Robert Littlejohn Robert.Littlejohn
Thu Jan 19 12:56:05 PST 2006
Hello,
I have a simple master to one slave cluster setup.  My setup is slony1-1.1.0
with postgresql-7.3.9-2 on a Red Hat ES v3 server (both master and slave and
the same).  

This worked great for several months even with frequent network outages.
Then in Nov the router at the slave site was changed and the replication
stopped but nobody noticed.  Now several months later I have over 5 million
entries in sl_log_1 and over 400, 000 in sl_event.  I see no errors in the
logs, I've vacuumed all tables including the sl_* and pg_* tables - no
change.  I've restarted both servers - I've viewed the network connections
and all are good.  The slon processes run fine and the postres's are talking
to each other  but still no replication.

small example from slave log:
2006-01-19 16:50:02 AST DEBUG2 syncThread: new sl_action_seq 1 - SYNC 953791
2006-01-19 16:50:02 AST DEBUG2 localListenThread: Received event 2,953791
SYNC
2006-01-19 16:50:12 AST DEBUG2 syncThread: new sl_action_seq 1 - SYNC 953792
2006-01-19 16:50:12 AST DEBUG2 localListenThread: Received event 2,953792
SYNC
2006-01-19 16:50:22 AST DEBUG2 syncThread: new sl_action_seq 1 - SYNC 953793

and from the master:
2006-01-19 16:50:02 AST DEBUG2 remoteListenThread_2: queue event 2,953790
SYNC
2006-01-19 16:50:02 AST DEBUG2 remoteWorkerThread_2: Received event 2,953790
SYNC
2006-01-19 16:50:02 AST DEBUG3 calc sync size - last time: 1 last length:
9881 ideal: 6 proposed size: 2
2006-01-19 16:50:02 AST DEBUG2 remoteWorkerThread_2: SYNC 953790 processing
2006-01-19 16:50:02 AST DEBUG2 remoteWorkerThread_2: no sets need syncing
for this event
2006-01-19 16:50:02 AST DEBUG2 syncThread: new sl_action_seq 7342475 - SYNC
1037985
2006-01-19 16:50:02 AST DEBUG2 localListenThread: Received event 1,1037985
SYNC
2006-01-19 16:50:06 AST DEBUG2 syncThread: new sl_action_seq 7342477 - SYNC
1037986
2006-01-19 16:50:07 AST DEBUG2 localListenThread: Received event 1,1037986
SYNC
2006-01-19 16:50:10 AST DEBUG2 syncThread: new sl_action_seq 7342479 - SYNC
1037987
2006-01-19 16:50:11 AST DEBUG2 localListenThread: Received event 1,1037987
SYNC
2006-01-19 16:50:12 AST DEBUG2 remoteListenThread_2: queue event 2,953791
SYNC
2006-01-19 16:50:12 AST DEBUG2 remoteWorkerThread_2: Received event 2,953791
SYNC
2006-01-19 16:50:12 AST DEBUG3 calc sync size - last time: 1 last length:
10001 ideal: 5 proposed size: 2
2006-01-19 16:50:12 AST DEBUG2 remoteWorkerThread_2: SYNC 953791 processing
2006-01-19 16:50:12 AST DEBUG2 remoteWorkerThread_2: no sets need syncing
for this event
2006-01-19 16:50:12 AST DEBUG2 syncThread: new sl_action_seq 7342482 - SYNC
1037988
2006-01-19 16:50:13 AST DEBUG2 localListenThread: Received event 1,1037988
SYNC
2006-01-19 16:50:22 AST DEBUG2 remoteListenThread_2: queue event 2,953792
SYNC
2006-01-19 16:50:22 AST DEBUG2 remoteWorkerThread_2: Received event 2,953792
SYNC
2006-01-19 16:50:22 AST DEBUG3 calc sync size - last time: 1 last length:
10071 ideal: 5 proposed size: 2
2006-01-19 16:50:22 AST DEBUG2 remoteWorkerThread_2: SYNC 953792 processing
2006-01-19 16:50:22 AST DEBUG2 remoteWorkerThread_2: no sets need syncing
for this event

Any help anybody can give me will be appreciated.  BTW if this is the wrong
mailling list could someone point me at the correct location for slony1
problems.

Thanks




More information about the Slony1-general mailing list