[Slony1-general] sl_log_1 and sl_log_2 tables not cleaned up

Mon Feb 23 09:36:31 PST 2015

On 23/02/2015 18:31, Guillaume Lelarge wrote:
> Hi,
> 
> Le 23 févr. 2015 16:51, "Clement Thomas" <clement1289 at gmail.com
> <mailto:clement1289 at gmail.com>> a écrit :
>>
>> Hi All,
>>           we face a weird problem in our 3 node slony setup.
>>
>> * node1 (db1.domain.tld )  is the master provider and node2
>> (db2.domain.tld ), node3  (db3.domain.tld ) are subscribers.
>> currently nodes have 5 replication sets and the replication is working
>> fine.
>> * the problem is sl_log_1 and sl_log_2 tables in node1 gets cleaned up
>> properly, but the tables in the node2 and node3 doesn't.  On node1 the
>> total number of rows in sl_log_1 table is 24845 and in sl_log_2 it is
>> 0. whereas
>>
>> node2:
>>
>>                          relation                         |  size
>> ----------------------------------------------------------+---------
>>  _mhb_replication.sl_log_2                                | 130 GB
>>  _mhb_replication.sl_log_2_idx1                           | 47 GB
>>  _mhb_replication.PartInd_mhb_replication_sl_log_2-node-1 | 30 GB
>>
>> node3:
>>                          relation                         |  size
>> ----------------------------------------------------------+--------
>>  _mhb_replication.sl_log_2                                | 133 GB
>>  _mhb_replication.sl_log_2_idx1                           | 47 GB
>>  _mhb_replication.PartInd_mhb_replication_sl_log_2-node-1 | 30 GB
>>  _mhb_replication.sl_log_1                                | 352 MB
>>
>>
>> in node2 and node3 could see the following lines frequently.
>>
>> slon[20695]: [4031-1] FATAL  cleanupThread: "delete from
>> "_mhb_replication".sl_log_1 where log_origin = '1' and log_xid <
>> '2130551154'; delete from
>> slon[20695]: [4031-2]  "_mhb_replication".sl_log_2 where log_origin =
>> '1' and log_xid < '2130551154'; delete from
>> "_mhb_replication".sl_seqlog where
>> slon[20695]: [4031-3]  seql_origin = '1' and seql_ev_seqno <
>> '51449379'; select "_mhb_replication".logswitch_finish(); " - ERROR:
>> canceling statement
>> slon[20695]: [4031-4]  due to statement timeout
>> slon[20695]: [4032-1] DEBUG2 slon_retry() from pid=20695
>>
>> please find the slony_tools.conf here
>> https://gist.github.com/clement1289/d928acb771ca01a89281 and sl_status
>> /sl_listen output here
>> https://gist.github.com/clement1289/88df40f77c03c691eee5 . Hoping for
>> some help.
>>
> 
> You should change PostgreSQL configuration. Statement_timeout is too low. Unset
> it or set it higher.

At least for the slony user. The table and index are really big. It may take a
long time to delete from them.

-- 
Stéphane Schildknecht
Contact régional PostgreSQL pour l'Europe francophone
Loxodata - Conseil, expertise et formations

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 490 bytes
Desc: OpenPGP digital signature
Url : http://lists.slony.info/pipermail/slony1-general/attachments/20150223/ae1c7e97/attachment.pgp