David Rees drees76 at gmail.com
Thu Feb 21 12:00:10 PST 2008
On Thu, Feb 21, 2008 at 9:15 AM, Andrew Sullivan <ajs at crankycanuck.ca> wrote:
> On Thu, Feb 21, 2008 at 08:09:08AM -0800, Craig James wrote:
>  > In a situation like this, some sort of ACTIVE response from Slony would be
>  > nice.  Here's an idea.  When the Slony daemon detects an unrecoverable
>  > error, it should STOP, and send an email to a configurable administration
>  > email address.  Something like this:
>
>  No, no, that should not go in the daemon.  That should go in your monitoring
>  system.  I believe there are Nagios plugins floating about.  They could be
>  smarter, though, particularly about this sort of recoverable/non-recoverable
>  distinction you're mentioning.

Yep, we use Nagios to monitor replication status, it works quite well.
When things get out of sync (has actually never happened in production
yet!) we simply go through the slon/pg logs to figure out what went
wrong.

-Dave


More information about the Slony1-general mailing list