Submitted by RyanJohnson on Sun, 07/15/2012 - 17:49
Hey guys,
I need your help troubleshooting a problem. I have two monitors on our development server that check the status of HTTP and 'Alive System' from the System and Server Status module. These monitors sporadically report that the server is down throughout the day when it appears that the system is in fact up.
I have a third monitor that checks the net connection before reporting the outages.
This has only recently started happening so I'm wondering if you can point me in the right direction so I can find out what's actually happening.
Could it be that Apache is under high load?
Thanks, Ryan
Status:
Active
Comments
Submitted by JamieCameron on Sun, 07/15/2012 - 19:22 Comment #1
It's possible that a transient networking or load issue is causing these failed status reports.
You could try checking the Apache log to see if any error is logged when a monitor fails.
Also, you could try changing the number of concurrent failures needed before the alert email is sent - this can be set on the Edit Monitor page in Webmin.
Submitted by RyanJohnson on Tue, 07/17/2012 - 18:59 Comment #2
Hi Jamie,
I looked into /var/log/httpd/error_log and nothing seems to indicate a crash. Could it be coming from a particular virtual server? Where should I start with this?
I also have a monitor set up on the server itself and it doesn't report the outage, so this is inconsistent which leads me to believe something else is at play.