monit-general
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Application Not getting monitored on improper reboot


From: Martin Pala
Subject: Re: Application Not getting monitored on improper reboot
Date: Tue, 29 Nov 2005 23:00:07 +0100
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.12) Gecko/20051007 Debian/1.7.12-1

It could be good to try to run monit in debug mode (using -v option). Then when the described problem occures, save the copy of .monit.state file. When you will attach the configuration, saved monit state file and debug output we can check it.

Thanks,
Martin


Rajesh Gopalan wrote:
Hi All,
We are using monit software for the last 6 months and it has been very useful for all of us. Thanks to all who made this a huge succes. I would really appreciate your help with following problem:
I am using monit V4.4 to monitor a list of services.
On a running system, if some services cannot be restarted for some reason, I specify in the /etc/monitrc file to retry a number of times, before 'timeout'. After a specified number of retries (in the /etc/monitrc file), monit shows that it will give up monitoring the system. For example, I have a service called foobar, which failed to start multiple times (5 times in this case), and exceeded the limit I specified in the /etc/monitrc file:
check process foobar
with pidfile "/var/run/foobar.pid"
start program = "/etc/rc.d/init.d/foobar start"
stop program = "/etc/rc.d/init.d/foobar stop"
if 5 restarts within 6 cycles then timeout
We are calling "monit monitor all" immediately after starting monit to start monitoring all the applications (including timed out ones) once a reboot occurs. But incase of a abnormal/abrupt reboot, i.e When you poweroff the system directly or if you press reset button(Improper reboot), the monit is not monitoring the timed out applications. Only if we forcibly the state file that monit maintains in user's home directory i.e $home/.monit.state, I am able to remonitor the timed out applcations. Is there any other way to remonitor these services? What could be the problem? I failed to find something useful in Google :( Thanks in advance! Thanks and Regards,
Rajesh G
"SASKEN RATED THE BEST COMPANY TO WORK FOR IN INDIA - SURVEY 2005 conducted by the BUSINESS TODAY - Mercer - TNS India"


SASKEN BUSINESS DISCLAIMER
This message may contain confidential, proprietary or legally Privileged information. In case you are not the original intended Recipient of the message, you must not, directly or indirectly, use, Disclose, distribute, print, or copy any part of this message and you are requested to delete it and inform the sender. Any views expressed in this message are those of the individual sender unless otherwise stated. Nothing contained in this message shall be construed as an offer or acceptance of any offer by Sasken Communication Technologies Limited ("Sasken") unless sent with that express intent and with due authority of Sasken. Sasken has taken enough precautions to prevent the spread of viruses. However the company accepts no liability for any damage caused by any virus transmitted by this email


------------------------------------------------------------------------

--
To unsubscribe:
http://lists.nongnu.org/mailman/listinfo/monit-general




reply via email to

[Prev in Thread] Current Thread [Next in Thread]