monit-general
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: problem with cpu usage (user)


From: Matt Corks
Subject: Re: problem with cpu usage (user)
Date: Thu, 11 Jan 2007 12:48:47 -0500
User-agent: Mutt/1.5.13 (2006-08-11)

Hi, Martin.  Here's the output of that command.

On Thu, Jan 11, 2007 at 06:14:56PM +0100, Martin Pala wrote:
>Hi,
>
>can you send the output of following command?:
>
>  cat /proc/stat && sleep 120 && cat /proc/stat
>
>This will provide the input, which is processed and interpreted by monit 
>as cpu usage.
>
>Thanks,
>Martin
>
>
>Matt Corks wrote:
>>My apologies -- I've attached those files here.
>>
>>On Wed, Jan 10, 2007 at 03:51:00PM -0500, Matt Corks wrote:
>>>Greetings, all.  I'm having a problem with monit 4.8.2 on gentoo 1.4.16
>>>(Linux kernel 2.6.10-gentoo-r6).  According to top all CPUs are mostly
>>>idle, but monit thinks the cpu user usage is hovering over 70%.  Having
>>>mpstat average CPU usage over the same cycle length as monit (2
>>>minutes) results in the same values as top.  I've read Martin Pala's
>>>description of how monit calculates those figures[1] but it's still not
>>>clear to me why they're so different; surely they can't both be right.
>>>Any help would be appreciated.
>>>
>>>Please find attached /etc/monitrc, /proc/stat, /proc/cpuinfo, and the
>>>output of monit -vI, top -b -n 1, and mpstat -P ALL 120 3.
>>>
>>>Many thanks,
>>>Matt
>>>
>>>[1] http://www.mail-archive.com/address@hidden/msg00794.html
>>
>>
>>------------------------------------------------------------------------
>>
>>set mailserver cpt0.carpooltool.com
>>set daemon 120 # Poll at 2-minute intervals
>>set logfile /var/log/monit.log
>>set alert address@hidden
>>    mail-format {
>>        from: address@hidden
>>    }
>>
>>set httpd port 2812
>>    ssl enable
>>    pemfile /usr/local/share/ssl/monit.pem
>>    clientpemfile /usr/local/share/ssl/monit_client.pem
>>    allowselfcertification
>>    allow monit:k33p1tup
>>
>>check system cpt0.carpooltool.com
>>   if loadavg (1min) > 4 then alert
>>   if loadavg (5min) > 2 then alert
>>   if memory usage > 75% then alert
>>   if cpu usage (user) > 75% then alert
>>   if cpu usage (system) > 40% then alert
>>   if cpu usage (wait) > 40% then alert
>>
>>check process apache
>>   with pidfile "/home/cpt/fgs/www/logs/httpd.pid"
>>   start program = "/home/cpt/fgs/www/bin/apachectl start"
>>   stop program = "/home/cpt/fgs/www/bin/apachectl stop"
>>   #if 2 restarts within 3 cycles then timeout
>>   if totalmem > 225 Mb for 5 cycles then alert
>>   if children > 255 for 5 cycles then alert
>>   if cpu usage > 95% for 3 cycles then alert
>>   #if failed port 443 type tcpssl protocol http then alert
>>   #if failed port 80 protocol http then alert
>>   if failed url http://cpt0.carpooltool.com:80/monit.txt then alert
>>   if failed host cpt0.carpooltool.com
>>        port 80 protocol apache-status
>>      dnslimit > 25% or loglimit > 80% or waitlimit < 20% then alert
>>   group server
>>   depends on httpd.conf, httpd.bin
>>
>>check file httpd.conf
>>    with path /home/cpt/fgs/www/conf/live.httpd.conf
>>    # Reload apache if the httpd.conf file was changed
>>    if changed checksum
>>       then alert
>>       #then exec "/home/cpt/fgs/www/bin/apachectl graceful"
>>
>>check file httpd.bin
>>    with path /home/cpt/fgs/www/bin/httpd
>>    # Alert in the case that the binary was changed and alert in the case 
>>    that
>>    # the checksum value recovered later
>>    if failed checksum then alert
>>       else if recovered then alert
>>
>>check process sshd with pidfile /var/run/sshd.pid
>>    if failed port 22 protocol ssh then alert
>>    start program = "/etc/init.d/sshd start"
>>    start program = "/etc/init.d/sshd stop"
>>    group server
>>
>>check process postfix with pidfile /var/spool/postfix/pid/master.pid
>>    start program = "/etc/init.d/postfix start"
>>    stop program  = "/etc/init.d/postfix stop"
>>    if failed port 25 protocol smtp then alert
>>    group server
>>
>>check process postgres with pidfile /var/lib/postgresql/data/postmaster.pid
>>    if failed port 5432 protocol pgsql then alert
>>    group server
>>
>>check process fcron with pidfile /var/run/fcron.pid
>>    group server
>>
>>check process pathserv_l0 with pidfile 
>>/home/cpt/DEV/source/et/util/route/astar/remoting/dev5/L0/BINDER_PID
>>    group server
>>
>>check process pathserv_l1 with pidfile 
>>/home/cpt/DEV/source/et/util/route/astar/remoting/dev5/L1/BINDER_PID
>>    group server
>>
>>check host cpt1 with address cpt1.carpooltool.com
>>    if failed icmp type echo count 5 with timeout 15 seconds then alert
>>    if failed url 
>>    https://monit:address@hidden:443/monit.txt then alert
>>    if failed port 22 protocol ssh with timeout 15 seconds then alert
>>    if failed port 5432 protocol pgsql with timeout 15 seconds then alert
>>
>>check host cpt2 with address cpt2.carpooltool.com
>>    if failed icmp type echo count 5 with timeout 15 seconds then alert
>>    if failed url 
>>    https://monit:address@hidden:443/monit.txt then alert
>>    if failed port 22 protocol ssh with timeout 15 seconds then alert
>>
>>
>>------------------------------------------------------------------------
>>
>>cpu  32814510 58374 5646183 5187334727 2828507 50477 304053
>>cpu0 20097737 28264 2694152 1281779892 2424456 32758 201968
>>cpu1 4558020 15507 1100169 1301335227 221997 1467 26810
>>cpu2 5226851 7223 1114364 1300727664 111743 14119 57234
>>cpu3 2931901 7378 737496 1303491942 70310 2132 18040
>>intr 311245708 188786303 2 0 0 0 0 3 0 2 0 0 0 3 0 42512270 15 0 0 0 0 0 0 
>>0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 79947110 0 
>>0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 
>>0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 
>>0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 
>>0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 
>>0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
>>ctxt 3694955451
>>btime 1155387586
>>processes 3016925
>>procs_running 1
>>procs_blocked 0
>>
>>
>>------------------------------------------------------------------------
>>
>>processor     : 0
>>vendor_id     : GenuineIntel
>>cpu family    : 15
>>model         : 3
>>model name    : Intel(R) Xeon(TM) CPU 3.00GHz
>>stepping      : 4
>>cpu MHz               : 2993.288
>>cache size    : 1024 KB
>>physical id   : 0
>>siblings      : 2
>>fdiv_bug      : no
>>hlt_bug               : no
>>f00f_bug      : no
>>coma_bug      : no
>>fpu           : yes
>>fpu_exception : yes
>>cpuid level   : 5
>>wp            : yes
>>flags         : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca 
>>cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe lm pni 
>>monitor ds_cpl cid xtpr
>>bogomips      : 5914.62
>>
>>processor     : 1
>>vendor_id     : GenuineIntel
>>cpu family    : 15
>>model         : 3
>>model name    : Intel(R) Xeon(TM) CPU 3.00GHz
>>stepping      : 4
>>cpu MHz               : 2993.288
>>cache size    : 1024 KB
>>physical id   : 0
>>siblings      : 2
>>fdiv_bug      : no
>>hlt_bug               : no
>>f00f_bug      : no
>>coma_bug      : no
>>fpu           : yes
>>fpu_exception : yes
>>cpuid level   : 5
>>wp            : yes
>>flags         : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca 
>>cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe lm pni 
>>monitor ds_cpl cid xtpr
>>bogomips      : 5980.16
>>
>>processor     : 2
>>vendor_id     : GenuineIntel
>>cpu family    : 15
>>model         : 3
>>model name    : Intel(R) Xeon(TM) CPU 3.00GHz
>>stepping      : 4
>>cpu MHz               : 2993.288
>>cache size    : 1024 KB
>>physical id   : 3
>>siblings      : 2
>>fdiv_bug      : no
>>hlt_bug               : no
>>f00f_bug      : no
>>coma_bug      : no
>>fpu           : yes
>>fpu_exception : yes
>>cpuid level   : 5
>>wp            : yes
>>flags         : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca 
>>cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe lm pni 
>>monitor ds_cpl cid xtpr
>>bogomips      : 5980.16
>>
>>processor     : 3
>>vendor_id     : GenuineIntel
>>cpu family    : 15
>>model         : 3
>>model name    : Intel(R) Xeon(TM) CPU 3.00GHz
>>stepping      : 4
>>cpu MHz               : 2993.288
>>cache size    : 1024 KB
>>physical id   : 3
>>siblings      : 2
>>fdiv_bug      : no
>>hlt_bug               : no
>>f00f_bug      : no
>>coma_bug      : no
>>fpu           : yes
>>fpu_exception : yes
>>cpuid level   : 5
>>wp            : yes
>>flags         : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca 
>>cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe lm pni 
>>monitor ds_cpl cid xtpr
>>bogomips      : 5980.16
>>
>>
>>
>>------------------------------------------------------------------------
>>
>>monit: Debug: Adding credentials for user 'monit'.
>>Runtime constants:
>> Control file       = /etc/monitrc
>> Log file           = /var/log/monit.log
>> Pid file           = /var/run/monit.pid
>> Debug              = True
>> Log                = True
>> Use syslog         = False
>> Is Daemon          = True
>> Use process engine = True
>> Poll time          = 120 seconds
>> Mail server(s)     = cpt0.carpooltool.com 
>> Mail from          = (not defined)
>> Mail subject       = (not defined)
>> Mail message       = (not defined)
>> Start monit httpd  = True
>> httpd bind address = Any/All
>> httpd portnumber   = 2812
>> httpd signature    = True
>> Use ssl encryption = True
>> PEM key/cert file  = /usr/local/share/ssl/monit.pem
>> Client cert file   = /usr/local/share/ssl/monit_client.pem
>> Allow self certs   = True
>> httpd auth. style  = Basic Authentication
>> Alert mail to      = address@hidden
>>   Alert on         = All events
>>
>>The service list contains the following entries:
>>
>>System Name           = cpt0.carpooltool.com
>> Monitoring mode      = active
>> CPU wait limit       = if greater than 40.0% 1 times within 1 cycle(s) 
>> then alert else if passed 1 times within 1 cycle(s) then alert
>> CPU system limit     = if greater than 40.0% 1 times within 1 cycle(s) 
>> then alert else if passed 1 times within 1 cycle(s) then alert
>> CPU user limit       = if greater than 75.0% 1 times within 1 cycle(s) 
>> then alert else if passed 1 times within 1 cycle(s) then alert
>> Memory usage limit   = if greater than 75.0% 1 times within 1 cycle(s) 
>> then alert else if passed 1 times within 1 cycle(s) then alert
>> Load avg. (5min)     = if greater than 2.0 1 times within 1 cycle(s) then 
>> alert else if passed 1 times within 1 cycle(s) then alert
>> Load avg. (1min)     = if greater than 4.0 1 times within 1 cycle(s) then 
>> alert else if passed 1 times within 1 cycle(s) then alert
>>
>>Process Name          = apache
>> Group                = server
>> Pid file             = /home/cpt/fgs/www/logs/httpd.pid
>> Monitoring mode      = active
>> Start program        = /home/cpt/fgs/www/bin/apachectl start
>> Stop program         = /home/cpt/fgs/www/bin/apachectl stop
>> Depends on Service   = httpd.bin
>> Depends on Service   = httpd.conf
>> Pid                  = if changed 1 times within 1 cycle(s) then alert
>> Ppid                 = if changed 1 times within 1 cycle(s) then alert
>> Port                 = if failed cpt0.carpooltool.com:80 [APACHESTATUS 
>> via TCP] with timeout 5 seconds 1 times within 1 cycle(s) then alert else 
>> if passed 1 times within 1 cycle(s) then alert
>> Port                 = if failed cpt0.carpooltool.com:80/monit.txt [HTTP 
>> via TCP] with timeout 5 seconds 1 times within 1 cycle(s) then alert else 
>> if passed 1 times within 1 cycle(s) then alert
>> CPU usage limit      = if greater than 95.0% 3 times within 3 cycle(s) 
>> then alert else if passed 1 times within 1 cycle(s) then alert
>> Children             = If greater than 255 5 times within 5 cycle(s) then 
>> alert else if passed 1 times within 1 cycle(s) then alert
>> Memory amount limit (incl. children) = If greater than 230400 5 times 
>> within 5 cycle(s) then alert else if passed 1 times within 1 cycle(s) 
>> then alert
>>
>>File Name             = httpd.conf
>> Path                 = /home/cpt/fgs/www/conf/live.httpd.conf
>> Monitoring mode      = active
>> Checksum             = if changed MD5 1 times within 1 cycle(s) then alert
>>
>>File Name             = httpd.bin
>> Path                 = /home/cpt/fgs/www/bin/httpd
>> Monitoring mode      = active
>> Checksum             = if failed be7f833bd23e9003fbc5d313c95acea6(MD5) 1 
>> times within 1 cycle(s) then alert else if passed 1 times within 1 
>> cycle(s) then alert
>>
>>Process Name          = sshd
>> Group                = server
>> Pid file             = /var/run/sshd.pid
>> Monitoring mode      = active
>> Start program        = /etc/init.d/sshd stop
>> Pid                  = if changed 1 times within 1 cycle(s) then alert
>> Ppid                 = if changed 1 times within 1 cycle(s) then alert
>> Port                 = if failed localhost:22 [SSH via TCP] with timeout 
>> 5 seconds 1 times within 1 cycle(s) then alert else if passed 1 times 
>> within 1 cycle(s) then alert
>>
>>Process Name      monit: pidfile '/var/run/monit.pid' does not exist
>>Starting monit daemon with http interface at [*:2812]
>>Starting monit HTTP server at [*:2812]
>>monit HTTP server started
>>Monit started
>>Monit instance changed notification is sent to 
>>address@hidden
>>monit: verify_init(): Loaded SSL pem client file 
>>'/usr/local/share/ssl/monit_client.pem'
>>monit: verify_init(): Loaded monit's SSL pem server file 
>>'/usr/local/share/ssl/monit.pem'
>>monit: verify_info(): Found 1 client certificates
>>'cpt0.carpooltool.com' cpu wait usage check passed [current cpu wait 
>>usage=0.0%]
>>'cpt0.carpooltool.com' cpu system usage check passed [current cpu system 
>>usage=0.0%]
>>'cpt0.carpooltool.com' cpu user usage check passed [current cpu user 
>>usage=0.0%]
>>'cpt0.carpooltool.com' mem usage check passed [current mem usage=47.7%]
>>'cpt0.carpooltool.com' loadavg(5min) check passed [current 
>>loadavg(5min)=0.0]
>>'cpt0.carpooltool.com' loadavg(1min) check passed [current 
>>loadavg(1min)=0.0]
>>'httpd.conf' file existence check passed
>>'httpd.conf' is regular file
>>'httpd.conf' checksum has not changed
>>'httpd.bin' file existence check passed
>>'httpd.bin' is regular file
>>'httpd.bin' has valid checksums
>>'sshd' zombie check passed [status_flag=0000]
>>'sshd' succeeded connecting to INET[localhost:22] via TCP
>>'sshd' succeeded testing protocol [SSH] at INET[localhost:22] via TCP
>>'postfix' zombie check passed [status_flag=0000]
>>'postfix' succeeded connecting to INET[localhost:25] via TCP
>>'postfix' succeeded testing protocol [SMTP] at INET[localhost:25] via TCP
>>'postgres' zombie check passed [status_flag=0000]
>>'postgres' succeeded connecting to INET[localhost:5432] via TCP
>>'postgres' succeeded testing protocol [PGSQL] at INET[localhost:5432] via 
>>TCP
>>'fcron' zombie check passed [status_flag=0000]
>>'pathserv_l0' zombie check passed [status_flag=0000]
>>'pathserv_l1' zombie check passed [status_flag=0000]
>>'cpt1' icmp ping succeeded [response time 0.001s]
>>'cpt1' succeeded connecting to INET[cpt1.carpooltool.com:5432] via TCP
>>'cpt1' succeeded testing protocol [PGSQL] at 
>>INET[cpt1.carpooltool.com:5432] via TCP
>>'cpt1' succeeded connecting to INET[cpt1.carpooltool.com:22] via TCP
>>'cpt1' succeeded testing protocol [SSH] at INET[cpt1.carpooltool.com:22] 
>>via TCP
>>'cpt1' succeeded connecting to INET[cpt1.carpooltool.com:443] via TCPSSL
>>'cpt1' succeeded testing protocol [HTTP] at INET[cpt1.carpooltool.com:443] 
>>via TCPSSL
>>'cpt2' icmp ping succeeded [response time 0.000s]
>>'cpt2' succeeded connecting to INET[cpt2.carpooltool.com:22] via TCP
>>'cpt2' succeeded testing protocol [SSH] at INET[cpt2.carpooltool.com:22] 
>>via TCP
>>'cpt2' succeeded connecting to INET[cpt2.carpooltool.com:443] via TCPSSL
>>'cpt2' succeeded testing protocol [HTTP] at INET[cpt2.carpooltool.com:443] 
>>via TCPSSL
>>'apache' zombie check passed [status_flag=0000]
>>'apache' cpu usage check passed [current cpu usage=0.0%]
>>'apache' children check passed [current children=37]
>>'apache' total mem amount check passed [current total mem amount=64400kB]
>>'apache' succeeded connecting to INET[cpt0.carpooltool.com:80] via TCP
>>Scoreboard: 
>>________W_____________________K____K_...........................................................................................................................................................................................................................
>>'apache' succeeded testing protocol [APACHESTATUS] at 
>>INET[cpt0.carpooltool.com:80] via TCP
>>'apache' succeeded connecting to INET[cpt0.carpooltool.com:80] via TCP
>>'apache' succeeded testing protocol [HTTP] at 
>>INET[cpt0.carpooltool.com:80] via TCP
>>Monit has not changed
>>'cpt0.carpooltool.com' cpu wait usage check passed [current cpu wait 
>>usage=0.0%]
>>'cpt0.carpooltool.com' cpu system usage check passed [current cpu system 
>>usage=24.6%]
>>'cpt0.carpooltool.com' cpu user usage check passed [current cpu user 
>>usage=73.0%]
>>'cpt0.carpooltool.com' mem usage check passed [current mem usage=47.6%]
>>'cpt0.carpooltool.com' loadavg(5min) check passed [current 
>>loadavg(5min)=0.0]
>>'cpt0.carpooltool.com' loadavg(1min) check passed [current 
>>loadavg(1min)=0.0]
>>'httpd.conf' file existence check passed
>>'httpd.conf' is regular file
>>'httpd.conf' checksum has not changed
>>'httpd.bin' file existence check passed
>>'httpd.bin' is regular file
>>'httpd.bin' has valid checksums
>>'sshd' zombie check passed [status_flag=0000]
>>'sshd' PID has not changed since last cycle
>>'sshd' PPID has not changed since last cycle
>>'sshd' succeeded connecting to INET[localhost:22] via TCP
>>'sshd' succeeded testing protocol [SSH] at INET[localhost:22] via TCP
>>'postfix' zombie check passed [status_flag=0000]
>>'postfix' PID has not changed since last cycle
>>'postfix' PPID has not changed since last cycle
>>'postfix' succeeded connecting to INET[localhost:25] via TCP
>>'postfix' succeeded testing protocol [SMTP] at INET[localhost:25] via TCP
>>'postgres' zombie check passed [status_flag=0000]
>>'postgres' PID has not changed since last cycle
>>'postgres' PPID has not changed since last cycle
>>'postgres' succeeded connecting to INET[localhost:5432] via TCP
>>'postgres' succeeded testing protocol [PGSQL] at INET[localhost:5432] via 
>>TCP
>>'fcron' zombie check passed [status_flag=0000]
>>'fcron' PID has not changed since last cycle
>>'fcron' PPID has not changed since last cycle
>>'pathserv_l0' zombie check passed [status_flag=0000]
>>'pathserv_l0' PID has not changed since last cycle
>>'pathserv_l0' PPID has not changed since last cycle
>>'pathserv_l1' zombie check passed [status_flag=0000]
>>'pathserv_l1' PID has not changed since last cycle
>>'pathserv_l1' PPID has not changed since last cycle
>>'cpt1' icmp ping succeeded [response time 0.000s]
>>'cpt1' succeeded connecting to INET[cpt1.carpooltool.com:5432] via TCP
>>'cpt1' succeeded testing protocol [PGSQL] at 
>>INET[cpt1.carpooltool.com:5432] via TCP
>>'cpt1' succeeded connecting to INET[cpt1.carpooltool.com:22] via TCP
>>'cpt1' succeeded testing protocol [SSH] at INET[cpt1.carpooltool.com:22] 
>>via TCP
>>'cpt1' succeeded connecting to INET[cpt1.carpooltool.com:443] via TCPSSL
>>'cpt1' succeeded testing protocol [HTTP] at INET[cpt1.carpooltool.com:443] 
>>via TCPSSL
>>'cpt2' icmp ping succeeded [response time 0.000s]
>>'cpt2' succeeded connecting to INET[cpt2.carpooltool.com:22] via TCP
>>'cpt2' succeeded testing protocol [SSH] at INET[cpt2.carpooltool.com:22] 
>>via TCP
>>'cpt2' succeeded connecting to INET[cpt2.carpooltool.com:443] via TCPSSL
>>'cpt2' succeeded testing protocol [HTTP] at INET[cpt2.carpooltool.com:443] 
>>via TCPSSL
>>'apache' zombie check passed [status_flag=0000]
>>'apache' PID has not changed since last cycle
>>'apache' PPID has not changed since last cycle
>>'apache' cpu usage check passed [current cpu usage=0.0%]
>>'apache' children check passed [current children=37]
>>'apache' total mem amount check passed [current total mem amount=64400kB]
>>'apache' succeeded connecting to INET[cpt0.carpooltool.com:80] via TCP
>>Scoreboard: 
>>________________K__________________W_...........................................................................................................................................................................................................................
>>'apache' succeeded testing protocol [APACHESTATUS] at 
>>INET[cpt0.carpooltool.com:80] via TCP
>>'apache' succeeded connecting to INET[cpt0.carpooltool.com:80] via TCP
>>'apache' succeeded testing protocol [HTTP] at 
>>INET[cpt0.carpooltool.com:80] via TCP
>>'cpt0.carpooltool.com' cpu wait usage check passed [current cpu wait 
>>usage=1.7%]
>>'cpt0.carpooltool.com' cpu system usage check passed [current cpu system 
>>usage=23.5%]
>>'cpt0.carpooltool.com' cpu user usage check passed [current cpu user 
>>usage=72.3%]
>>'cpt0.carpooltool.com' mem usage check passed [current mem usage=47.6%]
>>'cpt0.carpooltool.com' loadavg(5min) check passed [current 
>>loadavg(5min)=0.0]
>>'cpt0.carpooltool.com' loadavg(1min) check passed [current 
>>loadavg(1min)=0.1]
>>'httpd.conf' file existence check passed
>>'httpd.conf' is regular file
>>'httpd.conf' checksum has not changed
>>'httpd.bin' file existence check passed
>>'httpd.bin' is regular file
>>'httpd.bin' has valid checksums
>>'sshd' zombie check passed [status_flag=0000]
>>'sshd' PID has not changed since last cycle
>>'sshd' PPID has not changed since last cycle
>>'sshd' succeeded connecting to INET[localhost:22] via TCP
>>'sshd' succeeded testing protocol [SSH] at INET[localhost:22] via TCP
>>'postfix' zombie check passed [status_flag=0000]
>>'postfix' PID has not changed since last cycle
>>'postfix' PPID has not changed since last cycle
>>'postfix' succeeded connecting to INET[localhost:25] via TCP
>>'postfix' succeeded testing protocol [SMTP] at INET[localhost:25] via TCP
>>'postgres' zombie check passed [status_flag=0000]
>>'postgres' PID has not changed since last cycle
>>'postgres' PPID has not changed since last cycle
>>'postgres' succeeded connecting to INET[localhost:5432] via TCP
>>'postgres' succeeded testing protocol [PGSQL] at INET[localhost:5432] via 
>>TCP
>>'fcron' zombie check passed [status_flag=0000]
>>'fcron' PID has not changed since last cycle
>>'fcron' PPID has not changed since last cycle
>>'pathserv_l0' zombie check passed [status_flag=0000]
>>'pathserv_l0' PID has not changed since last cycle
>>'pathserv_l0' PPID has not changed since last cycle
>>'pathserv_l1' zombie check passed [status_flag=0000]
>>'pathserv_l1' PID has not changed since last cycle
>>'pathserv_l1' PPID has not changed since last cycle
>>'cpt1' icmp ping succeeded [response time 0.000s]
>>'cpt1' succeeded connecting to INET[cpt1.carpooltool.com:5432] via TCP
>>'cpt1' succeeded testing protocol [PGSQL] at 
>>INET[cpt1.carpooltool.com:5432] via TCP
>>'cpt1' succeeded connecting to INET[cpt1.carpooltool.com:22] via TCP
>>'cpt1' succeeded testing protocol [SSH] at INET[cpt1.carpooltool.com:22] 
>>via TCP
>>'cpt1' succeeded connecting to INET[cpt1.carpooltool.com:443] via TCPSSL
>>'cpt1' succeeded testing protocol [HTTP] at INET[cpt1.carpooltool.com:443] 
>>via TCPSSL
>>'cpt2' icmp ping succeeded [response time 0.000s]
>>'cpt2' succeeded connecting to INET[cpt2.carpooltool.com:22] via TCP
>>'cpt2' succeeded testing protocol [SSH] at INET[cpt2.carpooltool.com:22] 
>>via TCP
>>'cpt2' succeeded connecting to INET[cpt2.carpooltool.com:443] via TCPSSL
>>'cpt2' succeeded testing protocol [HTTP] at INET[cpt2.carpooltool.com:443] 
>>via TCPSSL
>>'apache' zombie check passed [status_flag=0000]
>>'apache' PID has not changed since last cycle
>>'apache' PPID has not changed since last cycle
>>'apache' cpu usage check passed [current cpu usage=0.0%]
>>'apache' children check passed [current children=37]
>>'apache' total mem amount check passed [current total mem amount=64400kB]
>>'apache' succeeded connecting to INET[cpt0.carpooltool.com:80] via TCP
>>Scoreboard: 
>>_W___________________K____K__________...........................................................................................................................................................................................................................
>>'apache' succeeded testing protocol [APACHESTATUS] at 
>>INET[cpt0.carpooltool.com:80] via TCP
>>'apache' succeeded connecting to INET[cpt0.carpooltool.com:80] via TCP
>>'apache' succeeded testing protocol [HTTP] at 
>>INET[cpt0.carpooltool.com:80] via TCP
>>Shutting down monit HTTP server
>>monit HTTP server stopped
>>monit daemon with pid [4893] killed
>>Monit stopped
>>Monit instance changed notification is sent to 
>>address@hidden
>>    = postfix
>> Group                = server
>> Pid file             = /var/spool/postfix/pid/master.pid
>> Monitoring mode      = active
>> Start program        = /etc/init.d/postfix start
>> Stop program         = /etc/init.d/postfix stop
>> Pid                  = if changed 1 times within 1 cycle(s) then alert
>> Ppid                 = if changed 1 times within 1 cycle(s) then alert
>> Port                 = if failed localhost:25 [SMTP via TCP] with timeout 
>> 5 seconds 1 times within 1 cycle(s) then alert else if passed 1 times 
>> within 1 cycle(s) then alert
>>
>>Process Name          = postgres
>> Group                = server
>> Pid file             = /var/lib/postgresql/data/postmaster.pid
>> Monitoring mode      = active
>> Pid                  = if changed 1 times within 1 cycle(s) then alert
>> Ppid                 = if changed 1 times within 1 cycle(s) then alert
>> Port                 = if failed localhost:5432 [PGSQL via TCP] with 
>> timeout 5 seconds 1 times within 1 cycle(s) then alert else if passed 1 
>> times within 1 cycle(s) then alert
>>
>>Process Name          = fcron
>> Group                = server
>> Pid file             = /var/run/fcron.pid
>> Monitoring mode      = active
>> Pid                  = if changed 1 times within 1 cycle(s) then alert
>> Ppid                 = if changed 1 times within 1 cycle(s) then alert
>>
>>Process Name          = pathserv_l0
>> Group                = server
>> Pid file             = 
>> /home/cpt/DEV/source/et/util/route/astar/remoting/dev5/L0/BINDER_PID
>> Monitoring mode      = active
>> Pid                  = if changed 1 times within 1 cycle(s) then alert
>> Ppid                 = if changed 1 times within 1 cycle(s) then alert
>>
>>Process Name          = pathserv_l1
>> Group                = server
>> Pid file             = 
>> /home/cpt/DEV/source/et/util/route/astar/remoting/dev5/L1/BINDER_PID
>> Monitoring mode      = active
>> Pid                  = if changed 1 times within 1 cycle(s) then alert
>> Ppid                 = if changed 1 times within 1 cycle(s) then alert
>>
>>Remote Host Name      = cpt1
>> Monitoring mode      = active
>> ICMP                 = if failed Echo Request count 5 with timeout 15 
>> seconds 1 times within 1 cycle(s) then alert else if passed 1 times 
>> within 1 cycle(s) then alert
>> Port                 = if failed cpt1.carpooltool.com:5432 [PGSQL via 
>> TCP] with timeout 15 seconds 1 times within 1 cycle(s) then alert else if 
>> passed 1 times within 1 cycle(s) then alert
>> Port                 = if failed cpt1.carpooltool.com:22 [SSH via TCP] 
>> with timeout 15 seconds 1 times within 1 cycle(s) then alert else if 
>> passed 1 times within 1 cycle(s) then alert
>> Port                 = if failed cpt1.carpooltool.com:443/monit.txt [HTTP 
>> via TCPSSL] with timeout 5 seconds 1 times within 1 cycle(s) then alert 
>> else if passed 1 times within 1 cycle(s) then alert
>>
>>Remote Host Name      = cpt2
>> Monitoring mode      = active
>> ICMP                 = if failed Echo Request count 5 with timeout 15 
>> seconds 1 times within 1 cycle(s) then alert else if passed 1 times 
>> within 1 cycle(s) then alert
>> Port                 = if failed cpt2.carpooltool.com:22 [SSH via TCP] 
>> with timeout 15 seconds 1 times within 1 cycle(s) then alert else if 
>> passed 1 times within 1 cycle(s) then alert
>> Port                 = if failed cpt2.carpooltool.com:443/monit.txt [HTTP 
>> via TCPSSL] with timeout 5 seconds 1 times within 1 cycle(s) then alert 
>> else if passed 1 times within 1 cycle(s) then alert
>>
>>-------------------------------------------------------------------------------
>>
>>
>>------------------------------------------------------------------------
>>
>>top - 15:02:29 up 151 days,  7:02,  2 users,  load average: 0.00, 0.02, 
>>0.00
>>Tasks: 131 total,   1 running, 130 sleeping,   0 stopped,   0 zombie
>>Cpu(s):  0.6% us,  0.1% sy,  0.0% ni, 99.2% id,  0.1% wa,  0.0% hi,  0.0% 
>>si
>>Mem:   2074616k total,  2055692k used,    18924k free,        0k buffers
>>Swap:  7815544k total,   134972k used,  7680572k free,  1067136k cached
>>
>>  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND       
>>    1 root      16   0  1336  472  424 S    0  0.0   0:29.48 init          
>>    2 root      RT   0     0    0    0 S    0  0.0   0:08.03 migration/0   
>>    3 root      34  19     0    0    0 S    0  0.0   0:01.57 ksoftirqd/0   
>>    4 root      RT   0     0    0    0 S    0  0.0   0:02.38 migration/1   
>>    5 root      34  19     0    0    0 S    0  0.0   0:02.10 ksoftirqd/1   
>>    6 root      RT   0     0    0    0 S    0  0.0   0:03.83 migration/2   
>>    7 root      34  19     0    0    0 S    0  0.0   0:04.75 ksoftirqd/2   
>>    8 root      RT   0     0    0    0 S    0  0.0   0:02.29 migration/3   
>>    9 root      34  19     0    0    0 S    0  0.0   0:00.95 ksoftirqd/3   
>>   10 root       5 -10     0    0    0 S    0  0.0   0:00.01 events/0      
>>   11 root       5 -10     0    0    0 S    0  0.0   0:00.00 events/1      
>>   12 root       5 -10     0    0    0 S    0  0.0   0:00.00 events/2      
>>   13 root       5 -10     0    0    0 S    0  0.0   0:00.00 events/3      
>>   14 root       9 -10     0    0    0 S    0  0.0   0:00.00 khelper       
>>   30 root       5 -10     0    0    0 S    0  0.0   0:00.00 kacpid        
>>  137 root       5 -10     0    0    0 S    0  0.0   0:07.71 kblockd/0     
>>  138 root       5 -10     0    0    0 S    0  0.0   0:00.48 kblockd/1     
>>  139 root       5 -10     0    0    0 S    0  0.0   0:00.13 kblockd/2     
>>  140 root       5 -10     0    0    0 S    0  0.0   0:00.12 kblockd/3     
>>  226 root       6 -10     0    0    0 S    0  0.0   0:00.00 aio/0         
>>  225 root      16   0     0    0    0 S    0  0.0   4:10.93 kswapd0       
>>  227 root       5 -10     0    0    0 S    0  0.0   0:00.00 aio/1         
>>  228 root       5 -10     0    0    0 S    0  0.0   0:00.00 aio/2         
>>  229 root       6 -10     0    0    0 S    0  0.0   0:00.00 aio/3         
>>  230 root       5 -10     0    0    0 S    0  0.0   0:49.59 xfslogd/0     
>>  231 root       5 -10     0    0    0 S    0  0.0   0:00.00 xfslogd/1     
>>  232 root       7 -10     0    0    0 S    0  0.0   0:00.00 xfslogd/2     
>>  233 root      12 -10     0    0    0 S    0  0.0   0:00.00 xfslogd/3     
>>  234 root       5 -10     0    0    0 S    0  0.0   0:00.00 xfsdatad/0    
>>  235 root       5 -10     0    0    0 S    0  0.0   0:00.00 xfsdatad/1    
>>  236 root       7 -10     0    0    0 S    0  0.0   0:00.00 xfsdatad/2    
>>  237 root      12 -10     0    0    0 S    0  0.0   0:00.00 xfsdatad/3    
>>  238 root      15   0     0    0    0 S    0  0.0   0:13.37 xfsbufd       
>>  315 root       6 -10     0    0    0 S    0  0.0   0:03.25 vesafb        
>>  322 root      25   0     0    0    0 S    0  0.0   0:00.00 kseriod       
>>  405 root       5 -10     0    0    0 S    0  0.0   0:00.00 ata/0         
>>  406 root       5 -10     0    0    0 S    0  0.0   0:00.00 ata/1         
>>  407 root       5 -10     0    0    0 S    0  0.0   0:00.00 ata/2         
>>  408 root       5 -10     0    0    0 S    0  0.0   0:00.00 ata/3         
>>  410 root      16   0     0    0    0 S    0  0.0   0:00.00 scsi_eh_0     
>>  431 root      15   0     0    0    0 S    0  0.0   0:00.00 kirqd         
>>  434 root      15   0     0    0    0 S    0  0.0   2:44.55 md2_raid1     
>>  436 root      15   0     0    0    0 S    0  0.0   0:00.00 md1_raid1     
>>  437 root      15   0     0    0    0 S    0  0.0   0:00.00 md0_raid1     
>>  605 root      15   0     0    0    0 S    0  0.0   0:00.00 khubd         
>> 1181 root      15   0     0    0    0 S    0  0.0   0:12.57 xfssyncd      
>> 1327 root      16   0  1696  976  624 S    0  0.0   0:18.99 devfsd        
>> 6194 root      15   0  1688  828  548 S    0  0.0   2:55.65 syslog-ng     
>> 6331 rpc       16   0  1568  612  516 S    0  0.0   0:00.00 portmap       
>> 6342 root      19   0  2544 1096  980 S    0  0.1   0:00.00 famd          
>> 6443 root      17   0  1548  672  576 S    0  0.0   0:08.71 fcron         
>> 6664 root      15   0  3028 1408 1180 S    0  0.1   2:27.27 sshd          
>> 6796 root      17   0  1468  644  564 S    0  0.0   0:00.01 agetty        
>> 6797 root      17   0  1468  644  564 S    0  0.0   0:00.00 agetty        
>> 6798 root      16   0  1468  644  564 S    0  0.0   0:00.00 agetty        
>> 6799 root      16   0  1468  644  564 S    0  0.0   0:00.00 agetty        
>> 6800 root      16   0  1468  644  564 S    0  0.0   0:00.00 agetty        
>> 6801 root      16   0  1468  644  564 S    0  0.0   0:00.00 agetty        
>> 7349 root      16   0  2472 1168  916 S    0  0.1   0:00.16 httpd         
>>29066 root      15   0  1696  840  552 S    0  0.0   2:20.31 syslog-ng     
>>11853 root      16   0  4164 1448 1172 S    0  0.1   2:10.93 master        
>>11856 postfix   16   0  5324 2560 1228 S    0  0.1   1:19.32 qmgr          
>>10093 postgres  16   0  261m 4732 4380 S    0  0.2   1:44.38 postmaster    
>>10096 postgres  16   0  6844 2836 1496 S    0  0.1   5:24.97 postmaster    
>>10097 postgres  15   0  5852 1920 1504 S    0  0.1   3:36.27 postmaster    
>>25880 root      15   0     0    0    0 S    0  0.0   0:43.92 pdflush       
>> 1408 root      15   0     0    0    0 S    0  0.0   0:27.82 pdflush       
>> 6196 cpt       16   0 48896  43m 1224 S    0  2.1   0:03.00 pathserv      
>> 6198 cpt       16   0  188m 161m 1224 S    0  7.9   0:17.84 pathserv      
>> 6200 cpt       16   0 11964 9816 1224 S    0  0.5   0:01.22 pathserv      
>> 6202 cpt       16   0 23376  17m 1224 S    0  0.9   0:02.75 pathserv      
>> 6568 cpt       16   0 48896  43m 1224 S    0  2.1   0:00.01 pathserv      
>> 6706 apache    16   0  2848 1804 1124 S    0  0.1   0:00.68 httpd         
>> 6707 apache    16   0  2704 1628 1128 S    0  0.1   0:00.70 httpd         
>> 6708 apache    16   0  2856 1812 1124 S    0  0.1   0:00.70 httpd         
>> 6709 apache    16   0  2848 1808 1128 S    0  0.1   0:00.67 httpd         
>> 6710 apache    16   0  2848 1808 1128 S    0  0.1   0:00.67 httpd         
>> 6711 apache    16   0  2848 1804 1124 S    0  0.1   0:00.83 httpd         
>> 6712 apache    15   0  2704 1620 1128 S    0  0.1   0:00.67 httpd         
>> 6713 apache    15   0  2704 1620 1128 S    0  0.1   0:00.68 httpd         
>> 6714 apache    16   0  2856 1812 1124 S    0  0.1   0:00.73 httpd         
>> 6715 apache    15   0  2704 1628 1128 S    0  0.1   0:00.74 httpd         
>> 6716 apache    16   0  2704 1624 1124 S    0  0.1   0:00.75 httpd         
>> 6717 apache    15   0  2704 1620 1128 S    0  0.1   0:00.73 httpd         
>> 6718 apache    15   0  2704 1616 1124 S    0  0.1   0:00.65 httpd         
>> 6719 apache    16   0  2848 1808 1128 S    0  0.1   0:01.61 httpd         
>> 6720 apache    16   0  2856 1816 1128 S    0  0.1   0:00.78 httpd         
>> 6721 apache    15   0  2856 1832 1144 S    0  0.1   0:00.67 httpd         
>> 6722 apache    16   0  2704 1620 1128 S    0  0.1   0:02.27 httpd         
>> 6723 apache    15   0  2704 1620 1128 S    0  0.1   0:00.78 httpd         
>> 6724 apache    16   0  2856 1812 1124 S    0  0.1   0:00.71 httpd         
>> 6725 apache    16   0  2848 1808 1128 S    0  0.1   0:00.64 httpd         
>> 6731 apache    16   0  2704 1624 1124 S    0  0.1   0:00.70 httpd         
>> 6759 apache    15   0  2856 1816 1128 S    0  0.1   0:00.79 httpd         
>> 6814 apache    15   0  2704 1624 1124 S    0  0.1   0:00.66 httpd         
>> 6822 apache    16   0  2704 1624 1128 S    0  0.1   0:00.75 httpd         
>> 6904 apache    16   0  2704 1620 1128 S    0  0.1   0:00.74 httpd         
>> 7089 cpt       16   0 11964 9816 1224 S    0  0.5   0:00.01 pathserv      
>> 7392 cpt       16   0 23376  17m 1224 S    0  0.9   0:00.00 pathserv      
>> 7815 apache    16   0  2848 1808 1128 S    0  0.1   0:00.65 httpd         
>>10108 apache    15   0  2704 1628 1128 S    0  0.1   0:00.70 httpd         
>>10109 apache    16   0  2704 1624 1124 S    0  0.1   0:00.61 httpd         
>>10189 apache    15   0  2856 1816 1128 S    0  0.1   0:00.65 httpd         
>>17672 apache    16   0  2848 1804 1124 S    0  0.1   0:00.50 httpd         
>>22100 apache    15   0  2856 1812 1124 S    0  0.1   0:00.40 httpd         
>>22101 apache    16   0  2704 1628 1128 S    0  0.1   0:00.45 httpd         
>>22102 apache    16   0  2704 1620 1128 S    0  0.1   0:00.43 httpd         
>>22103 apache    16   0  2848 1804 1124 S    0  0.1   0:00.42 httpd         
>>22104 apache    15   0  2704 1620 1128 S    0  0.1   0:00.47 httpd         
>>22105 apache    16   0  2704 1620 1128 S    0  0.1   0:00.40 httpd         
>>22106 apache    15   0  2704 1620 1128 S    0  0.1   0:00.44 httpd         
>> 8471 cpt       16   0  188m 161m 1224 S    0  7.9   0:00.00 pathserv      
>>17089 root      15   0  5868 1844 1508 S    0  0.1   0:00.01 sshd          
>>17093 cpt       16   0  5868 1900 1556 S    0  0.1   0:00.43 sshd          
>>17094 cpt       15   0  2684 1328 1056 S    0  0.1   0:00.00 bash          
>>17107 root      16   0  2064  952  764 S    0  0.0   0:00.00 su            
>>17108 root      16   0  2180 1312 1056 S    0  0.1   0:00.00 bash          
>>17121 root      16   0  4496 3464 1592 S    0  0.2   0:01.08 zsh           
>>32565 root      15   0  5872 1848 1508 S    0  0.1   0:00.00 sshd          
>>32569 cpt       16   0  5876 1916 1556 S    0  0.1   0:00.03 sshd          
>>32570 cpt       16   0  2560 1328 1056 S    0  0.1   0:00.01 bash          
>> 3422 postfix   17   0  4208 1476 1192 S    0  0.1   0:00.00 pickup        
>> 3624 postfix   16   0  4196 1428 1160 S    0  0.1   0:00.00 proxymap      
>> 5372 postfix   15   0  4216 1460 1180 S    0  0.1   0:00.00 
>> trivial-rewrite            5385 root      16   0  6692 2236 1764 S    0  
>> 0.1   0:00.11 monit                      5386 root      16   0  6692 2236 
>> 1764 S    0  0.1   0:00.00 monit                      5387 root      16   
>> 0  6692 2236 1764 S    0  0.1   0:00.01 monit                      5589 
>> postfix   17   0  4504 1792 1476 S    0  0.1   0:00.00 smtpd              
>> 5621 root      16   0  4448 1600 1312 S    0  0.1   0:00.00 sshd          
>> 5622 sshd      15   0  4380 1552 1280 S    0  0.1   0:00.00 sshd          
>> 5623 root      15   0  1984  988  740 R    0  0.0   0:00.00 top           
>>
>>
>>
>>------------------------------------------------------------------------
>>
>>Linux 2.6.10-gentoo-r6 (cpt0)   01/10/07
>>
>>15:04:03     CPU   %user   %nice %system %iowait    %irq   %soft   %idle   
>>intr/s
>>15:06:03     all    0.25    0.00    0.09    0.00    0.00    0.01   99.64   
>>1022.66
>>15:06:03       0    0.15    0.00    0.15    0.00    0.00    0.02   99.68   
>>1002.38
>>15:06:03       1    0.01    0.00    0.02    0.00    0.00    0.00   99.97   
>>0.00
>>15:06:03       2    0.58    0.00    0.13    0.00    0.01    0.02   99.23   
>>20.28
>>15:06:03       3    0.28    0.00    0.07    0.00    0.00    0.00   99.64   
>>0.00
>>
>>15:06:03     CPU   %user   %nice %system %iowait    %irq   %soft   %idle   
>>intr/s
>>15:08:03     all    0.41    0.00    0.14    0.00    0.00    0.01   99.43   
>>1046.03
>>15:08:03       0    0.31    0.00    0.23    0.00    0.00    0.01   99.45   
>>1002.84
>>15:08:03       1    0.02    0.00    0.04    0.00    0.00    0.00   99.93   
>>0.00
>>15:08:03       2    1.05    0.00    0.23    0.00    0.02    0.04   98.66   
>>43.19
>>15:08:03       3    0.25    0.00    0.07    0.00    0.00    0.00   99.68   
>>0.00
>>
>>15:08:03     CPU   %user   %nice %system %iowait    %irq   %soft   %idle   
>>intr/s
>>15:10:03     all    1.02    0.00    0.34    0.02    0.01    0.04   98.58   
>>1119.04
>>15:10:03       0    0.77    0.00    0.45    0.10    0.01    0.02   98.66   
>>1004.23
>>15:10:03       1    0.12    0.00    0.12    0.01    0.00    0.01   99.77   
>>0.00
>>15:10:03       2    2.77    0.00    0.63    0.00    0.02    0.12   96.46   
>>114.81
>>15:10:03       3    0.42    0.00    0.14    0.00    0.00    0.00   99.43   
>>0.00
>>
>>Average:     CPU   %user   %nice %system %iowait    %irq   %soft   %idle   
>>intr/s
>>Average:     all    0.56    0.00    0.19    0.01    0.00    0.02   99.22   
>>1062.58
>>Average:       0    0.41    0.00    0.28    0.03    0.00    0.01   99.26   
>>1003.15
>>Average:       1    0.05    0.00    0.06    0.00    0.00    0.00   99.89   
>>0.00
>>Average:       2    1.47    0.00    0.33    0.00    0.02    0.06   98.12   
>>59.42
>>Average:       3    0.32    0.00    0.09    0.00    0.00    0.00   99.59   
>>0.00
>>
>>
>>------------------------------------------------------------------------
>>
>>--
>>To unsubscribe:
>>http://lists.nongnu.org/mailman/listinfo/monit-general
>
>
>--
>To unsubscribe:
>http://lists.nongnu.org/mailman/listinfo/monit-general

-- 
address@hidden | http://theorem.ca/~mvcorks/
Calvin: Did you know that's one of the ten warning signs of hopeless
  dweebism?

Attachment: proc_stat_sleep_120_proc_stat.txt
Description: Text document


reply via email to

[Prev in Thread] Current Thread [Next in Thread]