Re: monit deadlocks

From: Jan-Henrik Haukeland
Re: monit deadlocks
Date: Thu, 20 Oct 2005 13:45:27 +0200

On second thought don't send the monit binary, but do send us the output of configure and please do the following so we can get more information:

1) Start monit with debug information. That is, do NOT use the monit binary installed using 'make install' since all debug information is stripped in the install process. Instead, copy and run the monit binary you built in the monit-4.6 source directory.

2) Attach gdb to the running monit process and issue this gdb command, "thread apply all bt". Send us the output from this command.

Here's a trace example on how to do this:

address@hidden root]# gdb /usr/bin/monit `cat /var/run/`
GNU gdb Red Hat Linux (6.0post-0.20040223.19rh)
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "i386-redhat-linux-gnu"...Using host libthread_db library "/lib/tls/".

Attaching to program: /usr/bin/monit, process 3190
Reading symbols from /lib/tls/
[Thread debugging using libthread_db enabled]
[New Thread -1208024960 (LWP 3190)]
[New Thread -1208026192 (LWP 3191)]
Reading symbols <....>
A program is being debugged already.  Kill it? (y or n) n
Program not killed.
(gdb) thread apply all bt

Thread 2 (Thread -1208026192 (LWP 3191)):
#0  0x003267a2 in _dl_sysinfo_int80 () from /lib/
#1  0x003f3491 in ___newselect_nocancel () from /lib/tls/
#2  0x08050d69 in can_read (socket=5, timeout=-514) at net.c:475
#3  0x080605bd in socket_producer (server=4, port=2812, sslserver=0x0)
    at http/engine.c:647
#4  0x0805fe4f in start_httpd (port=2812, backlog=10,
    bindAddr=0xa088ca8 "") at http/engine.c:195
#5  0x0804e9b1 in thread_wrapper (arg=0x0) at http.c:160
#6  0x06b3c98c in start_thread () from /lib/tls/
#7  0x003fa7da in clone () from /lib/tls/

Thread 1 (Thread -1208024960 (LWP 3190)):
#0  0x003267a2 in _dl_sysinfo_int80 () from /lib/
#1  0x003c57f6 in __nanosleep_nocancel () from /lib/tls/
#2  0x003c561f in sleep () from /lib/tls/
#3  0x080503f0 in do_default () at monitor.c:472
#4  0x0804fdbf in main (argc=1, argv=0xbfed72c4) at monitor.c:118
#0  0x003267a2 in _dl_sysinfo_int80 () from /lib/

Jan-Henrik Haukeland
Mobil +47 97141255

