[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#34033: Offloading sometimes hangs
From: |
Mathieu Othacehe |
Subject: |
bug#34033: Offloading sometimes hangs |
Date: |
Thu, 02 Jul 2020 16:20:23 +0200 |
User-agent: |
Gnus/5.13 (Gnus v5.13) Emacs/26.3 (gnu/linux) |
Hello,
> (That still doesn’t tell us why our ‘guix offload’ processes would
> occasionally be stuck but at least it ensures the build farm keeps
> making progress even when that happens.)
I'm still not sure it's directly related to this bug but I observed
several offloading hangs on Berlin today.
For instance, in Cuirass logs:
--8<---------------cut here---------------start------------->8---
2020-07-02T09:59:45 '/gnu/store/rm8ndiichxhwybaizis5pgck77952ilp-halt.drv'
offloaded to '141.80.167.164'
2020-07-02T09:54:30
'/gnu/store/dxczkbf5wa6qr37gm7wr995hcxs8s0ya-motion-4.2.2.drv' offloaded to
'141.80.167.170'
--8<---------------cut here---------------end--------------->8---
those two builds were offloaded around 10:00 today and there's still no
report from them at 16:00.
On 141.80.167.164 there's a matching build log:
--8<---------------cut here---------------start------------->8---
-rw-r--r-- 1 root root 1735 Jul 2 10:00
/var/log/guix/drvs/rm/8ndiichxhwybaizis5pgck77952ilp-halt.drv.bz2
--8<---------------cut here---------------end--------------->8---
same on 141.80.167.170,
--8<---------------cut here---------------start------------->8---
-rw-r--r-- 1 root root 6344 Jul 2 09:56
/var/log/guix/drvs/dx/czkbf5wa6qr37gm7wr995hcxs8s0ya-motion-4.2.2.drv.bz2
--8<---------------cut here---------------end--------------->8---
Having those builds "unfinished" keeps the rest of the evaluation
hanging. Running this SQL command in Cuirass database:
--8<---------------cut here---------------start------------->8---
sqlite> select derivation, datetime(starttime, 'unixepoch',
'localtime'),stoptime from Builds where status=-1 and evaluation=14771;
/gnu/store/ncp59nyidli4lm3ff2lkfjym25yb18j5-guix-1.1.0-14.5bd8033.drv|2020-07-02
09:33:04|0
/gnu/store/rm8ndiichxhwybaizis5pgck77952ilp-halt.drv|2020-07-02 09:59:28|0
/gnu/store/71wnjgm2waqgw3fqmxmc4r3f1ifd1l92-cups-test.drv|2020-07-02 10:00:26|0
/gnu/store/9qsqd7jfwnaw9sm323y45cwymn98kyjl-exim-test.drv|2020-07-02 10:00:51|0
/gnu/store/vhcww4fw4qxw0hl1009npd26b22gfj3c-bitlbee-test.drv|2020-07-02
10:00:24|0
/gnu/store/92jrd6dfzgdifr107hwi64s8hf4mls47-iptables.drv|2020-07-02 09:59:49|0
/gnu/store/380nq6sjphd0agrvl43sr6ypli1yraz4-gnunet-0.12.2.drv|2020-07-02
09:51:32|0
/gnu/store/lqs22nbc6vy2z2524rmkcsmbh5mllm62-cuirass-0.0.1-37.882393d.drv|2020-07-02
10:34:37|0
/gnu/store/dxczkbf5wa6qr37gm7wr995hcxs8s0ya-motion-4.2.2.drv|2020-07-02
09:54:02|0
/gnu/store/5ln3r997ycr7rd6fqahd2d426mjw0rxb-gzochi-0.12.drv|2020-07-02
09:53:51|0
--8<---------------cut here---------------end--------------->8---
shows that the evaluation is pretty much pending since 10:00.
According to Cuirass logs again, all those builds were offloaded,
"/gnu/store/380nq6sjphd0agrvl43sr6ypli1yraz4-gnunet-0.12.2.drv",
"/gnu/store/lqs22nbc6vy2z2524rmkcsmbh5mllm62-cuirass-0.0.1-37.882393d.drv"
and /gnu/store/5ln3r997ycr7rd6fqahd2d426mjw0rxb-gzochi-0.12.drv
are reported as failed, and all other are still hanging.
Something is going wrong here! I'll keep investigating.
Thanks,
Mathieu
- bug#34033: Offloading sometimes hangs,
Mathieu Othacehe <=