qemu-discuss
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-discuss] Latest Qemu-COLO Segfaults


From: Lukas Straub
Subject: Re: [Qemu-discuss] Latest Qemu-COLO Segfaults
Date: Tue, 5 Jun 2018 22:08:09 +0200

Hi Zhang,
I have tested Your latest Patch, and the Crash is gone and the Secondary
now Synchronizes fine. So far so good. :)

However sadly, I found more Problems:
First it seems like COLO has problems comparing Packets send by the VMs
as it's doing a lot of Checkpoints when doing even simple network
operations. ping to local LAN for example is around ~200-4000ms and ssh
is unusable, because input lags almost 10 Seconds behind. Also I get
lots of messages on the primary qemu, see the log attached.

Secondly, I couldn't really Failover. While the Secondary VM continues
to run when the Primary Fails, it's unable to send/receive network
packets. Also when executing { "execute": "x-colo-lost-heartbeat" } (is
this still needed for Failover?), the Secondary Aborts:

qemu-system-x86_64: Can't receive COLO message: Input/output error
{"timestamp": {"seconds": 1528226730, "microseconds": 287902}, "event":
"COLO_EXIT", "data": {"mode": "secondary", "reason": "error"}}
qemu-system-x86_64: Unable to connect character device red0: Failed to
connect socket: Connection refused
{ "execute": "x-colo-lost-heartbeat" }
{"return": {}}
{"timestamp": {"seconds": 1528226733, "microseconds": 218936}, "event":
"STOP"}
qemu-system-x86_64: Conflicts with use by a block device as 'root',
which does not allow 'resize' on node0
qemu-system-x86_64: 
free(): invalid size


Third, Periodic mode (Remus-like without all the
comparing/filters/rewriters) doesn't seem to work , the primary crashes
at the first STOP:

{"timestamp": {"seconds": 1528228672, "microseconds": 407981}, "event":
"RESUME"}
{"timestamp": {"seconds": 1528228690, "microseconds": 230970}, "event":
"STOP"}
qemu-system-x86_64: util/qemu-thread-posix.c:64: qemu_mutex_lock_impl:
Assertion `mutex->initialized' failed.

I use the scripts attached to test qemu-COLO (simply copying-pasting
the echo output to the qmp monitor).

I'm very glad that you work on Qemu-COLO.

Many Thanks,
Lukas Straub


On Mon, 4 Jun 2018 00:35:55 +0800
Zhang Chen <address@hidden> wrote:

> Hi Lukas,
> 
> This bug have been fixed in my latest patch.
> Please retest it.
> https://www.mail-archive.com/address@hidden/msg538383.html
> 
> github:
> https://github.com/zhangckid/qemu/tree/qemu-colo-18jun1
> 
> 
> Thanks
> 
> Zhang Chen

Attachment: primary-log.txt
Description: Text document

Attachment: colo-primary.sh
Description: application/shellscript

Attachment: colo-secondary.sh
Description: application/shellscript


reply via email to

[Prev in Thread] Current Thread [Next in Thread]