[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: [RFC 7/7] migration: call qemu_savevm_state_pending_exact() with the
From: |
Yishai Hadas |
Subject: |
RE: [RFC 7/7] migration: call qemu_savevm_state_pending_exact() with the guest stopped |
Date: |
Wed, 19 Oct 2022 15:51:05 +0000 |
> From: Qemu-devel <qemu-devel-
> bounces+yishaih=nvidia.com@nongnu.org> On Behalf Of Jason Gunthorpe
> Sent: Tuesday, 18 October 2022 15:23
> To: Joao Martins <joao.m.martins@oracle.com>
> Cc: quintela@redhat.com; Alex Williamson <alex.williamson@redhat.com>;
> Eric Blake <eblake@redhat.com>; Stefan Hajnoczi <stefanha@redhat.com>;
> Fam Zheng <fam@euphon.net>; qemu-s390x@nongnu.org; Cornelia Huck
> <cohuck@redhat.com>; Thomas Huth <thuth@redhat.com>; Vladimir
> Sementsov-Ogievskiy <vsementsov@yandex-team.ru>; Laurent Vivier
> <lvivier@redhat.com>; John Snow <jsnow@redhat.com>; Dr. David Alan
> Gilbert <dgilbert@redhat.com>; Christian Borntraeger
> <borntraeger@linux.ibm.com>; Halil Pasic <pasic@linux.ibm.com>; Paolo
> Bonzini <pbonzini@redhat.com>; qemu-block@nongnu.org; Eric Farman
> <farman@linux.ibm.com>; Richard Henderson
> <richard.henderson@linaro.org>; David Hildenbrand <david@redhat.com>;
> Avihai Horon <avihaih@nvidia.com>; qemu-devel@nongnu.org
> Subject: Re: [RFC 7/7] migration: call qemu_savevm_state_pending_exact()
> with the guest stopped
>
> On Fri, Oct 14, 2022 at 01:29:51PM +0100, Joao Martins wrote:
> > On 14/10/2022 12:28, Juan Quintela wrote:
> > > Joao Martins <joao.m.martins@oracle.com> wrote:
> > >> On 13/10/2022 17:08, Juan Quintela wrote:
> > >>> Oops. My understanding was that once the guest is stopped you can
> > >>> say how big is it.
> > >
> > > Hi
> > >
> > >> It's worth keeping in mind that conceptually a VF won't stop (e.g.
> > >> DMA) until really told through VFIO. So, stopping CPUs (guest) just
> > >> means that guest code does not arm the VF for more I/O but still is
> > >> a weak guarantee as VF still has outstanding I/O to deal with until
> > >> VFIO tells it to quiesce DMA (for devices that support it).
> > >
> > > How can we make sure about that?
> > >
> > > i.e. I know I have a vfio device. I need two things:
> > > - in the iterative stage, I eed to check the size, but a estimate is ok.
> > > for example with RAM, we use whatever is the size of the dirty bitmap
> > > at that moment.
> > > If the estimated size is smaller than the theshold, we
> > > * stop the guest
> > > * sync dirty bitmap
> > > * now we test the (real) dirty bitmap size
> > >
> > > How can we do something like that with a vfio device.
> > >
> > You would have an extra intermediate step that stops the VF prior to
> > asking the device state length. What I am not sure is whether stopping
> > vCPUs can be skipped as an optimization.
>
> It cannot, if you want to stop the VFIO device you must also stop the vCPUs
> because the device is not required to respond properly to MMIO operations
> when stopped.
>
> > > My understanding from NVidia folks was that newer firmware have an
> > > ioctl to return than information.
> >
> > Maybe there's something new. I was thinking from this here:
>
> Juan is talking about the ioctl we had in the pre-copy series.
>
> I expect it to come into some different form to support this RFC.
>
Do we really need to STOP the VM to get the exact data length that will be
required to complete stop copy ?
Can't we simply go with some close estimation when the device is running and
drop all the complexity in QEMU/Kernel to STOP and then RE-START the VM if the
threshold didn't meet, etc.?
Yishai
- [RFC 2/7] migration: No save_live_pending() method uses the QEMUFile parameter, (continued)
- [RFC 2/7] migration: No save_live_pending() method uses the QEMUFile parameter, Juan Quintela, 2022/10/02
- [RFC 5/7] migration: Remove unused threshold_size parameter, Juan Quintela, 2022/10/02
- [RFC 6/7] migration: simplify migration_iteration_run(), Juan Quintela, 2022/10/02
- [RFC 7/7] migration: call qemu_savevm_state_pending_exact() with the guest stopped, Juan Quintela, 2022/10/02
- Re: [RFC 7/7] migration: call qemu_savevm_state_pending_exact() with the guest stopped, Joao Martins, 2022/10/13
- Re: [RFC 7/7] migration: call qemu_savevm_state_pending_exact() with the guest stopped, Juan Quintela, 2022/10/13
- Re: [RFC 7/7] migration: call qemu_savevm_state_pending_exact() with the guest stopped, Joao Martins, 2022/10/14
- Re: [RFC 7/7] migration: call qemu_savevm_state_pending_exact() with the guest stopped, Juan Quintela, 2022/10/14
- Re: [RFC 7/7] migration: call qemu_savevm_state_pending_exact() with the guest stopped, Joao Martins, 2022/10/14
- Re: [RFC 7/7] migration: call qemu_savevm_state_pending_exact() with the guest stopped, Jason Gunthorpe, 2022/10/18
- RE: [RFC 7/7] migration: call qemu_savevm_state_pending_exact() with the guest stopped,
Yishai Hadas <=
- Re: [RFC 7/7] migration: call qemu_savevm_state_pending_exact() with the guest stopped, Jason Gunthorpe, 2022/10/15
RE: [RFC 0/7] migration patches for VFIO, Yishai Hadas, 2022/10/20