Re: [PATCH 0/4] vhost-user-fs: Internal migration

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH 0/4] vhost-user-fs: Internal migration

From:	Hanna Czenczek
Subject:	Re: [PATCH 0/4] vhost-user-fs: Internal migration
Date:	Mon, 8 May 2023 19:00:46 +0200
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.10.0

On 05.05.23 16:37, Hanna Czenczek wrote:

On 05.05.23 16:26, Eugenio Perez Martin wrote:

On Fri, May 5, 2023 at 11:51 AM Hanna Czenczek <hreitz@redhat.com>wrote:

(By the way, thanks for the explanations :))

On 05.05.23 11:03, Hanna Czenczek wrote:

On 04.05.23 23:14, Stefan Hajnoczi wrote:

[...]

I think it's better to change QEMU's vhost code
to leave stateful devices suspended (but not reset) across
vhost_dev_stop() -> vhost_dev_start(), maybe by introducing
vhost_dev_suspend() and vhost_dev_resume(). Have you thought about
this aspect?

Yes and no; I mean, I haven’t in detail, but I thought this is what’s
meant by suspending instead of resetting when the VM is stopped.

So, now looking at vhost_dev_stop(), one problem I can see is that
depending on the back-end, different operations it does will do
different things.

It tries to stop the whole device via vhost_ops->vhost_dev_start(),

which for vDPA will suspend the device, but for vhost-user willreset it

(if F_STATUS is there).

It disables all vrings, which doesn’t mean stopping, but may be
necessary, too.  (I haven’t yet really understood the use of disabled
vrings, I heard that virtio-net would have a need for it.)

It then also stops all vrings, though, so that’s OK.  And because this
will always do GET_VRING_BASE, this is actually always the same
regardless of transport.

Finally (for this purpose), it resets the device status via
vhost_ops->vhost_reset_status().  This is only implemented on vDPA, and
this is what resets the device there.


So vhost-user resets the device in .vhost_dev_start, but vDPA only does
so in .vhost_reset_status.  It would seem better to me if vhost-user
would also reset the device only in .vhost_reset_status, not in
.vhost_dev_start.  .vhost_dev_start seems precisely like the place to
run SUSPEND/RESUME.

I think the same. I just saw It's been proposed at [1].

Another question I have (but this is basically what I wrote in my last
email) is why we even call .vhost_reset_status here.  If the device
and/or all of the vrings are already stopped, why do we need to reset
it?  Naïvely, I had assumed we only really need to reset the device if

the guest changes, so that a new guest driver sees a freshlyinitialized

device.

I don't know why we didn't need to call it :). I'm assuming the
previous vhost-user net did fine resetting vq indexes, using
VHOST_USER_SET_VRING_BASE. But I don't know about more complex
devices.

The guest can reset the device, or write 0 to the PCI config status,
at any time. How does virtiofs handle it, being stateful?

Honestly a good question because virtiofsd implements neitherSET_STATUS nor RESET_DEVICE. I’ll have to investigate that.

I think when the guest resets the device, SET_VRING_BASE always comesalong some way or another, so that’s how the vrings are reset. Maybethe internal state is reset only following more high-level FUSEcommands like INIT.


So a meeting and one session of looking-into-the-code later:

We reset every virt queue on GET_VRING_BASE, which is wrong, but happensto serve the purpose. (German is currently on that.)

In our meeting, German said the reset would occur when the memoryregions are changed, but I can’t see that in the code. I think it onlyhappens implicitly through the SET_VRING_BASE call, which resets theinternal avail/used pointers.

[This doesn’t seem different from libvhost-user, though, whichimplements neither SET_STATUS nor RESET_DEVICE, and which pretends toreset the device on RESET_OWNER, but really doesn’t (itsvu_reset_device_exec() function just disables all vrings, doesn’t resetor even stop them).]

Consequently, the internal state is never reset. It would be cleared ona FUSE Destroy message, but if you just force-reset the system, thestate remains into the next reboot. Not even FUSE Init clears it, whichseems weird. It happens to work because it’s still the same filesystem,so the existing state fits, but it kind of seems dangerous to keep e.g.files open. I don’t think it’s really exploitable because everythingstill goes through the guest kernel, but, well. We should clear thestate on Init, and probably also implement SET_STATUS and clear thestate there.


Hanna

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [PATCH 0/4] vhost-user-fs: Internal migration, Hanna Czenczek, 2023/05/04
- Re: [PATCH 0/4] vhost-user-fs: Internal migration, Stefan Hajnoczi, 2023/05/04
  - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Hanna Czenczek, 2023/05/05
    - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Hanna Czenczek, 2023/05/05
    - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Eugenio Perez Martin, 2023/05/05
    - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Hanna Czenczek, 2023/05/05
    - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Hanna Czenczek <=
    - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Eugenio Perez Martin, 2023/05/08
    - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Eugenio Perez Martin, 2023/05/08
    - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Hanna Czenczek, 2023/05/09
    - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Stefan Hajnoczi, 2023/05/09
    - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Eugenio Perez Martin, 2023/05/09
    - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Eugenio Perez Martin, 2023/05/05
    - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Hanna Czenczek, 2023/05/05
    - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Stefan Hajnoczi, 2023/05/08
    - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Hanna Czenczek, 2023/05/09
    - Re: [PATCH 0/4] vhost-user-fs: Internal migration, Stefan Hajnoczi, 2023/05/09

Prev by Date: Re: [PATCH v2 00/12] simpletrace: refactor and general improvements
Next by Date: Re: [PATCH RESEND] vhost: fix possible wrap in SVQ descriptor ring
Previous by thread: Re: [PATCH 0/4] vhost-user-fs: Internal migration
Next by thread: Re: [PATCH 0/4] vhost-user-fs: Internal migration
Index(es):
- Date
- Thread