[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Qemu-devel] [PATCH 2/2] memory: Split zones when do coalesced_io_de
From: |
Paolo Bonzini |
Subject: |
Re: [Qemu-devel] [PATCH 2/2] memory: Split zones when do coalesced_io_del() |
Date: |
Mon, 19 Aug 2019 16:24:08 +0200 |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 |
On 17/08/19 11:32, Peter Xu wrote:
> It is a workaround of current KVM's KVM_UNREGISTER_COALESCED_MMIO
> interface. The kernel interface only allows to unregister an mmio
> device with exactly the zone size when registered, or any smaller zone
> that is included in the device mmio zone. It does not support the
> userspace to specify a very large zone to remove all the small mmio
> devices within the zone covered.
>
> Logically speaking it would be nicer to fix this from KVM side, though
> in all cases we still need to coop with old kernels so let's do this.
>
> This patch has nothing to do with 3ac7d43a6fbb5d4a3 because this is
> probably broken from the very beginning when the
> KVM_UNREGISTER_COALESCED_MMIO interface is introduced in kernel.
> However to make the backport to stables easier, I'm still using the
> commit 3ac7d43a6fbb5d4a3 to track this problem because this will
> depend on that otherwise even additions of mmio devices won't work.
>
> Fixes: 3ac7d43a6fbb5d4a3
> Signed-off-by: Peter Xu <address@hidden>
> ---
> memory.c | 30 +++++++++++++++++++++++++++---
> 1 file changed, 27 insertions(+), 3 deletions(-)
This is still messy because memory_region_add_coalescing and
memory_region_clear_coalescing modify fr->mr->colesced.
It's not hard to fix it, but not trivial either. Probably it is
sufficient to replace memory_region_update_coalesced_range and
memory_region_update_coalesced_range_as with two pairs:
- memory_region_add_coalesced_range and
memory_region_add_coalesced_range_as, which call a new function
flat_range_coalesced_io_add_one to call the listener only on the
newly-added range (and set coalesced_mmio_add_done).
memory_region_add_coalescing then can call
memory_region_add_coalesced_range_as
- memory_region_clear_coalesced_ranges and
memory_region_clear_coalesced_ranges_as, which call
flat_range_coalesced_io_del. Now memory_region_clear_coalescing can
call memory_region_clear_coalesced_ranges *before* emptying the list, or
exit immediately if it is empty.
Thanks,
Paolo
> diff --git a/memory.c b/memory.c
> index 1a2b465a96..b24cdd13cf 100644
> --- a/memory.c
> +++ b/memory.c
> @@ -864,6 +864,9 @@ static void address_space_update_ioeventfds(AddressSpace
> *as)
>
> static void flat_range_coalesced_io_del(FlatRange *fr, AddressSpace *as)
> {
> + CoalescedMemoryRange *cmr;
> + AddrRange tmp;
> +
> if (QTAILQ_EMPTY(&fr->mr->coalesced)) {
> return;
> }
> @@ -874,9 +877,30 @@ static void flat_range_coalesced_io_del(FlatRange *fr,
> AddressSpace *as)
>
> fr->coalesced_mmio_del_done = true;
>
> - MEMORY_LISTENER_UPDATE_REGION(fr, as, Reverse, coalesced_io_del,
> - int128_get64(fr->addr.start),
> - int128_get64(fr->addr.size));
> + /*
> + * We split the big region into smaller ones to satisfy KVM's
> + * KVM_UNREGISTER_COALESCED_MMIO interface, where it does not
> + * allow to specify a large region to unregister all the devices
> + * under that zone instead it only accepts exact zones or even a
> + * smaller zone of previously registered mmio device. Logically
> + * speaking we should better fix KVM to allow the userspace to
> + * unregister multiple mmio devices within a large requested zone,
> + * but in all cases we'll still need to live with old kernels. So
> + * let's simply break the zones into exactly the small pieces when
> + * we do coalesced_io_add().
> + */
> + QTAILQ_FOREACH(cmr, &fr->mr->coalesced, link) {
> + tmp = addrrange_shift(cmr->addr,
> + int128_sub(fr->addr.start,
> +
> int128_make64(fr->offset_in_region)));
> + if (!addrrange_intersects(tmp, fr->addr)) {
> + continue;
> + }
> + tmp = addrrange_intersection(tmp, fr->addr);
> + MEMORY_LISTENER_UPDATE_REGION(fr, as, Reverse, coalesced_io_del,
> + int128_get64(tmp.start),
> + int128_get64(tmp.size));
> + }
> }
>
> static void flat_range_coalesced_io_add(FlatRange *fr, AddressSpace *as)
>