[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[PATCH v4 00/14] vfio/migration: Implement VFIO migration protocol v2
From: |
Avihai Horon |
Subject: |
[PATCH v4 00/14] vfio/migration: Implement VFIO migration protocol v2 |
Date: |
Wed, 30 Nov 2022 11:44:00 +0200 |
Hello,
A long time has passed since v2 of this series was posted. During this
time we had several KVM calls discussing the problems that were needed
to be solved in order to move forward.
This version of the series includes quite some changes, and I believe
that it addresses all the major problems we have discussed.
Following VFIO migration protocol v2 acceptance in kernel, this series
implements VFIO migration according to the new v2 protocol and replaces
the now deprecated v1 implementation.
The main differences between v1 and v2 migration protocols are:
1. VFIO device state is represented as a finite state machine instead of
a bitmap.
2. The migration interface with kernel is done using VFIO_DEVICE_FEATURE
ioctl and normal read() and write() instead of the migration region
used in v1.
3. Migration protocol v2 currently doesn't support the pre-copy phase of
migration.
Full description of the v2 protocol and the differences from v1 can be
found here [1].
Patch list:
Patches 1-2 are patches taken from Juan's RFC [2].
As discussed in the KVM call, since we are going to add a new ioctl to
get device data size while it's RUNNING, we don't need the stop and
resume VM functionality from the RFC.
Patches 3-8 are prep patches fixing bugs, adding QEMUFile function
that will be used later and refactoring v1 protocol code to make it
easier to add v2 protocol.
Patches 9-12 implement v2 protocol and remove v1 protocol.
Patches 13-14 are preview patches (which are not for merging yet) that
demonstrate how the new ioctl to get device state size will work once
added.
Thanks.
Changes from v3 [3]:
- Rebased on latest master branch.
- Dropped patch #1 "migration: Remove res_compatible parameter" as
it's not mandatory to this series and needs some further discussion.
- Dropped patch #3 "migration: Block migration comment or code is
wrong" as it has been merged already.
- Addressed overlooked corner case reported by Vladimir in patch #4
"migration: Simplify migration_iteration_run()".
- Dropped patch #5 "vfio/migration: Fix wrong enum usage" as it has
been merged already.
- In patch #12 "vfio/migration: Implement VFIO migration protocol v2":
1. Changed vfio_save_pending() to update res_precopy_only instead of
res_postcopy_only (as VFIO migration doesn’t support postcopy).
2. Moved VFIOMigration->data_buffer allocation to vfio_save_setup()
and its de-allocation to vfio_save_cleanup(), so now it's
allocated when actually used (during migration and only on source
side).
- Addressed Alex's comments:
1. Eliminated code duplication in patch #7 "vfio/migration: Allow
migration without VFIO IOMMU dirty tracking support".
2. Removed redundant initialization of vfio_region_info in patch #10
"vfio/migration: Move migration v1 logic to vfio_migration_init()".
3. Added comment about VFIO_MIG_DATA_BUFFER_SIZE heuristic (and
renamed to VFIO_MIG_DEFAULT_DATA_BUFFER_SIZE).
4. Cast migration structs to their actual types instead of void *.
5. Return -errno and -EBADF instead of -1 in vfio_migration_set_state().
6. Set migration->device_state to new_state even in case of data_fd
out of sync. Although migration will be aborted, setting device
state succeeded so we should reflect that.
7. Renamed VFIO_MIG_PENDING_SIZE to VFIO_MIG_STOP_COPY_SIZE, set it
to 100G and added a comment about the size choice.
8. Changed vfio_save_block() to return -errno on error.
9. Squashed Patch #14 to patch #12.
10. Adjusted migration data buffer size according to MIG_DATA_SIZE
ioctl.
- In preview patch #17 "vfio/migration: Query device data size in
vfio_save_pending()" - changed vfio_save_pending() to report
VFIO_MIG_STOP_COPY_SIZE on any error.
- Added another preview patch "vfio/migration: Optimize
vfio_save_pending()".
- Added ret value on some traces as suggested by David.
- Added Reviewed-By tags.
Changes from v2 [4]:
- Rebased on top of latest master branch.
- Added relevant patches from Juan's RFC [2] with minor changes:
1. Added Reviewed-by tag to patch #3 in the RFC.
2. Adjusted patch #6 to work without patch #4 in the RFC.
- Added a new patch "vfio/migration: Fix wrong enum usage" that fixes a
small bug in v1 code. This patch has been sent a few weeks ago [5] but
wasn't taken yet.
- Patch #2 (vfio/migration: Skip pre-copy if dirty page tracking is not
supported):
1. Dropped this patch and replaced it with
"vfio/migration: Allow migration without VFIO IOMMU dirty tracking
support".
The new patch uses a different approach – instead of skipping
pre-copy phase completely, QEMU VFIO code will mark RAM dirty
(instead of kernel). This ensures that current migration behavior
is not changed and SLA is taken into account.
- Patch #4 (vfio/common: Change vfio_devices_all_running_and_saving()
logic to equivalent one):
1. Improved commit message to better explain the change.
- Patch #7 (vfio/migration: Implement VFIO migration protocol v2):
1. Enhanced vfio_migration_set_state() error reporting.
2. In vfio_save_complete_precopy() of v2 protocol - when changing
device state to STOP, set recover state to ERROR instead of STOP as
suggested by Joao.
3. Constify SaveVMHandlers of v2 protocol.
4. Modified trace_vfio_vmstate_change and
trace_vfio_migration_set_state
to print device state string instead of enum.
5. Replaced qemu_put_buffer_async() with qemu_put_buffer() in
vfio_save_block(), as requested by Juan.
6. Implemented v2 protocol version of vfio_save_pending() as requested
by Juan. Until ioctl to get device state size is added, we just
report some big hard coded value, as agreed in KVM call.
- Patch #9 (vfio/migration: Reset device if setting recover state
fails):
1. Enhanced error reporting.
2. Set VFIOMigration->device_state to RUNNING after device reset.
- Patch #11 (docs/devel: Align vfio-migration docs to VFIO migration
v2):
1. Adjusted vfio migration documentation to the added
vfio_save_pending()
- Added the last patch (which is not for merging yet) that demonstrates
how the new ioctl to get device state size will work once added.
Changes from v1 [6]:
- Split the big patch that replaced v1 with v2 into several patches as
suggested by Joao, to make review easier.
- Change warn_report to warn_report_once when container doesn't support
dirty tracking.
- Add Reviewed-by tag.
[1]
https://lore.kernel.org/all/20220224142024.147653-10-yishaih@nvidia.com/
[2]
https://lore.kernel.org/qemu-devel/20221003031600.20084-1-quintela@redhat.com/T/
[3]
20221103161620.13120-1-avihaih@nvidia.com/">https://lore.kernel.org/qemu-devel/20221103161620.13120-1-avihaih@nvidia.com/
[4]
https://lore.kernel.org/all/20220530170739.19072-1-avihaih@nvidia.com/
[5]
https://lore.kernel.org/all/20221016085752.32740-1-avihaih@nvidia.com/
[6]
https://lore.kernel.org/all/20220512154320.19697-1-avihaih@nvidia.com/
Avihai Horon (12):
vfio/migration: Fix NULL pointer dereference bug
vfio/migration: Allow migration without VFIO IOMMU dirty tracking
support
migration/qemu-file: Add qemu_file_get_to_fd()
vfio/common: Change vfio_devices_all_running_and_saving() logic to
equivalent one
vfio/migration: Move migration v1 logic to vfio_migration_init()
vfio/migration: Rename functions/structs related to v1 protocol
vfio/migration: Implement VFIO migration protocol v2
vfio/migration: Remove VFIO migration protocol v1
vfio: Alphabetize migration section of VFIO trace-events file
docs/devel: Align vfio-migration docs to VFIO migration v2
vfio/migration: Use VFIO_DEVICE_FEATURE_MIG_DATA_SIZE ioctl
vfio/migration: Optimize vfio_save_pending()
Juan Quintela (2):
migration: No save_live_pending() method uses the QEMUFile parameter
migration: Simplify migration_iteration_run()
docs/devel/vfio-migration.rst | 68 ++-
hw/s390x/s390-stattrib.c | 2 +-
hw/vfio/common.c | 119 +++---
hw/vfio/migration.c | 727 +++++++++++----------------------
hw/vfio/trace-events | 29 +-
include/hw/vfio/vfio-common.h | 10 +-
include/migration/register.h | 3 +-
linux-headers/linux/vfio.h | 13 +
migration/block-dirty-bitmap.c | 3 +-
migration/block.c | 2 +-
migration/migration.c | 29 +-
migration/qemu-file.c | 34 ++
migration/qemu-file.h | 1 +
migration/ram.c | 2 +-
migration/savevm.c | 7 +-
migration/savevm.h | 3 +-
16 files changed, 430 insertions(+), 622 deletions(-)
--
2.26.3
- [PATCH v4 00/14] vfio/migration: Implement VFIO migration protocol v2,
Avihai Horon <=
- [PATCH v4 01/14] migration: No save_live_pending() method uses the QEMUFile parameter, Avihai Horon, 2022/11/30
- [PATCH v4 02/14] migration: Simplify migration_iteration_run(), Avihai Horon, 2022/11/30
- [PATCH v4 03/14] vfio/migration: Fix NULL pointer dereference bug, Avihai Horon, 2022/11/30
- [PATCH v4 04/14] vfio/migration: Allow migration without VFIO IOMMU dirty tracking support, Avihai Horon, 2022/11/30
- [PATCH v4 05/14] migration/qemu-file: Add qemu_file_get_to_fd(), Avihai Horon, 2022/11/30
- [PATCH v4 06/14] vfio/common: Change vfio_devices_all_running_and_saving() logic to equivalent one, Avihai Horon, 2022/11/30
- [PATCH v4 07/14] vfio/migration: Move migration v1 logic to vfio_migration_init(), Avihai Horon, 2022/11/30
- [PATCH v4 08/14] vfio/migration: Rename functions/structs related to v1 protocol, Avihai Horon, 2022/11/30
- [PATCH v4 09/14] vfio/migration: Implement VFIO migration protocol v2, Avihai Horon, 2022/11/30
- [PATCH v4 10/14] vfio/migration: Remove VFIO migration protocol v1, Avihai Horon, 2022/11/30