Re: [PATCH v3] blockcommit: Reopen base image as RO after abort

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v3] blockcommit: Reopen base image as RO after abort

From:	Alexander Ivanov
Subject:	Re: [PATCH v3] blockcommit: Reopen base image as RO after abort
Date:	Fri, 15 Mar 2024 10:55:00 +0100
User-agent:	Mozilla Thunderbird



On 2/28/24 17:48, Vladimir Sementsov-Ogievskiy wrote:

On 09.02.24 15:29, Alexander Ivanov wrote:
Could you please review the patch?
Sorry for long delay.
Honestly, I don't like refcnt in block-driver. It violateincapsulation, refcnt is interal thing of common block layer. Andactually, you can't make any assumptions from value of refcnt, as youdon't know which additional parents were created and why, and whenthey are going unref their children.

Hmmm... Maybe I can just exclude refcnt check from the condition, can'tI. If BDS will be removed it doesn't matter if we make it RO. What doyou think?


What was wrong with v2?

My bad, it seems, I didn't send v2 before I decided to change the patch.


On 1/30/24 10:14, Alexander Ivanov wrote:

If a blockcommit is aborted the base image remains in RW mode, thatleads

to a fail of subsequent live migration.

How to reproduce:
   $ virsh snapshot-create-as vm snp1 --disk-only

   *** write something to the disk inside the guest ***

$ virsh blockcommit vm vda --active --shallow && virsh blockjobvm vda --abort

   $ lsof /vzt/vm.qcow2
   COMMAND      PID USER   FD   TYPE DEVICE   SIZE/OFF NODE NAME

qemu-syst 433203 root 45u REG 253,0 1724776448 133/vzt/vm.qcow2

   $ cat /proc/433203/fdinfo/45
   pos:    0
   flags:  02140002 <==== The last 2 means RW mode

If the base image is in RW mode at the end of blockcommit and was in RO

mode before blockcommit, check if src BDS has refcnt > 1. If so, theBDSwill not be removed after blockcommit, and we should make the baseimage

RO. Otherwise check recursively if there is a parent BDS of src BDS and
reopen the base BDS in RO in this case.

Signed-off-by: Alexander Ivanov <alexander.ivanov@virtuozzo.com>
---
  block/mirror.c | 38 ++++++++++++++++++++++++++++++++++++--
  1 file changed, 36 insertions(+), 2 deletions(-)

diff --git a/block/mirror.c b/block/mirror.c
index 5145eb53e1..52a7fee75e 100644
--- a/block/mirror.c
+++ b/block/mirror.c
@@ -93,6 +93,7 @@ typedef struct MirrorBlockJob {
      int64_t active_write_bytes_in_flight;
      bool prepared;
      bool in_drain;
+    bool base_ro;
  } MirrorBlockJob;
  typedef struct MirrorBDSOpaque {

@@ -652,6 +653,32 @@ static void coroutine_fnmirror_wait_for_all_io(MirrorBlockJob *s)

      }
  }
+/*
+ * Check recursively if there is a parent BDS referenced more than
+ * min_refcnt times. This argument is needed because at the first
+ * call there is a bds referenced in blockcommit.
+ */
+static bool bdrv_chain_has_significant_parent(BlockDriverState *bs)
+{
+    BdrvChild *parent;
+    BlockDriverState *parent_bs;
+
+    QLIST_FOREACH(parent, &bs->parents, next) {
+        if (!(parent->klass->parent_is_bds)) {
+            continue;
+        }
+        parent_bs = parent->opaque;
+        if (parent_bs->drv && !parent_bs->drv->is_filter) {
+            return true;
+        }
+        if (bdrv_chain_has_significant_parent(parent_bs)) {
+            return true;
+        }
+    }
+
+    return false;
+}
+
  /**
   * mirror_exit_common: handle both abort() and prepare() cases.
   * for .prepare, returns 0 on success and -errno on failure.
@@ -793,6 +820,11 @@ static int mirror_exit_common(Job *job)
      bdrv_drained_end(target_bs);
      bdrv_unref(target_bs);
+    if (s->base_ro && !bdrv_is_read_only(target_bs) &&
+        (src->refcnt > 1 || bdrv_chain_has_significant_parent(src))) {
+        bdrv_reopen_set_read_only(target_bs, true, NULL);
+    }
+
      bs_opaque->job = NULL;
      bdrv_drained_end(src);
@@ -1715,6 +1747,7 @@ static BlockJob *mirror_start_job(

bool is_none_mode, BlockDriverState*base, bool auto_complete, const char*filter_node_name, bool is_mirror, MirrorCopyModecopy_mode,

+                             bool base_ro,
                               Error **errp)
  {
      MirrorBlockJob *s;
@@ -1798,6 +1831,7 @@ static BlockJob *mirror_start_job(
      bdrv_unref(mirror_top_bs);
      s->mirror_top_bs = mirror_top_bs;
+    s->base_ro = base_ro;

/* No resize for the target either; while the mirror is stillrunning, a * consistent read isn't necessarily possible. We couldpossibly allow@@ -2027,7 +2061,7 @@ void mirror_start(const char *job_id,BlockDriverState *bs, speed, granularity, buf_size, backing_mode,zero_target, on_source_error, on_target_error, unmap,NULL, NULL,

                       &mirror_job_driver, is_none_mode, base, false,
-                     filter_node_name, true, copy_mode, errp);
+                     filter_node_name, true, copy_mode, false, errp);
  }

BlockJob *commit_active_start(const char *job_id, BlockDriverState*bs,@@ -2056,7 +2090,7 @@ BlockJob *commit_active_start(const char*job_id, BlockDriverState *bs,

                       on_error, on_error, true, cb, opaque,

&commit_active_job_driver, false, base,auto_complete, filter_node_name, false,MIRROR_COPY_MODE_BACKGROUND,

-                     errp);
+                     base_read_only, errp);
      if (!job) {
          goto error_restore_flags;
      }


--
Best regards,
Alexander Ivanov

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [PATCH v3] blockcommit: Reopen base image as RO after abort, Alexander Ivanov <=
- Re: [PATCH v3] blockcommit: Reopen base image as RO after abort, Vladimir Sementsov-Ogievskiy, 2024/03/15

Prev by Date: Another CXL/MMIO tcg tlb corner case
Next by Date: Re: [PATCH v4 10/25] migration: Add Error** argument to qemu_savevm_state_setup()
Previous by thread: Another CXL/MMIO tcg tlb corner case
Next by thread: Re: [PATCH v3] blockcommit: Reopen base image as RO after abort
Index(es):
- Date
- Thread