Re: [Qemu-devel] [PATCH 1/3] block: Ignore allocation size in underlying

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH 1/3] block: Ignore allocation size in underlying

From:	Max Reitz
Subject:	Re: [Qemu-devel] [PATCH 1/3] block: Ignore allocation size in underlying file
Date:	Sat, 11 Oct 2014 11:44:20 +0200
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.1.0

Am 10.10.2014 um 13:50 schrieb Benoît Canet:

The Saturday 16 Aug 2014 à 20:54:16 (+0200), Max Reitz wrote :

When falling through to the underlying file in
bdrv_co_get_block_status(), do not let the number of sectors for which
information could be obtained be overwritten.

Signed-off-by: Max Reitz <address@hidden>
---
  block.c | 6 ++++--
  1 file changed, 4 insertions(+), 2 deletions(-)

diff --git a/block.c b/block.c
index 3e252a2..c922664 100644
--- a/block.c
+++ b/block.c
@@ -3991,9 +3991,11 @@ static int64_t coroutine_fn 
bdrv_co_get_block_status(BlockDriverState *bs,
      if (bs->file &&
          (ret & BDRV_BLOCK_DATA) && !(ret & BDRV_BLOCK_ZERO) &&
          (ret & BDRV_BLOCK_OFFSET_VALID)) {
+        int backing_pnum;
+
          ret2 = bdrv_co_get_block_status(bs->file, ret >> BDRV_SECTOR_BITS,
-                                        *pnum, pnum);
-        if (ret2 >= 0) {
+                                        *pnum, &backing_pnum);
+        if (ret2 >= 0 && backing_pnum >= *pnum) {

About backing_pnum >= *pnum.

The documentation of bdrv_co_get_block_status says:

  * 'nb_sectors' is the max value 'pnum' should be set to.  If nb_sectors goes
  * beyond the end of the disk image it will be clamped.
  */
static int64_t coroutine_fn bdrv_co_get_block_status(BlockDriverState *bs,
                                                      int64_t sector_num,
                                                      int nb_sectors, int *pnum)

So clearly after the bdrv_co_get_block_status *pnum >= backing_pnum.

This means that  backing_pnum > *pnum will never happen.

I think either this test is wrong or the doc is wrong.


Thank you for confusing me, I had to think quite a while about this. *g*

The condition is not for error checking. If it was, it would be thewrong order (the condition should be true on success, that's why it's"ret2 >= 0" and not "ret2 < 0", so it should then be "backing_pnum <=*pnum"). So what this is testing is whether all sectors in theunderlying file in the queried range are read as zero. But if"backing_pnum < *pnum" that is not the case, some clusters are not zero.So we may not set the zero flag if backing_pnum < *pnum; or as it readsin the code, we may only set it if backing_pnum >= *pnum. This is notabout whether *pnum > backing_pnum, but more about whether backing_pnum== *pnum (but >= would be fine, too, if bdrv_co_get_block_status()supported it, so that's why I wrote it that way).

However, I'm starting to think about whether it would be better, for thebacking_pnum < *pnum case, not to not set the zero flag, but rathersimply set *pnum = backing_pnum. And this in turn would be prettyequivalent to just omitting this patch, because:

If we get to this point where we query the underlying file and itreturns a certain number of sectors is zero; then we therefore want toset *pnum = backing_pnum (both if backing_pnum < *pnum and ifbacking_pnum == *pnum; backing_pnum > *pnum cannot happen, as youpointed out). On the other hand, if the sectors are not reported to bezero, but backing_pnum < *pnum, we want to shorten *pnum accordingly aswell because this may indicate that after another backing_pnum sectors,we arrive at a hole in the file.

There is only one point I can imagine where it makes sense not to letbacking_pnum overwrite *pnum: And that's if bdrv_co_get_block_status()reported BDRV_BLOCK_DATA | BDRV_BLOCK_OFFSET_VALID with an offset beyondthe EOF. I think this might actually happen with qcow2, if one clustersimply lies beyond the EOF (which is perfectly valid). So I concludethat this patch has its use after all but needs to be modified so thatbacking_pnum always overwrites *pnum; except for when backing_pnum iszero (which should only happen at or after the EOF) in which case thezero flag should be set and *pnum should be left as it was.

And now in all honesty: Thanks for confusing me, I guess I can thinkbetter when I'm confused. :-)

Max

Best regards

Benoît

              /* Ignore errors.  This is just providing extra information, it
               * is useful but not necessary.
               */
--
2.0.4

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Qemu-devel] [PATCH 1/3] block: Ignore allocation size in underlying file, Eric Blake, 2014/10/08
- Re: [Qemu-devel] [PATCH 1/3] block: Ignore allocation size in underlying file, Benoît Canet, 2014/10/10
  - Re: [Qemu-devel] [PATCH 1/3] block: Ignore allocation size in underlying file, Max Reitz <=
    - Re: [Qemu-devel] [PATCH 1/3] block: Ignore allocation size in underlying file, Benoît Canet, 2014/10/11

Prev by Date: Re: [Qemu-devel] how to dynamically add a block device using qmp?
Next by Date: Re: [Qemu-devel] [PATCH 2/3] qemu-io: Respect early image end for map
Previous by thread: Re: [Qemu-devel] [PATCH 1/3] block: Ignore allocation size in underlying file
Next by thread: Re: [Qemu-devel] [PATCH 1/3] block: Ignore allocation size in underlying file
Index(es):
- Date
- Thread