Re: backup_calculate_cluster_size does not consider source

qemu-block

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: backup_calculate_cluster_size does not consider source

From:	Dietmar Maurer
Subject:	Re: backup_calculate_cluster_size does not consider source
Date:	Wed, 6 Nov 2019 14:09:37 +0100 (CET)

> Let me elaborate: Yes, a cluster size generally means that it is most
> “efficient” to access the storage at that size.  But there’s a tradeoff.
>  At some point, reading the data takes sufficiently long that reading a
> bit of metadata doesn’t matter anymore (usually, that is).

Any network storage suffers from long network latencies, so it always
matters if you do more IOs than necessary.

> There is a bit of a problem with making the backup copy size rather
> large, and that is the fact that backup’s copy-before-write causes guest
> writes to stall. So if the guest just writes a bit of data, a 4 MB
> buffer size may mean that in the background it will have to wait for 4
> MB of data to be copied.[1]

We use this for several years now in production, and it is not a problem.
(Ceph storage is mostly on 10G (or faster) network equipment).

> Hm.  OTOH, we have the same problem already with the target’s cluster
> size, which can of course be 4 MB as well.  But I can imagine it to
> actually be important for the target, because otherwise there might be
> read-modify-write cycles.
> 
> But for the source, I still don’t quite understand why rbd has such a
> problem with small read requests.  I don’t doubt that it has (as you
> explained), but again, how is it then even possible to use rbd as the
> backend for a guest that has no idea of this requirement?  Does Linux
> really prefill the page cache with 4 MB of data for each read?

No idea. I just observed that upstream qemu backups with ceph are 
quite unusable this way.

[Prev in Thread]

Current Thread

[Next in Thread]

Re: backup_calculate_cluster_size does not consider source, Stefan Hajnoczi, 2019/11/06
- Re: backup_calculate_cluster_size does not consider source, Max Reitz, 2019/11/06
  - Re: backup_calculate_cluster_size does not consider source, Wolfgang Bumiller, 2019/11/06
    - Re: backup_calculate_cluster_size does not consider source, Max Reitz, 2019/11/06
    - Re: backup_calculate_cluster_size does not consider source, Dietmar Maurer, 2019/11/06
    - Re: backup_calculate_cluster_size does not consider source, Max Reitz, 2019/11/06
    - Re: backup_calculate_cluster_size does not consider source, Max Reitz, 2019/11/06
    - Re: backup_calculate_cluster_size does not consider source, Dietmar Maurer <=
    - Re: backup_calculate_cluster_size does not consider source, Max Reitz, 2019/11/06
    - Re: backup_calculate_cluster_size does not consider source, Dietmar Maurer, 2019/11/06
    - Re: backup_calculate_cluster_size does not consider source, Max Reitz, 2019/11/06
    - Re: backup_calculate_cluster_size does not consider source, Vladimir Sementsov-Ogievskiy, 2019/11/06
  - Re: backup_calculate_cluster_size does not consider source, Dietmar Maurer, 2019/11/06
    - Re: backup_calculate_cluster_size does not consider source, Max Reitz, 2019/11/06

Prev by Date: Re: backup_calculate_cluster_size does not consider source
Next by Date: Re: backup_calculate_cluster_size does not consider source
Previous by thread: Re: backup_calculate_cluster_size does not consider source
Next by thread: Re: backup_calculate_cluster_size does not consider source
Index(es):
- Date
- Thread