[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [PATCH] block/nvme: Fix VFIO_MAP_DMA failed: No space left on device
From: |
Philippe Mathieu-Daudé |
Subject: |
Re: [PATCH] block/nvme: Fix VFIO_MAP_DMA failed: No space left on device |
Date: |
Thu, 17 Jun 2021 17:48:46 +0200 |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.10.1 |
On 6/17/21 2:40 PM, Maxim Levitsky wrote:
> On Mon, 2021-06-14 at 18:03 +0200, Philippe Mathieu-Daudé wrote:
>> On 6/11/21 1:46 PM, Philippe Mathieu-Daudé wrote:
>>> When the NVMe block driver was introduced (see commit bdd6a90a9e5,
>>> January 2018), Linux VFIO_IOMMU_MAP_DMA ioctl was only returning
>>> -ENOMEM in case of error. The driver was correctly handling the
>>> error path to recycle its volatile IOVA mappings.
>>>
>>> To fix CVE-2019-3882, Linux commit 492855939bdb ("vfio/type1: Limit
>>> DMA mappings per container", April 2019) added the -ENOSPC error to
>>> signal the user exhausted the DMA mappings available for a container.
>>
>> Hmm this commit has been added before v5.1-rc4.
>>
>> So while this fixes the behavior of v5.1-rc4+ kernels,
>> older kernels using this fix will have the same problem...
>
>
> Hi!
>
> I wonder why not to check for both -ENOMEM and -ENOSPC
> and recycle the mappings in both cases?
>
> I think that would work on both old and new kernels.
>
> What do you think?
Yes, worst case we retry one more time for nothing.
Alex suggested to use VFIO_IOMMU_GET_INFO to get the limit
with VFIO_IOMMU_TYPE1_INFO_DMA_AVAIL, and if not available
the driver should use a reasonable value to limit itself.
I'll try to get it quick, otherwise fall back to your "dual
errno" case.
Thanks both for the ideas,
Phil.
>>> diff --git a/block/nvme.c b/block/nvme.c
>>> index 2b5421e7aa6..12f9dd5cce3 100644
>>> --- a/block/nvme.c
>>> +++ b/block/nvme.c
>>> @@ -1030,7 +1030,7 @@ try_map:
>>> r = qemu_vfio_dma_map(s->vfio,
>>> qiov->iov[i].iov_base,
>>> len, true, &iova);
>>> - if (r == -ENOMEM && retry) {
>>> + if (r == -ENOSPC && retry) {
>>> retry = false;
>>> trace_nvme_dma_flush_queue_wait(s);
>>> if (s->dma_map_count) {
>>>