|
From: | Pádraig Brady |
Subject: | Re: [PATCH] cksum: Implement Chorba algorithm in PCLMUL |
Date: | Wed, 25 Dec 2024 18:57:17 +0000 |
User-agent: | Mozilla Thunderbird Beta |
On 25/12/2024 16:55, Sam Russell wrote:
Thanks for the results, looks like I'll need to get access to some older hardware and try some different combinations. There's a few things I can tune (loading all 8 values at the start vs loading one per fold, different BUFSIZE values), I'd be interested in finding a setup that definitely offers an improvement across the board. Did you test this with the first patch or the second patch? At a minimum cutting out the final table-based fold should be a consistent ~5% improvement on any platform.
It would be good to test chorba without also increasing the buffer size so we're comparing just the algorithms. We can tweak the buffer sizes after, though note ioblksize.h is currently set to 256KiB so it would be good to be <= that. cheers, Pádraig
[Prev in Thread] | Current Thread | [Next in Thread] |