Re: Encoding for Robust Immutable Storage (ERIS)

gnunet-developers

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Encoding for Robust Immutable Storage (ERIS)

From:	pukkamustard
Subject:	Re: Encoding for Robust Immutable Storage (ERIS)
Date:	Mon, 07 Dec 2020 18:12:59 +0100
User-agent:	mu4e 1.4.13; emacs 27.1

Hello Christian,
Hello GNUnet,

Thanks again for the extremely valuable feedback on the initialversion

of ERIS.

I'd like to request feedback on a second version of the encoding:ERIS

v0.2.0: http://purl.org/eris

The major change compared to the initial version from June: It is
basically ECRS.

I have become convinced that the functionality offered by the

"verification capability" - verify the integrity of all blockswithoutbeing able to decode content - can be implemented with asynchronizationalgorithm for blocks. Removing the verification capability fromtheencoding itself simplifies the encoding and increases performanceof

encoding process.

The differences to ECRS are (see alsohttp://purl.org/eris#_previous_work):


- Use Blake2b/ChaCha20 and allow a "convergence secret"

- Block size: ERIS allows block size of either 1 KiB or 32 KiB.Thevariety of use-cases (file sharing vs. robust storage of tinypiecesof data) seem to make this necessary. For both use cases thisseems

 better than the 4 KiB compromise.

- URN: A URN is defined independent of applications using theencoding.- No namespace mechanism: This can be implemented with things suchas GNS.


Other reasons for not just referring to the ECRS paper:

- Concise specification of the encoding. E.g. the ECRS paper doesnot

 define cryptographic primitives used or URN.
- Include test vectors

The hope is that a wide variety of applications can use ERISencodedcontent over a variety of transport and storage layers. Somethird-party

implementations (not by me) are already starting to pop up
(http://purl.org/eris/#_implementations).

I'd be very happy for your insight, feedback and opinions onwhether

ERIS might find a place in the GNUNet filesharing application.

Thanks!
-pukkamustard


Christian Grothoff <grothoff@gnunet.org> writes:

On 7/26/20 7:28 PM, pukkamustard wrote:
Hello Christian,

Thank you for your comments!
For my taste, the block size is much too small. I understand4k can makesense for page tables and SATA, but looking at benchmarks 4kis stilltoo small to maximize SATA throughput. I would also worryabout 4k for arequest size in any database or network protocol. Theoverheads perrequest are still too big for modern hardware. You couldeasily go to8k, which could be justified with 9k jumbo frames for Ethernetand wouldat least also utilitze all of the bits in your paths. The 32kof ECRSare close to the 64k which are reportedly the optimum formodern M.2
media. IIRC Torrents even use 256k.
I agree that increasing block size makes sense for improvingperformance
in storage and transport.
The overhead from padding may be
large for very small files if you go beyond 4k, but you shouldalsothink in terms of absolute overhead: even a 3100% overheaddoesn'tchange the fact that the absolute overhead is tiny for a 1kfile.
The use-case I have in mind for ERIS is very small pieces ofdata (not
even small files). Examples include ActivityStreams objects or
OpenStreetMaps nodes.
Ah, that's a different use case then file-sharing, so different
trade-offs certainly apply here.
Apparently the average size of individual ActivityStreamsobjects isless than 1kB (unfortunately I don't have the data to back thisup).
I agree that the overhead of 3100% for a single 1kB object is
acceptable. But I would argue that an overhead of 3100% forvery many1kB objects is not. The difference might be a 32 GB databaseinstead of
a 1 GB database.
Sure, the only question is if it might not in this case makesense tocombine the tiny objects into larger ones, like merging all OSMnodes ina region into one larger download. But of course, it againdepends on
the use case you are shooting for.
Furthermore, you should consider a trick we use in GNUnet-FS,which isthat we share *directories*, and for small files, we simply_inline_ thefull file data in the meta data of the file that is storedwith thedirectory or search result. So you can basically avoid havingto everdownload tiny files as separate entities, so for files <32k wehave zero
overhead this way.
That makes a lot of sense.
But packing multiple objects into a single transport packet orgroupingfor storage on disk/in database works for small block sizes aswell. The
optimization just happens at a "different layer".
The key value I see in having small block sizes is that tinypieces of
data can be individually referenced and used (securely).
Sure, if that's your only use case, 4k could make sense.
I'd be curious to see how much the two pass encoding costs inpractice-- it might be less expensive than ECRS if you are lucky(hashing onebig block being cheaper than many small hash operations), ormuch moreexpensive if you are unlucky (have to actually read the datatwice fromdisk). I am not sure that it is worth it merely to reduce thenumber ofhashes/keys in the non-data blocks. Would be good to have somedata onthis, for various file sizes and platforms (to judge IO/RAMcachingeffects). As I said, I can't tell for sure if the 2nd pass isvirtuallyfree or quite expensive -- and that is an important detail.Especiallywith a larger block size, the overhead of an extra key in thenon-data
blocks could be quite acceptable.
I think the cost of the two-pass encoding in ERIS is quiteexpensive.Considering that the hash of the individual blocks also needsto becomputed (as reference in parent nodes), I think ECRS willalways win
performance wise.
Maybe the answer is not ECRS or ERIS but ECRS and ERIS. ECRSfor largepieces of data, where it makes more sense to have large blocksize andsingle-pass encoding. And ERIS for (very many) small pieces ofdatawhere a 3100% overhead is too much but the performance penaltyis
acceptable and size of data is much smaller than memory.
There might be some heuristic that says: If data is larger than2MB use
ECRS, else use ERIS and you get the verification capability.
If using ECRS, you can add the verification capability byencoding alist of all the hash references to the ECRS block with ERIS.The ERISread capability of this list of ECRS block is enough to verifytheintegrity of the original ECRS encoded content (withoutrevealing the
content).

What do you think?
I don't know how important the verification capability is inpractice,or how much the block size trade-offs are relevant (vs. groupingtinyobjects into larger ones). If we can avoid proliferatingencodings andfind one that fits all important use cases, that would be ideal.I wouldnot be _opposed_ to adopting ERIS in GNUnet (even consideringthepossible increase in encoding cost), _except_ for the tiny blocksize
(which I know would be terrible for our use-case).
For 3.4 Namespaces, I would urge you to look at the GNU NameSystem(GNS). My plan is to (eventually, when I have way too muchtime andcould actually re-do FS...) replace SBLOCKS and KBLOCKS ofECRS with
basically only GNS.
I have been looking into it. It does seem to be a perfectapplication of
GNS.
The crypto is way above my head and using readily available andalreadyimplemented primitives would make implementation much easierfor me. ButI understand the need for "non-standard" crypto and amfollowing the
ongoing discussions.
Great. Feel free to chime in or ask questions. Right now, we'rehopingto find the time to update the draft based on the feedbackalready
received, but of course constructive feedback is always welcome.

Cheers!

Christian

[Prev in Thread]

Current Thread

[Next in Thread]

Re: Encoding for Robust Immutable Storage (ERIS), pukkamustard <=
- Re: Encoding for Robust Immutable Storage (ERIS), Christian Grothoff, 2020/12/10
- Re: Encoding for Robust Immutable Storage (ERIS), Martin Schanzenbach, 2020/12/10

Prev by Date: Re: About re:claimID
Next by Date: Re: Encoding for Robust Immutable Storage (ERIS)
Previous by thread: About re:claimID
Next by thread: Re: Encoding for Robust Immutable Storage (ERIS)
Index(es):
- Date
- Thread