parametrize erasure coding numbers #506

danicuki · 2025-12-04T15:49:33Z

No description provided.

zdave-parity

The changes look correct to me, just a few minor comments.

zdave-parity · 2025-12-08T13:57:52Z

text/erasure_coding.tex

 \newcommand{\spl}[1]{\text{split}_{#1}}

-The foundation of the data-availability and distribution system of \Jam is a systematic Reed-Solomon erasure coding function in \textsc{gf}($2^{16}$) of rate 342:1023, the same transform as done by the algorithm of \cite{lin2014novel}. We use a little-endian $\blob[2]$ form of the 16-bit \textsc{gf} points with a functional equivalence given by $\fnencode[2]$. From this we may assume the encoding function $\fnerasurecode: \sequence[342]{\blob[2]} \to \sequence[1023]{\blob[2]}$ and the recovery function $\fnecrecover: \protoset{\tuple{\blob[2], \Nmax{1023}}}_{342} \to \sequence[342]{\blob[2]}$. Encoding is done by extrapolating a data blob of size 684 octets (provided in $\fnerasurecode$ here as 342 octet pairs) into 1,023 octet pairs. Recovery is done by collecting together any distinct 342 octet pairs, together with their indices, and transforming this into the original sequence of 342 octet pairs.
+The foundation of the data-availability and distribution system of \Jam is a systematic Reed-Solomon erasure coding function in \textsc{gf}($2^{16}$) of rate $\nicefrac{\Cecpiecesize}{2}$:$\Cvalcount$, the same transform as done by the algorithm of \cite{lin2014novel}. We use a little-endian $\blob[2]$ form of the 16-bit \textsc{gf} points with a functional equivalence given by $\fnencode[2]$. From this we may assume the encoding function $\fnerasurecode: \sequence[\nicefrac{\Cecpiecesize}{2}]{\blob[2]} \to \sequence[\Cvalcount]{\blob[2]}$ and the recovery function $\fnecrecover: \protoset{\tuple{\blob[2], \Nmax{\Cvalcount}}}_{\nicefrac{\Cecpiecesize}{2}} \to \sequence[\nicefrac{\Cecpiecesize}{2}]{\blob[2]}$. Encoding is done by extrapolating a data blob of size $\Cecpiecesize$ octets (provided in $\fnerasurecode$ here as $\nicefrac{\Cecpiecesize}{2}$ octet pairs) into $\Cvalcount$ octet pairs. Recovery is done by collecting together any distinct $\nicefrac{\Cecpiecesize}{2}$ octet pairs, together with their indices, and transforming this into the original sequence of $\nicefrac{\Cecpiecesize}{2}$ octet pairs.


All the /2s make this a bit unpleasant to read. Maybe sensible to introduce another constant for the number of original shards, or change the meaning of W_E to this and use 2W_E for piece size? This is a style question that is probably best answered by Gav though.

@zdave-parity I totally agree with you. @gavofyork what is your suggestion?

text/erasure_coding.tex

text/work_packages_and_reports.tex

zdave-parity · 2025-12-08T14:24:56Z

text/work_packages_and_reports.tex

-Once done, then imported segments must be reconstructed. This process may in fact be lazy as the Refine function makes no usage of the data until the \emph{fetch} host-call is made. Fetching generally implies that, for each imported segment, erasure-coded chunks are retrieved from enough unique validators (342, including the guarantor) and is described in more depth in appendix \ref{sec:erasurecoding}. (Since we specify systematic erasure-coding, its reconstruction is trivial in the case that the correct 342 validators are responsive.) Chunks must be fetched for both the data itself and for justification metadata which allows us to ensure that the data is correct.
+Once done, then imported segments must be reconstructed. This process may in fact be lazy as the Refine function makes no usage of the data until the \emph{fetch} host-call is made. Fetching generally implies that, for each imported segment, erasure-coded chunks are retrieved from enough unique validators ($\nicefrac{\Cecpiecesize}{2}$, including the guarantor) and is described in more depth in appendix \ref{sec:erasurecoding}. (Since we specify systematic erasure-coding, its reconstruction is trivial in the case that the correct $\nicefrac{\Cecpiecesize}{2}$ validators are responsive.) Chunks must be fetched for both the data itself and for justification metadata which allows us to ensure that the data is correct.

 Validators, in their role as availability assurers, should index such chunks according to the index of the segments-tree whose reconstruction they facilitate. Since the data for segment chunks is so small at 12 octets, fixed communications costs should be kept to a bare minimum. A good network protocol (out of scope at present) will allow guarantors to specify only the segments-tree root and index together with a Boolean to indicate whether the proof chunk need be supplied. Since we assume at least 341 other validators are online and benevolent, we can assume that the guarantor can compute $\importsegmentdata$ and $\justifysegmentdata$ above with confidence, based on the general availability of data committed to with $\mathbf{s}^\clubsuit$, which is specified below.


341 here should be $\nicefrac{\Cecpiecesize}{2} - 1$ I guess, though that is a bit of a mouthful.

maybe $\nicefrac{\Cecpiecesize}{2} - 1 = 341$ ?

parametrize erasure coding numbers

b3d6608

zdave-parity reviewed Dec 8, 2025

View reviewed changes

additional constants changes for erasure coding

0dafcde

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

parametrize erasure coding numbers #506

parametrize erasure coding numbers #506

Uh oh!

danicuki commented Dec 4, 2025

Uh oh!

zdave-parity left a comment

Uh oh!

zdave-parity Dec 8, 2025

Uh oh!

danicuki Dec 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zdave-parity Dec 8, 2025

Uh oh!

danicuki Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

parametrize erasure coding numbers #506

Are you sure you want to change the base?

parametrize erasure coding numbers #506

Uh oh!

Conversation

danicuki commented Dec 4, 2025

Uh oh!

zdave-parity left a comment

Choose a reason for hiding this comment

Uh oh!

zdave-parity Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

danicuki Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zdave-parity Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

danicuki Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

danicuki Dec 8, 2025 •

edited

Loading