New "packed" random oracle input type #10005

mrmr1993 · 2022-01-11T21:41:57Z

This PR is based off PR #9935, but refactored to change only the code that relates to random oracle input. In order to compile, this includes changes that were previously scattered across PRs #9933, #9934, #9935, #9936, #9937, #9938, #9939, #9940, with some modifications to separate out the other unrelated changes that were also bundled with them.

This PR takes a slightly different strategy to #9935, by keeping the random oracle inputs (and the signatures derived from them) separated as Random_oracle.Input.Chunked and Random_oracle.Input.Legacy (equivalently Schnorr.Chunked, Schnorr.Legacy), so that it is clear at every usage which of the two input systems is used.

This also makes the requisite changes for the client SDK and for rosetta, ensuring that both continue to use the Legacy hashing/signature mode for compatibility purposes. As in the original, 'signed transactions' also use the Legacy hashing/signature mode, to ensure that various implementations do not need to be updated.

The term 'chunk' refers to a number less than the size of a field element, which we treat as an indivisible bitstring and chain together to form field elements by computing a + b * 2^n + c * 2^(n+m) + .... For example, consider

u16 // 8-bit integer
u32 // 32-bit integer
u64 // 64-bit integer
// The 'chunking' algorithm will group the following
[u16, u16, u16, u16, u16, u16, u16, u16, u16, u16, u16, u16, u16, u16, u16, u16, u32, u32, u32, u32, u32, u32, u32, u32, u64, u64, u64, u64]
// into the following batches
[ [u16, u16, u16, u16, u16, u16, u16, u16, u16, u16, u16, u16, u16, u16, u16], // 240 bits
  [u16, u32, u32, u32, u32, u32, u32, u32], // 240 bits
  [u32, u64, u64, u64], // 224 bits
  [u64] // 64 bits
]

This differs from the original implementation, which decomposed all non-field-element data into bits and recombined those bits back into field elements. This decomposition and recomposition is significantly more expensive than the extra hashing we incur by not doing so.

Checklist:

Modified the current draft of release notes with details on what is completed or incomplete within this project
Document code purpose, how to use it
- Mention expected invariants, implicit constraints
Tests were added for the new behavior
- Document test purpose, significance of failures
- Test names should reflect their purpose
All tests pass (CI will check this if you didn't)
Serialized types are in stable-versioned modules
Does this close issues? List them

bkase

Legacy around client-sdk and Rosetta looks good 👍

This reverts commit 53a929f. NOTE: fixing this programmatically from the log involves (from vim) :tabe src/app/replayer/test/archive_db.sql :vsplit ~/Downloads/mina_build_XXXXX_replayer-test.log You can set up a (slow) macro to perform all of the replacements with qq/expected_ledger_hash 2f""ayi"4f""syi"h:%s/a/s/g lq Then, for N the number of replacements, we run N@q and go grab a coffee. This *WILL* lock up your vim instance for some time.

jspada · 2022-01-13T12:05:05Z

Should this one be closed now #9935

mrmr1993 · 2022-01-13T14:10:50Z

Should this one be closed now #9935

When this is merged, the surrounding PRs should be readjusted to avoid it. There's still unmerged code in those 7 other PRs

jspada · 2022-01-18T18:50:51Z

Based on the description, it seems like this can have collisions.

Simple collision example, assume field element size is 8-bits to keep it short

Equation from above: a + b * 2^n + c * 2^(n+m) + ...

                   a0   b0    a1   b1
M = m4 m4 m4 m4 = [0000 0000, 0000 1000]
                = [0 + 0*2^4, 0 + 8*2^4]
                = [0, 128]

             a0    b0
N = n4 n8 = [0000, 10000000]
          = [0, 128]

I've probably missed something important.

mrmr1993 · 2022-01-19T13:08:44Z

Based on the description, it seems like this can have collisions.

Indeed it could, except we only use this random oracle input type when the structure of the underlying data is fixed, so it should be a non-issue. Perhaps part of the code review should be confirming that, though?

jspada · 2022-01-19T14:13:02Z

Based on the description, it seems like this can have collisions.

Indeed it could, except we only use this random oracle input type when the structure of the underlying data is fixed, so it should be a non-issue. Perhaps part of the code review should be confirming that, though?

Either that or making it safe no matter how it's used. I suspect the reason it's this way is for performance in the snark, so the latter may be too expensive, correct?

mrmr1993 · 2022-01-19T17:17:09Z

Either that or making it safe no matter how it's used. I suspect the reason it's this way is for performance in the snark, so the latter may be too expensive, correct?

The latter makes the snark more expensive, yes. We should write a variant of this called Chunked_variable_length or some such for this, but I don't think this PR is the place for it.

jspada

Huge diff!
Checked things carefully for mistakes.
Left a lot of questions and some suggestions.
Approved!

src/lib/random_oracle/random_oracle.ml

src/lib/non_zero_curve_point/non_zero_curve_point.ml

src/lib/pickles/pickles.ml

src/lib/pickles/pickles.mli

src/lib/pickles/side_loaded_verification_key.ml

src/lib/mina_base/token_permissions.ml

src/lib/signature_lib/schnorr.ml

src/lib/uptime_service/uptime_service.ml

buildkite/scripts/replayer-test.sh

querolita · 2022-01-20T16:46:49Z

Going back to the encoding of the input, when you say that the underlying data structure is fixed you mean that you would never be using different types of input vectors as part of the same scenario? Meaning, by the context one would be able to know what is the underlying composition of the chunk? Otherwise, one would need some bits of information to differentiate between different "recipes" leading to those chunks. For a shorter example of 32bit chunks, these could either be [u32] or [u16, u16]. When the output domain only has (in this case) 2^32 possible values, in order to avoid collisions, one can only have at most 2^32 different inputs. But given that 32bit chunks can have 2 compositions, one instead would have twice that input domain size, and thus collisions. So if the underlying structured is "assumed" and known a priori, then there seems to be no collisions.

mrmr1993 · 2022-01-21T15:02:05Z

Going back to the encoding of the input, when you say that the underlying data structure is fixed you mean that you would never be using different types of input vectors as part of the same scenario? Meaning, by the context one would be able to know what is the underlying composition of the chunk?

Correct, yes. We only use poseidon hashing where we need to mirror the hashing within a snark circuit; because the format of the data in the snark circuit is necessarily fixed by the permutation argument, we always know that the data will be laid out in exactly the same way.

Otherwise, one would need some bits of information to differentiate between different "recipes" leading to those chunks. For a shorter example of 32bit chunks, these could either be [u32] or [u16, u16]. When the output domain only has (in this case) 2^32 possible values, in order to avoid collisions, one can only have at most 2^32 different inputs. But given that 32bit chunks can have 2 compositions, one instead would have twice that input domain size, and thus collisions. So if the underlying structured is "assumed" and known a priori, then there seems to be no collisions.

Agreed. As the proof system becomes more flexible (and as SnarkyJS becomes more able to use that flexibility) we'll probably want to create a version of this that does the 'safe' thing, but for now I believe it isn't an issue.

mrmr1993 · 2022-01-21T20:23:27Z

After some debugging, this PR now increases the amounts in the payment test by 10x, to ensure that block rewards do not cause the balance of the timed account to fall back into the valid range. (cc @QuiteStochastic)

Izaak Meckler and others added 6 commits January 11, 2022 12:29

add a new packed random oracle input type

364d9dc

input changes for supporting types

8863636

Rename Random_oracle_input to Random_oracle_input.Chunked

d7bac03

Convert types for chunked random_oracle_input

689e476

Fix compilation of rosetta with new random_oracle inputs

8b350d9

Fix compilation of nonconsensus code / client_sdk

a4e8423

mrmr1993 added the ci-build-me Add this label to trigger a circle+buildkite build for this branch label Jan 11, 2022

mrmr1993 requested review from a team as code owners January 11, 2022 21:41

bkase approved these changes Jan 11, 2022

View reviewed changes

mrmr1993 and others added 3 commits January 12, 2022 01:10

Merge branch 'develop' into feature/new-random-oracle-input

088c14f

Fixup Account.Index.to_input

f65d03d

TEMP COMMIT; DO NOT MERGE

53a929f

mrmr1993 requested a review from a team as a code owner January 12, 2022 03:38

mrmr1993 added 3 commits January 12, 2022 03:43

Fixup client_sdk compilation

84b6ebd

Different account hash -> different ledger hashes

894602e

mrmr1993 mentioned this pull request Jan 12, 2022

Fix block production race condition in integration test #10009

Merged

6 tasks

Merge branch 'develop' into feature/new-random-oracle-input

c17928e

mrmr1993 added 4 commits January 19, 2022 20:32

Adorn Random_oracle_input.Chunked with doc-comments

80a12f7

Exposition in side_loaded_verification_key

d750afe

Reformat

40e7358

Merge branch 'develop' into feature/new-random-oracle-input

dafa766

jspada approved these changes Jan 20, 2022

View reviewed changes

Reduce impacts of block rewards in payments test by 10x-ing amounts

e1eb5fa

mrmr1993 force-pushed the feature/new-random-oracle-input branch from dfd8c2c to e1eb5fa Compare January 21, 2022 20:20

Further increase payments test amounts to remove impact of block rewards

070301e

QuiteStochastic approved these changes Jan 21, 2022

View reviewed changes

mrmr1993 merged commit 8f5adb8 into develop Jan 21, 2022

mrmr1993 deleted the feature/new-random-oracle-input branch January 21, 2022 23:11

mitschabaude mentioned this pull request Mar 1, 2022

New "packed" random oracle input type #9935

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New "packed" random oracle input type #10005

New "packed" random oracle input type #10005

mrmr1993 commented Jan 11, 2022

bkase left a comment

jspada commented Jan 13, 2022

mrmr1993 commented Jan 13, 2022 •

edited

Loading

jspada commented Jan 18, 2022 •

edited

Loading

mrmr1993 commented Jan 19, 2022

jspada commented Jan 19, 2022 •

edited

Loading

mrmr1993 commented Jan 19, 2022

jspada left a comment

querolita commented Jan 20, 2022

mrmr1993 commented Jan 21, 2022

mrmr1993 commented Jan 21, 2022

New "packed" random oracle input type #10005

New "packed" random oracle input type #10005

Conversation

mrmr1993 commented Jan 11, 2022

bkase left a comment

Choose a reason for hiding this comment

jspada commented Jan 13, 2022

mrmr1993 commented Jan 13, 2022 • edited Loading

jspada commented Jan 18, 2022 • edited Loading

mrmr1993 commented Jan 19, 2022

jspada commented Jan 19, 2022 • edited Loading

mrmr1993 commented Jan 19, 2022

jspada left a comment

Choose a reason for hiding this comment

querolita commented Jan 20, 2022

mrmr1993 commented Jan 21, 2022

mrmr1993 commented Jan 21, 2022

mrmr1993 commented Jan 13, 2022 •

edited

Loading

jspada commented Jan 18, 2022 •

edited

Loading

jspada commented Jan 19, 2022 •

edited

Loading