Signed-digit multi-comb ecmult_gen algorithm #1057

sipa · 2021-12-27T22:51:37Z

A third iteration of the signed-digit multi-comb ecmult_gen algorithm (earlier attempts: #693, and #546 by Peter Dettman). Short summary:

A new constant-time point multiplication algorithm with precomputation (so only used for multiply with G).
Based on section 3.3 of https://eprint.iacr.org/2012/309 by Mike Hamburg.
Configurable through two parameters: COMB_BLOCKS and COMB_TEETH
- Currently only 3 predefined configurations reachable through ./configure (tables 2 kB, 22 kB, 86 kB). All three are included in precomputed_ecmult_gen.c and tested in CI. The 2 kB option is already comparable in speed with the current code.
- Many more configurations can be reached by manually setting the macros. These are not tested.

Compared with the previous PR #693:

Updated to the new static-precomputation-only model (Fully static precomputation tables #893).
Just 3 curated configurations reachable through configure.
Removed some optimizations that do not matter (much).
Do blinding through an final correction add rather than an initial start point, which may later permit usage of incomplete addition formulae (Try a non-uniform group law (e.g., for ecmult_gen)? #1051).
The recoding of the input scalar to signed bit representation is done slightly differently, which needs fewer special cases.

peterdettman · 2021-12-28T11:48:14Z

Just out of curiosity, some perf. numbers (best Min of 3 bench sign, 64-bit, i7-9750H):

branch	ecdsa_sign	schnorrsig_sign
master	29.5	23.1
this PR	25.6	19.2
experimental	23.8	17.7

("experimental" is this PR plus the other PRs for normalize, group formulae, and the "vector" modinv).

This introduces a new secp256k1_scalar_half function which multiplies a scalar with the multiplicative inverse of 2 (modulo order).

Instead of having the starting point of the ecmult_gen computation be offset, do it with the final point. This enables reasoning over the set of points reachable in intermediary computations, which can be leveraged by potential future optimization. Because the final point is in affine coordinates, its projective blinding is no longer possible. It will be reintroduced again in a different way, in a later commit. Also introduce some more comments and more descriptive names.

sipa · 2021-12-29T00:08:41Z

@peterdettman Care to redo benchmarks for the latest commit (I've removed the incomplete comb optimization, and re-added the uint32_t[9] recoded approach)?

siv2r · 2021-12-29T03:28:20Z

These were the best min I got (running the benchmark thrice) on my machine (64-bit, i7-8750H).

branch	ecdsa_sign	schnorrsig_sign
master	64.4	49.6
this PR	57.0	42.0

peterdettman · 2021-12-29T06:00:58Z

Updated perf. numbers (-O3, best Min of 3 bench sign, 64-bit, i7-9750H):

branch	ecdsa_sign	schnorrsig_sign
master	29.6	23.1
this PR	25.7	19.4
experimental	24.2	17.9

("experimental" is this PR plus the other PRs for normalize, group formulae, and the "vector" modinv).

So this looks just slightly slower than before, but perfectly fine if we are merge-focused. We can go hunting the extra 2% once we've booked the 20%.

This introduces the signed-digit multi-comb multiplication algorithm for constant-time G multiplications (ecmult_gen). It is based on section 3.3 of "Fast and compact elliptic-curve cryptography" by Mike Hamburg (see https://eprint.iacr.org/2012/309). Original implementation by Peter Dettman, with changes by Pieter Wuille to use scalars for recoding, and additional comments.

It is unnecessary to recompute the 2^COMB_BITS-1 scalar offset needed by the SDMC algorithm for every multiplication; move it into the context scalar_offset value instead.

The existing code needs to deal with the edge case that bit_pos >= 256, which would lead to an out-of-bounds read from secp256k1_scalar. Instead, recode the scalar into an array of uint32_t with enough zero padding at the end to alleviate the issue. This also simplifies the code, and is necessary for a security improvement in a follow-up commit. Original code by Peter Dettman, with modifications by Pieter Wuille.

sipa · 2021-12-29T21:00:28Z

Restarting this in a new PR to avoid the WIP discussion: #1058.

sipa changed the title ~~WIP Reword of Signed-Digit Multicomb~~ WIP Rework of Signed-Digit Multicomb Dec 27, 2021

sipa force-pushed the 202112_sdmc branch from f404b8a to c914fcc Compare December 28, 2021 01:31

sipa added 2 commits December 28, 2021 19:03

Add secp256k1_scalar_half

f8d148b

This introduces a new secp256k1_scalar_half function which multiplies a scalar with the multiplicative inverse of 2 (modulo order).

sipa force-pushed the 202112_sdmc branch from c914fcc to 151935e Compare December 29, 2021 00:07

sipa force-pushed the 202112_sdmc branch from 151935e to 7498630 Compare December 29, 2021 20:29

sipa changed the title ~~WIP Rework of Signed-Digit Multicomb~~ Signed-digit multi-comb ecmult_gen algorithm Dec 29, 2021

peterdettman and others added 10 commits December 29, 2021 15:54

Always generate tables for current (blocks,teeth) config

1777163

Provide 3 configurations accessible through ./configure

878e678

Optimization: move 2^COMB_BITS-1 offset into ctx->scalar_offset

a23da75

It is unnecessary to recompute the 2^COMB_BITS-1 scalar offset needed by the SDMC algorithm for every multiplication; move it into the context scalar_offset value instead.

Optimization: first table lookup needs no point addition

d4ab037

Optimization: avoid unnecessary doublings in precomputation

10e6d6b

Make secp256k1_scalar_get_bits support 32-bit reads

eab993a

Reduce side channels from single-bit reads

3d90274

Reintroduce projective blinding

4a4d16e

sipa force-pushed the 202112_sdmc branch from 7498630 to 4a4d16e Compare December 29, 2021 20:54

sipa closed this Dec 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Signed-digit multi-comb ecmult_gen algorithm #1057

Signed-digit multi-comb ecmult_gen algorithm #1057

sipa commented Dec 27, 2021 •

edited

Loading

peterdettman commented Dec 28, 2021

sipa commented Dec 29, 2021

siv2r commented Dec 29, 2021

peterdettman commented Dec 29, 2021 •

edited

Loading

sipa commented Dec 29, 2021

Signed-digit multi-comb ecmult_gen algorithm #1057

Signed-digit multi-comb ecmult_gen algorithm #1057

Conversation

sipa commented Dec 27, 2021 • edited Loading

peterdettman commented Dec 28, 2021

sipa commented Dec 29, 2021

siv2r commented Dec 29, 2021

peterdettman commented Dec 29, 2021 • edited Loading

sipa commented Dec 29, 2021

sipa commented Dec 27, 2021 •

edited

Loading

peterdettman commented Dec 29, 2021 •

edited

Loading