perf: math: make Int.Size() faster by computation not len(MarshalledBytes) #16263

odeke-em · 2023-05-23T16:17:19Z

This change computes Int.Size() by checking bit lengths and translating those to base 10 values. This is instead of firstly invoking .Marshal to check the length, yet .MarshalTo requires invoking .Size() then .Marshal which is double work.

The results show improvements:

$ benchstat before.txt after.txt
name       old time/op    new time/op    delta
IntSize-8    23.9µs ± 3%    20.3µs ± 1%  -15.15%  (p=0.000 n=9+9)

name       old alloc/op   new alloc/op   delta
IntSize-8    6.62kB ± 0%    5.09kB ± 0%  -23.19%  (p=0.000 n=10+10)

name       old allocs/op  new allocs/op  delta
IntSize-8       186 ± 0%       177 ± 0%   -4.84%  (p=0.000 n=10+10)

Fixes #10331

odeke-em · 2023-05-23T16:17:50Z

/cc @elias-orijtech @ValarDragon

ValarDragon · 2023-05-23T18:41:20Z

math/int.go

+	}
+
+	// Use Log10(x) for values less than (1<<64)-1, given it is only defined for [1, (1<<64)-1]
+	if i.Cmp(bigMaxUint64) <= 0 {


isn;t this equivalent to bitlen <= 64?

It is and the result in changing it to bitLen <= 64 isn't significant! I used bigMaxUint64 to make for clear readability and because when writing out the algorithm I was thinking in terms of log2 where a value like: 18446744073709551616, that value is +1 greater than maxUint64. If you pass in math.Log10(float64(i.Uint64())) would result in -Inf. Sure I'll change it to just that comparison

ValarDragon · 2023-05-23T18:42:06Z

math/int.go

+
+	// At this point we should just keep reducing by 10^19 as that's the smallest multiple
+	// of 10 that matches the digit length of (1<<64)-1
+	var ri *big.Int


Can we pass this in as a scratch variable each loop, to avoid re-allocations?

ping @odeke-em

ValarDragon · 2023-05-23T18:43:24Z

math/int.go

+	return size + sizeBigInt(ii, alreadyMadeCopy)
+}
+
+func sizeBigInt(i *big.Int, alreadyMadeCopy bool) (size int) {


I am a bit confused, since this feels like we could solve this with more pre-computation. E.g. for every bitlen, store "markers" in a map for values of that bitlength that corresspond to different sizes.

(And then we'd have 0 alloc's per size call)

Nah, it isn't as simple. I had independently tried this approach out, there are a bunch of numbers which are in the boundary of bit lengths but require you to search between the closest values: and roughly I also checked for the allocations reduction and it wasn't significant. I shall refine it in the future, but for purpose of clear wins this approach as is will work great: the approach to get accuracy is to:

while precomputing the lookup table, for the same bit length B: store []*big.Int{startingBitLenValue, ...endBitLenValue}

when you lookup by bit length, perform a binary search (range search) in the retrieved list and see if the value falls within that range

This was the code for my prior experiment which I'll refine after this PR lands:

type savings struct { bi *big.Int bl int } var bLenMap map[int]*savings func computeLUT() { // 1. Compute the tables on which we have a divergence in digits // for log values. // Goal to lookup the number of digits given a bit length bLenMap = map[int]*savings{ 0: {new(big.Int).SetUint64(1), 1}, } for i := uint(0); i <= 258; i++ { ii := new(big.Int).SetUint64(1) ii = ii.Lsh(ii, i) len10Digits := len(ii.String()) if false { fmt.Printf("%-79s bitLen: %-4d %d\n", ii, ii.BitLen(), len10Digits) } bLenMap[ii.BitLen()] = &savings{bl: len10Digits, bi: ii} } } func init() { computeLUT() } func lookup(ii *big.Int) int { ii = ii.Abs(ii) // Could be lazily computed using sync.Once only when needed bits := ii.BitLen() closest, ok := bLenMap[bits] if !ok { return 1 } switch cmp := ii.Cmp(closest.bi); { case cmp == 0: return closest.bl case cmp < 0: for { bits-- retr, ok := bLenMap[bits] if !ok { break } if ii.Cmp(retr.bi) >= 0 { break } closest = retr } case cmp > 0: for { bits++ retr, ok := bLenMap[bits] if !ok { break } if ii.Cmp(retr.bi) <= 0 { if retr.bi.BitLen() == ii.BitLen() { closest = retr } break } closest = retr } } return closest.bl }

ValarDragon

LGTM, nice job!

…ytes) This change computes Int.Size() by checking bit lengths and translating those to base 10 values. This is instead of firstly invoking .Marshal to check the length, yet .MarshalTo requires invoking .Size() then .Marshal which is double work. The results show improvements: ```shell $ benchstat before.txt after.txt name old time/op new time/op delta IntSize-8 23.9µs ± 3% 20.3µs ± 1% -15.15% (p=0.000 n=9+9) name old alloc/op new alloc/op delta IntSize-8 6.62kB ± 0% 5.09kB ± 0% -23.19% (p=0.000 n=10+10) name old allocs/op new allocs/op delta IntSize-8 186 ± 0% 177 ± 0% -4.84% (p=0.000 n=10+10) ``` Fixes #10331

This is a simpler version of cosmos#16263 that only optimizes values that fit in 53 bits. It is possible to optimize values that fit in 64 bits by using big.Int.BitLen and math.Log2(10), but it doesn't seem woth the complexity, especially given the revert of cosmos#16263.

This is a simpler version of cosmos#16263 that only optimizes values that fit in 53 bits. It is possible to optimize values that fit in 64 bits by using big.Int.BitLen and math.Log2(10), but it doesn't seem worth the complexity, especially given the revert of cosmos#16263. I failed to beat big.Int.Marshal for values that don't fit in 64 bits.

odeke-em marked this pull request as ready for review May 23, 2023 16:17

odeke-em requested a review from a team as a code owner May 23, 2023 16:17

github-prbot requested review from a team, alexanderbez and samricotta and removed request for a team May 23, 2023 16:17

This comment has been minimized.

Sign in to view

odeke-em force-pushed the math-implement-faster-Int.Size branch 2 times, most recently from 107fe85 to ae4bd37 Compare May 23, 2023 16:26

odeke-em changed the title ~~feat: math: make Int.Size() faster by computation not len(MarshalledBytes)~~ perf: math: make Int.Size() faster by computation not len(MarshalledBytes) May 23, 2023

odeke-em force-pushed the math-implement-faster-Int.Size branch from ae4bd37 to 3972f74 Compare May 23, 2023 16:29

ValarDragon reviewed May 23, 2023

View reviewed changes

odeke-em force-pushed the math-implement-faster-Int.Size branch from 3972f74 to 481f32a Compare May 25, 2023 08:20

ValarDragon approved these changes May 26, 2023

View reviewed changes

odeke-em force-pushed the math-implement-faster-Int.Size branch 2 times, most recently from f7834c0 to 34980a3 Compare May 30, 2023 08:40

tac0turtle assigned kocubinski and testinginprod Jun 3, 2023

odeke-em force-pushed the math-implement-faster-Int.Size branch from 34980a3 to a90b021 Compare June 6, 2023 17:41

testinginprod approved these changes Jun 8, 2023

View reviewed changes

tac0turtle enabled auto-merge June 8, 2023 09:27

lint fix

81eba1e

tac0turtle disabled auto-merge June 8, 2023 10:03

julienrbrt added this pull request to the merge queue Jun 8, 2023

julienrbrt removed this pull request from the merge queue due to a manual request Jun 8, 2023

Merge branch 'main' into math-implement-faster-Int.Size

6bd97e1

odeke-em enabled auto-merge June 8, 2023 15:02

odeke-em added this pull request to the merge queue Jun 8, 2023

Merged via the queue into main with commit 9b9e319 Jun 8, 2023

odeke-em deleted the math-implement-faster-Int.Size branch June 8, 2023 15:18

julienrbrt mentioned this pull request Jun 8, 2023

chore: fix concurrency group sims and math changelog #16462

Merged

19 tasks

julienrbrt added a commit that referenced this pull request Aug 21, 2023

fix(math): revert #16263

d4c672a

julienrbrt mentioned this pull request Aug 21, 2023

fix(math): revert #16263 and add test cases #17489

Merged

20 tasks

github-merge-queue bot pushed a commit that referenced this pull request Aug 21, 2023

fix(math): revert #16263 and add test cases (#17489)

bf24916

julienrbrt mentioned this pull request Aug 21, 2023

fix(math): incorrect .Size() value #17485

Closed

20 tasks

elias-orijtech mentioned this pull request Aug 21, 2023

perf: optimize math.Int.Size for small values #17497

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: math: make Int.Size() faster by computation not len(MarshalledBytes) #16263

perf: math: make Int.Size() faster by computation not len(MarshalledBytes) #16263

odeke-em commented May 23, 2023

odeke-em commented May 23, 2023

This comment has been minimized.

ValarDragon May 23, 2023 •

edited

Loading

odeke-em May 25, 2023

ValarDragon May 23, 2023

tac0turtle Jun 8, 2023

ValarDragon May 23, 2023

odeke-em May 25, 2023

ValarDragon left a comment

perf: math: make Int.Size() faster by computation not len(MarshalledBytes) #16263

perf: math: make Int.Size() faster by computation not len(MarshalledBytes) #16263

Conversation

odeke-em commented May 23, 2023

odeke-em commented May 23, 2023

This comment has been minimized.

ValarDragon May 23, 2023 • edited Loading

Choose a reason for hiding this comment

odeke-em May 25, 2023

Choose a reason for hiding this comment

ValarDragon May 23, 2023

Choose a reason for hiding this comment

tac0turtle Jun 8, 2023

Choose a reason for hiding this comment

ValarDragon May 23, 2023

Choose a reason for hiding this comment

odeke-em May 25, 2023

Choose a reason for hiding this comment

ValarDragon left a comment

Choose a reason for hiding this comment

ValarDragon May 23, 2023 •

edited

Loading