feat: gnovm benchmarking tool #2241

piux2 · 2024-05-30T08:34:57Z

Contributors' checklist...

[ x] Added new tests, or not needed, or not feasible
[ x] Provided an example (e.g. screenshot) to aid review or the PR is self-explanatory
[ x] Updated the official documentation or not needed
[ x] No breaking changes were made, or a BREAKING CHANGE: xxx message was included in the description
[ x] Added references to related issues and PRs
[ x] Provided any useful hints for running manual tests
Added new benchmarks to generated graphs, if any. More info here.

We build this tool mainly for the following issues

#1826
#1828
#1281
#1973

We could also use it in the following cases.

#1973
#2222

`gnobench` benchmarks the time consumed for each VM CPU OpCode and persistent access to the store, including marshalling and unmarshalling of realm objects.

Design consideration

Minimum Overhead and Footprint

Constant build flags enable benchmarking.
Encode operations and measurements in binary.
Dump to a local file in binary.
No logging, printout, or network access involved.

Accuracy

Pause the timer for storage access while performing VM opcode benchmarking.
Measure each OpCode execution in nanoseconds.
Store access includes the duration for Amino marshalling and unmarshalling.

It is built on top of @deelawn's design and framework with @jaekwon's input.
#2073

Usage

Simple mode

The benchmark only involves the GnoVM and the persistent store. It benchmarks the bare minimum components, and the results are isolated from other components. We use standardize gno contract to perform the benchmarking.

This mode is the best for benchmarking each major release and/or changes in GnoVM.

make opcode
make storage

Production mode

It benchmarks the node in the production environment with minimum overhead.
We can only benchmark with standardize the contract but also capture the live usage in production environment.
It gives us a complete picture of the node perform.

Build the production node with benchmarking flags:

go build -tags "benchmarkingstorage benchmarkingops" gno.land/cmd/gnoland

Run the node in the production environment. It will dump benchmark data to a benchmark.bin file.
call the realm contracts at gno.land/r/x/benchmark/opcodes and gno.land/r/x/benchmark/storage
Stop the server after the benchmarking session is complete.
Run the following command to convert the binary dump:

gnobench -bin path_to_benchmark_bin

it converts the binary dump to results.csv and results_stats.csv.

Results ( Examples )

The benchmarking results are stored in two files:

The raw results are saved in results.csv.

Operation	Elapsed Time	Disk IO Bytes
OpEval	40333	0
OpPopBlock	208	0
OpHalt	167	0
OpEval	500	0
OpInterfaceType	458	0
OpPopBlock	166	0
OpHalt	125	0
OpInterfaceType	21125	0
OpEval	541	0
OpEval	209	0
OpInterfaceType	334	0

The averages and standard deviations are summarized in results_stats.csv.

Operation	Avg Time	Time Std Dev	Count
OpAdd	101	45	300
OpAddAssign	309	1620	100
OpArrayLit	242	170	700
OpArrayType	144	100	714
OpAssign	136	95	2900
OpBand	92	30	100
OpBandAssign	127	62	100
OpBandn	97	54	100
OpBandnAssign	125	113	100
OpBinary1	128	767	502
OpBody	127	145	13700

codecov · 2024-05-30T08:52:27Z

Codecov Report

Attention: Patch coverage is 19.37173% with 462 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
gnovm/cmd/benchops/stats.go	0.00%	127 Missing ⚠️
gnovm/pkg/gnolang/store.go	18.69%	68 Missing and 19 partials ⚠️
gnovm/pkg/benchops/bench.go	0.00%	61 Missing ⚠️
gnovm/pkg/benchops/exporter.go	0.00%	57 Missing ⚠️
gnovm/cmd/benchops/run.go	61.61%	37 Missing and 1 partial ⚠️
gnovm/pkg/gnolang/machine.go	0.00%	25 Missing and 8 partials ⚠️
gnovm/cmd/benchops/main.go	0.00%	31 Missing ⚠️
gnovm/pkg/benchops/ops.go	0.00%	9 Missing ⚠️
gnovm/pkg/gnolang/realm.go	0.00%	6 Missing and 2 partials ⚠️
gnovm/cmd/benchops/store.go	80.64%	4 Missing and 2 partials ⚠️
... and 2 more

📢 Thoughts on this report? Let us know!

deelawn

Looks good. Perhaps we could move the transient size variable so that is defined inside the benchmarking package as an exported variable. Then it can be set and read strictly within benchmarking constant conditional blocks and avoid any unnecessary allocations when benchmarking is not enabled.

What do you think?

benchmarking/Makefile

jaekwon · 2024-06-09T04:45:06Z

Waiting for updates as per discussions with Ray

…ive benchmark anymore.

ltzmaxwell

preliminary review.

gnovm/pkg/benchops/bench.go

gnovm/pkg/gnolang/store.go

gnovm/pkg/benchops/bench.go

zivkovicmilos · 2024-11-21T06:20:04Z

Hey @piux2, can you rebase this with master? 🙏

mvertes

The pr provides:

Performance measurement capabilities for opcodes (count, execution time) and persistent storage (count, size). This part is similar to performance counter registers in processors. It would better be called perf (the VM capability) instead of benchops (a particular usage of this capability, and not the only possible one, i.e profiling, gas measurement). See for example linux or freebsd
An executable called gnobench, to perform a benchmark on a specific scenario (a sample forum board in gno). This executable should not be located under gnovm, but in contribs/benchmark, with
its dependencies:
- the readme, makefile, etc in contrib/benchmark
- the executable itself in contrib/benchmark/cmd/gnobench
- benchmarked gno files in contrib/benchmark/gno
- the vm part still in gnovm/pkg/perf

Regarding the build tags to enable or not the counting, I'm not a big fan of that. It brings a lot of hassle, and it's partly useless because counting is necessary in production anyway. The performance gain vs using a variable instead of a constant (requiring a different executable) to enable the feature in opcode procressing was shown negligible when we introduced the debugger, it should be the same here. A former colleague of mine used to say: "Premature optimisation is the root of all evil" :-)

In more details, the measurement of opcodes consists of counting and measuring execution time. Counting is super cheap (a counter increment operation), and could be always performed. Measuring execution time is probably a bit more costly and it makes sense to enable it only for benchmark scenarios. In production, we could have a static pre-computed profile of mean exec time per opcode, for gas estimation. But again, no sufficient reason to use build tags.

Regarding the storage of performance counters, they should be stored per virtual machine, instead of a global variable measure, which is generally a bad practice, bringing a lot of unstated constraints and risks. I understand that it is already the case for other parts of the VM, and that it facilitates the coding, but this is something that we should eventually fix. It can be done later, I don't want to block the PR for that.

Gno2D2 · 2024-12-02T22:16:55Z

🛠 PR Checks Summary

All Automated Checks passed. ✅

Manual Checks (for Reviewers):

IGNORE the bot requirements for this PR (force green CI check)

✅ Automated Checks (for Contributors):

🟢 Maintainers must be able to edit this pull request (more info)

☑️ Contributor Actions:

Fix any issues flagged by automated checks.
Follow the Contributor Checklist to ensure your PR is ready for review.
- Add new tests, or document why they are unnecessary.
- Provide clear examples/screenshots, if necessary.
- Update documentation, if required.
- Ensure no breaking changes, or include BREAKING CHANGE notes.
- Link related issues/PRs, where applicable.

☑️ Reviewer Actions:

Complete manual checks for the PR, including the guidelines and additional checks if applicable.

📚 Resources:

Debug

Automated Checks
Maintainers must be able to edit this pull request (more info)
If
🟢 Condition met
└── 🟢 The pull request was created from a fork (head branch repo: piux2/gno)
Then
🟢 Requirement satisfied
└── 🟢 Maintainer can modify this pull request
Manual Checks
**IGNORE** the bot requirements for this PR (force green CI check)
If
🟢 Condition met
└── 🟢 On every pull request
Can be checked by

Any user with comment edit permission

piux2 · 2024-12-03T20:40:38Z

Hey @piux2, can you rebase this with master? 🙏

done

piux2 · 2024-12-05T02:05:26Z

@mvertes thank you for the comments!

The pr provides:

Performance measurement capabilities for opcodes (count, execution time) and persistent storage (count, size). This part is similar to performance counter registers in processors. It would better be called perf (the VM capability) instead of benchops (a particular usage of this capability, and not the only possible one, i.e profiling, gas measurement). See for example linux or freebsd

Thank you for the reference.!

I can see that Linux's perf tool measures software and hardware events in a way similar to how we measure opcode execution and storage access times. However, in our case, we use a set of standardized contracts in benchops/gno to evaluate the execution time of each opcode in the Gno VM runtime. This forms the essence of our benchmarking.

To truly implement performance measurement of the VM, similar to what Linux perf does with software and hardware events, we would need to develop a comprehensive new tool.

Alternatively, we might reserve the name 'Gno Performance and Profiling Tools' for measuring the VM's execution of user contracts in terms of gas consumption, similar to how the Go runtime measures performance for Go code.

An executable called gnobench, to perform a benchmark on a specific scenario (a sample forum board in gno). This executable should not be located under gnovm, but in contribs/benchmark, with
its dependencies:

the readme, makefile, etc in contrib/benchmark

the executable itself in contrib/benchmark/cmd/gnobench

benchmarked gno files in contrib/benchmark/gno

the vm part still in gnovm/pkg/perf

I moved benchops/cmd out of /pkg and placed it in gnovm/cmd/benchops since it is still part of the VM.

Regarding the build tags to enable or not the counting, I'm not a big fan of that. It brings a lot of hassle, and it's partly useless because counting is necessary in production anyway. The performance gain vs using a variable instead of a constant (requiring a different executable) to enable the feature in opcode procressing was shown negligible when we introduced the debugger, it should be the same here. A former colleague of mine used to say: "Premature optimisation is the root of all evil" :-)

I totally agree and advocate "Premature optimization is the root of all evil".

All we did was using build flag to set constant flag instead of a variable flag to determine non-production code. Go complier removes entire if block if the condition is a constant value. The constant flag removes the over head 10-18% when we try to capture the accurate value of the operation in nano second with a timer

mvertes

Ok to me, provided that we move the bench executable and examples in contrib at some point, and that we ensure a simple an single way to build gnovm. This can be addressed in a further PR.

## Summary: This PR updates the CPU and Store gas based on results from the benchmarking tool:#2241 For CPU gas, the measurement is in nanoseconds per opcode execution. For storage gas, the measurement is in nanoseconds per byte for each type of Gno store access. ## Changes: We moved the gas meter from the underlying store to the upper Gno store to capture accurate resource consumption for VM transactions. At the same time, we retain the original gas store and gas meter for the Auth Keeper to handle regular blockchain transactions that do not necessarily involve the VM. We also updated the gas-wanted in the integration test to reflect actual gas usage. This can serve as a flag to alert us to future changes that might increase gas assumptions. Additional reasons for these changes include: - The Gno VM store processes additional complex encoding and decoding of data structures, rather than simply reading and writing bytes to disk. - For the above reason, we benchmarked gas for store access at the Gno store level. - We want to avoid consuming gas at two points for a single store access during a VM transaction. Here are the diagrams to explain the store access gas before and after changes #### Before: ![image](https://github.com/user-attachments/assets/7f1f7361-623d-4bdf-ba28-9b337da649fc) #### After: ![image](https://github.com/user-attachments/assets/c5ce6e36-e2f7-4f10-bebf-4278f9d16e47) <details><summary>Contributors' checklist...</summary> - [x] Added new tests - [ ] Provided an example (e.g. screenshot) to aid review or the PR is self-explanatory - [ ] Updated the official documentation or not needed - [ ] No breaking changes were made, or a `BREAKING CHANGE: xxx` message was included in the description - [x] Added references to related issues and PRs - [ ] Provided any useful hints for running manual tests </details>

thehowl

Made a few changes for consistency and usability of the tool, but I'd say we're good to merge and iterate.

@deelawn

<details><summary>Contributors' checklist...</summary> - [ x] Added new tests, or not needed, or not feasible - [ x] Provided an example (e.g. screenshot) to aid review or the PR is self-explanatory - [ x] Updated the official documentation or not needed - [ x] No breaking changes were made, or a `BREAKING CHANGE: xxx` message was included in the description - [ x] Added references to related issues and PRs - [ x] Provided any useful hints for running manual tests - [ ] Added new benchmarks to [generated graphs](https://gnoland.github.io/benchmarks), if any. More info [here](https://github.com/gnolang/gno/blob/master/.benchmarks/README.md). </details> We build this tool mainly for the following issues gnolang#1826 gnolang#1828 gnolang#1281 gnolang#1973 We could also use it in the following cases. gnolang#1973 gnolang#2222 ### `gnobench` benchmarks the time consumed for each VM CPU OpCode and persistent access to the store, including marshalling and unmarshalling of realm objects. ## Design consideration ### Minimum Overhead and Footprint - Constant build flags enable benchmarking. - Encode operations and measurements in binary. - Dump to a local file in binary. - No logging, printout, or network access involved. ### Accuracy - Pause the timer for storage access while performing VM opcode benchmarking. - Measure each OpCode execution in nanoseconds. - Store access includes the duration for Amino marshalling and unmarshalling. It is built on top of @deelawn's design and framework with @jaekwon's input. gnolang#2073 ## Usage ### Simple mode The benchmark only involves the GnoVM and the persistent store. It benchmarks the bare minimum components, and the results are isolated from other components. We use standardize gno contract to perform the benchmarking. This mode is the best for benchmarking each major release and/or changes in GnoVM. make opcode make storage ### Production mode It benchmarks the node in the production environment with minimum overhead. We can only benchmark with standardize the contract but also capture the live usage in production environment. It gives us a complete picture of the node perform. 1. Build the production node with benchmarking flags: `go build -tags "benchmarkingstorage benchmarkingops" gno.land/cmd/gnoland` 2. Run the node in the production environment. It will dump benchmark data to a benchmark.bin file. 3. call the realm contracts at `gno.land/r/x/benchmark/opcodes` and `gno.land/r/x/benchmark/storage` 4. Stop the server after the benchmarking session is complete. 5. Run the following command to convert the binary dump: `gnobench -bin path_to_benchmark_bin` it converts the binary dump to results.csv and results_stats.csv. ## Results ( Examples ) The benchmarking results are stored in two files: 1. The raw results are saved in results.csv. | Operation | Elapsed Time | Disk IO Bytes | |-----------------|--------------|---------------| | OpEval | 40333 | 0 | | OpPopBlock | 208 | 0 | | OpHalt | 167 | 0 | | OpEval | 500 | 0 | | OpInterfaceType | 458 | 0 | | OpPopBlock | 166 | 0 | | OpHalt | 125 | 0 | | OpInterfaceType | 21125 | 0 | | OpEval | 541 | 0 | | OpEval | 209 | 0 | | OpInterfaceType | 334 | 0 | 2. The averages and standard deviations are summarized in results_stats.csv. | Operation | Avg Time | Avg Size | Time Std Dev | Count | |----------------|----------|----------|--------------|-------| | OpAdd | 101 | 0 | 45 | 300 | | OpAddAssign | 309 | 0 | 1620 | 100 | | OpArrayLit | 242 | 0 | 170 | 700 | | OpArrayType | 144 | 0 | 100 | 714 | | OpAssign | 136 | 0 | 95 | 2900 | | OpBand | 92 | 0 | 30 | 100 | | OpBandAssign | 127 | 0 | 62 | 100 | | OpBandn | 97 | 0 | 54 | 100 | | OpBandnAssign | 125 | 0 | 113 | 100 | | OpBinary1 | 128 | 0 | 767 | 502 | | OpBody | 127 | 0 | 145 | 13700 | --------- Co-authored-by: Morgan Bazalgette <[email protected]>

piux2 added 2 commits May 30, 2024 01:08

feat: gnovm benchmarking tool

c781414

chore: fix typo

f78ad0b

piux2 requested review from jaekwon, moul, deelawn, zivkovicmilos and a team as code owners May 30, 2024 08:34

github-actions bot assigned piux2 May 30, 2024

github-actions bot added 🧾 package/realm Tag used for new Realms or Packages. 📦 🤖 gnovm Issues or PRs gnovm related 📦 ⛰️ gno.land Issues or PRs gno.land package related labels May 30, 2024

piux2 added 2 commits May 30, 2024 12:26

chore: fix bin flag

ae21be3

chore: fix resetting benchmark stack

03ec596

deelawn approved these changes May 30, 2024

View reviewed changes

thehowl reviewed May 31, 2024

View reviewed changes

benchmarking/Makefile Outdated Show resolved Hide resolved

piux2 added 11 commits June 16, 2024 01:50

use array based timer to reduce the overhead, but won't support the l…

776f456

…ive benchmark anymore.

chore: fix fmt

82211c2

Delete benchmarking/results.csv

7f0be67

Delete benchmarking/results_stats.csv

d5dce64

removed stack since we don't need to benchmark static eval type

a446401

Merge remote-tracking branch 'origin' into feat_gnobench

557084e

remove store

3312bf5

Merge remote-tracking branch 'origin' into feat_gnobench

87d8df2

fix test

c3d0ebf

fixed naming

7b7260e

refactor bench measure

3e0b8ca

ltzmaxwell reviewed Nov 19, 2024

View reviewed changes

gnovm/pkg/benchops/bench.go Outdated Show resolved Hide resolved

gnovm/pkg/gnolang/store.go Show resolved Hide resolved

ltzmaxwell reviewed Nov 19, 2024

View reviewed changes

gnovm/pkg/benchops/bench.go Show resolved Hide resolved

mvertes reviewed Nov 21, 2024

View reviewed changes

Kouteki mentioned this pull request Nov 24, 2024

Move performance counter storage for global variable to virtual machine(s) #3187

Open

reset flag

ea91a5e

merged with master

d4cd514

piux2 added 5 commits December 3, 2024 12:42

chores: fmt and lint

d200d3c

chores: lint

99387c5

fix: move cmd out of pkg

f1a7e5d

fix comments

393b5df

fix test

4a1a125

piux2 added 4 commits December 4, 2024 18:11

Merge remote-tracking branch 'origin/master' into feat_gnobench

0ba2944

Merge remote-tracking branch 'origin/master' into feat_gnobench

600d394

catch up api update

5a4663f

remove require in mod

db569bd

mvertes approved these changes Dec 9, 2024

View reviewed changes

ltzmaxwell approved these changes Dec 9, 2024

View reviewed changes

thehowl added 3 commits December 18, 2024 13:13

Merge branch 'master' of github.com:gnolang/gno into feat_gnobench

5eb51d5

use temporary dir for store, fixup makefile

296f264

print on io.Discard

4519d26

thehowl approved these changes Dec 18, 2024

View reviewed changes

thehowl added 2 commits December 18, 2024 13:43

fixup tests

c587a04

Merge branch 'master' into feat_gnobench

f9a841e

thehowl merged commit b91f2fb into gnolang:master Dec 18, 2024
106 of 107 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: gnovm benchmarking tool #2241

feat: gnovm benchmarking tool #2241

piux2 commented May 30, 2024 •

edited

Loading

codecov bot commented May 30, 2024 •

edited

Loading

deelawn left a comment

jaekwon commented Jun 9, 2024

ltzmaxwell left a comment

zivkovicmilos commented Nov 21, 2024

mvertes left a comment •

edited

Loading

Gno2D2 commented Dec 2, 2024 •

edited

Loading

✅ Automated Checks (for Contributors):

☑️ Contributor Actions:

☑️ Reviewer Actions:

📚 Resources:

If

Then

If

Can be checked by

piux2 commented Dec 3, 2024

piux2 commented Dec 5, 2024

mvertes left a comment

thehowl left a comment

feat: gnovm benchmarking tool #2241

feat: gnovm benchmarking tool #2241

Conversation

piux2 commented May 30, 2024 • edited Loading

gnobench benchmarks the time consumed for each VM CPU OpCode and persistent access to the store, including marshalling and unmarshalling of realm objects.

Design consideration

Minimum Overhead and Footprint

Accuracy

Usage

Simple mode

Production mode

Results ( Examples )

codecov bot commented May 30, 2024 • edited Loading

Codecov Report

deelawn left a comment

Choose a reason for hiding this comment

jaekwon commented Jun 9, 2024

ltzmaxwell left a comment

Choose a reason for hiding this comment

zivkovicmilos commented Nov 21, 2024

mvertes left a comment • edited Loading

Choose a reason for hiding this comment

Gno2D2 commented Dec 2, 2024 • edited Loading

🛠 PR Checks Summary

Manual Checks (for Reviewers):

✅ Automated Checks (for Contributors):

☑️ Contributor Actions:

☑️ Reviewer Actions:

📚 Resources:

If

Then

If

Can be checked by

piux2 commented Dec 3, 2024

piux2 commented Dec 5, 2024

mvertes left a comment

Choose a reason for hiding this comment

thehowl left a comment

Choose a reason for hiding this comment

piux2 commented May 30, 2024 •

edited

Loading

`gnobench` benchmarks the time consumed for each VM CPU OpCode and persistent access to the store, including marshalling and unmarshalling of realm objects.

codecov bot commented May 30, 2024 •

edited

Loading

mvertes left a comment •

edited

Loading

Gno2D2 commented Dec 2, 2024 •

edited

Loading