Continuous Trie Log Pruning #6075

siladu · 2023-10-24T06:26:23Z

Feature toggled by --Xtrie-log-pruning-enabled
Add trie log pruning after a successful TrieLogManager.saveTrieLog
Each time a trie log is persisted, the current trie log is cached and the pruner is run against the oldest entries in the cache.
This makes no attempt to manage the backlog of old trie logs, it will only prune what has been added to the cache, i.e. trie logs that have been added since the feature was enabled.
Pruner limit exists in case of exceptional circumstances, but we should only ever be pruning all the forks for a single block number during each prune execution.

This is a second take of #6026
Currently built on #6072 (diff: siladu/besu@refactor-trie-log-manager...trie-log-pruning-take2)

One downside of this PR compared to #6026 is that restarting besu may leave a gap in the pruned trie logs because we don't preload the cache. This will have to be considered as part of managing the backlog. It may mean that we can't simply rely on a "one-off" resync or prune subcommand. However, these un-pruneable trie logs should be negligible compared to the saving gained for long running nodes.

Signed-off-by: Simon Dudley <[email protected]>

Separate out the concepts of world state caching from trie log management Make AbstractTrieLogManager a concrete implemenation (to be further renamed/refactored next commit) Signed-off-by: Simon Dudley <[email protected]>

Signed-off-by: Simon Dudley <[email protected]>

Feature toggled by --Xtrie-log-pruning-enabled Each time a trie log is persisted, the current trie log is cached and the pruner is run against the oldest entries in the cache. This makes no attempt to manage the backlog of old trie logs, it will only prune what has been added to the cache, i.e. trie logs that have been added since the feature was enabled. Pruner limit exists in case of exceptional circumstances, but we should only ever be pruning all the forks for a single block number during each prune execution. Signed-off-by: Simon Dudley <[email protected]>

Was only used for supporting test code and can instead reuse static factory from InMemoryKeyValueStorageProvider Signed-off-by: Simon Dudley <[email protected]>

github-actions · 2023-10-24T06:26:40Z

I thought about documentation and added the doc-change-required label to this PR if updates are required.
I thought about the changelog and included a changelog update if required.
If my PR includes database changes (e.g. KeyValueSegmentIdentifier) I have thought about compatibility and performed forwards and backwards compatibility tests

...rc/main/java/org/hyperledger/besu/ethereum/referencetests/BonsaiReferenceTestWorldState.java

    @Override
-    public void reset() {}
+    public void saveTrieLog(


Signed-off-by: Simon Dudley <[email protected]>

Inialised once on startup Signed-off-by: Simon Dudley <[email protected]>

Signed-off-by: Simon Dudley <[email protected]>

siladu · 2023-10-26T06:38:06Z

Abandoning this approach in favour of an adapted form of the original #6026

siladu added 6 commits October 23, 2023 13:29

Move TrieLogProvider setup code to AbstractTrieLogManager

048623a

Signed-off-by: Simon Dudley <[email protected]>

Decouple CachedWorldStorageManager from TrieLogManager

24bdcf6

Separate out the concepts of world state caching from trie log management Make AbstractTrieLogManager a concrete implemenation (to be further renamed/refactored next commit) Signed-off-by: Simon Dudley <[email protected]>

Rename AbstractTrieLogManager to DefaultTrieLogManager

7a6552c

Signed-off-by: Simon Dudley <[email protected]>

Make TrieLogManager the default implementation and remove the interface

0bb66f1

Signed-off-by: Simon Dudley <[email protected]>

Remove a BonsaiWorldStateProvider constructor

ec6f52b

Was only used for supporting test code and can instead reuse static factory from InMemoryKeyValueStorageProvider Signed-off-by: Simon Dudley <[email protected]>

siladu changed the title ~~Move TrieLogProvider setup code to AbstractTrieLogManager~~ Continuous Trie Log Pruning Oct 24, 2023

github-advanced-security bot found potential problems Oct 24, 2023

View reviewed changes

siladu added TeamGroot GH issues worked on by Groot Team and removed TeamGroot GH issues worked on by Groot Team labels Oct 24, 2023

siladu added 7 commits October 25, 2023 11:56

Merge branch 'main' into trie-log-pruning-take2

def2160

Signed-off-by: Simon Dudley <[email protected]>

Merge branch 'main' into trie-log-pruning-take2

d007a2c

Signed-off-by: Simon Dudley <[email protected]>

javadoc

164c77f

Signed-off-by: Simon Dudley <[email protected]>

copy paste error

e0ad379

Signed-off-by: Simon Dudley <[email protected]>

Preload cache with trielogs limited by pruningLimit

af1e314

Inialised once on startup Signed-off-by: Simon Dudley <[email protected]>

Log when adding to prune cache

d921ad4

Signed-off-by: Simon Dudley <[email protected]>

Prune any orphaned blocks included in prune window during load

d19a691

Signed-off-by: Simon Dudley <[email protected]>

siladu closed this Oct 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Continuous Trie Log Pruning #6075

Continuous Trie Log Pruning #6075

siladu commented Oct 24, 2023 •

edited

Loading

github-actions bot commented Oct 24, 2023

siladu commented Oct 26, 2023

Continuous Trie Log Pruning #6075

Continuous Trie Log Pruning #6075

Conversation

siladu commented Oct 24, 2023 • edited Loading

github-actions bot commented Oct 24, 2023

siladu commented Oct 26, 2023

siladu commented Oct 24, 2023 •

edited

Loading