all: implement path based storage #23733

sscodereth · 2021-10-14T02:58:41Z

This PR implements the path based storage scheme. Note, it's a WIP PR, open it just want to ensure it's on the right track.

sscodereth · 2021-10-18T06:14:45Z

This comment tries to explain the design of path based storage scheme in high level.

Path based storage scheme

The new state scheme is based on the node path, all nodes are saved in disk with the path as key. It means there is only a single version trie stored in the disk and will be updated in place whenever chain makes progress.

The database key of trie node is encoded with node path. To be precise, the key format can be represented by this diagram.

                               + only for storage trie node
                              /
    +------------+-----------------------+-------------------------------------+
    | Key Prefix | 32 bytes account hash | Suffix compact encoding(node path) |
    +------------+-----------------------+-------------------------------------+

Key prefix

The key prefix is a single byte which is used to indicate the database entry type. Since the node key length is non-fixed, it's hard for db inspector to determine the database entry type without it.

In the implementation, the key prefix is chosen as []byte("w").

Storage namespace

In ethereum, there are two layers tries: account trie and a bunch of storage tries. Thus for a node in storage trie, the real path is consisted by two parts: the path in account trie and the path in storage trie. The account hash(the path of the node in the account trie) is encoded into the database key.

Internal path

The last part is the internal path in the trie. Here the Reverse-compact-encoding is chosen to encode the trie path, so that all consecutive paths can have a shared path prefix after being encoded.

Note, keys with shared prefix can be compressed properly by Leveldb, it's the main reason for choosing this encoding scheme.

Reverse-compact-encoding is variant of compact-encoding, which puts the special flag(oddness, terminator, etc) in the end. The corresponding implementation is here

With the new storage scheme, there is only one version trie node belongs to the specific trie path be saved in the disk. The entire storage key encoding is implemented here

Alternatives

There are a few other encoding schemes. Check the proposal for more details. Suggestions are appreciated!

In-memory trie nodes

In order to serve shallow reorgs (shorter than the depth of the persistent singleton trie), a bunch of trie nodes need to be maintained in memory for a while, flush them into the disk only if they are mature enough(at least 128 block confirms). The structure for managing all in-memory nodes is called trie.Database.

There are a few functionalities supported by trie.Database

Support multiple version state access
Support in-memory node journalling for surviving restarts
Support state reverting by reverse diffs(not implemented yet)

The structure can be represented by this diagram, just like the snapshot tree, trie nodes belong to different state are organized in different layers. Layers are linked with each other by parent-child relationship. There are at most 128 depth layers will be maintained, the bottom-most diff layer will be flushed into disk if there are too many nodes accumulated.

+-----------------+-----------------+        
|   State Y       |      State Y'   |        
+-----------------+-----------------+        
|   State X       |      State X'   |        
+-----------------------------------+        
|              .......              |        
+-----------------------------------+        
|       Bottom-most diff layer      |        
+-----------------------------------+        
|     Disk layer(singleton trie)    |        
+-----------------------------------+

The trie.Database is mainly implemented in here.

Trie changes

Stateless commit

Perviously, trie.Database has a huge dirty node set which acts as the temporary container. Whenever tries are committed, dirty nodes are blindly inserted into this set and indexed by the node hash.

However, in the new scheme, trie.Database organizes nodes in different layers, which means each layer must contains all dirty nodes corresponding to this state. Apparently, the committed nodes by a single trie is just a small part of them.

So the commit procedure is changed a bit. Instead of pushing committed nodes blindly into dirty set, the committed nodes will be returned by trie, callers need to aggregate the results from multiple tries(account trie, storage tries) to a complete state diff for building a new diff layer.

Checking here for more implementation details.

Self-contained dirty set

After commit operation, a result object will be generated which contains all dirty nodes. Except returning the result object to outside, the trie itself also needs to maintain a dirty node set in order to access the latest state.

So in trie, an additional set is added. All the dirty nodes created since the trie creation will be recorded here.

Note the dirty set of trie will never be released unless the trie object is deallocated.

State sync changes

Retire fast sync

Since in the new scheme, the trie nodes are stored by path. Therefore accessing trie nodes via hash is impossible which breaks the fast sync. Retiring fast sync is not done by this PR, but I believe when we try to merge this one, it's already retired.

Twist snap sync

In the snap sync healing stage, the standard procedure of downloading missing nodes is:

Check if the node is present via hash
If not existent, retrieve it by GetTrieNodes protocol request
Stored the downloaded nodes, fill the missing children into the downloader
Repeat

It's slightly changed in new scheme. The new procedure is:

Check if the node with path X and hash Y is present
- No node with path x exists, download it
- The node with path x exists, but hash is different, download it
- The node with path x exists and hash is equal, skip it
Stored the downloaded nodes, fill the missing children into the downloader
Repeat

So that stale state with same path can be overwritten by state syncer silently. And eventually the synced state strictly match with the latest version.

Implementation wise, a new key scheme called internal format is introduced. It mixes the storage key format and node hash, so that it can represent a node in unique way.

Internal format scheme

It's widely used in the project(e.g. state syncer, trie.Database, sync bloom) which can distinguish different versions of nodes(same path, different hash).

Upgrade path

Upgrade from the legacy scheme to the path-based scheme requires a re-sync in theory. However a trick can be applied to avoid it(but it's still recommended to do a re-sync in order to slim down your node).
All the legacy nodes(stored by node hash) can be treated as the archive trie node. So that legacy nodes can act as the fallback for state accessing and all newly created states will be stored with new scheme. But of course the drawback is that legacy nodes won't be deleted from the disk unless you resync the node :).

References

Trie pruning v9000 #23427

holiman

Impressive piece of work!
It's going to take a while to go through though :)

holiman · 2021-10-18T09:00:33Z

core/rawdb/database.go

+		case bytes.HasPrefix(key, TrieNodePrefix):
 			tries.Add(size)
+		case len(key) == common.HashLength:
+			archiveTries.Add(size)


Won't 1/256:th of all archive tries be mis-tagged as tries?

What do you mean? :P

Legacy nodes and archive nodes are all stored with node hash. I think it's fine to put the legacy nodes into the archive node category.

I mean that out of all un-prefixed 32-byte keys that are raw trie nodes (archive), 1/256 will have their first byte be w, so be tagged as prefixed-tries instead of raw tries.

It's not highly important here, but in general those types of errors can be messy to debug.

Ah, yes you are right. It's really annoying to handle the legacy format trie nodes. I think we have this issue all in time even in the current master. Although the archive nodes in the current code should only be a small part(genesis) in most cases, but yes, let's try to make it better.

core/rawdb/schema.go

holiman · 2021-10-18T09:31:04Z

It would be interesting to test this approach in this context: #22497 (comment)

Block 6490899 on Ropsten.

sscodereth · 2021-10-18T10:49:05Z

It would be interesting to test this approach in this context: #22497 (comment)

Block 6490899 on Ropsten.

Actually in the core.StateDB, for the deleted state object, we won't explicitly clean out the storage. So IIUC, nothing different will happen.

It's an interesting question for me. How should we handle the destructed account storage. It makes no sense to leak them out in the disk, but a proper solution needs to be found to avoid unexpected OOM.

holiman · 2021-10-18T11:49:24Z

for the deleted state object, we won't explicitly clean out the storage. So IIUC, nothing different will happen.

Hm, so in the current implementation, the storage trie will be left untouched after the deletion? If so, what will happen if

Contract A is created, via CREATE2
Contract A sets slot 1 to 1
Contract A is self-destructed
Contract A is created again via CREATE2.

How do you avoid the accidental resurrection of the storage slots (which should now be cleared) if you haven't deleted them by-path?

trie/database.go

sscodereth · 2021-10-19T03:00:37Z

for the deleted state object, we won't explicitly clean out the storage. So IIUC, nothing different will happen.

Hm, so in the current implementation, the storage trie will be left untouched after the deletion? If so, what will happen if

Contract A is created, via CREATE2
Contract A sets slot 1 to 1
Contract A is self-destructed
Contract A is created again via CREATE2.
How do you avoid the accidental resurrection of the storage slots (which should now be cleared) if you haven't deleted them by-path?

Actually it's OK to leave the deleted state data in the disk. Like in the case you described above, Contract A is recreated with an empty storage root hash, in this case the leaked storage data won't be accessed since node hash is not matched.

More specifically,

In trie.Database.diffLayer, nodes are retrieved with internal key which is consisted with node hash. https://github.com/sscodereth/go-ethereum/blob/path-based-storage/trie/difflayer.go#L116

In trie.Database.diskLayer, nodes are retrieved with storage key(path), but data hash will still be compared with the requested one https://github.com/sscodereth/go-ethereum/blob/path-based-storage/trie/disklayer.go#L81

Let's go back to the scenario, the Contract A won't load any data for resolving an non-existent root node, so the left storage data won't be accessed at all.

sscodereth · 2021-10-19T03:02:40Z

But we definitely don't want to leak out any deleted data, I will improve this part for sure. But just want to clarify that even deleted data is leaked, the correctness won't be affected.

sscodereth · 2021-10-19T06:18:47Z

It would be interesting to test this approach in this context: #22497 (comment)

Block 6490899 on Ropsten.

But actually I don't fully understand this. If we want to delete the storage slots for a huge contract, the disk write will be chunked into several small writes. But yes, the disk write can be huge, not sure it will lead to OOM

relevant code:

// Ensure we don't delete too much data blindly (contract can be
// huge). It's ok to flush, the root will go missing in case of a
// crash and we'll detect and regenerate the snapshot.
if batch.ValueSize() > ethdb.IdealBatchSize {
	if err := batch.Write(); err != nil {
		log.Crit("Failed to write storage deletions", "err", err)
	}
	batch.Reset()
}

gballet

I read the whole PR, but I feel I need some more time for things to sink in. From what I can see, it will not conflict too much with my verkle tree work, but adapting it to the new multi-layer format will definitely require a lot of work. In spite of this, I think this is a welcome change and a lot of great work.

I see a lot of the snapshot code has been duplicated. With your model, does the "old" snapshot still make sense, if so how, and if not, should it not be deleted?

trie/scheme.go

trie/database.go

core/blockchain.go

trie/database.go

trie/sync_bloom.go

trie/utils.go

eth/state_accessor.go

eth/tracers/api.go

trie/proof.go

sscodereth · 2021-10-26T02:06:52Z

@gballet thanks for your feedback!

I see a lot of the snapshot code has been duplicated. With your model, does the "old" snapshot still make sense, if so how, and if not, should it not be deleted?

Yes, the multi-layer structure is duplicated. Since I found it's a perfect data structure in ethereum in order to handle the mini reorgs. Snapshot is still needed since it offers the direct state access. And actually I have been thinking to abstract the multi-layer structure out to remove the duplication. But I think it's not a good time to do it right now, reasons are: (1) it's already a huge change, refactoring will make it even harder to review (2) I still have to finish reverse-diff part, would be nice to give the priority to that part first.

But you are totally right, the duplication is ugly :) I will try to abstract it out a bit later.

sscodereth · 2021-10-26T02:07:08Z

From what I can see, it will not conflict too much with my verkle tree work, but adapting it to the new multi-layer format will definitely require a lot of work.

Yes, it's changed a bit. For example in trie, originally the dirty nodes are pushed into disk directly in Commit, but they are now wrapped in the CommitResult and returned to caller. It might affect your verkle trie.

holiman · 2021-10-26T09:58:31Z

Interesting. This PR is on the right side, running a full sync from genesis

This PR is the yellow one:

So far (which is too early to say anything, really), the snapshot commit appears to take a major hit, making this slower than master.

- Fix non-existent node resolving in Node API - Add benchmarks for NodeBlob API - Use the node hash as the bloom filter id - Change pararameters for Trie.Database

holiman · 2021-10-28T07:52:51Z

This brings down BenchmarkSearchBottom from 5000ns to 4100ns on my machine:

diff --git a/trie/difflayer.go b/trie/difflayer.go
index 518ed76915..1debf377c1 100644
--- a/trie/difflayer.go
+++ b/trie/difflayer.go
@@ -215,27 +215,20 @@ func (dl *diffLayer) NodeBlob(key []byte) ([]byte, error) {
 	// all the maps in all the layers below
 	_, hash := DecodeInternalKey(key)
 	dl.lock.RLock()
-	hit := dl.diffed.ContainsHash(binary.BigEndian.Uint64(hash.Bytes()))
-	var origin *diskLayer
-	if !hit {
-		origin = dl.origin // extract origin while holding the lock
+	if dl.diffed.ContainsHash(binary.BigEndian.Uint64(hash.Bytes())) {
+		d, err := dl.nodeBlob(key, 0)
+		dl.lock.RUnlock()
+		return d, err
 	}
 	dl.lock.RUnlock()
-
 	// If the bloom filter misses, don't even bother with traversing the memory
 	// diff layers, reach straight into the bottom persistent disk layer
-	if origin != nil {
-		triedbBloomMissMeter.Mark(1)
-		return origin.NodeBlob(key)
-	}
-	return dl.nodeBlob(key, 0)
+	triedbBloomMissMeter.Mark(1)
+	return dl.origin.NodeBlob(key)
 }
 
 // nodeBlob is the inner version of NodeBlob which counts the accessed layer depth.
 func (dl *diffLayer) nodeBlob(key []byte, depth int) ([]byte, error) {
-	dl.lock.RLock()
-	defer dl.lock.RUnlock()
-
 	// If the layer was flattened into, consider it invalid (any live reference to
 	// the original should be marked as unusable).
 	if dl.Stale() {

holiman · 2021-10-28T07:58:25Z

I pushed an error-check into the benchmark

holiman · 2021-10-29T12:24:39Z

Hm, funky... It builds fine if I remove Node from the difflayer:

diff --git a/trie/database.go b/trie/database.go
index aa6bc80b68..52cba6f2fc 100644
--- a/trie/database.go
+++ b/trie/database.go
@@ -99,7 +99,7 @@ type snapshot interface {
 	// Node retrieves the trie node associated with a particular key. The
 	// passed key should be encoded in internal format with hash encoded.
 	// No error will be returned if the node is not found.
-	Node(internalKey []byte) (node, error)
+	//Node(internalKey []byte) (node, error)
 
 	// Parent returns the subsequent layer of a snapshot, or nil if the base was
 	// reached.
diff --git a/trie/difflayer.go b/trie/difflayer.go
index 518ed76915..c1f3c92cbf 100644
--- a/trie/difflayer.go
+++ b/trie/difflayer.go
@@ -155,59 +155,6 @@ func (dl *diffLayer) Stale() bool {
 	return atomic.LoadUint32(&dl.stale) != 0
 }
 
-// Node retrieves the trie node associated with a particular key.
-// The given key must be the internal format node key.
-func (dl *diffLayer) Node(key []byte) (node, error) {
-	// Check the bloom filter first whether there's even a point in reaching into
-	// all the maps in all the layers below
-	_, hash := DecodeInternalKey(key)
-	dl.lock.RLock()
-	hit := dl.diffed.ContainsHash(binary.BigEndian.Uint64(hash.Bytes()))
-	var origin *diskLayer
-	if !hit {
-		origin = dl.origin // extract origin while holding the lock
-	}
-	dl.lock.RUnlock()
-
-	// If the bloom filter misses, don't even bother with traversing the memory
-	// diff layers, reach straight into the bottom persistent disk layer
-	if origin != nil {
-		triedbBloomMissMeter.Mark(1)
-		return origin.Node(key)
-	}
-	return dl.node(key, 0)
-}
-
-// node is the inner version of Node which counts the accessed layer depth.
-func (dl *diffLayer) node(key []byte, depth int) (node, error) {
-	// If the layer was flattened into, consider it invalid (any live reference to
-	// the original should be marked as unusable).
-	if dl.Stale() {
-		return nil, ErrSnapshotStale
-	}
-	// If the trie node is known locally, return it
-	if n, ok := dl.nodes[string(key)]; ok {
-		triedbDirtyHitMeter.Mark(1)
-		triedbDirtyNodeHitDepthHist.Update(int64(depth))
-		triedbBloomTrueHitMeter.Mark(1)
-
-		// The trie node is marked as deleted, don't bother parent anymore.
-		if n == nil {
-			return nil, nil
-		}
-		triedbDirtyReadMeter.Mark(int64(n.size))
-		_, hash := DecodeInternalKey(key)
-		return n.obj(hash), nil
-	}
-	// Trie node unknown to this diff, resolve from parent
-	if diff, ok := dl.parent.(*diffLayer); ok {
-		return diff.node(key, depth+1)
-	}
-	// Failed to resolve through diff layers, mark a bloom error and use the disk
-	triedbBloomFalseHitMeter.Mark(1)
-	return dl.parent.Node(key)
-}
-
 // NodeBlob retrieves the trie node blob associated with a particular key.
 // The given key must be the internal format node key.
 func (dl *diffLayer) NodeBlob(key []byte) ([]byte, error) {

So -- we could choose to just store it as blobs, instead of storing as references to *cachedNode. That should improve gc a bit, and probably also is a bit safer (?)

holiman · 2021-10-29T12:55:23Z

I tested implementing that, seems pretty straight-forward: holiman@81af9ab

holiman · 2021-10-29T13:31:57Z

trie/disklayer.go

+	blob, nodeHash := rawdb.ReadTrieNode(dl.diskdb, path)
+	if len(blob) == 0 || nodeHash != hash {
+		blob = rawdb.ReadArchiveTrieNode(dl.diskdb, hash)
+		if len(blob) != 0 {
+			triedbFallbackHitMeter.Mark(1)
+			triedbFallbackReadMeter.Mark(int64(len(blob)))
+		}
+	}


I'm trying to understand this. In the 'normal' case, will this happen e.g if someone does an ether-transfer to a non-existing account?
If so, it seems pretty expensive, if every non-existing trie access triggers an additional ReadArchiveTrieNode?
But I'm not sure I fully understand what situations would trigger this to happen.

Yes, it's true. For all the non-existing entries, they will trigger an additional ReadArchiveTrieNode. It acts the fallback, mainly for the nodes upgraded from the legacy version.

sscodereth · 2021-11-01T02:17:08Z

So -- we could choose to just store it as blobs, instead of storing as references to *cachedNode. That should improve gc a bit, and probably also is a bit safer (?)

The cachedNode should be super useful if the cached nodes can be reused by trie. Since RLP encoding and decoding is saved.

holiman · 2021-11-01T09:49:49Z

The cachedNode should be super useful if the cached nodes can be reused by trie. Since RLP encoding and decoding is saved.

But it's not used though, is it?

holiman · 2021-11-24T07:18:33Z

Happened during shutdown on one of the benchmarkers:

Nov 23 22:11:06 bench03.ethdevops.io geth INFO [11-23|21:11:06.730] Got interrupt, shutting down...
Nov 23 22:11:06 bench03.ethdevops.io geth INFO [11-23|21:11:06.739] HTTP server stopped endpoint=[::]:8545
Nov 23 22:11:06 bench03.ethdevops.io geth INFO [11-23|21:11:06.740] IPC endpoint closed url=/datadir/geth/geth.ipc
Nov 23 22:11:06 bench03.ethdevops.io geth INFO [11-23|21:11:06.741] Stats daemon stopped
Nov 23 22:11:06 bench03.ethdevops.io geth WARN [11-23|21:11:06.741] Failed to retrieve stats server message err="read tcp 172.31.2.178:38228->52.9.113.136:443: use of closed network connection"
Nov 23 22:11:06 bench03.ethdevops.io geth INFO [11-23|21:11:06.744] Ethereum protocol stopped
Nov 23 22:11:06 bench03.ethdevops.io geth INFO [11-23|21:11:06.744] Transaction pool stopped 
Nov 23 22:11:06 bench03.ethdevops.io geth panic: reflect: call of reflect.Value.Type on zero Value
Nov 23 22:11:06 bench03.ethdevops.io geth goroutine 2821454975 [running]:
Nov 23 22:11:06 bench03.ethdevops.io geth reflect.Value.Type({0x0, 0x0, 0x1eaff60})
Nov 23 22:11:07 bench03.ethdevops.io geth reflect/value.go:2249 +0x12e
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/rlp.(*encbuf).encode(0x1eaff60, {0x0, 0x0})
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/rlp/encode.go:152 +0xb8
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/rlp.EncodeToBytes({0x0, 0x0})
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/rlp/encode.go:74 +0xe9
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/trie.(*cachedNode).rlp(0xc14b54fef8)
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/trie/database.go:197 +0x52
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/trie.(*diffLayer).Journal(0xc110d39ce0, 0x1c74660)
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/trie/journal.go:182 +0x2df
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/trie.(*diffLayer).Journal(0xc0c4da0150, 0x46db4a)
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/trie/journal.go:166 +0x67
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/trie.(*diffLayer).Journal(0xc0dfb65dc0, 0xd0)
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/trie/journal.go:166 +0x67
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/trie.(*diffLayer).Journal(0xc0ce9ba930, 0x3e)
... repeated a lot ...
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/trie/journal.go:166 +0x67
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/trie.(*diffLayer).Journal(0xc100e362a0, 0x46c857)
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/trie/journal.go:166 +0x67
Nov 23 22:11:07 bench03.ethdevops.io geth created by github.com/ethereum/go-ethereum/cmd/utils.StartNode.func1
Nov 23 22:11:07 bench03.ethdevops.io geth github.com/ethereum/go-ethereum/cmd/utils/cmd.go:90 +0x31f

holiman · 2022-05-06T12:58:04Z

Closing in favour of #24597

all: implement path based storage

b941eb2

sscodereth requested review from holiman, karalabe, rjl493456442 and zsfelfoldi as code owners October 14, 2021 02:58

holiman reviewed Oct 18, 2021

View reviewed changes

trie: remove used code

726a8ff

holiman reviewed Oct 18, 2021

View reviewed changes

trie/database.go Outdated Show resolved Hide resolved

gballet reviewed Oct 22, 2021

View reviewed changes

all: fix unittests, address comments

5a69a7e

sscodereth requested a review from MariusVanDerWijden as a code owner October 26, 2021 07:05

unkonwn-coder added 3 commits October 27, 2021 13:36

core, trie: improve metrics

cc6febf

stash

fd8da85

trie: add updates

2787fe1

- Fix non-existent node resolving in Node API - Add benchmarks for NodeBlob API - Use the node hash as the bloom filter id - Change pararameters for Trie.Database

trie: error check in benchmark

9ea7c24

holiman reviewed Oct 29, 2021

View reviewed changes

sscodereth force-pushed the path-based-storage branch from d0fe0d4 to 9ea7c24 Compare November 1, 2021 02:21

gballet mentioned this pull request Dec 6, 2021

use Kawashima's database rewrite instead of the snapshot ethereum/go-verkle#141

Closed

yorickdowne mentioned this pull request Feb 8, 2022

Signal --prune complete #24344

Closed

holiman closed this May 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

all: implement path based storage #23733

all: implement path based storage #23733

sscodereth commented Oct 14, 2021

sscodereth commented Oct 18, 2021 •

edited by rjl493456442

Loading

holiman left a comment

holiman Oct 18, 2021

sscodereth Oct 18, 2021

holiman Oct 18, 2021

sscodereth Oct 19, 2021

holiman commented Oct 18, 2021

sscodereth commented Oct 18, 2021

holiman commented Oct 18, 2021 •

edited

Loading

sscodereth commented Oct 19, 2021 •

edited

Loading

sscodereth commented Oct 19, 2021

sscodereth commented Oct 19, 2021

gballet left a comment

sscodereth commented Oct 26, 2021

sscodereth commented Oct 26, 2021

holiman commented Oct 26, 2021

holiman commented Oct 28, 2021

holiman commented Oct 28, 2021

holiman commented Oct 29, 2021

holiman commented Oct 29, 2021

holiman Oct 29, 2021

sscodereth Nov 1, 2021

sscodereth commented Nov 1, 2021

holiman commented Nov 1, 2021

holiman commented Nov 24, 2021

holiman commented May 6, 2022

all: implement path based storage #23733

all: implement path based storage #23733

Conversation

sscodereth commented Oct 14, 2021

sscodereth commented Oct 18, 2021 • edited by rjl493456442 Loading

Path based storage scheme

Key prefix

Storage namespace

Internal path

Alternatives

In-memory trie nodes

Trie changes

Stateless commit

Self-contained dirty set

State sync changes

Retire fast sync

Twist snap sync

Upgrade path

References

holiman left a comment

Choose a reason for hiding this comment

holiman Oct 18, 2021

Choose a reason for hiding this comment

sscodereth Oct 18, 2021

Choose a reason for hiding this comment

holiman Oct 18, 2021

Choose a reason for hiding this comment

sscodereth Oct 19, 2021

Choose a reason for hiding this comment

holiman commented Oct 18, 2021

sscodereth commented Oct 18, 2021

holiman commented Oct 18, 2021 • edited Loading

sscodereth commented Oct 19, 2021 • edited Loading

sscodereth commented Oct 19, 2021

sscodereth commented Oct 19, 2021

gballet left a comment

Choose a reason for hiding this comment

sscodereth commented Oct 26, 2021

sscodereth commented Oct 26, 2021

holiman commented Oct 26, 2021

holiman commented Oct 28, 2021

holiman commented Oct 28, 2021

holiman commented Oct 29, 2021

holiman commented Oct 29, 2021

holiman Oct 29, 2021

Choose a reason for hiding this comment

sscodereth Nov 1, 2021

Choose a reason for hiding this comment

sscodereth commented Nov 1, 2021

holiman commented Nov 1, 2021

holiman commented Nov 24, 2021

holiman commented May 6, 2022

sscodereth commented Oct 18, 2021 •

edited by rjl493456442

Loading

holiman commented Oct 18, 2021 •

edited

Loading

sscodereth commented Oct 19, 2021 •

edited

Loading