refactor: transaction rlp encoding #1536

morph-dev · 2024-10-18T21:00:00Z

What was wrong?

Transaction can be rlp encoded/decoded with (less common) or without (most common) additional rlp header.

Currently, we have to deal with less common case manually, which isn't great. As of now, we have two use cases for it: inside BlockBody and as block size inside eth_getBlockBy* calls (which is not implemented).

How was it fixed?

Added const generic argument to the Transaction struct.

The default value represents the most common case, which means that we don't have to specify it in most of the cases.

To-Do

Clean up commit history and use conventional commits.

morph-dev · 2024-10-18T21:01:13Z

I'm not extremely happy with this approach, as it complicates the Transaction object, which is very commonly used (even tho complexity is mostly hidden away).

The only other reasonable approach that I see is to have another type (e.g. TransactionWithRlpHeader, up for a better name suggestion) that would be a wrapper around existing Transaction with its own Encodable/Decodable implementation.

KolbyML · 2024-10-18T23:06:30Z

I'm not extremely happy with this approach, as it complicates the Transaction object, which is very commonly used (even tho complexity is mostly hidden away).

The only other reasonable approach that I see is to have another type (e.g. TransactionWithRlpHeader, up for a better name suggestion) that would be a wrapper around existing Transaction with its own Encodable/Decodable implementation.

Couldn't we implement alloy_rlp length trait with the pre-existing code as well. If neither of the proposed refactors are a "happy approaches" we can just implement the length trait and call it a day no? which would also be much similar than the proposed changes.

I think the manual rlp calls instead of deriving them is fine because as you said the "manual" path is hardly used. So is it worth the complexity to implement auto derives for a rare usecase in the grand scheme of things.

carver · 2024-10-19T00:26:16Z

Oh yeah this is starting to come back to me, it's been about 3 years!

Since you're looking for alternatives, I'll share what we did in py-evm. We found it helpful to have a unified pre-RLP-encoded representation. So that means the primitives of byte-strings and lists (er, vectors), which can contain more byte-strings and vectors. When we correctly serialize into this format, then the RLP library should handle the actual encoding for us (by prepending the length of the payload recursively).

So the approach we took in py-evm was to offer two separate methods on the global transaction type:

serialize(): generate primitives that are suitable for RLP encoding
encode(): generate a self-contained byte-string representing a single transaction, useful for saving to a database, etc

For a legacy transaction, that looks like:

serialize: return a vector of byte-strings, with an entry for each field of the transaction
encode: rlp.encode(self.serialize())

For a typed transaction, split it into two layers, a wrapping layer that identifies the transaction type and a payload layer that is fully defined by that type ID. The payload layer works effectively the same as a legacy transaction, for serialization.

In the wrapping layer:

serialize: return the self.encode() value as a byte-string at the root level, with no vector
encode: prepend the type byte to the output of payload.encode()

Then the standard RLP encoding should work on a list of transactions that have each been run through transaction.serialize(). I haven't messed around with the alloy::rlp library much yet, but I expect it to be straightforward once you are supplying RLP-primitive values.

This is a lot, so I'm happy to get on a call, if that would help.

carver · 2024-10-19T00:32:28Z

nit since this is adding a new ability in the rpc, I think naming commit something like this would be clearer:
feat: add size to block returned in eth_getBlock*

Also, it could make sense to split into a refactor: commit and a feat: commit (or PR), especially if the refactor continues to grow.

carver · 2024-10-19T02:10:17Z

Since you're looking for alternatives, I'll share what we did in py-evm.

Hm, the longer I look at this, the more I think it's potentially an orthogonal problem. I guess it's possible we never dealt with this funny case just when calculating block size, where just the typed transactions are double-rlp-encoded (it doesn't seem familiar to me). So I guess if I don't have any better ideas. 🤷🏻‍♂️

I guess it's somewhat related how we have to add an extra rlp-encoding to legacy transactions when calculating the transaction root.

KolbyML · 2024-10-19T02:41:45Z

Nvm I think I am wrong about the performance claims I made as after reading the implementation for the RlpEncodable macro it will implement the Encodable traits length() function which avoids any objects being encoded to RLP to get the length

So with this refactor, assuming everything implements length(), whether that is through a macro, or manually, there should be a performance increase. As all primitives should rely on .len() instead of encoding the rlp

KolbyML · 2024-10-19T02:49:02Z

I gave this PR a brief review. We should implement length() for the header as currently that will still use encode() to get the length as we don't use RlpEncodable macro for header.

I will give a more thorough once all the tests are passing. Sorry about the confusion in my earlier messages.

KolbyML · 2024-10-19T03:20:59Z

ethportal-api/src/types/execution/block_body.rs

@@ -123,17 +123,16 @@ impl BlockBody {
    /// Returns reference to uncle headers.
    ///
    /// Returns None post Merge fork.


Suggested change

/// Returns None post Merge fork.

/// Returns None post Merge fork.

this doc is no longer true

KolbyML

PR looks good. I think this solution is fine as I would want to avoid a wrapper type as it just seems kinda feels off to me.

KolbyML · 2024-10-19T16:53:12Z

rpc/src/eth_rpc.rs

-        let size = None;
+        // Note: transactions are encoded with header
+        let size = {
+            let payload_size = header.length()


Suggested change

let payload_size = header.length()

let payload_size = header.length()

Our Header type here doesn't use the macro or implement length in its implementation of Encodable so we are stilling encoding to find the length of Header. Would you want to implement length() for header in this PR or make an issue for it any we can solve it in another PR.

This would be the only place in calculating size where we still are encoding to get the length I believe.

I will do it in a separate PR (no need to open issue about it, I will do it right now).

morph-dev · 2024-10-19T17:50:58Z

@KolbyML @carver I created another PR (very similar to this one) where I used separate type that is just a wrapper around Transaction with different RLP encoding.

Now that I see both of them, I like that one a bit more because it doesn't complicate the Transaction object that is very frequently used. But I can be convinced otherwise. So let me know what you think about both of them.

KolbyML · 2024-10-19T17:52:14Z

@KolbyML @carver I created another PR (very similar to this one) where I used separate type that is just a wrapper around Transaction with different RLP encoding.

Now that I see both of them, I like that one a bit more because it doesn't complicate the Transaction object that is very frequently used. But I can be convinced otherwise. So let me know what you think about both of them.

I like it more too. I thought it would look weirder in my head.

carver

Prefer the other approach

morph-dev · 2024-10-20T07:00:12Z

Closing in favor of #1539

refactor: transaction rlp encoding

52d9a38

morph-dev requested review from carver, njgheorghita and KolbyML October 18, 2024 21:00

morph-dev self-assigned this Oct 18, 2024

carver mentioned this pull request Oct 19, 2024

fix: add size to eth_getBlockByNumber and eth_getBlockByHash #1534

Closed

This comment was marked as outdated.

Sign in to view

KolbyML reviewed Oct 19, 2024

View reviewed changes

fix: test and pr comment

0982019

KolbyML approved these changes Oct 19, 2024

View reviewed changes

morph-dev mentioned this pull request Oct 19, 2024

refactor: transaction rlp encoding #1539

Merged

1 task

carver requested changes Oct 20, 2024

View reviewed changes

morph-dev closed this Oct 20, 2024

morph-dev deleted the rlp_transaction branch October 20, 2024 07:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: transaction rlp encoding #1536

refactor: transaction rlp encoding #1536

morph-dev commented Oct 18, 2024

morph-dev commented Oct 18, 2024

KolbyML commented Oct 18, 2024

carver commented Oct 19, 2024 •

edited

Loading

carver commented Oct 19, 2024 •

edited

Loading

carver commented Oct 19, 2024 •

edited

Loading

This comment was marked as outdated.

This comment was marked as outdated.

KolbyML commented Oct 19, 2024 •

edited

Loading

KolbyML commented Oct 19, 2024

KolbyML Oct 19, 2024

KolbyML left a comment

KolbyML Oct 19, 2024

morph-dev Oct 19, 2024

morph-dev commented Oct 19, 2024

KolbyML commented Oct 19, 2024

carver left a comment

morph-dev commented Oct 20, 2024

	/// Returns None post Merge fork.
	/// Returns None post Merge fork.

	let payload_size = header.length()
	let payload_size = header.length()

refactor: transaction rlp encoding #1536

refactor: transaction rlp encoding #1536

Conversation

morph-dev commented Oct 18, 2024

What was wrong?

How was it fixed?

To-Do

morph-dev commented Oct 18, 2024

KolbyML commented Oct 18, 2024

carver commented Oct 19, 2024 • edited Loading

carver commented Oct 19, 2024 • edited Loading

carver commented Oct 19, 2024 • edited Loading

This comment was marked as outdated.

This comment was marked as outdated.

KolbyML commented Oct 19, 2024 • edited Loading

KolbyML commented Oct 19, 2024

KolbyML Oct 19, 2024

Choose a reason for hiding this comment

KolbyML left a comment

Choose a reason for hiding this comment

KolbyML Oct 19, 2024

Choose a reason for hiding this comment

morph-dev Oct 19, 2024

Choose a reason for hiding this comment

morph-dev commented Oct 19, 2024

KolbyML commented Oct 19, 2024

carver left a comment

Choose a reason for hiding this comment

morph-dev commented Oct 20, 2024

carver commented Oct 19, 2024 •

edited

Loading

carver commented Oct 19, 2024 •

edited

Loading

carver commented Oct 19, 2024 •

edited

Loading

KolbyML commented Oct 19, 2024 •

edited

Loading