Light Client refactoring #237

romac · 2020-04-24T14:54:48Z

Closes: #230
Closes: #229
Closes: #174

Here's take 2 of the light client spike.

This version follows pretty much the same decomposition as in the ADR (minus the Demuxer for lack of time), and I find It much more readable and understandable than the former one. It also drops the predicate library in favor of plain functions returning Result<(), Error> packed in a trait.

The big downside currently is that the components are now coupled (ie. the Scheduler has a reference to the Verifier and the Rpc components).

I haven’t found a way to decouple them without having to extract some of the logic into what would be the Demuxer router/event-loop.

I believe that the only way to get both decoupling + simple control-flow would be to go with async/await and have components communicate via channels with the Demuxer, and wait for specific responses. I will try to sketch something about that this weekend so that we can discuss both versions on Monday.

Referenced an issue explaining the need for the change
Updated all relevant documentation in docs
Updated all code comments where relevant
Wrote tests
Updated CHANGES.md

brapse · 2020-04-27T07:03:06Z

Just to ensure that I understand what we are trying to solve here. The previous light-spike defined the light-client process as an iterator where each iteration would handle a single event processed by a single component and multiple iterations were necessary to complete a “flow”. This granular process made it very testable but hard to read as your brain had to keep track of multiple iteration to see the whole flow.

This is definitely a valid criticism. We defined it inductively which can be a challenge to read. If it takes three iterations through three different components to get anything meaningful done then we constantly have three things in our head.

In this version we get rid of the iterative nature and just call a function which passes control through the different components. It seems similar although clearer than the current version on master. As you mentioned, we keep the components coupled here which I would say is a significant downside.

I don’t know if async would help here or if we need more complex channel communication. Instead I think we just need to solve the granularity problem directly. With a demuxer (as outlined in ADR-006 which handles state as well routing control between components. This way we can have clear separation of concerns and execute entire flows as a single synchronous function calls.

I hope this approach might be the best of both worlds in terms of testability and readability while using minimal language features.

romac · 2020-04-27T14:13:51Z

Here's take 3: https://github.com/informalsystems/tendermint-rs/tree/romac/light-spike-chan/light-spike

This version uses async channels to decouple components, but can still be run synchronously on a single thread.

Doing this gave me an idea for decoupling components without relying on async-await, so expect yet another take on this soon.

romac · 2020-04-27T19:59:07Z

Take 4: https://github.com/informalsystems/tendermint-rs/tree/romac/light-spike-sync-decouple/light-spike

This version removes all async/await from take 3, and replaces the BiChan with a Router trait which can then be passed to the various components for them to call when they need to query other components. This trait is implemented for the Demuxer, but could be implemented for a standalone struct for the purpose of mocking/testing/etc.

romac · 2020-04-28T10:56:19Z

Just updated this PR to take 4.

…adapted

brapse · 2020-05-26T15:30:29Z

light-client/src/components/io.rs

+
+pub struct ProdIo {
+    rpc_clients: HashMap<PeerId, rpc::Client>,
+    peer_map: HashMap<PeerId, tendermint::net::Address>,


Nice to have the abstraction of peerIDs here from the onset 👍

brapse · 2020-05-26T15:58:27Z

light-client/src/components/scheduler.rs

+#[pre(light_store.latest(VerifiedStatus::Verified).is_some())]
+#[post(valid_schedule(ret, target_height, next_height, light_store))]
+pub fn schedule(
+    light_store: &dyn LightStore,


Why do we need to pass the whole store if all we need is the latest trusted height?

This is to match the spec, as @josef-widder suggested the scheduler could perform some optimizations as to what height to verify next based on what headers of nearby heights are already available in the store, but this is not implemented yet.

liamsi · 2020-05-27T10:39:40Z

light-client/src/tests.rs

+// -----------------------------------------------------------------------------
+// Everything below is a temporary workaround for the lack of `provider` field
+// in the light blocks serialized in the JSON fixtures.
+// -----------------------------------------------------------------------------


cc @greg-szabo @Shivani912 this is relevant to testing / serialization

This seems to be implementation specific to me. Go code has a different approach to identify a node as a primary/secondary provider and does not deal with addresses. But also, this approach looks cleaner and reduces a lot of data from test files. Will need some more work around testing of Go code though. Need more thoughts here! @greg-szabo @liamsi @melekes

liamsi · 2020-05-27T10:51:15Z

light-client/Cargo.toml

+prost-amino = "0.5.0"
+contracts = "0.4.0"
+sled = "0.31.0"
+serde_cbor = "0.11.1"


Yet another binary encoding 😱 😄 I assume this is the most reasonable choice to use in combination with sled?

I have to admit I didn't give it too much thought. I initially considered just serializing keys and values to JSON and then to bytes, but that seemed a bit wasteful, so I figured using a dedicated binary encoding was better. In the end, aside from the extra dependencies, the choice of encoding does not matter since it is internal to the light store and specific to the choice of sled as a database. But I'd be happy to discuss alternatives, either for the binary encoding, or for sled :)

Not a big deal but couldnt prost work here for proto3?

light-client/src/components/scheduler.rs

light-client/src/light_client.rs

light-client/src/macros.rs

liamsi

This looks amazing ❤️

brapse

This is really great. Let's do the rest in the follow ups 👍

romac force-pushed the romac/light-spike-bis branch 2 times, most recently from e233d8d to 0e58ab3 Compare May 5, 2020 16:21

romac added 24 commits May 5, 2020 18:35

Rework predicates

cceaa70

WIP: Add tracing

a982821

Fix verification procedure

7e2a9d5

Rename requester component to rpc

8509a13

Rename Trace::run to Trace::collect

cdc2294

Return meaningful data in errors

be3012f

Proper error handling with thiserror+anomaly

6c8a011

Make events PartialEq+Eq

333afd5

Implement verifier

93dbe53

Implement scheduler and bisection

1f325e0

Remove write access to trusted store for scheduler

93b148b

Add a couple of FIXMEs

614c870

Formatting

ded21c9

Fix clippy warnings

745c001

Fix misplaced attribute

0969b7b

Enable VerificationPredicates to be made into a trait object

5948c7b

Allow cloning TSReader

0661ee9

Shorter method name

de049fb

Decouple components using Router trait

799caaa

Silence a couple Clippy warnings

79b26c6

Cleanup trace module

66ef29f

Revamp errors

ede2b76

Revamp error, part 2

884eebd

Bundle verification options together

af46005

romac added 3 commits May 25, 2020 15:01

Fix tests

6777956

Merge branch 'master' into romac/light-spike-bis

8bbb194

Adapt test to new JSON files organization

948ecb4

informalsystems deleted a comment from codecov-commenter May 25, 2020

romac requested a review from brapse May 25, 2020 14:00

romac mentioned this pull request May 25, 2020

light-client: follow-up #280

Closed

19 tasks

romac added 6 commits May 26, 2020 09:25

Rename light-spike crate to light-client

a17beb7

Turn production predicates into default trait impl

5323bb8

Comment out provider field of LightBlock until conformance tests are …

e287152

…adapted

Refactor is_within_trust_period to better match the spec

3ecdec2

Add core verification loop invariant

4934cbc

WIP: Documentation

862ff72

brapse reviewed May 26, 2020

View reviewed changes

Merge branch 'master' into romac/light-spike-bis

7b39443

informalsystems deleted a comment from codecov-commenter May 26, 2020

romac added 2 commits May 26, 2020 17:55

Make cargo fmt happy

5f3f914

Make clippy happy

97e27b1

brapse reviewed May 26, 2020

View reviewed changes

Re-enable provider field in LightBlock struct

1f12e72

liamsi reviewed May 27, 2020

View reviewed changes

light-client/src/components/scheduler.rs Show resolved Hide resolved

liamsi reviewed May 27, 2020

View reviewed changes

light-client/src/light_client.rs Show resolved Hide resolved

liamsi reviewed May 27, 2020

View reviewed changes

light-client/src/macros.rs Show resolved Hide resolved

liamsi reviewed May 27, 2020

View reviewed changes

informalsystems deleted a comment from codecov-commenter May 27, 2020

brapse approved these changes May 27, 2020

View reviewed changes

brapse merged commit 1ba8a2c into master May 27, 2020

brapse deleted the romac/light-spike-bis branch May 27, 2020 13:10

romac mentioned this pull request Jun 9, 2020

sketching #204

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Light Client refactoring #237

Light Client refactoring #237

romac commented Apr 24, 2020 •

edited

Loading

brapse commented Apr 27, 2020 •

edited

Loading

romac commented Apr 27, 2020

romac commented Apr 27, 2020

romac commented Apr 28, 2020

brapse May 26, 2020

brapse May 26, 2020

romac May 26, 2020

liamsi May 27, 2020

Shivani912 May 27, 2020

liamsi May 27, 2020

romac May 27, 2020

ebuchman Jun 5, 2020

liamsi left a comment

brapse left a comment

Light Client refactoring #237

Light Client refactoring #237

Conversation

romac commented Apr 24, 2020 • edited Loading

brapse commented Apr 27, 2020 • edited Loading

romac commented Apr 27, 2020

romac commented Apr 27, 2020

romac commented Apr 28, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liamsi left a comment

Choose a reason for hiding this comment

brapse left a comment

Choose a reason for hiding this comment

romac commented Apr 24, 2020 •

edited

Loading

brapse commented Apr 27, 2020 •

edited

Loading