Update synchronization logic for ckb-indexer/light client #90

Keith-CY · 2023-01-03T06:35:55Z

Feature has been discussed in #52

Keith-CY · 2023-01-04T02:37:13Z

@Keith-CY add PRD about switch between ckb-index(full node) and light client

Keith-CY · 2023-01-07T09:18:33Z

Request light client to add an API of light client info, with that neuron could detect the service of an endpoint(ckb node/ckb light client): Adding an API to get light client info nervosnetwork/ckb-light-client#118
Add a built-in light client in network list
Make the tag of networks detailed: Mainnet/Testnet/Devnet Node, Mainnet/Testnet/Devnet Light Client, the tag shows in preference/settings network and sync progress at the bottom left corner.
If the built-in light client is connected
1. preference/settings -> data acts on built-in light client
  1. For the path set: CKB Node Config & Storage -> CKB Light Client Config & Storage, the tooltip should be updated too.
  2. For the cache:
  1. date of cache cleared of the built-in light client is separate from that of the built-in full node. Namely, there will be two dates of cache cleared and displayed according to the network type;
  2. fully rebuild index option in clear cache dialog is hidden
2. menu -> tools -> clear all synchronized data works on built-in light client, light client should re-sync.
Add the version of the built-in light client in About Neuron(macOS: menu -> neuron -> about neuron, windows: menu -> help -> about neuron

yanguoyu · 2023-01-08T08:22:25Z

How to switch light client or full node for users?

Keith-CY · 2023-01-08T09:48:58Z

How to switch light client or full node for users?

Switch between light client and full node by selecting different networks, i.e. Neuron will become network-agnostic.

If the feature request of adding a light client info API mentioned above could be supported, Neuron can detect whether the remote service is a full node or a light client by calling local_node_info and light_client_info:

local_node_info responds while light_client_info fails => full node;
local_node_info fails while light_client_info responds => light client;
local_node_info fails while light_client_info fails => unknown service.

yanguoyu · 2023-01-08T11:19:42Z

As I know, now Neuron can start with testnet, but it's hard for general users to use it with testnet, because ckb network can only be set when init.
So I mean if users want to use Neuron with light client, do they also need to start a light client, Or we will provide choices for users to start a light client or full node?

Keith-CY · 2023-01-08T11:35:44Z

As I know, now Neuron can start with testnet, but it's hard for general users to use it with testnet, because ckb network can only be set when init. So I mean if users want to use Neuron with light client, do they also need to start a light client, Or we will provide choices for users to start a light client or full node?

A built-in light client connected to mainnet will be provided by Neuron, as mentioned in point 2 in #90 (comment)

Once the built-in light client is selected, Neuron boosts the light client inside it.

So we provide the options of connecting to internal full node and internal light client by 2 options in the network list

default node => built-in ckb full node
default light client => built-in ckb light client

Keith-CY · 2023-01-10T10:20:16Z

Recommendations from the core team:

It's not recommended to allow users to set the entrypoint of a light client freely because light client and full node are based on different security assumptions.

Users should be clearly notified that the network is a light client, then they can connect to that one.

With this recommendation, the feature would be as follows:

Add 2 built-in light clients Light Client Mainnet and Light Client Testnet' in the network list, Light Client MainnetandLight Client Testnet` are not editable;
Make the tag of networks detailed: Mainnet/Testnet/Devnet Node, Mainnet/Testnet/Devnet Light Client, the tag shows in preference/settings network and sync progress at the bottom left corner.
If the built-in light client is connected
1. preference/settings -> data acts on built-in light client
  1. For the path set: CKB Node Config & Storage -> CKB Light Client Mainnet Config & Storage, the tooltip should be updated too.(CKB Light Client Testnet Config & Storage if connected to testnet)
  2. For the cache:
  1. date of cache cleared of the built-in light clients are independent, respectively, and separate from that of the built-in full node. Namely, there will be three dates of cache cleared and displayed according to the network type(CKB Node Mainnet, CKB Light Client Mainnet, CKB Light Client Testnet);
  2. fully rebuild index option in clear cache dialog is hidden
2. menu -> tools -> clear all synchronized data works on the built-in light client, the light client should re-sync, light clients of mainnet and testnet work independently.
Add the version of the built-in light client in About Neuron(macOS: menu -> neuron -> about neuron, windows: menu -> help -> about neuron
If an external light client is detected(port 9000), it prompts users with the message Failed to start the CKB light client, please check if there's an external one running with button Dismiss and keep retrying start the built-in one.
Light Client Mainnet network option is hidden for now because the light client feature is not activated on mainnet.

yanguoyu · 2023-01-12T03:49:01Z

Does the Light Client Mainnet has a schedule to publish?

Keith-CY · 2023-01-12T07:55:48Z

Does the Light Client Mainnet has a schedule to publish?

Not yet

Keith-CY · 2023-01-12T07:58:38Z

An optional parameter SetScriptCommand was added in set_script API which is for cases as HD wallet derivation
Ref: https://github.com/nervosnetwork/ckb-light-client#set_scripts

Keith-CY · 2023-01-13T06:05:34Z

CKB Light [email protected] was released and could be used as the built-in one for development

Keith-CY · 2023-01-20T03:00:30Z

CKB Light [email protected] includes a portable version for macOS m1

yanguoyu · 2023-02-23T02:39:42Z

I have two problems with this:

How to calculate the left time when syncing with light client? Total scripts left block numbers * Fixed speed, Or the speed we need to calculate by Total synced block numbers/ cost time?
I found when I add new addresses to sync by the light client, It will pause synced higher script until the new scripts sync to the higher block number. So maybe we need to create more addresses once time like 60 received addresses and 30 change addresses.

@Keith-CY

quake · 2023-02-23T03:04:02Z

I have two problems with this:

How to calculate the left time when syncing with light client? Total scripts left block numbers * Fixed speed, Or the speed we need to calculate by Total synced block numbers/ cost time?

You can estimate the time by calculating the progress of min_block_number / tip_number (min_block_number = min(get_scripts.block_number), tip_number = get_tip_header.number)

Keith-CY · 2023-02-23T03:16:11Z

I have two problems with this:

How to calculate the left time when syncing with light client? Total scripts left block numbers * Fixed speed, Or the speed we need to calculate by Total synced block numbers/ cost time?

The remaining time can be estimated by the rule suggested by @quake but it would be a bit confusing because scripts are synced group by group, if the next group of scripts should be derived is uncertain until the current one has fully synced. If the next group of scripts will be derived, the time of sync will be longer. So the estimated time would be like almost done => need more time => almost done => need more time, any idea about this? @Danie0918

I found when I add new addresses to sync by the light client, It will pause synced higher script until the new scripts sync to the higher block number. So maybe we need to create more addresses once time like 60 received addresses and 30 change addresses.

I didn't get the point. Do you mean, synchronization of groupA will stop if groupB is derived by groupA, until groupB syncs to the same block where groupA reached?

yanguoyu · 2023-02-23T03:20:35Z

I have two problems with this:

How to calculate the left time when syncing with light client? Total scripts left block numbers * Fixed speed, Or the speed we need to calculate by Total synced block numbers/ cost time?

You can estimate the time by calculating the progress of min_block_number / tip_number (min_block_number = min(get_scripts.block_number), tip_number = get_tip_header.number)

Got it, for example, min_block_number= 1,000,000, tip_number = 9,000,000, and it has spent 1 hour, then the left time can estimate to 8 hours.

yanguoyu · 2023-02-23T03:23:22Z

I didn't get the point. Do you mean, synchronization of groupA will stop if groupB is derived by groupA, until groupB syncs to the same block where groupA reached?

I guess yes, I think it will sync the group that its block_number is smaller, utils all the groups have the same block_number, then they will sync at the same time.

quake · 2023-02-23T03:28:07Z

2. I found when I add new addresses to sync by the light client, It will pause synced higher script until the new scripts sync to the higher block number. So maybe we need to create more addresses once time like 60 received addresses and 30 change addresses.

are you setting the starting block number new derived address to 0? you may set it to the block number of last change or receiving address transaction occurred.

yanguoyu · 2023-02-23T03:28:39Z

The remaining time can be estimated by the rule suggested by @quake but it would be a bit confusing because scripts are synced group by group, if the next group of scripts should be derived is uncertain until the current one has fully synced. If the next group of scripts will be derived, the time of sync will be longer. So the estimated time would be like almost done => need more time => almost done => need more time, any idea about this? @Danie0918

The estimate may be not exact when creating new group addresses, and it's the same as the synced block number,
the synced block number is possibly changing from big to small when calling set_script with a new group.

Keith-CY · 2023-02-23T03:31:09Z

I didn't get the point. Do you mean, synchronization of groupA will stop if groupB is derived by groupA, until groupB syncs to the same block where groupA reached?

I guess yes, I think it will sync the group that its block_number is smaller, utils all the groups have the same block_number, then they will sync at the same time.

As we designed at #52 (comment)

Working with the light client, Neuron could sync each key separately because each key has its own cursor/progress. That means it's safe to postpone key derivation until the derived keys are all processed totally. With this characteristic, Neuron could divide synchronization into groups of keys instead of each block, which is efficient.

The derivation would only occur on the last group of addresses is fully synced, which means it reaches the tip block number. So it's fine to stop synchronization of the last group temporarily

yanguoyu · 2023-02-23T03:45:25Z

The derivation would only occur on the last group of addresses is fully synced, which means it reaches the tip block number. So it's fine to stop synchronization of the last group temporarily

Group A synced to header block number -> derived Group B -> Group B synced to header block number
Do you mean like this? If so, it may be synced slowly. Because it means A from 0 to max and B from 0 to max,
but I think A from 0 to block_number_A, B from 0 to block_number_A, and A+B from block_number_A to header tip is faster.

yanguoyu · 2023-02-23T03:46:47Z

are you setting the starting block number new derived address to 0? you may set it to the block number of last change or receiving address transaction occurred.

Yes, I set the new derived address's start block number to 0.
Is there a possible deriving address that has a transaction occurring before the block number of last change or receiving address transaction occurred? @Keith-CY

I think it's a good idea If users use the wallet by Neuron, there will not exist transactions with the derived addresses before they are created.

Keith-CY · 2023-02-23T03:55:48Z

The derivation would only occur on the last group of addresses is fully synced, which means it reaches the tip block number. So it's fine to stop synchronization of the last group temporarily

Group A synced to header block number -> derived Group B -> Group B synced to header block number Do you mean like this? If so, it may be synced slowly. Because it means A from 0 to max and B from 0 to max, but I think A from 0 to block_number_A, B from 0 to block_number_A, and A+B from block_number_A to header tip is faster.

The previous workflow would be simple, and a bit performant because the check if the next group of scripts should be derived is executed once instead of every time a transaction is detected.

Keith-CY · 2023-02-23T03:58:39Z

are you setting the starting block number new derived address to 0? you may set it to the block number of last change or receiving address transaction occurred.

Yes, I set the new derived address's start block number to 0. Is there a possible deriving address that has a transaction occurring before the block number of last change or receiving address transaction occurred? @Keith-CY

I think it's a good idea If users use the wallet by Neuron, there will not exist transactions with the derived addresses before they are created.

But users may not only use Neuron with the same seed. It's possible that an address is used when it's not derived in Neuron

Keith-CY · 2023-02-23T04:03:23Z

The progress/estimated time could be improved later, as mentioned at #52 (comment)

What's more, each script could have its own progress bar and refresh itself independently. If the user declares address A has an invisible asset, he/she could resync the group only.

Each address could have its progress bar, and the incremental progress could be computed from them.

For example, there are 3 groups fully synced, but a new group is generated, the progress turns from 100% to 75%. The more addresses used, the less fallback it will be.

quake · 2023-02-23T04:45:20Z

But users may not only use Neuron with the same seed. It's possible that an address is used when it's not derived in Neuron

This use case exists only in theory, do you know of any actual cases? Let's keep it simple and ignore the use case of sharing the same seed but use different derivation strategies with neuron.

yanguoyu · 2023-02-23T04:47:16Z

For example, there are 3 groups fully synced, but a new group is generated, the progress turns from 100% to 75%. The more addresses used, the less fallback it will be.

Show every progress for every address is good, and If so we should hide the block number of the total at the left-bottom of Neuron. Because it's difficult to calculate.
But after all groups have synced to the header tip, we also need to check whether we should create a new group when a transaction has synced. And I think checking whether we need to derive new addresses is not a performance bottleneck.
The performance bottleneck is node sync speed.

On the other hand, should we derive more addresses once to quicken the sync speed?

Keith-CY · 2023-02-23T04:50:26Z

But users may not only use Neuron with the same seed. It's possible that an address is used when it's not derived in Neuron

This use case exists only in theory, do you know of any actual cases? Let's keep it simple and ignore the use case of sharing the same seed but use different derivation strategies with neuron.

Feasible, a refresh button could be added(in future) next to each address to update transactions of a specific address as mentioned at #52

What's more, each script could have its own progress bar and refresh itself independently. If the user declares address A has an invisible asset, he/she could resync the group only.

So it would be easy to fix data missing in Neuron

cc @yanguoyu

Keith-CY · 2023-02-23T04:54:59Z

For example, there are 3 groups fully synced, but a new group is generated, the progress turns from 100% to 75%. The more addresses used, the less fallback it will be.

Show every progress for every address is good, and If so we should hide the block number of the total at the left-bottom of Neuron. Because it's difficult to calculate. But after all groups have synced to the header tip, we also need to check whether we should create a new group when a transaction has synced. And I think checking whether we need to derive new addresses is not a performance bottleneck. The performance bottleneck is node sync speed.

Got it

On the other hand, should we derive more addresses once to quicken the sync speed?

I would prefer to keep the count of addresses to generate, the first goal is to enable light client in Neuron, then the user experience.

yanguoyu · 2023-02-23T13:15:40Z

nervosnetwork/neuron#2590

Danie0918 · 2023-04-10T02:11:20Z

nervosnetwork/neuron#2615

yanguoyu · 2023-05-15T02:12:49Z

nervosnetwork/neuron#2659

Keith-CY assigned yanguoyu Jan 3, 2023

Keith-CY added the enhancement New feature or request label Jan 3, 2023

Keith-CY added this to Nervos Wallet/Explorer Jan 3, 2023

Keith-CY added this to the 2023/01/11 - 2023/01/18 milestone Jan 3, 2023

Keith-CY moved this to Todo in Nervos Wallet/Explorer Jan 3, 2023

Keith-CY mentioned this issue Jan 3, 2023

Preview functions and design of light client #52

Closed

Keith-CY self-assigned this Jan 4, 2023

Keith-CY removed this from the 2023/01/11 - 2023/01/18 milestone Jan 9, 2023

yanguoyu mentioned this issue Jan 16, 2023

The synchronization is very slow nervosnetwork/neuron#2448

Closed

Keith-CY removed their assignment Jan 18, 2023

yanguoyu moved this from Todo to In Progress in Nervos Wallet/Explorer Jan 20, 2023

Danie0918 added this to Neuron Feb 26, 2023

Danie0918 moved this to 👀 In Review in Neuron Feb 26, 2023

yanguoyu moved this from In Progress to QA in Nervos Wallet/Explorer Feb 27, 2023

Danie0918 assigned Cedar67 Mar 6, 2023

Danie0918 closed this as completed Jun 5, 2023

github-project-automation bot moved this from 👀 Testing to ✅ Done in Neuron Jun 5, 2023

Cedar67 removed their assignment Jun 5, 2023

Keith-CY mentioned this issue Nov 17, 2023

Should we add network type field when the user add network #324

Closed

Keith-CY mentioned this issue Apr 1, 2024

Could NOT specify network information manually. nervosnetwork/neuron#3098

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update synchronization logic for ckb-indexer/light client #90

Update synchronization logic for ckb-indexer/light client #90

Keith-CY commented Jan 3, 2023

Keith-CY commented Jan 4, 2023

Keith-CY commented Jan 7, 2023

yanguoyu commented Jan 8, 2023

Keith-CY commented Jan 8, 2023

yanguoyu commented Jan 8, 2023

Keith-CY commented Jan 8, 2023

Keith-CY commented Jan 10, 2023 •

edited

Loading

yanguoyu commented Jan 12, 2023

Keith-CY commented Jan 12, 2023

Keith-CY commented Jan 12, 2023

Keith-CY commented Jan 13, 2023

Keith-CY commented Jan 20, 2023

yanguoyu commented Feb 23, 2023

quake commented Feb 23, 2023

Keith-CY commented Feb 23, 2023

yanguoyu commented Feb 23, 2023

yanguoyu commented Feb 23, 2023

quake commented Feb 23, 2023

yanguoyu commented Feb 23, 2023

Keith-CY commented Feb 23, 2023

yanguoyu commented Feb 23, 2023

yanguoyu commented Feb 23, 2023 •

edited

Loading

Keith-CY commented Feb 23, 2023

Keith-CY commented Feb 23, 2023

Keith-CY commented Feb 23, 2023 •

edited

Loading

quake commented Feb 23, 2023

yanguoyu commented Feb 23, 2023

Keith-CY commented Feb 23, 2023 •

edited

Loading

Keith-CY commented Feb 23, 2023

yanguoyu commented Feb 23, 2023

Danie0918 commented Apr 10, 2023

yanguoyu commented May 15, 2023

Update synchronization logic for ckb-indexer/light client #90

Update synchronization logic for ckb-indexer/light client #90

Comments

Keith-CY commented Jan 3, 2023

Keith-CY commented Jan 4, 2023

Keith-CY commented Jan 7, 2023

yanguoyu commented Jan 8, 2023

Keith-CY commented Jan 8, 2023

yanguoyu commented Jan 8, 2023

Keith-CY commented Jan 8, 2023

Keith-CY commented Jan 10, 2023 • edited Loading

yanguoyu commented Jan 12, 2023

Keith-CY commented Jan 12, 2023

Keith-CY commented Jan 12, 2023

Keith-CY commented Jan 13, 2023

Keith-CY commented Jan 20, 2023

yanguoyu commented Feb 23, 2023

quake commented Feb 23, 2023

Keith-CY commented Feb 23, 2023

yanguoyu commented Feb 23, 2023

yanguoyu commented Feb 23, 2023

quake commented Feb 23, 2023

yanguoyu commented Feb 23, 2023

Keith-CY commented Feb 23, 2023

yanguoyu commented Feb 23, 2023

yanguoyu commented Feb 23, 2023 • edited Loading

Keith-CY commented Feb 23, 2023

Keith-CY commented Feb 23, 2023

Keith-CY commented Feb 23, 2023 • edited Loading

quake commented Feb 23, 2023

yanguoyu commented Feb 23, 2023

Keith-CY commented Feb 23, 2023 • edited Loading

Keith-CY commented Feb 23, 2023

yanguoyu commented Feb 23, 2023

Danie0918 commented Apr 10, 2023

yanguoyu commented May 15, 2023

Keith-CY commented Jan 10, 2023 •

edited

Loading

yanguoyu commented Feb 23, 2023 •

edited

Loading

Keith-CY commented Feb 23, 2023 •

edited

Loading

Keith-CY commented Feb 23, 2023 •

edited

Loading