-
Notifications
You must be signed in to change notification settings - Fork 352
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Event monitor crashes when the RPC node goes away #863
Comments
Hello, I'm bug informant. To reproduce this crashe, see
Then the crashe will be reproduced. |
Thank you for the feedback, @nodebreaker0-0. I would like to know what would be the required behavior in your opinion, if running
Any thoughts? |
I would say 2. |
We could perhaps introduce a thread to monitor the state of all configured chains and trigger a reload/restart when a crashed node comes back online. |
As an initial fix, maybe we do 2 without resuming relaying for the affected chain? |
I confirm that 2. is much preferred. |
In my opinion, number two is appropriate. It is necessary to separate the rpc connection for each chain independently. For example Fundamental Problem Solving
If you think this is very complicated development. Temporary Measure Multiple rpc nodes in config.toml are required. Then, if node 1 is unable to communicate rpc, you can try to connect to node 2 (or up to number 3 and 4). In conclusion, however, the fundamental problem must be solved. |
What are the minimum requirements for recovery when the full node comes back? Is manual restart ok, similar to when a new chain is added? Or need something like @romac proposes here #863 (comment) |
Crate
relayer
Summary of Bug
This was discovered by the DEX team during load testing and happens on hermes multi path startup.
Trace is:
Version
Steps to Reproduce
Acceptance Criteria
For Admin Use
The text was updated successfully, but these errors were encountered: