Validator fallback does not work when the execution engine is offline or unavailable #3641

Spacesider · 2022-10-14T00:38:27Z

Description

My validator has multiple consensus node endpoints configured. The first being a Lighthouse-Nethermind pair, and the second being a Teku-Besu pair. The Lighthouse validator points to Lighthouse node as the primary, and the Teku node as the fallback.

While performing an upgrade on my Nethermind node, I ran into issues and Nethermind was offline for roughly an hour. After it was back up, I discovered that I had missed attestations the entire time, this was despite having a Teku-Besu fallback both configured and running.

It appears that when the execution engine that is paired with Lighthouse goes offline (Or is otherwise unavailable), the lighthouse validator still attempts to use the lighthouse node despite it not being functional. (Side note, I have only tested this with Nethermind as the execution engine, however I would be happy to perform tests with all execution engines, but I will need some time to do this).

Pre-merge this wouldn't be a problem because you didn't need the execution engine to attest, but post-merge this has changed.

Version

Running https://github.com/sigp/lighthouse/releases/tag/v3.1.2 > lighthouse-v3.1.2-x86_64-unknown-linux-gnu.tar.gz

Present Behaviour

At current, the Lighthouse node presents itself as being available for a validator node when the execution node is offline. This prevents the validator from switching over to the other configured endpoint.

Expected Behaviour

The Lighthouse node should report itself as being something along the lines of "offline" or "not in sync" or "unavailable" when the paired execution engine is offline or unavailable. Because while it is in this state, it is unable to process attestations or block proposals. So for validating purposes, it is offline.

Steps to resolve

N/A

michaelsproul · 2022-10-14T00:55:08Z

Sorry you ran into this, it's a known problem with our fallback mechanism post-merge. We are tracking these issues via this tracking issue: #3613.

michaelsproul · 2023-05-16T06:38:18Z

Closing as dupe of #3613. The scenario described will be fixed by #4291

michaelsproul closed this as completed May 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validator fallback does not work when the execution engine is offline or unavailable #3641

Validator fallback does not work when the execution engine is offline or unavailable #3641

Spacesider commented Oct 14, 2022

michaelsproul commented Oct 14, 2022

michaelsproul commented May 16, 2023

Validator fallback does not work when the execution engine is offline or unavailable #3641

Validator fallback does not work when the execution engine is offline or unavailable #3641

Comments

Spacesider commented Oct 14, 2022

Description

Version

Present Behaviour

Expected Behaviour

Steps to resolve

michaelsproul commented Oct 14, 2022

michaelsproul commented May 16, 2023