Handle lock release with SIGHUP in VTGR #8472

5antelope · 2021-07-14T17:05:00Z

Description

VTGR relies on the lock from topo server. Most of the topo lock uses health-check of session to determine if a process is still holding the lock. This is not ideal for lock release process during a restart with SIGHUP. Take consul as an example, the session check by default uses serfHealth, which is a health check on the node level - even if the process restart because the node is healthy, consul will think the lock is still being held until the TTL. As a result, during a deploy, there could be TTL period of time that VTGR cannot grab the lock and fix the cluster.

This PR handle the SIGHUP signal by explicitly release the lock that was held to avoid the situation above.

Related Issue(s)

#8386

Checklist

Tests were added or are not required
Documentation was added or is not required

Deployment Notes

Signed-off-by: crowu <[email protected]>

deepthi

Code looks fine, but it seems to be handling SIGHUP, not SIGTERM.
Which one should it be? or both?

5antelope · 2021-07-16T00:59:51Z

Code looks fine, but it seems to be handling SIGHUP, not SIGTERM.
Which one should it be? or both?

It depends on what signal ppl use for restart, I think we can start with SIGHUP and call it out in the document. Later add supports for SIGTERM if required. wdyt?

deepthi · 2021-07-19T15:57:32Z

Code looks fine, but it seems to be handling SIGHUP, not SIGTERM.
Which one should it be? or both?

It depends on what signal ppl use for restart, I think we can start with SIGHUP and call it out in the document. Later add supports for SIGTERM if required. wdyt?

For safety it is better to handle SIGTERM as well. I don't have a preference on when this should be added, I will leave that up to you.
LMK if that is planned to be a separate PR so that I can approve and merge this one.

5antelope · 2021-07-19T17:13:26Z

Thanks @deepthi, I will try to address SIGTERM separately (hopefully soon :-))

5antelope · 2021-07-21T01:50:12Z

Gentle nudge

Handle SIGTERM in VTGR

d1d5832

Signed-off-by: crowu <[email protected]>

5antelope assigned deepthi Jul 14, 2021

5antelope added Type: Enhancement Logical improvement (somewhere between a bug and feature) Component: Cluster management labels Jul 14, 2021

Fix race in test

b03b0e8

Signed-off-by: crowu <[email protected]>

deepthi reviewed Jul 14, 2021

View reviewed changes

5antelope changed the title ~~Handle lock release with SIGTERM in VTGR~~ Handle lock release with SIGHUP in VTGR Jul 16, 2021

deepthi approved these changes Jul 23, 2021

View reviewed changes

deepthi merged commit 584f829 into vitessio:main Jul 23, 2021

frouioui added the release notes label Sep 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle lock release with SIGHUP in VTGR #8472

Handle lock release with SIGHUP in VTGR #8472

5antelope commented Jul 14, 2021 •

edited by deepthi

Loading

deepthi left a comment

5antelope commented Jul 16, 2021

deepthi commented Jul 19, 2021

5antelope commented Jul 19, 2021

5antelope commented Jul 21, 2021

Handle lock release with SIGHUP in VTGR #8472

Handle lock release with SIGHUP in VTGR #8472

Conversation

5antelope commented Jul 14, 2021 • edited by deepthi Loading

Description

Related Issue(s)

Checklist

Deployment Notes

deepthi left a comment

Choose a reason for hiding this comment

5antelope commented Jul 16, 2021

deepthi commented Jul 19, 2021

5antelope commented Jul 19, 2021

5antelope commented Jul 21, 2021

5antelope commented Jul 14, 2021 •

edited by deepthi

Loading