Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add cleanup of dead connections no longer reported by a worker #1220

Merged
merged 22 commits into from
May 14, 2021
Merged

Conversation

jefferai
Copy link
Member

When a worker no longer reports a connection (e.g. after a restart), the controller will now mark those connections as closed so that sessions can be terminated instead of sitting in canceling state due to stuck connections.

Related to #894
Related to #1055

Copy link
Member

@mgaffney mgaffney left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks really good. Just a few comments and questions.

Co-authored-by: Michael Gaffney <[email protected]>
louisruch
louisruch previously approved these changes May 13, 2021
Copy link
Collaborator

@louisruch louisruch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Two minor nits otherwise this looks great

@jefferai jefferai dismissed mgaffney’s stale review May 13, 2021 21:45

Addressed issues

@jefferai jefferai merged commit 6f34da8 into main May 14, 2021
@jefferai jefferai deleted the ICU-1260 branch May 14, 2021 17:23
hugoghx pushed a commit that referenced this pull request Dec 6, 2024
* internal/daemon/controller+worker: propagate downstream timeout

Propagate the downstream worker timeout into the
downstream receiver and downstream ticker
so we have a configurable consistent timeout
value to use for worker to worker interactions.

Also change the atomic.Pointer to an atomic.Int64
for consistency with other values.

* internal: propagate downstream timeout

The downstream worker timeout is now used in all
time bound operations between workers.

* address lint and test failures

---------

Co-authored-by: irenarindos <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants