Statsite telemetry leaves socket lingering to CLOSE_WAIT after remote server restarted #3697
Labels
theme/telemetry
Anything related to telemetry or observability
type/bug
Feature does not function as expected
Milestone
consul version
for both Client and ServerClient: N/A
Server: Consul v0.9.3
Operating system and Environment details
Running as Docker container based on Debian 8.9 image
Consul container is running in AWS on top of ECS (can't reproduce in local environment)
Consul connects to telemetry server via an ELB
Description of the Issue (and unexpected/desired result)
When the telemetry server is restarted / killed / somehow temporarily unavailable Consul will not reconnect until the operating system takes care of the TCP socket left in CLOSE_WAIT state. By default this is in Linux I guess something over 2 hours:
Couldn't get any logs from this specific module (https://github.com/hashicorp/consul/blob/v0.9.3/vendor/github.com/armon/go-metrics/statsite.go), even though TRACE log levels enabled.
Reproduction steps
Log Fragments or Link to gist
Couldn't get any logs related to the statsite, even with TRACE level.
The text was updated successfully, but these errors were encountered: