-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Restart delay not working when agent process exits unexpectedly #27891
Comments
Pinging code owners:
See Adding Labels via Comments if you do not have permissions to add labels yourself. |
This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping Pinging code owners:
See Adding Labels via Comments if you do not have permissions to add labels yourself. |
This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping Pinging code owners:
See Adding Labels via Comments if you do not have permissions to add labels yourself. |
…pectedly (#32150) **Description:** Reset should called only on stopped or expired timers with drained channels. If the timer already expired (and the channel was not cleared) it reads from the timer's channel to clear it. **Link to tracking Issue:** Fixes #27891 **Testing:** <Describe what testing was performed and which tests were added.> **Documentation:** <Describe the documentation added.> --------- Co-authored-by: Evan Bradley <[email protected]>
Component(s)
cmd/opampsupervisor
What happened?
Description
When the supervisor fails to start the collector, it will log that it will restart the collector in a bit, but it doesn't actually wait 5s to restart and instead logs the message over and over.
Steps to Reproduce
Send a bad config from an OpAMP server to the supervisor. The collector will fail to start and exit. The supervisor will be caught in a tight loop logging that it will "restart in a bit..."
Expected Result
The supervisor will wait 5s and then attempt to restart the collector.
Actual Result
The supervisor logs this message over and over.
Collector version
05ec3a2
Environment information
Environment
OS: m1 Mac
Compiler(if manually compiled): go1.21.3 darwin/arm64
OpenTelemetry Collector configuration
No response
Log output
No response
Additional context
Based on a quick review of the code, when the Commander watch() method closes the
doneCh
, the Supervisor gets stuck incase <-s.commander.Done()
as the closed channel will continue to produce signals.Tested using the OpAMP Agent Extension PR
#16594
The text was updated successfully, but these errors were encountered: