Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

V2: Agent starting multiple instances of filebeat or metricbeat and goes into tight restart loop for the beat process #997

Closed
aleksmaus opened this issue Aug 24, 2022 · 2 comments
Assignees
Labels
bug Something isn't working estimation:Week Task that represents a week of work. Team:Elastic-Agent-Control-Plane Label for the Agent Control Plane team v8.6.0

Comments

@aleksmaus
Copy link
Member

Agent starting multiple instances of filebeat or metricbeat and goes into tight restart loop for the beat process.
More than 50% of agent runs fail while starting multiple beats processes due to the conflict on data path, which results in the tight restart loop for the beat.

Exiting: data path already locked by another beat. Please make sure that multiple beats are not sharing the same data path (path.data).
Exiting: data path already locked by another beat. Please make sure that multiple beats are not sharing the same data path (path.data).
Exiting: data path already locked by another beat. Please make sure that multiple beats are not sharing the same data path (path.data).
Exiting: data path already locked by another beat. Please make sure that multiple beats are not sharing the same data path (path.data).

This is documented here as well:
elastic/beats#32673 (comment)

Running off of the latest V2 branch:
https://github.com/elastic/elastic-agent/tree/feature-arch-v2

Steps to reproduce:

  1. Enroll the agent from the command line
  2. Run the agent from the command line
@aleksmaus aleksmaus added bug Something isn't working Team:Elastic-Agent-Control-Plane Label for the Agent Control Plane team v8.5.0 labels Aug 24, 2022
@blakerouse blakerouse mentioned this issue Aug 31, 2022
3 tasks
@michalpristas michalpristas added the estimation:Week Task that represents a week of work. label Sep 26, 2022
@cmacknz
Copy link
Member

cmacknz commented Sep 29, 2022

Let's wait for the fix to elastic/beats#31670 before addressing this.

That will remove the data path locking failure, but I think that is just a symptom here where the real problem is likely that the V2 agent is rapidly starting and stopping the Filebeat and Metricbeat.

@cmacknz cmacknz assigned blakerouse and unassigned michalpristas Sep 29, 2022
@cmacknz
Copy link
Member

cmacknz commented Sep 29, 2022

@blakerouse reports we've fixed this particular bug already, closing. If we struggle to reproduce the locking issue separately (which I don't think will be a problem) we can try reverting the fix for this on the agent V2 branch.

@cmacknz cmacknz closed this as completed Sep 29, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working estimation:Week Task that represents a week of work. Team:Elastic-Agent-Control-Plane Label for the Agent Control Plane team v8.6.0
Projects
None yet
Development

No branches or pull requests

5 participants