Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Buildkite] Agent Manager #95711

Closed
4 of 15 tasks
brianseeders opened this issue Mar 29, 2021 · 1 comment
Closed
4 of 15 tasks

[Buildkite] Agent Manager #95711

brianseeders opened this issue Mar 29, 2021 · 1 comment
Assignees
Labels
Feature:Buildkite Feature:CI Continuous integration Team:Operations Team label for Operations Team

Comments

@brianseeders
Copy link
Contributor

brianseeders commented Mar 29, 2021

Agent Manager

  • Change secs configs to mins
  • Documentation
  • Test coverage
  • Balance agents cross numerous GCP zones
  • Hard and soft time limits for GCP instances
  • APM instrumentation
  • Delete orphaned GCP instances (instance online, not connected to an agent)
  • Scaling idle timeouts
    • e.g. the first agent for a configuration might have an idle timeout of 1 hour, but the 200th might be 5 minutes
  • Automatically replace agents if disk usage is too high?
  • Liveness check that fails if a run hasn't been started in the last 5-10min
  • Add timeouts where appropriate, especially data gathering phase (partially done)
  • Move to elastic org
  • Nested Virtualization license support
  • Scheduled config modifications?
  • Cache-bust pulling JSON config from GitHub, use API instead, or figure out a different solution

See https://github.com/brianseeders/buildkite-agent-manager

@brianseeders brianseeders added Feature:Buildkite Feature:CI Continuous integration Team:Operations Team label for Operations Team labels Mar 29, 2021
@brianseeders brianseeders self-assigned this Mar 29, 2021
@elasticmachine
Copy link
Contributor

Pinging @elastic/kibana-operations (Team:Operations)

@tylersmalley tylersmalley added 1 and removed 1 labels Oct 11, 2021
@exalate-issue-sync exalate-issue-sync bot added impact:low Addressing this issue will have a low level of impact on the quality/strength of our product. loe:small Small Level of Effort labels Oct 12, 2021
@brianseeders brianseeders added EnableJiraSync and removed loe:small Small Level of Effort impact:low Addressing this issue will have a low level of impact on the quality/strength of our product. EnableJiraSync labels Oct 25, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Feature:Buildkite Feature:CI Continuous integration Team:Operations Team label for Operations Team
Projects
None yet
Development

No branches or pull requests

3 participants