-
Notifications
You must be signed in to change notification settings - Fork 148
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Agent remains Unhealthy
even on updating invalid integration configuration to valid input.
#2954
Comments
Pinging @elastic/elastic-agent-control-plane (Team:Elastic-Agent-Control-Plane) |
@manishgupta-qasource Please review. |
Secondary review for this ticket is Done |
This is the failing input in state.yaml: - id: system/metrics-default
state:
state: 2
message: 'Healthy: communicating with pid ''5768'''
units:
? unittype: 0
unitid: system/metrics-default-system/metrics-system-6d2e7b5d-b166-466c-8fb5-db5c3a512387
: state: 4
message: '[failed to reload inputs: 1 error: Error creating runner from config:
1 error: error validating config: invalid core.metrics value ''xxxxxxx''
(valid options are percentages and ticks)]'
? unittype: 1
unitid: system/metrics-default
: state: 4
message: '[failed to reload inputs: 1 error: Error creating runner from config:
1 error: error validating config: invalid core.metrics value ''xxxxxxx''
(valid options are percentages and ticks)]' The configuration for the - data_stream:
namespace: windows
id: system/metrics-system-6d2e7b5d-b166-466c-8fb5-db5c3a512387
meta:
package:
name: system
version: 1.34.0
name: system-2
package_policy_id: 6d2e7b5d-b166-466c-8fb5-db5c3a512387
revision: 3
streams:
- core.metrics:
- percentages
data_stream:
dataset: system.core
type: metrics
id: system/metrics-system.core-6d2e7b5d-b166-466c-8fb5-db5c3a512387
metricsets:
- core
- cpu.metrics:
- percentages
- normalized_percentages
data_stream:
dataset: system.cpu
type: metrics
id: system/metrics-system.cpu-6d2e7b5d-b166-466c-8fb5-db5c3a512387
metricsets:
- cpu
period: 10s
- data_stream:
dataset: system.diskio
type: metrics
diskio.include_devices: null
id: system/metrics-system.diskio-6d2e7b5d-b166-466c-8fb5-db5c3a512387
metricsets:
- diskio
period: 10s
- data_stream:
dataset: system.filesystem
type: metrics
id: system/metrics-system.filesystem-6d2e7b5d-b166-466c-8fb5-db5c3a512387
metricsets:
- filesystem
period: 1m
processors:
- drop_event:
when:
regexp:
system.filesystem.mount_point: ^/(sys|cgroup|proc|dev|etc|host|lib|snap)($|/)
- data_stream:
dataset: system.fsstat
type: metrics
id: system/metrics-system.fsstat-6d2e7b5d-b166-466c-8fb5-db5c3a512387
metricsets:
- fsstat
period: 1m
processors:
- drop_event:
when:
regexp:
system.fsstat.mount_point: ^/(sys|cgroup|proc|dev|etc|host|lib|snap)($|/)
- condition: ${host.platform} != 'windows'
data_stream:
dataset: system.load
type: metrics
id: system/metrics-system.load-6d2e7b5d-b166-466c-8fb5-db5c3a512387
metricsets:
- load
period: 10s
- data_stream:
dataset: system.memory
type: metrics
id: system/metrics-system.memory-6d2e7b5d-b166-466c-8fb5-db5c3a512387
metricsets:
- memory
period: 10s
- data_stream:
dataset: system.network
type: metrics
id: system/metrics-system.network-6d2e7b5d-b166-466c-8fb5-db5c3a512387
metricsets:
- network
network.interfaces: null
period: 10s
- data_stream:
dataset: system.process
type: metrics
id: system/metrics-system.process-6d2e7b5d-b166-466c-8fb5-db5c3a512387
metricsets:
- process
period: 10s
process.cgroups.enabled: false
process.cmdline.cache.enabled: true
process.include_cpu_ticks: false
process.include_top_n.by_cpu: 5
process.include_top_n.by_memory: 5
processes:
- .*
- data_stream:
dataset: system.process.summary
type: metrics
id: system/metrics-system.process.summary-6d2e7b5d-b166-466c-8fb5-db5c3a512387
metricsets:
- process_summary
period: 10s
- data_stream:
dataset: system.socket_summary
type: metrics
id: system/metrics-system.socket_summary-6d2e7b5d-b166-466c-8fb5-db5c3a512387
metricsets:
- socket_summary
period: 10s
- data_stream:
dataset: system.uptime
type: metrics
id: system/metrics-system.uptime-6d2e7b5d-b166-466c-8fb5-db5c3a512387
metricsets:
- uptime
period: 10s
type: system/metrics
use_output: default This is definitely a bug, one we haven't see before. |
@amolnater-qasource could you re-validate that issue? I cannot reproduce it on |
The issue has gone away in 8.10.1. |
Hi @AndersonQ Thank you for the update. We have revalidated this issue on 8.10.2 and 8.11.0-SNAPSHOT kibana cloud environment and found it fixed now. Observations:
Screen Recording: ec2amaz-u7odjck.-.Agents.-.Fleet.-.Elastic.-.Google.Chrome.2023-09-29.12-04-45.mp48.10.2: Edit.integration.-.Windows.Agent.policy.1.-.Agent.policies.-.Fleet.-.Elastic.-.Google.Chrome.2023-09-29.12-08-59.mp4Build details: Hence we are closing this issue and marking as QA:Validated. Thanks!! |
|
Issue
Kibana Build details:
Host OS and Browser version: All, All
Preconditions:
Steps to reproduce:
system-1
integration.xxxxxx
.percentages
Expected:
Agent should get back healthy on updating invalid integration configuration to valid input.
Screen Recording:
Agents.-.Fleet.-.Elastic.-.Google.Chrome.2023-06-28.11-00-17.mp4
ec2amaz-tc0oajr.-.Agents.-.Fleet.-.Elastic.-.Google.Chrome.2023-06-28.11-21-38.mp4
Debug Logs:
elastic-agent-diagnostics-2023-06-28T06-08-54Z-00.zip
Definition of done
The text was updated successfully, but these errors were encountered: