NPD v0.7.0 release
Changes: v0.6.4...v0.7.0
NPD Metrics Support
In v0.7.0, @xueweiz added metrics support into NPD and made a major refactoring. The detailed design can be found in the proposal.
- System stats monitor. A system stats monitor is added into NPD as a new problem daemon. It collects useful node problem related system stats with OpenCensus such as
disk/io_time
,disk/weighted_io
anddisk/avg_queue_len
. - Problems as OpenCensus metrics. Besides node condition and events, problems detected by existing problem daemons are also collected into OpenCensus as metrics:
problem_counter{reason="PROBLEM_REASON"} xxx
for events andproblem_gauge{reason="PROBLEM_REASON",type="PROBLEM_TYPE"} 1 or 0
for conditions. - Prometheus endpoint. A Prometheus exporter is added to export all OpenCensus metrics collected by NPD through Prometheus.
- Plugin system for problem daemons. A plugin system for problem daemons is added. Problem daemons can be disabled at compile time with build tags, such as
disable_system_stats_monitor
,disable_system_log_monitor
anddisable_custom_plugin_monitor
. New problem daemon plugins can be added in the future. - Plugin system for problem exporters. A problem exporter interface is added. The original kubernetes problem reporting logic was moved into
k8sexporter
. Prometheus support is implemented asprometheusexporter
. New problem exporters can be added in the future.
Docker Image: k8s.gcr.io/node-problem-detector:v0.7.0
SHA: 1eea20dff5adb82b26b68781dff7c594b0570fd6
MD5: eb44947c63f4869202ce89474571cded