Add an aggregate metric for the theoretical write capacity #28

mkeeler · 2022-01-24T20:46:54Z

The new metric emits a sample of the number of logs per second boltdb could store if all those log write operations looked like the one currently being measured. That means that another operation would have the same number of logs per batch and that the actual txn Commit took the same amount of time. While no two operations will be identical, taking the average of the sample/summary emitted should provide a good picture of what Consul could handle with the current types of write operations being performed.

It is expected that this value will fluctuate with changes in size of the logs flowing through consul and how many logs get batched into one storage op.

If someone wanted to monitor this I think they would want to know when the actual write rate exceeds 75% of this metrics value. That could be due to an increased number of writes, or a degradation in disk performance which causes similar writes to slow down. Regardless of the cause, if you are getting close to the limit or see a drastic change in the metric it could be indicative of another issue which requires investigation.

markan

Nice. I really appreciate it when software surfaces metrics like this. So often I look at a metric and have no idea how to relate it to reality.

A few minor tweaks, feel free to take or leave them.

bolt_store.go

v2/bolt_store.go

go.mod

bolt_store.go

acpana · 2022-01-31T21:23:29Z

[nit] Any chance we could add some comments around what the metric writeCapacity is and how folks should interact with it? (the PR description is great imo for this purpose)

hashicorp-cla · 2022-03-12T16:46:32Z

All committers have signed the CLA.

mkeeler · 2022-03-29T19:44:22Z

I added some info in the README about how to interpret metrics.

markan approved these changes Jan 24, 2022

View reviewed changes

bolt_store.go Outdated Show resolved Hide resolved

bolt_store.go Outdated Show resolved Hide resolved

mkeeler force-pushed the agg-metric branch from 77a66ca to 80a55b2 Compare January 25, 2022 17:38

acpana reviewed Jan 31, 2022

View reviewed changes

v2/bolt_store.go Show resolved Hide resolved

acpana reviewed Jan 31, 2022

View reviewed changes

go.mod Show resolved Hide resolved

acpana reviewed Jan 31, 2022

View reviewed changes

bolt_store.go Show resolved Hide resolved

mkeeler force-pushed the agg-metric branch from 862f824 to e3d826c Compare March 29, 2022 19:44

mkeeler force-pushed the agg-metric branch from e3d826c to e9d2724 Compare March 29, 2022 19:45

Add an aggregate metric for the theoretical write capacity

f899436

mkeeler force-pushed the agg-metric branch from e9d2724 to f899436 Compare March 29, 2022 19:49

mkeeler merged commit 15018e9 into master Mar 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an aggregate metric for the theoretical write capacity #28

Add an aggregate metric for the theoretical write capacity #28

mkeeler commented Jan 24, 2022

markan left a comment •

edited

Loading

acpana commented Jan 31, 2022

hashicorp-cla commented Mar 12, 2022 •

edited

Loading

mkeeler commented Mar 29, 2022

Add an aggregate metric for the theoretical write capacity #28

Add an aggregate metric for the theoretical write capacity #28

Conversation

mkeeler commented Jan 24, 2022

markan left a comment • edited Loading

Choose a reason for hiding this comment

acpana commented Jan 31, 2022

hashicorp-cla commented Mar 12, 2022 • edited Loading

mkeeler commented Mar 29, 2022

markan left a comment •

edited

Loading

hashicorp-cla commented Mar 12, 2022 •

edited

Loading