Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[PLAT-906] counters for sainted read and write (#487) #489

Merged
merged 1 commit into from
May 7, 2019
Merged

Conversation

winmillwill
Copy link
Member

@winmillwill winmillwill commented May 7, 2019

  • PLAT-870 Problem: stats for types of requests not distinct

Solution: Account for their number and latency distinctly.

With the Prometheus Statsd exporter in place, we have real histograms that are
fed by stats_timer, which means we can ditch related calls to stats_counter
and we can just reconfigure the bucket widths in the exporter config rather than
having a const that says how long is too long.

We instrument in print_errors because it has the error and latency information
as well as the function name. To that end, I've changed the function name passed
to process_status to more closely align with what someone looking at logs or
timeseries might understand because

  1. that's easier than more extensive refactoring
  2. the other people who care are the ones reading the source and they can just
    as easily search for calls to print_errors
  • move the timers to process_status to observe successes also

  • counters for sainted read/write

* [PLAT-870] Problem: stats for types of requests not distinct

Solution: Account for their number and latency distinctly.

With the Prometheus Statsd exporter in place, we have real histograms that are
fed by `stats_timer`, which means we can ditch related calls to `stats_counter`
and we can just reconfigure the bucket widths in the exporter config rather than
having a const that says how long is too long.

We instrument in `print_errors` because it has the error and latency information
as well as the function name. To that end, I've changed the function name passed
to `process_status` to more closely align with what someone looking at logs or
timeseries might understand because

1) that's easier than more extensive refactoring
2) the other people who care are the ones reading the source and they can just
as easily search for calls to `print_errors`

* move the timers to process_status to observe successes also

* counters for sainted read/write
@winmillwill winmillwill merged commit 48961c0 into stage May 7, 2019
winmillwill added a commit that referenced this pull request May 8, 2019
* [PLAT-870] Problem: stats for types of requests not distinct

Solution: Account for their number and latency distinctly.

With the Prometheus Statsd exporter in place, we have real histograms that are
fed by `stats_timer`, which means we can ditch related calls to `stats_counter`
and we can just reconfigure the bucket widths in the exporter config rather than
having a const that says how long is too long.

We instrument in `print_errors` because it has the error and latency information
as well as the function name. To that end, I've changed the function name passed
to `process_status` to more closely align with what someone looking at logs or
timeseries might understand because

1) that's easier than more extensive refactoring
2) the other people who care are the ones reading the source and they can just
as easily search for calls to `print_errors`

* move the timers to process_status to observe successes also

* counters for sainted read/write
@jerryblakley jerryblakley deleted the yolo branch May 22, 2019 22:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants