ttljob: only add labels when `ttl_label_metrics` is set #77567

otan · 2022-03-09T20:30:38Z

Release justification: high benefit change to new stuff

Release note (sql change): TTL metrics are labelled by relation name if
SET CLUSTER SETTING server.child_metrics.enabled=true; is set and
the ttl_label_metrics storage parameter is set to true. This is to
prevent a potentially unbounded cardinality on TTL related metrics.

Follow up to [this slack convo](https://cockroachlabs.slack.com/archives/C0168LW5THS/p1645556875126379). Release justification: high benefit change to new stuff Release note (sql change): TTL metrics are labelled by relation name if `SET CLUSTER SETTING server.child_metrics.enabled=true;` is set and the `ttl_label_metrics` storage parameter is set to true. This is to prevent a potentially unbounded cardinality on TTL related metrics.

cockroach-teamcity · 2022-03-09T20:30:46Z

This change is

ajwerner · 2022-03-09T20:39:28Z

this runs afoul of my general concern that letting users create label-values in a cloud setting is a big-time problem for the stability of the system as a whole. I'm willing to be overridden, but I want whatever party does that to understand the implications.

otan · 2022-03-09T21:25:19Z

ah, i may have misread your slack message a little bit and this would implement what you suggested there:

My middle-ground suggestion would be to add some syntax (via one of those storage parameters we’ve embraced) to set the metric label

but yes i do understand the concern, but at the same time these metrics seem pretty important in understanding TTL performance (cc @vy-ton in case you want to weigh in about the acceptability of the tradeoff/implicaitions)

ajwerner · 2022-03-09T21:30:33Z

I did say it, but then I tried to walk it back with:

tl;dr I’d be happy with a constant number of tier labels or something and scared of anything more dynamic and downright opposed to anything that creates labels for users without them going out of their way to do it.

I'm scared of this, but not downright opposed.

My preference would be to create a finite number of labels and let users map tables to them.

otan · 2022-03-10T21:39:59Z

My preference would be to create a finite number of labels and let users map tables to them.

i'm not a big fan of this, seems counterintuitive and not user friendly, but i understand why it's suggested :). my preference is the way presented in this PR.

anyone want to be the tiebreaker? but i bet i know what @vy-ton would say ;)

vy-ton · 2022-03-11T02:36:34Z

My preference would be to create a finite number of labels and let users map tables to them.

I actually don't quite understand this option so will schedule time to chat next week.

otan · 2022-03-11T02:46:02Z

My preference would be to create a finite number of labels and let users map tables to them.

I assume this means we have some hardcoded set of labels , e.g. ttl_1, ttl_2, ... ttl_10
and if we want per-table metrics for TTL, we'd set something like SET (ttl_metric_label = 'ttl_1') for the metrics to be reported under a specific label.

ajwerner · 2022-03-14T18:27:02Z

I prefer this to having the labels added automatically. I didn't realize you had already done the automatic thing.

ajwerner · 2022-03-14T18:57:44Z

Okay, I'm on board. I have modest worries about people going nuts with this, but we can try to deal with that later.

andreimatei

i'm not a big fan of this, seems counterintuitive and not user friendly, but i understand why it's suggested :). my preference is the way presented in this PR.

fwiw, I agree. I didn't like the respective option, with a predefined list of metric names, in the changefeed case either.

But, I would be curious what options we have in CC for configuring our Prometheus for forget about timeseries that haven't been updated in a while (in particular, forget their names for the purposes of whatever indexes it maintains). I'm thinking about tables whose names keep changing. Oliver, it might worth be looking into it.

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @rafiss)

otan · 2022-03-14T20:03:58Z

bors r+

maybe one day we can enjoy the pleasures of high cardinality systems :')

But, I would be curious what options we have in CC for configuring our Prometheus for forget about timeseries that haven't been updated in a while (in particular, forget their names for the purposes of whatever indexes it maintains). I'm thinking about tables whose names keep changing. Oliver, it might worth be looking into it.

i will follow up

craig · 2022-03-14T21:04:33Z

Build failed (retrying...):

GitHub CI (Cockroach)

craig · 2022-03-14T23:47:05Z

Build succeeded:

GitHub CI (Cockroach)

otan requested review from rafiss and a team March 9, 2022 20:30

otan force-pushed the label_metrics branch from b6c0c13 to 621d767 Compare March 9, 2022 20:30

andreimatei reviewed Mar 14, 2022

View reviewed changes

craig bot merged commit 49a24b5 into cockroachdb:master Mar 14, 2022

cockroach-teamcity mentioned this pull request Mar 14, 2022

ttljob: only add labels when ttl_label_metrics is set cockroachdb/docs#13243

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ttljob: only add labels when `ttl_label_metrics` is set #77567

ttljob: only add labels when `ttl_label_metrics` is set #77567

otan commented Mar 9, 2022

cockroach-teamcity commented Mar 9, 2022

ajwerner commented Mar 9, 2022

otan commented Mar 9, 2022

ajwerner commented Mar 9, 2022

otan commented Mar 10, 2022

vy-ton commented Mar 11, 2022

otan commented Mar 11, 2022 •

edited

Loading

ajwerner commented Mar 14, 2022

ajwerner commented Mar 14, 2022

andreimatei left a comment

otan commented Mar 14, 2022

craig bot commented Mar 14, 2022

craig bot commented Mar 14, 2022

ttljob: only add labels when ttl_label_metrics is set #77567

ttljob: only add labels when ttl_label_metrics is set #77567

Conversation

otan commented Mar 9, 2022

cockroach-teamcity commented Mar 9, 2022

ajwerner commented Mar 9, 2022

otan commented Mar 9, 2022

ajwerner commented Mar 9, 2022

otan commented Mar 10, 2022

vy-ton commented Mar 11, 2022

otan commented Mar 11, 2022 • edited Loading

ajwerner commented Mar 14, 2022

ajwerner commented Mar 14, 2022

andreimatei left a comment

Choose a reason for hiding this comment

otan commented Mar 14, 2022

craig bot commented Mar 14, 2022

craig bot commented Mar 14, 2022

ttljob: only add labels when `ttl_label_metrics` is set #77567

ttljob: only add labels when `ttl_label_metrics` is set #77567

otan commented Mar 11, 2022 •

edited

Loading