-
Notifications
You must be signed in to change notification settings - Fork 189
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TSDB: Fix "duplicates" #251
Conversation
Now there we're calculating the `_id` from dimensions I found that we have duplicated `_id`s in our test data. But it's not because we actually measure things three time. No! We just handn't annotated all of the dimensions as `time_series_dimension`s.
tsdb/index.json
Outdated
@@ -6563,7 +6565,7 @@ | |||
}, | |||
"name": { | |||
"type": "keyword", | |||
"ignore_above": 1024 | |||
"time_series_dimension": true |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sadly, I don't think we can use this one. It's the host name that metricbeat is running on. I don't think that is a sensible thing to split time series on.
I'm pretty sure we do have duplicates. In the |
This doesn't remove all duplicates. But it does remove some duplicates. Which is lovely. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
@@ -7816,7 +7818,7 @@ | |||
}, | |||
"name": { | |||
"type": "keyword", | |||
"ignore_above": 1024 | |||
"time_series_dimension": true |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice! 😄
Now there we're calculating the
_id
from dimensions I found that wehave duplicated
_id
s in our test data. But it's not because weactually measure things three time. No! We just handn't annotated all of
the dimensions as
time_series_dimension
s.