Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TSDB: Fix "duplicates" #251

Merged
merged 2 commits into from
Mar 15, 2022
Merged

TSDB: Fix "duplicates" #251

merged 2 commits into from
Mar 15, 2022

Conversation

nik9000
Copy link
Member

@nik9000 nik9000 commented Mar 14, 2022

Now there we're calculating the _id from dimensions I found that we
have duplicated _ids in our test data. But it's not because we
actually measure things three time. No! We just handn't annotated all of
the dimensions as time_series_dimensions.

@nik9000 nik9000 marked this pull request as ready for review March 14, 2022 17:20
Now there we're calculating the `_id` from dimensions I found that we
have duplicated `_id`s in our test data. But it's not because we
actually measure things three time. No! We just handn't annotated all of
the dimensions as `time_series_dimension`s.
tsdb/index.json Outdated
@@ -6563,7 +6565,7 @@
},
"name": {
"type": "keyword",
"ignore_above": 1024
"time_series_dimension": true
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sadly, I don't think we can use this one. It's the host name that metricbeat is running on. I don't think that is a sensible thing to split time series on.

@nik9000
Copy link
Member Author

nik9000 commented Mar 14, 2022

But it's not because we
actually measure things three time. No! We just handn't annotated all of
the dimensions as time_series_dimensions.

I'm pretty sure we do have duplicates. In the event data set each metricbeat logs the event.

@nik9000 nik9000 requested a review from imotov March 15, 2022 17:47
@nik9000
Copy link
Member Author

nik9000 commented Mar 15, 2022

This doesn't remove all duplicates. But it does remove some duplicates. Which is lovely.

Copy link
Contributor

@imotov imotov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@@ -7816,7 +7818,7 @@
},
"name": {
"type": "keyword",
"ignore_above": 1024
"time_series_dimension": true
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice! 😄

@nik9000 nik9000 merged commit 2a36ba8 into elastic:master Mar 15, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants