Add points metadata support for archive indices #86655

ywelsch · 2022-05-11T08:03:47Z

Archive indices appear just as regular indices in a cluster, and can be part of index patterns when queried. To allow searches to quickly skip shards of archive indices that might not have relevant data, we're adding support for skipping shards of archive indices here that don't have data falling in the time range being queried. This is critical for the Kibana experience which relies on the date range picker to quickly skip some of the indices in an index pattern.

Doing the actual time-range query on the archive index is much more expensive as on a regular index (as it's using doc-values instead of points to run the query, equating to a full scan of the columnar data). The solution here is to make points metadata available in archive indices, so that the minimum and maximum value can be retrieved in constant time (only a tiny fraction of the full points capabilities).

elasticmachine · 2022-05-11T08:03:50Z

Pinging @elastic/es-search (Team:Search)

ywelsch · 2022-05-11T08:55:17Z

@elasticmachine run elasticsearch-ci/bwc

ywelsch · 2022-05-11T11:15:11Z

.../old-lucene-versions/src/main/java/org/elasticsearch/xpack/lucene/bwc/OldLuceneVersions.java

+        if (indexSettings.getIndexVersionCreated().isLegacyIndexVersion()
+            && indexSettings.getIndexMetadata().isSearchableSnapshot() == false) {
+            return Optional.of(
+                engineConfig -> new ReadOnlyEngine(


Archive indices had already been read-only (enforced by write block that can't be removed). We're using ReadOnlyEngine here now that leverages some of the advantages of statically knowing that the index is read only. In particular, it fixes an issue where points are being used by peer recovery, which ReadOnly nicely sidesteps by making the document-based recovery a NOOP.

dnhatn

LGTM. Thanks Yannick.

ywelsch · 2022-05-16T11:27:57Z

Thanks @dnhatn!

jpountz

LGTM too. (Sorry for being late to the party!)

Add metadata support for points

e2ede67

ywelsch added >non-issue :Search/Search Search-related issues that do not fall into other categories v8.3.0 labels May 11, 2022

elasticmachine added the Team:Search Meta label for search team label May 11, 2022

rename

e5fea23

ywelsch mentioned this pull request May 11, 2022

Snapshots as simple archives #81210

Closed

32 tasks

ywelsch added 3 commits May 11, 2022 12:06

fix recovery

0da971c

fix test

a9e527c

Merge remote-tracking branch 'elastic/master' into points-meta-support

cfb8380

ywelsch commented May 11, 2022

View reviewed changes

ywelsch requested review from dnhatn and jpountz May 11, 2022 11:17

ywelsch added 2 commits May 11, 2022 13:35

only instantiate if not already instantiated by source-only snap repo

72db7cd

spotless

0431bf4

dnhatn approved these changes May 16, 2022

View reviewed changes

ywelsch merged commit 8a58e36 into elastic:master May 16, 2022

jpountz reviewed May 17, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add points metadata support for archive indices #86655

Add points metadata support for archive indices #86655

ywelsch commented May 11, 2022

elasticmachine commented May 11, 2022

ywelsch commented May 11, 2022

ywelsch May 11, 2022

dnhatn left a comment

ywelsch commented May 16, 2022

jpountz left a comment

Add points metadata support for archive indices #86655

Add points metadata support for archive indices #86655

Conversation

ywelsch commented May 11, 2022

elasticmachine commented May 11, 2022

ywelsch commented May 11, 2022

ywelsch May 11, 2022

Choose a reason for hiding this comment

dnhatn left a comment

Choose a reason for hiding this comment

ywelsch commented May 16, 2022

jpountz left a comment

Choose a reason for hiding this comment