Adding a split cache in Searchers #3857

fulmicoton · 2023-09-20T07:17:49Z

Searcher split cache

Quickwit includes a split cache. It can be useful for specific workloads:

to improve performance
to reduce the cost associated with GET requests.

The split cache stores entire split files on disk.
It works under the following configurable constraints:

number of concurrent download
amount of disk space
number of on-disk files.

Searcher get tipped by indexers about the existence of splits (for which they have the best affinity).
They also might learn about split existence, upon read requests.

The searcher is then in charge of maintaining an in-memory datastructure with a bounded list of splits it knows about and their score.
The current strategy for admission/evicton is a simple LRU logic.

If the most recently accessed splits not already in cache has been accessed, we consider downloading it.
If the limits have been reached, we only proceed to eviction if one of the split currently
in cache has been less recently accessed.

docs/internals/searcher-split-cache.md

quickwit/quickwit-common/src/fs.rs

guilload · 2023-09-20T13:30:26Z

quickwit/quickwit-indexing/src/actors/uploader.rs

@@ -327,6 +343,9 @@ impl Handler<PackagedSplitBatch> for Uploader {
                counters.num_staged_splits.fetch_add(split_metadata_list.len() as u64, Ordering::SeqCst);

                let mut packaged_splits_and_metadata = Vec::with_capacity(batch.splits.len());
+
+                event_broker.publish(ReportSplitsRequest { report_splits });


Why do we report the splits after a successful upload rather than a successful publish?

There was a bunch of reasons that are not necessarily relevant to be honest.

There was two original motivations:

getting more info in the ReportSplit message to eventually make it possible to improve on our cache logic:
I strongly suspect we might want to be smart enough to decide to cache in priority mature splits.
(aparte: Interestingly the number of merge ops before split maturity should not matter much. The idea there is that a split with merge ops n+1 is m times larger than a split with maturity n, and its life expectancy is also m times larger... Overall you get roughly the same bang for the buck by downloading one or the other)

One of customer oscillated about the need to make sure we hit the cache all of the times. Putting it before publish makes it possible to enrich the configuration to change the behavior. For instance, someone who really cares about caching everything could use config flag to block publishing until at least one node has a copy in its cache.

There was another place that I considered, but at that point I did not have access to the list of split ids, and only had them one by one. I thought it was a bit sad to increase the number of rpc.

These are not great reasons. I'll see if I can move stuff to the MetastoreEventPublisher.

Ah no the thing we miss in the publisher is the storage uri.

quickwit/quickwit-search/src/search_job_placer.rs

quickwit/quickwit-search/src/service.rs

quickwit/quickwit-serve/src/lib.rs

quickwit/quickwit-storage/src/split_cache/mod.rs

Co-authored-by: Adrien Guillo <[email protected]>

fulmicoton force-pushed the split_cache_rebased branch 9 times, most recently from 1c411ef to 22e07f9 Compare September 20, 2023 08:20

fulmicoton requested a review from guilload September 20, 2023 09:11

Added a split cache

015fe2c

fulmicoton force-pushed the split_cache_rebased branch from 22e07f9 to 015fe2c Compare September 20, 2023 14:35

guilload approved these changes Sep 20, 2023

View reviewed changes

fulmicoton and others added 5 commits September 21, 2023 10:00

Apply suggestions from code review

a853bef

Co-authored-by: Adrien Guillo <[email protected]>

CR

2eb8e06

Apply suggestions from code review

07a5d49

Co-authored-by: Adrien Guillo <[email protected]>

CR

96f0bb2

Fixed bug introduced by PR

0f04333

fulmicoton force-pushed the split_cache_rebased branch from 32e5d5b to 0f04333 Compare September 21, 2023 06:16

Truncating the number of candidates tracked

ebb9a28

fulmicoton force-pushed the split_cache_rebased branch from 6a56ebb to ebb9a28 Compare September 21, 2023 07:17

Merge branch 'main' into split_cache_rebased

433c4c5

fulmicoton enabled auto-merge (squash) September 21, 2023 13:17

fulmicoton merged commit b9a2215 into main Sep 21, 2023

fulmicoton deleted the split_cache_rebased branch September 21, 2023 13:31

This was referenced Oct 2, 2023

Split cache #3786

Closed

SSD Split "cache" #3443

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding a split cache in Searchers #3857

Adding a split cache in Searchers #3857

fulmicoton commented Sep 20, 2023 •

edited

Loading

guilload Sep 20, 2023

fulmicoton Sep 21, 2023

fulmicoton Sep 21, 2023

Adding a split cache in Searchers #3857

Adding a split cache in Searchers #3857

Conversation

fulmicoton commented Sep 20, 2023 • edited Loading

Searcher split cache

guilload Sep 20, 2023

Choose a reason for hiding this comment

fulmicoton Sep 21, 2023

Choose a reason for hiding this comment

fulmicoton Sep 21, 2023

Choose a reason for hiding this comment

fulmicoton commented Sep 20, 2023 •

edited

Loading