Tunable Cache Eviction Policies #5445

esatterwhite · 2024-09-24T18:26:15Z

Is your feature request related to a problem? Please describe.

We ingest and search on petabytes of data reaching as far back as 90 days old. We maintain multiple terabytes of cache for quickwit. Our Search workload is very front heavy in relation to time. Meaning, the further a given document's timestamp is from now the less it is accessed. The data that has been ingested in the last 3 days is generally the most frequently accessed data. However, We have internal jobs that run once a day or once a week that may access older data. Our users may periodically run reports or expensive aggregations across months of data, etc. Because Quickwit is using LRU cache eviction policies, these expensive, one off, and infrequently executed queries can invalidate a large portion of cache resulting in depredated performance across the cluster

Describe the solution you'd like
It would be good for our workload to implement means to switch the eviction policy. Namely, switch from the default Least Recently Used (LRU) to a Least Frequently Used (LFU) eviction policy. This would allow quickwit to evict split entries that are much less likely to be accessed again rather than just entries that happen to have a slightly older age than others keeping hot / relevant data in cache.

The text was updated successfully, but these errors were encountered:

fulmicoton · 2024-09-26T03:02:33Z

thanks for the accurate description of your issue!

esatterwhite · 2024-10-10T12:46:25Z

@fulmicoton out of curiosity does the split cache operate per index?

Or could it? how complicated would that be? I'm just thinking of ways to remove / reduce the impact of a bad actor or the 1 off expensive query so that it doesn't impact the entire cluster.

Even in the case that we moved away from the daily index pattern - The per customer pattern still leaves us with thousands of indexes and a fairly unbalanced search work load

fulmicoton · 2024-10-11T07:30:11Z

the split cache does not operate per index.

esatterwhite added the enhancement New feature or request label Sep 24, 2024

trinity-1686a mentioned this issue Oct 2, 2024

make some cache kind configureable #5469

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tunable Cache Eviction Policies #5445

Tunable Cache Eviction Policies #5445

esatterwhite commented Sep 24, 2024 •

edited

Loading

fulmicoton commented Sep 26, 2024

esatterwhite commented Oct 10, 2024 •

edited

Loading

fulmicoton commented Oct 11, 2024

Tunable Cache Eviction Policies #5445

Tunable Cache Eviction Policies #5445

Comments

esatterwhite commented Sep 24, 2024 • edited Loading

fulmicoton commented Sep 26, 2024

esatterwhite commented Oct 10, 2024 • edited Loading

fulmicoton commented Oct 11, 2024

esatterwhite commented Sep 24, 2024 •

edited

Loading

esatterwhite commented Oct 10, 2024 •

edited

Loading