SSD Split "cache" #3443

fulmicoton · 2023-05-30T14:00:43Z

For some slightly high performance/high traffic we want a solution to have splits data available on a local SSDs.

I put "cache" under quotation mark, because it is not really a cache. It should be populated, upon publish (as opposed to "on first read").
I called it "a cache" because it plays no role in durability. If the node hosting this local copy is down, the query can still hit S3 directly and perform as usual.

This ticket will likely involve

a pre-flight query
a way to lock splits in the cache for a given grace period, to prevent their eviction
some logic (rendez vous hashing?) to place splits on nodes?
some eviction strategy, etc.

imotov · 2023-05-31T03:08:05Z

What is the role of "a pre-flight query" in this context?

fulmicoton · 2023-05-31T08:43:45Z

Right now we do assign split's leaf_search to searchers using some logic that tries to spread the work evenly and maximize affinity, the affinity being defined using rendez-vous hashing.

But @trinity-1686a added a "split search result cache" and you are about to add a split cache.
It might be interesting to have one round of RPC to ask leaf nodes if they have data already in their cache before assigning the leaf search request.

Introduces a distinct Control Plane component, which is now seperate from the Indexing Scheduler. See #3443 See #3622

Introduces a distinct Control Plane component, which is now separate from the Indexing Scheduler. See #3443 See #3622

For #3443 I need to be able to perform initialization on storage factory level and in order to do that I need access to the config parameters during initialization rather than during storage resolution. This PR moves the storage config parameters to the storage initializer.

For #3443 I need to be able to perform initialization on storage factory level and in order to do that I need access to the config parameters during initialization rather than during storage resolution. This PR moves the storage config parameters to the storage initializer. Co-authored-by: Adrien Guillo <[email protected]>

Removes duplicate code from the cluster sandbox and makes it possible to run cluster with custom node configurations. It is needed for SSD Cache testing for now but can be useful for other issues as well. See #3443

guilload · 2023-10-08T21:19:42Z

Closed via #3857.

fulmicoton added enhancement New feature or request high-priority labels May 30, 2023

fulmicoton assigned imotov May 30, 2023

guilload mentioned this issue Jun 2, 2023

Store hot data on searchers' local SSDs #2557

Closed

imotov added a commit that referenced this issue Jul 13, 2023

Separate IndexingScheduler from Control Plane

7adee5f

Introduces a distinct Control Plane component, which is now seperate from the Indexing Scheduler. See #3443 See #3622

imotov added a commit that referenced this issue Jul 13, 2023

Separate IndexingScheduler from Control Plane

015733a

Introduces a distinct Control Plane component, which is now separate from the Indexing Scheduler. See #3443 See #3622

imotov added a commit that referenced this issue Jul 13, 2023

Separate IndexingScheduler from Control Plane

2eeaf6a

Introduces a distinct Control Plane component, which is now separate from the Indexing Scheduler. See #3443 See #3622

imotov mentioned this issue Jul 13, 2023

Separate IndexingScheduler from Control Plane #3625

Merged

imotov added a commit that referenced this issue Jul 13, 2023

Separate IndexingScheduler from Control Plane (#3625)

96a8016

Introduces a distinct Control Plane component, which is now separate from the Indexing Scheduler. See #3443 See #3622

imotov mentioned this issue Aug 4, 2023

Refactor storage factory initialization #3709

Merged

imotov mentioned this issue Aug 8, 2023

POC for ssd cache #3723

Closed

10 tasks

imotov mentioned this issue Aug 18, 2023

Refactor Cluster Sandbox #3764

Merged

imotov assigned fulmicoton and unassigned imotov Aug 23, 2023

guilload closed this as completed Oct 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SSD Split "cache" #3443

SSD Split "cache" #3443

fulmicoton commented May 30, 2023

imotov commented May 31, 2023

fulmicoton commented May 31, 2023

guilload commented Oct 8, 2023

SSD Split "cache" #3443

SSD Split "cache" #3443

Comments

fulmicoton commented May 30, 2023

imotov commented May 31, 2023

fulmicoton commented May 31, 2023

guilload commented Oct 8, 2023