use search_after in scroll #4280

trinity-1686a · 2023-12-14T17:07:31Z

Description

fix #3748
use search_after as an implementation detail in scroll. We lose the ability to replay request with the same scroll_id if pages are for more than 1k document (which isn't something that's technically allowed anyway), but we gain that the underlying request is always for the same amount of documents, and not growing linearly like before.

Fix a bug where asking for pages of >1k doc would yield a first page of 1k, and only have the right number of docs on subsequent pages.
Make it so that we can always answer a page in at most a single underlying query (before it was up to 10 queries. If the page was for 10k docs, we would issue 10 requests for 1k docs)

How was this PR tested?

tested manually and updated unit tests

fulmicoton · 2024-01-18T00:53:51Z

quickwit/quickwit-search/src/root.rs

@@ -491,6 +491,7 @@ async fn search_partial_hits_phase_with_scroll(
            )
            .next_page(leaf_search_resp.partial_hits.len() as u64);

+        scroll_ctx.truncate_start();


My understanding is that you are calling truncate start to handle the case where the request max hits is > SCROLL_BATCH_LEN. If so, I don't understand why this isn't truncate (as opposed to truncate_start) we want to call here.

If this is the use case you are targetting I suggest we :

do not cache anything if max_hits is too large

return an error and log a warn if scroll is used with max_hits too large.

The idea is that the firsts elements in cache have already been consumed. To not store too many elements (so, to reduce memory pressure), we throw away elements so as to keep at most SCROLL_BATCH_LEN elems.
This is arguably not useful, we could keep only one element (to know the values to provide for search_after), and throw everything else

Got it.

I'd rather remove the truncate_before. If I understand correctly, it is there to optimize memory.
The complexity added is non-trivial.

The truncation relies on a very bad spec on elasticsearch: scrolling is not nilpotent.
Calling Elasticsearch twice with the same scroll id increases the scroll. (This is bad engineering because it prevents retries on all kinds of errors.)
In addition, the function has a strange behavior that is not suggested in its name. It truncates, but attempts to leave at least one element.

we throw away elements so as to keep at most SCROLL_BATCH_LEN elems.
Even without the truncation, I think we already have at most SCROLL_BATCH_LEN today. (maybe it is a change you made recently).

So do we want to keep scrolling idempotent? Before this PR, it was (though at the cost of having each page being slower and more memory intensive to create than the previous).
After, but with the original truncate_start (or without it being called, but with usage of search_after based scrolling), it is idempotent in some cases, but not always (and never if a page is more than SCROLL_BATCH_LEN)
With the last commit, it rarely is idempotent

fulmicoton

Not really a request change: I did not understand the truncate_start stuff. Please have a look at my comment.

fulmicoton · 2024-01-24T00:35:17Z

As discussed offline, let's put the search_after key in the scroll key, and avoid caching when max hits is too large,
and let's make the feature idempotent.

fulmicoton · 2024-01-25T05:47:31Z

quickwit/quickwit-search/src/scroll_context.rs

    ) -> ScrollKeyAndStartOffset {
        let scroll_ulid: Ulid = Ulid::new();
+        // technically we could only initialize search_after on first call to next_page, and use
+        // default() before, but that feels like partial initilization.


Suggested change

// default() before, but that feels like partial initilization.

// default() before, but that feels like partial initialization.

trinity-1686a requested a review from fulmicoton December 14, 2023 17:07

use search_after in scroll

1a68823

trinity-1686a force-pushed the trinity--search-after-scroll branch from 5666fa7 to 1a68823 Compare December 14, 2023 17:13

fulmicoton reviewed Jan 18, 2024

View reviewed changes

fulmicoton requested changes Jan 18, 2024

View reviewed changes

trinity-1686a added 3 commits January 22, 2024 14:21

truncate more documents of scroll cache

801a6b9

Merge branch 'main' into trinity--search-after-scroll

09b523d

fix scroll when there are no results

12b7989

send search_after key as part of scroll id

06cd4cc

trinity-1686a requested a review from fulmicoton January 24, 2024 16:48

fulmicoton reviewed Jan 25, 2024

View reviewed changes

fulmicoton approved these changes Jan 25, 2024

View reviewed changes

trinity-1686a added 2 commits January 25, 2024 11:19

typo

aaa2656

Merge branch 'main' into trinity--search-after-scroll

48a36bd

trinity-1686a enabled auto-merge (squash) January 25, 2024 10:27

trinity-1686a merged commit ca38897 into main Jan 25, 2024
4 checks passed

trinity-1686a deleted the trinity--search-after-scroll branch January 25, 2024 10:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use search_after in scroll #4280

use search_after in scroll #4280

trinity-1686a commented Dec 14, 2023

fulmicoton Jan 18, 2024

trinity-1686a Jan 18, 2024

fulmicoton Jan 23, 2024

trinity-1686a Jan 23, 2024 •

edited

Loading

fulmicoton left a comment

fulmicoton commented Jan 24, 2024

fulmicoton Jan 25, 2024

	// default() before, but that feels like partial initilization.
	// default() before, but that feels like partial initialization.

use search_after in scroll #4280

use search_after in scroll #4280

Conversation

trinity-1686a commented Dec 14, 2023

Description

How was this PR tested?

fulmicoton Jan 18, 2024

Choose a reason for hiding this comment

trinity-1686a Jan 18, 2024

Choose a reason for hiding this comment

fulmicoton Jan 23, 2024

Choose a reason for hiding this comment

trinity-1686a Jan 23, 2024 • edited Loading

Choose a reason for hiding this comment

fulmicoton left a comment

Choose a reason for hiding this comment

fulmicoton commented Jan 24, 2024

fulmicoton Jan 25, 2024

Choose a reason for hiding this comment

trinity-1686a Jan 23, 2024 •

edited

Loading