Move Aggregator#buildTopLevel() to search worker thread. #98715

iverase · 2023-08-22T09:18:18Z

This PR introduces an AggregatorCollector that contains a finish method which performs aggregation postcollection and builds the internal aggregation for this collector.This method is called on the worker thread at the end of the collection phase.

The PR is set as a draft because it found an issue with global ordinals. In this case you get errors looking like:

java.lang.AssertionError: Sorted doc values are only supposed to be consumed in the thread in which they have been acquired. But was acquired in Thread[#267,elasticsearch[node_s3][search][T#5],5,TGRP-NestedIT] and consumed in Thread[#273,elasticsearch[node_s3][search_worker][T#4],5,TGRP-NestedIT].
	at __randomizedtesting.SeedInfo.seed([B198436B0B2A21D7]:0)
	at org.apache.lucene.tests.index.AssertingLeafReader.assertThread(AssertingLeafReader.java:67)
	at org.apache.lucene.tests.index.AssertingLeafReader$AssertingSortedDocValues.lookupOrd(AssertingLeafReader.java:908)
	at org.apache.lucene.index.SingletonSortedSetDocValues.lookupOrd(SingletonSortedSetDocValues.java:95)
	at org.elasticsearch.search.aggregations.bucket.terms.GlobalOrdinalsStringTermsAggregator$StandardTermsResults.convertTempBucketToRealBucket(GlobalOrdinalsStringTermsAggregator.java:752)

The issue is that global ordinals are created on the first collector but then reused by the other collectors during the postcollection / internal aggregation building phase.

In order to get around the issue we disable the asserting codec.

closes #98705

elasticsearchmachine · 2023-08-22T09:18:42Z

Pinging @elastic/es-analytics-geo (Team:Analytics)

test/framework/src/main/java/org/elasticsearch/test/ESIntegTestCase.java

server/src/main/java/org/elasticsearch/search/aggregations/AggregationPhase.java

# Conflicts: # server/src/main/java/org/elasticsearch/search/profile/query/InternalProfileCollector.java # server/src/main/java/org/elasticsearch/search/query/QueryPhaseCollector.java

# Conflicts: # test/framework/src/main/java/org/elasticsearch/search/aggregations/AggregatorTestCase.java

martijnvg · 2023-09-08T15:49:59Z

I think in order to fix the timeout exception for search cancellation issue that we see with the GraphTests#testTimedoutQueryCrawl() test, we need to have the ability to not throw the TimeExceededException for search worker threads that are doing post collection stuff. I tried the below patch (on top of this PR) and it seemed to fix the test in question locally:

Subject: [PATCH] search-worker-overwrites
---
Index: server/src/main/java/org/elasticsearch/search/internal/ContextIndexSearcher.java
IDEA additional info:
Subsystem: com.intellij.openapi.diff.impl.patch.CharsetEP
<+>UTF-8
===================================================================
diff --git a/server/src/main/java/org/elasticsearch/search/internal/ContextIndexSearcher.java b/server/src/main/java/org/elasticsearch/search/internal/ContextIndexSearcher.java
--- a/server/src/main/java/org/elasticsearch/search/internal/ContextIndexSearcher.java	(revision 5ac1b3057cff072e2d82e932697215ea022be7bb)
+++ b/server/src/main/java/org/elasticsearch/search/internal/ContextIndexSearcher.java	(date 1694187992273)
@@ -54,10 +54,12 @@
 import java.util.Comparator;
 import java.util.HashSet;
 import java.util.List;
+import java.util.Map;
 import java.util.Objects;
 import java.util.PriorityQueue;
 import java.util.Set;
 import java.util.concurrent.CancellationException;
+import java.util.concurrent.ConcurrentHashMap;
 import java.util.concurrent.ExecutionException;
 import java.util.concurrent.Executor;
 import java.util.concurrent.Future;
@@ -489,7 +491,12 @@
             // otherwise the state of the aggregation might be undefined and running post collection
             // might result in an exception
             if (success || timeExceeded) {
-                doAggregationPostCollection(collector);
+                try {
+                    timeoutOverwrites.put(Thread.currentThread(), true);
+                    doAggregationPostCollection(collector);
+                } finally {
+                    timeoutOverwrites.remove(Thread.currentThread());
+                }
             }
         }
     }
@@ -505,8 +512,12 @@
         return timeExceeded;
     }
 
+    private final Map<Thread, Boolean> timeoutOverwrites = new ConcurrentHashMap<>();
+
     public void throwTimeExceededException() {
-        throw new TimeExceededException();
+        if (timeoutOverwrites.getOrDefault(Thread.currentThread(), false) == false) {
+            throw new TimeExceededException();
+        }
     }
 
     private static class TimeExceededException extends RuntimeException {

martijnvg

LGTM

server/src/main/java/org/elasticsearch/search/internal/ContextIndexSearcher.java

server/src/main/java/org/elasticsearch/search/aggregations/support/AggregationContext.java

jpountz · 2023-09-14T07:32:23Z

This change is a bit hard for me to review because I've been away from this code from some time. That said, given that the approach for aggregations is to treat each slice of segments as a mini-shard, it makes sense to me to run buildTopLevel() in the same thread where we ran the collector. This would not only address this issue, but also better parallelize aggregation execution, as I believe that this buildTopLevel() operation is not always cheap, e.g. terms aggregations?

iverase · 2023-09-14T08:23:47Z

This would not only address this issue, but also better parallelize aggregation execution, as I believe that this buildTopLevel() operation is not always cheap, e.g. terms aggregations?

That's right

server/src/main/java/org/elasticsearch/search/internal/ContextIndexSearcher.java

javanna · 2023-09-14T12:53:42Z

server/src/main/java/org/elasticsearch/search/internal/ContextIndexSearcher.java

@@ -498,7 +516,9 @@ public boolean timeExceeded() {
    }

    public void throwTimeExceededException() {
-        throw new TimeExceededException();
+        if (timeoutOverwrites.getOrDefault(Thread.currentThread(), false) == false) {


Looking at the changes above I am wondering if there are situations where post collection does want timeout to be thrown. Are there? If not is there a way to disable timeouts in post collection directly? I get worried that this type of change will make it harder to migrate to lucene's timeout support.

No, the current logic expect no timeouts during the post-collection phase.

is there a way to disable timeouts in post collection directly?

No as far as I know. The main issue is the deferrable aggregations which run during that phase and they actually access the directory which can throw timeouts.

could we not doing something similar to what we were doing before? I mean, what is the point of having timeouts if we don't throw an exception when there is one? Should we rather remove the timeout runnable at this point before post collection? Or are you worried that we may not honour cancellation if we do so?

As you said, we cannot remove the timeout as it affects all running threads. We still want other threads to honour cancellation.

Note that before we also just build the top level internal aggregations when timeout occurred (in AggregationPhase). This workaround allows us to do the same, otherwise we can't return partial aggregation response (we just fail producing the search response).

Ok got it, thanks for explaining.

server/src/main/java/org/elasticsearch/search/query/QueryPhase.java

test/framework/src/main/java/org/elasticsearch/search/aggregations/AggregatorTestCase.java

iverase · 2023-09-18T13:04:08Z

@ellasticmachine run elasticsearch-ci/part-1

javanna

LGTM

Move Aggregator#buildTopLevel() to search worker thread.

03318f9

iverase added :Analytics/Geo Indexing, search aggregations of geo points and shapes v8.11.0 labels Aug 22, 2023

iverase requested a review from martijnvg August 22, 2023 09:18

elasticsearchmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Aug 22, 2023

iverase marked this pull request as draft August 22, 2023 09:18

iverase added >enhancement >non-issue and removed >enhancement labels Aug 22, 2023

martijnvg reviewed Aug 22, 2023

View reviewed changes

iverase added 13 commits August 23, 2023 16:41

Merge branch 'main' into AggregatorCollector

45670e4

fix profiler test

98572e4

Merge branch 'main' into AggregatorCollector

3454682

# Conflicts: # server/src/main/java/org/elasticsearch/search/profile/query/InternalProfileCollector.java # server/src/main/java/org/elasticsearch/search/query/QueryPhaseCollector.java

Merge branch 'main' into AggregatorCollector

ea356a8

Merge branch 'main' into AggregatorCollector

c8268f6

Update AggregatorTestCase

a642701

Merge branch 'main' into AggregatorCollector

108271a

run always in worker thread in aggregatorTestCase

48cbb41

fix test

7025ae9

Merge branch 'main' into AggregatorCollector

f06a33d

# Conflicts: # test/framework/src/main/java/org/elasticsearch/search/aggregations/AggregatorTestCase.java

fix docvalues access from different threads

684ebab

for partial reduction

4195486

remove @LuceneTestCase.SuppressCodecs("*")

5ac1b30

iverase added 2 commits September 12, 2023 11:54

Merge branch 'main' into AggregatorCollector

3ed684c

disable timeouts per thread

9527a8c

iverase marked this pull request as ready for review September 12, 2023 10:01

iverase requested a review from jpountz September 13, 2023 12:09

martijnvg approved these changes Sep 14, 2023

View reviewed changes

javanna reviewed Sep 14, 2023

View reviewed changes

iverase added 2 commits September 18, 2023 09:43

Merge branch 'main' into AggregatorCollector

8930293

address review comments

3f233ab

martijnvg reviewed Sep 18, 2023

View reviewed changes

test/framework/src/main/java/org/elasticsearch/search/aggregations/AggregatorTestCase.java Outdated Show resolved Hide resolved

iverase added 2 commits September 18, 2023 11:44

address review comments

588cee0

doh

4527c13

javanna approved these changes Sep 19, 2023

View reviewed changes

iverase merged commit 4bc1afd into elastic:main Sep 19, 2023

iverase deleted the AggregatorCollector branch September 19, 2023 07:46

iverase mentioned this pull request Sep 19, 2023

Remove supportsParallelCollection implementation for some aggs #99654

Merged

iverase mentioned this pull request Sep 27, 2023

Make AggregatorTestCase#searchAndReduce(...) more inline with what happens in Search API #98672

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move Aggregator#buildTopLevel() to search worker thread. #98715

Move Aggregator#buildTopLevel() to search worker thread. #98715

iverase commented Aug 22, 2023 •

edited

Loading

elasticsearchmachine commented Aug 22, 2023

martijnvg commented Sep 8, 2023 •

edited

Loading

martijnvg left a comment

jpountz commented Sep 14, 2023

iverase commented Sep 14, 2023

javanna Sep 14, 2023

iverase Sep 14, 2023 •

edited

Loading

javanna Sep 18, 2023

iverase Sep 18, 2023

martijnvg Sep 18, 2023

javanna Sep 19, 2023

iverase commented Sep 18, 2023

javanna left a comment

Move Aggregator#buildTopLevel() to search worker thread. #98715

Move Aggregator#buildTopLevel() to search worker thread. #98715

Conversation

iverase commented Aug 22, 2023 • edited Loading

elasticsearchmachine commented Aug 22, 2023

martijnvg commented Sep 8, 2023 • edited Loading

martijnvg left a comment

Choose a reason for hiding this comment

jpountz commented Sep 14, 2023

iverase commented Sep 14, 2023

javanna Sep 14, 2023

Choose a reason for hiding this comment

iverase Sep 14, 2023 • edited Loading

Choose a reason for hiding this comment

javanna Sep 18, 2023

Choose a reason for hiding this comment

iverase Sep 18, 2023

Choose a reason for hiding this comment

martijnvg Sep 18, 2023

Choose a reason for hiding this comment

javanna Sep 19, 2023

Choose a reason for hiding this comment

iverase commented Sep 18, 2023

javanna left a comment

Choose a reason for hiding this comment

iverase commented Aug 22, 2023 •

edited

Loading

martijnvg commented Sep 8, 2023 •

edited

Loading

iverase Sep 14, 2023 •

edited

Loading