Adjust MicroBatchSize dynamically based on throttling rate in BulkExecutor #22290

FabianMeiswinkel · 2021-06-15T11:29:21Z

This PR changes the way how we determine the micro-batch size when using bulk execution (CosmosAsyncContainer.processBulkOperations). Instead of relying on a user provided static micro batch size the size gets dynamically adjusted based on the percentage of throttled operations (either because the entire batch request is throttled or when the batch request is partially successful with some operations being throttled).
To be able to do this in the BulkExecutor I had to change the behavior in the ClientRetryPolicy's ResourceThrottlingRetryPolicy to allow 429s to bubble up to the BulkExecutor - so that the BulkExecutor's ResourceThrottlingRetryPolicy could trigger the retry and account for the throttled operations that way.
The riskiest change was that reactor's bufferTimeout operator doesn't allow specifying a dynamic maxBufferSize - so I had to switch to another operator (bufferUntil) and implement a custom timer triggered mechanism to flush the buffers to drain the remaining operations form the buffers. This timer based mechanism would only be triggered after the input Flux (the user provided Flux of Operations to be executed) has been closed.

From my initial tests with the Spark end-to-end samples this approach works very well. It reduces the percentage of throttled requests significantly when no client-throughput control is enabled and with client throughput control it helps reducing the micro batch size so that the achievable throughput is as expected while also allowing the throughput to be limited reasonably well.

…e accounted for in dynamic MicroBatchSize adjustment

…n BulkExecutor

sdk/cosmos/azure-cosmos/src/main/java/com/azure/cosmos/BulkProcessingOptions.java

…into users/fabianm/dynamicMicroBatchSize

sdk/cosmos/azure-cosmos/src/main/java/com/azure/cosmos/BulkProcessingThresholds.java

xinlian12 · 2021-06-16T17:17:20Z

...osmos/src/main/java/com/azure/cosmos/implementation/batch/BatchRequestResponseConstants.java

@@ -14,7 +14,9 @@
    public static final int MAX_OPERATIONS_IN_DIRECT_MODE_BATCH_REQUEST = 100;

    public static final int DEFAULT_MAX_MICRO_BATCH_INTERVAL_IN_MILLISECONDS = 100;
-    public static final int DEFAULT_MAX_MICRO_BATCH_CONCURRENCY = 2;
+    public static final int DEFAULT_MAX_MICRO_BATCH_CONCURRENCY = 1;


why change the concurrency here?

Because concurrency of 2 is pretty much always wrong. Even a single client with 4 cores can easily saturate the 10,000 RU of a physical partition.

So this change was intentional - but leaving the comment open for others to chime in as well.

if the concurrency should always be 1, then should we remove the setMaxMicroBatchConcurrency public api from bulkOption?

The only scenario where I think a higher maxConcurrency makes sense is if you build a webservice accepting requests containing info that results in multiple documents you want to ingest - so where your call to processBulkOperations would only contain let's say a couple dozen/hundred documents. I can imagine that latency might be better with higher concurrency - but this is an edge case - the concurrency can still be modified by customers but 1 as default seems to be meeting most scenarios better

sdk/cosmos/azure-cosmos/src/main/java/com/azure/cosmos/implementation/batch/BulkExecutor.java

...re-cosmos/src/main/java/com/azure/cosmos/implementation/batch/FlushBuffersItemOperation.java

sdk/cosmos/azure-cosmos/src/main/java/com/azure/cosmos/implementation/batch/BulkExecutor.java

...ure-cosmos/src/main/java/com/azure/cosmos/implementation/batch/PartitionScopeThresholds.java

…to users/fabianm/dynamicMicroBatchSize

sdk/cosmos/azure-cosmos-spark_3-1_2-12/src/main/scala/com/azure/cosmos/spark/BulkWriter.scala

sdk/cosmos/azure-cosmos/src/main/java/com/azure/cosmos/implementation/batch/BulkExecutor.java

...re-cosmos/src/main/java/com/azure/cosmos/implementation/batch/FlushBuffersItemOperation.java

FabianMeiswinkel added 2 commits June 12, 2021 00:46

Temp snapshot

af621d7

Adjusting MicroBatchSize dynamically in BulkExecutor.java

cddd9ba

FabianMeiswinkel requested review from aayush3011, kirankumarkolli, kushagraThapar, mbhaskar, milismsft, moderakh, simplynaveen20 and xinlian12 as code owners June 15, 2021 11:29

ghost added the Cosmos label Jun 15, 2021

FabianMeiswinkel added 5 commits June 15, 2021 22:24

Making sure Bulk Request 429 bubble up to the BulkExecutor so they ar…

b35008a

…e accounted for in dynamic MicroBatchSize adjustment

Adjusting targeted bulk throttling retry rate to be a range

5d084a7

Reducing lock contention in PartitionScopeThresholds.java

e586150

Adding unit test coverage for dynamically changing micro batch size i…

2c80e70

…n BulkExecutor

Adjusting log level in PartitionScopeThresholds

81f13c9

FabianMeiswinkel changed the title ~~[DRAFT - do not review yet]Adjust MicroBatchSize dynamically based on throttling rate in BulkExecutor~~ Adjust MicroBatchSize dynamically based on throttling rate in BulkExecutor Jun 16, 2021