fix: stuck queries when too many events with same timestamp #586

pvlugter · 2024-08-01T04:48:23Z

We've seen a case where more than 1000 events were persisted together (so have the same timestamp), and more than the default query buffer size, and then queries will be stuck on the same timestamp and never make further progress.

Since events are ordered by timestamp and sequence number, update the queries to also filter by the highest seen seq number for the latest timestamp (only for events with the same timestamp as this starting timestamp). Fixes the case where all the events are for the same persistence id. Queries will still duplicate on the events with both the same timestamp and seq number (different persistence ids), to handle events across the buffer limit. The very edge case of more events than the buffer size all with the same timestamp and the same seq number (from different persistent ids) would not be handled.

So this always ends up adding an additional filter to queries. We could look at only adding this extra conditional check when the latest timestamp is the same as the previous query (that returned results), to only handle this particular case. The backtracking filtered adjustment (updated in this PR) would need to be aware of this too.

Another alternative is to go further, and always do all the filtering on the database side. Adding conditions for all the latest seen sequence numbers, conditional per persistence id. We would then expect no duplicated events from queries. It complicates the queries even more though, with something like db_timestamp >= :from_timestamp AND ((persistence_id NOT IN (pid1, pid2, ..., pidN) OR (persistence_id = pid1 AND seq_nr > pid1SeqNr) OR (persistence_id = pid2 AND seq_nr > pid2SeqNr) OR ... OR (persistence_id = pidN AND seq_nr > pidNSeqNr)). But this would also guarantee progress in all cases, and the additional conditions are only for the 'seen' persistence ids that share the same latest timestamp.

Marking draft while this is under discussion.

pvlugter · 2024-08-01T05:22:23Z

SQL Server tests not able to start the database to begin with. Looks like this path /opt/mssql-tools/bin/sqlcmd is now /opt/mssql-tools18/bin/sqlcmd. Updating that but then it fails with:

Sqlcmd: Error: Microsoft ODBC Driver 18 for SQL Server : TCP Provider: Error code 0x2749.
Sqlcmd: Error: Microsoft ODBC Driver 18 for SQL Server : SSL Provider: [error:0A000086:SSL routines::certificate verify failed:self-signed certificate].
Sqlcmd: Error: Microsoft ODBC Driver 18 for SQL Server : A network-related or instance-specific error has occurred while establishing a connection to localhost. Server is not found or not accessible. Check if instance name is correct and if SQL Server is configured to allow remote connections. For more information see SQL Server Books Online..

Will try pinning to an earlier version instead.

pvlugter · 2024-08-01T06:40:49Z

core/src/test/scala/akka/persistence/r2dbc/query/EventsBySliceSpec.scala

+        take(2) shouldBe Set("A1", "A2")
+        take(3) shouldBe Set("A3", "B3", "C3")
+        take(2) shouldBe Set("A4", "B4")
+        take(2) shouldBe Set("A5", "A6")


Postgres does return the events with the same seq number in the inserted order, but probably shouldn't rely on that for tests.

octonato

LGTM

...with a few suggestions

core/src/main/scala/akka/persistence/r2dbc/internal/BySliceQuery.scala

core/src/main/scala/akka/persistence/r2dbc/internal/h2/H2QueryDao.scala

core/src/test/scala/akka/persistence/r2dbc/query/EventsBySliceSpec.scala

johanandren

Nice!

pvlugter · 2024-08-05T01:57:20Z

Test failing with docker-compose: command not found. Underlying image has been updated by GitHub, removing the old docker-compose command? Will update to docker compose.

pvlugter · 2024-08-05T02:11:53Z

I've updated to track the timestamp from the previous query, and only apply the extra seq nr filter when it's already known to be the same timestamp for the next query: 09e05fc

patriknw

looking good

patriknw · 2024-08-05T08:32:08Z

core/src/main/scala/akka/persistence/r2dbc/internal/BySliceQuery.scala

+  // only filter by highest seen seq nr when the next query is the same timestamp (or when unknown for initial queries)
+  private def highestSeenSeqNr(previous: TimestampOffset, latest: TimestampOffset): Option[Long] =
+    Option.when((previous == TimestampOffset.Zero || previous.timestamp == latest.timestamp) && latest.seen.nonEmpty) {
+      latest.seen.values.max


Note that sequence numbers are per persistence id, so a later timestamp can have an earlier sequence number

That is correct. How is that handled? I guess it is caught by backtracking in the same way as when visibility of pid-A may be seen before another pid-B even though timestamp of pid-B is before pid-A.

If the events are there, it handles it by only filtering by highest sequence number for the same timestamp as the query starting timestamp. See also where the query is adjusted, and the comment there.

So if there are events (ordered by timestamp, seq nr):

timestamp seq nr pid event

t1 1 pid1 A1

t1 1 pid2 B1

t1 1 pid3 C1

t1 2 pid1 A2

t1 2 pid2 B2

t1 3 pid1 A3

t2 2 pid3 C2

t2 3 pid2 B3

t2 4 pid1 A4

And a buffer size of 4. First query only gets to the 4th event (A2). Before it would restart from timestamp >= t1, stuck on repeating the same 4 events (A1, B1, C1, A2). Now it will use the query timestamp >= t1 AND (timestamp != t1 OR seq_nr >= 2) for the next query, starting from the 4th event now, processing (A2 (deduplicated), B2, A3, C2). The seq number filter only applies to the starting timestamp, so that the query is otherwise just by timestamp as usual.

Applies for the both the regular queries and the backtracking queries. Backtracking catching any late arriving events as usual.

Good, that should work fine. What I was thinking about would be classified as late arriving event.

patriknw · 2024-08-05T08:41:28Z

core/src/main/scala/akka/persistence/r2dbc/internal/BySliceQuery.scala

      Some(
        dao
          .rowsBySlices(
            entityType,
            minSlice,
            maxSlice,
            fromTimestamp,
+            fromSeqNr,
            toTimestamp,
            behindCurrentTime,
            backtracking = newState.backtracking)


Do we need to adjust BySliceQuery.deserializeAndAddOffset? It has a check on the buffer size and throws IllegalStateException

That check is whether the seen map exceeds the buffer size (more persistence ids on the same timestamp than the buffer size). We could adjust it to only throw if all the sequence numbers are the same, which would not be handled by this fix (would be stuck on both timestamp and seq number). But that many persistence ids with events on the exact same timestamp already feels exceptional, so thought that it's useful to leave as is.

I agree 👍

pvlugter · 2024-08-05T23:41:24Z

Also discussed on internal issue: we may want to have a limit on the number of events that can be persisted together in the first place, as these are committed atomically in a transaction, and it could indicate a bug in user code.

Or if we want to support writing large numbers of events in the same operation, we could group events in batches with separate transactions (wouldn't be atomic across all events though).

patriknw

LGTM

pvlugter added 3 commits August 1, 2024 17:23

fix: stuck queries when too many events with same timestamp

a557328

ci: add mima filters

a36b640

ci: pin sqlserver to the previous version

6505f54

pvlugter force-pushed the query-same-timestamp branch from e0f3d01 to 6505f54 Compare August 1, 2024 05:24

test: same timestamp, overlapping seq numbers

3203ac8

pvlugter commented Aug 1, 2024

View reviewed changes

octonato approved these changes Aug 1, 2024

View reviewed changes

johanandren reviewed Aug 1, 2024

View reviewed changes

suggestions from code review

f2ace61

pvlugter force-pushed the query-same-timestamp branch from 90aeaac to f2ace61 Compare August 2, 2024 05:31

johanandren approved these changes Aug 2, 2024

View reviewed changes

track previous timestamp to only filter by seq nr when same timestamp

09e05fc

ci: update docker compose command for yugabyte tests

faee727

patriknw reviewed Aug 5, 2024

View reviewed changes

pvlugter marked this pull request as ready for review August 5, 2024 23:35

patriknw approved these changes Aug 6, 2024

View reviewed changes

pvlugter merged commit 63e30e3 into akka:main Aug 6, 2024
9 checks passed

pvlugter deleted the query-same-timestamp branch August 6, 2024 06:16

pvlugter mentioned this pull request Aug 7, 2024

ci: pin sqlserver to the previous version akka/akka-projection#1179

Merged

patriknw added the bug Something isn't working label Oct 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: stuck queries when too many events with same timestamp #586

fix: stuck queries when too many events with same timestamp #586

pvlugter commented Aug 1, 2024 •

edited

Loading

pvlugter commented Aug 1, 2024

pvlugter Aug 1, 2024

octonato left a comment

johanandren left a comment

pvlugter commented Aug 5, 2024

pvlugter commented Aug 5, 2024

patriknw left a comment

patriknw Aug 5, 2024 •

edited

Loading

pvlugter Aug 5, 2024

patriknw Aug 6, 2024

patriknw Aug 5, 2024

pvlugter Aug 5, 2024

patriknw Aug 6, 2024

pvlugter commented Aug 5, 2024

patriknw left a comment

timestamp	seq nr	pid	event
t1	1	pid1	A1
t1	1	pid2	B1
t1	1	pid3	C1
t1	2	pid1	A2
t1	2	pid2	B2
t1	3	pid1	A3
t2	2	pid3	C2
t2	3	pid2	B3
t2	4	pid1	A4

fix: stuck queries when too many events with same timestamp #586

fix: stuck queries when too many events with same timestamp #586

Conversation

pvlugter commented Aug 1, 2024 • edited Loading

pvlugter commented Aug 1, 2024

pvlugter Aug 1, 2024

Choose a reason for hiding this comment

octonato left a comment

Choose a reason for hiding this comment

johanandren left a comment

Choose a reason for hiding this comment

pvlugter commented Aug 5, 2024

pvlugter commented Aug 5, 2024

patriknw left a comment

Choose a reason for hiding this comment

patriknw Aug 5, 2024 • edited Loading

Choose a reason for hiding this comment

pvlugter Aug 5, 2024

Choose a reason for hiding this comment

patriknw Aug 6, 2024

Choose a reason for hiding this comment

patriknw Aug 5, 2024

Choose a reason for hiding this comment

pvlugter Aug 5, 2024

Choose a reason for hiding this comment

patriknw Aug 6, 2024

Choose a reason for hiding this comment

pvlugter commented Aug 5, 2024

patriknw left a comment

Choose a reason for hiding this comment

pvlugter commented Aug 1, 2024 •

edited

Loading

patriknw Aug 5, 2024 •

edited

Loading