sql: fix cluster setting propagation flake take 2 #95583

cucaroach · 2023-01-20T15:25:13Z

Previously we tried to fix this with one retry but that was
insufficient. Extend it to all queries in this section of the test.

Release note: None
Epic: CRDB-20535

cockroach-teamcity · 2023-01-20T15:25:33Z

This change is

Previously we tried to fix this with one retry but that was insufficient. Extend it to all queries in this section of the test. Release note: None Epic: CRDB-20535

cucaroach · 2023-01-20T15:31:49Z

This test runs 10k times w/ stress w/o flaking now.

yuzefovich

Re: can we ensure that all nodes in the cluster see the updated cluster setting value? Perhaps we could adjust the logic test to explicitly read the cluster setting from each node (with nodeidx directive) and with a retry option to wait until the setting is propagated, but we run this tests in 1- and 3-node configs, so we'd also need to skip some of those reads. In short, adding the retries seems like the easiest option.

Reviewed 1 of 1 files at r1, all commit messages.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @cucaroach)

cucaroach · 2023-01-20T20:53:59Z

Re: can we ensure that all nodes in the cluster see the updated cluster setting value? Perhaps we could adjust the logic test to explicitly read the cluster setting from each node (with nodeidx directive) and with a retry option to wait until the setting is propagated, but we run this tests in 1- and 3-node configs, so we'd also need to skip some of those reads. In short, adding the retries seems like the easiest option.

So basically the read side is a problem, there's a rangefeed on everynode that updates the in memory settings cache. What I'd like to see is a primitive to wait for range feed invocations to be done or something, ie:

# Bounding box operations.
statement ok
SET CLUSTER SETTING sql.spatial.experimental_box2d_comparison_operators.enabled = on

statement ok
SELECT crdb_internal.await_range_feed_progress_on_all_nodes(crdb_internal.get_lastest_timestamp("system.settings"))

But I have no idea how that would work, presumably we'd read the latest timestamp from that table and then check that all the other nodes range feeds have advanced to that timestamp? There's some discussion here:

#87201

Basically I agreed with Yahor retries are the easiest option but this feels like a problem that could come up again and could use a better solution.

cucaroach · 2023-01-20T20:54:13Z

bors r+

craig · 2023-01-20T23:53:16Z

Build succeeded:

Bazel Essential CI (Cockroach)

sql: fix cluster setting propagation flake take 2

73408a2

Previously we tried to fix this with one retry but that was insufficient. Extend it to all queries in this section of the test. Release note: None Epic: CRDB-20535

cucaroach force-pushed the gh95359-2 branch from ed05de7 to 73408a2 Compare January 20, 2023 15:25

cucaroach changed the title ~~sql: fix cluster setting propagation flake take #2~~ sql: fix cluster setting propagation flake take 2 Jan 20, 2023

cucaroach marked this pull request as ready for review January 20, 2023 15:31

cucaroach requested a review from yuzefovich January 20, 2023 15:31

yuzefovich approved these changes Jan 20, 2023

View reviewed changes

craig bot merged commit 1b79102 into cockroachdb:master Jan 20, 2023

cucaroach mentioned this pull request Jan 22, 2023

pkg/sql/logictest/tests/fakedist/fakedist_test: TestLogic_inverted_filter_geospatial failed #95565

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: fix cluster setting propagation flake take 2 #95583

sql: fix cluster setting propagation flake take 2 #95583

cucaroach commented Jan 20, 2023

cockroach-teamcity commented Jan 20, 2023

cucaroach commented Jan 20, 2023

yuzefovich left a comment

cucaroach commented Jan 20, 2023

cucaroach commented Jan 20, 2023

craig bot commented Jan 20, 2023

sql: fix cluster setting propagation flake take 2 #95583

sql: fix cluster setting propagation flake take 2 #95583

Conversation

cucaroach commented Jan 20, 2023

cockroach-teamcity commented Jan 20, 2023

cucaroach commented Jan 20, 2023

yuzefovich left a comment

Choose a reason for hiding this comment

cucaroach commented Jan 20, 2023

cucaroach commented Jan 20, 2023

craig bot commented Jan 20, 2023