kv: add cycle-length to limit set of written keys #50

tbg · 2017-05-08T20:56:16Z

This parameter truncates & loops the sequence around. As a consequence, the
number of keys is bounded by concurrency * cycleLength.

Something odd I've noticed is that this:

go run main.go --cycle-length 1 --concurrency 8
_elapsed___errors__ops/sec(inst)___ops/sec(cum)__p95(ms)__p99(ms)_pMax(ms)
      1s        0          344.6          344.6    100.7    134.2    285.2
      2s        0          319.2          331.9    104.9    209.7    285.2
      3s        0          325.0          329.6     96.5    184.5    251.7
      4s        0          322.9          327.9    117.4    167.8    209.7

is much slower than this:

go run main.go --cycle-length 10 --concurrency 8
_elapsed___errors__ops/sec(inst)___ops/sec(cum)__p95(ms)__p99(ms)_pMax(ms)
      1s        0         3165.0         3164.9      5.5      7.9     10.5
      2s        0         3194.6         3179.8      5.2      8.9     12.6
      3s        0         3093.7         3151.1      5.5      8.4     12.1
      4s        0         2807.6         3065.3      6.3     11.0     13.1

Unless I'm mistaken, the eight clients "never" intersect, so the only
difference is that in one example they're each hitting one own key, and in the
latter eight own keys. Perhaps there is more range parallelism in the latter,
but you wouldn't expect it. The difference disappears with --concurrency=1.

The motivation for this change is
cockroachdb/cockroach#15756:

Bad behavior in the GC queue can be reproduced by running a single-node
cluster and

go run main.go --cycle-length 1 --concurrency 1 --min-block-bytes $((1024*1024)) --max-block-bytes $((1024*1024*2))

and, after a few gigs of data have piled up,

./cockroach zone set .default -f - --insecure <<EOF
gc:
  ttlseconds: 600
EOF

In one run, this resulted in

queue_gc_processingnanos{store="1"} 4.26018985343e+11

or, ~400s.

There's also a "mystery" that I haven't really looked into: Replace the *2
in the go run invocation above with *10, and immediately get driver: bad connection.

This change is

tbg · 2017-05-08T21:04:21Z

@petermattis I looked into adding --delete-percent as well (actually wrote the code) but it's less useful for me as I need few total keys with many versions, not many keys with each a deletion tombstone - those would split in practice. Could be prevented, but easier to deal with low cardinality.

tbg · 2017-05-08T21:05:04Z

(and there's also the issue that a high percentage of deletes would leave few actual values, and a double-delete is idempotent, not helping my cause).

petermattis

LGTM

petermattis · 2017-05-08T21:21:03Z

kv/main.go

@@ -129,7 +131,7 @@ type generator struct {
 func newGenerator(seq *sequence) *generator {
 	return &generator{
 		seq:    seq,
-		rand:   rand.New(rand.NewSource(int64(time.Now().UnixNano()))),
+		rand:   rand.New(rand.NewSource(rand.Int63())),


What motivated this change?

if time.Now().UnixNano() returns the same timestamp (surely shouldn't on reasonable systems, but Never Trust Clocks™), you end up with multiple blockers overlapping, which is unintentional. Seems more legit this way.

Do we seed the global rand? If we don't, then this is not doing what you expect.

https://golang.org/src/math/rand/rand.go#L235:

var globalRand = New(&lockedSource{src: NewSource(1).(Source64)})

Good point about seeding the global rand, I've indeed set us up for a problem here. I'll back this out (the concern above is synthetic anyway).

This parameter truncates & loops the sequence around. As a consequence, the number of keys is bounded by `concurrency * cycleLength`. Something odd I've noticed is that this: ``` go run main.go --cycle-length 1 --concurrency 8 _elapsed___errors__ops/sec(inst)___ops/sec(cum)__p95(ms)__p99(ms)_pMax(ms) 1s 0 344.6 344.6 100.7 134.2 285.2 2s 0 319.2 331.9 104.9 209.7 285.2 3s 0 325.0 329.6 96.5 184.5 251.7 4s 0 322.9 327.9 117.4 167.8 209.7 ``` is much slower than this: ``` go run main.go --cycle-length 10 --concurrency 8 _elapsed___errors__ops/sec(inst)___ops/sec(cum)__p95(ms)__p99(ms)_pMax(ms) 1s 0 3165.0 3164.9 5.5 7.9 10.5 2s 0 3194.6 3179.8 5.2 8.9 12.6 3s 0 3093.7 3151.1 5.5 8.4 12.1 4s 0 2807.6 3065.3 6.3 11.0 13.1 ``` Unless I'm mistaken, the eight clients "never" intersect, so the only difference is that in one example they're each hitting one own key, and in the latter eight own keys. Perhaps there is more range parallelism in the latter, but you wouldn't expect it. The difference disappears with `--concurrency=1`. The motivation for this change is cockroachdb/cockroach#15756: Bad behavior in the GC queue can be reproduced by running a single-node cluster and ``` go run main.go --cycle-length 1 --concurrency 1 --min-block-bytes $((1024*1024)) --max-block-bytes $((1024*1024*2)) ``` and, after a few gigs of data have piled up, ``` ./cockroach zone set .default -f - --insecure <<EOF gc: ttlseconds: 600 EOF ``` In one run, this resulted in ``` queue_gc_processingnanos{store="1"} 4.26018985343e+11 ``` or, ~400s. There's also a "mystery" that I haven't really looked into: Replace the `*2` in the `go run` invocation above with `*10`, and immediately get `driver: bad connection`.

tbg force-pushed the cycle branch from e66d6e2 to 53c5ffc Compare May 8, 2017 20:56

tbg requested a review from petermattis May 8, 2017 21:02

petermattis approved these changes May 8, 2017

View reviewed changes

tbg force-pushed the cycle branch from 623d464 to 39fa5a7 Compare May 8, 2017 21:39

tbg merged commit b9ecf80 into master May 9, 2017

tbg deleted the cycle branch May 9, 2017 12:58

tbg mentioned this pull request May 9, 2017

perf: investigate loadgen/kv slowdown cockroachdb/cockroach#15797

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kv: add cycle-length to limit set of written keys #50

kv: add cycle-length to limit set of written keys #50

tbg commented May 8, 2017 •

edited by tamird

Loading

tbg commented May 8, 2017

tbg commented May 8, 2017

petermattis left a comment

petermattis May 8, 2017

tbg May 8, 2017

petermattis May 8, 2017

petermattis May 8, 2017

tbg May 8, 2017

kv: add cycle-length to limit set of written keys #50

kv: add cycle-length to limit set of written keys #50

Conversation

tbg commented May 8, 2017 • edited by tamird Loading

tbg commented May 8, 2017

tbg commented May 8, 2017

petermattis left a comment

Choose a reason for hiding this comment

petermattis May 8, 2017

Choose a reason for hiding this comment

tbg May 8, 2017

Choose a reason for hiding this comment

petermattis May 8, 2017

Choose a reason for hiding this comment

petermattis May 8, 2017

Choose a reason for hiding this comment

tbg May 8, 2017

Choose a reason for hiding this comment

tbg commented May 8, 2017 •

edited by tamird

Loading