YCSB performance analysis #26137

arruw · 2018-05-28T09:18:14Z

Is it normal that CRDB perform worst on three node cluster then on one node. I don't have infrastructure to test performance on more nodes. But I'm assuming that this is because range replication, by default ranges are replicated 3 times. So I'm assuming that after adding more then 3 nodes (with default configuration) I should see performance increase almost linearly?
I also noticed that with increasing connections performance drop much faster then on regular Postgres database, what are your thoughts on that?

The measurements was made using brianfrankcooper/YCSB tool. To get more accurate results, each test was ran 3 times, both databases run on same infrastructure with default configuration. Infrastructure was build out of 4 old computers (i5, HDD, 4GB, 1Gb/s) each running Ubuntu server 16.04 LTE. They ware connected with Gigabit Ethernet switch.

All results are located here YCSB-Results - v2 - EN.xlsx.

NOTE: Number at base of series means number of connections where maximal throughput was reached.

tbg · 2018-05-28T09:30:40Z

I haven't looked into this in detail, but note that there's an open PR that is said to drastically improve YCSB performance: #25014

@nvanbenschoten, could you take a look at this?

nvanbenschoten · 2018-05-29T20:53:40Z

Hi @matjazmav, thanks for performing this benchmarking! We've done previous testing with YCSB (see #20448) and found similar results. As Tobi mentioned, we do have one change in the pipeline that has a lot of promise to dramatically improve some of the contention-heavy YCSB workloads. If you're interested, you could try doing a comparison before and after with that change.

But I'm assuming that this is because range replication, by default ranges are replicated 3 times.

That assumption is correct.

So I'm assuming that after adding more then 3 nodes (with default configuration) I should see performance increase almost linearly?

This would be a safe assumption for a well-distributed workload. Unfortunately YCSB is the opposite of that. It uses a zipf distribution to create large hotspots in activity, which works against the effects of horizontal scalability.

I also noticed that whit increasing connections performance drop much faster then on regular Postgres database, what are your thoughts on that?

I believe that this is the issue that #25014 is trying to fix. Specifically, contended writes are not handled as well as they could be in Cockroach.

arruw · 2018-05-30T06:25:59Z

@nvanbenschoten Thank you for explanation. Are results of YCSB testing that you have done available somewhere, I would like to compare it with mine? Maybe I'll repeat this benchmark on next version, I'm currently limited with time writing thesis :)

I forgot to mention that current testing was done on top of Docker Swarm, each node running Docker version 18.03.0-ce and using official Docker image cockroachdb/cockroach:v2.0.1.

arruw · 2018-05-30T06:29:45Z

@nvanbenschoten I believe you have nightly benchmarks to compare performance over time? Are this results publicly available?

nvanbenschoten · 2018-05-30T16:24:16Z

@matjazmav unfortunately we do not have published YCSB numbers at the moment, as past benchmarking has not been rigorous enough to permit publication. This is in contrast to TPC-C, where we have published results with detailed reproduction steps. The best results we have available in the linked issues above, but none of these are "official".

In terms of nightly benchmarks, we don't currently have a good method of visualizing results over time. This is being tracked in #24366.

arruw · 2018-05-31T06:49:02Z

@nvanbenschoten I'm looking at first two charts at workload C. How can I explain that both have almost same latency, but if I look at throughput Postgres perform more then 2x better?

nvanbenschoten · 2018-08-21T01:25:21Z

I'm looking at first two charts at workload C. How can I explain that both have almost same latency, but if I look at throughput Postgres perform more then 2x better?

@matjazmav that indicates to me that Cockroach needs to perform more work and is, therefore, more resource hungry than Postgres even though it is able to achieve almost the same latency. In some sense, this is expected because Cockroach needs to perform more work to maintain a consistent replication across its three nodes.

That said, we've made a number of performance improvements to CockroachDB for our upcoming 2.1 release. In particular, #25014 landed, which dramatically improves our performance on YCSB. I would be interested in how this affects our comparison to Postgres.

We have a few other big changes on the horizon that should also have dramatic effects on YCSB. The most important of these is the ability to push partial-row update operations directly to the data so that we can avoid the read-then-write operation we currently need to perform for workloads like YCSB.

I'm going to close this for now since there's not anything actionable to do, but please feel free to continue the discussion.

knz added C-investigation Further steps needed to qualify. C-label will change. O-community Originated from the community S-1-blocking-adoption C-performance Perf of queries or internals. Solution not expected to change functional behavior. labels May 28, 2018

tbg added the A-kv-client Relating to the KV client and the KV interface. label Jun 5, 2018

tbg assigned nvanbenschoten Jun 5, 2018

benesch changed the title ~~YCSB performance analisis~~ YCSB performance analysis Jul 19, 2018

nvanbenschoten added this to the 2.1 milestone Jul 23, 2018

nvanbenschoten closed this as completed Aug 21, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

YCSB performance analysis #26137

YCSB performance analysis #26137

arruw commented May 28, 2018 •

edited

Loading

tbg commented May 28, 2018

nvanbenschoten commented May 29, 2018

arruw commented May 30, 2018

arruw commented May 30, 2018

nvanbenschoten commented May 30, 2018

arruw commented May 31, 2018

nvanbenschoten commented Aug 21, 2018

YCSB performance analysis #26137

YCSB performance analysis #26137

Comments

arruw commented May 28, 2018 • edited Loading

tbg commented May 28, 2018

nvanbenschoten commented May 29, 2018

arruw commented May 30, 2018

arruw commented May 30, 2018

nvanbenschoten commented May 30, 2018

arruw commented May 31, 2018

nvanbenschoten commented Aug 21, 2018

arruw commented May 28, 2018 •

edited

Loading