Improve performance of Query method #40

kevina · 2016-06-26T16:09:25Z

In ipfs/kubo#2760 @whyrusleeping said in a line comment:

Yeah, using a channel as an iterator sucks. If one of you wants to work on improving the perf of query that would be great.

We could change the interface to not use a channel, and have it instead just return the next value directly. Then on top of that we could provide a method for turning the direct query result into a channel buffered one for usecases that need it

kevina · 2016-06-26T16:12:19Z

@whyrusleeping I will be happy to look into this and determine where the bottleneck is. It may be as simple as increasing the buffer size. I will also try a direct iterator approach and see if that helps.

kevina · 2016-06-29T18:20:29Z

Here are some performance numbers for doing a key-only query on the leveldb datastore:

The buffer size is the channel buffer size, direct is the results from querying the level-db directly.

And here are some results from the flatfs datastore:

It seams that at least for key-only 128 in the optimal buffer size.

whyrusleeping · 2016-06-29T20:52:28Z

@kevina thanks for these graphs, i think youre right, we should buffer the channels at 128 for now. And if we need more perf later, give the option for direct iteration.

kevina · 2016-06-30T03:26:21Z

I updated the graph for flatfs queries. It seams there is enough overhead in the filepath.Walk that once the buffer is large enough the overhead of channels and goroutine is insignificant.

Use "make benchmark" to run.

kevina · 2016-06-30T19:39:29Z

I pushed the (somewhat hackish) code to create the graphs on the kevina/query-benchmarks for lack of a better place.

Use "make benchmark" to run.

kevina self-assigned this Jun 28, 2016

kevina mentioned this issue Jun 30, 2016

Set the channel buffer size to 128 for KeysOnly queries. #43

Merged

kevina added a commit that referenced this issue Jun 30, 2016

Code to create graphs for issue #40.

3014194

Use "make benchmark" to run.

kevina added a commit that referenced this issue Jun 30, 2016

Code to create graphs for issue #40.

1b23919

Use "make benchmark" to run.

kevina mentioned this issue Jul 2, 2016

update go-datastore changes 0.1.2 ipfs/kubo#2933

Merged

whyrusleeping added the help wanted Seeking public contribution on this issue label Sep 14, 2016

flyingzumwalt added the status/deferred Conscious decision to pause or backlog label Sep 26, 2016

kevina mentioned this issue Oct 4, 2016

Querying datastore currently very expensive ipfs/kubo#3270

Closed

kevina mentioned this issue Oct 31, 2016

Improve Garbage Collection Performance ipfs/kubo#3333

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of Query method #40

Improve performance of Query method #40

kevina commented Jun 26, 2016

kevina commented Jun 26, 2016

kevina commented Jun 29, 2016 •

edited

Loading

whyrusleeping commented Jun 29, 2016

kevina commented Jun 30, 2016

kevina commented Jun 30, 2016 •

edited

Loading

Improve performance of Query method #40

Improve performance of Query method #40

Comments

kevina commented Jun 26, 2016

kevina commented Jun 26, 2016

kevina commented Jun 29, 2016 • edited Loading

whyrusleeping commented Jun 29, 2016

kevina commented Jun 30, 2016

kevina commented Jun 30, 2016 • edited Loading

kevina commented Jun 29, 2016 •

edited

Loading

kevina commented Jun 30, 2016 •

edited

Loading