Optimize shard index loading #6618

jwilder · 2016-05-12T20:02:10Z

Required for all non-trivial PRs

Rebased/mergable
Tests pass
CHANGELOG.md updated

On data sets with many series, large series keys and many shards,
the cost of parsing the key and re-indexing can be high.

Loading the TSM keys into the index was being done repeatedly for
series that were already loaded into the index by an earlier TSM file. This was
wasted worked and slows down shard loading.

Parsing the key was also innefficient and allocated a new string
slice. This was simplified to remove that allocation.

I tested this on a dataset with two databases containing 155 and 85 shards each with ~300k series keys in each DB and keys ~250bytes in length.

0.13.0 (old/new)

5:32 -> 0:32

master (old/new)

4:25 -> 0.33

master cache flushed (old/new)

5:09 -> 1:42

This should help #6250 in some cases although there may still be other bottlenecks that this data set does not bring out.

mention-bot · 2016-05-12T20:02:12Z

By analyzing the blame information on this pull request, we identified @e-dard to be a potential reviewer

On data sets with many series and potentially large series keys, the cost of parsing the key and re-indexing can be high. Loading the TSM keys into the index was being done repeatedly for series that were already index by an earlier TSM file. This was wasted worked and slows down shard loading. Parsing the key was also innefficient and allocated a new string slice. This was simplified to remove that allocation.

corylanou · 2016-05-13T19:10:17Z

+1

jwilder force-pushed the jw-shard-load branch from 999b662 to 0dbd489 Compare May 12, 2016 20:03

jwilder added this to the 1.0.0 milestone May 12, 2016

jwilder added area/performance area/tsm labels May 12, 2016

jwilder mentioned this pull request May 12, 2016

InfluxDB starts for 2 hours #6250

Closed

jwilder merged commit 1187195 into master May 13, 2016

jwilder deleted the jw-shard-load branch May 13, 2016 20:16

jwilder mentioned this pull request May 13, 2016

High memory usage 0.12.2 #6513

Closed

timhallinflux modified the milestones: 1.0.0, 1.0.0 beta Dec 20, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize shard index loading #6618

Optimize shard index loading #6618

jwilder commented May 12, 2016 •

edited

Loading

mention-bot commented May 12, 2016

corylanou commented May 13, 2016

Optimize shard index loading #6618

Optimize shard index loading #6618

Conversation

jwilder commented May 12, 2016 • edited Loading

Required for all non-trivial PRs

0.13.0 (old/new)

master (old/new)

master cache flushed (old/new)

mention-bot commented May 12, 2016

corylanou commented May 13, 2016

jwilder commented May 12, 2016 •

edited

Loading