[BREAKING] opt(compactions): Improve compaction performance #1574

manishrjain · 2020-10-23T01:10:22Z

Implement multiple ideas for speeding up compactions:

Dynamic Level Sizes: https://rocksdb.org/blog/2015/07/23/dynamic-level.html
L0 to L0 compactions: https://rocksdb.org/blog/2017/06/26/17-level-based-changes.html
Sub Compactions: Split up one compaction into multiple sub-compactions using key ranges, which can be run concurrently.
If a table being generated at Li overlaps with >= 10 tables at Li+1, finish the table. This helps avoid big overlaps and expensive compactions later.
Update compaction priority based on the priority of the next level prioritizing compactions of lower levels over upper levels, resulting in an always healthy LSM tree structure.

With these changes, we can load 1B entries (160GB of data) into Badger (without the Stream framework) in 1h25m at 31 MB/s. This is a significant improvement over current master.

This change is

…ple times

…ction priority adjustment.

…score exceeds 1.0.

codelingo

2 issues found. 1 rules errored during the review.

codelingo · 2020-10-26T20:50:36Z

levels.go

+// concurrently, only iterating over the provided key range, generating tables.
+// This speeds up the compaction significantly.
+func (s *levelsController) subcompact(it y.Iterator, kr keyRange, cd compactDef,
+	inflightBuilders *y.Throttle, res chan *table.Table) {


Returned channels or channel arguments should generally have a direction.

View Rule

codelingo · 2020-10-26T20:50:37Z

badger/cmd/write_bench.go

@@ -66,7 +66,7 @@ var (
 	vlogMaxEntries   uint32
 	loadBloomsOnOpen bool
 	detectConflicts  bool
-	compression      bool
+	zstdComp         bool


Avoid global variables to improve readability and reduce complexity

View Rule

Implement multiple ideas for speeding up compactions: 1. Dynamic Level Sizes: https://rocksdb.org/blog/2015/07/23/dynamic-level.html 2. L0 to L0 compactions: https://rocksdb.org/blog/2017/06/26/17-level-based-changes.html 3. Sub Compactions: Split up one compaction into multiple sub-compactions using key ranges, which can be run concurrently. 4. If a table being generated at Li overlaps with >= 10 tables at Li+1, finish the table. This helps avoid big overlaps and expensive compactions later. 5. Update compaction priority based on the priority of the next level prioritizing compactions of lower levels over upper levels, resulting in an always healthy LSM tree structure. With these changes, we can load 1B entries (160GB of data) into Badger (without the Stream framework) in 1h25m at 31 MB/s. This is a significant improvement over current master. Co-authored-by: Ibrahim Jarif <[email protected]> fix(tests): Writebatch, Stream, Vlog tests (#1577) This PR fixes the following issues/tests - Deadlock in writes batch - Use atomic to set value of `writebatch.error` - Vlog Truncate test - Fix issues with empty memtables - Test options - Set memtable size. - Compaction tests - Acquire lock before updating level tables - Vlog Write - Truncate the file size if the transaction cannot fit in vlog size - TestPersistLFDiscardStats - Set numLevelZeroTables=1 to force compaction. This PR also fixes the failing bank test by adding an index cache to the bank test.

manishrjain and others added 12 commits October 14, 2020 19:12

Some option changes to get started with compaction

abf52b4

Merge branch 'master' into mrjn/compactions

38d48b2

Merge branch 'master' into mrjn/compactions

5c07e45

Compiles, but gets stuck when loading.

dc23b42

debug

bddedf5

Fix bug in compaction. Do not run compaction for the same level multi…

651ee86

…ple times

Working code

2e60bf2

Working sub-compactions

2cc41b5

work

8a041c9

Some more changes

ce96292

Fix tests

473e48b

More benchmarking

af06f7e

manishrjain requested a review from jarifibrahim as a code owner October 23, 2020 01:10

manishrjain changed the title ~~opt(compactions): Improve compaction performance~~ [BREAKING] opt(compactions): Improve compaction performance Oct 23, 2020

manishrjain added 11 commits October 22, 2020 18:12

Add comments. Revert flatten.go

be07ce1

Increase num compactors. Reserve one for L0. Permanently enable compa…

8a32382

…ction priority adjustment.

Use default compression in write bench

8f564a5

Make zero prefer L0 instead of only compacting L0 exclusively.

e6543c1

Make zero-th compactor run L0->L0. Don't do L0->Lbase until adjusted …

a86dcaf

…score exceeds 1.0.

Create one file for L0->L0 compaction.

c21d5bb

Don't print

b7c302b

Simplify level 0 check

86d48a8

Merge branch 'master' into mrjn/compactions

7ccdc49

Update ristretto

88be5af

Self review

84805a9

codelingo bot reviewed Oct 26, 2020

View reviewed changes

Some touch-ups.

6401968

manishrjain merged commit 45bca18 into master Oct 26, 2020

manishrjain deleted the mrjn/compactions branch October 26, 2020 21:13

tzdybal mentioned this pull request Jul 29, 2021

Use a single badgerdb instance celestiaorg/celestia-core#283

Closed

Wondertan mentioned this pull request Feb 26, 2022

deps: update to badgerv4 celestiaorg/celestia-node#482

Closed

mkobetic mentioned this pull request Dec 15, 2022

Update badger to v3 xmtp/go-ds-badger#1

Closed

mangalaman93 mentioned this pull request May 8, 2023

[BUG]: badger v3 memory leak #1841

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BREAKING] opt(compactions): Improve compaction performance #1574

[BREAKING] opt(compactions): Improve compaction performance #1574

manishrjain commented Oct 23, 2020 •

edited

Loading

codelingo bot left a comment

codelingo bot Oct 26, 2020

codelingo bot Oct 26, 2020

[BREAKING] opt(compactions): Improve compaction performance #1574

[BREAKING] opt(compactions): Improve compaction performance #1574

Conversation

manishrjain commented Oct 23, 2020 • edited Loading

codelingo bot left a comment

Choose a reason for hiding this comment

codelingo bot Oct 26, 2020

Choose a reason for hiding this comment

codelingo bot Oct 26, 2020

Choose a reason for hiding this comment

manishrjain commented Oct 23, 2020 •

edited

Loading