Cherry-pick compaction enhancements and add range check function #346

tabokie · 2023-08-16T12:33:09Z

No description provided.

…1375) Summary: during manual compaction (CompactRange()), L0->L1 trivial move is disabled when only L0 overlaps with compacting key range (introduced in facebook#7368 to enforce kForce* contract). This can cause large memory usage due to compaction readahead when number of L0 files is large. This PR allows L0->L1 trivial move in this case, and will do a L1 -> L1 intra-level compaction when needed (`bottommost_level_compaction` is kForce*). In brief, consider a DB with only L0 file, and user calls CompactRange(kForce, nullptr, nullptr), - before this PR, RocksDB does a L0 -> L1 compaction (disallow trivial move), - after this PR, RocksDB does a L0 -> L1 compaction (allow trivial move), and a L1 -> L1 compaction. Users can use kForceOptimized to avoid this extra L1->L1 compaction overhead when L0s are overlapping and cannot be trivial moved. This PR also fixed a bug (see previous discussion in facebook#11041) where `final_output_level` of a manual compaction can be miscalculated when `level_compaction_dynamic_level_bytes=true`. This bug could cause incorrect level being moved when CompactRangeOptions::change_level is specified. Pull Request resolved: facebook#11375 Test Plan: - Added new unit tests to test that L0 -> L1 compaction allows trivial move and L1 -> L1 compaction is done when needed. Reviewed By: ajkr Differential Revision: D44943518 Pulled By: cbi42 fbshipit-source-id: e9fb770d17b163c18a623e1d1bd6b81159192708 Signed-off-by: tabokie <[email protected]>

…action (facebook#11468) Summary: currently for leveled compaction, the max output level of a call to `CompactRange()` is pre-computed before compacting each level. This max output level is the max level whose key range overlaps with the manual compaction key range. However, during manual compaction, files in the max output level may be compacted down further by some background compaction. When this background compaction is a trivial move, there is a race condition and the manual compaction may not be able to compact all keys in the specified key range. This PR updates `CompactRange()` to always compact to the bottommost level to make this race condition more unlikely (it can still happen, see more in comment here: https://github.com/cbi42/rocksdb/blob/796f58f42ad1bdbf49e5fcf480763f11583b790e/db/db_impl/db_impl_compaction_flush.cc#L1180C29-L1184). This PR also changes the behavior of CompactRange() when `bottommost_level_compaction=kIfHaveCompactionFilter` (the default option). The old behavior is that, if a compaction filter is provided, CompactRange() always does an intra-level compaction at the final output level for all files in the manual compaction key range. The only exception when `first_overlapped_level = 0` and `max_overlapped_level = 0`. It’s awkward to maintain the same behavior after this PR since we do not compute max_overlapped_level anymore. So the new behavior is similar to kForceOptimized: always does intra-level compaction at the bottommost level, but not including new files generated during this manual compaction. Several unit tests are updated to work with this new manual compaction behavior. Pull Request resolved: facebook#11468 Test Plan: Add new unit tests `DBCompactionTest.ManualCompactionCompactAllKeysInRange*` Reviewed By: ajkr Differential Revision: D46079619 Pulled By: cbi42 fbshipit-source-id: 19d844ba4ec8dc1a0b8af5d2f36ff15820c6e76f Signed-off-by: tabokie <[email protected]>

Signed-off-by: tabokie <[email protected]>

tonyxuqqi · 2023-08-22T09:13:20Z

db/db_impl/db_impl_compaction_flush.cc

+                break;
+              }
+            } else {
+              // TODO(cbi): there is still a race condition here where


will this still cause the source region's trim to not clean all the data beyond its range?
What if we always set final_output_level to last level (6)

This is for dynamic_level_bytes=false, using 6 will move all files to L6 which makes write amp much larger for smaller dataset.

overvenus · 2023-08-22T10:27:10Z

db/db_impl/db_impl_merge.cc

+  if (begin == nullptr && end == nullptr) {
+    return s;
+  }


What does nullptr mean? Should we return Status::InvalidArgument?

It means infinity. It's a common usage of nullptr within RocksDB codebase.

overvenus · 2023-08-22T10:38:54Z

Rest LGTM

cbi42 and others added 3 commits August 16, 2023 07:34

add check in range interface

e7d2fc3

Signed-off-by: tabokie <[email protected]>

tabokie mentioned this pull request Aug 18, 2023

raftstore-v2: fix compact range bugs that causes false positive clean tablet tikv/tikv#15332

Merged

tonyxuqqi requested review from Connor1996 and overvenus August 22, 2023 09:04

tonyxuqqi reviewed Aug 22, 2023

View reviewed changes

tonyxuqqi approved these changes Aug 22, 2023

View reviewed changes

overvenus reviewed Aug 22, 2023

View reviewed changes

tabokie requested a review from overvenus August 23, 2023 03:24

overvenus approved these changes Aug 23, 2023

View reviewed changes

tabokie merged commit fe76937 into tikv:6.29.tikv Aug 23, 2023

tabokie deleted the fix-compaction-v2 branch August 23, 2023 07:12

ti-chi-bot mentioned this pull request Aug 24, 2023

raftstore-v2: fix compact range bugs that causes false positive clean tablet (#15332) tikv/tikv#15422

Closed

tabokie mentioned this pull request Aug 24, 2023

cherry-pick compaction enhancements #345

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cherry-pick compaction enhancements and add range check function #346

Cherry-pick compaction enhancements and add range check function #346

tabokie commented Aug 16, 2023

tonyxuqqi Aug 22, 2023 •

edited

Loading

tabokie Aug 22, 2023

overvenus Aug 22, 2023

tabokie Aug 22, 2023

overvenus commented Aug 22, 2023

Cherry-pick compaction enhancements and add range check function #346

Cherry-pick compaction enhancements and add range check function #346

Conversation

tabokie commented Aug 16, 2023

tonyxuqqi Aug 22, 2023 • edited Loading

Choose a reason for hiding this comment

tabokie Aug 22, 2023

Choose a reason for hiding this comment

overvenus Aug 22, 2023

Choose a reason for hiding this comment

tabokie Aug 22, 2023

Choose a reason for hiding this comment

overvenus commented Aug 22, 2023

tonyxuqqi Aug 22, 2023 •

edited

Loading