Refine algorithm for WriteAmpBasedRateLimiter #213

tabokie · 2020-11-23T06:03:41Z

Bugfix

Only compaction triggers auto-tuner to collect necessary data for training rate limit. When compaction frequency is low, data from long period of time is fused into one sample, causing inaccurate estimation. Fix this issue by looping through missing timeslice.

Recent window size (10s) is too small, make it 30s.

Better support for low pressure scenarios

Before this PR, flush flow is padded to 20MB/s which makes rate limit always larger than 28MB/s. After removing this restriction, we notice that it's easier to accumulate pending bytes under low pressure. Adjust the padding calculation to partially resolve this problem.

Also notice that with new formula, the minimal rate limit is still around 28MB/s.

Control reshuffle

Remove the use of long term sampler, instead enlarge the window of short term sampler. Reduce the use of ratio_delta which often causes unnecessary jitters. With algorithm being simplified, we can now deduce the actual limit by prometheus expression sum(rate(tikv_engine_compaction_flow_bytes{instance=~"$instance", db="kv", type="bytes_written"}[5m]))

Signed-off-by: tabokie <[email protected]>

This reverts commit dc096f2. Signed-off-by: tabokie <[email protected]>

Signed-off-by: tabokie <[email protected]>

Signed-off-by: Xinye Tao <[email protected]>

db/column_family.cc

Signed-off-by: tabokie <[email protected]>

* Bugfix Only compaction triggers auto-tuner to collect necessary data for training rate limit. When compaction frequency is low, data from long period of time is fused into one sample, causing inaccurate estimation. Fix this issue by looping through missing timeslice. Recent window size (10s) is too small, make it 30s. * Better support for low pressure scenarios Before this PR, flush flow is padded to 20MB/s which makes rate limit always larger than 28MB/s. After removing this restriction, we notice that it's easier to accumulate pending bytes under low pressure. Adjust the padding calculation to partially resolve this problem. Also notice that with new formula, the minimal rate limit is still around 28MB/s. * Control reshuffle Remove the use of long term sampler, instead enlarge the window of short term sampler. Reduce the use of `ratio_delta` which often causes unnecessary jitters. With algorithm being simplified, we can now deduce the actual limit by prometheus expression `sum(rate(tikv_engine_compaction_flow_bytes{instance=~"$instance", db="kv", type="bytes_written"}[5m]))` * Normal pace up Add normal pace in addition to critical pace up to reduce pending bytes issue. Signed-off-by: tabokie <[email protected]>

tabokie added 30 commits November 16, 2020 12:26

add metrics for limiter ratio

dc096f2

Signed-off-by: tabokie <[email protected]>

tune up flush flow padding

7469c9b

Signed-off-by: tabokie <[email protected]>

tune down ratio padding

30e89cf

Signed-off-by: tabokie <[email protected]>

down down ratio padding

e2ffb1a

Signed-off-by: tabokie <[email protected]>

larger window and smaller ratio padding

b594595

Signed-off-by: tabokie <[email protected]>

further tune down padding

f0aac1f

Signed-off-by: tabokie <[email protected]>

more flush padding

2203f24

Signed-off-by: tabokie <[email protected]>

ignore long term

6d345ba

Signed-off-by: tabokie <[email protected]>

up padding

d192123

Signed-off-by: tabokie <[email protected]>

tune up util threshold

9992774

Signed-off-by: tabokie <[email protected]>

faster cooling

a24773f

Signed-off-by: tabokie <[email protected]>

strict limit for POC

dc53bd1

Signed-off-by: tabokie <[email protected]>

update ratio surge threshold

a3a7128

Signed-off-by: tabokie <[email protected]>

Revert "add metrics for limiter ratio"

2b68223

This reverts commit dc096f2. Signed-off-by: tabokie <[email protected]>

restore history sample

45c513a

Signed-off-by: tabokie <[email protected]>

update

fb16b04

Signed-off-by: tabokie <[email protected]>

remove util boost

0813933

Signed-off-by: tabokie <[email protected]>

fix

540787b

Signed-off-by: tabokie <[email protected]>

more breathing room

cdd0068

Signed-off-by: tabokie <[email protected]>

remove ratio padding

4569510

Signed-off-by: tabokie <[email protected]>

more breathing room

2aec038

Signed-off-by: tabokie <[email protected]>

add min for padding

dacd7b8

Signed-off-by: tabokie <[email protected]>

fix

5974c32

Signed-off-by: tabokie <[email protected]>

tune padding

0fd43bb

Signed-off-by: tabokie <[email protected]>

tune up padding min

8a8b2a6

Signed-off-by: tabokie <[email protected]>

fix sample hole

c67490a

Signed-off-by: tabokie <[email protected]>

cleanup

315561a

Signed-off-by: tabokie <[email protected]>

new formula for calculating padding

65d87ed

Signed-off-by: tabokie <[email protected]>

update formula

9b4d25e

Signed-off-by: tabokie <[email protected]>

update formula

e87e211

Signed-off-by: tabokie <[email protected]>

tabokie added 4 commits December 14, 2020 16:54

reset recent window

78b4fe4

Signed-off-by: tabokie <[email protected]>

update comment

f06e598

Signed-off-by: tabokie <[email protected]>

Merge branch '6.4.tikv' into limiterv3-pr

8ea1686

update comment

c51b3c5

Signed-off-by: tabokie <[email protected]>

Connor1996 approved these changes Dec 15, 2020

View reviewed changes

tabokie added 14 commits December 17, 2020 18:26

add critical pace up

1e408e6

Signed-off-by: tabokie <[email protected]>

switch to percent delta

d69f3d0

Signed-off-by: tabokie <[email protected]>

update calculation

d8dd598

Signed-off-by: tabokie <[email protected]>

amplified 1.5x

d9e0719

Signed-off-by: tabokie <[email protected]>

persistent minor pace up

d4fe585

Signed-off-by: tabokie <[email protected]>

fix compile

6ddc6a1

Signed-off-by: tabokie <[email protected]>

fix

0ad31d7

Signed-off-by: tabokie <[email protected]>

add comment

959953d

Signed-off-by: tabokie <[email protected]>

cleanup

b8545f3

Signed-off-by: tabokie <[email protected]>

tweak

45293c4

Signed-off-by: tabokie <[email protected]>

tweak

802ba0e

Signed-off-by: tabokie <[email protected]>

Merge branch '6.4.tikv' into limiterv3-pr

ab5e2c3

tweak

d7d8e4e

Signed-off-by: tabokie <[email protected]>

Fix compile

bb93585

Signed-off-by: Xinye Tao <[email protected]>

yiwu-arbug reviewed Dec 17, 2020

View reviewed changes

db/column_family.cc Outdated Show resolved Hide resolved

yiwu-arbug reviewed Dec 17, 2020

View reviewed changes

db/column_family.cc Outdated Show resolved Hide resolved

address comment

329021e

Signed-off-by: tabokie <[email protected]>

tabokie merged commit 7d209a8 into tikv:6.4.tikv Dec 18, 2020

ti-srebot mentioned this pull request Dec 18, 2020

rocksdb: Refine algorithm for WriteAmpBasedRateLimiter tikv/rust-rocksdb#583

Merged

tabokie deleted the limiterv3-pr branch December 23, 2020 03:17

tabokie mentioned this pull request May 9, 2022

Upgrade to facebook 6.29 #277

Closed

39 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refine algorithm for WriteAmpBasedRateLimiter #213

Refine algorithm for WriteAmpBasedRateLimiter #213

tabokie commented Nov 23, 2020 •

edited

Loading

Refine algorithm for WriteAmpBasedRateLimiter #213

Refine algorithm for WriteAmpBasedRateLimiter #213

Conversation

tabokie commented Nov 23, 2020 • edited Loading

tabokie commented Nov 23, 2020 •

edited

Loading