feat(benchmark): tui based benchmarking tool #149

OlivierDehaene · 2023-03-30T10:41:22Z

njhill · 2023-03-31T04:24:04Z

Thanks @OlivierDehaene, looks great! I haven't had a chance to look at it closely yet but will soon.

OlivierDehaene · 2023-03-31T08:14:20Z

Thanks!
I find very useful to compare modeling code or how the model behave with different number of shards. It is also useful to see what is the expected latency in worst case scenarios (maximum expected context size, maximum decode tokens) and pick the correct max batch size bellow a latency threshold.
However it is not perfect as the batch never contains any padding which make some models look better but it's a first version.

OlivierDehaene added 15 commits March 29, 2023 12:00

wip

4dfa6fb

wip

a28a8eb

wip

c0d793d

add shutdown logic

681744b

v1

383619b

improvements

1c5d526

improving design

a1613e2

improving design

ae72d4f

improving design

271f045

add helper

17a75c8

v1

b6df203

exclude benchmark from workspace

c15922b

add image

b2d1276

add latency per token

163c23f

revert aml changes

3a0e706

OlivierDehaene merged commit 610bb1f into main Mar 30, 2023

OlivierDehaene deleted the feat/benchmark_tool branch March 30, 2023 13:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(benchmark): tui based benchmarking tool #149

feat(benchmark): tui based benchmarking tool #149

OlivierDehaene commented Mar 30, 2023 •

edited

Loading

njhill commented Mar 31, 2023

OlivierDehaene commented Mar 31, 2023

feat(benchmark): tui based benchmarking tool #149

feat(benchmark): tui based benchmarking tool #149

Conversation

OlivierDehaene commented Mar 30, 2023 • edited Loading

njhill commented Mar 31, 2023

OlivierDehaene commented Mar 31, 2023

OlivierDehaene commented Mar 30, 2023 •

edited

Loading