Update throughput-latency plot script #881

lekurile · 2024-03-28T22:47:36Z

This PR updates the plot_th_lat.py throughput-latency plot generation script to remove the concept of a backend (aml, fastgen, vllm) and generalize for any result output directory, irrespective of where it was run.

The PR also introduces the concept of an optional plot_config.yaml that resides within each result directory and allows for overrides in the plot formatting. An example config file may look like this:

label: "vLLM"
color: "purple"
marker: "o"
linestyle: "--"
polyfit_degree: 0
x_max : 30
y_max : 10

Each of the config parameters is optional, allowing for override of only the specific plot aspects required, however all parameters may also be provided.

A few nuances for the polyfit_degree and x/y_max parameters:

polyfit_degree: Specifies the polynomial degree for the 'best fit line'. Specifying 0 removes the best fit line and simply connects the scatter plot points.
x/y_max: Clips the x or y axis data using the specified value as the upper bound.

An example command executing the script may look something like this:

python3 src/plot_th_lat.py --data_dirs ./results/results-* --model_name <plot_model_title>

Or each result directory can be enumerated explicitly:

python3 src/plot_th_lat.py --data_dirs ./results/results-1 ./results/results-2 ./results/results-3 --model_name <plot_model_title>

mrwyattii

Please address my comments before merging, but LGTM!

benchmarks/inference/mii/README.md

benchmarks/inference/mii/plot_config.yaml

lekurile added 4 commits March 26, 2024 22:52

Generalize plotting script

a347ff1

Change plot config to YAML

b8ccc56

Add kwargs, add x_max, add plot type, etc

06f9f21

Merge branch 'master' into lekurile/update_plot_scripts

17cab2f

lekurile requested review from HeyangQin and umchand March 28, 2024 22:47

lekurile requested review from tjruwase, conglongli, awan-10, eltonzheng, duli2012, mrwyattii, arashb and xiaoxiawu-microsoft as code owners March 28, 2024 22:47

lekurile and others added 5 commits March 28, 2024 23:43

Remove log_dir, only use data_dir

4c04c12

Update README

0e91cd7

Add backwards compatibility with existing scripts

a903cf3

Merge branch 'master' into lekurile/update_plot_scripts

65a0417

don't create a new dir for backend, instead concatenate to out_json_dir

fca8312

mrwyattii approved these changes Apr 2, 2024

View reviewed changes

benchmarks/inference/mii/README.md Outdated Show resolved Hide resolved

benchmarks/inference/mii/plot_config.yaml Outdated Show resolved Hide resolved

Update plotting script

4234d78

lekurile merged commit fab5d06 into master Apr 9, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update throughput-latency plot script #881

Update throughput-latency plot script #881

lekurile commented Mar 28, 2024 •

edited

Loading

mrwyattii left a comment

Update throughput-latency plot script #881

Update throughput-latency plot script #881

Conversation

lekurile commented Mar 28, 2024 • edited Loading

mrwyattii left a comment

Choose a reason for hiding this comment

lekurile commented Mar 28, 2024 •

edited

Loading