Update throughput-latency plot script #881
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR updates the
plot_th_lat.py
throughput-latency plot generation script to remove the concept of abackend (aml, fastgen, vllm)
and generalize for any result output directory, irrespective of where it was run.The PR also introduces the concept of an optional
plot_config.yaml
that resides within each result directory and allows for overrides in the plot formatting. An example config file may look like this:Each of the config parameters is optional, allowing for override of only the specific plot aspects required, however all parameters may also be provided.
A few nuances for the
polyfit_degree
andx/y_max
parameters:polyfit_degree
: Specifies the polynomial degree for the 'best fit line'. Specifying0
removes the best fit line and simply connects the scatter plot points.x/y_max
: Clips the x or y axis data using the specified value as the upper bound.An example command executing the script may look something like this:
Or each result directory can be enumerated explicitly: