-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* adding notebooks for V7 LMI (#270) Co-authored-by: “Vivek <“[email protected]”> * feat: jumpstart llm load test benchmarking (#277) * feat: initial jumpstart llm benchmarking commit * fix: throughput robustness and pricing * feat: jumpstart benchmarking, deploy endpoint and model specs * feat: jumpstart benchmarking generalize concurrency probe * fix: jumpstart latency benchmarking logging changes * fix: adjust throughput computations * fix: variety of cleanup to jumpstart llm benchmarking * chore: clean up notebooks * chore: black * chore: cleanup jumpstart inference benchmarking * fix: adjust error logging in concurrency probe * chore: concurrency probe finalization * chore: clean up notebooks * chore: add tranformers requirement install * chore: black * chore: grammar * chore: pip install change * chore: adjust load metrics for missing token statistics * [minor updates] * pinned version for image generation notebook (#282) * pinned version for image generation notebook * re-triggered test * pinned version * dummy commit * code formatting fix (#287) * Badges vivek (#292) * [adding badges for the lmi v7 notebooks] * [adding badges for the lmi v7 notebooks] * [adding badges for the lmi v7 notebooks] * [adding badges for the lmi v7 notebooks] * [adding badges for the lmi v7 notebooks] * [adding badges for the lmi v7 notebooks] * [adding badges for the lmi v7 notebooks] * [adding badges for the lmi v7 notebooks] --------- Co-authored-by: “Vivek <“[email protected]”> * fix the badge issue (#289) Co-authored-by: Harsha Reddy <[email protected]> --------- Co-authored-by: Vivek Gangasani <[email protected]> Co-authored-by: “Vivek <“[email protected]”> Co-authored-by: ulrichkr <[email protected]> Co-authored-by: vivekmadan2 <[email protected]> Co-authored-by: xieyongliang <[email protected]> Co-authored-by: Harsha Reddy <[email protected]>
- Loading branch information
1 parent
4abe8c5
commit a09ee0b
Showing
20 changed files
with
3,837 additions
and
48 deletions.
There are no files selected for viewing
1,182 changes: 1,182 additions & 0 deletions
1,182
inference/generativeai/llm-workshop/deploy-V7-lmi/llama2_70b-lmi-trtllm.ipynb
Large diffs are not rendered by default.
Oops, something went wrong.
1,184 changes: 1,184 additions & 0 deletions
1,184
inference/generativeai/llm-workshop/deploy-V7-lmi/llama2_70b_lmi_v7.ipynb
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.