-
Notifications
You must be signed in to change notification settings - Fork 539
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[LLM] Llama 3.1 serving example #3780
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome @romilbhardwaj, some quick comments.
- add to docs & repo README's LLM list too, for both finetuning & serving?
@@ -39,6 +39,7 @@ Contents | |||
DBRX (Databricks) <llms/dbrx> | |||
Llama-2 (Meta) <llms/llama-2> | |||
Llama-3 (Meta) <llms/llama-3> | |||
Llama-3.1 (Meta) <llms/llama-3_1> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should add to docs home page's LLM list too.
Thanks, resolved comments. |
llm/llama-3_1/README.md
Outdated
|
||
## Serving Llama 3.1 on your infra | ||
|
||
We will first test the model on a GPU dev node, then package it for deployment using SkyPilot. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we have highlighted sentence for the complete deployment YAML so people can directly jump to that section?
We will first test the model on a GPU dev node, then package it for deployment using SkyPilot. | |
We offer a step-by-step guide for how to use SkyPilot to test a new model on a GPU dev node, and then package it for deployment using SkyPilot. **For the complete deployment guide of Llama 3.1, see [Step 3: Package and deploy using SkyPilot](#Step-3-Package-and-deploy-using-SkyPilot) |
Tested (run the relevant ones):
bash format.sh
pytest tests/test_smoke.py
pytest tests/test_smoke.py::test_fill_in_the_name
conda deactivate; bash -i tests/backward_compatibility_tests.sh