-
Notifications
You must be signed in to change notification settings - Fork 200
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix tgi xeon tag #641
fix tgi xeon tag #641
Conversation
Using |
Because 2.2.0 didn't include the optimization we need, so we will use this sha version temperately, and will switch to 2.3.0 after it's release. |
The CI issue caused by test structure update, can be fix and test in another PR. @daisy-ycguo |
* Add vllm Arc Dockerfile support Support vllm inference on Intel ARC GPU Signed-off-by: Li Gang <[email protected]> Co-authored-by: Chen, Hu1 <[email protected]> * Add vLLM ARC support With vLLM official repo: https://github.com/vllm-project/vllm/ based on openvino backend Dockerfile is based on Dockerfile.openvino https://github.com/vllm-project/vllm/blob/main/Dockerfile.openvino And add ARC support packages Default mode: meta-llama/Llama-3.2-3B-Instruct to fit ARC A770 VRAM Signed-off-by: Li Gang <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Add README and .github workflow for vLLM ARC support Signed-off-by: Li Gang <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update comps/llms/text-generation/vllm/langchain/README.md Co-authored-by: Eero Tamminen <[email protected]> * Rename Dockerfile to meet Contribution Guidelines Signed-off-by: Li Gang <[email protected]> * Align image names as opea/vllm-arc:latest Signed-off-by: Li Gang <[email protected]> --------- Signed-off-by: Li Gang <[email protected]> Co-authored-by: Chen, Hu1 <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eero Tamminen <[email protected]>
Description
fix tgi xeon tag
Issues
#636
Type of change
List the type of change like below. Please delete options that are not relevant.
Dependencies
None
Tests
All UT