-
Notifications
You must be signed in to change notification settings - Fork 144
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add vLLM ARC support with OpenVINO backend (#641)
* Add vllm Arc Dockerfile support Support vllm inference on Intel ARC GPU Signed-off-by: Li Gang <[email protected]> Co-authored-by: Chen, Hu1 <[email protected]> * Add vLLM ARC support With vLLM official repo: https://github.com/vllm-project/vllm/ based on openvino backend Dockerfile is based on Dockerfile.openvino https://github.com/vllm-project/vllm/blob/main/Dockerfile.openvino And add ARC support packages Default mode: meta-llama/Llama-3.2-3B-Instruct to fit ARC A770 VRAM Signed-off-by: Li Gang <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Add README and .github workflow for vLLM ARC support Signed-off-by: Li Gang <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update comps/llms/text-generation/vllm/langchain/README.md Co-authored-by: Eero Tamminen <[email protected]> * Rename Dockerfile to meet Contribution Guidelines Signed-off-by: Li Gang <[email protected]> * Align image names as opea/vllm-arc:latest Signed-off-by: Li Gang <[email protected]> --------- Signed-off-by: Li Gang <[email protected]> Co-authored-by: Chen, Hu1 <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eero Tamminen <[email protected]>
- Loading branch information
1 parent
617e119
commit a2b9d95
Showing
6 changed files
with
133 additions
and
16 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
43 changes: 43 additions & 0 deletions
43
comps/llms/text-generation/vllm/langchain/dependency/Dockerfile.intel_gpu
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,43 @@ | ||
# The vLLM Dockerfile is used to construct vLLM image that can be directly used | ||
# to run the OpenAI compatible server. | ||
# Based on https://github.com/vllm-project/vllm/blob/main/Dockerfile.openvino | ||
# add Intel ARC support package | ||
|
||
FROM ubuntu:22.04 AS dev | ||
|
||
RUN apt-get update -y && \ | ||
apt-get install -y \ | ||
git python3-pip \ | ||
ffmpeg libsm6 libxext6 libgl1 \ | ||
gpg-agent wget | ||
|
||
RUN wget -qO - https://repositories.intel.com/gpu/intel-graphics.key | gpg --yes --dearmor --output /usr/share/keyrings/intel-graphics.gpg && \ | ||
echo "deb [arch=amd64 signed-by=/usr/share/keyrings/intel-graphics.gpg] https://repositories.intel.com/gpu/ubuntu jammy/lts/2350 unified" | \ | ||
tee /etc/apt/sources.list.d/intel-gpu-jammy.list &&\ | ||
apt update -y &&\ | ||
apt install -y \ | ||
intel-opencl-icd intel-level-zero-gpu level-zero \ | ||
intel-media-va-driver-non-free libmfx1 libmfxgen1 libvpl2 \ | ||
libegl-mesa0 libegl1-mesa libegl1-mesa-dev libgbm1 libgl1-mesa-dev libgl1-mesa-dri \ | ||
libglapi-mesa libgles2-mesa-dev libglx-mesa0 libigdgmm12 libxatracker2 mesa-va-drivers \ | ||
mesa-vdpau-drivers mesa-vulkan-drivers va-driver-all vainfo hwinfo clinfo | ||
|
||
WORKDIR /workspace | ||
|
||
RUN git clone -b v0.6.3.post1 https://github.com/vllm-project/vllm.git | ||
|
||
#ARG GIT_REPO_CHECK=0 | ||
#RUN --mount=type=bind,source=.git,target=.git \ | ||
# if [ "$GIT_REPO_CHECK" != 0 ]; then bash tools/check_repo.sh ; fi | ||
|
||
# install build requirements | ||
RUN PIP_EXTRA_INDEX_URL="https://download.pytorch.org/whl/cpu" python3 -m pip install -r /workspace/vllm/requirements-build.txt | ||
# build vLLM with OpenVINO backend | ||
RUN PIP_EXTRA_INDEX_URL="https://download.pytorch.org/whl/cpu" VLLM_TARGET_DEVICE="openvino" python3 -m pip install /workspace/vllm/ | ||
|
||
#COPY examples/ /workspace/vllm/examples | ||
#COPY benchmarks/ /workspace/vllm/benchmarks | ||
|
||
|
||
CMD ["/bin/bash"] | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters