forked from opea-project/GenAIComps
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add LVMs LLaVA component (opea-project#181)
* add img2txt microservice Signed-off-by: Spycsh <[email protected]> * fix * rem * fix * fix * debug * add debug * add standalone llava server to avoid tokenization in diff process issue * add another test * endpoint * debug * rem * fix * disable hpu graph * dockernize * add test * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * rem nohup * fix * refactor img2txt to lvms * Update test_reranks_langchain.sh * fix test name * rename * lower wait time --------- Signed-off-by: Spycsh <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: chen, suyue <[email protected]> Signed-off-by: sharanshirodkar7 <[email protected]>
- Loading branch information
1 parent
e88b32c
commit c9c229d
Showing
17 changed files
with
498 additions
and
3 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -16,6 +16,7 @@ | |
TextDoc, | ||
RAGASParams, | ||
RAGASScores, | ||
LVMDoc, | ||
) | ||
|
||
# Constants | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
# Copyright (C) 2024 Intel Corporation | ||
# SPDX-License-Identifier: Apache-2.0 | ||
|
||
FROM python:3.11-slim | ||
|
||
# Set environment variables | ||
ENV LANG=en_US.UTF-8 | ||
|
||
COPY comps /home/comps | ||
|
||
RUN pip install --no-cache-dir --upgrade pip && \ | ||
pip install --no-cache-dir -r /home/comps/lvms/requirements.txt | ||
|
||
ENV PYTHONPATH=$PYTHONPATH:/home | ||
|
||
WORKDIR /home/comps/lvms | ||
|
||
ENTRYPOINT ["python", "lvm.py"] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,111 @@ | ||
# LVM Microservice | ||
|
||
Visual Question and Answering is one of the multimodal tasks empowered by LVMs (Large Visual Models). This microservice supports visual Q&A by using LLaVA as the base large visual model. It accepts two inputs: a prompt and an image. It outputs the answer to the prompt about the image. | ||
|
||
# 🚀1. Start Microservice with Python (Option 1) | ||
|
||
## 1.1 Install Requirements | ||
|
||
```bash | ||
pip install -r requirements.txt | ||
``` | ||
|
||
## 1.2 Start LLaVA Service/Test | ||
|
||
- Xeon CPU | ||
|
||
```bash | ||
# Start LLaVA service | ||
cd llava/ | ||
nohup python llava_server.py --device=cpu & | ||
# Wait until the server is up | ||
# Test | ||
python check_llava_server.py | ||
``` | ||
|
||
- Gaudi2 HPU | ||
|
||
```bash | ||
pip install optimum[habana] | ||
``` | ||
|
||
```bash | ||
cd llava/ | ||
# Start LLaVA service | ||
nohup python llava_server.py & | ||
# Test | ||
python check_llava_server.py | ||
``` | ||
|
||
## 1.3 Start Image To Text Service/Test | ||
|
||
```bash | ||
cd .. | ||
# Start the OPEA Microservice | ||
python lvm.py | ||
# Test | ||
python check_lvm.py | ||
``` | ||
|
||
# 🚀1. Start Microservice with Docker (Option 2) | ||
|
||
## 1.2 Build Images | ||
|
||
### 1.2.1 LLaVA Server Image | ||
|
||
- Xeon CPU | ||
|
||
```bash | ||
cd ../.. | ||
docker build -t opea/llava:latest --build-arg https_proxy=$https_proxy --build-arg http_proxy=$http_proxy -f comps/lvms/llava/Dockerfile . | ||
``` | ||
|
||
- Gaudi2 HPU | ||
|
||
```bash | ||
cd ../.. | ||
docker build -t opea/llava:latest --build-arg https_proxy=$https_proxy --build-arg http_proxy=$http_proxy -f comps/lvms/llava/Dockerfile_hpu . | ||
``` | ||
|
||
### 1.2.2 LVM Service Image | ||
|
||
```bash | ||
cd ../.. | ||
docker build -t opea/lvm:latest --build-arg https_proxy=$https_proxy --build-arg http_proxy=$http_proxy -f comps/lvms/Dockerfile . | ||
``` | ||
|
||
## 1.3 Start LLaVA and LVM Service | ||
|
||
### 1.3.1 Start LLaVA server | ||
|
||
- Xeon | ||
|
||
```bash | ||
docker run -p 8399:8399 -e http_proxy=$http_proxy --ipc=host -e https_proxy=$https_proxy opea/llava:latest | ||
``` | ||
|
||
- Gaudi2 HPU | ||
|
||
```bash | ||
docker run -p 8399:8399 --runtime=habana -e HABANA_VISIBLE_DEVICES=all -e OMPI_MCA_btl_vader_single_copy_mechanism=none --cap-add=sys_nice --ipc=host -e http_proxy=$http_proxy -e https_proxy=$https_proxy opea/llava:latest | ||
``` | ||
|
||
### 1.3.2 Start LVM service | ||
|
||
```bash | ||
ip_address=$(hostname -I | awk '{print $1}') | ||
|
||
docker run -p 9399:9399 --ipc=host -e http_proxy=$http_proxy -e https_proxy=$https_proxy -e LVM_ENDPOINT=http://$ip_address:8399 opea/lvm:latest | ||
``` | ||
|
||
### 1.3.3 Test | ||
|
||
```bash | ||
# Use curl/python | ||
|
||
# curl | ||
http_proxy="" curl http://localhost:9399/v1/lvm -XPOST -d '{"image": "iVBORw0KGgoAAAANSUhEUgAAAAoAAAAKCAYAAACNMs+9AAAAFUlEQVR42mP8/5+hnoEIwDiqkL4KAcT9GO0U4BxoAAAAAElFTkSuQmCC", "prompt":"What is this?"}' -H 'Content-Type: application/json' | ||
|
||
# python | ||
python check_lvm.py | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,2 @@ | ||
# Copyright (C) 2024 Intel Corporation | ||
# SPDX-License-Identifier: Apache-2.0 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
# Copyright (C) 2024 Intel Corporation | ||
# SPDX-License-Identifier: Apache-2.0 | ||
|
||
|
||
import base64 | ||
import json | ||
from io import BytesIO | ||
|
||
import PIL.Image | ||
import requests | ||
|
||
image_path = "https://avatars.githubusercontent.com/u/39623753?v=4" | ||
|
||
image = PIL.Image.open(requests.get(image_path, stream=True, timeout=3000).raw) | ||
buffered = BytesIO() | ||
image.save(buffered, format="PNG") | ||
img_b64_str = base64.b64encode(buffered.getvalue()).decode() | ||
|
||
endpoint = "http://localhost:9399/v1/lvm" | ||
inputs = {"image": img_b64_str, "prompt": "What is this?", "max_new_tokens": 32} | ||
response = requests.post(url=endpoint, data=json.dumps(inputs), proxies={"http": None}) | ||
print(response.json()) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,19 @@ | ||
# Copyright (C) 2024 Intel Corporation | ||
# SPDX-License-Identifier: Apache-2.0 | ||
|
||
FROM python:3.11-slim | ||
|
||
# Set environment variables | ||
ENV LANG=en_US.UTF-8 | ||
ENV PYTHONPATH=/home/user:/usr/lib/habanalabs/:/optimum-habana | ||
|
||
COPY comps /home/comps | ||
|
||
RUN pip install --no-cache-dir --upgrade pip && \ | ||
pip install --no-cache-dir -r /home/comps/lvms/requirements.txt | ||
|
||
ENV PYTHONPATH=$PYTHONPATH:/home | ||
|
||
WORKDIR /home/comps/lvms/llava | ||
|
||
ENTRYPOINT ["python", "llava_server.py", "--device", "cpu"] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,23 @@ | ||
# Copyright (C) 2024 Intel Corporation | ||
# SPDX-License-Identifier: Apache-2.0 | ||
|
||
# HABANA environment | ||
FROM vault.habana.ai/gaudi-docker/1.14.0/ubuntu22.04/habanalabs/pytorch-installer-2.1.1 AS hpu | ||
RUN rm -rf /etc/ssh/ssh_host* | ||
|
||
# Set environment variables | ||
ENV LANG=en_US.UTF-8 | ||
ENV PYTHONPATH=/home/user:/usr/lib/habanalabs/:/optimum-habana | ||
|
||
COPY comps /home/comps | ||
|
||
# Install requirements and optimum habana | ||
RUN pip install --no-cache-dir --upgrade pip && \ | ||
pip install --no-cache-dir -r /home/comps/lvms/requirements.txt && \ | ||
pip install optimum[habana] | ||
|
||
ENV PYTHONPATH=$PYTHONPATH:/home | ||
|
||
WORKDIR /home/comps/lvms/llava | ||
|
||
ENTRYPOINT ["python", "llava_server.py"] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,2 @@ | ||
# Copyright (C) 2024 Intel Corporation | ||
# SPDX-License-Identifier: Apache-2.0 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
# Copyright (C) 2024 Intel Corporation | ||
# SPDX-License-Identifier: Apache-2.0 | ||
|
||
|
||
import base64 | ||
import json | ||
from io import BytesIO | ||
|
||
import PIL.Image | ||
import requests | ||
|
||
image_path = "https://avatars.githubusercontent.com/u/39623753?v=4" | ||
|
||
image = PIL.Image.open(requests.get(image_path, stream=True, timeout=3000).raw) | ||
buffered = BytesIO() | ||
image.save(buffered, format="PNG") | ||
img_b64_str = base64.b64encode(buffered.getvalue()).decode() | ||
|
||
endpoint = "http://localhost:8399/generate" | ||
inputs = {"img_b64_str": img_b64_str, "prompt": "What is this?", "max_new_tokens": 32} | ||
response = requests.post(url=endpoint, data=json.dumps(inputs), proxies={"http": None}) | ||
print(response.json()) |
Oops, something went wrong.