Added the microservice of vLLM #78

tianyil1 · 2024-05-22T01:37:28Z

Description

This PR mainly updates the microservice wrapper for the vllm to align with the tgi, rename the ray backend service, and adds the vllm and ray introduction in the readme.

Issues

n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

New feature (non-breaking change which adds new functionality)
- wrapped the vllm with the langchain microservice for aligning with tgi microservice.
- updated the llms readme with the vllm and ray introduction.
- fixed the vllm readme and docker script with huggingface API token useage.
Breaking change (fix or feature that would break existing design and interface
- rename the ray serve from "rayllm" to "ray_serve" to avoid the confusion of the ray implement based on the "ray serve" instead of "ray llm"

Dependencies

No newly introduced 3rd party dependency.

Tests

This PR is tested in the Gaudi2 server with:
2 sockers Intel(R) Xeon(R) Platinum 8368 CPU @ 2.40GHz
8 Gaudi nodes, HL-SMI Version: hl-1.14.0-fw-48.0.1.0 Driver Version: 1.14.0-9e8ecf8

which is tested well in the above env:

vLLM backend serving
Langchain vLLM serving

tianyil1 · 2024-05-22T01:39:53Z

Please help review, and I expect your invaluable feedback. Thanks. @Jian-Zhang @xuechendi

comps/llms/langchain/docker/tgi/docker_compose_llm.yaml

comps/llms/rayllm/README.md

tianyil1 · 2024-05-24T02:01:53Z

The PR has fixed the above comments and is ready to merge. Please help review. Thanks. @lvliang-intel @hshen14 @Jian-Zhang

ftian1

from the PR title, only LLM microservice based on vLLM was added. but in the code, you in fact included Ray Serve version, right? and it's better to move 'text-generation/ray_serve/docker' to upper level like what we did in vLLM and other examples.

another question is why there is a requirements.txt in 'text-generation/ray_serve/docker' folder? moving it to dockerfile is feasible?

tianyil1 · 2024-05-24T07:43:08Z

from the PR title, only LLM microservice based on vLLM was added. but in the code, you in fact included Ray Serve version, right? and it's better to move 'text-generation/ray_serve/docker' to upper level like what we did in vLLM and other examples.

another question is why there is a requirements.txt in 'text-generation/ray_serve/docker' folder? moving it to dockerfile is feasible?

@ftian1 Thanks for your comments. I exactly renamed the ray serve name instead of changing its version, did not implement the ray serve microservice in this PR, and will implement it in the next PR. The code in "docker" folder of the ray_serve was self-implemented docker image and just aimed to launch the ray serve llm engine unlike that the vLLM/TGI can build the docker from official github.

The requirements.txt of ray serve was the necessary file to build the ray serve engine image instead of the microservice of the ray langchain implementation, so I put it under the docker folder as before.

tianyil1 · 2024-05-29T08:32:52Z

This PR is ready to merge. Would you please help check this PR? Thanks. @ftian1 @lvliang-intel

ftian1

looks good to me

Signed-off-by: tianyil1 <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: tianyil1 <[email protected]>

for more information, see https://pre-commit.ci

* refine the vllm microservice Signed-off-by: tianyil1 <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * rename the rayllm to ray_serve Signed-off-by: tianyil1 <[email protected]> * refactor the ray service code structure Signed-off-by: tianyil1 <[email protected]> * refine the vllm and readme Signed-off-by: tianyil1 <[email protected]> * update the readme with correct ray service name Signed-off-by: tianyil1 <[email protected]> * update the readme Signed-off-by: tianyil1 <[email protected]> * refine the readme Signed-off-by: tianyil1 <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: tianyil1 <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: V, Ganesan <[email protected]>

tianyil1 force-pushed the vllm branch from b71c03a to 5088af7 Compare May 22, 2024 01:50

Jian-Zhang reviewed May 23, 2024

View reviewed changes

comps/llms/langchain/docker/tgi/docker_compose_llm.yaml Outdated Show resolved Hide resolved

comps/llms/rayllm/README.md Outdated Show resolved Hide resolved

tianyil1 force-pushed the vllm branch from dadc238 to 00365eb Compare May 24, 2024 01:15

tianyil1 force-pushed the vllm branch 3 times, most recently from 055ae40 to 23d3b63 Compare May 24, 2024 05:50

hshen14 requested review from lvliang-intel and ftian1 May 24, 2024 06:33

ftian1 reviewed May 24, 2024

View reviewed changes

tianyil1 force-pushed the vllm branch 3 times, most recently from 21b51da to 122560a Compare May 29, 2024 08:29

ftian1 approved these changes May 29, 2024

View reviewed changes

tianyil1 and others added 8 commits May 30, 2024 09:33

refine the vllm microservice

024d98c

Signed-off-by: tianyil1 <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

d7f664e

for more information, see https://pre-commit.ci

rename the rayllm to ray_serve

624e62d

Signed-off-by: tianyil1 <[email protected]>

refactor the ray service code structure

c232692

Signed-off-by: tianyil1 <[email protected]>

refine the vllm and readme

3f78e9c

Signed-off-by: tianyil1 <[email protected]>

update the readme with correct ray service name

199bcfb

Signed-off-by: tianyil1 <[email protected]>

update the readme

d750b2c

Signed-off-by: tianyil1 <[email protected]>

refine the readme

1782801

Signed-off-by: tianyil1 <[email protected]>

tianyil1 force-pushed the vllm branch 2 times, most recently from af2a15b to 1782801 Compare May 30, 2024 01:37

[pre-commit.ci] auto fixes from pre-commit.com hooks

a5a6252

for more information, see https://pre-commit.ci

lvliang-intel approved these changes May 30, 2024

View reviewed changes

lvliang-intel merged commit f0b0690 into opea-project:main May 30, 2024
6 checks passed

poussa mentioned this pull request May 30, 2024

Fix Ray llm docker build #95

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added the microservice of vLLM #78

Added the microservice of vLLM #78

tianyil1 commented May 22, 2024 •

edited

Loading

tianyil1 commented May 22, 2024 •

edited

Loading

tianyil1 commented May 24, 2024

ftian1 left a comment

tianyil1 commented May 24, 2024

tianyil1 commented May 29, 2024

ftian1 left a comment

Added the microservice of vLLM #78

Added the microservice of vLLM #78

Conversation

tianyil1 commented May 22, 2024 • edited Loading

Description

Issues

Type of change

Dependencies

Tests

tianyil1 commented May 22, 2024 • edited Loading

tianyil1 commented May 24, 2024

ftian1 left a comment

Choose a reason for hiding this comment

tianyil1 commented May 24, 2024

tianyil1 commented May 29, 2024

ftian1 left a comment

Choose a reason for hiding this comment

tianyil1 commented May 22, 2024 •

edited

Loading

tianyil1 commented May 22, 2024 •

edited

Loading