Add vLLM support to DocSum Helm chart #649

eero-t · 2024-12-18T18:57:33Z

Description

This continues Helm vLLM support added in #610, by adding vLLM support to DocSum Helm chart.

(Similarly to how it's already done for ChatQnA app + Agent component, there are tgi.enabled & vllm.enabled flags for selecting which LLM will be used.)

Type of change

New feature (non-breaking change which adds new functionality)

Dependencies

opea/llm-docsum-vllm:latest image is currently missing from CI & DockerHub registries:
opea-project/GenAIComps#961

(Although corresponding opea/llm-docsum-tgi:latest image for TGI, and opea/llm-vllm:latest vLLM text-generation images already exist.)

Tests

Manual testing with opea/llm-docsum-vllm:latest image built locally.

Otherwise "llm-uservice" throws an exception due to None variable value, or vLLM returns error due to unrecognized model ID. Signed-off-by: Eero Tamminen <[email protected]>

Signed-off-by: Eero Tamminen <[email protected]>

eero-t · 2024-12-18T18:58:34Z

Setting as draft because the required image is still missing from DockerHub, and this needs retesting after currently pending DocSum changes for Comps & Examples repos have completed.

eero-t · 2024-12-20T15:55:37Z

While CI "docsum, gaudi, ci-gaudi-vllm-values" test fails as expected, due to OPEA missing llm-docsum-vllm image...

There seems to be a bug in component unrelated to this PR, as also run "llm-uservice, xeon, ci-faqgen-values, common" CI test fails to a package missing from image:

[pod/llm-uservice20241218190439-5b9b7b79fd-r65l9/llm-uservice20241218190439]
...
   File "/home/user/comps/llms/faq-generation/tgi/langchain/llm.py", line 77, in stream_generator
     from langserve.serialization import WellKnownLCSerializer
   File "/home/user/.local/lib/python3.11/site-packages/langserve/__init__.py", line 8, in <module>
     from langserve.client import RemoteRunnable
   File "/home/user/.local/lib/python3.11/site-packages/langserve/client.py", line 24, in <module>
     from httpx._types import AuthTypes, CertTypes, CookieTypes, HeaderTypes, VerifyTypes
 ImportError: cannot import name 'VerifyTypes' from 'httpx._types' (/home/user/.local/lib/python3.11/site-packages/httpx/_types.py)

=> requirements.txt for llm-faqgen-tgi:latest image generation is not up to date in Comps repo?

@lianhao?

eero-t added 2 commits December 18, 2024 20:38

Workaround for llm-uservice model ID inconsistency

2302123

Otherwise "llm-uservice" throws an exception due to None variable value, or vLLM returns error due to unrecognized model ID. Signed-off-by: Eero Tamminen <[email protected]>

Add vLLM support for DocSum

aa4e01a

Signed-off-by: Eero Tamminen <[email protected]>

eero-t requested review from yongfengdu and lianhao as code owners December 18, 2024 18:57

eero-t marked this pull request as draft December 18, 2024 18:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add vLLM support to DocSum Helm chart #649

Add vLLM support to DocSum Helm chart #649

eero-t commented Dec 18, 2024 •

edited

Loading

eero-t commented Dec 18, 2024 •

edited

Loading

eero-t commented Dec 20, 2024

Add vLLM support to DocSum Helm chart #649

Are you sure you want to change the base?

Add vLLM support to DocSum Helm chart #649

Conversation

eero-t commented Dec 18, 2024 • edited Loading

Description

Type of change

Dependencies

Tests

eero-t commented Dec 18, 2024 • edited Loading

eero-t commented Dec 20, 2024

eero-t commented Dec 18, 2024 •

edited

Loading

eero-t commented Dec 18, 2024 •

edited

Loading