Skip to content

Commit

Permalink
[pre-commit.ci] auto fixes from pre-commit.com hooks
Browse files Browse the repository at this point in the history
for more information, see https://pre-commit.ci

Signed-off-by: Chun Tao <[email protected]>
  • Loading branch information
pre-commit-ci[bot] authored and ctao456 committed Sep 6, 2024
1 parent b5cbec9 commit 55b824b
Showing 1 changed file with 9 additions and 8 deletions.
17 changes: 9 additions & 8 deletions ChatQnA/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -97,17 +97,18 @@ flowchart LR

This ChatQnA use case performs RAG using LangChain, Redis VectorDB and Text Generation Inference on Intel Gaudi2 or Intel XEON Scalable Processors. The Intel Gaudi2 accelerator supports both training and inference for deep learning models in particular for LLMs. Visit [Habana AI products](https://habana.ai/products) for more details.

In the below, we provide a table that describes for each microservice component in the ChatQnA architecture, the default configuration of the open source project, hardware, port, and endpoint.
In the below, we provide a table that describes for each microservice component in the ChatQnA architecture, the default configuration of the open source project, hardware, port, and endpoint.

<details>
<summary> Gaudi default compose.yaml </summary>

| MicroService | Open Source Project | HW | Port | Endpoint |
|--------------|---------------------|------|------|---------------------|
| Embedding | Langchain | Gaudi| 6000 | /v1/embaddings |
| Retriever | Langchain | Xeon | 7000 | /v1/retrieval |
| Reranking | Langchain | Xeon | 8000 | /v1/reranking |
| LLM | Langchain | Gaudi| 9000 | /v1/chat/completions |
| Dataprep | Redis | Xeon | 6007 | /v1/dataprep |
| MicroService | Open Source Project | HW | Port | Endpoint |
| ------------ | ------------------- | ----- | ---- | -------------------- |
| Embedding | Langchain | Gaudi | 6000 | /v1/embaddings |
| Retriever | Langchain | Xeon | 7000 | /v1/retrieval |
| Reranking | Langchain | Xeon | 8000 | /v1/reranking |
| LLM | Langchain | Gaudi | 9000 | /v1/chat/completions |
| Dataprep | Redis | Xeon | 6007 | /v1/dataprep |

</details>

Expand Down

0 comments on commit 55b824b

Please sign in to comment.