Skip to content

Commit

Permalink
Add tei/teirerank usvc helm deployment
Browse files Browse the repository at this point in the history
Signed-off-by: Dolpher Du <[email protected]>
  • Loading branch information
yongfengdu committed Jan 10, 2025
1 parent a2cf490 commit fa68693
Show file tree
Hide file tree
Showing 6 changed files with 90 additions and 0 deletions.
18 changes: 18 additions & 0 deletions comps/3rd_parties/tei/deployment/kubernetes/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
# Deploy TEI on kubernetes cluster

- You should have Helm (version >= 3.15) installed. Refer to the [Helm Installation Guide](https://helm.sh/docs/intro/install/) for more information.
- For more deployment options, refer to [helm charts README](https://github.com/opea-project/GenAIInfra/tree/main/helm-charts#readme).

## Deploy on Xeon

```
export HFTOKEN="insert-your-huggingface-token-here"
helm install tei oci://ghcr.io/opea-project/charts/tei --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN} -f cpu-values.yaml
```

## Deploy on Gaudi

```
export HFTOKEN="insert-your-huggingface-token-here"
helm install tei oci://ghcr.io/opea-project/charts/tei --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN} -f gaudi-values.yaml
```
5 changes: 5 additions & 0 deletions comps/3rd_parties/tei/deployment/kubernetes/cpu-values.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
# Copyright (C) 2024 Intel Corporation
# SPDX-License-Identifier: Apache-2.0

image:
repository: ghcr.io/huggingface/text-embeddings-inference
22 changes: 22 additions & 0 deletions comps/3rd_parties/tei/deployment/kubernetes/gaudi-values.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
# Copyright (C) 2024 Intel Corporation
# SPDX-License-Identifier: Apache-2.0

accelDevice: "gaudi"

OMPI_MCA_btl_vader_single_copy_mechanism: "none"
MAX_WARMUP_SEQUENCE_LENGTH: "512"
image:
repository: ghcr.io/huggingface/tei-gaudi
tag: 1.5.0

securityContext:
readOnlyRootFilesystem: false

resources:
limits:
habana.ai/gaudi: 1

livenessProbe:
timeoutSeconds: 1
readinessProbe:
timeoutSeconds: 1
18 changes: 18 additions & 0 deletions comps/3rd_parties/teirerank/deployment/kubernetes/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
# Deploy TEIRERANK on kubernetes cluster

- You should have Helm (version >= 3.15) installed. Refer to the [Helm Installation Guide](https://helm.sh/docs/intro/install/) for more information.
- For more deployment options, refer to [helm charts README](https://github.com/opea-project/GenAIInfra/tree/main/helm-charts#readme).

## Deploy on Xeon

```
export HFTOKEN="insert-your-huggingface-token-here"
helm install teirerank oci://ghcr.io/opea-project/charts/teirerank --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN} -f cpu-values.yaml
```

## Deploy on Gaudi

```
export HFTOKEN="insert-your-huggingface-token-here"
helm install teirerank oci://ghcr.io/opea-project/charts/teirerank --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN} -f gaudi-values.yaml
```
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
# Copyright (C) 2024 Intel Corporation
# SPDX-License-Identifier: Apache-2.0

image:
repository: ghcr.io/huggingface/text-embeddings-inference
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
# Copyright (C) 2024 Intel Corporation
# SPDX-License-Identifier: Apache-2.0

accelDevice: "gaudi"

OMPI_MCA_btl_vader_single_copy_mechanism: "none"
MAX_WARMUP_SEQUENCE_LENGTH: "512"
image:
repository: ghcr.io/huggingface/tei-gaudi
tag: 1.5.0

securityContext:
readOnlyRootFilesystem: false

resources:
limits:
habana.ai/gaudi: 1

livenessProbe:
timeoutSeconds: 1
readinessProbe:
timeoutSeconds: 1

0 comments on commit fa68693

Please sign in to comment.