Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

support multiple nodes of a graph, apply different endpoint env for different node #176

Closed
wants to merge 37 commits into from

Conversation

KfreeZ
Copy link
Collaborator

@KfreeZ KfreeZ commented Jul 16, 2024

Description

#166

this PR also adopt the new manifest, when new manifest is merged, some unnecessary code will be removed for simplicity

mark it as draft

Issues

List the issue or RFC link this PR is working on. If there is no such link, please mark it as n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)

Dependencies

List the newly introduced 3rd party dependency if exists.

Tests

Describe the tests that you ran to verify your changes.

KfreeZ and others added 27 commits July 16, 2024 22:20
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: Dolpher Du <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
- Generate the component manifests into a single flat direcotry 'common'

- Add a special version manifest for docsum-llm-uservice

Signed-off-by: Lianhao Lu <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
- Remove unused probes

- Support conditionally disable probes per GMC request

Signed-off-by: Lianhao Lu <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: Lianhao Lu <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
This is to prevent pre-commit lint the whitespace at the end of line.

Signed-off-by: Lianhao Lu <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: Lianhao Lu <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
- Added helm test for chart redis-vector-db

- Update the chart README

Signed-off-by: Lianhao Lu <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
- Added helm test for chart tgi

- Update the chart README

Signed-off-by: Lianhao Lu <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
- Added helm test for chart tei

- Update the chart README

Signed-off-by: Lianhao Lu <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
- Added helm test for chart teirerank

- Update the chart README

Signed-off-by: Lianhao Lu <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
@@ -23,6 +23,7 @@ spec:
serviceName: embedding-svc
config:
endpoint: /v1/embeddings
TEI_EMBEDDING_ENDPOINT: TeiEmbedding
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should use servcieName, in this case tei-embedding-svc.

@@ -33,6 +34,7 @@ spec:
serviceName: retriever-svc
config:
endpoint: /v1/retrieval
REDIS_URL: VectorDB
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same here, use redis-vector-db, in this way we can support multiple endpoint for switch scenario

@@ -43,6 +45,7 @@ spec:
serviceName: reranking-svc
config:
endpoint: /v1/reranking
TEI_RERANKING_ENDPOINT: TeiReranking
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the same

@@ -55,6 +58,7 @@ spec:
serviceName: llm-svc
config:
endpoint: /v1/chat/completions
TGI_LLM_ENDPOINT: Tgi
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the same

Copy link
Collaborator

@mkbhanda mkbhanda left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@KfreeZ KfreeZ mentioned this pull request Jul 19, 2024
3 tasks
@KfreeZ
Copy link
Collaborator Author

KfreeZ commented Jul 19, 2024

use #203 to commit code

@KfreeZ KfreeZ closed this Jul 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants