-
Notifications
You must be signed in to change notification settings - Fork 62
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support multiple nodes of a graph, apply different endpoint env for different node #176
Conversation
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: Dolpher Du <[email protected]> Signed-off-by: KfreeZ <[email protected]>
- Generate the component manifests into a single flat direcotry 'common' - Add a special version manifest for docsum-llm-uservice Signed-off-by: Lianhao Lu <[email protected]> Signed-off-by: KfreeZ <[email protected]>
- Remove unused probes - Support conditionally disable probes per GMC request Signed-off-by: Lianhao Lu <[email protected]> Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: Lianhao Lu <[email protected]> Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: Lianhao Lu <[email protected]> Signed-off-by: KfreeZ <[email protected]>
This is to prevent pre-commit lint the whitespace at the end of line. Signed-off-by: Lianhao Lu <[email protected]> Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: Lianhao Lu <[email protected]> Signed-off-by: KfreeZ <[email protected]>
- Added helm test for chart redis-vector-db - Update the chart README Signed-off-by: Lianhao Lu <[email protected]> Signed-off-by: KfreeZ <[email protected]>
- Added helm test for chart tgi - Update the chart README Signed-off-by: Lianhao Lu <[email protected]> Signed-off-by: KfreeZ <[email protected]>
- Added helm test for chart tei - Update the chart README Signed-off-by: Lianhao Lu <[email protected]> Signed-off-by: KfreeZ <[email protected]>
- Added helm test for chart teirerank - Update the chart README Signed-off-by: Lianhao Lu <[email protected]> Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: Lianhao Lu <[email protected]> Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
@@ -23,6 +23,7 @@ spec: | |||
serviceName: embedding-svc | |||
config: | |||
endpoint: /v1/embeddings | |||
TEI_EMBEDDING_ENDPOINT: TeiEmbedding |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should use servcieName, in this case tei-embedding-svc
.
@@ -33,6 +34,7 @@ spec: | |||
serviceName: retriever-svc | |||
config: | |||
endpoint: /v1/retrieval | |||
REDIS_URL: VectorDB |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
same here, use redis-vector-db
, in this way we can support multiple endpoint for switch scenario
@@ -43,6 +45,7 @@ spec: | |||
serviceName: reranking-svc | |||
config: | |||
endpoint: /v1/reranking | |||
TEI_RERANKING_ENDPOINT: TeiReranking |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the same
@@ -55,6 +58,7 @@ spec: | |||
serviceName: llm-svc | |||
config: | |||
endpoint: /v1/chat/completions | |||
TGI_LLM_ENDPOINT: Tgi |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the same
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
Signed-off-by: KfreeZ <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Signed-off-by: KfreeZ <[email protected]>
use #203 to commit code |
Description
#166
this PR also adopt the new manifest, when new manifest is merged, some unnecessary code will be removed for simplicity
mark it as draft
Issues
List the issue or RFC link this PR is working on. If there is no such link, please mark it as
n/a
.Type of change
List the type of change like below. Please delete options that are not relevant.
Dependencies
List the newly introduced 3rd party dependency if exists.
Tests
Describe the tests that you ran to verify your changes.