-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[model-controller] Investigate serving integration from cluster data #249
[model-controller] Investigate serving integration from cluster data #249
Comments
New feature implementation in custom branch https://github.com/lampajr/odh-model-controller/tree/lampajr20231219_gh249_reconciler_from_isvc, waiting for a final decision on which workflow we would like to go with. After that I will finalize the selected branch, either MR --> ISVC or ISVC --> MR |
IMO, we need to support both. When do we need to support both may be in question. |
As agreed we will go with this implementation at first round, here the summarized workflow: sequenceDiagram
actor U as UI Dashboard
participant K as Kubernetes
participant MC as ODH Model Controller
participant MR as Model Registry
U->>+MR: Retrieve indexed model version
MR-->>-U: Indexed model version
U->>K: Create InferenceService (ISVC)
Note right of U: Annotate/Label the ISVC with indexed <br/> model information, like RegisteredModel and <br/>ModelVersion IDs.
Note right of K: Here all operators/controllers in charge to deploy<br/> the model will make<br/> their actions, e.g., KServe or ModelMesh.
loop Every ISVC creation/deletion/update
K-->>+MC: Send notification
MC->>+K: Retrieve affected ISVC in the cluster
K-->>-MC: ISVC resource
MC->>+MR: Create/Update InferenceService in Model Registry
Note left of MR: InferenceService records in Model Registry<br/>are used to keep track of every deployment that<br/>occurred in the monitored Kubernetes cluster.
MR-->>-MC: InferenceService record
MC-->>-K: Update ISVC with Model Registry record ID
end
As highlighted here, the ODH Model Controller logic will have a more passive behavior as the InferenceService CRs are still created and managed by the UI Dashboard. The Model Controller will just sync those occurrence into the Model Registry such that we can keep track of every deployment that occurred in the cluster for indexed models. |
@lampajr thank you, please be aware the webhook is no longer active back to JIRA |
Thanks for highlighting that the webhook is no more active 🙏 |
Is your feature request related to a problem? Please describe.
Followup opendatahub-io/model-registry#104
Evaluate/investigate the other direction of the aforementioned workflow:
Describe the solution you'd like
Workflow:
a. NO: then create a new IS in model registry, then update the ISVC labels/annotation adding this new links and removing previous ones
b. YES: TBD
3.a reconcile the data such that it won't be processed the next round but it will go through the original workflow opendatahub-io/model-registry#104
Describe alternatives you've considered
n/a
Additional context
n/a
The text was updated successfully, but these errors were encountered: