[feature] remove hard-coded configs in KFP V2 launcher #9689

thesuperzapper · 2023-06-29T00:18:35Z

Feature Area

/area backend

Related Issues

[feature] Allow KFP to specify Minio instance when configuring pipeline root #6517

The Problem

Currently, we are hardcoding a few important configs in KFP V2's launcher and cache code.
This is making it difficult for distributions to support KFP v2 in a generic way.

Launcher - The minio credentials secret is hard-coded as mlpipeline-minio-artifact:

pipelines/backend/src/v2/objectstore/object_store.go

Line 292 in 35f7507

const minioArtifactSecretName = "mlpipeline-minio-artifact"

Launcher - The minio credentials secret keys are hard-coded as accesskey and secretkey:

pipelines/backend/src/v2/objectstore/object_store.go

Lines 324 to 325 in 35f7507

    
           accessKey := string(secret.Data["accesskey"]) 
        
           secretKey := string(secret.Data["secretkey"])

Launcher - The minio endpoint is hard-coded as minio-service.kubeflow:9000:

pipelines/backend/src/v2/objectstore/object_store.go

Line 291 in 35f7507

const defaultMinioEndpointInMultiUserMode = "minio-service.kubeflow:9000"

Cache - The ml-pipeline endpoint is hard-coded as ml-pipeline.kubeflow:8887:

pipelines/backend/src/v2/cacheutils/cache.go

Line 26 in 35f7507

defaultKfpApiEndpoint = "ml-pipeline.kubeflow:8887"

The Solution

We want to allow the above configs to be changed from their defaults, preferably on a per-namespace basis.

I propose we add these configs to the ConfigMap/kfp-launcher which already exists in each profile namespace to set defaultPipelineRoot.

For example, a new ConfigMap/kfp-launcher in a profile namespace might look like this:

data:
  defaultPipelineRoot: "minio://mlpipeline/v2/artifacts"
 
  ## minio endpoint config
  minioEndpoint: "minio.example.com:9000"
 
  ## minio auth configs
  minioAuthSecret: "mlpipeline-minio-artifact"
  minioAuthSecretAccessKeyKey: "access_key"
  minioAuthSecretAccessKeyKey: "access_key"

  ## ml pipeline endpoint config
  mlPipelineEndpoint: "ml-pipeline.kubeflow:8887"

The question now becomes how to propagate these configs to the kfp-launcher container in each workflow Pod.
I think there are two options:

OPTION 1: have the kfp-launcher container read the ConfigMap/kfp-launcher
- when it executes, it can pass the minio configs down as it calls objectstore.OpenBucket
- setting mlPipelineEndpoint has to be done before it runs cacheutils.NewClient()
OPTION 2: read the ConfigMap/kfp-launcher in RootDAG at the same time as we read the defaultPipelineRoot
- (this approach reduces the number of API calls to read ConfigMap/kfp-launcher)

Other Thoughts

We could make this slightly more advanced to allow for "selecting" credentials based on the bucket & key-prefix being used, because not all pipelines have to use the default bucket_root, so might need different credentials.

For example, we could extend ConfigMap/kfp-launcher like this:

data:
  defaultPipelineRoot: "minio://my-bucket/v2/artifacts"

  ## minio endpoint config
  minioEndpoint: "minio.example.com:9000"

  ## ml pipeline endpoint config
  mlPipelineEndpoint: "ml-pipeline.kubeflow:8887"

  ## bucket auth configs (as a YAML string)
  bucketAuth: |
    ## list of "auth providers" for minio
    minio:

      ## specifies auth for objects under "minio://my-bucket/v2/artifacts/"
      - bucketName: "my-bucket"
        keyPrefix: "v2/artifacts/"
        authFromSecret:
          secretName: "my-secret-in-profile-namespace"
          accessKeyKey: "access_key"
          secretKeyKey: "secret_key"

    ## similar, but for s3 buckets
    s3: []

    ## similar, but for gcs buckets
    gcs: []

Love this idea? Give it a 👍.

The text was updated successfully, but these errors were encountered:

thesuperzapper · 2023-06-29T02:09:37Z

@chensun @zijianjoy I think this is very important before cutting the 2.0.0 for the backend.

zijianjoy · 2023-07-13T22:46:16Z

/assign @zijianjoy

github-actions · 2023-10-12T07:41:42Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

The upstream pipelines project hardcodes the name of the object storage service to minio-service. The current implementation sets the service to just minio, which collides with what pipelines expect. This PR ensures the Service is created with the expected name and ports. Refer to kubeflow/pipelines#9689 for more information

* fix: explicitly apply minio-service with name The upstream pipelines project hardcodes the name of the object storage service to minio-service. The current implementation sets the service to just minio, which collides with what pipelines expect. This PR ensures the Service is created with the expected name and ports. Refer to kubeflow/pipelines#9689 for more information

ca-scribner · 2023-11-02T18:36:44Z

Did this get resolved before the 1.8 release?

thesuperzapper · 2023-11-02T18:42:46Z

@chensun can you confirm what the status on this is?

This: * removes deprecated `DBCONFIG_USER`, etc, environment variables (they have been replaced by variables of name `DBCONFIG_[driver]CONFIG_*`) * adds `OBJECTSTORECONFIG_HOST`, `_PORT`, and `_REGION`, which previously were required. Although currently they seem to be ignored due to kubeflow/pipelines#9689 - but in theory they'll matter again? Not sure exactly the scope of that issue.

* update kfp-api's apiserver configuration This: * removes deprecated `DBCONFIG_USER`, etc, environment variables (they have been replaced by variables of name `DBCONFIG_[driver]CONFIG_*`) * adds `OBJECTSTORECONFIG_HOST`, `_PORT`, and `_REGION`, which previously were required. Although currently they seem to be ignored due to kubeflow/pipelines#9689 - but in theory they'll matter again? Not sure exactly the scope of that issue.

TobiasGoerke · 2023-12-14T16:15:23Z

Just seeing this issue after I created #10318 and implemented #10319.

If implemented, this would enable users to define arbitrary s3 endpoints for artifact storage.
Appropriate envs (AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY and AWS_REGION) still need to be added to the pipeline pods on some other way.

Does this help?

HumairAK · 2023-12-15T22:01:21Z

Thank you for this @TobiasGoerke I think this is an improvement. Though the problem of the hardcoded configs still remains. Adding the endpoint as part of the query string seems more like a workaround to me for this issue, and as you said the issue of propagating credentials remains.

There seems to be already a precedent with passing around mlmd endpoint info via a configmap, seems to make sense to do the same for the s3 endpoint as suggested by @thesuperzapper. In this same config we can specify the s3 credentials secret name/keys as well.

HumairAK · 2024-01-16T19:24:28Z

@zijianjoy this is a priority item for our team so we'll be looking to resolve this in our downstream, and if you'd like I can help with implementing it here as well. If you are already in progress with this and would like to see it to completion, we'd be interested in your approach so we can be aligned in our solution while we wait for the change in upstream!

Some follow ups:

1. Is there an obvious preference over option 1/2?

Option 1 it seems easier to implement as well as easier to extend, one does not need to always update mlmd client code when adding new properties to the config. However, as previously mentioned it requires that every executor pod fetches the configmap, is this a major concern?

Option 2 we'd follow the same pattern as we do for reading in the pipelineroot. So addtiional metadata fields for the provider creds would need to be added. In the driver we would find the matching auth provider config in the kfp launcher configmap and maybe store that as a json string. We then query MLMD for the configs within the launcher.

I would be interested to hear other options/suggestions under consideration.

2. Configmap structure

I like @thesuperzapper more flexible configmap suggestions, but I have some further suggestions/amendments:

    defaultPipelineRoot: "minio://my-bucket/v2/artifacts"
    providers:
        minio:
          # can be another minio instance
          endpoint: "minio.example.com:9000"
          disableSSL: true
          # by default use this secret for this provider
          defaultProviderSecretRef:
            secretName: "my-secret-in-profile-namespace"
            accessKeyKey: "access_key"
            secretKeyKey: "secret_key"
          # optional, first matching prefix in pipelineRoot is used
          # in this example v2/artifacts would be used if using the 
          # defaultPipelineRoot above
          authConfigs:
            - bucketName: "my-bucket"
              keyPrefix: "v2/artifacts/123"
              secretRef:
                secretName: "my-secret-in-profile-namespace-1"
                accessKeyKey: "access_key"
                secretKeyKey: "secret_key"
            - bucketName: "my-bucket"
              keyPrefix: "v2/artifacts/"
              secretRef:
                secretName: "my-secret-in-profile-namespace-2"
                accessKeyKey: "access_key"
                secretKeyKey: "secret_key"

        # Having the ability to override the regions here allows us to leverage other s3 compatible providers
        # and not just aws s3
        s3:
          # allows for pointing to s3 compatible non aws solutions like ceph
          endpoint: "https://s3.amazonaws.com"
          disableSSL: true
          region: us-east-2
          defaultProviderSecretRef:
            secretName: "my-aws-secret-in-profile-namespace"
            accessKeyKey: "access_key"
            secretKeyKey: "secret_key"
          authConfigs: []
        gs: { }

3. mlPipelineEndpoint

I don't see the need to add this in kfp-launcher configmap, to me it makes more sense to specify this in the API server as a config/env var and pass it down as parameters within driver/kfp-launcher/etc. just like the mlmd address/port.

thesuperzapper · 2024-03-16T00:16:11Z

@HumairAK I see that you have at least partially fixed this in your fork of KFP:

UPSTREAM: <carry>: add ns scoped s3 support. opendatahub-io/data-science-pipelines#4
fix: handle cached input in objstore opendatahub-io/data-science-pipelines#27

Would you be willing to raise a PR upstream, so we can all standardize on a fix?

HumairAK · 2024-03-20T15:11:12Z

@thesuperzapper that's the plan, I'll send one out soon

HumairAK · 2024-03-27T16:44:57Z

@thesuperzapper I've provided a WIP that is mostly complete, just need to do more testing, and fix a obj store region related bug

I welcome feedback on implementation/approach there:

#10625

github-actions · 2024-05-27T07:41:35Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

hbelmiro · 2024-05-27T14:00:55Z

Don't stale.

rimolive · 2024-06-04T11:43:03Z

/lifecycle frozen

gregsheremeta · 2024-08-09T12:18:14Z

@HumairAK @thesuperzapper should this be closed as complete? Referenced PRs are all merged

HumairAK · 2024-08-12T13:24:47Z

Hardcoded configs for object store have all been made configurable, and docs are provided here. What now remains is parameterizing the pipeline server endpoint, which is still hardcoded to use the ml-pipeline service: https://github.com/kubeflow/pipelines/blob/2.2.0/backend/src/v2/cacheutils/cache.go#L127-L140. The service doesn't require to be located in kubeflow namespace, so it doesn't prohibit users from doing a kfp installation in other non kubeflow namespaces, so it's less of an issue in my mind, but still worth doing.

I made an attempt here. But an easier route might be to just over-write the default ML_PIPELINE_SERVICE_HOST / ML_PIPELINE_SERVICE_PORT_GRPC, this way we can keep the IR env agnostic.

I'm fine with having a separated granular issue for it though.

shivanibhargove · 2025-01-16T11:19:48Z

Hi,
We are also facing the similar issue where our kubeflow is not installed in kubeflow namespace and thus its failing as default is given as kubeflow.

thesuperzapper added the kind/feature label Jun 29, 2023

google-oss-prow bot added the area/backend label Jun 29, 2023

google-oss-prow bot assigned zijianjoy Jul 13, 2023

zijianjoy added this to KFP v2 Jul 13, 2023

chensun moved this to P1 in KFP v2 Aug 8, 2023

github-actions bot added the lifecycle/stale The issue / pull request is stale, any activities remove this label. label Oct 12, 2023

DnPlas mentioned this issue Oct 13, 2023

fix: explicitly apply minio-service with name canonical/minio-operator#151

Merged

ca-scribner mentioned this issue Nov 2, 2023

update kfp-api's apiserver configuration canonical/kfp-operators#375

Merged

github-actions bot removed the lifecycle/stale The issue / pull request is stale, any activities remove this label. label Nov 3, 2023

ca-scribner mentioned this issue Nov 15, 2023

minio-service added for Charmed Kubeflow 1.8 should be moved to kfp-api to avoid bugs with deploying multiple minios at once canonical/minio-operator#153

Closed

This was referenced Dec 12, 2023

Identify upstream/pipelines issues to work on opendatahub-io/data-science-pipelines-operator#423

Open

V2 Poc minio service has kubeflow hardcoded values opendatahub-io/data-science-pipelines-operator#494

Closed

ca-scribner mentioned this issue Jan 2, 2024

kfp-api from channel 2.0/stable isn't properly applying the object-store-bucket-name value canonical/kfp-operators#397

Open

amadhusu mentioned this issue Jan 3, 2024

[WIP] fix: moving the hardcoded configs in KFPv2 Launcher to configmap. Fixes #494 opendatahub-io/data-science-pipelines-tekton#191

Closed

1 task

amadhusu mentioned this issue Jan 12, 2024

[WIP] fix: moving the hardcoded configs in KFPv2 Launcher to configmap. Fixes RHOAIENG-1707 opendatahub-io/data-science-pipelines#1

Closed

1 task

HumairAK mentioned this issue Jan 19, 2024

UPSTREAM: <carry>: add ns scoped s3 support. opendatahub-io/data-science-pipelines#4

Merged

gregsheremeta mentioned this issue Feb 7, 2024

fix: remove conflicting tls proxy secret generator. opendatahub-io/data-science-pipelines-operator#567

Merged

3 tasks

This was referenced Mar 21, 2024

feat(backend): preserve querystring in pipeline root (fixes #10318) #10319

Merged

[feature] Allow KFP to specify Minio instance when configuring pipeline root #6517

Closed

HumairAK mentioned this issue Mar 27, 2024

feat(backend): add namespace & prefix scoped credentials to kfp-launcher config for object store paths #10625

Merged

1 task

HumairAK mentioned this issue Apr 2, 2024

[feature] Allow manifests to set external Object Store to deploy Kubeflow Pipelines #10651

Closed

thesuperzapper mentioned this issue Apr 13, 2024

[TRACKING] deployKF 0.1.5 / Kubeflow Pipelines V2 deployKF/deployKF#65

Closed

HumairAK mentioned this issue May 3, 2024

feat(frontend&backend): Add UI support for object store customization and prefixes #10787

Merged

1 task

github-actions bot added the lifecycle/stale The issue / pull request is stale, any activities remove this label. label May 27, 2024

stale bot removed the lifecycle/stale The issue / pull request is stale, any activities remove this label. label May 27, 2024

google-oss-prow bot added the lifecycle/frozen label Jun 4, 2024

haiminh2001 mentioned this issue Aug 8, 2024

secrets "mlpipeline-minio-artifact" not found deployKF/deployKF#192

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature] remove hard-coded configs in KFP V2 launcher #9689

[feature] remove hard-coded configs in KFP V2 launcher #9689

thesuperzapper commented Jun 29, 2023 •

edited

Loading

thesuperzapper commented Jun 29, 2023 •

edited

Loading

zijianjoy commented Jul 13, 2023

github-actions bot commented Oct 12, 2023

ca-scribner commented Nov 2, 2023

thesuperzapper commented Nov 2, 2023

TobiasGoerke commented Dec 14, 2023

HumairAK commented Dec 15, 2023

HumairAK commented Jan 16, 2024 •

edited

Loading

thesuperzapper commented Mar 16, 2024

HumairAK commented Mar 20, 2024

HumairAK commented Mar 27, 2024 •

edited

Loading

github-actions bot commented May 27, 2024

hbelmiro commented May 27, 2024

rimolive commented Jun 4, 2024

gregsheremeta commented Aug 9, 2024

HumairAK commented Aug 12, 2024 •

edited

Loading

shivanibhargove commented Jan 16, 2025

[feature] remove hard-coded configs in KFP V2 launcher #9689

[feature] remove hard-coded configs in KFP V2 launcher #9689

Comments

thesuperzapper commented Jun 29, 2023 • edited Loading

Feature Area

Related Issues

The Problem

The Solution

Other Thoughts

thesuperzapper commented Jun 29, 2023 • edited Loading

zijianjoy commented Jul 13, 2023

github-actions bot commented Oct 12, 2023

ca-scribner commented Nov 2, 2023

thesuperzapper commented Nov 2, 2023

TobiasGoerke commented Dec 14, 2023

HumairAK commented Dec 15, 2023

HumairAK commented Jan 16, 2024 • edited Loading

1. Is there an obvious preference over option 1/2?

2. Configmap structure

3. mlPipelineEndpoint

thesuperzapper commented Mar 16, 2024

HumairAK commented Mar 20, 2024

HumairAK commented Mar 27, 2024 • edited Loading

github-actions bot commented May 27, 2024

hbelmiro commented May 27, 2024

rimolive commented Jun 4, 2024

gregsheremeta commented Aug 9, 2024

HumairAK commented Aug 12, 2024 • edited Loading

shivanibhargove commented Jan 16, 2025

thesuperzapper commented Jun 29, 2023 •

edited

Loading

thesuperzapper commented Jun 29, 2023 •

edited

Loading

HumairAK commented Jan 16, 2024 •

edited

Loading

HumairAK commented Mar 27, 2024 •

edited

Loading

HumairAK commented Aug 12, 2024 •

edited

Loading