Retrieving pipeline id in cicd pipelines #76

Gabriel2409 · 2023-09-22T09:52:49Z

I am having a bit of trouble registering a model after launching a pipeline with kedro azureml.
Indeed, to find the actual id of the pipeline that trained the model, I currently take the most recent (with a few additional filters to make sure i get the correct one) but I don't think that is a great solution.

I noticed that the pipeline id is available in AzureMLPipelinesClient.run (it is just pipeline_job.name).

Is there an intelligent way to retrieve it?

I could imagine logging it to the output then do something like
pipeline_text=$(kedro azureml run ...)
and then filter pipeline_text to get the actual value but it seems very ugly.
Plus you would need to make sure your pipeline log level is set correctly so I don't think it is ideal.

The text was updated successfully, but these errors were encountered:

marrrcin · 2023-09-22T10:11:19Z

There's a callback that allows to plug-in some behaviour after the job is scheduled (on_job_scheduled):

kedro-azureml/kedro_azureml/cli.py

Line 261 in 5ae7bcf

lambda job: click.echo(job.studio_url),

I think you can write your own CLI for Kedro, that will replicate what is being done in our kedro azureml run - from there you can use this callback to e.g. save the id you want to have.

Another way would be saving the pipeline id from within the pipeline (as a Kedro dataset) and read it later from CICD pipeline. I think it's available in some AzureML-set environment variable.

Gabriel2409 · 2023-09-22T10:53:51Z

Thanks @marrrcin this seems like a great solution.

I think I can do even simpler as the job studio studio url contains the pipeline name (which i did not notice before):
https://ml.azure.com/runs/<pipeline_name>?wsid=/subscriptions/.... so we can extract it here

However, would you also be open to a modification of the callback so that the intent is a bit clearer? I was thinking of something

like

lambda job: click.echo(f"AzureML Studio URL: {job.studio_url}"),

or maybe using a more detailed callback function such as:

def echo_job_info(job):
    click.echo(f"Job studio url: {job.studio_url}")
    click.echo(f"Azure ML Pipeline name: {job.name}")

marrrcin · 2023-09-25T09:01:45Z

Sure :)

Gabriel2409 · 2023-09-29T13:37:39Z

Closing the issue following merge of #78

This was referenced Sep 26, 2023

Feature/export job name for cicd #77

Closed

Feature/on schedule job callback #78

Merged

Gabriel2409 closed this as completed Sep 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retrieving pipeline id in cicd pipelines #76

Retrieving pipeline id in cicd pipelines #76

Gabriel2409 commented Sep 22, 2023

marrrcin commented Sep 22, 2023

Gabriel2409 commented Sep 22, 2023

marrrcin commented Sep 25, 2023

Gabriel2409 commented Sep 29, 2023

Retrieving pipeline id in cicd pipelines #76

Retrieving pipeline id in cicd pipelines #76

Comments

Gabriel2409 commented Sep 22, 2023

marrrcin commented Sep 22, 2023

Gabriel2409 commented Sep 22, 2023

marrrcin commented Sep 25, 2023

Gabriel2409 commented Sep 29, 2023