Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dashboard and monitoring for Kubeflow CD pipelines #580

Closed
jlewi opened this issue Jan 27, 2020 · 4 comments
Closed

Dashboard and monitoring for Kubeflow CD pipelines #580

jlewi opened this issue Jan 27, 2020 · 4 comments

Comments

@jlewi
Copy link
Contributor

jlewi commented Jan 27, 2020

Follow on to #450

We need a way to monitor the continuous delivery of Kubeflow applications.

As a strawman I think we'd like a dashboard that shows a table like the following for each Kubeflow version.

Application Last Commit of Code Image Commit Status of Update Pipeline
Notebook Controller 1234 1234 Green
Profile Controller abcd efgh Red
... ... ... ...

So by looking at the dashboard we'd like to be able to answer the following questions

  1. For Kubeflow version x.y.z which commit of application A is in the manifest?
  2. Is the manifest for Application A up to date?
  3. Are the CD pipelines for Application A failing?

We could probably easily write a python script to compute the above information. We could then perhaps turn that into a report just by having a flask server return it as html.

Perhaps we should start by writing a notebook and then think about how to turn that notebook into
reports being served at some URL?

@issue-label-bot
Copy link

Issue-Label Bot is automatically applying the labels:

Label Probability
kind/feature 0.89

Please mark this comment with 👍 or 👎 to give our bot feedback!
Links: app homepage, dashboard and code for this bot.

@jlewi
Copy link
Contributor Author

jlewi commented Jan 28, 2020

update_kf_apps.py has all the information to compute the above.

So update_kf_apps.py should probably track that information and then make it available via a flask server.

@stale
Copy link

stale bot commented Apr 27, 2020

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in one week if no further activity occurs. Thank you for your contributions.

@stale
Copy link

stale bot commented May 4, 2020

This issue has been closed due to inactivity.

@stale stale bot closed this as completed May 4, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant