-
Notifications
You must be signed in to change notification settings - Fork 293
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dashboard support in Kueue #940
Comments
Do you have a high level idea of how to get there? For example, would metrics + grafana yamls be enough for the administrative side? For end users, certainly grafana wouldn't be viable. But what could be the MVP that would keep kueue largely non-opinionated and reusable (so you can integrate it with your own UI, if you already have one). Could we offer a CLI instead? |
Kueue already spits out prometheus metrics. Building a UI based on that can be useful and the UI should be optional to deploy. I do wonder if it is more useful for us to provide general purpose grafana dashboard and make it available in https://grafana.com/grafana/dashboards/ |
Building on metrics is helpful I think, but the dashboard is more than that, like it will display the basic information about the system, how many queues there, what their names are, how many jobs inside the queue, it can be interactive. We can get the information via the apis directly or we can have a lightweight database inside for cache, like sqlite. We may need some frontend volunteers if we want to finish this work. As a MVP, IMHO, I think it should include
|
+1000, this is a much needed experience gap, I would be happy to review proposals. @kerthcet may be we start by looking at similar batch schedulers and see what "screens" they offer to inform and help seed what we need to build? |
@ahg-g However, airflow doesn't provide an API to submit one-time run jobs like ml training jobs, which is the core application for kueue. |
Hi guys, I want to try involved in the prototyping part of the dashbaord Desgin, and provide the prototype like figma. |
Thanks @samzong |
Maybe we can start with a list of views you would like to have and do priority sorting |
@B1F030 is doing this. |
Hey folks, https://github.com/armadaproject/armada has a UI in the form of lookout. Our demo UI is here: https://ui.demo.armadaproject.io/ Let us know what you think of it - I think many parts of it could be suitable for lookout and we would be interested in contributing. Thanks! |
Great, Thank you :)) |
@Sharpz7 what is a lookout in this context? |
The Kubernetes project currently lacks enough contributors to adequately respond to all issues. This bot triages un-triaged issues according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle stale |
/remove-lifecycle stale |
The Kubernetes project currently lacks enough contributors to adequately respond to all issues. This bot triages un-triaged issues according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle stale |
Are you still following this @samzong ? |
Absolutely, I'm still very interested in this. The good news is that I'll have more time to contribute to open source in the coming period. I'll make sure to push this forward as soon as possible. |
Hey, |
I think it might be this one https://github.com/GoogleCloudPlatform/ai-on-gke/blob/main/best-practices/gke-batch-refarch/02_platform/monitoring/deploy-dashboard.yaml but @moficodes and @alizaidis to confirm |
Yep that's the one! |
The Kubernetes project currently lacks enough active contributors to adequately respond to all issues. This bot triages un-triaged issues according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle rotten |
The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs. This bot triages issues according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /close not-planned |
@k8s-triage-robot: Closing this issue, marking it as "Not Planned". In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
Hi everyone, As a side project to just run a demo I have contributed kueue-viz : as a kueue dashboard. It is still very basic , but every contribution and feedback are welcome. |
/reopen |
@kannon92: Reopened this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
@mwielgus has mentioned that this is still of interest to the Kueue project. |
Thanks @akram for the work! |
/reopen
I'm also open to track the improvements as dedicated issue, just listing them here as a starting point. |
@mimowo: Reopened this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
/remove-lifecycle rotten |
What would you like to be added:
It would be great if we can have an insight about what's our queueing system looks like at real time
Overall, it's a great enhancement especially for production env.
Why is this needed:
A big enhancement and a great insight of kueue system.
Completion requirements:
This enhancement requires the following artifacts:
The artifacts should be linked in subsequent comments.
Some advices here:
The text was updated successfully, but these errors were encountered: