Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[New Hub] M2LiNES Pangeo hub #1168

Closed
8 of 9 tasks
choldgraf opened this issue Apr 3, 2022 · 16 comments
Closed
8 of 9 tasks

[New Hub] M2LiNES Pangeo hub #1168

choldgraf opened this issue Apr 3, 2022 · 16 comments
Assignees

Comments

@choldgraf
Copy link
Member

choldgraf commented Apr 3, 2022

Hub Description

M2LINES is an international climate modeling collaboration. They'd like a Pangeo-style hub for their community.

Community Representative(s)

Our contact right now is:

TODO: is there a better person at NYU to be the community rep?

Important dates

  • Required start date: TODO: is there a specific date or just ASAP?
  • Target start date:
  • Any important dates for usage:

Hub Authentication Type

GitHub Authentication (e.g., @MyGitHubHandle)

Hub logo information

Hub user image

Extra features you'd like to enable

  • Specific cloud provider or datacenter: Same as other Pangeo hubs
  • Dedicated Kubernetes cluster
  • Scalable Dask Cluster

Other relevant information

This hub should be similar to the Pangeo and LEAP hubs.

TODO: confirm that github authentication is preferred

Hub URL

m2lines.2i2c.cloud TODO: confirm this is OK w/ community rep

Hub Type

daskhub

Tasks to deploy the hub

  • Engineer who will deploy the hub is assigned
  • Deploy information filled in above
  • Initial Hub deployment PR: Add m2lines hub #1227
  • Administrators able to log on
  • Community Representative satisfied with hub environment
  • Hub now in steady-state
@choldgraf
Copy link
Member Author

cc @rabernat - there are a few "TODO" questions up there that you might be able to help answering (or feel free to loop in any other person that would be able to help clarify).

@yuvipanda
Copy link
Member

Does this get its own cloud project with its own billing for cloud resource use, or is this being rolled into the cloud resource / billing for pangeo-hubs?

I'll pick this one up.

@rabernat
Copy link
Contributor

rabernat commented Apr 4, 2022

TODO: is there a better person at NYU to be the community rep?

@johannag126 may be the best choice here because she is in contact with all the project participants on a regular basis. Johanna, would you be comfortable with the Community Representative Responsibilities? If you have questions about this, feel free to ask them here in this thread.

  • TODO: is there a specific date or just ASAP?

Asap without any specific urgency.

  • TODO: figure out which registry Pangeo uses

We are still using dockerhub. If you want to help move pangeo-docker-images to quay, PR welcome 😉 .

  • TODO: figure out which image should be used (or maybe juts latest)

M2LInES needs here are mostly identical to LEAP. Ideally we would be able to select from all the pangeo-docker-image tags via a customized spawner, as discussed a bit in #1050 (comment). We also need optional GPUs.

TODO: confirm that github authentication is preferred

Yes. The roles are simpler than with LEAP. Let's just allow anyone part of the m2lines org (https://github.com/orgs/m2lines) to access the hub.

m2lines.2i2c.cloud TODO: confirm this is OK w/ community rep

👍

@johannag126
Copy link

TODO: is there a better person at NYU to be the community rep?

@johannag126 may be the best choice here because she is in contact with all the project participants on a regular basis. Johanna, would you be comfortable with the Community Representative Responsibilities? If you have questions about this, feel free to ask them here in this thread.

I am happy to be the relay between the team and hub engineer but would this role require technical knowledge ? The description reads "This role is usually filled by someone that is a member of the hub’s community of practice."

@colliand
Copy link
Contributor

Yes @yuvipanda this hub should be set up on a dedicated GCP cluster with the billing account managed by 2i2c/CS&S. Cloud costs will be passed on to NYU/M2LInES by our colleagues at CS&S.

@damianavila damianavila moved this to Needs Shaping / Refinement in DEPRECATED Engineering and Product Backlog Apr 12, 2022
@yuvipanda
Copy link
Member

@colliand great. I'll take this on and try to get it done by next week.

@yuvipanda yuvipanda self-assigned this Apr 13, 2022
@damianavila damianavila moved this from Needs Shaping / Refinement to Ready to work in DEPRECATED Engineering and Product Backlog Apr 13, 2022
@choldgraf
Copy link
Member Author

Hey all - what is the status on this hub? We have now set up the invoicing for it, so it should be running ASAP.

@yuvipanda
Copy link
Member

@choldgraf I'll get this done this week.

yuvipanda added a commit to yuvipanda/pilot-hubs that referenced this issue Apr 22, 2022
Features enabled:
- GitHub Auth, anyone part of https://github.com/m2lines org can
  log in
- Dask-gateway is enabled
- gh-scoped-creds (https://github.com/yuvipanda/gh-scoped-creds/) is
  enabled for secure pushing to GitHub
- Scratch GCS storage bucket is available, accessed via the
  SCRATCH_BUCKET environment variable

Ref 2i2c-org#1168
@yuvipanda
Copy link
Member

yuvipanda commented Apr 22, 2022

@johannag126 @rabernat this is now up at https://m2lines.2i2c.cloud! Please check it out.

Features:

@yuvipanda
Copy link
Member

I can't test if folks part of the https://github.com/m2lines org can log in - can you test logging in, @johannag126?

@johannag126
Copy link

@yuvipanda thank you! I was able to log in with no issue

@damianavila damianavila moved this from Ready to work to In progress in DEPRECATED Engineering and Product Backlog Apr 22, 2022
@yuvipanda
Copy link
Member

@johannag126 great! I've now merged this and I think the hub is ready to go!

@rabernat @johannag126 can you speak more about the GPU requirement? we can open another issue to discuss that.

@rabernat
Copy link
Contributor

See #1237

@damianavila
Copy link
Contributor

Since we have a follow-up for GPU support, I think we can close this one now.

Repository owner moved this from In progress to Complete in DEPRECATED Engineering and Product Backlog Apr 25, 2022
@rabernat
Copy link
Contributor

rabernat commented May 2, 2022

Sorry to revive this old issue, but there is one final item we need to resolve before launching the LEAP and M2LInES hubs: the ability to customize the user image independently from the hardware profile. This is being discussed in:

Specifically, in jupyterhub/kubespawner#607, Yuvi has implemented the ability to have a dropdown menu of possible images for each profile. I want to state clearly that that is an acceptable solution to this issue and I would prefer to move forward with that, rather than iterating further on the design (as I had suggested earlier in jupyterhub/kubespawner#607 (comment)).

In particular, we need to make sure that GPU users can choose from TWO different possible images:

Given the velocity of change of these images, I do not think it is ever wise to use latest tags. We should always be pinning a specific version (this goes for pangeo-notebook as well). Ideally we would populate the list of available images with all of the possible recent versions, so that users can go back and forth between versions. We want users to become aware of which image and version they are using by exposing this clearly though our UI.

yuvipanda added a commit to yuvipanda/pilot-hubs that referenced this issue May 3, 2022
- Pins image versions too
- Can be consolidated once
  jupyterhub/kubespawner#607 lands

Ref 2i2c-org#1168 (comment)
@yuvipanda
Copy link
Member

@rabernat to not block m2lines usage on that PR merging, I've just added an extra profile with pytorch here: #1267. I've also pinned the images. We can consolidate once that PR lands.

@colliand colliand mentioned this issue Dec 1, 2023
13 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Development

No branches or pull requests

6 participants