-
Notifications
You must be signed in to change notification settings - Fork 65
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[New Hub] Callysto #1439
Comments
After following the discussion in the leads repo, I tried to come up with a summary of the things we know so far and the info we need in order to deploy this hub 🚀 Please feel free to update it with any info that I've missed and ask any questions that I didn't. What we know
Info missing about the hub
Questionswhere @yuvipanda's thoughts can help ✨
|
Thanks for working on this, @GeorgianaElena! Cluster should definitely be regional! I think CILogon is the way to go here, as it lets us use Google / MS / Institutional providers. Question is, would we need to limit which institutions? We can probably limit them by listing out their cilogon IdP ids. Re: Toronto vs Montreal, my current suggestion is to pick Montreal because they do have GPU availability (https://cloud.google.com/compute/docs/gpus/gpu-regions-zones) so if we ever need GPUs in the future we can do that. GPUs nor dask are required now. For usernames, we should find something that is:
Does |
Who would manage the GCP billing account? |
Thanks @GeorgianaElena and @yuvipanda. We (Callysto/Cybera/PIMS) are going to manage the billing account and are currently investigating which public cloud provider (AWS, GCP, Azure) can provide us with cost-effective options that are located at a Canadian data centre. If GCP is what we go with then Montreal seems like quite a reasonable choice. Are there any specific requirements that 2i2c has that we should pass onto these cloud providers when we ask for options/quotes? |
Note, I don't think our investigation will take too long as we already have a rough estimate from one provider and have conversations going on with the other two. |
For contextual/additional information about the above assignations, @GeorgianaElena will be the lead developer for this hub with @yuvipanda's assistance as a secondary companion/helper/supporter. This is maybe already obvious from the above comments, just making it explicit 😉. |
@yuvipanda, according to the cilogon docs here and here, I believe so. They mainly talk about the cilogon |
@GeorgianaElena sounds great! So this is now waiting on figuring out cloud provider and getting back to us. |
@byrcyb do you have any updates about the cloud provider preference? Thanks! |
The person who's in charge of this is on vacation until Wednesday of this week. So, unfortunately, no new updates. I'll be in touch shortly once I hear anything. |
@byrcyb, do you have any updates? Btw, can you confirm the "real" usage will start by the Fall term (by the end of August, am I correct)? |
Afternoon team, @damianavila, @GeorgianaElena, @yuvipanda |
Thanks @beakkay! 2i2c regularly launches hub service on Google cloud so a review engagement with our engineering team will not be required on our side. We will just need access to the account to set up the hub infrastructure. This page in our documentation provides insights into cloud cost forecasting. Callysto collaborator @ianabc can likely also provide guidance on cloud costs. |
Morning,
Thank you for the response and information.
The technical meeting was requested by Google to determine infrastructure
needs and cost calculations.
If you can provide all requirements we need from the hosting platform then
I will gladly cancel the meeting and move forward.
Regards,
Elmar Bouwer
Project Manager
cybera.ca <https://www.cybera.ca/> | @cybera <https://twitter.com/cybera>
<http://cybera.ca/>
Alberta's not-for-profit organization responsible for driving economic
growth through the use of digital technology.
…On Fri, Jul 8, 2022 at 9:37 AM James Colliander ***@***.***> wrote:
Thanks @beakkay <https://github.com/beakkay>! 2i2c regularly launches hub
service on Google cloud so a review engagement with our engineering team
will not be required on our side. We will just need access to the account
to set up the hub infrastructure. This page in our documentation provides
insights into cloud cost forecasting
<https://docs.2i2c.org/en/latest/about/sustainability/costs/cloud.html>.
Callysto collaborator @ianabc <https://github.com/ianabc> can likely also
provide guidance on cloud costs.
—
Reply to this email directly, view it on GitHub
<#1439 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AYFKKW632DVMOG4DA6Q3FSDVTBDKTANCNFSM5ZJU4EMQ>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
@damianavila in terms of "real" usage, yes, Fall term is what we are aiming for but end of September is more the target date than end of August. |
@damianavila asked me to comment about what we would need from a cloud provider as 'requirements'. I think it's:
Additionally, we support all the three major cloud providers, but at least personally I'd love for it to not be Azure haha :) |
Note that these requirements are basically 'standard' - anyone creating any account on these cloud providers with a credit card basically meets all these requirements already. I'm listing them so additional restrictions don't get put on during negotiations with cloud provider. |
Afternoon, We have successfully created and updated the GCP environment on which the Callysto hub will be hosted. Thank you |
Owner access has been granted to the Open Engineering Team - let us know if there's anything else we should do to assist with locking down or removing folks in case of any staffing changes. |
Thanks @Chealion! Btw, can we go back and revisit the questions raised by @GeorgianaElena on this post: #1439 (comment). That information would be needed to start with the deployment process. Thanks! |
Yes, thanks @damianavila, we're working on this now. |
Here's what we have so far and a question about logos:
Montreal
Yes, we're ok with sticking with Google and Microsoft. We don't require the institutional identity providers anymore.
We have quite a few Hub and Image logos to choose from. Are there any specifications you require in terms of layout (e.g. vertical or horizontal), resolution, format or other?
We don't have any extra features needed at the moment but we are interested in having the classic / retro notebook as the default to start with for the moment. I don't know if that's something we would discuss now or later though. Thank you |
Thanks a lot for filling in this info @byrcyb.
Not really. This image will go right above the login button. Checkout the 2i2c staging hub https://staging.2i2c.cloud as an example. Also, do you have any preference regarding the hub URL? Is |
No problem, thanks for your patience on this @GeorgianaElena For the hub image logo you can use https://www.callysto.ca/wp-content/uploads/2022/08/Callysto-HUB_vertical.png We're still figuring out the hub URL question. My preference would be to keep it as hub.callysto.ca but I'm not sure if that will cause any problems since that's our current hub url. @ianabc thoughts? |
Yeah, our current hub is on hub.callysto.ca and it's been around for long enough that there's a lot of links pointing there. It might be worth doing something like 2i2c.callysto.ca for consistency with the domain, then we can update the DNS records for hub.callysto.ca after the move to make sure nobody get's lost. |
Updates
Question
@ianabc, do you want to set a CNAME to |
Thanks @GeorgianaElena. I've gone through the cilogon process and I'll send my openID in slack. I suspect that we will want to open the hub more generally to any microsoft or google account, which is what we are currently doing on hub.callysto.ca. Related to that though, we need to manage the migration of user data from the old hub to the new one. On the old hub, accounts are identified by a hash (computed as a function of their email or account identifier) and I think we are doing something similar on the new hub. I started writing an extension for the old hub to help us capture the relevant information, but we'll need to figure out what their new hash/identifier would be so we can map the accounts. Also, there might be a better way of doing this than the extension I was writing. Do you know how users are identified on the new hub? e.g. is their storage related to their OpenID value or something like that? |
I've added the CNAME record, it should be propagating now. |
Thanks a lot @ianabc! Hub is now running at https://2i2c.callysto.ca and I've added you as an admin and also removed the allowed users list, which means that it matches the access level of the old hub. |
Yes, so the hub should only be aware of the user's OpenIDs since these are the hub usernames. |
hi @GeorgianaElena I haven't had a chance to test all of the functionality yet, but, I was able to login successfully. A couple of initial questions:
|
Yay 🎉
I believe it took this long because the server spawning required a new node to be created and this takes a couple of minutes usually. The node spin up event, was probably followed by the user image needing to be downloaded since it was your first time logging in and starting a server which also took a couple of minutes I suspect.
We have docs about how to use these folders here: https://docs.2i2c.org/en/latest/admin/howto/data.html
I believe we have the infrastructure to change easily everything that's part of the hub UI, but I will need to double check these two as I'm not confident they are part of the hub's UI, but rather the notebook's. I will come back with a more clear answer asap.
Yuhuuuuu 🚀 |
Hi @GeorgianaElena I showed the hub to some high school educators on our team and there were some comments and questions about the CILogon page that is encountered when logging into the hub. Are there ways to customize:
I'm also not sure if this is part of the custom jupyter-server templates issue that you linked 😄 #1697 |
@byrcyb, this is now almost done. I have not deployed it to the main hub yet because I am waiting for feedback from the team, but you can check it out on the staging hub if you want, which is running at https://staging.callysto.2i2c.cloud/hub/home 🚀
Unfortunately these CILogon related customizations are not possible. But we will prioritize deliberating whether this is something we should support as part of our infrastructure. |
Thanks for the update @GeorgianaElena. I was doing more testing and noticed that downloading our Callysto notebooks as PDF via LaTeX or HTML throws a 500: Internal Server Error page. I'm not sure if this is a hub issue, perhaps something with nbconvert, or something else. Any help on this is greatly appreciated. Here's the error thrown when I try to download PDF via HTML:
The error log when I try to download PDF via LaTeX is much longer so I've attached it (as a PDF). |
infrastructure/config/clusters/callysto/common.values.yaml Lines 31 to 34 in 90512f2
|
Hub Description
2i2c will work with @ianabc from PIMS and @byrcyb from Cybera (perhaps others?) to transition the current Callysto hub to a new service managed by 2i2c.
The Callysto hub should be a "standard education hub" since it will be used by students and teachers in grades 5-12 who are making their initial forays into data-intensive studies.
Community Representative(s)
@ianabc @byrcyb
Important dates
Notes: target and required dates are outdated, we need to update them accordingly to new information and prioritization.
2022-07-08end of August?2022-07-15mid September?Hub Authentication Type
Other (may not be possible, please specify in comments)
Hub logo information
Hub user image
Extra features you'd like to enable
Other relevant information
No response
Hub URL
..2i2c.cloud
Hub Type
No response
Tasks to deploy the hub
The text was updated successfully, but these errors were encountered: