Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Increase Job Throughput #4959

Closed
jmchilton opened this issue Nov 8, 2017 · 1 comment
Closed

Increase Job Throughput #4959

jmchilton opened this issue Nov 8, 2017 · 1 comment

Comments

@jmchilton
Copy link
Member

jmchilton commented Nov 8, 2017

With metadata handled as part of the remote process - Galaxy still struggles a bit when creating and finishing hundreds of jobs a time. I've spent a good amount of time working on this problem in 2016 as part of scaling up collections but I think I need to take some more passes. Here are some things I think should be done:

TODO:

DONE:

FAILED PRS:

@mvdbeek
Copy link
Member

mvdbeek commented Apr 16, 2021

I'm gonna call this one complete (even though we didn't do batch inserts yet). I think the next step is to break down pieces into tasks we can schedule with celery, where we're not limited by the GIL. Job handlers are often close to 100% in CPU utilization, so I don't think threading is going to help much there. It'll also be much easier to scale celery tasks based on load, and they can be retried and resumed more easily.

@mvdbeek mvdbeek closed this as completed Apr 16, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants