-
Notifications
You must be signed in to change notification settings - Fork 78
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Concurrent task iteration support #422
Comments
Do you think batches enumerators could help? (see #409) Regarding actual parallelism when running tasks, it's something that we're thinking about but we haven't made any formal plans so we can't make any promise. We can keep this issue open to continue thinking about it, start fleshing out an API, behaviour, figure out the edge cases (e.g. it will require special handling for custom enumerators which may not have a way to start a cursor randomly, but only give out one item at a time), etc. |
Batches could help with some of our task types, yes! But we have other types of tasks that require, for example, calling an external API with an individual record and then saving that value to our database, so the batching would remove some of the overhead of the job queue itself but wouldn't give us the speed up that we would get from concurrency. |
This issue has been marked as stale because it has not been commented on in two months. |
We would still like this! |
This issue has been marked as stale because it has not been commented on in two months. |
We would still really like this |
This would be incredibly useful. |
This issue has been marked as stale because it has not been commented on in two months. |
Still valid |
This issue has been marked as stale because it has not been commented on in two months. |
still relevant |
This issue has been marked as stale because it has not been commented on in two months. |
Still, I would love this. |
This issue has been marked as stale because it has not been commented on in two months. |
Still want this. |
Over at https://github.com/shopify/flow we have been trying to adopt the maintenance task framework and have enjoyed the benefits for our small data migrations, but our main hangup is the loong runtimes of tasks that need to operate on large datasets (e.g. all records in one table table - tens of thousands now, will be much more the future). When we tried running a recent data migration via maintenance task recently, the total time to execute would have been months.
As such, our main desire with this library would be declarative concurrency support. Is #325 (comment) still the recommendation for concurrency in the future of this library?
No immediate need for action on this - we just wanted to provide feedback on our adoption!
The text was updated successfully, but these errors were encountered: