Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Formalize contract between XArray and the dask.distributed scheduler #1644

Closed
jhamman opened this issue Oct 21, 2017 · 1 comment
Closed

Comments

@jhamman
Copy link
Member

jhamman commented Oct 21, 2017

From @mrocklin in pangeo-data/pangeo#5 (comment):

XArray was designed long before the dask.distributed task scheduler. As a result newer ways of doing things, like asynchronous computing, persist, etc. either don't function well, or were hacked on in a less-than-optimal-way. We should improve this relationship so that XArray can take advantage of newer dask.distributed features today and also adhere to contracts so that it benefits from changes in the future.

There is conversation towards the end of dask/dask#1068 about what such a contract might look like. I think that @jcrist is planning to work on this on the Dask side some time in the next week or two.

There is a new "Dask Collection Interface" implemented in dask/dask#2748 (and the dask docs docs).

I'm creating this issue here (in addition to pangeo-data/pangeo#5) to track design considerations on the xarray side and to get input from the @pydata/xarray team.

cc @mrocklin, @shoyer, @jcrist, @rabernat

@jhamman
Copy link
Member Author

jhamman commented Nov 14, 2017

closed via #1674

@jhamman jhamman closed this as completed Nov 14, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant