-
-
Notifications
You must be signed in to change notification settings - Fork 719
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Custom Serialization for Task Args #2953
Comments
cc @mrocklin (since it looks like you requested this issue be raised) |
This was my resposne:
|
What specifically would you see disrupting performance significantly? This would be the cascading lookup to detect serialization overrides on each argument? So long as there is not a performance disruption in the case where there are no overrides present, it's really up to the user if they want to incur the overhead of applying overrides. I would think this is possible to accomplish. |
Right, so I think that we agree that there are two important cases here: future = client.submit(func, my_big_object) # want to serialize separately
future = client.submit(func, 123) # don't want to serialize separately So we would need a clear and generic way to differentiate one from the other that works under most contexts. We do this currently by checking
It's not entirely clear to me how to check for this. There are a few different seriailization families. Also you might (?) want to handle nesting within tuples/lists/dicts. We would also want to apply this uniformly across the various APIs, like |
The arguments of a task submitted to the scheduler are currently serialized using
pickle
and will not use any custom serialization (warn_dumps ⟶ pickle.dumps
#2110 (comment)). This is also demonstrated by the below example.Will always raise the
ValueError
set inFoo
.Originally posted by @milesgranger in #2469 (comment)
The text was updated successfully, but these errors were encountered: