Support shared memory with Queues #539

mrocklin · 2016-09-20T14:04:50Z

Currently the Client, Scheduler, and Workers always communicate over Tornado IOStreams, serializing messages as defined in distributed/protocol.py and distributed/core.py::dumps(). This is necessary if workers are on different machines or if the user wants to use multiple processes, but unnecessarily wasteful if the user wants to use a single process on a single machine. In this case it would be nice to pass Python objects directly between Client/Scheduler/Workers without serialization, perhaps using Tornado Queues (or something similar).

However, this isn't as simple as optionally switching IOStreams for Queues. We choose to serialize and deserialize data in certain places. For example when the client sends functions to the scheduler it serializes on the client-side but does not deserialize on the scheduler (the scheduler never deserializes functions or data). When the time is right the scheduler sends the serialized data to a worker which then deserializes the bytes back into a function. Because of this there are dumps/loads calls are not always symmetric. They are scattered throughout the code rather than isolated to a particular location.

This problem might be easier to solve if we consider subclassing IOStream. This would allow us to push some of the protocol.py logic into the Stream class and would also let us attach .dumps and .loads methods on to the object that could be something like pickle for socket-based streams and a no-op for queue-based streams.

cc @pitrou

The text was updated successfully, but these errors were encountered:

pitrou · 2016-10-20T14:44:51Z

I suspect the main optimization target here is task payload and results. Are those serialized on the Client and then deserialized on the Worker (or vice-versa)? Is it common to have Client and Workers on the same machine (why wouldn't you simply use Dask in that case)?

mrocklin · 2016-10-20T14:52:04Z

I suspect the main optimization target here is task payload and results.

Yes. Also the serialization of complex objects like functions or locks can also be convenient to avoid.

Are those serialized on the Client and then deserialized on the Worker (or vice-versa)?

Yes. Also Worker-to-Worker communications

Is it common to have Client and Workers on the same machine (why wouldn't you simply use Dask in that case)?

People do just use the dask.async scheduler in this case today. However the distributed scheduler has some non-distributed advantages like asynchronous computing, the concurrent.futures interface, the Bokeh dashboard, etc..

This would also lay groundwork for thinking about alternative IOStream implementations. This might help with issues like #400 (GPUs), communication between same-node processes, ZeroMQ, etc..

None of this is critical though and there may be more important things to work on.

pitrou · 2016-10-20T15:01:44Z

I'm curious about serialization of lock objects, can you point me to where / why that happens?

mrocklin · 2016-10-20T15:05:29Z

What I mean is that sometimes people want to send hard-to-serialized objects as part of their function call. One such example is if they want to use a lock. This happens for example when people try to use HDF5 as in this issue: dask/dask#1683

mrocklin · 2016-10-24T22:53:37Z

This relates to #604

pitrou added the enhancement Improve existing functionality or make things work better label Oct 20, 2016

konrad-roze mentioned this issue Nov 14, 2017

"can't pickle thread.lock objects" when working with published dataframe #1556

Open

mrocklin closed this as completed Jun 11, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support shared memory with Queues #539

Support shared memory with Queues #539

mrocklin commented Sep 20, 2016

pitrou commented Oct 20, 2016

mrocklin commented Oct 20, 2016

pitrou commented Oct 20, 2016 •

edited

Loading

mrocklin commented Oct 20, 2016

mrocklin commented Oct 24, 2016

Support shared memory with Queues #539

Support shared memory with Queues #539

Comments

mrocklin commented Sep 20, 2016

pitrou commented Oct 20, 2016

mrocklin commented Oct 20, 2016

pitrou commented Oct 20, 2016 • edited Loading

mrocklin commented Oct 20, 2016

mrocklin commented Oct 24, 2016

pitrou commented Oct 20, 2016 •

edited

Loading