You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
File ~/mambaforge/envs/xgboost-2-27/lib/python3.10/site-packages/distributed/core.py:820, in _handle_comm()
818 result = handler(**msg)
819 if inspect.iscoroutine(result):
--> 820 result = await result
821 elif inspect.isawaitable(result):
822 raise RuntimeError(
823 f"Comm handler returned unknown awaitable. Expected coroutine, instead got {type(result)}"
824 )
File ~/mambaforge/envs/xgboost-2-27/lib/python3.10/site-packages/distributed/scheduler.py:5684, in gather()
5681 else:
5682 who_has[key] = []
-> 5684 data, missing_keys, missing_workers = await gather_from_workers(
5685 who_has, rpc=self.rpc, close=False, serializers=serializers
5686 )
5687 if not missing_keys:
5688 result = {"status": "OK", "data": data}
File ~/mambaforge/envs/xgboost-2-27/lib/python3.10/site-packages/distributed/utils_comm.py:91, in gather_from_workers()
89 for worker, c in coroutines.items():
90 try:
---> 91 r = await c
92 except OSError:
93 missing_workers.add(worker)
Training an XGboost model on a dask-cudf dataframe fails due to a serialization error when run on a cluster with
jit_unspill=True
:throws:
This same code works as expected when
jit_unspill=False
Full Traceback
Versions:
Dask-cuDF: 23.04 (also occurs with 23.02)
Dask-cuda: 23.04 (also occurs with 23.02)
XGBoost: 1.7.1
The text was updated successfully, but these errors were encountered: