You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
which fails with [500] Internal: Server error: UDF Exception during Spark execution: ... ValueError: in_executor=True indicating the UDF ran in an executor
(I stumbled on this issue while working on #437 / Open-EO/openeo-python-driver#197)
#251 / #262 added parallelized UDF execution on vector cubes (
udf_apply_feature_dataframe
andudf_apply_udf_data
entrypoints), as documented at https://github.com/Open-EO/openeo-geopyspark-driver/blob/1f0ad56cc749d9f3ade315a85f39f1200f74168c/docs/vectorcube-run_udf.md . The idea was to get parallelization and executor isolation automatically by using the pyspark.pandas withapply
However, it seems that a pyspark.pandas
apply
callback does not run in the executors, but just in the driver.example snippet to illustrate:
This fails with:
Internal: Server error: ValueError('in_executor=False')
indicating the callback did not run in executorThe text was updated successfully, but these errors were encountered: