Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix un-merged frames #4666

Merged
merged 3 commits into from
Apr 2, 2021
Merged

Fix un-merged frames #4666

merged 3 commits into from
Apr 2, 2021

Conversation

mrocklin
Copy link
Member

@mrocklin mrocklin commented Apr 1, 2021

mrocklin added 2 commits April 1, 2021 17:30
Previously we would re-serialize an object, even if it was a Serialized
object.  Instead we should just unpack its header and frames and be
done.
@mrocklin mrocklin changed the title [WIP] Fix un-merged frames Fix un-merged frames Apr 1, 2021
@mrocklin
Copy link
Member Author

mrocklin commented Apr 1, 2021

OK, I think that we might be ok here

@jakirkham
Copy link
Member

Awesome thanks for figuring this out Matt 😄

@jakirkham jakirkham merged commit f2b516a into dask:main Apr 2, 2021
@mrocklin mrocklin deleted the fix-split-frames branch April 2, 2021 01:02
douglasdavis pushed a commit to douglasdavis/distributed that referenced this pull request Apr 2, 2021
* Add test for un-merged frames

* Don't double-split/compress Serialized frames

Previously we would re-serialize an object, even if it was a Serialized
object.  Instead we should just unpack its header and frames and be
done.

* specify num_sub_frames in all cases
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

dd.read_parquet + compute raises _pickle.UnpicklingError: pickle data was truncated for multiple parquet files
2 participants