-
-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ENH: minimal support for dask.dataframe query planning (dask-expr) #285
Merged
jorisvandenbossche
merged 12 commits into
geopandas:main
from
jorisvandenbossche:dask-expr
May 6, 2024
Merged
Changes from 1 commit
Commits
Show all changes
12 commits
Select commit
Hold shift + click to select a range
9567c21
duplicate core.py
jorisvandenbossche d28521f
support dask-expr
jorisvandenbossche 80976e6
enable in tests
jorisvandenbossche 01f0b86
Merge remote-tracking branch 'upstream/main' into dask-expr
2b9af30
fixed from_dask_dataframe geometry
4ee6a51
Fixed from_geopandas, to_dask_dataframe
720fa97
fixed test
d850239
Merge pull request #1 from TomAugspurger/dask-expr
jorisvandenbossche 1000144
restore to_dask_dataframe (fix to_legacy_dataframe)
jorisvandenbossche 5592911
fix doc requirements
jorisvandenbossche 501f8dc
small cleanup
jorisvandenbossche 8ef4966
add one build with recent but pre-expressions dask version
jorisvandenbossche File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@TomAugspurger I renamed this back to the original name, but now this is fine because dask-expr upstream renamed the
to_dask_dataframe
they added toto_legacy_dataframe
, so there is no longer a name clash.to_legacy_dataframe
will convert to a legacy dask.dataframe collection, which is needed for certain implementations that still use the original implementation (e.g. dask still does this for parquet IO right now).While our
to_dask_dataframe
is meant to convert your dask-geopandas object to a dask object (regardless of it being a legacy collection or a new expression).(and so naming this implementation
to_legacy_dataframe
actually broke the parquet tests, because it is not doing what dask is expecting, i.e. it doesn't return a legacy collection, just the same object but where the partitions are pd.DataFrames instead of GeoDataFrames)There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah, makes sense, thanks.