-
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Specifying path for pl.read_delta
#5785
Comments
Hmm.. I see that |
Before this new Polars functionality, I would do: from deltalake import DeltaTable
dt = DeltaTable(path)
df = pl.from_arrow(dt.to_pyarrow_table()) But I'm not sure how this would work in a 'scanning' fashion. Maybe @chitralverma has some input here :) |
Let me check it out today |
That's the
Hi @stinodego so there are 2 issues with this currently and both of them were not caught during unit tests,
for the second issue, the reason is that the To get around this i have relied on I think of quickly fixing the Hope this clarifies your doubts? |
I have also opened, an issue on delta side for this, if they fix it then it should be quite straightforward. |
@chitralverma maybe we can pickle a function that imports the If that does not work, we can pickle a string that we can run with |
This is what's happening currently, we are calling the I tried some workarounds but it wont pickle |
Does it cloudpickle? We could also add a cloudpickle version of |
Nope
Neither does dill |
So, got some updates from the delta team regarding this here Apparently, this is not natively supported by pyo3 modules. I've already fixed things for |
Can this one be closed or is this still an issue? |
@ritchie46 it should be closed as I have tested for this, but let's keep this open for a while till someone else confirms? |
Nah, I will reopen if it is not. :) |
Polars version checks
I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of Polars.
Issue description
I was very excited to start using the Delta reading functionality added in #5761. There seem to be some issues with the path parameter though.
It appears that:
Reproducible example
This assumes you have a delta table in the
data/test_delta
directory.Expected behavior
I expect the behaviour of
pl.read_delta()
to be identical todeltalake.DeltaTable()
. In the cases above, this is not the case.Installed versions
Deltalake version 0.6.4
The text was updated successfully, but these errors were encountered: