-
Notifications
You must be signed in to change notification settings - Fork 611
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Iceberg table support #7712
Comments
I see a number of technical issues with the iceberg python client that I think are blockers for using it as the basis for iceberg support in Ibis:
At the very least, we'd need to be able to get back a PyArrow Dataset that can be streamed into a query engine like DuckDB before we can consider using the iceberg python client. |
I think a better option might be https://duckdb.org/docs/extensions/iceberg.html at least for DuckDB. |
This, at least, is resolved. :) |
@deepyaman any interest in taking a stab at this? |
Is this ready for implementation? It seems @cpcloud's first concern is resolved, but is the second one? We could get a |
As far as I know, Iceberg is a table format for compute engines (e.g., Spark) to work with. Along that line, I think it is expected for As raised in 2. above,
Looking at the implementation of @cpcloud Do these points make sense to you? If they do, I can take a stab at this issue. Disclaimer: My understanding of Iceberg might not be fully correct as my knowledge of Iceberg is limited :) Refs: |
Is your feature request related to a problem?
Support Iceberg tables in Ibis
Using: https://github.com/apache/iceberg-python
Main blocker is write support, tracked here: apache/iceberg-python#23
Describe the solution you'd like
ibis.read_iceberg
table.to_iceberg
What version of ibis are you running?
n/a
What backend(s) are you using, if any?
local backends that would support this
Code of Conduct
The text was updated successfully, but these errors were encountered: