-
Notifications
You must be signed in to change notification settings - Fork 360
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Athena with Delta tables #6351
Comments
Yes, it queries deleted files also |
Some additional context: Blindly exporting symlinks is a bad idea for open table formats - symlinks work with hive style tables so any parquet file appearing in the symlink will be queried and partition information might be lost. so that's a no go. An almost viable option was to export a "shallow clone" of the delta table: as a post-hook, write a single file, named I tested this (without lakeFS) by constructing a json file that points to arbitrary Other options that might work:
Not a fan of any of the above :) |
This issue is now marked as stale after 90 days of inactivity, and will be closed soon. To keep it, mark it with the "no stale" label. |
Closing this issue because it has been stale for 7 days with no activity. |
currently, using Athena with lakeFS works by registering symlinks into Glue
for Delta tables, this won't work (or worse: will cause deletef parquet files to also be queried).
For delta we should either generate symlinks based on the delta log, or find another way to query lakeFS from Athena.
The text was updated successfully, but these errors were encountered: