-
Notifications
You must be signed in to change notification settings - Fork 416
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Creating DeltaTable object slow #2518
Comments
Did you checkpoint? |
Haven't come across that before, how do I do it? |
With the latest version it should automatically checkpount every 100 commits, but you can also manually do it by doing DeltaTable.create_checkpoint() |
Edit: disregard these numbers, we were running on debug mode 😅 |
@ion-elgreco Shall I make a PR to document checkpointing a bit more? |
@braaannigan improving our documentation is always welcome! I'm going to close this in the meantime |
Environment
Delta-rs version: 0.17.4
Binding: python
Environment:
Bug
What happened:
I have a DeltaTable on S3 partitioned by date with about 60 dates. The partitions have been compacted and vacuumed so have one file each. I append to this table 100 times a day so the transaction log has about 6000 json files.
When I try to create the DeltaTable object it takes 30 seconds.
What you expected to happen: I expected this operation to be faster but I'm not sure if that's a reasonable expectation?
How to reproduce it: No repro example, I'm just trying to establish if there is something unusual here
More details:
The text was updated successfully, but these errors were encountered: