-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cloud versioning: slow #8359
Comments
In the time it takes to complete a regular |
Seconded, it would be nice to at least see some logging output with |
Another concern here is whether the current |
Tested with 2800 images taken from https://github.com/iterative/dataset-registry/blob/master/use-cases/cats-dogs.dvc. Results with cloud versioning:
Results without cloud versioning:
|
Results for a slightly larger dataset. With cloud versioning:
Without cloud versioning:
I can follow up with profiling for each. |
Some profiling done with the data from s3://dave-sandbox-versioning/registry-cloud-versioned/cats-dogs/. For regular cache, here's the zip of the yappi cachegrind output per thread: Here's the same for Edit: And here's Haven't looked through the output at all, but I notice that there are 45 threads/outputs from the regular cache but only 5 from the cloud-versioned ones. Is that expected? |
After iterative/dvc-data#246, the only blocker should be pushing incremental changes to a
The |
Part of #7995
Description
Cloud versioning is working much slower than traditional remotes
Reproduce
The text was updated successfully, but these errors were encountered: