-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
More granular control on caching of logs with dvclive #77
Comments
This will cause those files to be tracked by both DVC (as cached objects) and Git, which is a problem. It will eventually cause certain DVC commands to fail as we do not allow adding/tracking a file which is already tracked by Git. As you noted, what you really need is the ability to set specific Transferring this issue to the dvclive repo for now, but there may be a design reason for this limitation that I'm not aware of @pared @dberenbaum |
HTML files are not tracked at all (but can be added under git control). DVC cache can track only the |
@Benjamin-Etheredge What do you want to cache with DVC and what do you want to track with Git? It reads to me like you want to cache the logging directory with DVC and track the other live outputs with Git, which I believe is the default behavior. |
Sounds like since the |
Indeed. I was waiting to close this after the docs are merged |
OK I see it's linked so yeah it will get closed. But for clarity maybe you want to explicitly answer here on how things are now relating to granular caching of logs? p.s. maybe even move it to the |
The
cache
flag underlive
indvc.yaml
seems to be an all-or-nothing type flag (i.e., all log files, summaries, and HTML must be cached or none of them). The logging directories, which contain each iteration of data, are less likely to be checked into git than the summaries are. This is due to the large number of differences that will always be present in the logged iteration data. Summaries are generally smaller and thus a prime candidate for being tracked with git.Currently, this can be done by setting
cache
totrue
and removing summary files from.gitignore
. This seems counter to the intentions of DVC providing thecache
option. I'm also unsure of the implications of doing this. Does DVC still track that item? Is it now duplicated in git and DVC tracking?Adding the options for individually caching of the outputs of the
live
tag would allow for easier workflows when only the summaries from logging are to be tracked.The text was updated successfully, but these errors were encountered: