-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
get: allow downloading regular files/dirs tracked by Git #2515
Comments
Cc @shcheklein Why Thanks |
Related to #2507 |
Is there an agreement on this one, @efiop ? |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
A few questions regarding this issue: What protocols should
Have the questions above by @jorgeorpinel been resolved? Is it supposed to some sort of compatibility with the future Should it be possible to pipe the file to stdout for inspection with a pager such as As for implementation - could it be as simple as checking if the file provided to Lines 21 to 64 in 3de46dd
Is there a helper to identify dvc files besides just looking at the file suffix? |
You mean for |
Import doesn't support regular files IIRC. Cached directories are supported, so it would be great to support non-cached ones too as a part of this ticket 🙂 Recursive option doesn't exist for any of the commands, so it is not part of this ticket (though I'm not even sure what @jorgeorpinel meant by recursive, maybe he could clarify)
Well, it is more about
Not required and I don't really see a need for that functionality. Also, how would that even work with directories? 🙂It is not the functionality meant for
Not sure what you mean, Dvcfiles are not supplied to |
Agree with Ruslan. We would basically be using Git to download files in the repo so we will support everything Git does. I didn't get the part about So
These Qs haven't really been decided Dani, thanks for checking. I vote to work on
What do you mean @efiop? Not sure the cache has much to do with this issue. It's really about files hosted in the Git repo vs. files referenced in the DVC-files hosted in the Git repo, right? Also agree with Ruslan about |
I was talking about outputs of dvc files that are not cached by dvc, so possibly tracked by git @jorgeorpinel . E.g. |
Yes, that is what I was saying. Note that we might have external outputs (e.g. s3://bucket/data), but those are not supported and shouldn't be for now, so I was just clarifying. |
Kudos to @danihodovic for PR #2837 which seems to get most of this resolved, even including support for downloading entire directories from Git. Just pending to check whether |
Allows `dvc get` to copy regular files or directories. fixes: iterative#2515
Allows `dvc get` to copy regular files or directories. fixes: iterative#2515
As we decide how to list data artifacts in external DVC repos in #2509 it became apparent listing regular files along with stage outputs could be especially useful. See #2509 (comment).
This made us think also then users could want to
get
some of those individual regular files after seeing them. (See #2509 (comment))The text was updated successfully, but these errors were encountered: