-
Notifications
You must be signed in to change notification settings - Fork 394
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
update: how to update specific rev import? Clarify docs. #735
Comments
To confirm that all the files are on the remote, I go to my local copy of the dataset-registry repo and run: $ git remote -v
origin [email protected]:iterative/dataset-registry.git (fetch)
origin [email protected]:iterative/dataset-registry.git (push)
$ git rev-parse --short HEAD
99d1cdb
$ tree use-cases/cats-dogs --filelimit 3
use-cases/cats-dogs
└── data
├── train
│ ├── cats [1000 entries exceeds filelimit, not opening dir]
│ └── dogs [1000 entries exceeds filelimit, not opening dir]
...
$ dvc push
Everything is up to date. Note that the
|
Can reproduce. Investigating. |
This comment has been minimized.
This comment has been minimized.
Ah, alright, I've missed this: |
@jorgeorpinel to update to cats-dogs-v2, you need to re-import it. There is no option for |
@jorgeorpinel We can introduce that option though 🙂 Something like |
@efiop what would happen if I don't specify initially a Btw, what are the differences between running |
@shcheklein Yes, it will import from default branch and if it moves then
Update moves within the specified |
Ah, thanks for the info! Interesting indeed. It makes sense...
Re-import it how? Using the commit hash instead of the tag name? That will detect the commit's branch when updating? (I want to import an older version and then update it to the latest.)
Yes, I was going to suggest that also. |
Hm. Sounds like I can't do what I wanted then (import an older version and then update it to the latest), at least not with What happens if I (re-)import with Besides or instead of adding |
You can (and probably should). As an illustration, it won't be runnable. I think it will a regular workflow in certain case to |
What do you mean?
Even if you specify |
I mean that user won't be able to to run
no, I hope that when you use |
@jorgeorpinel To understand the behavior of
If you understand properly how For the same reason I think that adding a |
Makes sense about moving But I don't agree that it would be wrong or unexpected that However I don't have a strong opinion either way. I don't think either approach is "wrong"; It's just a question of product design. Of course its easier to not change/complicate it if there's no compelling reason. We |
@jorgeorpinel agreed on updating docs! (as usual, haha :)) I would clarify though that It does not update only when you explicitly told it to use an exact commit. It should be the same as regular package managers behave if you specify them a link to an exact commit on Github. Their update logic will keep them as is. And will be updating only if you use a branch name as a rev. |
Agreed. I like the package manager analogy. Maybe we should've called these features |
… but also for use-case: add expandable sections to new data registry case per #679 (comment) and other misc. copy edits. Also standardizes term "external" (repo) vs. "source" data/project in this context and introduces the term "revision fixing".
Adding details on all this to the docs in 131af1e! Should we also open a ticket to give special output from the command itself when it detects a fixed |
It can be just a status message (not WARNING or something like that - since there is nothing wrong) that this file is up to date since exact commit is specified and there is nothing to update. |
Opened iterative/dvc/issues/2696. |
UPDATE: See #735 (comment) and under for concepts that should be better explained in docs.
Notice how the last command still counts 500 images each in cats-dogs/data/train cats/ and dogs/, however the latest version of
cats-dogs.dvc
in https://github.com/iterative/dataset-registry/tree/master/use-cases should track 1000 files in each.The text was updated successfully, but these errors were encountered: