-
-
Notifications
You must be signed in to change notification settings - Fork 16.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dataset autodownload feature addition #685
Conversation
Ah! Ok it needs a rebase then. Maybe this will work: /rebase |
7f699d0
to
13e8e6c
Compare
My default unit test passed! I didnβt use any auto download functionality though. How should I use it to download coco128? Would calling train auto download it if I donβt have the folder available? |
@NanoCode012 yes, now you can start training without downloading a dataset first! In this screenshot you can see we clone the repo, install requirements.txt, and run train.py right away. Each of the 3 datasets now have download directions in the yaml (either a zipfile URL, or a bash command). If it all works correctly you should see something like this: |
It works! Very simple and nice! However, I see that you output βtrain2017β directory twice. Is there a reason? Is it meant to be βval2017β? |
Ah, yes this is because coco128 trains and tests on the same 128 images. It's really meant as a sanity check to ensure your setup converges before trying larger datasets. |
* initial commit * move download scripts into data/scripts * new check_dataset() function in general.py * move check_dataset() out of with context * Update general.py * DDP update * Update general.py
* initial commit * move download scripts into data/scripts * new check_dataset() function in general.py * move check_dataset() out of with context * Update general.py * DDP update * Update general.py
* initial commit * move download scripts into data/scripts * new check_dataset() function in general.py * move check_dataset() out of with context * Update general.py * DDP update * Update general.py
Dataset autodownload branch initial commit.
π οΈ PR Summary
Made with β€οΈ by Ultralytics Actions
π Summary
Refinement of dataset handling and download procedures for YOLOv5.
π Key Changes
coco.yaml
,coco128.yaml
, andvoc.yaml
dataset configurations to include optional download commands.get_coco2017.sh
script and replaced it withget_coco.sh
in a new directoryscripts
.get_voc.sh
to thescripts
directory.test.py
andtrain.py
to call the new dataset check function and handle download if needed.π― Purpose & Impact
scripts
directory.