How to train objects365 without auto download the dataset. #4658

nocolour · 2021-09-03T07:22:06Z

Dear all,

Have a nice day.

Since, I succeed to train on VisDrone. I wish to test on object365, but facing problem of downloading..

My Problem:
Object365 dataset too huge, download time out and can not complete with train.py --data Objects365.yaml.

To solve this problem, I use download manager to download the files manually one by one.

My question:

If I downloaded the object365 dataset manually. Where I need to unzip it? and folders structure?
If run python train.py --data Objects365.yaml, need to disable the download in Objects365.yaml ? If already downloaded manually.
How many free space needed for object365?
How many epochs needed? 300 enough?
Do I need to use --hpy hyp.finetune_objects365.yaml ? default='data/hyps/hyp.scratch.yaml'

Thanks for your help..

glenn-jocher · 2021-09-05T15:27:34Z

@nocolour I haven't had enough opportunity to train on Objects365 to answer your questions well, but yes naturally if you download it yourself you should comment out the download field in the yaml so it doesn't try to download it again. You should place your data in the structure indicated in the yaml here, and yes you can use either the default hyps or the 365 hyps you indicated to train, though you should not need 300 epochs since the dataset is much larger than COCO, which is where the 300 number comes from.

yolov5/data/Objects365.yaml

Lines 10 to 15 in fad57c2

    
           # Train/val/test sets as 1) dir: path/to/imgs, 2) file: path/to/imgs.txt, or 3) list: [path/to/imgs1, path/to/imgs2, ..] 
        
           path: ../datasets/Objects365  # dataset root dir 
        
           train: images/train  # train images (relative to 'path') 1742289 images 
        
           val: images/val # val images (relative to 'path') 5570 images 
        
           test:  # test images (optional)

nocolour · 2021-09-06T09:30:58Z

@nocolour I haven't had enough opportunity to train on Objects365 to answer your questions well, but yes naturally if you download it yourself you should comment out the download field in the yaml so it doesn't try to download it again. You should place your data in the structure indicated in the yaml here, and yes you can use either the default hyps or the 365 hyps you indicated to train, though you should not need 300 epochs since the dataset is much larger than COCO, which is where the 300 number comes from.

yolov5/data/Objects365.yaml

Lines 10 to 15 in fad57c2

# Train/val/test sets as 1) dir: path/to/imgs, 2) file: path/to/imgs.txt, or 3) list: [path/to/imgs1, path/to/imgs2, ..]

path: ../datasets/Objects365 # dataset root dir

train: images/train # train images (relative to 'path') 1742289 images

val: images/val # val images (relative to 'path') 5570 images

test: # test images (optional)

Thank you for reply.

nocolour · 2021-09-07T02:41:57Z

@glenn-jocher
The download zip files, I don't see the label text files. Only have json file from zhiyuan_objv2_train.tar.gz.
zhiyuan_objv2_train.json is the label? Need to convert it? What I need to do?

glenn-jocher · 2021-09-07T11:31:23Z

@nocolour autodownload handles all conversion, I would recommend you simply use that:

python train.py --data Objects365.yaml

nocolour · 2021-09-07T12:47:34Z

@nocolour autodownload handles all conversion, I would recommend you simply use that:
python train.py --data Objects365.yaml

Ya, I found the way to solve it ready. But need do the test 1st. I will setup the localhost webserver to host the dataset I downloaded. Then change the download url in Objects365.yaml.

However still thank you.

glenn-jocher · 2021-09-07T13:14:50Z

@nocolour oh good idea!

github-actions · 2021-10-08T00:11:39Z

👋 Hello, this issue has been automatically marked as stale because it has not had recent activity. Please note it will be closed if no further activity occurs.

Access additional YOLOv5 🚀 resources:

Wiki – https://github.com/ultralytics/yolov5/wiki
Tutorials – https://docs.ultralytics.com/yolov5
Docs – https://docs.ultralytics.com

Access additional Ultralytics ⚡ resources:

Ultralytics HUB – https://ultralytics.com
Vision API – https://ultralytics.com/yolov5
About Us – https://ultralytics.com/about
Join Our Team – https://ultralytics.com/work
Contact Us – https://ultralytics.com/contact

Feel free to inform us of any other issues you discover or feature requests that come to mind in the future. Pull Requests (PRs) are also always welcomed!

Thank you for your contributions to YOLOv5 🚀 and Vision AI ⭐!

wangsun1996 · 2022-11-29T01:50:02Z

Can you provide a pre-training model on object365?

glenn-jocher · 2022-11-30T02:45:32Z

@wangsun1996 https://github.com/ultralytics/yolov5/releases/download/v6.0/yolov5m_Objects365.pt

wangsun1996 · 2022-11-30T02:59:04Z

thank you very much! furthermore,Can you provide a pre-training model yolov5s6 or yolov5s on object365?

glenn-jocher · 2022-11-30T03:00:14Z

No, this is the only model available.

On Tue, 29 Nov 2022 at 18:59, wangsun1996 ***@***.***> wrote: thank you very much! furthermore,Can you provide a pre-training model yolov5s6 or yolov5s on object365? — Reply to this email directly, view it on GitHub <#4658 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AGMXEGKPVGW6PAS7G4S2XZDWK27IHANCNFSM5DLEYEIQ> . You are receiving this because you were mentioned.Message ID: ***@***.***>

-- <https://www.ultralytics.com/> *Glenn Jocher* Founder & CEO, Ultralytics +1 301 237 6695 <https://www.twitter.com/ultralytics> <https://www.youtube.com/ultralytics> <https://www.github.com/ultralytics> <https://www.linkedin.com/company/ultralytics>

wangsun1996 · 2022-11-30T03:02:19Z

OK,thank you very much!!!

wangsun1996 · 2022-12-03T14:31:24Z

i want to train a yolov5s6 on OBJ365 dataset to get a pre-train model，but we need 12h to train an epoch，Is there any way to speed up training?

glenn-jocher · 2022-12-03T19:26:23Z

👋 Hello! Thanks for asking about training speed issues. YOLOv5 🚀 can be trained on CPU (slowest), single-GPU, or multi-GPU (fastest). If you would like to increase your training speed some options are:

Increase --batch-size
Reduce --img-size
Reduce model size, i.e. from YOLOv5x -> YOLOv5l -> YOLOv5m -> YOLOv5s
Train with multi-GPU DDP at larger --batch-size
Train on cached data: python train.py --cache (RAM caching) or --cache disk (disk caching)
Train on faster GPUs, i.e.: P100 -> V100 -> A100
Train on free GPU backends with up to 16GB of CUDA memory:

Good luck 🍀 and let us know if you have any other questions!

nocolour added the question Further information is requested label Sep 3, 2021

nocolour changed the title ~~How to train object365 without auto download the dataset.~~ How to train objects365 without auto download the dataset. Sep 6, 2021

github-actions bot added the Stale Stale and schedule for closing soon label Oct 8, 2021

github-actions bot closed this as completed Oct 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to train objects365 without auto download the dataset. #4658

How to train objects365 without auto download the dataset. #4658

nocolour commented Sep 3, 2021 •

edited

Loading

glenn-jocher commented Sep 5, 2021

nocolour commented Sep 6, 2021

nocolour commented Sep 7, 2021

glenn-jocher commented Sep 7, 2021

nocolour commented Sep 7, 2021

glenn-jocher commented Sep 7, 2021

github-actions bot commented Oct 8, 2021 •

edited by glenn-jocher

Loading

wangsun1996 commented Nov 29, 2022

glenn-jocher commented Nov 30, 2022

wangsun1996 commented Nov 30, 2022

glenn-jocher commented Nov 30, 2022 via email

wangsun1996 commented Nov 30, 2022

wangsun1996 commented Dec 3, 2022

glenn-jocher commented Dec 3, 2022 •

edited by UltralyticsAssistant

Loading

How to train objects365 without auto download the dataset. #4658

How to train objects365 without auto download the dataset. #4658

Comments

nocolour commented Sep 3, 2021 • edited Loading

glenn-jocher commented Sep 5, 2021

nocolour commented Sep 6, 2021

nocolour commented Sep 7, 2021

glenn-jocher commented Sep 7, 2021

nocolour commented Sep 7, 2021

glenn-jocher commented Sep 7, 2021

github-actions bot commented Oct 8, 2021 • edited by glenn-jocher Loading

wangsun1996 commented Nov 29, 2022

glenn-jocher commented Nov 30, 2022

wangsun1996 commented Nov 30, 2022

glenn-jocher commented Nov 30, 2022 via email

wangsun1996 commented Nov 30, 2022

wangsun1996 commented Dec 3, 2022

glenn-jocher commented Dec 3, 2022 • edited by UltralyticsAssistant Loading

nocolour commented Sep 3, 2021 •

edited

Loading

github-actions bot commented Oct 8, 2021 •

edited by glenn-jocher

Loading

glenn-jocher commented Dec 3, 2022 •

edited by UltralyticsAssistant

Loading