Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

hyperparameter & pre-train issues #1

Open
muye5 opened this issue Feb 28, 2018 · 3 comments
Open

hyperparameter & pre-train issues #1

muye5 opened this issue Feb 28, 2018 · 3 comments

Comments

@muye5
Copy link

muye5 commented Feb 28, 2018

  • Does this detection model train totally from scratch with COCO-Dataset or VOC?
    Should we do pre-train with ImageNet?

  • GRID_W = 18 & GRID_H = 10
    How to set these two hyper-parameters ?
    (e.g. input image size: 32 * (13, 13) = (416, 416) as described in paper ? ?)

  • The anchor file missed

Thanks!

@muye5 muye5 changed the title Pre-train or not? hyperparameter & pre-train issues Feb 28, 2018
@wojciechmo
Copy link
Owner

  1. If you would like to train with COCO or VOC you should download and prepare data by yourself. But in such case it is easier to use trained model - as point 2 in description states: Conversion from Darknet to Tensorflow framework done with darkflow project. Point of this project is to train with your own data which are summarized in csv file.

  2. GRID_W and GRID_H are spatial dimensions of final 3D tensor, where each cell of GRID_WxGRID_H makes a vector with predicted rois and classes. For GRID_W = 18 and GRID_H = 10 input training images should be of size 572x300.

  3. As it's written in description you should prepare two files before training - one is anchor file. You can specify anchors dimensions as you wish, or you can use some clustering algorithm as it is done in paper.

Best regards

@colwind
Copy link

colwind commented Apr 3, 2018

could i use the tfrecord file that generated in other repository?

@Tveek
Copy link

Tveek commented Apr 13, 2018

@WojciechMormul ,how to calculate 572x300 by GRID_WxGRID_H ? but,in make_tfrecord.py file,in_w, in_h = grid_w*net_scale, grid_h*net_scale ,so strange?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants