A guid to prepare data to train a detection model? #16

samson-wang · 2016-09-09T10:48:01Z

I notice that the training has been kind of "hard coded" to different versions of pascal voc and coco datasets. I'm trying to figure out the data flow and data format requirement to run training on "new"
data. Still have some problems in loading and preparing data before training (Even on voc or coco data). Could anyone give some advices to help me to build up the process.

Now, I have some images and corresponding bounding box annotations. If I want to train on this data, I need to generate some proposals, i.e. 1000/image. Put annotations and proposals in "least required" Torch formats.

I think I have to write some pieces of code to implement

data checker / generator
configurations

I hope to be a contributor. ;-)

szagoruyko · 2016-09-10T17:21:11Z

@samson-wang the code is generic and can be used for COCO/VOC/ImageNet given JSON annotations in the right format similar to http://mscoco.org/external/ and proposals similar to the ones that we provide, in torch format.

samson-wang · 2016-09-13T01:45:04Z

@szagoruyko Thank you! I'm working on it.
For proposals, I want to use deepmask. Referred to data/proposals/coco/deepmask/val.t7, images, boxes, scores should be contained. In addition, scores seem to be not in an order. In deepmask project, There is only a getTopProps function which generates proposals ordered by corresponding scores. Is the score ordering a case that I should take care?
Another thing is that the number of boxes vary in pascal selective search proposals, while keep the same in the deepmask. Any reasons?

Thanks!

samson-wang · 2016-09-14T04:24:02Z

@szagoruyko I have trained on my own data which only has 1 category of boundingbox.
There is a trick to change opt.num_classes = opt.dataset == 'pascal' and 21 or 81 to opt.num_classes = 2. I'm not sure if it make sense.
However, when run demo with the trained model something weird happens.
After execution of prob, maxes = detections:max(2), get all '1's for both prob and maxes which leads to select on empty tensor in following code local idx = maxes:squeeze():gt(1):cmul(prob:gt(config.thr)):nonzero():select(2,1).
Could you give some advices? Thank you!

samson-wang · 2016-09-14T08:53:47Z

I think the problem may be too few positive samples in the training dataset. So when predicting, all proposals are predicted to negative.
Training data summary:
2000 images, 1 category object, 1 ground truth bbox per image, 1000 proposals per image generated by deepmask, 100 epoch

Can I set a higher learning rate for positive samples?

szagoruyko · 2016-09-18T14:14:37Z

@samson-wang looks like you need to adjust fraction of positive examples in batches to balance your data, check here https://github.com/facebookresearch/multipathnet/blob/master/BatchProviderROI.lua#L19

samson-wang · 2016-09-20T10:34:11Z

@szagoruyko Thank you for your tip!
I found that running demo.lua got all negative after detector:detect(img:float(), bboxes:float()) even on an image from training set. Other wise, running run_test.lua on training set is all right. Evaluation on test set got 0.33 AP @0.75 and 0.77 AP @0.5. Not as bad as demo result.
After some debug, though not conclusive, there is something wired.
I use the following code to generate proposals with deepmask for training and testing.

    -- load image
    local img = image.load(img_file)
    local h,w = img:size(2),img:size(3)

    -- forward all scales
    infer:forward(img)

    -- get top proposals
    local masks,_ = infer:getTopProps(.2,h,w)
    rs = maskApi.encode(masks)
    bbs = maskApi.toBbox(rs)

    table.insert(images, paths.basename(img_file))
    table.insert(scores, _:index(2, torch.LongTensor{1}))
    table.insert(boxes, bbs)

The generated bounding box looks like

  78   89   69   40
   0   22  624  618
...

When evaluation. After execution of getROIBoxes
The corresponding boxes looks like

89  78   40   69
22   0  618 624
...

Positions has been switched. I'm not sure if it is a problem. Still working on this.

samson-wang · 2016-09-20T14:25:16Z

Update,

multipathnet/DataSetJSON.lua

Line 234 in e6b9e0d

    
           boxes = boxes:size(2) ~= 4 and torch.FloatTensor(0,4) or boxes:index(2,permute_tensor)

boxes permuted.

samson-wang · 2016-09-22T01:12:12Z

@szagoruyko Stupid mistake. The image transformer not the same with train and evaluation. So the scores inferred get wrong.

teezeit · 2016-10-05T13:54:04Z

Hi Samson, did you get it working? I am also trying to set up my own training pipeline, what did your workflow end up like?

samson-wang mentioned this issue Sep 21, 2016

How to generate proposals t7 files? #17

Open

minouminou mentioned this issue Feb 9, 2017

Proposals t7 files #43

Closed

samson-wang closed this as completed Mar 10, 2017

minouminou mentioned this issue May 18, 2017

Training custom data set #46

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A guid to prepare data to train a detection model? #16

A guid to prepare data to train a detection model? #16

samson-wang commented Sep 9, 2016

szagoruyko commented Sep 10, 2016

samson-wang commented Sep 13, 2016

samson-wang commented Sep 14, 2016

samson-wang commented Sep 14, 2016

szagoruyko commented Sep 18, 2016

samson-wang commented Sep 20, 2016 •

edited

Loading

samson-wang commented Sep 20, 2016

samson-wang commented Sep 22, 2016

teezeit commented Oct 5, 2016

A guid to prepare data to train a detection model? #16

A guid to prepare data to train a detection model? #16

Comments

samson-wang commented Sep 9, 2016

szagoruyko commented Sep 10, 2016

samson-wang commented Sep 13, 2016

samson-wang commented Sep 14, 2016

samson-wang commented Sep 14, 2016

szagoruyko commented Sep 18, 2016

samson-wang commented Sep 20, 2016 • edited Loading

samson-wang commented Sep 20, 2016

samson-wang commented Sep 22, 2016

teezeit commented Oct 5, 2016

samson-wang commented Sep 20, 2016 •

edited

Loading