Make sure python Inception and scala Inception examples have the same result #2515

jason-dai · 2018-05-02T07:08:41Z

Especially for the training results.

jason-dai · 2018-05-23T02:12:01Z

Any update?

yiheng · 2018-05-23T03:47:30Z

@jenniew Can you provide the example code to reproduce this issue?

jason-dai · 2018-05-23T05:14:01Z

https://github.com/intel-analytics/BigDL/tree/master/pyspark/bigdl/models/inception

yiheng · 2018-05-23T14:10:05Z

@wzhongyuan please take a look at this issue

wzhongyuan · 2018-05-25T01:54:15Z

The python code and related parameters are same as scala ones, finally found the difference caused by preprocessing :) , Python uses OpenCV while Scala still uses legacy ones

 DataSet.SeqFileFolder.files(path, sc, classNumber).transform(
      MTLabeledBGRImgToBatch[ByteRecord](
        width = imageSize,
        height = imageSize,
        batchSize = batchSize,
        transformer = (BytesToBGRImg() -> BGRImgCropper(imageSize, imageSize)
          -> DatasetHFlip(0.5) -> BGRImgNormalizer(0.485, 0.456, 0.406, 0.229, 0.224, 0.225))
      ))

    train_transformer = Pipeline([PixelBytesToMat(),
                                  RandomCrop(image_size, image_size),
                                  RandomTransformer(HFlip(), 0.5),
                                  ChannelNormalize(0.485, 0.456, 0.406, 0.229, 0.224, 0.225),
                                  MatToTensor(to_rgb=True),
                                  ImageFrameToSample(input_keys=["imageTensor"], target_keys=["label"])
                                  ])

Will raise a PR to fix it

wzhongyuan · 2018-05-25T02:10:47Z

Need to implement a replacement of MTLabeledBGRImgToBatch compatible with OpenCV and then apply new Opencv transformers

jason-dai · 2018-05-25T02:30:28Z

I think the main issue is that the Top5 accuracy results of these two examples are different (the Scala version has higher accuracy). @jenniew can you provide the testing results?

jenniew · 2018-05-25T04:37:21Z

scala: top5: 88.2 top1: 68.2
python: top5: 87.4 top1: 67.1

wzhongyuan · 2018-05-25T05:03:47Z

Seems we cannot refactor scala code for now as python has worse performance... holding the change for now.
I will run a full training to re-produce the number and will try to find if any referenced caffe parameters and run accordingly with the same config to see how it works if caffe has better accuracy

wzhongyuan · 2018-05-25T05:03:59Z

@jenniew thanks for providing the data

jenniew · 2018-05-25T05:07:21Z

Python ChannelNormalize has an issue: #2544
Maybe it's related with the accuracy difference.

jenniew · 2018-05-25T05:09:39Z

caffe also uses opencv to transform, so I think using opencv would not decrease accuracy.

wzhongyuan · 2018-06-04T01:55:53Z

@jenniew I think that's the issue causing the discrepancy. The training result after the fix is much better now. I will put the fix in, quite simple change.

wzhongyuan · 2018-06-04T01:56:29Z

Will refactor the scala transformers as well

jason-dai · 2018-07-24T05:26:35Z

I believe caffe also use opencv based preprocessing? maybe we can just compared to caffe.

wzhongyuan · 2018-07-24T06:52:08Z

We mirrored intel caffe (please refer to #2555), but did not get compatible result, the final accuracy is below than what we've done with legacy transformers.

wzhongyuan · 2018-12-24T07:04:01Z

Closing this issue as it's fixed

jason-dai · 2019-04-23T03:56:49Z

Reopen - cannot reproduce the results

wzhongyuan · 2019-04-24T08:50:11Z

Probably caused by regression from commit ce7e573 , this commit is made after the convergence testing. we'll check and get back.

jason-dai assigned yiheng and qiuxin2012 May 2, 2018

yiheng assigned wzhongyuan and unassigned yiheng and qiuxin2012 May 23, 2018

wzhongyuan closed this as completed Dec 24, 2018

jason-dai reopened this Apr 23, 2019

jason-dai added the high priority label Apr 23, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make sure python Inception and scala Inception examples have the same result #2515

Make sure python Inception and scala Inception examples have the same result #2515

jason-dai commented May 2, 2018

jason-dai commented May 23, 2018

yiheng commented May 23, 2018

jason-dai commented May 23, 2018

yiheng commented May 23, 2018

wzhongyuan commented May 25, 2018

wzhongyuan commented May 25, 2018

jason-dai commented May 25, 2018

jenniew commented May 25, 2018

wzhongyuan commented May 25, 2018

wzhongyuan commented May 25, 2018

jenniew commented May 25, 2018

jenniew commented May 25, 2018

wzhongyuan commented Jun 4, 2018

wzhongyuan commented Jun 4, 2018

jason-dai commented Jul 24, 2018

wzhongyuan commented Jul 24, 2018

wzhongyuan commented Dec 24, 2018

jason-dai commented Apr 23, 2019

wzhongyuan commented Apr 24, 2019

Make sure python Inception and scala Inception examples have the same result #2515

Make sure python Inception and scala Inception examples have the same result #2515

Comments

jason-dai commented May 2, 2018

jason-dai commented May 23, 2018

yiheng commented May 23, 2018

jason-dai commented May 23, 2018

yiheng commented May 23, 2018

wzhongyuan commented May 25, 2018

wzhongyuan commented May 25, 2018

jason-dai commented May 25, 2018

jenniew commented May 25, 2018

wzhongyuan commented May 25, 2018

wzhongyuan commented May 25, 2018

jenniew commented May 25, 2018

jenniew commented May 25, 2018

wzhongyuan commented Jun 4, 2018

wzhongyuan commented Jun 4, 2018

jason-dai commented Jul 24, 2018

wzhongyuan commented Jul 24, 2018

wzhongyuan commented Dec 24, 2018

jason-dai commented Apr 23, 2019

wzhongyuan commented Apr 24, 2019