the retrieval loss doesn't converge well #11

qq283215389 · 2019-02-26T06:36:54Z

Hello, luo
when I pretrain the VSEFCmodel, the vse_loss doesn't converge well , just around 51.2. is there some mistakes in my experiments, how about your vse_loss when you pretrain VSEFCmodel?

ruotianluo · 2019-02-26T06:51:26Z

thats very common in the first several epochs. Try training it a little bit longer. Or just restart the training.

qq283215389 · 2019-02-26T07:02:49Z

ok, thanks a lot, for another VSE model(VSEAttModel) and "pair loss" , whose result isn't shown in your paper "Discriminability objective for training descriptive captions" in CVPR 2018?

ruotianluo · 2019-02-26T07:13:56Z

Pair loss is worse and vseattmodel gives worse result too.

qq283215389 · 2019-02-28T03:48:43Z

thanks！if the retrieval model perform better（like the paper“Stacked Cross Attention for Image-Text Matching”），can we get a better result for captioning model？

ruotianluo · 2019-02-28T04:02:32Z

I think it's very likely.

qq283215389 · 2019-03-01T09:23:44Z

hello,luo
It's my result of pre-training retrieval model after i run “run_fc_con.sh”, there is still a difference with your result presented in your paper for the retrieval model.
Result:
Average i2t Recall: 53.9
Image to text: 29.9 59.2 72.6 4.0 19.6
Average t2i Recall: 42.3
Text to image: 20.6 46.5 59.8 7.0 40.8

ruotianluo · 2019-03-02T09:19:22Z

Did you download my pretrained model? Does it perform better and the same as what's reported in the paper?
https://drive.google.com/open?id=1oQ_O-O2KoSQv1xdBPKaIOGt-VW0gS-42
These are my training curves, to give you a hint.

qq283215389 · 2019-03-02T10:02:04Z

i might get the problem，i have used the size of 7x7 for coco fc features, i think u have used 14x14 for coco fc features?

ruotianluo · 2019-03-02T14:12:41Z

fc feature doest have spatial dimensions, it's a vector

qq283215389 · 2019-03-07T03:14:28Z

I found other paper use Karpathy'split for COCO, your paper use rama's split, whose test data are the same? why you can compare your result with the result in self-critical?

ruotianluo · 2019-03-07T06:30:07Z

the splits are different. The self critical one is my implementation on Rama's split. Using Rama split I'd because we need to compare ours to Rama's result.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the retrieval loss doesn't converge well #11

the retrieval loss doesn't converge well #11

qq283215389 commented Feb 26, 2019

ruotianluo commented Feb 26, 2019

qq283215389 commented Feb 26, 2019

ruotianluo commented Feb 26, 2019

qq283215389 commented Feb 28, 2019

ruotianluo commented Feb 28, 2019

qq283215389 commented Mar 1, 2019

ruotianluo commented Mar 2, 2019

qq283215389 commented Mar 2, 2019

ruotianluo commented Mar 2, 2019

qq283215389 commented Mar 7, 2019

ruotianluo commented Mar 7, 2019

the retrieval loss doesn't converge well #11

the retrieval loss doesn't converge well #11

Comments

qq283215389 commented Feb 26, 2019

ruotianluo commented Feb 26, 2019

qq283215389 commented Feb 26, 2019

ruotianluo commented Feb 26, 2019

qq283215389 commented Feb 28, 2019

ruotianluo commented Feb 28, 2019

qq283215389 commented Mar 1, 2019

ruotianluo commented Mar 2, 2019

qq283215389 commented Mar 2, 2019

ruotianluo commented Mar 2, 2019

qq283215389 commented Mar 7, 2019

ruotianluo commented Mar 7, 2019