update
this repository is for one summarization and inference competition using pytorch launched by baidu.
details can be found from: https://aistudio.baidu.com/aistudio/competition/detail/3.
I use the pointer-generator-network and get the score(ROUGE_L) about 37.
It is not a very good result, cause somebody reached about 50 and even 70. In the case I try to pretrain the word vector, but the result is behind the one which does not pretrain,It's so strange.
I tried to use the default parameters from this paper, which uses adagrad to optimize the loss function, but I find the result is not good as using adam.I also use the small learing rate and cliping norm to try to avoid gradient explore.
I encounter the problem that the loss will change to be NAN after thousands steps, which is about 10 epoachs. It's hard for me to find the reason. Maybe I will fix this problem in the future.
In the future, I will try different approaches to solve this competition, and try my best the get higher scores.
In the last, I refer some codes and ideas from Luojie and Jiangxingfa.
If you are interested about this competiton or you have great interest in summarizition and NLP related task, do not hesitate to contact me by [email protected]
how to run :
- put the raw data to the data dir.
- bash data_prepare.sh
- bash train.sh
- bash predict.sh