Skip to content

quanterk/pgn_summarization_for_baidu_competition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pgn_for_baidu_competition

update

this repository is for one summarization and inference competition using pytorch launched by baidu.

details can be found from: https://aistudio.baidu.com/aistudio/competition/detail/3.

I use the pointer-generator-network and get the score(ROUGE_L) about 37.

It is not a very good result, cause somebody reached about 50 and even 70. In the case I try to pretrain the word vector, but the result is behind the one which does not pretrain,It's so strange.

I tried to use the default parameters from this paper, which uses adagrad to optimize the loss function, but I find the result is not good as using adam.I also use the small learing rate and cliping norm to try to avoid gradient explore.

I encounter the problem that the loss will change to be NAN after thousands steps, which is about 10 epoachs. It's hard for me to find the reason. Maybe I will fix this problem in the future.

In the future, I will try different approaches to solve this competition, and try my best the get higher scores.

In the last, I refer some codes and ideas from Luojie and Jiangxingfa.

If you are interested about this competiton or you have great interest in summarizition and NLP related task, do not hesitate to contact me by [email protected]

how to run :

  1. put the raw data to the data dir.
  2. bash data_prepare.sh
  3. bash train.sh
  4. bash predict.sh

About

this repository is for one summarization launched by baidu.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published