Data for NLPCC2018 Shared Task--Grammatical Error Correction (GEC).
The segmentations of the training data and the gold sentences are implemented with the PKUNLP tool (http://www.icst.pku.edu.cn/lcwm/pkunlp/downloads/libgrass-ui.tar.gz).
The evaluation tool is m2scorer (https://github.com/nusnlp/m2scorer).
If you use the dataset, please cite this paper:
Yuanyuan Zhao, Nan Jiang, Weiwei Sun and Xiaojun Wan. Overview of the NLPCC 2018 Shared Task: Grammatical Error Correction. Natural Language Processing and Chinese Computing (NLPCC 2018).