Grammatical Error Correction - Groups

Corpora Generation for Grammatical Error Correction

Two approaches for generating large parallel datasets for Grammatical Error Correction (GEC) using publicly available Wikipedia data.
- Dataset
- JSON
Lang8

This dataset is used for training and evaluating the proposed SynGEC approach.
- Dataset
- JSON
NLPCC-18

This dataset is used for training and evaluating the proposed SynGEC approach.
- Dataset
- JSON
MuCGEC

This dataset is used for training and evaluating the proposed SynGEC approach.
- Dataset
- JSON

4 datasets found