Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 4 datasets found Groups: Natural Language Processing Organizations: No Organization Filter Results Corpora Generation for Grammatical Error Correction Two approaches for generating large parallel datasets for Grammatical Error Correction (GEC) using publicly available Wikipedia data. Dataset JSON Lang8 This dataset is used for training and evaluating the proposed SynGEC approach. Dataset JSON NLPCC-18 This dataset is used for training and evaluating the proposed SynGEC approach. Dataset JSON MuCGEC This dataset is used for training and evaluating the proposed SynGEC approach. Dataset JSON