You're currently viewing an old version of this dataset. To see the current version, click here.

Corpus of Linguistic Acceptability (CoLA)

The Corpus of Linguistic Acceptability (CoLA) is a set of 10,657 English sentences labeled as grammatical or ungrammatical from published linguistics literature.

Data and Resources

Cite this as

Alex Warstadt, Amanpreet Singh, Samuel R. Bowman (2024). Dataset: Corpus of Linguistic Acceptability (CoLA). https://doi.org/10.57702/5scywcng

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.1805.12471
Author Alex Warstadt
More Authors
Amanpreet Singh
Samuel R. Bowman
Homepage https://nyu-mll.github.io/CoLA/