-
Universal Dependencies (UD) treebanks
The dataset used in the paper is not explicitly mentioned, but it is mentioned that the authors used the Universal Dependencies (UD) treebanks. -
Data Management Operations and Recipes
A dataset management operations and recipes for NLP data production -
A Workflow Manager for Complex NLP and Content Curation Pipelines
A workflow manager for the flexible creation and customisation of NLP processing pipelines. -
MatSci-NLP
The MatSci-NLP dataset is a collection of materials science text for NLP tasks. -
Towards Dark Jargon Interpretation in Underground Forums
Dark jargons are benign-looking words that have hidden, sinister meanings and are used by participants of underground forums for illicit behavior. -
ACL Anthology
The ACL Anthology dataset contains papers on natural language processing, including citation patterns, authorship, and language use over time. -
Cross-lingual semantic representation for NLP with UCCA
The UCCA dataset is used to test the annotation scheme in cross-lingual semantic representation for NLP. -
Multilingual Misinformation & Its Evolution
The dataset used in this study is a combination of data from Google Fact-Check explorer and data directly crawled from the websites of verified signatories of the International... -
GLUE benchmark
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used three downstream tasks from the GLUE benchmark: Stanford Sentiment Treebank... -
Social Chemistry 101
Social Chemistry 101 dataset is a collection of social norms and rules of thumb (ROTs) for evaluating people's behavior in everyday social situations. -
NLPositionality
NLPositionality is a framework for characterizing design biases and quantifying the positionality of NLP datasets and models.