-
Quantum Process Logic - Take IIb
The dataset consists of a graphical language for describing quantum phenomena and meaning-related linguistic phenomena. -
Quantum Process Logic - Take IIa
The dataset consists of a graphical language for describing quantum phenomena and meaning-related linguistic phenomena. -
Quantum Process Logic
The dataset consists of a graphical language for describing quantum phenomena and meaning-related linguistic phenomena. -
PFN Picking Instructions for Commodities Dataset (PFN-PIC)
A new challenging dataset for real-world object picking tasks, consisting of 1,180 images with bounding boxes and text instructions annotated. -
ECC Analyzer
The ECC Analyzer dataset is a collection of earnings conference calls (ECCs) with their corresponding transcripts and audio recordings. -
Furiously Can Colourless Green Ideas Sleep?
The dataset used in the paper to study the influence of context on sentence acceptability. -
BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction...
The dataset used in the paper to evaluate the effectiveness of the BEEAR method in mitigating safety backdoors in instruction-tuned LLMs. -
Existing ACQ datasets
A few existing datasets for asking clarification questions -
FLM-HotpotQA
A dataset for pragmatic evaluation of clarifying questions and fact-level masking -
NLPeer dataset
A unified resource for the computational study of peer review. -
Racist and sexist hate speech detection: Literature review
A review of studies on the detection of racist and sexist hate speech. -
YOSM: A new Yorùbá Sentiment Corpus for Movie Reviews
A dataset for sentiment analysis of Yoruba movie reviews. -
SemEval-2023 Task 10: Explainable Detection of Online Sexism
The dataset used for the SemEval-2023 Task 10: Explainable Detection of Online Sexism (EDOS) task, a shared task on offensive language (sexism) detection on English Gab and... -
ANTHROSCORE: A Computational Linguistic Measure of Anthropomorphism
Anthropomorphism in research papers and downstream news headlines -
Patent corpus
A dataset of over 100,000 patent documents from the Cooperative Patent Classification scheme (CPC) category A61.