-
BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction...
The dataset used in the paper to evaluate the effectiveness of the BEEAR method in mitigating safety backdoors in instruction-tuned LLMs. -
Existing ACQ datasets
A few existing datasets for asking clarification questions -
FLM-HotpotQA
A dataset for pragmatic evaluation of clarifying questions and fact-level masking -
NLPeer dataset
A unified resource for the computational study of peer review. -
ASAP AEG dataset
The ASAP AEG dataset contains approximately 13,000 essays, across 8 essay sets. The dataset has approximately 13,000 essays, across 8 essay sets. -
Racist and sexist hate speech detection: Literature review
A review of studies on the detection of racist and sexist hate speech. -
YOSM: A new Yorùbá Sentiment Corpus for Movie Reviews
A dataset for sentiment analysis of Yoruba movie reviews. -
SemEval-2023 Task 10: Explainable Detection of Online Sexism
The dataset used for the SemEval-2023 Task 10: Explainable Detection of Online Sexism (EDOS) task, a shared task on offensive language (sexism) detection on English Gab and... -
ANTHROSCORE: A Computational Linguistic Measure of Anthropomorphism
Anthropomorphism in research papers and downstream news headlines -
Patent corpus
A dataset of over 100,000 patent documents from the Cooperative Patent Classification scheme (CPC) category A61. -
Mitigating Backdoor Poisoning Attacks through the Lens of Spurious Correlation
Modern NLP models are often trained over large untrusted datasets, raising the potential for a malicious adversary to compromise model behaviour. -
FIPO Dataset
The dataset used for Free-form Instruction-oriented Prompt Optimization (FIPO) with Preference Dataset and Modular Fine-tuning Schema. -
Identifying machine-paraphrased plagiarism
This dataset is used to identify machine-generated paraphrased plagiarism. -
Dialogue Dataset for Detecting Sentences that Do Not Require Factual Correctn...
A dialogue dataset annotated with fact-check-needed label (DDFC) for detecting sentences that do not require factual correctness judgment -
Scaling laws and fluctuations in the statistics of word frequencies
The dataset consists of three large databases: Google-ngram, English Wikipedia, and a collection of scientific articles. -
Penn Treebank corpus
The Penn Treebank corpus contains 49,208 sentences and over 1 million words, and is used to test the proposed algorithm on a real-world dataset. -
Wall Street Journal (WSJ) dataset
The Wall Street Journal (WSJ) dataset is a standard benchmark dataset for coherence modeling.