-
Core Dative PRIME-LM Corpus
The dataset used in the paper to study the inverse frequency effect (IFE) in structural priming. -
Context versus Prior Knowledge in Language Models
The dataset used in the paper to test the persuasion and susceptibility scores of language models. -
Anthropic Helpfulness Base eval
The dataset used in the paper is the Anthropic Helpfulness Base eval dataset. -
Anthropic Helpfulness Base
The dataset used in the paper is the Anthropic Helpfulness Base train dataset and the Anthropic Helpfulness eval dataset. -
OpenAssistant dataset
The dataset used for the experiments in the paper, consisting of 1000 benign instruction examples. -
AdvBench dataset
The dataset used for the experiments in the paper, consisting of 60 harmful instructions from the AdvBench dataset. -
Grammaticality Judgment Task
The dataset used in the paper is a grammaticality judgment task featuring four linguistic phenomena: anaphora, center embedding, comparatives, and negative polarity constructions. -
Finetuned language models are zero-shot learners
Finetuned language models are zero-shot learners -
SafeDecoding dataset
The dataset used in the SafeDecoding paper, which contains 32 harmful queries spanning 16 harmful categories. -
ETHICS benchmark
The ETHICS benchmark is a dataset for evaluating the ethics of language models. -
HumanEval, MBPP, APPS
The dataset used in the paper is a code generation benchmark, consisting of 164 function declarations alongside their documentation, 500 test examples, each one is an... -
Comprehensive Assessment of Jailbreak Attacks against LLMs
The Comprehensive Assessment of Jailbreak Attacks against LLMs dataset is used to evaluate the effectiveness of jailbreak attacks on language models. -
Demonstration ITerated Task Optimization (DITTO)
The dataset used in the paper is a collection of email and blog posts from 20 distinct authors, with a focus on few-shot alignment of large language models. -
Towards the Scalable Evaluation of Cooperativeness in Language Models
The dataset is used to evaluate the cooperative tendencies of language models. It consists of scenarios with particular game-theoretic structures, generated through both...