Language Models - Groups

Core Dative PRIME-LM Corpus

The dataset used in the paper to study the inverse frequency effect (IFE) in structural priming.

Dataset
JSON

Context versus Prior Knowledge in Language Models

The dataset used in the paper to test the persuasion and susceptibility scores of language models.

Dataset
JSON

Anthropic Helpfulness Base eval

The dataset used in the paper is the Anthropic Helpfulness Base eval dataset.

Dataset
JSON

Anthropic Helpfulness Base

The dataset used in the paper is the Anthropic Helpfulness Base train dataset and the Anthropic Helpfulness eval dataset.

Dataset
JSON

OpenAssistant dataset

The dataset used for the experiments in the paper, consisting of 1000 benign instruction examples.

Dataset
JSON

AdvBench dataset

The dataset used for the experiments in the paper, consisting of 60 harmful instructions from the AdvBench dataset.

Dataset
JSON

LLaMA

The dataset used in the paper is LLaMA, a large language model.

Dataset
JSON

Grammaticality Judgment Task

The dataset used in the paper is a grammaticality judgment task featuring four linguistic phenomena: anaphora, center embedding, comparatives, and negative polarity constructions.

Dataset
JSON

Finetuned language models are zero-shot learners

Dataset
JSON

SafeDecoding dataset

The dataset used in the SafeDecoding paper, which contains 32 harmful queries spanning 16 harmful categories.

Dataset
JSON

KIT

Human motion modeling is a critical component of animating virtual characters to imitate vivid and rich human movements, which has been a vital topic for many applications, such...

Dataset
JSON

HH-RLHF

The HH-RLHF dataset is a human preference dataset for reinforcement learning from human feedback.

Dataset
JSON

ETHICS benchmark

The ETHICS benchmark is a dataset for evaluating the ethics of language models.

Dataset
JSON

HumanEval, MBPP, APPS

The dataset used in the paper is a code generation benchmark, consisting of 164 function declarations alongside their documentation, 500 test examples, each one is an...

Dataset
JSON

Comprehensive Assessment of Jailbreak Attacks against LLMs

The Comprehensive Assessment of Jailbreak Attacks against LLMs dataset is used to evaluate the effectiveness of jailbreak attacks on language models.

Dataset
JSON

GPT-4

The dataset used in this paper is a large language model, GPT-4, and its residual stream activations.

Dataset
JSON

Demonstration ITerated Task Optimization (DITTO)

The dataset used in the paper is a collection of email and blog posts from 20 distinct authors, with a focus on few-shot alignment of large language models.

Dataset
JSON

Towards the Scalable Evaluation of Cooperativeness in Language Models

The dataset is used to evaluate the cooperative tendencies of language models. It consists of scenarios with particular game-theoretic structures, generated through both...

Dataset
JSON

18 datasets found