Language Models - Groups

Core Dative PRIME-LM Corpus

The dataset used in the paper to study the inverse frequency effect (IFE) in structural priming.
- Dataset
- JSON
Context versus Prior Knowledge in Language Models

The dataset used in the paper to test the persuasion and susceptibility scores of language models.
- Dataset
- JSON
Anthropic Helpfulness Base eval

The dataset used in the paper is the Anthropic Helpfulness Base eval dataset.
- Dataset
- JSON
Anthropic Helpfulness Base

The dataset used in the paper is the Anthropic Helpfulness Base train dataset and the Anthropic Helpfulness eval dataset.
- Dataset
- JSON
OpenAssistant dataset

The dataset used for the experiments in the paper, consisting of 1000 benign instruction examples.
- Dataset
- JSON
AdvBench dataset

The dataset used for the experiments in the paper, consisting of 60 harmful instructions from the AdvBench dataset.
- Dataset
- JSON
LongPile

LongPile is a diverse dataset derived from the Pile corpus.
- Dataset
- JSON
CALaMo: a Constructionist Assessment of Language Models

The authors used the CHILDES corpus to train a character-based LSTM model and evaluated its performance on a set of tasks.
- Dataset
- JSON
LLaMA

The dataset used in the paper is LLaMA, a large language model.
- Dataset
- JSON
Grammaticality Judgment Task

The dataset used in the paper is a grammaticality judgment task featuring four linguistic phenomena: anaphora, center embedding, comparatives, and negative polarity constructions.
- Dataset
- JSON
Finetuned language models are zero-shot learners

Finetuned language models are zero-shot learners
- Dataset
- JSON
Edit Distance Robust Watermarks for Language Models

The dataset used in the paper is a language model output, which is a sequence of tokens generated by a language model.
- Dataset
- JSON
SafeDecoding dataset

The dataset used in the SafeDecoding paper, which contains 32 harmful queries spanning 16 harmful categories.
- Dataset
- JSON
Enhancing chat language models by scaling high-quality instructional conversa...

Enhancing chat language models by scaling high-quality instructional conversations.
- Dataset
- JSON
Llama: Open and efficient foundation language models

The LLaMA dataset is a large language model dataset used in the paper.
- Dataset
- JSON
Fine-tuning Language Models with Advantage-Induced Policy Alignment

The dataset used in the paper is the Anthropic Helpfulness and Harmlessness dataset and the StackExchange dataset.
- Dataset
- JSON
GPTFuzzer

This dataset is used to evaluate the performance of the judgement model.
- Dataset
- JSON
KIT

Human motion modeling is a critical component of animating virtual characters to imitate vivid and rich human movements, which has been a vital topic for many applications, such...
- Dataset
- JSON
HH-RLHF

The HH-RLHF dataset is a human preference dataset for reinforcement learning from human feedback.
- Dataset
- JSON
Multilingual Blending: LLM Safety Alignment Evaluation with Language Mixture

Multilingual Blending: LLM Safety Alignment Evaluation with Language Mixture
- Dataset
- JSON

37 datasets found