Language Models - Groups

LongPile

LongPile is a diverse dataset derived from the Pile corpus.

Dataset
JSON

LLaMA

The dataset used in the paper is LLaMA, a large language model.

Dataset
JSON

Grammaticality Judgment Task

The dataset used in the paper is a grammaticality judgment task featuring four linguistic phenomena: anaphora, center embedding, comparatives, and negative polarity constructions.

Dataset
JSON

Finetuned language models are zero-shot learners

Dataset
JSON

Llama: Open and efficient foundation language models

The LLaMA dataset is a large language model dataset used in the paper.

Dataset
JSON

GPTFuzzer

This dataset is used to evaluate the performance of the judgement model.

Dataset
JSON

Multilingual Blending: LLM Safety Alignment Evaluation with Language Mixture

Dataset
JSON

Self-Supervised Alignment with Mutual Information

The dataset is used for training a language model to follow behavioral principles without the use of preference labels, demonstrations, or human oversight.

Dataset
JSON

GPT-2 small

The dataset used in this paper is a large language model, GPT-2 small, and its residual stream activations.

Dataset
JSON

GPT-4

The dataset used in this paper is a large language model, GPT-4, and its residual stream activations.

Dataset
JSON

BERT: Pre-training of deep bidirectional transformers for language understanding

This paper proposes BERT, a pre-trained deep bidirectional transformer for language understanding.

Dataset
JSON

Demonstration ITerated Task Optimization (DITTO)

The dataset used in the paper is a collection of email and blog posts from 20 distinct authors, with a focus on few-shot alignment of large language models.

Dataset
JSON

Interpreting Learned Feedback Patterns in Large Language Models

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a condensed representation of LLM activations obtained from sparse...

Dataset
JSON

13 datasets found