Language Models - Groups

AANN construction dataset

The AANN construction dataset

Dataset
JSON

CoLA corpus and AANN construction dataset

The CoLA corpus of acceptability judgments and the AANN construction dataset

Dataset
JSON

ETHICS benchmark

The ETHICS benchmark is a dataset for evaluating the ethics of language models.

Dataset
JSON

HumanEval, MBPP, APPS

The dataset used in the paper is a code generation benchmark, consisting of 164 function declarations alongside their documentation, 500 test examples, each one is an...

Dataset
JSON

Comprehensive Assessment of Jailbreak Attacks against LLMs

The Comprehensive Assessment of Jailbreak Attacks against LLMs dataset is used to evaluate the effectiveness of jailbreak attacks on language models.

Dataset
JSON

Laion-5b

A large-scale dataset of text and images for training next-generation language models.

Dataset
JSON

Self-Supervised Alignment with Mutual Information

The dataset is used for training a language model to follow behavioral principles without the use of preference labels, demonstrations, or human oversight.

Dataset
JSON

GPT-2 small

The dataset used in this paper is a large language model, GPT-2 small, and its residual stream activations.

Dataset
JSON

GPT-4

The dataset used in this paper is a large language model, GPT-4, and its residual stream activations.

Dataset
JSON

BERT: Pre-training of deep bidirectional transformers for language understanding

This paper proposes BERT, a pre-trained deep bidirectional transformer for language understanding.

Dataset
JSON

GPT-4 Dataset

The GPT-4 dataset used for fine-tuning the Qwen model.

Dataset
JSON

Demonstration ITerated Task Optimization (DITTO)

The dataset used in the paper is a collection of email and blog posts from 20 distinct authors, with a focus on few-shot alignment of large language models.

Dataset
JSON

Towards the Scalable Evaluation of Cooperativeness in Language Models

The dataset is used to evaluate the cooperative tendencies of language models. It consists of scenarios with particular game-theoretic structures, generated through both...

Dataset
JSON

SHP dataset

The SHP dataset is used to evaluate the performance of the proposed Compositional Preference Models (CPMs).

Dataset
JSON

HH-RLHF dataset

The HH-RLHF dataset is used to evaluate the performance of the proposed Compositional Preference Models (CPMs).

Dataset
JSON

Training Language Models to Perform Tasks

A dataset for training language models to perform tasks such as question answering and text classification.

Dataset
JSON

Interpreting Learned Feedback Patterns in Large Language Models

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a condensed representation of LLM activations obtained from sparse...

Dataset
JSON

37 datasets found