Language Models - Groups

LongPile

LongPile is a diverse dataset derived from the Pile corpus.
- Dataset
- JSON
LLaMA

The dataset used in the paper is LLaMA, a large language model.
- Dataset
- JSON
Llama: Open and efficient foundation language models

The LLaMA dataset is a large language model dataset used in the paper.
- Dataset
- JSON
GPTFuzzer

This dataset is used to evaluate the performance of the judgement model.
- Dataset
- JSON
BERT: Pre-training of deep bidirectional transformers for language understanding

This paper proposes BERT, a pre-trained deep bidirectional transformer for language understanding.
- Dataset
- JSON
Training Language Models to Perform Tasks

A dataset for training language models to perform tasks such as question answering and text classification.
- Dataset
- JSON
Interpreting Learned Feedback Patterns in Large Language Models

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a condensed representation of LLM activations obtained from sparse...
- Dataset
- JSON

7 datasets found