-
Phi-2: A Dataset for Language Model Evaluation
The Phi-2 dataset is a collection of language models used to evaluate the performance of language models. -
MBPP: A Dataset for Language Model Evaluation
The MBPP dataset is a collection of basic programming questions used to evaluate the performance of language models. -
Language Model as an Annotator: Exploring DialoGPT for Dialogue Summarization
Dialogue summarization aims to generate a succinct summary while retaining essential information of the dialogue. -
A general language assistant as a laboratory for alignment
A general language assistant for aligning language models with human users -
Realtoxicityprompts: Evaluating neural toxic degeneration in language models
A dataset for evaluating neural toxic degeneration in language models -
GPT-2 small
The dataset used in this paper is a large language model, GPT-2 small, and its residual stream activations. -
Direct preference optimization: Your language model is secretly a reward model
The dataset used in the paper is not explicitly described. However, it is mentioned that the authors used a language model to optimize the performance of a reinforcement... -
RedPajama 3B
This dataset has no description
-
GPT-4 Dataset
The GPT-4 dataset used for fine-tuning the Qwen model.