-
Collective Constitutional AI
A platform for aligning a language model with public input. -
Phi-2: A Dataset for Language Model Evaluation
The Phi-2 dataset is a collection of language models used to evaluate the performance of language models. -
MBPP: A Dataset for Language Model Evaluation
The MBPP dataset is a collection of basic programming questions used to evaluate the performance of language models. -
Wikipedia Corpus
The dataset used in the paper is a subset of the Wikipedia corpus, consisting of 7500 English Wikipedia articles belonging to one of the following categories: People, Cities,... -
Direct preference optimization: Your language model is secretly a reward model
The dataset used in the paper is not explicitly described. However, it is mentioned that the authors used a language model to optimize the performance of a reinforcement... -
RedPajama 3B
This dataset has no description