-
Pick-a-Pic dataset
The dataset used in the paper is the Pick-a-Pic dataset, which consists of 87,687 pairs of text prompts and images. -
PartiPrompts dataset
The dataset used in the paper is the PartiPrompts dataset, which consists of 851,293 pairs of text prompts and images. -
Human Preference Data
Human preference data is collected for reward modeling. -
Baidu Search
Online human behavior data is collected from Baidu Search. Human preference data is collected for reward modeling. -
Human Preference Data about Helpfulness and Harmlessness
The dataset is used for human alignment in large language models.