The Stanford Human Preferences (SHP) dataset is sourced from Reddit with various subreddits that focus on QA. Preferences have been extracted from the accumulated up- and down-votes of the online community.
BibTex:
Before browse our site, please accept our cookies policy