Human Preference Data

Human preference data is collected for reward modeling.

BibTex: