Direct Preference Optimization With Unobserved Preference Heterogeneity

The dataset used in the paper is a binary preference dataset from heterogeneous annotators.

Data and Resources

Cite this as

Keertana Chidambaram, Karthik Vinay Seetharaman, Vasilis Syrgkanis (2024). Dataset: Direct Preference Optimization With Unobserved Preference Heterogeneity. https://doi.org/10.57702/h23d5f8z

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.48550/arXiv.2405.15065
Author Keertana Chidambaram
More Authors
Karthik Vinay Seetharaman
Vasilis Syrgkanis