Cite this as

Alex J. Chan, Hao Sun, Samuel Holt, Mihaela van der Schaar (2024). Dataset: Dense Reward for Free in RLHF. Resource: Original Metadata. https://doi.org/10.57702/netp9p5i

DOI retrieved: December 16, 2024

Additional Information

Field Value
Created December 16, 2024
Last updated December 16, 2024
Format JSON