Cite this as

Nisan Stiennon, Long Ouyang, Jeffrey Wu, Daniel Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei (2024). Dataset: Learning to summarize with human feedback. Resource: Original Metadata. https://doi.org/10.57702/bakxgny5

DOI retrieved: December 16, 2024

Additional Information

Field Value
Created December 16, 2024
Last updated December 16, 2024
Format JSON