Dataset Groups Activity Stream Reward Model Ensembles The authors used three datasets: TL;DR, HELPFULNESS, and XSUM/NLI. BibTex: @dataset{Jacob_Eisenstein_and_Chirag_Nagpal_and_Alekh_Agarwal_and_Ahmad_Beirami_and_Alex_D’Amour_and_DJ_Dvijotham_and_Adam_Fisch_and_Katherine_Heller_and_Stephen_Pfohl_and_Deepak_Ramachandran_and_Peter_Shaw_and_Jonathan_Berant_2024, abstract = {The authors used three datasets: TL;DR, HELPFULNESS, and XSUM/NLI.}, author = {Jacob Eisenstein and Chirag Nagpal and Alekh Agarwal and Ahmad Beirami and Alex D’Amour and DJ Dvijotham and Adam Fisch and Katherine Heller and Stephen Pfohl and Deepak Ramachandran and Peter Shaw and Jonathan Berant}, doi = {10.57702/axhbsmh3}, institution = {No Organization}, keyword = {'ensemble methods', 'machine learning', 'natural language processing', 'reward models'}, month = {dec}, publisher = {TIB}, title = {Reward Model Ensembles}, url = {https://service.tib.eu/ldmservice/dataset/reward-model-ensembles}, year = {2024} }