-
Learning to summarize with human feedback
The paper presents a study on the impact of synthetic data on large language models (LLMs) and proposes a method to steer LLMs towards desirable non-differentiable attributes. -
SummEval and Topical-Chat
This paper uses SummEval and Topical-Chat datasets for evaluating the quality of summaries and responses.