Anthropic's HH-RLHF and OpenAI's summarization datasets

The dataset used in the paper is the Anthropic's HH-RLHF and OpenAI's summarization datasets.

BibTex: