UltraRM-13B

The UltraRM-13B dataset is a collection of human feedback for language model training.

BibTex: