Okapi

The dataset is used for instruction-tuning of LLMs in multiple languages using reinforcement learning from human feedback.

Data and Resources

Cite this as

Viet Dac Lai, Nghia Trung Ngo, Amir Pouran Ben Veyseh, Hieu Man, Franck Dernoncourt, Trung Bui, Thien Huu Nguyen (2024). Dataset: Okapi. https://doi.org/10.57702/veefae2j

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Author Viet Dac Lai
More Authors
Nghia Trung Ngo
Amir Pouran Ben Veyseh
Hieu Man
Franck Dernoncourt
Trung Bui
Thien Huu Nguyen
Homepage https://github.com/nlp-uoregon/Okapi