Okapi

doi:doi:10.57702/veefae2j

Okapi

The dataset is used for instruction-tuning of LLMs in multiple languages using reinforcement learning from human feedback.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Viet Dac Lai, Nghia Trung Ngo, Amir Pouran Ben Veyseh, Hieu Man, Franck Dernoncourt, Trung Bui, Thien Huu Nguyen (2024). Dataset: Okapi. https://doi.org/10.57702/veefae2j

DOI retrieved: December 3, 2024

Additional Info

Field	Value
Created	December 3, 2024
Last update	December 3, 2024
Author	Viet Dac Lai
More Authors	Nghia Trung Ngo Amir Pouran Ben Veyseh Hieu Man Franck Dernoncourt Trung Bui Thien Huu Nguyen
Homepage	https://github.com/nlp-uoregon/Okapi