CodeContest

The dataset used in the paper for training and testing the DPO and PPO models.

Data and Resources

Cite this as

Shusheng Xu, Wei Fu, Jiaxuan Gao, Wenjie Ye, Weilin Liu, Zhiyu Mei, Guangju Wang, Chao Yu, Yi Wu (2024). Dataset: CodeContest. https://doi.org/10.57702/1vqkr7cn

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Author Shusheng Xu
More Authors
Wei Fu
Jiaxuan Gao
Wenjie Ye
Weilin Liu
Zhiyu Mei
Guangju Wang
Chao Yu
Yi Wu
Homepage https://huggingface.co/OpenAssistant/oasst-rm-2-pythia-6.9b-epoch-1