Code Generation - Groups

HumanEval

The dataset used in the paper is the HumanEval dataset, which is used to evaluate the performance of language models.
- Dataset
- JSON
HumanEval, MBPP, APPS

The dataset used in the paper is a code generation benchmark, consisting of 164 function declarations alongside their documentation, 500 test examples, each one is an...
- Dataset
- JSON

Before browse our site, please accept our cookies policy

2 datasets found