HumanEval

The dataset used in the paper is the HumanEval dataset, which is used to evaluate the performance of language models.

BibTex: