2 datasets found

Groups: Language Models Formats: JSON

Filter Results
  • HumanEval

    The dataset used in the paper is the HumanEval dataset, which is used to evaluate the performance of language models.
  • HumanEval, MBPP, APPS

    The dataset used in the paper is a code generation benchmark, consisting of 164 function declarations alongside their documentation, 500 test examples, each one is an...