Code Generation - Groups

MBPP

The dataset used in the paper for code generation
- Dataset
- JSON
HumanEval

The dataset used in the paper is the HumanEval dataset, which is used to evaluate the performance of language models.
- Dataset
- JSON
Wizardcoder

Wizardcoder: Empowering code large language models with evol-instruct
- Dataset
- JSON
Magicoder

Magicoder: Source code is all you need
- Dataset
- JSON
PLUM: Preference Learning Plus Test Cases Yields Better Code Language Models

Instruction-finetuned code language models have shown promise in various programming tasks. They are trained, using a language modeling objective, on natural language...
- Dataset
- JSON
MBPP dataset

The dataset used in this paper is the MBPP dataset, which contains code snippets and their corresponding test cases.
- Dataset
- JSON
APPS: A Dataset for Code Generation Evaluation

The APPS dataset is a collection of programming problems used to evaluate the performance of code generation models.
- Dataset
- JSON
Evaluating large language models trained on code

The paper presents the results of the OpenAI Codex evaluation on generating Python code.
- Dataset
- JSON
Execution-based Evaluation for NL2Bash

A set of 50 prompts to evaluate execution-based evaluation for NL2Bash task
- Dataset
- JSON
CodeXGLUE

Code completion is considered as an essential feature towards efficient software development in modern Integrated Development Environments (IDEs).
- Dataset
- JSON
CodeUltraFeedback

CodeUltraFeedback is a preference dataset of 10,000 complex instructions to tune and align LLMs to coding preferences through AI feedback.
- Dataset
- JSON
APPS

The dataset used in the paper for training and testing the DPO and PPO models.
- Dataset
- JSON
HumanEval, MBPP, APPS

The dataset used in the paper is a code generation benchmark, consisting of 164 function declarations alongside their documentation, 500 test examples, each one is an...
- Dataset
- JSON
Large language models of code fail at completing code with potential bugs

Code generation models fail at completing code with potential bugs.
- Dataset
- JSON
SLTrans: A Source Code to LLVM IR Translation Pairs Dataset

SLTrans is a parallel dataset consisting of nearly 4M pairs of self-contained source code and corresponding LLVM IR.
- Dataset
- JSON
Evol-Instruct-Code-80k

Evol-Instruct-Code-80k is a dataset for evaluating the performance of code generation models.
- Dataset
- JSON
CodeT5

The dataset used in this paper is the CodeT5 dataset, which is a code generation large language model.
- Dataset
- JSON

17 datasets found