-
SMARTPASTE Dataset
The dataset contains 4.8 million lines of C# code from 27 projects. -
SMARTPASTE
The SMARTPASTE task is a challenge for machine learning modeling of source code, that requires to learn (some) semantics of programs. -
AdaptivePaste Dataset
A dataset of code snippets in Python programming language, used for training and evaluating the AdaptivePaste model. -
SLTrans: A Source Code to LLVM IR Translation Pairs Dataset
SLTrans is a parallel dataset consisting of nearly 4M pairs of self-contained source code and corresponding LLVM IR. -
Codeforces
The Codeforces dataset contains solutions to competitive programming problems.