-
Block-based programming dataset
The dataset is a block-based programming dataset used to train a code classification model to predict students' success on a given problem. -
Various Datasets
The datasets used in the paper are described as follows: WikiMIA, BookMIA, Temporal Wiki, Temporal arXiv, ArXiv-1 month, Multi-Webdata, LAION-MI, Gutenberg. -
POJ-104 Dataset
The POJ-104 dataset is a collection of 104 program classes written by 500 different people randomly selected per class.