-
WIDER Dataset
A benchmark dataset for face detection, with 32,203 images and 393,703 faces. -
MoleculeNet
The MoleculeNet dataset is a collection of molecular property prediction tasks. It contains 17 datasets, each with a different type of molecular graph. -
CLIMABENCH
CLIMABENCH is a benchmark of climate-related text classification tasks. It collates five existing climate change-related text datasets, including CLIMATEXT, CLIMATESTANCE,... -
UKB object recognition benchmark
The UKB object recognition benchmark. -
Holidays dataset
The Holidays dataset is used for testing the performance of the HAE on visual feature translation. -
Urban100 dataset
The Urban100 dataset is a benchmark for image denoising, containing 100 images with varying levels of noise. -
BSDS500 dataset
The dataset used in this paper is the BSDS500 dataset, which contains 200 natural images with over 1000 ground truth labellings. -
speechocean762
speechocean762: An open-source non-native English speech corpus for pronunciation assessment. -
HumanEval, MBPP, APPS
The dataset used in the paper is a code generation benchmark, consisting of 164 function declarations alongside their documentation, 500 test examples, each one is an... -
VLCS Dataset
VLCS dataset is a benchmark for image classification, containing images from four different datasets (domains): VOC2007, LabelMe, Caltech, and SUN09. -
RL Unplugged
The RL Unplugged dataset, a benchmark for offline reinforcement learning, consisting of 20 tasks with varying difficulty levels. -
YCB Object and Model Set
The YCB object and model set is a benchmark for manipulation research, consisting of 15 object categories and 3D models. -
2018 Data Science Bowl dataset
A benchmark dataset for cell nuclei segmentation, featuring approximately 700 segmented cell images.