-
Sunnybrook Cardiac MRI Datasets
The Sunnybrook dataset includes cine-MRI images from patients with different pathologies and expert-drawn contours for validation. -
Cornell Grasp Detection Dataset
The Cornell grasp detection dataset contains 855 images of 240 different objects with ground truth labels indicating graspable and not-graspable rectangles. -
Mapillary Traffic Sign Dataset
The Mapillary Traffic Sign Dataset (MTSD) is a collection of images for traffic sign detection and classification, containing various examples of traffic signs used to evaluate... -
FashionMNIST
FashionMNIST is an image dataset of clothing items, used here to evaluate the performance of STN and P-STN models in recovering transformations and augmentations. -
Stanford 3D Semantic Parsing Dataset
The Stanford 3D semantic parsing dataset contains 3D scans from Matterport scanners in indoor areas with full annotations for various semantic classes including structural... -
ShapeNet Part Segmentation Dataset
ShapeNet part segmentation dataset contains 16,881 shapes from 16 categories, annotated with 50 parts in total, where each part category label is defined for each 3D point. -
KITTI Object Detection
The KITTI Object Detection dataset provides a comprehensive set of 2D object detection data for real-world driving scenarios, particularly focusing on car detection. -
Bosch Small Traffic Lights Dataset
The Bosch Small Traffic Lights Dataset presents a challenge for detecting small objects with partial occlusions, especially useful for localizing small objects under weak... -
MIOvision Traffic Camera Dataset (MIO-TCD)
MIOvision Traffic Camera Dataset (MIO-TCD) is the largest public benchmark for object detection in traffic surveillance images, with a vast array of annotated images for training... -
SQuAD dataset
The dataset used for training BERT consists of a concatenation of Wikipedia and BooksCorpus, specifically focused on the SQuAD task. -
PennTreebank
The PennTreebank dataset is used for language modeling, containing a large annotated corpus of English text to evaluate the task of predicting the next character or word based... -
Nottingham
The Nottingham dataset contains British and American folk tunes and is used to evaluate models' capabilities in polyphonic music modeling.