Dataset - LDM

Sunnybrook Cardiac MRI Datasets

The Sunnybrook dataset includes cine-MRI images from patients with different pathologies and expert-drawn contours for validation.
- Dataset
- JSON
DirLab

The DirLab dataset consists of 10 4D CTs of the thoracic region, each containing 10 3D CTs and manually set reference landmarks in the lungs for registration evaluation.
- Dataset
- JSON
Cornell Grasp Detection Dataset

The Cornell grasp detection dataset contains 855 images of 240 different objects with ground truth labels indicating graspable and not-graspable rectangles.
- Dataset
- JSON
Mapillary Traffic Sign Dataset

The Mapillary Traffic Sign Dataset (MTSD) is a collection of images for traffic sign detection and classification, containing various examples of traffic signs used to evaluate...
- Dataset
- JSON
FashionMNIST

FashionMNIST is an image dataset of clothing items, used here to evaluate the performance of STN and P-STN models in recovering transformations and augmentations.
- Dataset
- JSON
Stanford 3D Semantic Parsing Dataset

The Stanford 3D semantic parsing dataset contains 3D scans from Matterport scanners in indoor areas with full annotations for various semantic classes including structural...
- Dataset
- JSON
ShapeNet Part Segmentation Dataset

ShapeNet part segmentation dataset contains 16,881 shapes from 16 categories, annotated with 50 parts in total, where each part category label is defined for each 3D point.
- Dataset
- JSON
KITTI Object Detection

The KITTI Object Detection dataset provides a comprehensive set of 2D object detection data for real-world driving scenarios, particularly focusing on car detection.
- Dataset
- JSON
Bosch Small Trafﬁc Lights Dataset

The Bosch Small Trafﬁc Lights Dataset presents a challenge for detecting small objects with partial occlusions, especially useful for localizing small objects under weak...
- Dataset
- JSON
MIOvision Trafﬁc Camera Dataset (MIO-TCD)

MIOvision Trafﬁc Camera Dataset (MIO-TCD) is the largest public benchmark for object detection in trafﬁc surveillance images, with a vast array of annotated images for training...
- Dataset
- JSON
WMT2017

The WMT17 dataset is used for neural machine translation tasks, providing data for multiple language pairs.
- Dataset
- JSON
IWSLT2014

The IWSLT14 dataset is used for neural machine translation tasks, containing various parallel text translations.
- Dataset
- JSON
CIFAR10

The CIFAR10 dataset is used for training and evaluating deep neural networks, specifically in this study for assessing the performance of decision gates in ResNet-101 and...
- Dataset
- JSON
SQuAD dataset

The dataset used for training BERT consists of a concatenation of Wikipedia and BooksCorpus, specifically focused on the SQuAD task.
- Dataset
- JSON
VIPeR

VIPeR dataset consists of 632 persons with two images captured from different cameras, used for person re-identification tasks.
- Dataset
- JSON
CUHK01

CUHK01 is a medium-sized dataset containing 3,884 images of 971 identities, intended for testing person re-identification methods.
- Dataset
- JSON
CUHK03

CUHK03 is a large dataset containing 13,164 images for 1,360 identities captured by 6 cameras. It includes both detected and labeled images for training and testing.
- Dataset
- JSON
PennTreebank

The PennTreebank dataset is used for language modeling, containing a large annotated corpus of English text to evaluate the task of predicting the next character or word based...
- Dataset
- JSON
Nottingham

The Nottingham dataset contains British and American folk tunes and is used to evaluate models' capabilities in polyphonic music modeling.
- Dataset
- JSON
Pano2Vid

Pano2Vid is a real-world 360-degree video dataset containing videos from several categories. The frames are sampled and resized to 640x320 resolution for training and testing,...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

20,491 datasets found