20,499 datasets found

Filter Results
  • NIH Cancer Imaging Archive CT Scans

    The dataset consists of 17 full-body CT scans from the NIH Cancer Imaging Archive used for training, with one CT scan reserved for testing. The pelvis bone was segmented from...
  • Massively Multilingual Machine Translation Dataset

    A corpus of parallel documents over 102 languages and English, containing 25 billion training examples across a diverse set of languages used for multilingual neural machine...
  • ELI5

    ELI5 is a long-form question answering dataset where the answers are generated based on the concatenation of a question and relevant supporting documents.
  • Downscaled ImageNet

    Downscaled ImageNet is a modified version of the standard ImageNet dataset, containing a reduced size of images and fewer classes for training models efficiently.
  • Partial-iLIDS

    Partial-iLIDS dataset is a simulated partial dataset based on iLIDs, consisting of 119 persons with 238 images captured by multiple non-overlapping cameras.
  • Partial-ReID

    Partial-ReID dataset includes 600 images of 60 persons, with 5 full-body images and 5 partial images per person, collected at a university campus with various viewpoints and...
  • Cornell eRulemaking Corpus (CDCP)

    The Cornell eRulemaking Corpus (CDCP) consists of 731 user comments collected from an eRulemaking website, totaling about 4,700 propositions, all considered argumentative. The...
  • Popi Dataset

    The Popi dataset consists of six 4D CTs showing the lung region, each including 10 3D CTs and landmarks for registration evaluation.
  • Sunnybrook Cardiac MRI Datasets

    The Sunnybrook dataset includes cine-MRI images from patients with different pathologies and expert-drawn contours for validation.
  • DirLab

    The DirLab dataset consists of 10 4D CTs of the thoracic region, each containing 10 3D CTs and manually set reference landmarks in the lungs for registration evaluation.
  • Cornell Grasp Detection Dataset

    The Cornell grasp detection dataset contains 855 images of 240 different objects with ground truth labels indicating graspable and not-graspable rectangles.
  • Mapillary Traffic Sign Dataset

    The Mapillary Traffic Sign Dataset (MTSD) is a collection of images for traffic sign detection and classification, containing various examples of traffic signs used to evaluate...
  • FashionMNIST

    FashionMNIST is an image dataset of clothing items, used here to evaluate the performance of STN and P-STN models in recovering transformations and augmentations.
  • Stanford 3D Semantic Parsing Dataset

    The Stanford 3D semantic parsing dataset contains 3D scans from Matterport scanners in indoor areas with full annotations for various semantic classes including structural...
  • ShapeNet Part Segmentation Dataset

    ShapeNet part segmentation dataset contains 16,881 shapes from 16 categories, annotated with 50 parts in total, where each part category label is defined for each 3D point.
  • KITTI Object Detection

    The KITTI Object Detection dataset provides a comprehensive set of 2D object detection data for real-world driving scenarios, particularly focusing on car detection.
  • Bosch Small Traffic Lights Dataset

    The Bosch Small Traffic Lights Dataset presents a challenge for detecting small objects with partial occlusions, especially useful for localizing small objects under weak...
  • MIOvision Traffic Camera Dataset (MIO-TCD)

    MIOvision Traffic Camera Dataset (MIO-TCD) is the largest public benchmark for object detection in traffic surveillance images, with a vast array of annotated images for training...
  • WMT2017

    The WMT17 dataset is used for neural machine translation tasks, providing data for multiple language pairs.
  • IWSLT2014

    The IWSLT14 dataset is used for neural machine translation tasks, containing various parallel text translations.