15 datasets found

Tags: recognition

Filter Results
  • VideoLT

    The VideoLT dataset contains 1,004 classes and about 256,218 untrimmed videos collected from YouTube, covering a wide range of human activities, including everyday life,...
  • IAM

    The dataset used for handwritten text recognition, containing handwritten text images.
  • NTU-RGBD120

    A real-world skeleton-based human action recognition dataset.
  • NTU-RGBD60

    A real-world skeleton-based human action recognition dataset.
  • Affect-in-the-wild

    Facial expression recognition dataset in the wild
  • CSL-Daily

    CSL-Daily is a Chinese sign language (CSL) dataset that mainly focuses on people’s daily lives. It includes 18401, 1077, and 1176 available examples in the training, validation,...
  • MPI3D dataset

    The dataset used in the paper is a MPI3D dataset, which contains 3D images of objects with varying sizes and colors.
  • Human Activity Recognition Dataset

    The UCI machine learning repository contains a dataset of human activity recognition from inertial sensors.
  • GTZAN

    The GTZAN dataset is a comprehensive collection of 1000 audio tracks, each 30 seconds long, representing ten diverse music genres.
  • Table Tennis Stroke Detection and Recognition Using Ball Trajectory Data

    Table tennis stroke detection and recognition using ball trajectory data
  • USPS dataset

    The USPS dataset consists of 9298 images of handwritten digits 0-9 (10 classes) of 16x16 pixels in gray scale.
  • HMDB-51

    Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming....
  • Librispeech

    The Librispeech dataset is a large-scale speaker-dependent speech corpus containing 1080 hours of speech, 5600 utterances, and 1000 speakers.
  • DeepFashion dataset

    The DeepFashion dataset is a large-scale dataset for person image synthesis, containing 101,966 pairs of images with different poses and clothing.
  • FER+

    The dataset used for emotion recognition, featuring facial expression images.
You can also access this registry using the API (see API Docs).