Dataset - LDM

OuluVS2

OuluVS2 is a multi-view audiovisual database for non-rigid mouth motion analysis.
- Dataset
- JSON
AV Digits

The AV Digits database contains both normal and silent speech.
- Dataset
- JSON
LRS3-TED: A Large-Scale Dataset for Visual Speech Recognition

LRS3-TED: a large-scale dataset for visual speech recognition.
- Dataset
- JSON
LRS3

The LRS3 dataset is a large-scale dataset for visual speech recognition. It consists of thousands of spoken sentences from TED videos.
- Dataset
- JSON
Kinetics-600

The Kinetics-600 dataset consists of 392k training videos and 30k validation videos in 600 human action categories.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

5 datasets found