-
LRS3-TED: A Large-Scale Dataset for Visual Speech Recognition
LRS3-TED: a large-scale dataset for visual speech recognition. -
Kinetics-600
The Kinetics-600 dataset consists of 392k training videos and 30k validation videos in 600 human action categories.