2 datasets found

Tags: Audio-Visual Dataset

Filter Results
  • VoxCeleb

    Speaker verification systems experience significant performance degradation when tasked with short-duration trial recordings. To address this challenge, a multi-scale feature...
  • MEAD

    The MEAD dataset is a large-scale, high-quality emotional audio-visual dataset, which consists of 60 actors, including 8 basic emotions and 3 different emotional-intensity...
You can also access this registry using the API (see API Docs).