4 datasets found

Filter Results
  • Voxceleb2

    The Voxceleb2 dataset is a large-scale speaker recognition dataset, containing 2442 hours raw speech from 6112 speakers.
  • MEAD and HDTF datasets

    MEAD and HDTF datasets are used for training and testing the proposed SAAS model.
  • HDTF

    The dataset used in the paper for 3D head avatar reconstruction from monocular RGB videos.
  • MEAD

    The MEAD dataset is a large-scale, high-quality emotional audio-visual dataset, which consists of 60 actors, including 8 basic emotions and 3 different emotional-intensity...