1 dataset found

Groups: Audio-Visual Speech Separation Organizations: No Organization

Filter Results
  • Voxceleb2

    The Voxceleb2 dataset is a large-scale speaker recognition dataset, containing 2442 hours raw speech from 6112 speakers.