7 datasets found

Groups: Speaker Recognition Organizations: No Organization

Filter Results
  • Free Spoken Digit Dataset

    The dataset is a collection of 8kHz audio recordings of spoken digits from 'zero' to 'nine'.
  • TIMIT Corpus

    The TIMIT corpus is a large database of speech recordings used for speaker recognition and speech synthesis tasks.
  • TIMIT

    The TIMIT corpus is a widely used benchmark for speech recognition tasks. It contains 3,696 training utterances from 462 speakers, excluding the SA sentences. The core test set...
  • A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognitio...

    The Kazakh speech corpus (KSC) contains around 332 hours of transcribed audio comprising over 153,000 utterances spoken by participants from different regions and age groups, as...
  • VoxCeleb

    Speaker verification systems experience significant performance degradation when tasked with short-duration trial recordings. To address this challenge, a multi-scale feature...
  • VoxCeleb1

    Speaker recognition aims to identify speaker information from input speech. A type of speaker recognition is speaker verification (SV). It determines whether the test speaker's...
  • LibriTTS

    A popular text-based VC approach is to use an automatic speech recognition (ASR) model to extract phonetic posteriorgram (PPG) as content representation.