12 datasets found

Tags: Speaker Recognition

Filter Results
  • VOiCES

    The VOiCES dataset is used for testing the speaker recognition system. The dataset contains 7323 identities combined. The dataset is used for testing.
  • NIST SRE16

    Speaker recognition evaluation dataset
  • Free Spoken Digit Dataset

    The dataset is a collection of 8kHz audio recordings of spoken digits from 'zero' to 'nine'.
  • TIMIT Corpus

    The TIMIT corpus is a large database of speech recordings used for speaker recognition and speech synthesis tasks.
  • Voxceleb2: Deep speaker recognition

    Voxceleb2: Deep speaker recognition.
  • NIST-SRE 2016

    The NIST-SRE 2016 dataset contains recordings of speakers from different domains.
  • Ted Talks

    The Ted Talks dataset is a speaker recognition dataset, consisting of audio recordings of speakers.
  • VCTK Corpus

    The VCTK corpus is an English multi-speaker dataset, with 44 hours of audio spoken by 109 native English speakers.
  • VoxCeleb

    Speaker verification systems experience significant performance degradation when tasked with short-duration trial recordings. To address this challenge, a multi-scale feature...
  • VoxCeleb1

    Speaker recognition aims to identify speaker information from input speech. A type of speaker recognition is speaker verification (SV). It determines whether the test speaker's...
  • SimpleQuestion dataset for Wikidata

    The dataset used in this paper is a reinforcement learning dataset, specifically the SimpleQuestion dataset, which contains questions answerable using Wikidata as the knowledge...
  • LibriTTS

    A popular text-based VC approach is to use an automatic speech recognition (ASR) model to extract phonetic posteriorgram (PPG) as content representation.
You can also access this registry using the API (see API Docs).