Speaker Recognition - Groups

SincNet: A Novel CNN Architecture for Speaker Recognition from Raw Waveforms

Speaker recognition is a very active research area with no-table applications in various fields such as biometric authentication, forensics, security, speech recognition, and...

Dataset
JSON

Voxceleb2

The Voxceleb2 dataset is a large-scale speaker recognition dataset, containing 2442 hours raw speech from 6112 speakers.

Dataset
JSON

BUT retransmitted audio dataset

The dataset of retransmitted audio used for PLDA adaptation

Dataset
JSON

VOiCES 2019 Speaker Recognition Challenge

The dataset used for the VOiCES 2019 Speaker Recognition challenge

Dataset
JSON

Timbre Dataset Generation

The proposed model uses the timbral properties of voice, that is hardly used in any other research endeavors. The model is tested against a real-world continuous stream of...

Dataset
JSON

VOiCES

The VOiCES dataset is used for testing the speaker recognition system. The dataset contains 7323 identities combined. The dataset is used for testing.

Dataset
JSON

NIST SRE16

Speaker recognition evaluation dataset

Dataset
JSON

2000 NIST Speaker Recognition Evaluation

The dataset is used for speaker diarization tasks.

Dataset
JSON

NIST SRE 2000 CALLHOME

The dataset is used for speaker diarization tasks.

Dataset
JSON

ASVspoof2019

The ASVspoof2019 LA subset consists of three parts, training, development, and evaluation. Each partition has a disjoint set of speakers. The average duration of the utterances...

Dataset
JSON

Free Spoken Digit Dataset

The dataset is a collection of 8kHz audio recordings of spoken digits from 'zero' to 'nine'.

Dataset
JSON

VoxCeleb dataset

The VoxCeleb dataset is a large-scale speaker identification dataset, used to evaluate the performance of face recognition systems.

Dataset
JSON

TIMIT Corpus

The TIMIT corpus is a large database of speech recordings used for speaker recognition and speech synthesis tasks.

Dataset
JSON

TIMIT

The TIMIT corpus is a widely used benchmark for speech recognition tasks. It contains 3,696 training utterances from 462 speakers, excluding the SA sentences. The core test set...

Dataset
JSON

Voxceleb2: Deep speaker recognition

Voxceleb2: Deep speaker recognition.

Dataset
JSON

A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognitio...

The Kazakh speech corpus (KSC) contains around 332 hours of transcribed audio comprising over 153,000 utterances spoken by participants from different regions and age groups, as...

Dataset
JSON

Ted Talks

The Ted Talks dataset is a speaker recognition dataset, consisting of audio recordings of speakers.

Dataset
JSON

VCTK Corpus

The VCTK corpus is an English multi-speaker dataset, with 44 hours of audio spoken by 109 native English speakers.

Dataset
JSON

VoxCeleb

Speaker verification systems experience significant performance degradation when tasked with short-duration trial recordings. To address this challenge, a multi-scale feature...

Dataset
JSON

VoxCeleb1

Speaker recognition aims to identify speaker information from input speech. A type of speaker recognition is speaker verification (SV). It determines whether the test speaker's...

Dataset
JSON

22 datasets found