Speaker Identification - Groups

HYPOTHESIS STITCHER FOR END-TO-END SPEAKER-ATTRIBUTED ASR ON LONG-FORM MULTI-...

An end-to-end (E2E) speaker-attributed automatic speech recognition (SA-ASR) model was proposed recently to jointly perform speaker counting, speech recognition and speaker...

Dataset
JSON

FOOLHD: FOOLING SPEAKER IDENTIFICATION BY HIGHLY IMPERCEPTIBLE ADVERSARIAL DI...

Speaker identiﬁcation models are vulnerable to carefully designed adversarial perturbations of their input signals that induce misclas-siﬁcation.

Dataset
JSON

VoxCeleb dataset

The VoxCeleb dataset is a large-scale speaker identification dataset, used to evaluate the performance of face recognition systems.

Dataset
JSON

VoxCeleb: A Large-Scale Speaker Identification Dataset

Dataset
JSON

VoxCeleb

Speaker verification systems experience significant performance degradation when tasked with short-duration trial recordings. To address this challenge, a multi-scale feature...

Dataset
JSON

VoxCeleb1

Speaker recognition aims to identify speaker information from input speech. A type of speaker recognition is speaker verification (SV). It determines whether the test speaker's...

Dataset
JSON