-
HYPOTHESIS STITCHER FOR END-TO-END SPEAKER-ATTRIBUTED ASR ON LONG-FORM MULTI-...
An end-to-end (E2E) speaker-attributed automatic speech recognition (SA-ASR) model was proposed recently to jointly perform speaker counting, speech recognition and speaker... -
FOOLHD: FOOLING SPEAKER IDENTIFICATION BY HIGHLY IMPERCEPTIBLE ADVERSARIAL DI...
Speaker identification models are vulnerable to carefully designed adversarial perturbations of their input signals that induce misclas-sification. -
VoxCeleb dataset
The VoxCeleb dataset is a large-scale speaker identification dataset, used to evaluate the performance of face recognition systems. -
VoxCeleb: A Large-Scale Speaker Identification Dataset
VoxCeleb: A Large-Scale Speaker Identification Dataset