-
End-to-end speaker-attributed ASR with Transformer
End-to-end speaker-attributed ASR with Transformer -
Neural Speaker Diarization for Unlimited Number of Speakers Using End-to-End ...
This paper presents Transcribe-to-Diarize, a new approach for neural speaker diarization that uses an end-to-end (E2E) speaker-attributed automatic speech recognition (SA-ASR). -
The AMI Corpus: A Spoken Language Corpus for Speaker Diarization and Emotion ...
The AMI corpus: A spoken language corpus for speaker diarization and emotion recognition -
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
The End-to-End Neural Speaker Diarization dataset is a benchmark for speaker diarization. -
The Third DIHARD Diarization Challenge
The DIHARD dataset is a benchmark for speaker diarization. -
2000 NIST Speaker Recognition Evaluation
The dataset is used for speaker diarization tasks. -
NIST RT-03 English CTS
The dataset is used for speaker diarization tasks. -
NIST SRE 2000 CALLHOME
The dataset is used for speaker diarization tasks. -
Speaker Diarization with LSTM
Speaker diarization is the process of partitioning an input audio stream into homogeneous segments according to the speaker identity.