Dataset - LDM

End-to-end speaker-attributed ASR with Transformer

End-to-end speaker-attributed ASR with Transformer
- Dataset
- JSON
Neural Speaker Diarization for Unlimited Number of Speakers Using End-to-End ...

This paper presents Transcribe-to-Diarize, a new approach for neural speaker diarization that uses an end-to-end (E2E) speaker-attributed automatic speech recognition (SA-ASR).
- Dataset
- JSON
MiniVox

MiniVox is an automatic framework to transform any speaker-into continuous speech datastream with labelled dataset episodically revealed label feedbacks.
- Dataset
- JSON
The AMI Corpus: A Spoken Language Corpus for Speaker Diarization and Emotion ...

The AMI corpus: A spoken language corpus for speaker diarization and emotion recognition
- Dataset
- JSON
End-to-End Neural Speaker Diarization with Permutation-Free Objectives

The End-to-End Neural Speaker Diarization dataset is a benchmark for speaker diarization.
- Dataset
- JSON
The Third DIHARD Diarization Challenge

The DIHARD dataset is a benchmark for speaker diarization.
- Dataset
- JSON
2000 NIST Speaker Recognition Evaluation

The dataset is used for speaker diarization tasks.
- Dataset
- JSON
NIST RT-03 English CTS

The dataset is used for speaker diarization tasks.
- Dataset
- JSON
NIST SRE 2000 CALLHOME

The dataset is used for speaker diarization tasks.
- Dataset
- JSON
Speaker Diarization with LSTM

Speaker diarization is the process of partitioning an input audio stream into homogeneous segments according to the speaker identity.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

10 datasets found