-
Amazon Alexa Dataset
A 23 thousand hour corpus of untranscribed, de-identified, far-field, English voice command and voice query speech. -
Open Subtitles dataset
The Open Subtitles dataset consists of transcriptions of spoken dialog in movies and television shows. -
Loss Prediction: End-to-End Active Learning for Speech Recognition
End-to-end speech recognition systems usually require huge amounts of labeling resource, while annotating the speech data is complicated and expensive. Active learning is the... -
Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech...
This paper presents a well-known music identification method and implements it as a neural net. -
BABEL-Pashto
The BABEL-Pashto dataset is a multilingual speech recognition dataset containing Pashto speech recordings. -
Speech Intelligibility Prediction with DNN-based Performance Measures
The dataset used for speech intelligibility prediction with DNN-based performance measures -
DNS-5 dataset
The dataset used in the paper is a benchmarking dataset for speech-to-speech translation. -
Bengali Medical Corpus
A comprehensive 46-hour Bengali medical corpus encompassing disease names, symptoms, and symptom severity. -
Highly-Reverberant Real Environment database (HRRE)
Highly-Reverberant Real Environment database (HRRE) contains 13.4 hours of data recorded in real reverberant environments and consists of 20 different testing conditions. -
OpenSeq2Seq
The OpenSeq2Seq dataset is a speech recognition dataset used in the OpenSeq2Seq framework. -
Kaldi Speech Recognition Toolkit
The Kaldi Speech Recognition Toolkit is a widely used dataset for speech recognition. -
WAV2LETTER++
The dataset used in this paper is not explicitly mentioned, but it is implied to be a speech recognition dataset.