Speech Recognition - Groups

Speech EEG Database

Two simultaneous speech EEG recording databases for this work. For database A five female and five male subjects took part in the experiment. For database B five male and three...

Dataset
JSON

Isolet

The Isolet dataset is a spoken letter (A-Z) data set with 26 classes and approximately 297 examples per class.

Dataset
JSON

LibriLight: A Benchmark for ASR with Limited or No Supervision

The LibriLight dataset is a large-scale speech corpus used for self-supervised speech recognition tasks.

Dataset
JSON

The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus CDROM

The TIMIT Acoustic-Phonetic Continuous Speech Corpus CDROM is a widely used dataset for speech recognition tasks.

Dataset
JSON

UNSUPERVISED SPEECH RECOGNITION WITH N-SKIPGRAM AND POSITIONAL UNIGRAM MATCHING

Training unsupervised speech recognition systems presents challenges due to GAN-associated instability, misalignment between speech and text, and significant memory demands. To...

Dataset
JSON

TI-46 Spoken Digits Recognition

The TI-46 spoken digits dataset comprises of 5 speakers uttering 10 times each of the 10 digits (500 samples)

Dataset
JSON

The REPERE Corpus: a multimodal corpus for person recognition

The REPERE Corpus: a multimodal corpus for person recognition contains TV broadcasts.

Dataset
JSON

The ESTER phase II evaluation campaign for the rich transcription of French b...

The ESTER phase II evaluation campaign for the rich transcription of French broadcast news contains news reports.

Dataset
JSON

The ETAPE corpus for the evaluation of speech-based TV content processing in ...

The ETAPE corpus for the evaluation of speech-based TV content processing in the French language contains TV broadcasts.

Dataset
JSON

DARPA TIMIT acoustic-phonetic continuous speech corpus CD-ROM

The TIMIT acoustic-phonetic continuous speech corpus CD-ROM contains read speech from 2500 speakers.

Dataset
JSON

indic-punct

The dataset is used for automatic punctuation restoration and inverse text normalization for Indic languages.

Dataset
JSON

CallHome

The dataset used for comparing human and machine transcription errors in conversational speech.

Dataset
JSON

Stanford Neural Machine Translation Systems for Spoken Language Domain

Stanford neural machine translation systems for spoken language domain.

Dataset
JSON

Arabic Digits Dataset

The dataset used in this paper is a dataset for spoken digit recognition of Arabic digits from 0 to 9.

Dataset
JSON

Unsupervised word segmentation and lexicon discovery using acoustic word embe...

A dataset for the Zero Resource Speech Challenge 2015.

Dataset
JSON

Fixed-dimensional acoustic embeddings of variable-length segments in low-reso...

A dataset for the Zero Resource Speech Challenge 2015.

Dataset
JSON

The Zero Resource Speech Challenge 2015

A dataset for the Zero Resource Speech Challenge 2015.

Dataset
JSON

A segmental Bayesian framework for fully-unsupervised large-vocabulary speech...

A segmental Bayesian model for full-coverage segmentation and clustering of conversational speech audio.

Dataset
JSON

Amazon Alexa Dataset

A 23 thousand hour corpus of untranscribed, de-identified, far-field, English voice command and voice query speech.

Dataset
JSON

DeepSpeech

The DeepSpeech dataset used for evaluation of the proposed watermarking scheme.

Dataset
JSON

194 datasets found