Speech Recognition - Groups

ASRU 2019 Mandarin-English code-switching speech recognition challenge

The ASRU 2019 Mandarin-English code-switching speech recognition challenge dataset.

Dataset
JSON

Wall Street Journal

The Wall Street Journal dataset is used for syntactic linearization. It contains a large corpus of news articles with their corresponding syntactic trees.

Dataset
JSON

Video Corpus

A corpus of free and representative video content was gathered. This corpus includes videos having progressive scanning, 1280x720 resolution, and framerates between 24-30 frames...

Dataset
JSON

Correction Focused Language Model Training for Speech Recognition

Language models have been commonly adopted to boost the performance of automatic speech recognition (ASR) particularly in domain adaptation tasks. Conventional way of LM...

Dataset
JSON

AudioMNIST dataset

The dataset used in the paper is the AudioMNIST dataset, which contains 30,000 audio recordings.

Dataset
JSON

Convolutional Neural Networks for Speech Recognition

The Speech Recognition dataset is used for speech recognition tasks.

Dataset
JSON

Data set B

The dataset used for performing continuous speech recognition experiments using EEG features.

Dataset
JSON

Data set A and B

The dataset used for performing isolated and continuous speech recognition experiments using EEG features.

Dataset
JSON

LibriSpeech: An ASR Corpus Based on Public Domain Audio Books

LibriSpeech: an ASR corpus based on public domain audio books.

Dataset
JSON

Tedlium3

Tedlium3: A large-scale English speech corpus for speaker adaptation.

Dataset
JSON

GTZAN dataset

The GTZAN dataset is a small but popular dataset for genre classification, containing 10 musical genres, with each genre having 100 audio snippets of 30 s length.

Dataset
JSON

Free Spoken Digit Dataset

The dataset is a collection of 8kHz audio recordings of spoken digits from 'zero' to 'nine'.

Dataset
JSON

Lwazi speech corpus

Collecting and evaluating speech recognition corpora for nine southern bantu languages

Dataset
JSON

NCHLT speech corpus

The NCHLT speech corpus of the South African languages

Dataset
JSON

EasyASR

The dataset used in this paper is EasyASR, a distributed machine learning platform for end-to-end automatic speech recognition.

Dataset
JSON

INT8 Winograd Acceleration for Conv1D Equipped ASR Models Deployed on Mobile ...

The dataset used in this paper is a Conv1D equipped ASR model deployed on mobile devices.

Dataset
JSON

Attention-based beamformers for multi-channel speech recognition

The proposed 2D Conv-Attention model is compared with a traditional neural beamformer and multi-head attention based model.

Dataset
JSON

People’s Speech

The People’s Speech: A large-scale diverse English speech recognition dataset for commercial usage.

Dataset
JSON

LIBRIHEAVY: A 50,000 HOURS ASR CORPUS WITH PUNCTUATION CASING AND CONTEXT

Libriheavy is a large-scale ASR corpus consisting of 50,000 hours of read English speech derived from LibriVox. To the best of our knowledge, Libriheavy is the largest...

Dataset
JSON

CHiME-2

The CHiME-2 dataset is a speech separation and recognition challenge dataset. It contains 7138 utterances of 8 speakers, each with 10 seconds of speech.

Dataset
JSON

160 datasets found