Speech Enhancement - Groups

MHINT

The MHINT corpus is a Mandarin Chinese speech corpus used for speech recognition and speech enhancement. It contains 480 utterances of 10 speakers, each with 10 seconds of speech.

Dataset
JSON

TIMIT dataset

The dataset used in this paper is a collection of phonetically and phonologically local allophonic distribution in English, where voiceless stops surface as aspirated...

Dataset
JSON

TIMIT

The TIMIT corpus is a widely used benchmark for speech recognition tasks. It contains 3,696 training utterances from 462 speakers, excluding the SA sentences. The core test set...

Dataset
JSON

LRS2

The LRS2 dataset consists of 48,164 video clips from outdoor shows on BBC television. Each video is accompanied by an audio corresponding to a sentence with up to 100 characters.

Dataset
JSON

4 datasets found

MHINT

TIMIT dataset

TIMIT

LRS2