9 datasets found

Formats: JSON

Filter Results
  • Freesound Dataset

    The Freesound dataset consists of 18,873 audio files, each assigned one of the 41 unique audio events from the Google's Audioset Ontology.
  • Bach The Well-Tempered Clavier Book One and Two

    Bach The Well-Tempered Clavier Book One (WTC B1) and Bach The Well-Tempered Clavier Book Two (WTC B2) datasets.
  • COVID-19 Identiļ¬cation ResNet (CIdeR)

    The COVID-19 Identiļ¬cation ResNet (CIdeR) dataset consists of 517 crowdsourced coughing and breathing audio recordings from 355 participants, of which 62 participants had tested...
  • VoiceBank DEMAND dataset

    Speech enhancement dataset
  • TIMIT dataset

    The dataset used in this paper is a collection of phonetically and phonologically local allophonic distribution in English, where voiceless stops surface as aspirated...
  • VCTK Corpus

    The VCTK corpus is an English multi-speaker dataset, with 44 hours of audio spoken by 109 native English speakers.
  • Librispeech

    The Librispeech dataset is a large-scale speaker-dependent speech corpus containing 1080 hours of speech, 5600 utterances, and 1000 speakers.
  • VCTK

    Voice conversion (VC) is a technique that alters the voice of a source speaker to a target style, such as speaker identity, prosody, and emotion, while keeping the linguistic...
  • LibriTTS

    A popular text-based VC approach is to use an automatic speech recognition (ASR) model to extract phonetic posteriorgram (PPG) as content representation.