2 datasets found

Tags: audio clips

Filter Results
  • DCASE 2021 Challenge Dataset

    The DCASE 2021 challenge dataset consists of 1578 weakly-labelled, 10000 synthesized strongly-labelled and 14412 unlabelled audio clips.
  • Clotho

    Automated audio captioning is a cross-modal translation task for describing the content of audio clips with natural language sentences.
You can also access this registry using the API (see API Docs).