Dataset - LDM

DCASE 2021 Challenge Dataset

The DCASE 2021 challenge dataset consists of 1578 weakly-labelled, 10000 synthesized strongly-labelled and 14412 unlabelled audio clips.
- Dataset
- JSON
Clotho

Automated audio captioning is a cross-modal translation task for describing the content of audio clips with natural language sentences.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

2 datasets found