Clotho v2

Automated audio captioning is a cross-modal translation task for describing the content of audio clips with natural language sentences.

Data and Resources

Cite this as

Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang (2024). Dataset: Clotho v2. https://doi.org/10.57702/rv90u3ey

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.1109/TASLP.2024.3416686
Author Xinhao Mei
More Authors
Xubo Liu
Jianyuan Sun
Mark D. Plumbley
Wenwu Wang
Homepage https://doi.org/10.1109/ICASSP.2020.9123107