Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 2 datasets found Tags: natural language sentences Filter Results Clotho v2 Automated audio captioning is a cross-modal translation task for describing the content of audio clips with natural language sentences. Dataset JSON Clotho Automated audio captioning is a cross-modal translation task for describing the content of audio clips with natural language sentences. Dataset JSON