-
CL4AC: A CONTRASTIVE LOSS FOR AUDIO CAPTIONING
Automated Audio captioning (AAC) is a cross-modal translation task that aims to use natural language to describe the content of an audio clip. -
Clotho: An audio captioning dataset
Audio captioning is a multi-modal task, focusing on using natural language for describing the contents of general audio. Most audio captioning methods are based on deep neural...