-
CL4AC: A CONTRASTIVE LOSS FOR AUDIO CAPTIONING
Automated Audio captioning (AAC) is a cross-modal translation task that aims to use natural language to describe the content of an audio clip. -
Multimodal Emotion Recognition with Modality-Pairwise Unsupervised Contrastiv...
Emotion recognition is involved in several real-world applications. With an increase in available modalities, automatic understanding of emotions is being performed more...