The OMG Emotion Behavior Dataset is a multimodal emotion recognition dataset, consisting of 178 long videos divided into 2725 short utterances of varying length.
RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song) dataset contains 24 professional actors (12 female, 12 male) to offer the performance with good quality and...
The CREMA-D dataset is an audio-visual dataset for emotion recognition task, each video in which consists of both facial and acoustic emotional expressions.
Speaker verification systems experience significant performance degradation when tasked with short-duration trial recordings. To address this challenge, a multi-scale feature...