5 datasets found

Filter Results
  • The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS)

    A dynamic, multi-modal set of facial and vocal expressions in North American English
  • RAVDESS

    RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song) dataset contains 24 professional actors (12 female, 12 male) to offer the performance with good quality and...
  • EMODB

    The EMODB dataset is a German language speech library containing about 535 audio clips, each ranging from 1 to 10 seconds long, covering seven different emotional expressions.
  • SAVEE

    The SAVEE dataset contains 480 acted English utterances recorded by four male actors and consists of seven emotion categories: anger, fear, disgust, happiness, neutral, sadness,...
  • CREMA-D

    The CREMA-D dataset is an audio-visual dataset for emotion recognition task, each video in which consists of both facial and acoustic emotional expressions.