Emotional Speech - Groups

The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS)

A dynamic, multi-modal set of facial and vocal expressions in North American English

Dataset
JSON

RAVDESS

RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song) dataset contains 24 professional actors (12 female, 12 male) to offer the performance with good quality and...

Dataset
JSON

EMODB

The EMODB dataset is a German language speech library containing about 535 audio clips, each ranging from 1 to 10 seconds long, covering seven different emotional expressions.

Dataset
JSON

SAVEE

The SAVEE dataset contains 480 acted English utterances recorded by four male actors and consists of seven emotion categories: anger, fear, disgust, happiness, neutral, sadness,...

Dataset
JSON

CREMA-D

The CREMA-D dataset is an audio-visual dataset for emotion recognition task, each video in which consists of both facial and acoustic emotional expressions.

Dataset
JSON

5 datasets found

The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS)

RAVDESS

EMODB

SAVEE

CREMA-D