Table 1 Summary of the speech and non-speech datasets.
From: Emotion recognition and confidence ratings predicted by vocal stimulus type and prosodic parameters
Speech corpora | Description of content | Initial content | Number of selected files |
|---|---|---|---|
Anna (Hammerschmidt and Juergens, 2007) | Name “Anna” uttered for 8 emotions [(anger, affection, contempt, despair, fear, happiness, sensual satisfaction, triumph) + neutral (baseline expression)] by 22 German drama students [10 males (M); 12 females (F)] (same for all emotions). | 198 audio files | 88 [(emotion category of interest + baseline expression) ×22 speakers] |
Montreal Affective Voices (Belin et al., 2008) | Portrayals of non-verbal emotional sounds/affect bursts (e.g., laughing, crying) for 8 emotions [(anger, disgust, fear, happiness, pain, pleasure, sadness, surprise) + baseline expression (neutral)] by 10 francophone actors (5 M; 5 F) (same for all emotions). | 90 audio files | 70 [(emotion category of interest + baseline expression) ×10 Speakers] |
Berlin Database of Emotional Speech (Burkhardt et al., 2005) | Portrayals of 6 emotions [(anger, boredom, disgust, fear, happiness, sadness) + baseline expression (neutral)] by 10 German untrained actors (5 M; 5 F). The database consists of 10 semantic neutral sentences (same for all emotions). | 816 audio files | 120 [(emotion category of interest + baseline expression) ×2 Speakers × 10 sentences] |
Magdeburg Prosody Corpus (Wendt and Scheich, 2002) | Portrayals of 5 emotions [(anger, disgust, fear, happiness, sadness) + baseline expression (neutral)] by 2 German actors (1 M; 1 F). The corpus consists of 3318 nouns classified according to their positive-, negative- and neutral semantic content and of 222 pseudo-words (same for all emotions). | 3318 audio files (nouns)+222 audio files (pseudo-words) | 480 [(all emotions + baseline expression) ×2 Speakers × 10 nouns per semantic category (i.e., positive, negative, neutral)/10 Pseudo-words] |
Paulmann Prosodic Stimuli (Paulmann and Kotz, 2008; Paulmann et al., 2008) | Portrayals of 6 emotions (anger, disgust, fear, happiness, sadness, surprise) + baseline expression (neutral) by 2 German actors (1 M; 1 F). The stimulus set consists of 210 lexical sentences and 210 pseudo-sentences (different for each emotion). | 420 audio files | 280 [(10 lexical sentences & 10 pseudo-sentences for each emotion + baseline expression) ×2 speakers] |