Table 1 Summary of the speech and non-speech datasets.

From: Emotion recognition and confidence ratings predicted by vocal stimulus type and prosodic parameters

Speech corpora

Description of content

Initial content

Number of selected files

Anna (Hammerschmidt and Juergens, 2007)

Name “Anna” uttered for 8 emotions [(anger, affection, contempt, despair, fear, happiness, sensual satisfaction, triumph) + neutral (baseline expression)] by 22 German drama students [10 males (M); 12 females (F)] (same for all emotions).

198 audio files

88 [(emotion category of interest + baseline expression) ×22 speakers]

Montreal Affective Voices (Belin et al., 2008)

Portrayals of non-verbal emotional sounds/affect bursts (e.g., laughing, crying) for 8 emotions [(anger, disgust, fear, happiness, pain, pleasure, sadness, surprise) + baseline expression (neutral)] by 10 francophone actors (5 M; 5 F) (same for all emotions).

90 audio files

70 [(emotion category of interest + baseline expression) ×10 Speakers]

Berlin Database of Emotional Speech (Burkhardt et al., 2005)

Portrayals of 6 emotions [(anger, boredom, disgust, fear, happiness, sadness) + baseline expression (neutral)] by 10 German untrained actors (5 M; 5 F). The database consists of 10 semantic neutral sentences (same for all emotions).

816 audio files

120 [(emotion category of interest + baseline expression) ×2 Speakers × 10 sentences]

Magdeburg Prosody Corpus (Wendt and Scheich, 2002)

Portrayals of 5 emotions [(anger, disgust, fear, happiness, sadness) + baseline expression (neutral)] by 2 German actors (1 M; 1 F). The corpus consists of 3318 nouns classified according to their positive-, negative- and neutral semantic content and of 222 pseudo-words (same for all emotions).

3318 audio files (nouns)+222 audio files (pseudo-words)

480 [(all emotions + baseline expression) ×2 Speakers × 10 nouns per semantic category (i.e., positive, negative, neutral)/10 Pseudo-words]

Paulmann Prosodic Stimuli (Paulmann and Kotz, 2008; Paulmann et al., 2008)

Portrayals of 6 emotions (anger, disgust, fear, happiness, sadness, surprise) + baseline expression (neutral) by 2 German actors (1 M; 1 F). The stimulus set consists of 210 lexical sentences and 210 pseudo-sentences (different for each emotion).

420 audio files

280 [(10 lexical sentences & 10 pseudo-sentences for each emotion + baseline expression) ×2 speakers]