Table 11 Sensitivity and specificity of the sconn model for each emotion class using the mel Spectrogram, MFCC, and mel Spectrogram + MFCC features of the RAVDESS dataset.
From: Stacked convolutional neural network for emotion recognition using multi feature speech analysis
Emotion Classes | Mel Spectrogram | MFCC | Mel Spectrogram + MFCC | |||
|---|---|---|---|---|---|---|
Sensitivity | Specificity | Sensitivity | Specificity | Sensitivity | Specificity | |
angry | 92.89% | 99.13% | 95.43% | 98.61% | 97.97% | 99.04% |
calm | 93.33% | 98.20% | 98.33% | 98.20% | 96.67% | 99.66% |
disgust | 91.01% | 97.92% | 89.42% | 98.70% | 92.06% | 97.75% |
fearful | 87.21% | 98.38% | 84.88% | 98.81% | 89.53% | 99.40% |
happy | 89.40% | 99.29% | 90.32% | 97.96% | 90.32% | 98.94% |
sad | 84.29% | 97.92% | 89.01% | 98.61% | 90.58% | 99.05% |
surprised | 95.96% | 98.25% | 92.93% | 99.21% | 95.96% | 98.34% |