Table 9 Classification accuracy of the proposed sconn deep learning models on the RAVDESS, SAVEE and TESS datasets.

From: Stacked convolutional neural network for emotion recognition using multi feature speech analysis

Feature Sets

RAVDESS

SAVEE

TESS

MFCC (40)

91.51%

91.43%

99.93%

Mel Spectrogram (128)

90.63%

94.76%

99.68%

MFCC (40) + Mel Spectrogram (128)

93.30%

95.00%

99.93%