Fig. 2

Classification accuracy as a function of the number of selected features for emotion recognition based on arousal levels. The curve demonstrates performance across five emotional categories for male and female speakers using prosodic and spectral features. Optimal accuracy is achieved at 800 features (females) and 900 features (males).