Table 1 Classification accuracy of various methods.

From: Acoustic scene classification based on three-dimensional multi-channel feature-correlated deep learning networks

Method

Data augmentation

Features extraction

Sample rate (kHz)

Accuracy (%)

CNNs(5/9/13)23

\

Mel energies

32

69.2

ResNet43

Mix-up

HPSS

48

71.9

CNNs_Averaging844

\

HRTF, NNF

44.1

64.0

CDNN46

Mix-up

Single frequency cepstral coefficients (SFCC), log-Mel energies

48

70.4

3D-SEResNet

Mix-up

Mel Spectrogram

16

80.1

3D-SEResNet

Mix-up

Log-Mel Spectrogram

16

81.3

3D-CNN

Mix-up

STFT

16

83.5

Our Method

Mix-up

Mel Spectrogram, Chromagram, STFT

16

86.4