Table 4 Comparison of our proposed approach with state of the art algorithms (testing accuracy range), on different available datasets.
From: Gun identification from gunshot audios for secure public places using transformer learning
Model | Our dataset (%) | TREC (%) | Urban (%) |
---|---|---|---|
Resnet50 (Baseline: raw audio) | 76.0–78.0 | 71.0–73.0 | 70.0–73.4 |
Capsule network37 | 82.2–83.6 | 83.1–84.3 | 80.1–81.9 |
DNN ensemble40 | 83.0–84.5 | 82.2–83.4 | 84.0–85.0 |
Zero-shot federated learning41 | 83.5–86.0 | 82.9–24.8 | 83.0–85.0 |
Resnet50+MFCC+MelSpectogram | 84.0–87.0 | 83.0–84.5 | 82.9–85.0 |
(PA) VT+MFCC+MelSpectogram | 89.0–90.0 | 88.0–89.5 | 87.4–89.0 |