Table 1 Comparison of Lung Sound Classification Models.

From: Deep Learning-Driven Early Diagnosis of Respiratory Diseases using CNN-RNN Fusion on Lung Sound Data

Author

Model Type

Dataset Used

Disease(s) Targeted

Accuracy (%)

F1-Score

Remarks

Proposed Model

CNN + RNN (LSTM Fusion)

ICBHI, Coswara

Healthy, Pneumonia, Asthma, COPD, COVID-19

94.0

0.94

Multi-disease classification with data augmentation and weighted mapping

Kim et al. (2025) 25

Transformer

(AST Model)

Pediatric (2019–2020)

Wheezes, Other Respiratory Sounds

91.1

0.82

High accuracy in wheeze detection, uses Score-CAM for explainability, focused on pediatric data

Wang et al. (2025) 26

Transformer (REDT)

HF_Lung_V1

Inspiration, Expiration, Continuous/Discontinuous Adventitious Sounds

90.5% (Insp), 77.3% (Exp), 78.9% (Cont), 59.4% (Disc)

Not Mentioned

Event-based transformer for respiratory phase and adventitious sound detection, excels in detailed event localization

Luay et al. (2021) 14

Deep CNN Ensemble

ICBHI

Pediatric Asthma

93.1

0.92

Focused on asthma using ensemble CNN architecture

Demir et al. (2020) 22

RNN Ensemble

ICBHI

Respiratory Sound Classes

94.6

0.93

High temporal modeling, but limited disease annotation

Borwankar et al. (2022) 17

Multi-layer CNN

Custom Dataset

Respiratory Pathologies

91.7

0.91

Did not include COVID-19 or temporal modeling

Ijaz et al. (2022) 16

CNN

Custom Dataset

Abnormal Lung Sounds

94.2

0.93

No fusion with temporal analysis (RNN absent)

Goyal et al. (2024) 11

CNN-Based Classifier

 ~ 1500 recordings

Pneumonia

 > 90

0.90

Dataset focused on pneumonia only

Chen et al. (2018) 23

CNN

Lung Sound Corpus

Crackles and Wheezes

92.7

0.92

No multi-class classification

Pahar et al. (2022) 28

CNN + STFT features

R.A.L.E Dataset

Wheeze Detection

90.3

0.89

Limited to binary classification

  1. Metrics = τ(RNNmodel(FeaturesCNN), Mtest), where τ is a function for calculating the evaluation metrics.
  2. Significant values are in bold.