A robust chronic obstructive pulmonary disease classification model using dragonfly optimized kernel extreme learning machine

Chitra, S.; Alqahtani, Tariq Mohammed; Alduraywish, Abdulrahman; Sikkandar, Mohamed Yacin

doi:10.1038/s41598-025-02952-6

Download PDF

Article
Open access
Published: 28 May 2025

A robust chronic obstructive pulmonary disease classification model using dragonfly optimized kernel extreme learning machine

S. Chitra¹,
Tariq Mohammed Alqahtani²,
Abdulrahman Alduraywish² &
…
Mohamed Yacin Sikkandar²

Scientific Reports volume 15, Article number: 18702 (2025) Cite this article

2614 Accesses
3 Citations
Metrics details

Subjects

Abstract

Chronic obstructive pulmonary disease (COPD) is considered to be one of the most commonly occurring respiratory disorders and is proliferating at an extremely high rate in the recent years. The proposed system aims to classify the various stages of COPD using a COPD patient dataset comprising 101 patients and 24 varied factors related to the disease. In addition, a self-acquired dataset containing 560 lung CT images was also used. The obtained hybrid database is normalized, augmented, and preprocessed using bilateral filter and contrast enhanced using dynamic histogram equalization. Segmentation is then performed using SuperCut algorithm. Feature extraction is done by binary feature fusion technique involving UNet and AlexNet. Kernel extreme learning machine-based classification is conducted further, and the results produced are optimized using dragon fly optimization algorithm. The proposed system produced an enhanced accuracy of 98.82%, precision of 99.01%, recall of 94.98%, F1 score of 96.11%, specificity of 98.09%, MCC value of 94.33%, and AUC value of 0.996 which are far better when compared with other existing systems.

A novel framework for COPD management in cyber-physical systems using machine learning

Article Open access 21 October 2025

Deep radiomics-based survival prediction in patients with chronic obstructive pulmonary disease

Article Open access 26 July 2021

Deep learning for detecting and early predicting chronic obstructive pulmonary disease from spirogram time series

Article Open access 15 February 2025

Introduction

Chronic Obstructive Pulmonary Disease (COPD) is estimated to become the third largest disease which causes mortality across the globe in the year 2030¹. The main concerns of this disease to the medical faculties are.

a.
it is not completely curable.
b.
It can only be treated throughout the lifetime of the patient, and
c.
it can become worse at any point of time.

It is a non-communicable disease that has been increasing year after year because of the alarming rise of pollutants in the air and smoking habits. The disease is also considered to be in its peak stage after the onset of the coronavirus disease since the pandemic has left various disturbances on human lungs. A recent study reveals that around 24 million people are affected by this disease annually in the United States and also half of them go undiagnosed and succumb to the deadly disorder². Traditional diagnosis of COPD is becoming difficult even with the numerous advancements in medical sciences. The main reason behind this fact is the common symptoms that the disease shares with other respiratory disorders and it can be easily misdiagnosed³.

Hence there arises a huge need for an automated system that identifies and classifies the various stages of COPD depending upon the severity. Also, the economic burden that the disease places on its patients is exceedingly high and is estimated to be around 6% of the European Union’s financial budget annually. It is a heterogeneous disease that involves numerous smaller respiratory disorders. It is usually caused by the continuous exposure of the air tract and the lungs to micro particles that is often toxic and allergic⁴. It has four distinct stages classified by the Global initiative for chronic Obstructive Lung Disease (GOLD) such as stage 1, stage 2, stage 3, and stage 4. While the first stage is considered to be mild and the second one moderate, stages 3 and 4 are considered to be severe and very severe in nature. As the disease progresses, symptoms such as breathlessness, cough, dyspnea, and exacerbations intensify⁵. Although many imaging modalities such as magnetic resonance imaging, ultrasound and computed tomography images are available, specialists believe that the CT images of the affected lungs will function as the proper base for the identification of COPD. Various other signals such as electromyography signals, sounds of breathing etc. can also be used for identifying the disease. By identifying the disease in earlier stages, we can prevent the disease from progressing to further stages and also reduce the costs associated to the treatment.

Research on COPD has been happening in different directions such as.

a.
Proper identification and classification of the grade of the disease,
b.
Measures to control the progression of the disease from one stage to the next one,
c.
Various biological markers that can indicate the risk of predicting the disease prior to its onset and
d.
Also, studies are conducted in order to separate this disease from the associated diseases that are similar in nature to COPD⁶.
e.
The effect of active and passive smoking, differentiation of patients living in rural and urban areas in order to analyze the aftermath of air pollution are also under study.

Artificial Intelligence (AI) and deep learning have been widely applied in the field of medical diagnosis and have been quite successful over the years⁷. Medical practitioners have stated that AI has indeed helped them a lot in the identification and treatment of several heart and brain related diseases. Thereby applying deep learning to pulmonary diseases should also prove to be fruitful. In order to keep a check on the upward sloping curve of COPD, it is essential that a computerized and automated system is developed that can aid the medical professionals especially the pulmonologists in the early identification of the disease. The main contributions of the proposed work are.

The proposed work gathers a hybrid dataset including CT lung images and patient dataset from Kaggle website. The obtained dataset is augmented, bilateral filtered and contrast enhanced using dynamic histogram equalization.
Segmentation is performed using SuperCut algorithm and binary feature fusion process is conducted by concatenating UNet and AlexNet techniques.
Classification is done using KELM to categorize the COPD disease into four different grades. The results are further optimized using multi objective Dragonfly optimization algorithm for improving the performance.
Statistical analysis is conducted to evaluate the classification performance of the KELM model using accuracy, precision, recall, F1-Score, specificity, MCC, ROC and AUC values.
Comparative study with existing systems such as Logistic Regression, Random Forest, Recurrent Neural Network, Long Short-Term Memory, and Inception V3 is done to prove the enhanced performance of the proposed work.

The paper in organized as follows. A thorough investigation of similar works on COPD grade classification are presented in Sect. “Related works”. Detailed description of proposed model is given in Sect. “Proposed system”. Experimentation and results are discussed in Sect. “Results and discussion”. Concluding remarks are given in Sect. “Conclusion”.

Related works

The work in literature⁸ describes about the severity analysis of chronic obstructive pulmonary disease utilizing multi-channel lung sounds and training them with deep learning models. The authors have used the RespiratoryDatabase@TR which has 12 different sounds of the lung, 4 sounds related to the heart along with spirometry and chest X-rays of 41 patients. Abnormalities were detected using cuboid quantization and octant quantization coupled with the second order difference chaos plot. Deep extreme learning machines were used to classify the five different severities of the disease that used Hess and Lu autoencoder kernels. The proposed system achieved an accuracy of 94.31%, sensitivity of 94.28%, specificity of 98.76% and area under the curve value was around 0.9659.

The authors of⁹ and¹⁰ have highlighted about the usage of ensemble learning approach for the analysis of pulmonary diseases like COVID-19, tuberculosis, and pneumonia using automated chest X rays. In both the articles, they have stated the efficiency of using deep learning and machine learning models in the process of identifying diseases in the medical domain. Data was initially augmented and then resized and converted to gray scale for further processing. Features were extracted using histogram of oriented gradients (HOG) and local binary pattern (LBP) techniques. VGG-16, DenseNet-201, and Efficient-B0 models are involved in the classification process achieving an accuracy of 98%.

The research work carried out in^{11,12,13,14,15,16} describes about the detection process of lung nodules using convolutional neural networks, InceptionNet-V3 and weighted filters. The classification results are further optimized using grey wolf optimization algorithm. The proposed system achieved an accuracy of 98.33% sensitivity of 100% and specificity of 93.33%. LIDC-IDRI dataset is used here. The images are converted into jpeg format and adaptive filtering is applied to remove any noise present. Features are then extracted using Greywolf optimization and then Inception-V3 classifier is applied to classify the input images into three cases such as normal, benign or malignant.

Findings in¹⁷ present a machine learning based CT imaging model that is used to mark the difference between the diseases of COPD and asthma. Patients suffering from both diseases were employed for this analysis from the Heidelberg University Hospital in Germany. 93 features related to both the diseases such as low attenuation cluster, estimated thickness of the airway, perimeter of the airway, total count of the airway etc. were collected. The dataset included 48 COPD patients and 47 asthma patients. Several pulmonary function tests that evaluated the forced expiratory volume and forced vital capacity were calculated and CT images of the lung were also collected. Machine learning algorithms were trained with hybrid features and implemented in Python 3.7.3 version and Matlab R2019. The data set was preprocessed using normalization and Z-score standardization. Training and testing data were split in the ratio of 70:30. Barlett’s sphericity and Kaiser–Meyer–Olkin metrics were used to analyze the classification outputs. Particle swarm optimization was also used on the classified results for betterment. The chosen classifier was the support vector machine and features whose eigenvalues were greater than 1 and the Pearson correlation coefficient greater than 0.9 was used for the analysis. The system produced an accuracy of 80%, sensitivity of 87%, specificity of 71% and F1-Score of 81%.

The authors of¹⁸ have provided a comparative study of the CT images in the task of classifying the spirometric severity of COPD utilizing deep learning techniques. 80 patients participated in the study for whom post-bronchodilator pulmonary function test was taken. A total of 26,794 images were collected and they were preprocessed by resizing them to 512*512 pixels and rotating them from -25 to 25 degrees. Only those images that satisfied the threshold value of 950 in the lung field were considered for further processing. During feature extraction, parameters such as age, sex, body mass index, smoking habits, FVC, FEV, reversibility, capacity of diffusing Carbon monoxide were extracted. Statistical analysis was conducted by building a confusion matrix. Two class and four class analysis were also performed using McNemar–Bowker test.

Authors in¹⁹ have developed a system that performs automatic classification of CT images into different Emphysema using varied deep learning methods. The authors have come up with a retrospective cohort study involving 10,192 patient that have smoked in the past. 2407 were used for training the classifier, 100 for parameter tuning and the remaining 7143 were used for testing purpose. Features like ratio of FEV and FVC, dyspnea score, walking ability, sex, race, height, weight, and smoking history were collected. Feature extraction was done with the help of convolutional neural network and classification was done by long short-term memory.

Paper²⁰ presents a new multimodal system that is capable of identifying and classifying the disease of COPD and other multivariate pulmonary diseases such as tuberculosis, bronchitis, upper and lower respiratory tract infection using CT scan images. Three techniques namely the fusion model using machine learning and deep learning, cough sample method that uses the lung sound and CT scan based identification model were employed in the study which gave accuracies of 97.50%, 95.30% and 98% respectively. From this, the authors have concluded that diagnosis and classification of COPD using CT images prove to be the best way.

The research work in²¹ shows the identification and classification of COPD patients and their application using the Rome severity model. The study involved 364 patients who were hospitalized at the Maastricht University Medical center. All of them were diagnosed with COPD along with exacerbations. According to the reports 14.3% of patients had grade 1 COPD, 56.0% had grade 2 COPD and 29.7% had grade 3 COPD. This paper performs a post hoc analysis of an already published data using characteristics such as age, locality, gender, stage of GOLD, residential information, Charlson index, status of smoking etc. Shapiro–Wilk normal test, ANOVA test, Fisher’s exact test and Tukey’s test were performed on the dataset using IBM SPSS software tool version 28.0 for both parametric and non-parametric data.

The authors of²² have studied the overlap between the diagnosis done by a machine learning algorithm and pulmonologists. The study involves 400,000 pulmonology patients from five different countries. 360 respiratory professionals were also included in the review process. This study was done to test the diagnostic performance of the machine learning tool and compare it with that of actual doctors. Factors that were considered for the case study were FEV, ratio between FEV and FVC, pack years of smoking, height weight ratio, dyspnea, cough and wheezing, condition of rhinitis, therapies undergone etc. The detection accuracy achieved by the AC/DC tool was 73% and by the pulmonologists was 61%, which proves the superior performance of the machine learning models.

²³ presents a research work regarding the multi-omics classification of the COPD disease using deep learning graphs models. For the identification of the disease, convolutional graph neural network and protein protein interaction networks were used. AhGlasso algorithm was applied on String PPI database and COPD gene dataset. Analysis on gene ontology revealed that heparin signaling and carbohydrate derivative signaling were major factors in classification of COPD.

Findings of²⁴ show the anomaly based quantitative evaluation of CT images in the classification and prediction of the severity grades of COPD. The authors have used four deep learning models such as recurrent neural network, multiple instance learning (MIL), MIL combined with RNN and attention-based MIL model. Data was acquired from COPD gene dataset and Cosyconet cohort study including 446 patients. Information regarding demographics, spirometry and smoking were taken as the prime factors. The prediction score was evaluated using linear mixed models.

Proposed system

The proposed KELM based COPD classification system utilizes a hybrid dataset consisting of CT images of the affected lungs and COPD patient dataset from Kaggle website. The database contains 101 patient details pertaining to 24 factors related to the disease such as Forced Expiratory Volume(FEV), Forced Vital Capacity(FVC), atrial fibrillation, ability to walk, smoking habits, diabetes, hypertension, pack history of smoking, COPD Assessment Test(CAT) score, HIV Associated Dementia(HAD), St.George’s Respiratory Questionnaire(SGRQ) and Ischemic Heart Disease(IHD). 560 CT images were self-acquired from a reputed hospital. While both the datasets are sufficient individually for classifying the various grades of COPD, combining them would indeed improve the classification performance to a great extent. Since the acquired images are not sufficient for conducting the proposed system execution, they are augmented using method of rotation, horizontal flipping, vertical flipping, scaling, and translation. The augmented images are then preprocessed using bilateral filter and contrast enhanced using dynamic histogram equalization method. The patient database is normalized using outlier rejection and redundancy removal. After this, the filtered images are segmented using SuperCut algorithm. Binary feature fusion is performed using UNet and AlexNet. Once the features are extracted from both the techniques, they are concatenated to arrive at the feature vector. Classification is conducted with the help of Kernel extreme learning machine classifier. The results are further optimized using Dragon fly optimization algorithm for maximizing the classification accuracy of the proposed system. Performance metrics such as accuracy, precision, recall, F1 score, Matthew’s Correlation Coefficient (MCC), Receiver Operating Characteristic (ROC) curve and Area Under the Curve (AUC) are calculated and further compared with existing systems of Logistic Regression, Random Forest, Recurrent Neural Network, Long Short-Term Memory and InceptionV3. In real time, the proposed system can be used as an aid to help medical professionals in the processing of identifying and grading the disease of COPD. Classifying the specific grades of the disease would support in choosing the ideal and right modes of treatment for the patient and would also reduce the cost incurred and time involved. Figure 1 shows the workflow of the proposed model in an illustrative manner.

Data acquisition

The proposed system makes use of a hybrid dataset that combines the COPD patient dataset along with affected CT images. The patient dataset was obtained from Kaggle website link https://www.kaggle.com/datasets/prakharrathi25/copd-student-dataset containing the disease details of 101 COPD patients. It includes 24 varied factors related to the disease and other health parameters of the patient like FVC, FEV, atrial fibrillation, ability to walk, smoking habits, diabetes, hypertension, pack history of smoking, CAT score, HAD, SGRQ and IHD values. In addition to this, 340 CT images are also acquired for training and evaluating the proposed KELM classifier. The images were divided in the ratio 80:20 in order to train and evaluate the classification performance of the chosen classifier.

Data augmentation

Since the acquired CT images are less in number and will not be sufficient to proceed further, the input images are augmented using method like rotation, horizontal flipping, vertical flipping, scaling, and translation. The chosen data augmentation techniques like rotation, horizontal flipping, vertical flipping, scaling, and translation offer generalized results that are much useful for the classification process. These methods are much easier to process and also help in providing robust performance.

In this way, 220 new images were produced which totals up to 560 CT images. The details of the data augmentation parameters are given in Eqs. (1–6) below.

$$A_{n} \left( {a,b} \right) = \frac{{A\left( {a,b} \right) - {\text{min}}\left( A \right)}}{{\max \left( A \right) - {\text{min}}\left( A \right)}}$$

(1)

A—image selected, min(A) and max(A) are the minimum and maximum pixel values, $A_{n}$ represents the normalized image.

$$A_{r} \left( {a,b} \right) = A_{n} \left( {a cos\theta - b sin\theta , a sin\theta + b cos\theta } \right)$$

(2)

$$A_{hf} \left( {a,b} \right) = A_{n} \left( {a, Q - b - 1} \right)$$

(3)

$$A_{vf} \left( {a,b} \right) = A_{n} \left( {P - a - 1,b} \right)$$

(4)

$$A_{s} \left( {a,b} \right) = A _{n} \left( {\frac{a}{s},\frac{b}{s}} \right)$$

(5)

$$A_{t} \left( {a,b} \right) = A_{n} \left( {a + \Delta a, b + \Delta b} \right)$$

(6)

where $A_{r} \left( {a,b} \right)$ represents the rotation, $A_{hf} \left( {a,b} \right)$ stands for horizontal flipping, $A_{vf} \left( {a,b} \right)$ denotes vertical flipping, $A_{s} \left( {a,b} \right)$ represents scaling and $A_{t} \left( {a,b} \right)$ stands for translation tasks.

Preprocessing

As there are two types of input data, they are preprocessed separately. The COPD patient dataset is normalized using outlier rejection and redundancy removal techniques so that the repeated values and unwanted data are eliminated. Any missing data is filled using averaging method. Once the patient dataset is normalized, the CT images are preprocessed using bilateral filter and contrast enhanced using dynamic histogram equalization.

Bilateral Filter

The main problem with linear filters is that they tend to remove and filter out the edges that are surrounded by the noise in the process of filtering noise²⁵. This leads to loss of vital information. In order to overcome this issue, a non-linear filter was designed that is capable of removing noise without blurring out the nearby edges. This filter was later called as the bilateral filter which can perform noise removal and still preserve sharp edges. This property is achieved because the bilateral filter is based upon the radiometric variations of the image pixels such as intensity of colors, depth etc. in addition to the calculation of Euclidean distance. The proposed system uses bilateral filtering method as it possess several added advantages when compared to other type of filtering techniques such as easier usage and has simple parameters to deal with. It does not work in an iterative manner which makes it computationally very fast. Edge preservation and noise reduction properties of the image are very effective with the application of this filter.

It has three components for working such as the factor of normalization, space, and range weights. In addition to the observation of neighboring pixels extent and size, it also calculates the amplitude differences of sharp edges. This is done to make sure that the pixels which are in range with the mid value are blurred excluding the pixels whose intensity does not match with the central one, thereby indicating it as an edge and thus preserving it. The working of the bilateral filter has been given below in Eqs. (7–11).

$$bl_{f} \left[ i \right] = \frac{1}{{O_{P} }} \mathop \sum \limits_{q \in S} S_{{\sigma_{s} }} \left( {\left| {\left| {g - h} \right|} \right|} \right)R_{{\sigma_{r} }} \left( {\left| {i_{g} - i_{h} } \right|} \right) i_{h}$$

(7)

$$E\left( i \right) = k_{i}^{ - 1} \mathop \sum \limits_{{a \in R_{b} }} B_{r} \left( {C_{ab} } \right) \times W_{s} \left( {D_{ab} } \right) \times F\left( i \right)$$

(8)

$$k_{i} = \mathop \sum \limits_{{a \in R_{b} }} B_{r} \left( {C_{ab} } \right) \times W_{s} \left( {D_{ab} } \right)$$

(9)

$$B_{r} \left( {C_{ab} } \right) = exp\left( {\frac{{C_{ab}^{2} }}{2} \times \sigma_{r}^{2} } \right)$$

(10)

$$W_{s} \left( {D_{ab} } \right) = exp\left( {\frac{{D_{ab}^{2} }}{2} \times \sigma_{s}^{2} } \right)$$

(11)

where Op is the factor of normalization , g and h mean the pixel coordinates, $S_{{\sigma_{s} }}$ is the space parameter and $R_{{\sigma_{r} }}$ is the range parameter, $k_{i}^{ - 1}$ stands for the output , $B_{r} and W_{s}$ denote the kernel weights, $C_{ab}$ and $D_{ab}$ indicate the Euclidean distance.

Dynamic histogram equalization (DHE)

Dynamic histogram equalization was chosen because it has the ability to elevate the contrast and quality of the image to a great extent without adding much to the computational complexity of the process. It can also shed light on minute details of the image thereby leading to better classification performance.

It is an iterative algorithm that partitions the histogram of the image under consideration and creates numerous sub histograms²⁶. The equalization process stops when all the partitioned histogram shares the same level of contrast. Firstly, histogram is calculated and depending upon the local minimum points, it is sub divided. The task of partitioning continues until no dominating areas are found. Then for each of the produced sub histograms, gray levels are allocated. The allocation is done based on the cumulative distribution function value and gray range that may be present in the output image. The range and span are calculated using the below Eqs. (12) and (13).

$$gray_{s} = lm_{s} - lm_{s - 1}$$

(12)

$$gray_{r} = \frac{{gray_{s} }}{{\sum gray_{s} }}*L - 1$$

(13)

where $gray_{s}$ is the gray level span, $gray_{r}$ is the gray range, lm is the local minimum, L is the end limit.

This contrast range division prevents the overpowering of high elements over the small ones thus ensuring maintenance of each and every image detail. Lastly, the sub histograms are equalized separately using a transformation function that ensures the pixel range equality thus outputting an image that is contrast enhanced in a dynamic manner. It can be applied for images that require both local and global contrast enhancements and it assures that no information is lost or hidden during the entire process. This type of histogram equalization is better than others because it does not introduce any checkerboard effect in the image and also does not change the clarity and appearance of the image.

Super cut based segmentation

SuperCut algorithm is the integration of SuperPoint and CUT algorithms. It was designed to combine the efficiency and advantages of both the techniques. It is unsupervised in nature and works on the principle of Bayes decision theory²⁷. It falls under the category of interactive image segmentation algorithms which makes it even more powerful and popular. The application of SuperCut based segmentation over other usually preferred segmentation approaches, is because of the user interactive nature of this method. It has the capability to produce improved segmented images that marks it different from that of the others. It solves the problem of getting stuck in local minimum points as it happens in its predecessors of GrowCut and GrabCut. The images are pre-segmented into super pixels and accurate boundaries based on the application of Bayes theory. For pre- segmenting, mean shifting algorithm which is confident about the boundaries is used.

The segmented super pixels are grouped into effective clusters based on the Bayes decision theory. Gaussian mixture model is later applied combined with an interactive rectangular box for assigning the boundaries of the objects that have been segmented so far. This is done to achieve more accurate edges and arrive at a solution for unclear boundaries. Secondary clustering of pixels is done again with the help of the chosen rectangular box. An energy function is also used for this purpose whose Eq. (14) is given below.

$${\text{E}}\left( {{\text{ec}}} \right) \, = \sum E_{1} \left( {ec_{i} } \right) + \lambda \sum E_{2} \left( {ec_{i} , ec_{2} } \right)$$

(14)

where E(ec) is the energy consumption, i and j are pixels that are adjacent to each other. Pixels are classified into foreground or background based on its position with respect to the bounding box. Those pixels which lie outside the box are considered to be in the background and vice versa. Equations (15–17) give the feature vector of the probable foreground and background objects.

$${\text{P }}[{\text{bcg}}|{\text{s }}\left( {{\text{p}},{\text{ q}}} \right)]{ } = \frac{{{\text{P}}\left[ {{\text{s}}\left( {{\text{p}},{\text{q}}} \right){\text{|bcg}}} \right]P\left( {bcg} \right)}}{{{\text{P}}\left[ {{\text{s}}\left( {{\text{p}},{\text{q}}} \right)} \right]}}$$

(15)

$${\text{P}}[{\text{fg}}|{\text{ s}}\left( {{\text{p}},{\text{q}}} \right)]{ } = \frac{{{\text{P}}\left[ {{\text{s}}\left( {{\text{p}},{\text{q}}} \right){\text{|bcg}}} \right]P\left( {fg} \right)}}{{{\text{P}}\left[ {{\text{s}}\left( {{\text{p}},{\text{q}}} \right)} \right]}}$$

(16)

where bcg is the background, ${\text{s}}\left( {{\text{p}},{\text{q}}} \right)$ is the set of features and fg is the foreground.

$${ } {\text{P}}[{\text{bcg}}|{\text{s}}\left( {{\text{p}},{\text{q}}} \right)\left] { \, > {\text{ P}}} \right[{\text{fg}}|{\text{ s}}\left( {{\text{p}},{\text{q}}} \right)]$$

(17)

The final feature vector is given by Eq. (18).

$${\text{P}}\left[ {{\text{s}}\left( {{\text{p}},{\text{q}}} \right)} \right] = {\text{P}}\left[ {{\text{s}}\left( {{\text{p}},{\text{q}}} \right){\text{|bcg}}} \right]P\left( {bcg} \right) + {\text{P}}\left[ {{\text{s}}\left( {{\text{p}},{\text{q}}} \right){\text{|fg}}} \right]P\left( {fg} \right)$$

(18)

Binary feature fusion

Feature extraction is conducted using two different techniques namely UNet and AlexNet machine learning algorithms. The main reason for choosing UNet and AlexNet methods as feature extractors is the segmentation efficiency that both the techniques possess. The output of both the methods are fused together to form the desired feature vector.

UNet

It is a contemporary model of CNN that is fully connected. The specific advantage of UNet architecture is the fine-tuned object boundary identification that helps to a great extent in the further processing of the image. It is also very prompt in its working and can deal well with large images too. It gains its name from its U-shaped appearance that arises due to the presence of an ascending and descending network. These two networks take the role of an encoder and decoder²⁸. There are four encoder parts and four decoder parts both of which is connected by a bottleneck layer and skip connections tie the networks together. In total there are nine blocks that contract initially and then expand thus enabling superior feature extraction. The encoder attempts to mitigate the spatial information present and augment the feature information within the image. This task is accomplished with the help of a ma pooling layer, rectified linear unit layer and several 3*3 convolutions.

The down sampled image is again up sampled by the decoder that performs transpose 2*2 convolution operations and a single 1*1 convolution at the end to produce the required number of feature channels. The introduction of feature channels is considered the reason behind the massive success of U-Net architecture. The expanding network combines the spatial and feature information produced by the contracting path using a series of up convolutions. Information is passed between the encoding and decoding channels using skip connections. They are located in between the convolution layers and hence facilitate flow of information back and front. Both the networks are similar to each other in many aspects. Figure 2 shows the structure of UNet.

The energy function of the feature channels is given by the Eqs. (19) and (20).

$$F = \mathop \sum \limits_{p = 1}^{N} g\left( t \right)log\left( {s_{l\left( t \right)} \left( t \right)} \right)$$

(19)

$$s_{l} = exp\left( {a_{l} \left( y \right)} \right)/\mathop \sum \limits_{{l^{\prime} = 1}}^{L} exp\left( {a_{l} \left( t \right){\prime} } \right)$$

(20)

where $F$ is the energy function, $t$ stands for the features extracted, $g$ represents the weight, $s_{l}$ denotes the function of SoftMax, l is the channel and $a_{l}$ is for channel activation.

AlexNet

AlexNet is modern type of convolutional neural network that has won the visual challenge of recognition of images containing almost 14 million images²⁹. It is a phenomenally successful model that has been employed for different image processing tasks such as segmentation, feature extraction, pattern recognition, image classification etc. The major advantage of AlexNet is its capacity to extract precise and complicated features from the underlying image with the help of the deep layers present in its architecture. The architecture of AlexNet contains eight different layers of convolution, fully connected layers, max pooling and ReLu functions. Layers of drop out and response normalization are also present. It is amazingly fast in computation and takes input of the size 227*227*3. The architecture has been designed in a deep style where the first layer of convolution contains 96 filters of size 11*11. The output produced is of the size 55*55*96. Max pooling layers are of the general size of 3*3. Mathematical expressions for AlexNet model are given in Eqs. (21), (22) and (23).

$$f_{i} = fm*k_{i} + b_{i}$$

(21)

where fm is the feature map, b is the bias and k are the convolutional kernel. Activation function is represented as

$$AF_{i,j} = a\left( {\mathop \sum \limits_{c = 0}^{D - 1} \mathop \sum \limits_{e = 0}^{s - 1} \mathop \sum \limits_{g = 0}^{s - 1} w_{c,e,g} fm_{i + c,j + e,g} + b} \right)$$

(22)

$i,j$ is location of the feature map. Pooling operation in AlexNet is represented as

$$p_{i,j,k} = L_{{\left( {c,e} \right) \in R_{i,j} }} \left( {fm_{c,e,g} } \right)$$

(23)

R stands for pooling region and $L$ indicates the pooling function. Second layer has even more filters of 256 but with a reduced capacity of 5*5.

ReLu activation function is used here, and padding operation is conducted to obtain an output of 27*27*256. The next convolution layer makes use of 384 filters of size 3*3 and the feature map produced is 13*13*384. Fourth layer is exactly similar to third layer but with an additional padding done³⁰. The final layer contains 256 filters of size 3*3. There has been a huge increase in the number of filters from 96 to 384 in order to achieve better features. The size of the filters has been correspondingly reduced for the same purpose. The next set of layers is the fully connected ones with 4096 and 1000 neurons with a dropout factor of 0.5. Finally, a softmax classification function is added.

KELM based classification

Kernel Extreme Learning Machine (KELM) is a kind of Extreme Learning Machine (ELM) which is nothing but a feed forward network that has three layers namely the input layer, hidden layer, and output layer³¹. There are many variations in the design of ELM, one of which is the kernel extreme learning machine which uses a kernel for mapping the input to output. These machines are considered to be far better than the artificial neural networks and are considered to produce better results and classification performance. KELM has many added benefits when compared to traditional classification methods like Support Vector Machine (SVM). SVM does not have the potential to handle large datasets and also suffers from the problem of outlier interference. Random forest model requires a lot more of memory space and sometimes extensive time also for classification. KELM on the other hand are computationally inexpensive, fast in terms of classification and are very handy to implement without requiring any additional human expertise.

The regular algorithms belonging to the family of artificial neural networks suffer from many problems because of using back propagation and gradient descent-based algorithms which causes them to get stuck in the local minima often. Because of its superior classification results and fast learning, it has a wide range of applications in different fields of research³². It can be widely used for the purpose of classification, regression, clustering, learning, and mapping features, approximation, and prediction. Researchers have stated that extreme learning machine algorithms reach the global optima in a faster manner and do not get stuck in the local minima like the traditional neural networks.

The data from the input layers are randomly mapped to the hidden layers and this causes slight variations in the accuracies produced at the output layer in each iteration. In order to get rid of this problem, the KELM was introduced which maps the input to the hidden layer using an orthogonal projection-based calculation. The number of hidden layers can be one or many depending upon the architecture chosen. The nodes in the hidden layers are the actual computational engines that perform the task under consideration. The propagation of data from the hidden layer to the output layer is conducted with the help of a nonlinear activation function, mostly the rectified linear unit is used. The activation function is given as in Eq. (24).

$${\text{f}}\left( {\text{W}} \right){ } = {\text{ max}}\left\{ {0,{\text{W}}} \right\}{ } = { }\left\{ {\begin{array}{*{20}c} { 0 , if W_{i,j} < 0 } \\ {W_{i,j} otherwise } \\ \end{array} } \right.$$

(24)

The working of kernel extreme learning machine has two basic steps³³. The first is the initialization of parameters that is done in a random manner and next is arriving at an analytical solution using activation functions like sigmoid functions, hyperbolic tangent functions, radial basis functions, quadratic functions etc. and using the Moore Penrose generalized inverse. Classification is then done based on the mapping of input to the output class with the maximum similarity. The output function is given by the Eq. (25),

$$\mathop \sum \limits_{i = 1}^{H} k_{i} \left( x \right)\beta_{i} = b_{i}$$

(25)

where $\left( {a_{i} ,b_{i} } \right)$ is the training data and $H$ is the nodes present in the hidden layers.

In the above equation $k_{i} \left( x \right)\beta = b_{i}$ can be rewritten as $K\beta = S$, where $\beta$ is the weight of the output vector.

$$HL = \left[ {\begin{array}{*{20}c} {k\left( {a_{1} } \right)} \\ \vdots \\ {k\left( {a_{n} } \right)} \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} {k_{1} \left( {x_{1} } \right)} & \cdots & {k_{H} \left( {a_{1} } \right)} \\ \vdots & \ddots & \vdots \\ {k_{1} \left( {x_{n} } \right)} & \cdots & {k_{H} \left( {a_{n} } \right)} \\ \end{array} } \right]$$

(26)

The sum of least squares solution given by the ELM classifier can be calculated as

$$\tilde{\beta } = H^{L + }$$

(27)

$HL^{ + }$ stands for the Moore Penrose generalized inverse. Similarly, the objective function is given as below in Eqs. (28–30).

$$minimize:HL\beta - S$$

(28)

$$minimize:H_{{P_{ELM} }} = \frac{1}{2}\beta^{2} + D\frac{1}{2}\mathop \sum \limits_{i = 1}^{N} \xi_{i}^{2}$$

(29)

$$h\left( {x_{i} } \right)\beta = t_{i}^{T} - \xi_{i}^{T} , i = 1,2, \ldots , N$$

(30)

The final classification mapping of input to output is done by the following Eqs. (31) and (32).

$$\hat{U} = V_{o } W = V_{o } f\left( {V_{i } Z} \right)$$

(31)

$$\hat{u} = \arg {\text{ max}}\hat{U}$$

(32)

where $Z$ is the input matrix, $U$ is the one hot label representation of the input, $V$ is the hidden to output layer matrix and $W$ is the activation function. The error that occurs in the complete process can be formulated as in (33).

$$error = \mathop \sum \limits_{a = 1}^{m} ||g_{a} - o_{a} ||^{2}$$

(33)

where $g$ is the ground truth and so are the output.

Dragonfly optimization algorithm

This meta-heuristic, multi objective optimization algorithm is inspired from the swarming behavior of dragon flies. The proposed work aims to utilize the potential of dragonfly optimization technique in the classification of COPD dataset. The biggest profit that this technique yields is the adaptability and ease of implementation. It is a widely used optimizer for various real world problems and can search for solutions globally. While genetic algorithms suffer from a variety of disadvantages like getting stuck in the local minima, additional cost of computation incurred and inefficiency in handling multifaceted datasets, Dragonfly optimizer solves all of these shortcomings easily. Dragon flies fall under the category of small predators and follow two swarming patterns called as the feeding swarm and migratory swarm³⁴. The former swarming pattern is also known as the static swarming consists of feeding on local preys using abrupt movements and rapid change of flying path. The second type of swarming called as the dynamic or migratory swarm comprises a huge population of dragon flies flying over long distances in a particular direction.

Researchers have taken inspiration from these behaviors of the dragon flies and have constituted an optimization algorithm that can be binary or multi objective in nature. It can be used for solving discrete and also continuous problems. Because of its ease of computation and the ability of attaining a global optimum point minimally, this optimization algorithm is frequently used over the few years. The two swarming patterns of dragon fly are similar to the optimization characteristics of exploitation and exploration in nature. This algorithm consists of five factors such as separation, alignment, cohesion, attraction, and repulsion. Separation refers to the concept of avoiding conflicts with neighbors. Alignment is defined as the process of maintaining the same velocity of speed by all the individuals in the group. Cohesion refers to the principle of moving towards the center of the population. The other two common principles are the attraction and repulsion with reference to the location of dragon flies and source of food and predator³⁵. The mathematical formulations for all the five principles are given below in Eqs. (34–38).

$$S_{a} = \mathop \sum \limits_{b = 1}^{NB} y - y_{b}$$

(34)

$S_{a}$ stands for the separation factor, y is the current position, and NB is the number of neighbors.

$$A_{a} = \frac{{\mathop \sum \nolimits_{b = 1}^{NB} L_{b} }}{NB}$$

(35)

$A_{a}$ is the alignment factor, L is the velocity of the bth neighbor.

$$C_{a} = \left\{ {\frac{{\mathop \sum \nolimits_{b = 1}^{NB} y_{b} }}{NB} - y} \right\}$$

(36)

$C_{a}$ stands for cohesion here.

$$att_{a} = \left\{ {y^{ + } - y} \right\}$$

(37)

$att_{a}$ is the attraction and $y^{ + }$ is the location of the food.

$$rep_{a} = \{ y^{ - } + y\}$$

(38)

$rep_{a}$ is the repulsion factor and $y^{ - }$ is the location of the predator.

Figure 3 shows the illustration of these factors of dragonfly.

Two vectors called as the step vector and position vector are updated in an iterative manner depending upon the current location of the dragon flies as shown in Eqs. (39) and (40). During the exploitation phase which reflects the static swarming behavior of the dragon flies, weights are assigned in such a way that cohesion has low values and alignment has high scores. Similarly, during the exploitation phase which is similar to the dynamic swarming pattern of the dragon flies, weights are assigned accordingly so that alignment has low value and cohesion has high value³⁶. Food and enemy are also considered to be crucial factors in this algorithm which are denoted by F and E. Along with this, the neighbors also play an especially key role and therefore the neighboring space radius (NR) in and around each dragonfly is considered. It is essential that we need to balance the two phases of exploration and exploitation by tuning the factors of swarming during the various iterations of the algorithm. As the optimization algorithm progresses, it is necessary that the dragon flies change their weights, flying path and neighboring radius accordingly.

$$\Delta y_{t + 1} = \left( {w_{1} S_{a} + w_{2} A_{a} + w_{3} C_{a} + w_{4} att_{a} + w_{5} rep_{a} } \right) + ia\Delta y_{t}$$

(39)

$w_{1} , w_{2} , w_{3} , w_{4} , w_{5}$ are the weights of the corresponding factors, ia is the weight of inertia and t is the no. of iteration.

$$y_{t + 1} = \left( {y_{t} + \Delta y_{t + 1} } \right)$$

(40)

This updating of the swarm factors will converge at some point to arrive at the final stage of optimization where the best solutions can be found³⁷. In order to guarantee the random nature of the dragon flies, this algorithm uses a new parameter called as the levy flight which is just a random walk of the dragonfly where there are no viable solutions around. The following Eq. (41) can be used during the levy flight phase to update the position vector of the dragon fly.

$$y_{t + 1} = \left( {y_{t} + y_{t} .Levy\left( d \right)} \right)$$

(41)

where levy can be defined as in Eq. (42).

$${\text{Levy}}\left( {\text{d}} \right) = 0.01{ }\frac{{e_{1} .\sigma }}{{\left| {e_{2} } \right|^{\beta } }}$$

(42)

$e_{1} and e_{2}$ are the range values and $\beta$ is constant. The calculation for $\sigma$ is given in (43).

$$\sigma = ( \frac{{\beta ! .(\sin \frac{\pi \beta }{2})}}{{\left( {\frac{\beta - 1}{2}} \right)!.\beta .2^{{\frac{\beta - 1}{2}}} }} )^{{\frac{1}{\beta }}}$$

(43)

Results and discussion

The proposed system is implemented within the environment of python including pre-defined libraries of TensorFlow and Keras. 560 self-acquired CT lung images were divided in the ratio 80:20 for KELM classifier training. 448 images are used for training the classifier and 112 images for evaluating it. For grade 1 COPD, 104 images were used to train the classifier, and 30 images were used for testing. For grade 2 COPD, 113 images were utilized for training and 25 for testing. Similarly for grade 3 and 4, 131 and 157 images were employed in total. The details of dataset division are given in Table 1 below.

Table 1 CT image dataset division.

Full size table

Figure 4 shows the sample view of the COPD Patient dataset collected from the Kaggle website.

Figure 5 shows the sample CT images of patient’s lungs affected by COPD.

Table 2 below the details of data augmentation processes along with the range values. The images are rotated to 180 degrees and scaling factor is around 0.25. Both horizontal and vertical flipping is performed to create new images and the augmentation process of translation is also carried out.

Table 2 Parameters of data augmentation.

Full size table

Figure 6 shows the bilateral filtered and DHE based contrast enhanced lung CT images.

SuperCut algorithm-based segmentation results are shown in Fig. 7.

Table 3 lists the details of the simulation parameters chosen for the appropriate execution of the proposed system. The size of the population is 50 with a learning rate of 0.01. The number of hidden units in the classifier is 256 and the rate of dropout is 0.3. Batch size is 32 here, no. of epochs is 200, no. of iterations is 100 and size of kernel is 5 for the chosen classifier.

Table 3 Simulation details.

Full size table

The classification performance of the proposed KELM model is calculated using evaluation metrics of accuracy, precision, recall, F1-Score, specificity, Matthew’s Correlation Coefficient (MCC), Receiver Operating Characteristic (ROC) curve and Area under the Curve (AUC) values. They reflect the actual performance of the classifier model, concentrate on negative instances too and thus evaluate the quality of the classifier in a much better manner. Table 4 shows the performance scores that have been achieved by the proposed system during training and testing phases. The proposed dragonfly optimized KELM classifier produced an accuracy of 98.82%, precision of 99.01%, recall of 94.98%, F1 score of 96.11%, MCC score of 94.33%, specificity of 98.09%, ROC value of 0.95 and AUC score of 0.996.

Table 4 Performance evaluation.

Full size table

The training and validation accuracy curve is presented in Fig. 8 and training and validation loss curve is given in Fig. 9. Accuracy is rising with each increasing epoch and touches a peak value of 98.91% during the training phase and the training loss values follow a steep descending order.

Figure 10 shows the analysis of precision and recall scores. The average precision value is definitely above 0.991 which indicates the phenomenal performance of the proposed model. The average precision value for grade 1 of COPD is 1.00, grade 2 is 0.998 grade 3 is 0.991 and grade 4 COPD is 1.00.

Table 5 shows the values of the precision scores attained by the proposed KELM classifier at regular intervals. It also compares the precision values scored by existing systems such as Logistic Regression (LR), Random Forest (RF), Recurrent Neural Network (RNN), Long Short-Term Memory (LSTM) and InceptionV3³⁸. At epoch 50, the proposed system reaches 92.5 and at epoch 100, it attains a precision score of 93.1. Likewise, after the 150th epoch, 97.3 score is achieved and the final precision value attained by the KELM classifier is 99.01.

Table 5 Precision analysis.

Full size table

Figure 11 shows the comparison of precision with respect to the other models under study in an illustrative manner. The precision value of the proposed KELM model is very much superior to that of the other ones hence proving the optimal performance of the chosen classifier.

Table 6 provides the details of the recall values scored by the proposed model at regular epoch intervals such as the 50th epoch, 100th epoch, 150th epoch and 200th epoch and existing systems as well³⁹. As the epoch value increases, the recall score also increases. After 50 cycles of epoch, the proposed model scores 91.9 and at epoch 100, it reaches 92.4. At the 200 epoch, the final recall value hits 94.98 which show the improved performance of the proposed system over the other models.

Table 6 Recall analysis.

Full size table

Figure 12 depicts the comparison graph of recall values achieved by the proposed system and existing classifiers of COPD. It is evident from the figure that the recall value scored by the proposed system is better than all of them.

The values of F1 score achieved by the proposed model are listed in Table 7⁴⁰. After 50 epochs, the proposed model attains 93.8 and at epoch 100, F1 score is 94.9. Similarly, at the 150th and 200th epochs, the values are 95.2 and 96.11, respectively.

Table 7 F1-Score analysis.

Full size table

The relative analysis of F1 score is shown in Fig. 13. Like other performance metrics, the F1 score recorded by the proposed KELM model is better.

Table 8 shows the values of the MCC scores attained by the proposed and existing systems⁴¹. MCC scores are shown only for selected four epochs namely 50th, 100th, 150th and 200th epoch. The scores are 89.6, 92.4, 93.1 and 94.33, respectively.

Table 8 MCC analysis.

Full size table

Figure 14 explains the scores of MCC values accomplished by the chosen and existing classifiers. The proposed system performs better and higher in terms of MCC score amongst the others.

Table 9 shows the details of the proposed and existing algorithms specificity values attained at four selected epochs⁴². After the 50th epoch, the specificity score of the proposed KELM model is 94.5. During the 100th, 150th and 200th epoch, the specificity values are 95.1, 96.7 and 98.09.

Table 9 Specificity analysis.

Full size table

Figure 15 illustrates in a graphical manner the detailed analysis of scores of specificities attained by the proposed system and compares them with the rest of the models under study.

Table 10 specifies the accuracy details of the proposed and existing algorithms⁴³. After the 50th epoch, the accuracy value of the proposed KELM model is 93.8. Then after running for another 50 epochs, accuracy becomes of 94.6. During the 150th and 200th epoch, the values are 97.1 and 98.82, respectively.

Table 10 Accuracy analysis.

Full size table

Figure 16 below depicts the accuracy analysis of the proposed system in comparison with the existing classifiers. It can be seen from the figure that the classification performance of the proposed Dragonfly optimized KELM model is far better than the existing deep learning techniques.

Figure 17 shows the ROC curve attained by the proposed system that uses KELM classifier and dragonfly optimization algorithm. ROC is a graphical showcase of the classification performance of the classifier at different threshold values. It is a measure of true positive and false positive rates attained by the classification model.

The AUC values attained by the different grades of COPD are 0.999, 0.994, 0.998 and 1.000 for grade 1, grade 2, grade 3 and grade 4 of COPD, respectively. The average AUC of the proposed model is 0.996 which is considered to be particularly good. The ROC curve and its corresponding AUC values show the superior performance of the classifier model chosen for the disease classification of COPD.

Conclusion

Chronic obstructive pulmonary disease is becoming a global threat to mankind that needs to be curbed immediately. The proposed kernel extreme learning machine-based classifier coupled with dragon fly optimization technique used a hybrid dataset to identify different grades of COPD. In this process, it uses state of the art techniques such as bilateral filter, dynamic histogram equalization, SuperCut and binary feature fusion involving UNet and AlexNet. The proposed KELM classifier achieves an accuracy of 98.82%, precision of 99.01%, recall of 94.98%, F1 score of 96.11%, specificity of 98.09%, MCC value of 94.33% and AUC value of 0.996. The comparative analysis of the proposed system with existing COPD classification models such as logistic regression, random forest, recurrent neural network, long short-term memory, and inception V3 shows that the proposed dragon fly optimized KELM classifier outperforms all the other models and produces enhanced classification results. In future, the model can be extended to incorporate huge datasets and identify more complicated pulmonary disorders to aid medical professionals. The potential limitation of the proposed work would be minor changes in the rate of accuracy when the model is generalized to larger and diverse datasets.

Data availability

The patient dataset was gained from Kaggle website link https://www.kaggle.com/datasets/prakharrathi25/copd-student-dataset containing the disease details of 101 COPD patients.

Code availability

The custom code and algorithm developed for the COPD classification system—including the Dragonfly Optimization module, Kernel Extreme Learning Machine (KELM), SuperCut segmentation, and binary feature fusion using UNet and AlexNet—are part of an ongoing research dissertation and forthcoming project proposal. Due to institutional policy and the preliminary nature of this work, the code is not publicly available at this stage. However, it may be shared by the corresponding author upon reasonable request for academic and non-commercial research purposes.

References

Zhang, L., Jiang, B., Wisselink, H. J., Vliegenthart, R. & Xie, X. COPD identification and grading based on deep learning of lung parenchyma and bronchial wall in chest CT images. Br. J. Radiol. 95(1133), 20210637 (2022).
Article PubMed PubMed Central Google Scholar
Wu, Y. et al. A vision transformer for emphysema classification using CT images. Phys. Med. Biol. 66(24), 245016 (2021).
Article Google Scholar
Sundar Raj, A., Naidu, P. C., Sivakumar, S. & Senthilkumar, S. Hybrid attention residual deep convolution learning network for bio medical image analysis. J Intell. Fuzzy Syst. 48(4), 537–554. https://doi.org/10.3233/JIFS-241512 (2025).
Article Google Scholar
S. Senthilkumar, S. Vetriselvi, K. Kalaivani, P. Arunkumar, M. Malathi, S. Praveen Kumar, “Super-resolution image using enhanced deep residual networks and the DIV2K dataset”, Proceedings of the Second International Conference on Self Sustainable Artificial Intelligence Systems (ICSSAS-2024), 23 and 24 Oct 2024. https://doi.org/10.1109/ICSSAS64001.2024.10760961.
Park, J. et al. COPDGene investigators. Subtyping COPD by using visual and quantitative CT imaging features. Chest 157(1), 47–60 (2020).
Article PubMed Google Scholar
Hasenstab, K. A. et al. Automated CT staging of chronic obstructive pulmonary disease severity for predicting disease progression and mortality with a deep learning convolutional neural network. Radiol Cardiothor Imaging 3(2), e200477 (2021).
Article Google Scholar
Fraiwan, M., Fraiwan, L., Alkhodari, M. & Hassanin, O. Recognition of pulmonary diseases from lung sounds using convolutional neural networks and long short-term memory. J. Ambient Intell. Human. Comput. 1, 1–3 (2022).
Google Scholar
Altan, G., Kutlu, Y. & Gökçen, A. Chronic obstructive pulmonary disease severity analysis using deep learning onmulti-channel lung sounds. Turk. J. Electr. Eng. Comput. Sci. 28(5), 2979–2996 (2020).
Article Google Scholar
Amin, S. U., Taj, S., Hussain, A. & Seo, S. An automated chest X-ray analysis for COVID-19, tuberculosis, and pneumonia employing ensemble learning approach. Biomed. Signal Process. Control 87, 105408 (2024).
Article Google Scholar
Hussain, A. et al. An automated chest X-ray image analysis for covid-19 and pneumonia diagnosis using deep ensemble strategy. IEEE Access 11, 97207–97220 (2023).
Article Google Scholar
Bilal, A. et al. IGWO-IVNet3: DL-based automatic diagnosis of lung nodules using an improved gray wolf optimization and InceptionNet-V3. Sensors. 22(24), 9603 (2022).
Article ADS PubMed PubMed Central Google Scholar
Bilal, A., Sun, G., Li, Y., Mazhar, S. & Latif, J. Lung nodules detection using grey wolf optimization by weighted filters and classification using CNN. J. Chin. Inst. Eng. 45(2), 175–186 (2022).
Article Google Scholar
Li, H. et al. Carlos Molina Jimenez, “UCFNNet: Ulcerative colitis evaluation based on fine-grained lesion learner and noise suppression gating”. Comput. Methods Programs Biomed. 247, 108080 (2024).
Article PubMed Google Scholar
Wang, W. et al. Advancing aggregation-induced emission-derived biomaterials in viral, tuberculosis, and fungal infectious diseases. Aggregate. 6(3), e715. https://doi.org/10.1002/agt2.715 (2025).
Article CAS Google Scholar
Chen, S. et al. Evaluation of a three-gene methylation model for correlating lymph node metastasis in postoperative early gastric cancer adjacent samples. Front. Oncol. 14, 1432869. https://doi.org/10.3389/fonc.2024.1432869 (2024).
Article CAS PubMed PubMed Central Google Scholar
Pu, X., Sheng, S., Fu, Y., Yang, Y. & Xu, G. Construction of circRNA–miRNA–mRNA ceRNA regulatory network and screening of diagnostic targets for tuberculosis. Ann. Med. https://doi.org/10.1080/07853890.2024.2416604 (2024).
Article PubMed PubMed Central Google Scholar
Moslemi, A. et al. Differentiating COPD and asthma using quantitative CT imaging and machine learning. Eur. Respirat. J. 60(3), 213070 (2022).
Article Google Scholar
Sugimori, H., Shimizu, K., Makita, H., Suzuki, M. & Konno, S. A comparative evaluation of computed tomography images for the classification of spirometric severity of the chronic obstructive pulmonary disease with deep learning. Diagnostics 11(6), 929 (2021).
Article CAS PubMed PubMed Central Google Scholar
Humphries, S. M. et al. Genetic epidemiology of COPD (COPDGene) investigators. Deep learning enables automatic classification of emphysema pattern at CT. Radiology 294(2), 434–444 (2020).
Article PubMed Google Scholar
Kumar, S. et al. A novel multimodal framework for early diagnosis and classification of COPD based on CT scan images and multivariate pulmonary respiratory diseases. Comput. Methods Progr. Biomed. 1(243), 107911 (2024).
Article Google Scholar
Reumkens, C. et al. Application of the Rome severity classification of COPD exacerbations in a real-world cohort of hospitalised patients. ERJ Open Res. 9(3), 569 (2023).
Article Google Scholar
Kocks, J. W. et al. Diagnostic performance of a machine learning algorithm (asthma/chronic obstructive pulmonary disease [COPD] differentiation classification) tool versus primary care physicians and pulmonologists in asthma, COPD, and asthma/COPD overlap. J. Allergy Clin. Immunol. Pract. 11(5), 1463–1474 (2023).
Article PubMed Google Scholar
Zhuang, Y. et al. Deep learning on graphs for multi-omics classification of COPD. PLoS ONE 18(4), e0284563 (2023).
Article CAS PubMed PubMed Central Google Scholar
Almeida, S. D. et al. Prediction of disease severity in COPD: a deep learning approach for anomaly-based quantitative assessment of chest CT. Eur. Radiol. 34(7), 4379–4392 (2024).
Article PubMed Google Scholar
Wang, C. et al. Comparison of machine learning algorithms for the identification of acute exacerbations in chronic obstructive pulmonary disease. Comput. Methods Programs Biomed. 188, 105267 (2020).
Article PubMed Google Scholar
Wang, Q., Wang, H., Wang, L. & Yu, F. Diagnosis of chronic obstructive pulmonary disease based on transfer learning. IEEE Access 8, 47370–47383 (2020).
Article Google Scholar
Asatani, N., Kamiya, T., Mabu, S. & Kido, S. Classification of respiratory sounds using improved convolutional recurrent neural network. Comput. Electr. Eng. 94, 107367 (2021).
Article Google Scholar
Polat, Ö., Şalk, İ & Doğan, Ö. T. Determination of COPD severity from chest CT images using deep transfer learning network. Multimed. Tools Appl. 81(15), 21903–21917 (2022).
Article Google Scholar
Mondal, S., Sadhu, A. K. & Dutta, P. K. Automated diagnosis of pulmonary emphysema using multi-objective binary thresholding and hybrid classification. Biomed. Signal Process. Control 69, 102886 (2021).
Article Google Scholar
Tzanakis, N. et al. Classification of COPD patients and compliance to recommended treatment in Greece according to GOLD 2017 report: The RELICO study. BMC Pulmonary Med. 21, 1–9 (2021).
Article Google Scholar
Wang, X. et al. Machine learning-enabled risk prediction of chronic obstructive pulmonary disease with unbalanced data. Comput. Methods Programs Biomed. 230, 107340 (2023).
Article PubMed Google Scholar
Ananthajothi, K., Rajasekar, P. & Amanullah, M. Enhanced U-Net-based segmentation and heuristically improved deep neural network for pulmonary emphysema diagnosis. Sādhanā 48(1), 33 (2023).
Article Google Scholar
Bhatt, S. P. et al. FEV1/FVC severity stages for chronic obstructive pulmonary disease. Am. J. Respir. Crit. Care Med. 208(6), 676–684 (2023).
Article PubMed PubMed Central Google Scholar
Zaidi, S. Z. Y., Akram, M. U., Jameel, A. & Alghamdi, N. S. Lung segmentation-based pulmonary disease classification using deep neural networks. IEEE Access 9, 125202–125214 (2021).
Article Google Scholar
Sun, J. et al. Detection and staging of chronic obstructive pulmonary disease using a computed tomography–based weakly supervised deep learning approach. Eur. Radiol. 32(8), 5319–5329 (2022).
Article PubMed Google Scholar
Peng, J. et al. Exploratory study on classification of chronic obstructive pulmonary disease combining multi-stage feature fusion and machine learning. BMC Med Inform Decis Making 21, 1–9 (2021).
Article Google Scholar
Le Trung, K., Nguyen Anh, P. & Han, T. T. A novel method in COPD diagnosing using respiratory signal generation based on CycleGAN and machine learning. Comput. Methods Biomech. Biomed. Eng. 15, 1–6 (2024).
Article Google Scholar
Roy, A. & Satija, U. RDLINet: A novel lightweight inception network for respiratory disease classification using lung sounds. IEEE Trans. Instrum. Meas. 72, 1–13 (2023).
Google Scholar
Celli, B. R. et al. An updated definition and severity classification of chronic obstructive pulmonary disease exacerbations: The Rome proposal. Am. J Respirat. Critic. Care Med. 204(11), 1251–1258 (2021).
Article Google Scholar
Stolz, D. et al. Towards the elimination of chronic obstructive pulmonary disease: A lancet commission. The Lancet 400(10356), 921–972 (2022).
Article Google Scholar
Naqvi, S. Z. H. & Choudhry, M. A. An automated system for classification of chronic obstructive pulmonary disease and pneumonia patients using lung sound analysis. Sensors 20(22), 6512 (2020).
Article ADS PubMed PubMed Central Google Scholar
Makimoto, K. et al. Comparison of feature selection methods and machine learning classifiers for predicting chronic obstructive pulmonary disease using texture-based CT lung radiomic features. Acad. Radiol. 30(5), 900–910 (2023).
Article PubMed Google Scholar
Lakshman Narayana, V., Lakshmi Patibandla, R. S. M., Pavani, V., & Radhika, P. (2022). Optimized nature-inspired computing algorithms for lung disorder detection. In Nature-Inspired Intelligent Computing Techniques in Bioinformatics (pp. 103–118). Singapore: Springer

Download references

Acknowledgements

The author extends the appreciation to the Deanship of Postgraduate Studies and Scientific Research at Majmaah University for funding this research work through the project number R-2025-1799.

Funding

No funding received for this research work.

Author information

Authors and Affiliations

Department of Biomedical Engineering, E.G.S. Pillay Engineering College, Nagapattinam, Tamil Nadu, 611002, India
S. Chitra
Department of Medical Equipment Technology, College of Applied Medical Sciences, Majmaah University, Al Majmaah, 11952, Saudi Arabia
Tariq Mohammed Alqahtani, Abdulrahman Alduraywish & Mohamed Yacin Sikkandar

Authors

S. Chitra
View author publications
Search author on:PubMed Google Scholar
Tariq Mohammed Alqahtani
View author publications
Search author on:PubMed Google Scholar
Abdulrahman Alduraywish
View author publications
Search author on:PubMed Google Scholar
Mohamed Yacin Sikkandar
View author publications
Search author on:PubMed Google Scholar

Contributions

All the authors contributed to this research work in terms of concept creation, conduct of the research work, and manuscript preparation.

Corresponding author

Correspondence to S. Chitra.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Chitra, S., Alqahtani, T.M., Alduraywish, A. et al. A robust chronic obstructive pulmonary disease classification model using dragonfly optimized kernel extreme learning machine. Sci Rep 15, 18702 (2025). https://doi.org/10.1038/s41598-025-02952-6

Download citation

Received: 08 October 2024
Accepted: 16 May 2025
Published: 28 May 2025
Version of record: 28 May 2025
DOI: https://doi.org/10.1038/s41598-025-02952-6

Keywords

This article is cited by

COPD-TransNet: A Swin Transformer Network with Quantitative Emphysema Feature Fusion for COPD Detection and Staging from Opportunistic CT Scans
- Ao Liu
- Boyu Zhang
- Li Yang
Journal of Imaging Informatics in Medicine (2026)

Subjects

Abstract

Similar content being viewed by others

A novel framework for COPD management in cyber-physical systems using machine learning

Deep radiomics-based survival prediction in patients with chronic obstructive pulmonary disease

Deep learning for detecting and early predicting chronic obstructive pulmonary disease from spirogram time series

Introduction

Related works

Proposed system

Data acquisition

Data augmentation

Preprocessing

Bilateral Filter

Dynamic histogram equalization (DHE)

Super cut based segmentation

Binary feature fusion

UNet

AlexNet

KELM based classification

Dragonfly optimization algorithm

Results and discussion

Conclusion

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

COPD-TransNet: A Swin Transformer Network with Quantitative Emphysema Feature Fusion for COPD Detection and Staging from Opportunistic CT Scans

Search

Quick links