Spatial attention-based residual network for human burn identification and classification

Yadav, D. P.; Aljrees, Turki; Kumar, Deepak; Kumar, Ankit; Singh, Kamred Udham; Singh, Teekam

doi:10.1038/s41598-023-39618-0

Download PDF

Article
Open access
Published: 02 August 2023

Spatial attention-based residual network for human burn identification and classification

D. P. Yadav¹,
Turki Aljrees²,
Deepak Kumar³,
Ankit Kumar¹,
Kamred Udham Singh⁴ &
…
Teekam Singh⁵

Scientific Reports volume 13, Article number: 12516 (2023) Cite this article

4070 Accesses
22 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Diagnosing burns in humans has become critical, as early identification can save lives. The manual process of burn diagnosis is time-consuming and complex, even for experienced doctors. Machine learning (ML) and deep convolutional neural network (CNN) models have emerged as the standard for medical image diagnosis. The ML-based approach typically requires handcrafted features for training, which may result in suboptimal performance. Conversely, DL-based methods automatically extract features, but designing a robust model is challenging. Additionally, shallow DL methods lack long-range feature dependency, decreasing efficiency in various applications. We implemented several deep CNN models, ResNeXt, VGG16, and AlexNet, for human burn diagnosis. The results obtained from these models were found to be less reliable since shallow deep CNN models need improved attention modules to preserve the feature dependencies. Therefore, in the proposed study, the feature map is divided into several categories, and the channel dependencies between any two channel mappings within a given class are highlighted. A spatial attention map is built by considering the links between features and their locations. Our attention-based model BuRnGANeXt50 kernel and convolutional layers are also optimized for human burn diagnosis. The earlier study classified the burn based on depth of graft and non-graft. We first classified the burn based on the degree. Subsequently, it is classified into graft and non-graft. Furthermore, the proposed model performance is evaluated on Burns_BIP_US_database. The sensitivity of the BuRnGANeXt50 is 97.22% and 99.14%, respectively, for classifying burns based on degree and depth. This model may be used for quick screening of burn patients and can be executed in the cloud or on a local machine. The code of the proposed method can be accessed at https://github.com/dhirujis02/Journal.git for the sake of reproducibility.

A lightweight Deeplab V3+ network integrating deep transitive transfer learning and attention mechanism for burned area identification

Article Open access 07 May 2025

Precision diagnosis of burn injuries using imaging and predictive modeling for clinical applications

Article Open access 04 March 2025

Development of a framework for managing severe burns through a 17-year retrospective analysis of burn epidemiology and outcomes

Article Open access 30 April 2021

Introduction

Burn is a life-threatening condition that needs early treatment. It is classified into various categories based on its severity and the affected tissues. The most prevalent method for categorization of burns is the "degree" mechanism, which divides burns into three primary categories: first-degree (Superficial dermal), second-degree (Deep dermal), and third-degree (Full thickness) burns. Superficial burn only affects the top layer of the skin (epidermis). The main symptoms include redness, pain, and minor swelling. Healing usually occurs within a few days without scarring¹. Deep dermal burn affects the epidermis and part of the dermis (the second layer of skin). Symptoms include redness, blistering, severe pain, and swelling. Healing time can vary, and scarring may occur depending on the depth and extent of the burn. Full-thickness burn extends through the entire epidermis and dermis, reaching into the subcutaneous tissue. Symptoms may include a leathery or charred appearance, insensitivity to pain (due to nerve damage), and white or dark brown coloration. Healing is slow and may require skin grafting, and scarring is common. For human burn treatment, first aid can’t be administered to a burn victim before properly diagnosing the injury². The deeper the burn, the more severe the injury. A dermatologist assesses burn severity before grafting is performed. Grafting involves replacing the damaged skin with healthy tissue from an unburned area. After 14–21 days of therapy, a superficial (first-degree) burn will recover. In Table 1, we see how a doctor determines the severity of burns based on the color of the affected areas.

Table 1 The classification of human burn skin according to its color.

Full size table

The manual burn diagnosis process requires expert involvement, making the process time-consuming and expensive. Dermatological experts employ fluorescence fluorometry, fluorescence, and ultrasound imaging to predict burn depth, achieving diagnostic accuracy between 50 and 80%³. Deep-dermal burns affect the second skin layer, while full-thickness burns penetrate the third layer and often involve damaged tissues, muscles, and scarring, significantly impacting a patient’s life. Effective treatment for burn scars is essential, and doctors utilize anti-scarring techniques⁴. The severity of burns can also have long-term negative consequences for patients⁵. Past studies have employed machine learning methods for burn diagnosis, which typically involve preprocessing burnt images to downsize and reduce noise. Handcrafted texture and shape features are manually extracted for training and classifying burn types, but this approach requires a small dataset and specialized expertise, leading to potential errors that reduce model performance.

In contrast, deep learning models can automatically learn features through their layers, demonstrating promising capabilities for medical image recognition in recent years⁶. However, the deep CNN model performance depends on the dataset size and model architecture⁷. Previous research utilizing deep CNN techniques has shown improved performance^{8,9,10,11,12,13}. Yet, some deep learning models with few layers and limited training datasets have led to suboptimal performance for burn diagnosis. Models like ResNeXt, AlexNet, and VGG16 were computationally expensive and did not achieve remarkable accuracy in burn diagnosis. For burn degree categorization, ResNeXt, AlexNet, and VGG16 achieved classification accuracies of 84.31%, 70.57%, and 76.32%, respectively, similar to the manual approach. We propose a spatial attention-based model called BuRnGANeXt50 to address these challenges. This model utilizes a feature map divided into categories, highlights channel dependencies within each class, and builds a spatial attention map to improve classification accuracy. Efficiently capturing information regarding the depth of the burn region is crucial for severity assessment and surgical recommendations for grafting. The proposed model demonstrates excellent performance in quickly screening different types of burns while also being computationally efficient.

The significant contribution of the manuscript is as follows.

(1)
The proposed BuRnGANeXt50 is a residual network that takes less computation time than ResNeXt. Since ReNext has 23 × 10⁶ and our model has 5 × 10⁶ neurons
(2)
Two-channel maps are separated into categories, and the channel dependencies between them are highlighted. Meanwhile, a spatial attention map is built from the spatial relationships between features
(3)
The training and validation loss on BIP_US Database is significantly less, which confirms that the proposed model is sensitive for burn diagnosis

The rest of the paper is organized as follows:

“Literature review” Section describes a study involving a detailed review of human burns. At the same time, “Proposed method” section describes the architecture of the BuRnGANeXt50 model. “Results” section describes the experimental procedures for the diagnosis of urn based on degree and depth. Finally, in “Discussion” section, the comparative study of different models and BuRnGANXt50 is described in detail.

Literature review

To segment renal tissue and identify immunological (CD3 +) inflammatory cells, Hermsen et al.¹⁴ developed two CNNs models. Human evaluation of the Banff lesion types was compared with automated measurement of glomeruli, interstitial fibrosis, and (total) inflammation, and strong correlations were found. Long-term changes in estimated glomerular filtration rate (eGFR) are inversely related to inflammation inside scarred regions, according to automated and visual examination of a small cohort¹⁴. The machine learning technique used by Abubakar et al.^15,16,17 classifies the human burn using the African dataset. Burn healing times may be used to predict burn depth. Specifically, they used One-versus-One SVM to examine the efficacy of leveraging deep features obtained from a pretrained model to address a multiclass problem. Relevant discriminating characteristics of the images were obtained using VGG16 and pretrained ResNet50. With VGG16 features (VggFeat16), the proposed method achieved a prediction accuracy of 85.67%, whereas The ResNet50 model achieved a maximum classification accuracy of 95.43%¹⁵.

Furthermore, Suha et al.¹⁸ proposed a Deep Convolutional Neural Network (DCNN)-based model with transfer learning and fine-tune for assessing skin burn degree from real-time burn photos of patients. The design utilizes several convolutional layers and hyperparameter tuning for feature extraction and image classification into three distinct classes. The traditional approach, which used digital image processing and regular machine learning classifiers, was also tested and evaluated for this multiclass classification problem¹⁸. Abubakar et al.^15,16,17 observed that 90.45% of the time, it was possible to identify the different types of burns correctly. This study lays the groundwork for future investigations, notably in healthcare, to focus on how racial feature representations may be integrated with training data to produce effective and widely used diagnostic tools¹⁶.

The convolutional neural network-based approach for body part burn image identification Chauhan et al. (2020) has introduced a deep CNN model that can be used to develop more effective computer-assisted burn diagnostic tools by combining non-burn images with the body part-specific burn severity rating model. The burn image body part classification (BI-BPC) and Body Part-specific Burn Severity Assessment Model utilized deep convolutional neural networks for evaluating the two labelled burn image datasets (BPBSAM). Using BI-BPC and ResNet50 for feature extraction in severity evaluation shows maximum efficiency¹⁹. Pabitha et al.²⁰ presented a hybrid model combining DenseMask RCNNs with transfer learning to classify skin burns accurately. They engage in dense pose estimation²⁰ to split the burn zone, classify it into varying degrees, and compute the burned depth according to the severity of the lesion.

Khan et al.²¹ collected burn images from people of diverse ages and ethnicities. It was nearly impossible to collect images from healthcare facilities due to ethical concerns. Using image mining and DCNN classification, a method is described for segmenting damaged skin and calculating burn depth. A hybrid segmentation method eliminates the background of a picture of burned flesh. The burn depths were classified using a DCNN with an accuracy of 79.4%²¹. Wu et al.²² developed a convolution neural network (CNN) method for burn image recognition. The experimental results in this work show that the CNN based model can effectively classify and detect burn areas²².

Recent research by Khan et al.²¹ discriminated between mild and severe burn symptoms using preprocessing and image-down sampling methods. Otsu’s technique is used to remove the scorched area from the image. Their method categorizes burns as first, second, or third-degree. The performance of the model can be better using more complex CNN models and bigger data sets²¹. Recently, Rostami et al.²³ developed a deep learning model for diagnosing human burns. Their method employs a deep CNN model for feature extraction, whereas SVM is employed for classification. The fivefold cross-validation method achieves 87.7% accuracy in multiclass classification, whereas the binary class achieves 94.28% accuracy²³. Some of the recent method used for burn diagnosis using ML and DL has been shown in Table 2.

Table 2 Summary of the recent work using ML and DL.

Full size table

Proposed method

Accurate diagnosis of human burns requires a sensitive model. ML and DL are commonly employed in medical imaging for disease diagnosis. ResNeXt, AlexNet, and VGG16 are state-of-the-art deep-learning models frequently utilized for medical image diagnosis. In this study, we evaluated and compared the performance of these models for diagnosing burn images. However, these models showed limited effectiveness in accurate diagnosis of burn degree and distinguishing grafts from non-grafts.

ResNeXt, a deep residual model, consists of 50 layers, while AlexNet and VGG16 are sequential models with eight and 16 layers, respectively. These layers extract features from the burned images during the model’s training process. Unfortunately, distinguishing between deep dermal and full-thickness burns can be challenging, as they share similar white, dark red, and brown colors. Consequently, highly delicate and stringent methods are required for accurate differentiation. AlexNet and VGG16, being sequential models, mainly extract low-level features, whereas ResNeXt excels in extracting high-dimensional features. A limitation is that these models can only learn positive weight features due to the ReLu activation function. This constraint may hinder their ability to precisely identify critical burn characteristics. The DL models, AlexNet, ResNeXt, VGG16, and InceptionV3 are widely used for medical image diagnosis, however, these models encounter challenges in accurately categorizing burn degrees and differentiating grafts from non-grafts. Finding effective ways to handle these challenges and improve feature extraction could lead to more sensitive and reliable burn diagnosis models.

The ResNeXt model³³ influenced the BuRnGANeXt50 model. To construct a BuRnGANeXt50 model, the original ResNeXt model’s topology is modified. Moreover, the original ResNeXt was created to classify images into several categories with high computation costs. In this study, the method performs a multiclass and binary class classification task. Multiclass classification is used to assess burn severity based on burn depth. After that, based on depth, burns may be broken down into two distinct types: graft and non-graft. Reducing the first layer filter size from 7 × 7 to 5 × 5 is the first change to the original ResNext model’s design because a larger filter size resulted in lower pixel intensity in the burnt region. This has led to a rise in the frequency of spurious negative results for both grafts and non-grafts. Furthermore, the convolution sizes of Conv1, Conv2, Conv3, Conv4, and Conv5 are also changed to reduce the computation cost while maintaining cardinality. Furthermore, we applied Leaky ReLu instead of the ReLU activation for faster model convergence. Table 2 also shows that conv2, conv3, and conv4 are shrinking in size. After implementing all modifications, neurons decreased from 23 × 10⁶ to 5 × 10⁶, as shown in Table 3. The detailed architecture of the proposed model is shown in Fig. 1.

Table 3 BuRnGANeXt50 model versus original topology.

Full size table

This model has several essential building blocks, including convolution, residual, ReLU, activation, softmax, and flattened layer. The results of groups’ convolution of neurons inside the same kernel map are summed together by pooling layers, which reduce the input dimensionality and enhance the model performance. The pooling units in the proposed model constitute a grid, with each pixel representing a single voting location, and the value is selected to gain overlap while reducing overfitting. Figure 2 describes the structure of the model’s convolution layer. Polling units form a grid, each pixel representing a single voting place being centered $z \times z$. In the provided model, we employ the standard CNN with parameters set to $S = z$, but we add a charge of $S < z$ to increase overlap and decrease overfitting³⁴. The proposed architecture was developed to handle the unique issues of burn diagnosis, emphasizing decreasing overfitting and enhancing model accuracy.

The inner dot product is an essential part that neurons perform for the foundation of an artificial neural network’s convolutional and fully connected layers. The inner dot product may compute the aggregate transform, as illustrated in Eq. (1).

$$\mathop \sum \limits_{i = 1}^{K} w_{i} \rho_{i}$$

(1)

represents the neuron’s k-channel input vector. Filter weight is given by $w_{i}$for i-the neurons. This model replaces the elementary transformations with a more generic function $\left( {w_{i} \rho_{i} } \right)$. By expanding along a new dimension, this generic function reduces depth. This model calculates the aggregated transformations as follows:

$${\Im }\left( \rho \right) = \mathop \sum \limits_{i = 1}^{{\mathbb{C}}} \Upsilon_{i} \left( \rho \right)$$

(2)

The function $\Upsilon_{i} (\rho )$ is arbitrarily defined. $\Upsilon_{i}$ project $\rho$ into low-dimensional embedding and then change it, similar to a primary neuron. ${\mathbb{C}}$ represents the number of transforms to be summed in Eq. (2). ${\mathbb{C}}$ is known as cardinality³⁵. As the residual function, Eq. (2)‘s aggregated transformation serves³⁶. (Fig. 3):

$$x = \rho + \mathop \sum \limits_{i = 1}^{{\mathbb{C}}} \Upsilon_{i} \left( \rho \right)$$

(3)

where $x$ is the model’s predicted result.

Finally, at the top of the model a flattened and a global average pooling is added. The Softmax activation classifies burn into binary and multiclass. The softmax optimizer uses the exponent of each output layer to convert logits to probabilities³⁷. The vector $\Phi$ is the system input, representing the feature set. Our study uses k classification when there are three levels of burn severity (k = 3) and two levels of graft versus non-graft (k = 2). For predicting classification results, the bias $W_{0} X_{0}$ is added to each iteration.

$$p(\rho = i|\Phi^{\left( j \right)} ) = \frac{{e^{{\Phi^{\left( j \right)} }} }}{{\mathop \sum \nolimits_{i = 0}^{k} e^{{\Phi_{k}^{\left( j \right)} }} }}$$

(4)

$${\text{In}}\;{\text{which}}\;\Phi = W_{0} X_{0} + W_{1} X_{1} + \ldots + W_{k} X_{k}$$

(5)

Residual attention module

The residual attention block, which allows attention to be routed across groups of separate feature maps, is shown in Fig. 3. Furthermore, the channel’s extra feature map groups combine the spatial information of all groups via the spatial attention module, boosting CNN’s capacity to represent features. It comprises feature map groups, feature transformation channels, spatial attention algorithms, etc. Convolution procedures can be performed on feature groups, and cardinality specifies the number of feature map groups. A new parameter, "S," indicates the total number of groups in the channel set³⁸ and the number of subgroups in each of the N input feature groups. A channel scheduler is a tool that optimizes the processing of incoming data through channels. This method transforms feature subsets. G = N * S is the formula for the total number of feature groups.

Using Eq. (6), we conduct an essential feature modification on subgroups inside each group after channel shuffling.

$$g\left( {r,i,j} \right) = \left[ {\begin{array}{*{20}c} {\cos \frac{r\pi }{2}} & { - \sin \frac{r\pi }{2}} \\ {\sin \frac{r\pi }{2}} & {\cos \frac{r\pi }{2}} \\ \end{array} } \right]\left[ {\begin{array}{*{20}c} i \\ j \\ \end{array} } \right]$$

(6)

Here $0\le r<4,\left(i,j\right)$ stands for the original matrix’s coordinates. K represents the 3 × 3 convolution of the bottleneck block, and Output is written as $y_{s}$. Then, for each $x_{s}$ input

we have:

$$y_{s} = \left\{ {\begin{array}{*{20}c} {K\left( {g_{r} \left( {x_{s} } \right)} \right)r,} & {s = 0} \\ {K\left( {g_{r} \left( {x_{s} } \right)} \right) \odot y_{0} } & {0 < r = s < 4} \\ \end{array} } \right.$$

(7)

$g\& r$ here represents the input $x_{s}$. “$\odot$” corresponds to element multiplication in the matrix’s related feature transformation. Features of x being transformed are shared across the three 3 × 3 convolution operators K.

Channel and spatial attention modules

Semantic-specific feature representations can be improved by exploiting the interdependencies among channel graphs. We use the feature map’s channels as individual detectors. Figure 3A depicts how we send the feature map of the $no\in \mathrm{1,2},...,N$ group ${G}^{no}\in {R}^{C/N\times H\times W}$ to the channel attention module. As a first step, we use geographic average pooling (GAP) to gather global context information linked to channel statistics³⁹. The 1D channel attention maps ${C}^{no}\in {R}^{C/N}$ are then inferred using the shared fully connected layers.

$$C^{n} = D_{sigmoid} \left( {D_{{{\text{Re}} LU}} \left( {GAP\left( {G_{n} } \right)} \right)} \right)$$

(8)

$"{D}_{sigmoid}and{D}_{\mathit{Re}LU}"$ represents a fully linked layer that uses both "Sigmoid" and "ReLU" as activation functions. At last, Hadamard products are used to infer a group’s attention map and the corresponding input features. Then the components from each group are weighted and added together to produce an output feature vector. The final channel attention map

$$C \in R^{C/N \times H \times W} C = \mathop \sum \limits_{n = 1}^{N} \left( {C^{n} \odot G^{n} } \right)$$

(9)

Each group’s 1 × 1 convolution kernel weight is multiplied by the 3 × 3 kernel weight from the subgroup’s convolutional layer. The global feature dependency is preserved by adding the group’s channel attention weights, which all add up to the same value.

A spatial attention module is used to synthesize spatial links and increase the spatial size of associated features. The channel attention module is separate from that component. The spatial information of feature maps is first aggregated using global average pooling (GAP) and maximum global pooling (GMP)³⁹ to obtain two distinct contextual descriptors. Next, by joining $GAP(C)\in {R}^{1\times H\times W}andGMP(C)\in {R}^{1\times H\times W}$ connect to get ${S}_{c}\in {R}^{2\times H\times W}$.

$$S_{c} = GAP\left( C \right) + GMP\left( C \right)$$

(10)

The plus sign “+”denotes a linked feature map. The regular convolutional layer retrieves the spatial dimensional weight information to round things out. $S_{conv}$ Final spatial attention map $S\in {R}^{C/N\times H\times W}$ is obtained by element-wise multiplying the input feature map $C$ with itself.

$$S = Conv_{3 \times 3} \left( {S_{C} } \right) \odot C$$

(11)

$"Con{v}_{3\times 3}"$ means regular convolution, while "Sigmoid" denotes the activation function.

Local response normalization

Leaky ReLU activation-based deep learning models do not rely on input normalization for saturation. Neurons in this model are more efficient at learning from negative inputs. Despite this, neural activity is calculated ${\alpha }_{u,v}^{i}$ At a point $(u,v)$ by using the kernel $i$, which facilitates generalization. The ReLU nonlinearity is then implemented. The ReLU nonlinearity is then implemented. The response normalized ${\alpha }_{u,v}^{i}$ is determined using the provided Eq. (12).

$$b_{u,v}^{i} = \frac{{\alpha_{u,v}^{i} }}{{\left( {t + \alpha \mathop \sum \nolimits_{j - \max (0,i,n/2)}^{\min (N,1,i + n/2)} (\alpha_{u,v}^{j} )^{2} } \right)^{\beta } }}$$

(12)

where $N$ are the total number of layers and $t,\alpha ,n,\beta$ are constants? This $\sum {}$ is computed for each of the $n$ neighboring⁴⁰. We trained the network using a $100 \times 100 \times 3$ picture and the original ResNeXt CNN topology’s cardinality hyper-parameter ${\mathbb{C}}=32$. The algorithm of the proposed method is shown below.

Algorithm of the proposed method.

Ethical approval

All authors contributed to the conception and design of the study. All authors read and approved the final manuscript.

Results

Dataset

The dataset includes images of human burns having several depths (super-dermal, deep-dermal, and full-thickness) and types (graft, non-graft). The University of Seville’s (Spain) Signal Theory and communications department’s biomedical image processing (BIP) Group at the Virgen del Roco Hospital (Spain) collected images. There are 94 images with varying sizes in the original dataset compiled from⁴¹. Four types of data augmentation techniques were applied to the burn image: horizontal flip, vertical flip, rotation through 30°, and rotation through 30°. Finally, 6000 images retained the augmented dataset. Figure 4a–e show the augmented images of the modified dataset.

The proposed method is implemented on Nvidia GeForce GTX TITAN X GPU using Python 3.8 and Tensor Flow 2.0. The BuRnGANeXt50 is trained with images of batch size 32 and an initial learning rate of 1e^-3 on the Windows 10 operating system.

The mathematical technique of model performance analysis

The system’s efficacy is evaluated using a confusion matrix and its f1-score, accuracy, precision, recall, sensitivity, and specificity values. They are calculated using the true positive (TP), false positive (FP), false negative (FN), and true negative (TN) indicators (True Negative).

$$Acc = \frac{TP + TN}{{TP + TN + FP + FN}}$$

(13)

$$PRE = \frac{TP}{{TP + FP}}$$

(14)

$$RE = \frac{TP}{{TP + FN}}$$

(15)

$$f1 - score = 2{*}\frac{{PRE{*}RE}}{PRE + RE}$$

(16)

$$Sen = \frac{TP}{{TP + FN}}$$

(17)

$$Spec = \frac{TN}{{TP + TN}}$$

(18)

where $TN=$ The model labels it as unfavorable since it is a negative number. $TP=$ It is a genuine positive value, and the model also classifies it as positive. $FP=$ The model incorrectly interprets a negative value as positive. $FN=$ It is a positive number and a negative category of models.

Classification of burn degrees (multiclass)

The extended dataset⁴¹ comprises images of superficial burns, deep burn, and full-thickness burns. To ensure an unbiased model, a fivefold cross-validation is employed. In this process, the dataset is divided into 80% for training and 20% for validation, with each fold using different partitions. The training is conducted with an initial learning rate of 1e-3, and the input image size is downsampled to 100 × 100 pixels. The model undergoes 100 training iterations, utilizing a mini-batch size of 32. After training on each of the five test data folds, confusion matrices (CM) are generated, as depicted in Fig. 5. The obtained confusion matrices are shown in Fig. 5a–e. The results of the provided BuRnGANeXt50 model for each fold are shown in Table 4. Average values for the model’s sensitivity, specificity, F1-score, recall, and accuracy were 97.25%, 97.22%, 97.2%, 98.65%, and 97.17%. Classification accuracy for burns of varying depths in the skin (superficial, deep, and full thickness) is more than 98% using this approach.

Table 4 The multiclass classification effectiveness measures of the BuRnGANeXt50 model.

Full size table

The model training and validation loss are displayed in Fig. 6. Figure 6a shows that the model has a training accuracy of around 100% and a validation accuracy of over 98%. Figure 6b shows that after 80 iterations, the training loss and validation loss have reduced to almost zero.

Burn’s graft and non-graft diagnosis (binary classification)

For further diagnosis, the degree of burn is required. The severity of a burn can be determined by depth⁴². A doctor uses the grafting procedure to replace the burned skin on a patient’s body. Often, grafting is necessary for full-thickness and severe burns⁴³. The improved burn dataset consists of graft and non-graft. The grafts represent full-thickness and deep dermal burns, whereas non-grafts represent superficial burns. Four thousand images of human burns were utilized for the binary classification. Moreover, fivefold cross-validation was performed on the dataset. The data set is split into two halves, 80% and 20%, using a fivefold cross-validation. The training set uses 80% of the data for every fold, while the validation set uses 20%. The input image is scaled to 100 × 100 × 3 pixels, and the model’s initial learning rate is set to 1e-3. After that, a mini-batch size of 32 and 100 iterations was used to train the model. Figure 7 displays the confusion matrices (CM) acquired after training for each of the 5 test data folds. Figures 7a–e are five confusion matrices obtained for graft and non-graft. The performance measures of the BuRnGANeXt50 model across all folds are shown in Table 4. The model also has a sensitivity of 99.14%, a specificity of 99.84%, and an accuracy of 99.48% when classifying data into binary categories.

Classification accuracy of the training and validation as well as loss, are displayed in Fig. 8. After 45 iterations, the accuracy during training is close to 100% (Fig. 8a). Similarly, the model’s training and validation losses are near 0 and saturated after 45 iterations (Fig. 8b). Table 5 depicts the performance of all 5 folds of the proposed BuRnGANeXt50 model.

Table 5 performance metrics for binary class classification of the BuRnGANeXt50 model.

Full size table

Discussion

Automated methods for human burn diagnosis, utilizing deep learning, machine learning, and transfer learning, have been explored in various studies. For instance, Abubakar et al.^15,16,17 employed deep transfer learning with ResNet50 and VGG16 to extract visual patterns from a dataset of 2080 RGB images containing healthy skin and burns. Their proposed technique achieved a maximum prediction accuracy of 95.43% using ResNet50 and 85.63% with VggFeat16. Similarly⁴⁴, utilized ResNet101 for burn skin prediction, achieving a 95.9% accuracy rate.

Another study by Yadav et al.⁴⁰ focused on diagnosing burns and categorizing them as graft or non-grafts, achieving an accuracy of 82%. Abubakar et al.^15,16,17 employed transfer learning to classify skin as burnt or healthy, reaching 99.3% accuracy with the Caucasian dataset and 97% with the African dataset.

Machine learning models for evaluating burn severity were presented by Shin et al., achieving a 70.0% accuracy rate on an unlabelled dataset of 170 images learned via self-supervised learning techniques. Rahman et al.⁴⁶ suggested a vision-based method for identifying burns on the skin, with their SVM model achieving a maximum accuracy of 93.33%.

Despite their usefulness, these approaches have drawbacks, such as high computation costs and decreased efficiency in predicting grafts and non-grafts. To address these challenges, we proposed BuRnGANeXt50, an attention-based residual network that is less expensive and more efficient than ResNeXt. The suggested approach optimizes the convolution size and kernel size, and Leaky ReLu activation is employed to accelerate convergence. A channel and spatial attention module are also included to improve the local feature co-relationship. The existing methodologies for diagnosing burns using diverse datasets are summarised in Table 6. In Table 6, we can notice that the automatically trained classification models outperform compared to manually extracted feature classification models.

Table 6 Past work on the diagnosis of human burns using various ML and DL.

Full size table

An accuracy of 80%, 82.23%, and 84% were achieved using machine learning techniques such as SVM and kNN on the dataset used in this research^16,20,47. For the fair performance comparison, we utilized ResNeXt, AlexNet, and VGG16 for the multiclass (superficial vs. deep dermal vs. full thickness) and binary class (graft vs. non-graft) classification. In addition, the same training setup and dataset were used to evaluate the model’s performance. In Table 7, we summarize the performance of multiclass classification, and in Table 7, the performance of graft and non-graft is discussed. In Table 6, we can see AlexNet achieved the lowest classification accuracy of 70.57%, Whereas ResNeXt obtained 84.31% and the proposed BuRnGANeXt50 achieved a classification accuracy of 98.14%.

Table 7 Evaluation of the presented model versus the previous model for multiclass classification.

Full size table

In Table 8, we can see that the precision and F1-score of the AlexNet are 73.14% and 71.62%, respectively. Slight improvement was noticed in VGG16. The second highest precision and F1-score achieved by ResNeXt. Whereas the proposed model achieved 99. 86% and 99.49% precision and recall values, respectively.

Table 8 Evaluation of the presented model versus the previous model for binary classification.

Full size table

The provided BuRnGANeXt50 model showed the best results for multiclass and binary class classification. In addition, the computation time per epoch and trainable parameters are very less, as shown in Table 9. The BuRnGANeXt50 model can be used for real-time applications and provide a second healthcare opinion.

Table 9 Comparison of the computation time per epoch.

Full size table

We compare the performance of the proposed method and ResNeXt, AlexNet, and VGG16 for multiclass and binary class classification shown in Figs. 9 and 10, respectively. In Fig. 9, we can notice that all performance measure bars of the BuRnGANeXt50 are much higher than the other state-of-the-art methods. Similarly, in Fig. 10, we can notice ResNeXt performance measure bars are much better than the AlexNet and VGG16. However, the proposed method’s performance measures precision, recall, F1-score, and accuracy is much better than ResNeXt.

Furthermore, we plotted the ROC (Receiver Operating Characteristic) curve of the proposed method’s true positive rate and false positive rate for multiclass classification, shown in Fig. 11. We can notice that the ROC value of superficial and full thickness burn is 1, whereas the deep dermal burn is 0.99. This confirms the proposed BuRnGANeXt50 is highly sensitive for burn diagnosis.

Limitations

The proposed model algorithm computation cost is still a challenge. In addition, the attention module provides only local dependencies of the features. That may reduce the performance in some scenarios.

Conclusion

Burn diagnosis is timely and accurately is necessary to save the patient’s life. The traditional method of burn diagnosis is time-consuming, and the accuracy depends on the dermatologist’s expertise. Recent advancement in ML and DL in medical imaging has improved the accuracy and reduced the diagnosis time. However, ML-based methods require handcrafted features for model training that may reduce efficiency. Conversely, the Shallow DL method extracts features automatically but lacks the feature correlation dependency. We conducted experiments using AlexNet, VGG16, and ResNext. However, these models’ performance for classifying burns could be more optimal, and the computation costs are high due to high trainable parameters. The original ResNext performance is better compared to AlexNet and VGG16 due to the capability of capturing high-dimensional features. Many trainable parameters and activation functions make the model less reliable for real-time applications.

In this study, we proposed a modified residual network with less trainable parameters and an attention block for burn diagnosis. After extensive experiments, the convolution and filter size are optimized. Further, instead of ReLu activation, Leaky ReLu activation is utilized, which improves the convergence rate. The spatial attention module enables the model to focus on significant regions of interest, such as burn edges, blisters, and regions with varying degrees of injury. In the meantime, the channel attention module concentrates on crucial characteristics within each network layer, enabling the model to extract the most informative aspects from the input data. Combining spatial and channel attention mechanisms enables our model to learn discriminative patterns from burn images, resulting in superior diagnostic performance. The model’s performance for classifying burns based on degree and depth into three classes and binary class is much better than the state-of-the-art method. The precision and accuracy of the BuRnGANeXt50 for multiclass classification are 97.22% and 98.14%, respectively. Furthermore, the proposed model classifies the burn into graft and non-graft with a precision and accuracy of 99.86% and 99.48%, respectively. This confirms the model is highly sensitive for burn diagnosis and can provide a second opinion to a doctor. In addition, the model computation time per epoch is much less, making it suitable for real-time applications.

The computation time of the proposed is still a challenge that needs further improvement. In addition, the model needs to test on other diverse datasets and a real-time dataset for further evaluation. We found some images of deep dermal classified to full thickness due to similar texture and color characteristics. Furthermore, the results need to be evaluated by the healthcare expert. In future research, we will explore capturing the global relation of the features using a vision transformer-based model to improve the long-range dependency of the features. In addition, the extracted features can be optimized using nature-inspired algorithms to enhance the classification accuracy. Furthermore, a calibration technique can be applied to measure the model’s bias. Furthermore, addressing the challenges associated with model interpretability can be improved using a grad cam.

Data availability

The data supporting this study’s findings are available upon request from the corresponding authors.

References

Van Yperen, D. T. et al. Adherence to the emergency management of severe burns referral criteria in burn patients admitted to a hospital with or without a specialized burn center. Burns 47(8), 1810–1817 (2021).
Article PubMed Google Scholar
Brekke, R. L., Almeland, S. K., Hufthammer, K. O. & Hansson, E. Agreement of clinical assessment of burn size and burn depth between referring hospitals and burn centers: A systematic review. Burns 49(3), 493–515 (2022).
Article PubMed Google Scholar
Lee, S. et al. Real-time burn classification using ultrasound imaging. Sci. Rep. 10(1), 5829 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Deflorin, C. et al. Physical management of scar tissue: A systematic review and meta-analysis. J. Altern. Complement. Med. 26(10), 854–865 (2020).
Article PubMed PubMed Central Google Scholar
Rangel-Olvera, B. & Rosas-Romero, R. Detection and classification of burnt skin via sparse representation of signals by over-redundant dictionaries. Comput. Biol. Med. 132, 104310 (2021).
Article PubMed Google Scholar
Rahman, A. U. et al. A framework for susceptibility analysis of brain tumours based on uncertain analytical cum algorithmic modeling. Bioengineering 10(2), 147 (2023).
Article PubMed Central Google Scholar
Husham, S. et al. Comparative analysis between active contour and otsu thresholding segmentation algorithms in segmenting brain tumor magnetic resonance imaging. J. Inf. Technol. Manag. 12(Special Issue: Deep Learning for Visual Information Analytics and Management), 48–61 (2020).
Google Scholar
Kapoor, B., Nagpal, B., Jain, P. K., Abraham, A. & Gabralla, L. A. Epileptic seizure prediction based on hybrid seek optimization tuned ensemble classifier using EEG signals. Sensors 23(1), 423 (2023).
Article ADS Google Scholar
Rathor, S. & Agrawal, S. A robust model for domain recognition of acoustic communication using Bidirectional LSTM and deep neural network. Neural Comput. Appl. 33(17), 11223–11232 (2021).
Article Google Scholar
Sharma, H. & Srivastava, S. Visual question answering model based on the fusion of multimodal features by a two-way co-attention mechanism. Imaging Sci. J. 69(1–4), 177–189 (2022).
Google Scholar
Shen, C., Zhang, K. & Tang, J. A covid-19 detection algorithm using deep features and discrete social learning particle swarm optimization for edge computing devices. ACM Trans. Int. Technol 22(3), 1–17 (2021).
Article Google Scholar
Hua, X. et al. A novel method for ECG signal classification via one-dimensional convolutional neural network. Multimed. Syst. 28, 1–13 (2020).
Google Scholar
Rajinikanth, V., Kadry, S., Damaševičius, R., Sujitha, R. A., Balaji, G., & Mohammed, M. A. Glioma/Glioblastoma Detection in Brain MRI using Pre-Trained Deep-Learning Scheme. 2022 Third International Conference on Intelligent Computing Instrumentation and Control Technologies (ICICICT) 987–990 (IEEE, 2022).
Hermsen, M. et al. Convolutional neural networks for the evaluation of chronic and inflammatory lesions in kidney transplant biopsies. Am. J. Pathol. 192(10), 1418–1432 (2022).
Article CAS PubMed Google Scholar
Abubakar, A., Ugail, H., Smith, K. M., Bukar, A. M. & Elmahmudi, A. Burns depth assessment using deep learning features. J. Med. Biol. Eng. 40, 923–933 (2020).
Article Google Scholar
Abubakar, A., Ugail, H. & Bukar, A. M. Assessment of human skin burns: A deep transfer learning approach. J. Med. Biol. Eng. 40, 321–333 (2020).
Article Google Scholar
Abubakar, A., Ugail, H. & Bukar, A. M. Noninvasive assessment and classification of human skin burns using images of Caucasian and African patients. J. Electron. Imaging 29(4), 041002–041002 (2020).
ADS Google Scholar
Suha, S. A. & Sanam, T. F. A deep convolutional neural network-based approach for detecting burn severity from skin burn images. Mach. Learn. Appl. 9, 100371 (2022).
Google Scholar
Chauhan, J. & Goyal, P. BPBSAM: Body part-specific burn severity assessment model. Burns 46(6), 1407–1423 (2020).
Article PubMed Google Scholar
Pabitha, C. & Vanathi, B. Densemask RCNN: A hybrid model for skin burn image classification and severity grading. Neural Process. Lett. 53, 319–337 (2021).
Article Google Scholar
Khan, F. A. et al. Computer-aided diagnosis for burnt skin images using deep convolutional neural network. Multimed. Tools Appl. 79, 34545–34568 (2020).
Article Google Scholar
Wu, X., Chen, H., Wu, X., Wu, S. & Huang, J. Burn image recognition of medical images based on deep learning: From CNNs to advanced networks. Neural Process. Lett. 53, 2439–2456 (2021).
Article Google Scholar
Rostami, B. et al. multiclass wound image classification using an ensemble deep CNN-based classifier. Comput. Biol. Med. 134, 104536 (2021).
Article PubMed Google Scholar
Boissin, C. et al. Development and evaluation of deep learning algorithms for assessment of acute burns and the need for surgery. Sci. Rep. 13(1), 1794 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Anisuzzaman, D. M., Patel, Y., Niezgoda, J. A., Gopalakrishnan, S. & Yu, Z. A mobile app for wound localization using deep learning. IEEE Access 10, 61398–61409 (2022).
Article Google Scholar
Khani, M. E. et al. Accurate and early prediction of the wound healing outcome of burn injuries using the wavelet Shannon entropy of terahertz time-domain waveforms. J. Biomed. Opt. 27(11), 116001–116001 (2022).
Article ADS PubMed PubMed Central Google Scholar
Anisuzzaman, D. M. et al. Multi-modal wound classification using wound image and location by deep neural network. Sci. Rep. 12(1), 20057 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Lee, S. et al. A deep learning model for burn depth classification using ultrasound imaging. J. Mech. Behav. Biomed. Mater. 125, 104930 (2022).
Article PubMed Google Scholar
Kumar, B. K. S., Anandakrishan, K. C., Sumant, M. & Jayaraman, S. Wound care: Wound management system. IEEE Access 11, 45301–45312. https://doi.org/10.1109/ACCESS.2023.3271011 (2023).
Article Google Scholar
Santos, E., Santos, F., Dallyson, J., Aires, K., Tavares, J. M. R., & Veras, R Diabetic Foot Ulcers Classification using a fine-tuned CNNs Ensemble. 2022 IEEE 35th International Symposium on Computer-Based Medical Systems (CBMS) 282–287 (IEEE, 2022).
Eldem, H., Ülker, E. & Işıklı, O. Y. Effects of training parameters of AlexNet architecture on wound image classification. Traitement du Signal 40(2), 811–817 (2023).
Article Google Scholar
Ahsan, M., Naz, S., Ahmad, R., Ehsan, H. & Sikandar, A. A deep learning approach for diabetic foot ulcer classification and recognition. Information 14(1), 36 (2023).
Article Google Scholar
Zhang, G. et al. Classification of lung nodules based on CT images using squeeze-and-excitation network and aggregated residual transformations. Radiol. Med. 125, 374–383 (2020).
Article PubMed Google Scholar
Learning, D. Deep learning. High-dimensional fuzzy clustering (2020).
Brekke, R. L., Almeland, S. K., Hufthammer, K. O. & Hansson, E. Agreement of clinical assessment of burn size and burn depth between referring hospitals and burn centres: A systematic review. Burns 49(3), 493–515 (2022).
Article PubMed Google Scholar
Ye, P., Tang, S., Li, B., Chen, T., & Ouyang, W. Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing. Preprint at arXiv:2210.04153 (2022).
Shafiq, M. & Gu, Z. Deep residual learning for image recognition: A survey. Appl. Sci. 12(18), 8972 (2022).
Article CAS Google Scholar
Zhang, X. et al. SAMS-Net: Fusion of attention mechanism and multi-scale features network for tumor infiltrating lymphocytes segmentation. Math. Biosci. Eng. 20(2), 2964–2979 (2023).
Article PubMed Google Scholar
Lei, B. et al. Skin lesion segmentation via generative adversarial networks with dual discriminators. Med. Image Anal. 64, 101716 (2020).
Article PubMed Google Scholar
Yadav, D. P., Sharma, A., Singh, M. & Goyal, A. Feature extraction based machine learning for human burn diagnosis from burn images. IEEE J. TranslEng. Health Med. 7, 1800507. https://doi.org/10.1109/JTEHM.2019.2923628 (2019).
Article CAS Google Scholar
Biomedical Image Processing (BIP) Group from the Signal Theory and Communications Department (University of Seville, SPAIN) and Virgen del Rocío Hospital (Seville, Spain). Burns BIP_US Database, accessed 5 January 2023. http://personal.us.es/rboloix/Burns_BIP_US_database.zip.
Wang, R. et al. Diagnostic accuracy of laser Doppler imaging for the assessment of burn depth: a meta-analysis and systematic review. J. Burn Care Res. 41(3), 619–625 (2020).
Article PubMed Google Scholar
Przekora, A. A concise review on tissue engineered artificial skin grafts for chronic wound treatment: can we reconstruct functional skin tissue in vitro?. Cells 9(7), 1622 (2020).
Article CAS PubMed PubMed Central Google Scholar
Abubakar, A., & Ugail, H. Discrimination of human skin burns using machine learning. Intelligent Computing: Proceedings of the 2019 Computing Conference, Vol. 1 641–647 (Springer, 2019).
Shin, H. et al. Sample-efficient deep learning techniques for burn severity assessment with limited data conditions. Appl. Sci. 12(14), 7317 (2022).
Article CAS Google Scholar
Rahman, M. A., Hasan, S. T., & Kader, M. A. Computer vision-based skin burn detection using a support vector machine (SVM). 2022 International Conference on Innovations in Science, Engineering, and Technology (ICISET) 233–238 (IEEE, 2022).
Butt, A. U. R., Ahmad,W., Ashraf, R., Asif, M., & Cheema, S. A. Computer Aided Diagnosis(CAD) for Segmentation and Classification of Burnt Human skin. 2019 International Conference onElectrical, Communication, and Computer Engineering (ICECCE). 1–5 (IEEE, 2019).
Chang, C. W. et al. Application of multiple deep learning models for automatic burn wound assessment. Burns 49(5), 1039–1051 (2023).
Article PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Engineering and Applications, GLA University, Mathura, India
D. P. Yadav & Ankit Kumar
Department College of Computer Sci. and Eng., University of Hafr Al-Batin, Hafar Al-Batin, 39524, Saudi Arabia
Turki Aljrees
Department of Computer Science, NIT Meghalaya, Shillong, India
Deepak Kumar
School of Computing, Graphic Era Hill University, Dehradun, 248002, India
Kamred Udham Singh
Department of Computer Science and Engineering, Graphic Era Deemed to be University, Dehradun, 248002, India
Teekam Singh

Authors

D. P. Yadav
View author publications
Search author on:PubMed Google Scholar
Turki Aljrees
View author publications
Search author on:PubMed Google Scholar
Deepak Kumar
View author publications
Search author on:PubMed Google Scholar
Ankit Kumar
View author publications
Search author on:PubMed Google Scholar
Kamred Udham Singh
View author publications
Search author on:PubMed Google Scholar
Teekam Singh
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization, software, T.A.: methodology, validation, D.P.Y.: formal analysis, investigation, K.U.S.: data curation, writing—original draft preparation D.K.: writing—review and editing, A.K.: visualization and T.S.: supervision. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Ankit Kumar.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yadav, D.P., Aljrees, T., Kumar, D. et al. Spatial attention-based residual network for human burn identification and classification. Sci Rep 13, 12516 (2023). https://doi.org/10.1038/s41598-023-39618-0

Download citation

Received: 24 April 2023
Accepted: 27 July 2023
Published: 02 August 2023
Version of record: 02 August 2023
DOI: https://doi.org/10.1038/s41598-023-39618-0

This article is cited by

Precision diagnosis of burn injuries using imaging and predictive modeling for clinical applications
- Pramod K. B. Rangaiah
- B P Pradeep kumar
- Robin Augustine
Scientific Reports (2025)
MCTFWC: a multiscale CNN-transformer fusion-based model for wound image classification
- Lalit Maurya
- Sarfaraj Mirza
Signal, Image and Video Processing (2025)
Dense Mesh RCNN: assessment of human skin burn and burn depth severity
- C. Pabitha
- B. Vanathi
The Journal of Supercomputing (2024)

Subjects

Abstract

Similar content being viewed by others

A lightweight Deeplab V3+ network integrating deep transitive transfer learning and attention mechanism for burned area identification

Precision diagnosis of burn injuries using imaging and predictive modeling for clinical applications

Development of a framework for managing severe burns through a 17-year retrospective analysis of burn epidemiology and outcomes

Introduction

Literature review

Proposed method

Residual attention module

Channel and spatial attention modules

Local response normalization

Ethical approval

Results

Dataset

The mathematical technique of model performance analysis

Classification of burn degrees (multiclass)

Burn’s graft and non-graft diagnosis (binary classification)

Discussion

Limitations

Conclusion

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Precision diagnosis of burn injuries using imaging and predictive modeling for clinical applications

MCTFWC: a multiscale CNN-transformer fusion-based model for wound image classification

Dense Mesh RCNN: assessment of human skin burn and burn depth severity

Search

Quick links