Skin disease diagnostics through federated transfer learning on heterogeneous data

Sharma, Shikha; Mittal, Ruchi; Goyal, Nitin; Goyal, S. B.; Verma, Chaman

doi:10.1038/s41598-025-31730-7

Download PDF

Article
Open access
Published: 15 January 2026

Skin disease diagnostics through federated transfer learning on heterogeneous data

Scientific Reports volume 16, Article number: 1991 (2026) Cite this article

1153 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Skin diseases frequently cause mental and physical distress and are major global health concern. Because early detection is crucial to successful treatment, accurate diagnosis is challenge for dermatologists as well. Diagnostic accuracy could be significantly enhanced using methods like machine learning (ML) and deep learning (DL). However, substantial datasets are required for these models to make accurate predictions. The healthcare providers frequently encounter data shortages, and privacy regulations restrict data sharing. A privacy-preserving federated transfer learning for diagnosing skin diseases which incorporate four key strategies to enhance effectiveness. The transfer learning is used to train a model with dense neural network (DNN) for skin diseases detection. The feature extraction is performed using pre-trained architectures and DNN is used for classification. The federated learning (FL) replaces the transfer learning to train the model across distributed nodes with the DNN used to disease detection. The FL is combined with transfer learning to build a cohesive ecosystem where data privacy is maintained. The model performance was validated on both IID and non-IID database, with the proposed feature extraction with federated learning model achieving cross validation accuracy of 99.528% and 99.689% for IID and non-IID database, respectively. Results indicate that feature extraction with FL model can produce efficient, lightweight models—well-suited for resource-constrained devices—while ensemble learning enhances edge device performance, offering a powerful and privacy-preserving solution for skin disease diagnosis in modern healthcare.

Privacy preserving skin cancer diagnosis through federated deep learning and explainable AI

Article Open access 15 October 2025

Development of machine learning model for diagnostic disease prediction based on laboratory tests

Article Open access 07 April 2021

Systematic review of deep learning image analyses for the diagnosis and monitoring of skin disease

Article Open access 27 September 2023

Introduction

Globally, millions of people of all ages and demographics suffer from skin problems. Skin ailments range from eczema, psoriasis, and acne to melanoma and other skin malignancies¹. Chronic illnesses like psoriasis can cause physical discomfort, emotional suffering, and social isolation². Non-fatal skin diseases account for a large portion of global healthcare costs. The dermatologist scarcity in many places delays diagnoses and worsens patient outcomes³. Skin illnesses can indicate underlying health difficulties, thus early and precise diagnosis is crucial to preserving patient health and possibly detecting additional systemic diseases⁴. Dermatologists directly examine lesions, pigmentation, and texture changes to diagnose skin illnesses^5,6. Analyzing large datasets of skin images and finding disease patterns with artificial intelligent (AI) based techniques is also improving diagnostic accuracy⁷. Despite technological advances, such equipment and technical competence are scarce, especially in low-resource areas⁸. In dermatology, virtual and real-time skin condition diagnosis are now possible through advanced digital tools^9,10. Patients benefit from quick assessments and teledermatology consultation improves the dermatological care accessibility¹⁰. Continuous observation allows for personalized treatment adjustments, improving patient outcomes and adherence¹¹. Additionally, AI models can analyze patient data to detect early skin abnormalities and potentially identify skin cancers or other serious conditions^11,12. However, as these digital healthcare ecosystems expand, concerns about data security and privacy become increasingly significant, particularly in dermatology where sensitive medical data is transmitted and stored¹².

Medical imaging and diagnosis capture and share sensitive health data across platforms, making data privacy as serious problem¹³. Medical images used in dermatology contain visual data about skin problems and information that could reveal identification of patients if privacy protections are insufficient. Centralized storage systems, which contain patient data from numerous sources, are particularly vulnerable to hackers, threatening patient privacy and confidence in digital health care systems¹⁴. Federated learning (FL) model allows decentralized data utilization on local devices while keeping it secure, allowing shared model advances without transferring patient data¹⁵. To prevent data leaks during training, FL modelrequires strong encryption and secure aggregation. These advances make it harder to balance data utility and privacy since models need enough data to be clinically useful without violating patient privacy¹⁶. FL and transfer learning models have been popular in medical application because they solve data privacy, limited resources, and model adaptability¹⁷. FLmodel makes it possible to train machine learning (ML) and deep learning (DL) models on dispersed datasets, such as medical servers, without the need for centralized collection¹⁸. Transfer learning model allows pre-trained models on huge, publically available datasets to be tailored to specific medical applications with less task-specific data¹⁹. Transfer learning lets models adapt to diverse healthcare domains, such as dermatology and radiology. Transfer learning along with FL, can improve medical diagnostic accuracy by using information from many data sources, even in resource-limited medical environments²⁰. These methods promise to improve model performance while protecting privacy and managing data scarcity, enabling ethical and practical AI use in healthcare. FL models with decentralized data interested by the discretion subjects of traditional ML/DL techniques that have been previously discussed. After that, each local network model is trained using its own local data, preventing sensitive information from being shared over a server network. The rest of this paper is organized as follows. The literature on skin disease diagnostic using ML/DL techniques is reviewed in Section "Related Work". The proposed model for diagnosing skin diseases using transfer learning, pre-trained feature extraction models, federated feature extraction, and federated transfer learning is presented in Section "Methodology skin disease diagnosis". The experimental setup and results comparison of skin disease detection models are described in Section "Results analysis" and "Discussion". The paper conclusion and future scope discussed in Section "Conclusion".

Related work

By handling visual complexity and model generalization through image augmentation, the convolutional neural network (CNN) offers a diverse dataset that more accurately captures the variability of skin conditions²¹. The model’s accuracy of 86% and reminiscence of 81% across seven disease classes show that it can recognize the features of skin disorders. The FL framework²² aggregates prediction while sharing sensitive data. FL differential privacy architecture facilitates cooperative model training without transferring confidential patient data to central servers using decentralized manner²³. The implementation is on Amazon’s AWS cloud system, showed ease of use and scalability²⁴ which improves mobile health technology diagnostics. A hybrid model using CNN and optimization module²⁵ is used to improve the gesture identification. FL pre-trains the mixed approach without revealing sensitive sEMG data, and then transfer learning fine-tunes the model for each subject based on their features. According to experimental results, this approach improves recognition accuracy by 12.01% over conventional FL model and 28.52% over local training, overcoming data shortage and prioritizing privacy. The FL is used to train global model and sharing encrypted parameters via blockchain with permission to address privacy and trust issues²⁶. According to the data, the scheme outperforms baseline models in segmentation by 19.08% in Hausdorff distance for whole malignancies and 1.99% in Dice comparison coefficient for attractive growths. The local devices run simulations on their datasets without transferring sensitive health data, solving privacy concerns²⁷. Radar-based heartbeat and activity monitoring is implemented using a networked multi-task transfer learning²⁸. FedRadar beats local training models in heartbeat rate prediction and action planning on actual radar datasets by 2.8% and 2.5%, respectively. FL with decentralized data storage improves the detection rate²⁹. A data balancing strategy improves classifier performance and achieves 95% accuracy by correct the dataset’s class imbalance. FRESH is smart healthcare architecture that combines FL with ring identity safeguards against such assaults³⁰. Modified batch verification takes advantage of lined operations’ additively on elliptic arches to ease the server’s dispensation load.

Review summary

Based on the literature review (Table 1), DL techniques used to draw attention to the problems of using FL for skin disease diagnosis^{21,22,23,24,25,26,27,28,29,30}. The inherent non-IID distribution and data imbalance in skin disease datasets are significant issues. Patients from various demographic groups, geographical locations, and healthcare facilities have varying disease frequencies and image features, which leads to biased models that are not particularly successful at generalizing to other populations. Threats to security and privacy are another significant obstacle. In a medical context, protecting patient information’s security and confidentiality is essential. The FL system³¹, which uses a dataset of over 10,000 photos and decentralized data, initially demonstrates an overall accuracy rate of around 79% in the classification of skin disorders. The four categories of skin diseases are classified using the CNN³² and the parameters are optimized using the hyper-parameter tuning.Even though FL is decentralized, during model updates, sensitive patient data—such as images of skin lesions—is still susceptible to reconstruction or inference assaults. The varied nature of medical imaging data, which unintentionally expose distinguishable characteristics, increases this danger^21,23.

Table 1 Research gap summary from existing FLfor disease diagnosis frameworks.

Subjects

Abstract

Similar content being viewed by others

Privacy preserving skin cancer diagnosis through federated deep learning and explainable AI

Development of machine learning model for diagnostic disease prediction based on laboratory tests

Systematic review of deep learning image analyses for the diagnosis and monitoring of skin disease

Introduction

Related work

Review summary

Methodology skin disease diagnosis

FL with IID and non-IID datasets

Model training using transfer learning

Feature extraction using pre-trained architectures

Classification using dense neural network (DNN)

Results analysis

Results analysis of transfer learning models on skin disease diagnosis

Results analysis of feature extraction models on skin disease diagnosis

Results analysis of federated transfer learning models on skin disease diagnosis

Results analysis of UNet + FL + DNN for skin disease diagnosis

Discussion

Conclusion

Data availability

References

Acknowledgement

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Ethical approval

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links