Abstract
Diabetes mellitus remains one of the most widespread and burdensome chronic diseases worldwide, yet invasive assays and high costs constrain early detection. Existing machine-learning studies often reduce diagnosis to a binary task and overlook the clinically important pre-diabetic stage; additionally, many deep models act as uninterpretable “black boxes”. To address these gaps, we propose ProgMDD, an interpretable progressive residual network for multiclass diabetes diagnosis using routine clinical biomarkers. Employing a strict, leakage-free pipeline, LASSO-based feature selection and resampling were applied exclusively to the training set, yielding a compact, robust input panel. After comparing PCA, t-SNE, and UMAP, we selected UMAP for visualization because it optimally balances global and local structure to illustrate progressive class separation. ProgMDD integrates a progressive residual architecture with channel attention and multi-level regularization to enhance feature learning. Rigorously compared against multiple baselines, ProgMDD achieved 97.02% mean accuracy under 5-fold cross-validation, reinforced by a 97.59% accuracy on the purely original, imbalanced hold-out test set and supported by multiple ablation studies. The concordance between LASSO and SHAP rankings supports biological plausibility and model transparency. By uniting interpretable deep learning with low-cost clinical data, ProgMDD furnishes a feasible approach for early screening and risk stratification in primary care, providing a transferable methodological paradigm for other chronic-disease prediction tasks.
Similar content being viewed by others
Acknowledgements
This work was supported by the National Natural Science Foundation of China (No. 62562051), the National Science Foundation of Ningxia (No. 2025AAC020023), and the College Students’ Innovation and Entrepreneurship Training Program (No. S202510749056). We also thank Ningxia University for providing computational re-sources and the providers of the Iraqi diabetes dataset for the clinical data used in this study.
Funding
This work was supported in part by the National Natural Science Foundation of China under Grant 62562051; and in part by the National Science Foundation of Ningxia under Grant 2025AAC020023; and in part by the College Students’ Innovation and Entrepreneurship Training Program Grant S202510749056.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical statement
This study utilized a de-identified, public dataset from Mendeley Data. All data were anonymized, and further ethical approval was waived for this retrospective analysis of secondary data according to the Declaration of Helsinki.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Fan, H., Li, Z., Yan, N. et al. An interpretable progressive residual network for automated multiclass diabetes diagnosis. Sci Rep (2026). https://doi.org/10.1038/s41598-026-51603-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-51603-x


