Abstract
Population-based diabetic retinopathy (DR) screening requires diagnostic strategies that optimize clinical utility by balancing missed disease against referral burden. We performed a Preferred Reporting Items for Systematic Reviews and Meta-Analyses of Diagnostic Test Accuracy studies (PRISMA-DTA)-guided systematic review and meta-analysis comparing autonomous artificial intelligence (AI) screening with store-and-forward (SAF) or conventional image-based teleophthalmology pathways, using manual, expert, or reading-center grading as the reference standard, across any DR, referable DR (RDR), vision-threatening DR (VTDR), and diabetic macular edema (DME). Twenty-eight diagnostic accuracy studies were included. AI showed higher pooled sensitivity than SAF for any DR (86.9% vs 80.9%), RDR (96.2% vs 88.6%), VTDR (96.2% vs 84.2%), and DME (97.2% vs 87.4%). AI also showed higher pooled specificity for any DR, RDR, and VTDR, whereas DME specificity was similar between pathways. Translating operating characteristics into decision consequences demonstrated that pathway preference depends on prevalence, decision thresholds, and misclassification weighting: at 15% prevalence, AI yielded higher net benefit (140.7 vs 120.8 net true-positive decisions per 1000 screened at pā = 0.10). These findings support pathway-specific deployment strategies rather than direct superiority claims.
Similar content being viewed by others
Acknowledgements
This work was supported by the National Science and Technology Council Taiwan under grant numbers NSTC-114-2314-B-567-002 and NSTC-113-2314-B-567-001 and by Cardinal Tien Hospital under grant numbers CTH-110A-2217, CTH-112A-2215 and CTH-113A-NDMC-2233.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisherās note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the articleās Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the articleās Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Chen, KY., Chan, HC. & Chan, CM. Clinical setting-dependent diagnostic accuracy of artificial intelligence and store-and-forward diabetic retinopathy screening: a systematic review and meta-analysis. npj Digit. Med. (2026). https://doi.org/10.1038/s41746-026-02627-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41746-026-02627-0


