An eyecare foundation model for clinical assistance: a randomized controlled trial

Wu, Yilan; Qian, Bo; Li, Tingyao; Qin, Yiming; Guan, Zhouyu; Chen, Tingli; Jia, Yali; Zhang, Ping; Zeng, Dian; Moroi, Sayoko; Raman, Rajiv; Thinggaard, Benjamin Sommer; Pedersen, Frederik; Ñehe, José Alogo Obiang; Kamalden, Tengku Ain; Zhou, Yukun; Jin, Yixiao; Li, Huating; Ran, An Ran; Yang, Dawei; Meng, Ziyao; Peng, Qingsheng; Zheng, Ying Feng; Wang, Dingqiao; Ji, Hongwei; Zang, Pengxiao; Yin, Changchang; Shen, Jie; Chen, Youxin; Yu, Weihong; Dai, Rongping; Zhang, Chenxi; Zhao, Xinyu; Wang, Xiangning; Chen, Yan; Wu, Qiang; Xie, Hongbin; Szeto, Simon K. H.; Chan, Julia Y. Y.; Chan, Victor T. T.; Xie, Hua-Tao; Wei, Ruili; Li, Jin; Ma, Weizhi; Zhu, Lei; Wang, Hongqiu; Fu, Huazhu; Wang, Wenxiao; Lin, Shan; Xu, Zejun; Guan, Nian; Zhang, Xiao; Grzybowski, Andrzej; Gołębiowska-Bogaj, Monika; Gawęcki, Maciej; Smedowski, Adrian; Szaraniec, Wojciech; Wu, You; Wen, Yang; Chen, Xiang; Yao, Yuanqi; Lim, Lee-Ling; Cheung, Carol Y.; Tan, Gavin Siew Wei; Grauslund, Jakob; Ruamviboonsuk, Paisan; Sivaprasad, Sobha; Keane, Pearse A.; Wang, Ya Xing; Tham, Yih-Chung; Cheng, Ching-Yu; Wong, Tien Yin; Sheng, Bin

doi:10.1038/s41591-025-03900-7

Article
Published: 28 August 2025

An eyecare foundation model for clinical assistance: a randomized controlled trial

Nature Medicine (2025)Cite this article

5522 Accesses
203 Altmetric
Metrics details

Subjects

Abstract

In the context of an increasing need for clinical assessments of foundation models, we developed EyeFM, a multimodal vision–language eyecare copilot, and conducted a multifaceted evaluation, including retrospective validations, multicountry efficacy validation as a clinical copilot and a double-masked randomized controlled trial (RCT). EyeFM was pretrained on 14.5 million ocular images from five imaging modalities paired with clinical texts from global, multiethnic datasets. Efficacy validation invited 44 ophthalmologists across North America, Europe, Asia and Africa in primary and specialty care settings, highlighting its utility as a clinical copilot. The RCT—a parallel, single-center, double-masked study—assessed EyeFM as a clinical copilot in retinal disease screening among a high-risk population in China. A total of 668 participants (mean age 57.5 years, 79.5% male) were randomized to 16 ophthalmologists, equally allocated into intervention (with EyeFM copilot) and control (standard care) groups. The primary endpoint indicated that ophthalmologists with EyeFM copilot achieved higher correct diagnostic rate (92.2% versus 75.4%, P < 0.001) and referral rate (92.2% versus 80.5%, P < 0.001). Secondary outcome indicated improved standardization score of clinical reports (median 33 versus 37, P < 0.001). Participant satisfaction with the screening was similar between groups, whereas the intervention group demonstrated higher compliance with self-management (70.1% versus 49.1%, P < 0.001) and referral suggestions (33.7% versus 20.2%, P < 0.001) at follow-up. Post-deployment evaluations indicated strong user acceptance. Our study provided evidence that implementing EyeFM copilot can improve the performance of ophthalmologists and the outcome of patients. Chinese Clinical Trial Registry registration: ChiCTR2500095518.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on SpringerLink
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 2: Overview of EyeFM model structure and validation experiments.**

**Fig. 3: Experiment 2a: reader studies to validate the EyeFM copilot.**

**Fig. 4: Experiment 2b: multicenter real-world study to validate EyeFM copilot.**

**Fig. 5: Experiment 3: RCT for clinical evidence of EyeFM copilot.**

Effectiveness of a telemedicine program for triage and diagnosis of emergent ophthalmic conditions

Article 26 January 2022

A multimodal visual–language foundation model for computational ophthalmology

Article Open access 21 June 2025

The UK clinical eye research strategy: refreshing research priorities for clinical eye research in the UK

Article Open access 29 May 2024

Data availability

For the reproduction of our algorithm code, we have also deposited a minimum dataset (https://zenodo.org/records/15546254; ref. ⁴¹), which is publicly available for scientific research and non-commercial use. The data supporting the findings of this trial are available within the paper and its supplementary information files. All requests for further data sharing will be reviewed by the data management committees from participating institutions and by the ethics committee of Shanghai Health and Medical Centre, China, to verify whether the request is subject to any intellectual property or confidentiality obligations and will be accessible with informed consents. Requests for access to deidentified individual-level data from this trial can be submitted via email to B.S. (shengbin@sjtu.edu.cn) with detailed proposals for approval and will be evaluated on a case-by-case basis and responded to within 60 days. Investigators who consent to the terms of the data transfer agreement, including, but not limited to, the use of these data only for academic purposes and to protect the confidentiality of the data and limit the possibility of identification of patients, will be granted access. Source data are provided with this paper.

Code availability

The code being used in the current study for developing the algorithm is provided at https://github.com/eyefm/EyeFM.

References

Zhou, Y. et al. A foundation model for generalizable disease detection from retinal images. Nature 622, 156–163 (2023).
Article CAS PubMed PubMed Central Google Scholar
Chen, R. J. et al. Towards a general-purpose foundation model for computational pathology. Nat. Med. 30, 850–862 (2024).
Article CAS PubMed PubMed Central Google Scholar
Lu, M. Y. et al. A visual-language foundation model for computational pathology. Nat. Med. 30, 863–874 (2024).
Article CAS PubMed PubMed Central Google Scholar
Topol, E. J. High-performance medicine: the convergence of human and artificial intelligence. Nat. Med. 25, 44–56 (2019).
Article CAS PubMed Google Scholar
Tanno, R. et al. Collaboration between clinicians and vision–language models in radiology report generation. Nat. Med. 31, 599–608 (2025).
Article CAS PubMed Google Scholar
Norden, J, G. & Shah, N. R. What AI in health care can learn from the long road to autonomous vehicles. NEJM Catalyst https://catalyst.nejm.org/doi/abs/10.1056/CAT.21.0458 (2022).
You, J. G., Hernandez-Boussard, T., Pfeffer, M. A., Landman, A. & Mishuris, R. G. Clinical trials informed framework for real world clinical implementation and deployment of artificial intelligence applications. NPJ Digit. Med. 8, 107 (2025).
Article PubMed PubMed Central Google Scholar
Longhurst, C. A., Singh, K., Chopra, A., Atreja, A. & Brownstein, J. S. A call for artificial intelligence implementation science centers to evaluate clinical effectiveness. NEJM AI https://doi.org/10.1056/AIp2400223 (2024).
Article PubMed PubMed Central Google Scholar
Gupta, A., Savarese, S., Ganguli, S. & Fei-Fei, L. Embodied intelligence via learning and evolution. Nat. Commun. 12, 5721 (2021).
Article CAS PubMed PubMed Central Google Scholar
Colunga-Lozano, L. E. et al. Clinical judgment shows similar and sometimes superior discrimination compared to prognostic clinical prediction models: a systematic review. J. Clin. Epidemiol. 165, 111200 (2024).
Article Google Scholar
Bommasani, R. et al. On the opportunities and risks of foundation models. Preprint at https://arxiv.org/abs/2108.07258 (2022).
Howell, M. D., Corrado, G. S. & DeSalvo, K. B. Three epochs of artificial intelligence in health care. JAMA 331, 242–244 (2024).
Article Google Scholar
Lu, M. Y. et al. A multimodal generative AI copilot for human pathology. Nature 634, 466–473 (2024).
Article CAS PubMed PubMed Central Google Scholar
Future of Health: The Emerging Landscape of Augmented Intelligence in Health Care. https://www.ama-assn.org/system/files/future-health-augmented-intelligence-health-care.pdf (American Medical Association, 2024).
Yang, J. et al. Generalizability assessment of AI models across hospitals in a low-middle and high income country. Nat. Commun. 15, 8270 (2024).
Article CAS PubMed PubMed Central Google Scholar
Avram, O. et al. Accurate prediction of disease-risk factors from volumetric medical scans by a deep vision model pre-trained with 2D scans. Nat. Biomed. Eng. 9, 507–520 (2024).
Article PubMed Google Scholar
Street, A., Kersaudy Kerhoas, M. & Ndlovu, Z. From equitable access to equitable innovation: rethinking bioengineering for global health. Nat. Rev. Bioeng. 2, 444–446 (2024).
Article CAS Google Scholar
Matheny, M. E., Whicher, D. & Thadaney Israni, S. Artificial intelligence in health care: a report from the National Academy of Medicine. JAMA 323, 509–510 (2020).
Article PubMed Google Scholar
van de Sande, D. et al. To warrant clinical adoption AI models require a multi-faceted implementation evaluation. NPJ Digit. Med. 7, 58 (2024).
Article PubMed PubMed Central Google Scholar
Hadziahmetovic, M., Nicholas, P., Jindal, S., Mettu, P. S. & Cousins, S. W. Evaluation of a remote diagnosis imaging model vs dilated eye examination in referable macular degeneration. JAMA Ophthalmol. 137, 802–808 (2019).
Article PubMed Central Google Scholar
Liu, H., Li, C., Wu, Q. & Lee, Y. J. Visual instruction tuning. In Advances in Neural Information Processing Systems 36 (eds Oh, A. et al.) https://papers.nips.cc/paper_files/paper/2023/file/6dcf277ea32ce3288914faf369fe6de0-Paper-Conference.pdf (Curran Associates, 2023).
Touvron, H. et al. Llama 2: open foundation and fine-tuned chat models. Preprint at https://arxiv.org/abs/2307.09288 (2023).
Bachmann, R., Mizrahi, D., Atanov, A. & Zamir, A. MultiMAE: Multi-modal Multi-task Masked Autoencoders. In Computer Vision – ECCV 2022 (eds Avidan, S. et al.) 348–367 (Springer-Verlag, 2022).
Rafailov, R. et al. Direct preference optimization: your language model is secretly a reward model. In Proc. of the 37th International Conference on Neural Information Processing Systems (eds Oh, A. et al.) 53728–53741 (Curran Associates, 2023).
McMahan, B., Moore, E., Ramage, D., Hampson, S. & Arcas, B. A. Y. Communication-efficient learning of deep networks from decentralized data. In Proc. of the 20th International Conference on Artificial Intelligence and Statistics (eds Singh, A. & Zhu, J.) 1273–1282 (PMLR, 2017).
Chen, X. et al. FFA-GPT: an automated pipeline for fundus fluorescein angiography interpretation and question-answer. NPJ Digit. Med. 7, 111 (2024).
Article PubMed PubMed Central Google Scholar
Singhal, K. et al. Toward expert-level medical question answering with large language models. Nat. Med. 31, 943–950 (2025).
Article CAS PubMed PubMed Central Google Scholar
McDuff, D. et al. Towards accurate differential diagnosis with large language models. Nature 642, 451–457 (2025).
Article CAS PubMed PubMed Central Google Scholar
Ting, D. S. W. et al. Development and validation of a deep learning system for diabetic retinopathy and related eye diseases using retinal images from multiethnic populations with diabetes. JAMA 318, 2211–2223 (2017).
Article PubMed Central Google Scholar
Wang, W. et al. Learning two-stream CNN for multi-modal age-related macular degeneration categorization. IEEE J. Biomed. Health Inform. 26, 4111–4122 (2022).
Article PubMed Google Scholar
He, M. et al. Prevalence and clinical characteristics of glaucoma in adult Chinese: a population-based study in Liwan District, Guangzhou. Invest. Opthalmol. Vis. Sci. 47, 2782–2788 (2006).
Google Scholar
Liu, X. et al. Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension. Nat. Med. 26, 1364–1374 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bourne, R. et al. Trends in prevalence of blindness and distance and near vision impairment over 30 years: an analysis for the Global Burden of Disease Study. Lancet Glob. Health 9, e130–e143 (2021).
Article Google Scholar
Trott, M. et al. Eye disease and mortality, cognition, disease, and modifiable risk factors: an umbrella review of meta-analyses of observational studies. Eye 36, 369–378 (2022).
Article PubMed Google Scholar
Xiong, K., Mao, H., Zhang, Q., Lei, C. & Liang, Y. Associations between vision impairment and multimorbidity among older Chinese adults: results from the China health and retirement longitudinal study. BMC Geriatr. 23, 688 (2023).
Article PubMed PubMed Central Google Scholar
Zheng, D. D. et al. Patterns of chronic conditions and their association with visual impairment and health care use. JAMA Ophthalmol. 138, 387–394 (2020).
Article PubMed Central Google Scholar
Holden, B. A. et al. Global prevalence of myopia and high myopia and temporal trends from 2000 through 2050. Ophthalmology 123, 1036–1042 (2016).
Article PubMed Google Scholar
Singhal, K. et al. Large language models encode clinical knowledge. Nature 620, 172–180 (2023).
Article CAS PubMed PubMed Central Google Scholar
Lewis, J. R. & Sauro, J. in Human Centered Design (ed Kurosu, M.) 94–103 (Springer, 2009).
Hillis, S. L. & Soh, B. P. Obuchowski-Rockette analysis for multi-reader multi-case (MRMC) readers-nested-in-test study design with unequal numbers of readers. Proc. SPIE Int. Soc. Opt. Eng. 12467, 124670F (2023).
PubMed PubMed Central Google Scholar
EyeFM study group EyeFM sample dataset. Zenodo https://zenodo.org/records/15546254 (2025).

Download references

Acknowledgements

We thank H. Li and Z. Li for creating the illustrations and icons. This study was supported by the National Key R&D Program of China (2022YFC2502800), the National Natural Science Foundation of China (82388101) and the Beijing Natural Science Foundation (IS23096) to T.Y.W.; the National Natural Science Foundation of China (62272298), the Noncommunicable Chronic Diseases-National Science and Technology Major Project (2023ZD0509202 & 2023ZD0509201), the National Key Research and Development Program of China (2022YFC2407000) to B.S.; and the Noncommunicable Chronic Diseases-National Science and Technology Major Project (2023ZD0509202 & 2023ZD0509201), the Clinical Special Program of Shanghai Municipal Health Commission (20224044) and the Three-Year Action Plan to Strengthen the Construction of the Public Health System in Shanghai (2023-2025 GWVI-11.1-28) to T.C. These funders/sponsors had no role in the design or conduct of the study.

Author information

These authors contributed equally: Yilan Wu, Bo Qian, Tingyao Li, Yiming Qin, Zhouyu Guan, Tingli Chen, Yali Jia, Ping Zhang, Dian Zeng.
These authors jointly supervised this work: Ya Xing Wang, Yih-Chung Tham, Ching-Yu Cheng, Tien Yin Wong, Bin Sheng.

Authors and Affiliations

Beijing Visual Science and Translational Eye Research Institute (BERI), Beijing Tsinghua Changgung Hospital Eye Center, Tsinghua Medicine, Tsinghua University, Beijing, China
Yilan Wu, Yiming Qin, Dian Zeng, Yixiao Jin, Hongwei Ji, Ya Xing Wang & Tien Yin Wong
School of Clinical Medicine, Tsinghua Medicine, Tsinghua University, Beijing, China
Yilan Wu, Yiming Qin, Dian Zeng, Hongwei Ji & Tien Yin Wong
Shanghai Belt and Road International Joint Laboratory for Intelligent Prevention and Treatment of Metabolic Disorders, Department of Computer Science and Engineering, School of Electronic, Information, and Electrical Engineering, Institute for Proactive Healthcare, Shanghai Jiao Tong University, Department of Endocrinology and Metabolism, Shanghai Sixth People’s Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai Diabetes Institute, Shanghai Clinical Center for Diabetes, Shanghai Key Laboratory of Diabetes Mellitus, Shanghai, China
Bo Qian, Tingyao Li, Yiming Qin, Zhouyu Guan, Huating Li, Ziyao Meng, Xiang Chen, Yuanqi Yao & Bin Sheng
MOE Key Laboratory of AI, School of Electronic, Information, and Electrical Engineering, Shanghai Jiao Tong University, Shanghai, China
Bo Qian, Tingyao Li, Yiming Qin, Ziyao Meng, Xiang Chen, Yuanqi Yao & Bin Sheng
College of Artificial Intelligence, Nanjing University of Aeronautics and Astronautics, Key Laboratory of Brain-Machine Intelligence Technology, Ministry of Education, Nanjing, China
Bo Qian
Department of Ophthalmology, Shanghai Health and Medical Centre, Wuxi, China
Tingli Chen, Yi Huang, Xiyun Bian, Jing Wang, Xiaolong Yang, Haifang Zhang & Yihan Li
Casey Eye Institute, Oregon Health and Science University, Portland, OR, USA
Yali Jia & Pengxiao Zang
Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR, USA
Yali Jia & Pengxiao Zang
Department of Computer Science and Engineering, The Ohio State University, Columbus, OH, USA
Ping Zhang & Changchang Yin
Department of Biomedical Informatics, The Ohio State University, Columbus, OH, USA
Ping Zhang & Changchang Yin
Department of Ophthalmology & Visual Sciences, College of Medicine, The Ohio State University Wexner Medical Center, Columbus, OH, USA
Sayoko Moroi, Joshua Evans, Alan Letson, Frederik Davidorf, Mona Adeli, Peter Chen & Thomas A. Mendel
Department of Ophthalmology, University of Florida College of Medicine, Jacksonville, FL, USA
Rajiv Raman, Praveena Venkatakrishnan, Dhaivat Shah, Abhishek Kumar Tripathi, Dharshan Bhatt, Urvashi Kala & Abhishek Karra
Department of Ophthalmology, Odense University Hospital, Odense, Denmark
Benjamin Sommer Thinggaard & Frederik Pedersen
Open Patient Data Explorative Network, Odense University Hospital, Odense, Denmark
Benjamin Sommer Thinggaard, Benjamin Sommer Thinggaard, Frederik Pedersen, Bjarke Steenberg Smith, Andreas Abou Taha & Jakob Grauslund
Department of Ophthalmology, Malabo Regional Hospital, Malabo, Equatorial Guinea
José Alogo Obiang Ñehe
Department of Ophthalmology, Faculty of Medicine, Universiti Malaya, Kuala Lumpur, Malaysia
Tengku Ain Kamalden, José Alogo Obiang Ñehe & Celestino Edjang Nguema Mikue
Universiti Malaya Medical Centre, Kuala Lumpur, Malaysia
Tengku Ain Kamalden, Tengku Ain Kamalden, Tajunisah Iqbal, Penny Lott Pooi Wah, Marium Jamaluddin Ahmad, Nurul Najieha Amir & Irina Effendi-Tenang
NIHR Biomedical Research Centre, Moorfields Eye Hospital NHS Foundation Trust, London, UK
Yukun Zhou, Tengku Ain Kamalden, Tajunisah Iqbal, Penny Lott Pooi Wah, Marium Jamaluddin Ahmad, Nurul Najieha Amir & Irina Effendi-Tenang
Institute of Ophthalmology, University College London, London, UK
Yukun Zhou, Sobha Sivaprasad & Pearse A. Keane
Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, Hong Kong, China
An Ran Ran, Dawei Yang, Simon K. H. Szeto, Julia Y. Y. Chan, Victor T. T. Chan, Sobha Sivaprasad & Pearse A. Keane
Singapore Eye Research Institute, Singapore National Eye Centre, Singapore, Singapore
Qingsheng Peng, Chenxi Zhang, Carol Y. Cheung, Yih-Chung Tham, Ching-Yu Cheng & Tien Yin Wong
Centre for Innovation and Precision Eye Health and Department of Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
Qingsheng Peng, Gavin Siew Wei Tan, Yih-Chung Tham & Ching-Yu Cheng
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat‐sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou, China
Ying Feng Zheng
Department of Ophthalmology, The Eighth Affiliated Hospital of Sun Yat-sen University, Shenzhen, China
Dingqiao Wang
Medical Records and Statistics Office, Shanghai Sixth People’s Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
Jie Shen & Dingqiao Wang
Key Laboratory of Ocular Fundus Diseases, Chinese Academy of Medical Sciences, Beijing, China
Youxin Chen, Weihong Yu, Chenxi Zhang & Xinyu Zhao
Department of Ophthalmology, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China
Youxin Chen, Weihong Yu, Rongping Dai, Chenxi Zhang & Xinyu Zhao
Beijing Key Laboratory of Fundus Diseases Intelligent Diagnosis & Drug/Device Development and Translation, Beijing, China
Youxin Chen, Weihong Yu, Rongping Dai, Chenxi Zhang, Xinyu Zhao, Shiqun Lin & Yan Zhou
Department of Ophthalmology, Shanghai Sixth People’s Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
Xiangning Wang, Yan Chen, Qiang Wu, Shiqun Lin & Yan Zhou
Shenzhen Eye Hospital, Shenzhen Eye Center, Southern Medical University, Shenzhen, China
Hongbin Xie
Department of Ophthalmology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
Hua-Tao Xie
Department of Ophthalmology, The Second Affiliated Hospital of Naval Medical University (Shanghai Changzheng Hospital), Shanghai, China
Ruili Wei
Department of Ophthalmology, Renji Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
Jin Li
Institute for AI Industry Research, Tsinghua University, Beijing, China
Weizhi Ma
ROAS Thrust, Hong Kong University of Science and Technology (Guangzhou), Guangzhou, China
Lei Zhu & Hongqiu Wang
Hong Kong University of Science and Technology, Hong Kong Special Administrative Region, Hong Kong, China
Lei Zhu
Institute of High Performance Computing, Agency for Science, Technology and Research, Singapore, Singapore
Huazhu Fu
Key Laboratory of River Basic Digital Twinning of Ministry of Water Resources, Macau University of Science and Technology, Macao Special Administrative Region, Taipa, China
Wenxiao Wang
Wuhan Bright Eye Hospital, Wuhan, China
Shan Lin, Zejun Xu & Nian Guan
The People’s Hospital of Sixian County, Anhui, China
Xiao Zhang
Institute for Research in Ophthalmology, Foundation for Ophthalmology Development, Poznan, Poland
Andrzej Grzybowski
Department of Ophthalmology, University of Warmia and Mazury, Olsztyn, Poland
Andrzej Grzybowski
Department of Ophthalmology, Pomeranian Hospitals, Wejherowo, Poland
Monika Gołębiowska-Bogaj & Maciej Gawęcki
Department of Pediatric Ophthalmology, Faculty of Medical Sciences in Katowice, Medical University of Silesia, Katowice, Poland
Adrian Smedowski
Department of Pediatric Ophthalmology, Kornel Gibinski University Clinical Center, Medical University of Silesia, Katowice, Poland
Adrian Smedowski
GlaucoTech Co., Katowice, Poland
Adrian Smedowski
Department of Ophthalmology, Faculty of Medical Sciences in Katowice, Medical University of Silesia, Katowice, Poland
Wojciech Szaraniec
Department of Ophthalmology, Kornel Gibinski University Clinical Center, Medical University of Silesia, Katowice, Poland
Wojciech Szaraniec
School of Healthcare Management, Tsinghua Medicine, Tsinghua University, Beijing, China
You Wu
Institute for Hospital Management, Tsinghua University, Beijing, China
You Wu
Guangdong Provincial Key Laboratory of Intelligent Information Processing, College of Electronics and Information Engineering, Shenzhen University, Shenzhen, China
Yang Wen
Department of Medicine, Faculty of Medicine, Universiti Malaya, Kuala Lumpur, Malaysia
Lee-Ling Lim
Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Prince of Wales Hospital, Hong Kong Special Administrative Region, Hong Kong, China
Lee-Ling Lim
Baker Heart and Diabetes Insitute, Melbourne, Victoria, Australia
Lee-Ling Lim
Department of Ophthalmology, Rajavithi Hospital, College of Medicine, Rangsit University, Bangkok, Thailand
Paisan Ruamviboonsuk
Ophthalmology and Visual Science Academic Clinical Program (Eye ACP), Duke-NUS Medical School, Singapore, Singapore
Yih-Chung Tham
Beijing Key Laboratory of Intelligent Diagnostic Technology and Devices for Major Blinding Eye Diseases, Tsinghua Medicine, Tsinghua University, Beijing, China
Tien Yin Wong
Department of Ophthalmology, Peking University Third Hospital, Beijing, China
Bo Qu
Department of Ophthalmology, The Seventh Affiliated Hospital of Sun Yat-sen University, Shenzhen, China
Hongzhi Yuan
Department of Ophthalmology, Guangzhou Women and Children’s Medical Center, Guangzhou, China
Mengxiang Guo
Department of Ophthalmology, Jiangmen People’s Hospital, Jiangmen, China
Ming Zhou
Department of Ophthalmology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
Wen Shi
Department of Ophthalmology, Vestfold Hospital, Tonsberg, Norway
Lars Morten Skollerud

Authors

Yilan Wu
View author publications
Search author on:PubMed Google Scholar
Bo Qian
View author publications
Search author on:PubMed Google Scholar
Tingyao Li
View author publications
Search author on:PubMed Google Scholar
Yiming Qin
View author publications
Search author on:PubMed Google Scholar
Zhouyu Guan
View author publications
Search author on:PubMed Google Scholar
Tingli Chen
View author publications
Search author on:PubMed Google Scholar
Yali Jia
View author publications
Search author on:PubMed Google Scholar
Ping Zhang
View author publications
Search author on:PubMed Google Scholar
Dian Zeng
View author publications
Search author on:PubMed Google Scholar
Sayoko Moroi
View author publications
Search author on:PubMed Google Scholar
Rajiv Raman
View author publications
Search author on:PubMed Google Scholar
Benjamin Sommer Thinggaard
View author publications
Search author on:PubMed Google Scholar
Frederik Pedersen
View author publications
Search author on:PubMed Google Scholar
José Alogo Obiang Ñehe
View author publications
Search author on:PubMed Google Scholar
Tengku Ain Kamalden
View author publications
Search author on:PubMed Google Scholar
Yukun Zhou
View author publications
Search author on:PubMed Google Scholar
Yixiao Jin
View author publications
Search author on:PubMed Google Scholar
Huating Li
View author publications
Search author on:PubMed Google Scholar
An Ran Ran
View author publications
Search author on:PubMed Google Scholar
Dawei Yang
View author publications
Search author on:PubMed Google Scholar
Ziyao Meng
View author publications
Search author on:PubMed Google Scholar
Qingsheng Peng
View author publications
Search author on:PubMed Google Scholar
Ying Feng Zheng
View author publications
Search author on:PubMed Google Scholar
Dingqiao Wang
View author publications
Search author on:PubMed Google Scholar
Hongwei Ji
View author publications
Search author on:PubMed Google Scholar
Pengxiao Zang
View author publications
Search author on:PubMed Google Scholar
Changchang Yin
View author publications
Search author on:PubMed Google Scholar
Jie Shen
View author publications
Search author on:PubMed Google Scholar
Youxin Chen
View author publications
Search author on:PubMed Google Scholar
Weihong Yu
View author publications
Search author on:PubMed Google Scholar
Rongping Dai
View author publications
Search author on:PubMed Google Scholar
Chenxi Zhang
View author publications
Search author on:PubMed Google Scholar
Xinyu Zhao
View author publications
Search author on:PubMed Google Scholar
Xiangning Wang
View author publications
Search author on:PubMed Google Scholar
Yan Chen
View author publications
Search author on:PubMed Google Scholar
Qiang Wu
View author publications
Search author on:PubMed Google Scholar
Hongbin Xie
View author publications
Search author on:PubMed Google Scholar
Simon K. H. Szeto
View author publications
Search author on:PubMed Google Scholar
Julia Y. Y. Chan
View author publications
Search author on:PubMed Google Scholar
Victor T. T. Chan
View author publications
Search author on:PubMed Google Scholar
Hua-Tao Xie
View author publications
Search author on:PubMed Google Scholar
Ruili Wei
View author publications
Search author on:PubMed Google Scholar
Jin Li
View author publications
Search author on:PubMed Google Scholar
Weizhi Ma
View author publications
Search author on:PubMed Google Scholar
Lei Zhu
View author publications
Search author on:PubMed Google Scholar
Hongqiu Wang
View author publications
Search author on:PubMed Google Scholar
Huazhu Fu
View author publications
Search author on:PubMed Google Scholar
Wenxiao Wang
View author publications
Search author on:PubMed Google Scholar
Shan Lin
View author publications
Search author on:PubMed Google Scholar
Zejun Xu
View author publications
Search author on:PubMed Google Scholar
Nian Guan
View author publications
Search author on:PubMed Google Scholar
Xiao Zhang
View author publications
Search author on:PubMed Google Scholar
Andrzej Grzybowski
View author publications
Search author on:PubMed Google Scholar
Monika Gołębiowska-Bogaj
View author publications
Search author on:PubMed Google Scholar
Maciej Gawęcki
View author publications
Search author on:PubMed Google Scholar
Adrian Smedowski
View author publications
Search author on:PubMed Google Scholar
Wojciech Szaraniec
View author publications
Search author on:PubMed Google Scholar
You Wu
View author publications
Search author on:PubMed Google Scholar
Yang Wen
View author publications
Search author on:PubMed Google Scholar
Xiang Chen
View author publications
Search author on:PubMed Google Scholar
Yuanqi Yao
View author publications
Search author on:PubMed Google Scholar
Lee-Ling Lim
View author publications
Search author on:PubMed Google Scholar
Carol Y. Cheung
View author publications
Search author on:PubMed Google Scholar
Gavin Siew Wei Tan
View author publications
Search author on:PubMed Google Scholar
Jakob Grauslund
View author publications
Search author on:PubMed Google Scholar
Paisan Ruamviboonsuk
View author publications
Search author on:PubMed Google Scholar
Sobha Sivaprasad
View author publications
Search author on:PubMed Google Scholar
Pearse A. Keane
View author publications
Search author on:PubMed Google Scholar
Ya Xing Wang
View author publications
Search author on:PubMed Google Scholar
Yih-Chung Tham
View author publications
Search author on:PubMed Google Scholar
Ching-Yu Cheng
View author publications
Search author on:PubMed Google Scholar
Tien Yin Wong
View author publications
Search author on:PubMed Google Scholar
Bin Sheng
View author publications
Search author on:PubMed Google Scholar

Consortia

EyeFM Global Reader Study Team

Joshua Evans
, Alan Letson
, Frederik Davidorf
, Mona Adeli
, Peter Chen
, Thomas A. Mendel
, Yi Huang
, Xiyun Bian
, Jing Wang
, Xiaolong Yang
, Haifang Zhang
, Yihan Li
, Bo Qu
, Hongzhi Yuan
, Mengxiang Guo
, Dingqiao Wang
, Ming Zhou
, Wen Shi
, Shiqun Lin
, Yan Zhou
, Jakob Grauslund
, Lars Morten Skollerud
, Benjamin Sommer Thinggaard
, Frederik Pedersen
, Bjarke Steenberg Smith
, Andreas Abou Taha
, José Alogo Obiang Ñehe
, Celestino Edjang Nguema Mikue
, Praveena Venkatakrishnan
, Dhaivat Shah
, Abhishek Kumar Tripathi
, Dharshan Bhatt
, Urvashi Kala
, Abhishek Karra
, Tengku Ain Kamalden
, Tajunisah Iqbal
, Penny Lott Pooi Wah
, Marium Jamaluddin Ahmad
, Nurul Najieha Amir
& Irina Effendi-Tenang

Contributions

T.Y.W. and B.S. conceived and supervised the project. T.Y.W., B.S., Yilan Wu, Z.G., D.Z. and Y.F.Z. designed the study. B. Qian, Y.Q. and P.Z. designed the deep learning algorithm and the computational framework. T.Y.W., Yilan Wu, B. Qian, T.L, Y.Q., Z.G., D.Z. and Y.F.Z. contributed to the initial drafting of the manuscript. Y.J., P.Z., Y.Z., Q.P., C.Y., J.S., A.G., M.G.-B., M.G., A.S., W.S., L.Z. and You Wu helped with data collection. S.M., R.R., B.S.T., J.A.O.Ñ., T.A.K., H.L., Y.J., A.R.R., D.Y., Z.M., D.W., Y.C., W.Y., R.D., X. Zhao, C.Z., X.W., Y.C., Q.W., H.X., S.K.H.S., J.Y.Y.C., V.T.T.C., H.-T.X., R.W., J.L., Shan Lin, Z.X., N.G., J.E., A.L., F.D., MA., P.C., T.A.M., Y.H., Y.Z., Shiqun Lin, X.B., J.W., X.Y., H.Z., Y.L, B. Qu, H.Y., M.G., M.Z., W.S., L.M.S., F.P., B.S.S., A.A.T., C.E.N.M., P.V., D.S., A.K.T., D.B., U.K., A.K., T.I., P.L.P.W., M.J.A., N.N.A. and I.E.-T. participated in prospective validations. T.C., X. Zhang, Y.H., X.B., J.W., X.Y., H.Z. and Y.L. conducted the data collection and analysis in the RCT. J.G., P.R., S.S., P.A.K., L.-L.L., C.Y.C., G.S.W.T., Y.X.W., Y.-C.T., C.-Y.C., Y.F.Z., B.S. and T.Y.W. contributed to collaboration organization and provided critical revision of the manuscript for important intellectual content. All authors provided critical comments and reviewed the manuscript. All authors discussed the results and approved the final version before submission.

Corresponding authors

Correspondence to Tien Yin Wong or Bin Sheng.

Ethics declarations

Competing interests

Y.J. is a patent holder of Optovue/Visionix, Inc., Optos plc and Genentech, Inc. She receives financial support from Genentech, Inc., and she receives financial compensation from Optovue/Visionix, Inc. and Genentech, Inc. P.A.K. is a co-founder of Cascader Ltd., has acted as a consultant for Retina Consultants of America, Roche, Boehringer Ingleheim and Bitfount and is an equity owner in Big Picture Medical. He has received speaker fees from Zeiss, Thea, Apellis and Roche. He has received travel support from Bayer and Roche. He has attended advisory boards for Topcon, Bayer, Boehringer Ingleheim and Roche. T.Y.W. is a consultant for AbbVie Pte Ltd., Aldropika Therapeutics, Bayer, Boehringer Ingelheim, Zeiss, Genentech, Iveric Bio, Novartis, Opthea Limited, Plano, Quaerite Biopharm Research Ltd., Roche, Sanofi and Shanghai Henlius. He is an inventor, holds patents and is a co-founder of start-up companies EyRiS and Visre, which have interests in, and develop digital solutions for, eye diseases. All potential conflicts of interests for consultancy, advisory boards and positions in the start-up companies and financial renumeration, if any, are managed by institutional policies under SingHealth and Tsinghua University. The other authors declare no competing interests.

Peer review

Peer review information

Nature Medicine thanks Tae Keun Yoo and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Lorenzo Righetto, in collaboration with the Nature Medicine team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Envision the use case of EyeFM as a whole workflow copilot for eyecare under different clinical settings.

For patients attending ocular disease screening in the primary care centre where only low-cost examination technics are available, EyeFM can use its single modality and cross-modality ability to assist disease detection, followed with screening report writing assisted by its vision-question answering ability. Then, some patients will be referred to specialty care settings for further examination, where EyeFM can use its integrated modality disease detection or even zero-shot ability to facilitate further diagnosis. The image-report writing ability and vision-question answering can also assist to improve the efficiency of clinical report and patient letter drafting in specialty care settings.

Extended Data Fig. 2 The structural diagram of human-knowledge encoding in pretraining and application phase for EyeFM.

a) The diagram of pretraining process for EyeFM. EyeFM was first pretrained for its image encoder with five modalities of images, then conducted vision-language joint pretraining. The image module includes one encoder and five decoders. The encoder comprises 24 Transformer blocks. Each decoder comprises two Transformer blocks. The linear projection layer is implemented with a single convolutional layer. In the vision-language module, the projection is implemented using a single linear layer, which serves to connect the image encoder with the language module. The language module is based on LLaMA 2 architecture with 7 billion parameters. b) The diagram of human-in-the-loop process for EyeFM. EyeFM human-in-the-loop utilized DPO and federated learning for distributed knowledge evolution.

Extended Data Fig. 3 Glossary table and the clinical tasks related with each validation experiment.

First, we conducted retrospective validations, comparing EyeFM with prior benchmarks of medical foundation models. This step serves as the foundation for evaluating the model’s performance and safety when progressing to clinical applications. Second, we conducted reader studies and real-world study prospectively to test the efficiency of EyeFM as a clinical copilot to assist ophthalmologists. This step bridged the gap between the performance of the model its own and its efficiency when applicated by clinicians. At last, we validated EyeFM with a Randomised Controlled Trial (RCT). Colored chart, above, validation experiments and their corresponding relationship with the functions of EyeFM. The x-axis represents different tasks of EyeFM and y-axis represents validation experiments. Cells were coloured if the experiments have validated corresponding functions. Below, glossary table of the tasks, clinical scenario and experiments. RCT, randomised controlled trial.

Extended Data Fig. 4 Experiment 1 – Retrospective validation of EyeFM on multi-ethnic datasets.

a) For disease detection on CFP, the sample sizes and P values are: DR (n = 1501, P = 0.042), glaucoma suspect (n = 405, P = 0.533), AMD suspect (n = 370, P = 0.627), MMD (n = 643, P = 0.030). For disease detection on OCT, the sample sizes and P values are: ciDME (n = 523, P = 0.002), glaucoma (n = 412, P = 0.333), AMD (n = 379, n = 0.036). The sample size for cataract detection on external eye photo was 198 and the P value was 0.102. Error bars represent 95% CI. b)Segmentation dice similarity coefficient. The sample size for segmentation on CFP are: HE (n = 13), SE (n = 14), HM (n = 27) and MA (n = 27). The P value for haemorrhages was 0.083. The sample size was 759 for OCT segmentation. Error bars represent 95% CI. c) Cross-modality disease detection of ciDME that usually need to be diagnosed by OCT with CFP inputs only (left), the sample size was 405 and the P value was <0.001. Cross-modality disease detection of wet-AMD that usually need to be diagnosed by CFP with external eye photo inputs (right) the sample size was 332 and the P value was 0.583. Boxes indicate quartile values and whiskers indicate 1.5×the interquartile range. d) Image-report generation, model performance was evaluated by automatic metrics labelled as the x-axis. The sample size was 500. Boxes indicate quartile values and whiskers indicate 1.5× the interquartile range. e) Head-to-head comparison of answers generated by EyeFM and ophthalmologists, the measurement was summed score for quality, safety and empathy, ranged from 3–15 scores. Presented as Kernel Density Plot. The sample size was 300 for EyeFM and 1200 for ophthalmologists. P values were calculated with two-sided t-test between EyeFM and the better-performed reference model. *** denotes P < 0.001, n.s. represents P > 0.05. CFP, colour fundus photo; OCT, optical coherence tomography; EEP, external eye photo; DR, diabetic retinopathy; DME, diabetic macular oedema; AMD, age-related macular degeneration; MMD, myopic macular degeneration; MA, microaneurysms; HE, hard exudates; HM, haemorrhages; SE, soft exudates.

Source data

Extended Data Fig. 5 Workflow for the diagnostic study and management study in the RCT.

Participants included in the trial will first receive diagnosis and report by ophthalmologists by CFP, then receive additional OCT examinations for consultant-level reviewers to assess and revise the diagnosis and report. All diagnosis and reports before revision by consultant-level reviewers will be included in the analysis for correct diagnosis rate, correct referral rate and standardization score of reports. Only participants that are correctly diagnosed as ‘with fundus abnormality’ will be included in the follow-up for patient compliance analysis. CFP, colour fundus photo; OCT, optical coherence tomography.

Extended Data Table 1 Demographic characteristics of eyecare providers who participated in EyeFM validation

Full size table

Supplementary information

Supplementary Information

Supplementary Figs. 1–4, Tables 1–30, protocols and examples.

Reporting Summary

Source data

Fig. 3, 5 and Extended Data Fig. 4

Statistical Source Data.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Wu, Y., Qian, B., Li, T. et al. An eyecare foundation model for clinical assistance: a randomized controlled trial. Nat Med (2025). https://doi.org/10.1038/s41591-025-03900-7

Download citation

Received: 24 October 2024
Accepted: 16 July 2025
Published: 28 August 2025
DOI: https://doi.org/10.1038/s41591-025-03900-7

An eyecare foundation model for clinical assistance: a randomized controlled trial

Subjects

Abstract

Access options

Similar content being viewed by others

Effectiveness of a telemedicine program for triage and diagnosis of emergent ophthalmic conditions

A multimodal visual–language foundation model for computational ophthalmology

The UK clinical eye research strategy: refreshing research priorities for clinical eye research in the UK

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

EyeFM Global Reader Study Team

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Extended Data Fig. 1 Envision the use case of EyeFM as a whole workflow copilot for eyecare under different clinical settings.

Extended Data Fig. 2 The structural diagram of human-knowledge encoding in pretraining and application phase for EyeFM.

Extended Data Fig. 3 Glossary table and the clinical tasks related with each validation experiment.

Extended Data Fig. 4 Experiment 1 – Retrospective validation of EyeFM on multi-ethnic datasets.

Extended Data Fig. 5 Workflow for the diagnostic study and management study in the RCT.

Supplementary information

Supplementary Information

Reporting Summary

Source data

Fig. 3, 5 and Extended Data Fig. 4

Rights and permissions

About this article

Cite this article

Search

Quick links

Subjects

Abstract

Access options

Similar content being viewed by others

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

EyeFM Global Reader Study Team

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links