Performance of breast cancer risk prediction algorithms across mammography systems in the UK screening programme

Rothwell, Joshua; Payne, Nicholas; Kilburn-Toppin, Fleur; Huang, Yuan; Kaggie, Joshua; Black, Richard; Hickman, Sarah; Kasmai, Bahman; Juette, Arne; Gilbert, Fiona

doi:10.1038/s41746-026-02507-7

Download PDF

Article
Open access
Published: 08 March 2026

Performance of breast cancer risk prediction algorithms across mammography systems in the UK screening programme

Joshua Rothwell¹,
Nicholas Payne¹,
Fleur Kilburn-Toppin^1,2,
Yuan Huang^1,3,
Joshua Kaggie¹,
Richard Black²,
Sarah Hickman^1,4,
Bahman Kasmai⁵,
Arne Juette⁵ &
…
Fiona Gilbert^1,2

npj Digital Medicine , Article number: (2026) Cite this article

2573 Accesses
15 Altmetric
Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

Abstract

Thirty percent of interval breast cancers, diagnosed between routine screening mammograms, have a poorer prognosis than screen-detected cancers. Deep learning algorithms can estimate short-term risk from negative mammograms to guide supplemental imaging or screening intervals, but comparative validation on complete national screening data is lacking. We retrospectively evaluated four risk algorithms (Mirai, iCAD, Transpara, and Google) using 112,621 negative mammograms from two UK NHS Breast Screening Programme sites with different mammography systems (Philips, GE) over one screening round (2014–2017) with five-year follow-up, including 1225 future cancers. There was a distinct ranking in discriminative ability; overall AUCs ranged 0.65–0.72, only one algorithm significantly differed between systems. For interval cancers, AUCs ranged 0.67–0.77. Within the highest 4.0% of risk scores, top algorithms identified ~20% of future cancers, including ~27% of interval cancers, doubling at the 14.0% threshold. These differences highlight the need for multi-algorithm prospective trials and potential fine-tuning to improve generalisation across unseen systems.

Validation of a new fully automated software for 2D digital mammographic breast density evaluation in predicting breast cancer risk

Article Open access 06 October 2021

Identifying breast cancer risk factors and evaluating biennial mammography screening efficacy using big data analysis in Taiwan

Article Open access 09 May 2025

Diagnostic accuracy of deep learning in medical imaging: a systematic review and meta-analysis

Article Open access 07 April 2021

Data availability

Study data from the UK NHS Breast Screening Programme were pseudonymised. Access is restricted, and the data cannot be publicly shared. However, researchers may request access through the relevant NHS data governance and research ethics procedures, subject to review and appropriate data sharing agreements. Requests should be made to the corresponding author and will be considered on a case-by-case basis.

Code availability

The deep learning algorithms evaluated in this study were provided for research purposes by their respective commercial companies and cannot be shared by the authors. Researchers wishing to access these algorithms should contact the respective companies directly. Mirai has been made available by Yala et al. on GitHub: https://github.com/yala/Mirai. Analysis code for this study has been made available on GitHub: https://github.com/RandomForestJosh/breast_cancer_risk_prediction.

References

Marmot, M. G. et al. The benefits and harms of breast cancer screening: an independent review. Br. J. Cancer 108, 2205–2240 (2013).
Google Scholar
Schumann, L., Hadwiger, M., Eisemann, N. & Katalinic, A. Lead-time corrected effect on breast cancer survival in germany by mode of detection. Cancers 16, 1326 (2024).
Google Scholar
Dibden, A. et al. Reduction in interval cancer rates following the introduction of two-view mammography in the UK breast screening programme. Br. J. Cancer 110, 560–564 (2014).
Google Scholar
Henderson, L. M. et al. Breast Cancer Characteristics Associated With Digital Versus Film-Screen Mammography for Screen-Detected and Interval Cancers. Am. J. Roentgenol. 205, 676–684 (2015).
Google Scholar
Kerlikowske, K. et al. Advanced Breast Cancer Definitions by Staging System Examined in the Breast Cancer Surveillance Consortium. JNCI J. Natl. Cancer Inst. 113, 909–916 (2021).
Google Scholar
Törnberg, S. et al. A pooled analysis of interval cancer rates in six European countries. Eur. J. Cancer Prev. 19, 87–93 (2010).
Google Scholar
Bennett, R. L., Sellars, S. J. & Moss, S. M. Interval cancers in the NHS breast cancer screening programme in England, Wales and Northern Ireland. Br. J. Cancer 104, 571–577 (2011).
Google Scholar
Yala, A. et al. Optimizing risk-based breast cancer screening policies with reinforcement learning. Nat. Med. 28, 136–143 (2022).
Google Scholar
Kerlikowske, K. et al. Cumulative Advanced Breast Cancer Risk Prediction Model Developed in a Screening Mammography Population. JNCI J. Natl. Cancer Inst. 114, 676–685 (2022).
Google Scholar
Van Winkel, S. L. et al. Using AI to Select Women with Intermediate Breast Cancer Risk for Breast Screening with MRI. Radiology 314, e233067 (2025).
Google Scholar
Gilbert, F. J. et al. Comparison of supplemental breast cancer imaging techniques—interim results from the BRAID randomised controlled trial. Lancet 405, 1935–1944 (2025).
Google Scholar
Lang, K., Hofvind, S., Rodriguez-Ruiz, A. & Andersson, I. Can artificial intelligence reduce the interval cancer rate in mammography screening? Eur. Radiol. 31, 5940–5947 (2021).
Google Scholar
Eriksson, M. et al. European validation of an image-derived AI-based short-term risk model for individualized breast cancer screening—a nested case-control study. Lancet Reg. Health - Eur. 37, 100798 (2024).
Google Scholar
Yala, A. et al. Multi-Institutional Validation of a Mammography-Based Breast Cancer Risk Model. J. Clin. Oncol. 40, 1732–1740 (2022).
Google Scholar
Rothwell, J. W. D. et al. Evaluation of a Mammography-based Deep Learning Model for Breast Cancer Risk Prediction in a Triennial Screening Program. Radiology 317, e250391 (2025).
Google Scholar
Hussain, S. et al. Breast cancer risk prediction using machine learning: a systematic review. Front. Oncol. 14, 1343627 (2024).
Google Scholar
Lehman, C. et al. Deep Learning vs Traditional Breast Cancer Risk Models to Support Risk-Based Mammography Screening. J. Natl. Cancer Inst. https://doi.org/10.1093/jnci/djac142 (2022).
Hill, H., Roadevin, C., Duffy, S., Mandrik, O. & Brentnall, A. Cost-Effectiveness of AI for Risk-Stratified Breast Cancer Screening. JAMA Netw. Open 7, e2431715 (2024).
Google Scholar
Isautier, J. M. J. et al. Clinical guidelines for the management of mammographic density: a systematic review of breast screening guidelines worldwide. JNCI Cancer Spectr. 8, pkae103 (2024).
Google Scholar
Bakker, M. F. et al. Supplemental MRI Screening for Women with Extremely Dense Breast Tissue. N. Engl. J. Med. 381, 2091–2102 (2019).
Google Scholar
Salim, M. et al. AI-based selection of individuals for supplemental MRI in population-based breast cancer screening: the randomized ScreenTrustMRI trial. Nat. Med. https://doi.org/10.1038/s41591-024-03093-5 (2024).
Halligan, S., Altman, D. G. & Mallett, S. Disadvantages of using the area under the receiver operating characteristic curve to assess imaging tests: A discussion and proposal for an alternative approach. Eur. Radiol. 25, 932–939 (2015).
Google Scholar
Gastounioti, A. et al. External Validation of a Mammography-Derived AI-Based Risk Model in a U.S. Breast Cancer Screening Cohort of White and Black Women. Cancers 14, 4803 (2022).
Google Scholar
Lauritzen, A. D. et al. Assessing Breast Cancer Risk by Combining AI for Lesion Detection and Mammographic Texture. Radiology 308, e230227 (2023).
Google Scholar
Kerlikowske, K. et al. Identifying women with dense breasts at high risk for interval cancer: a cohort study. Ann. Intern. Med. 162, 673–681 (2015).
Google Scholar
Arasu, V. A. et al. Comparison of Mammography AI Algorithms with a Clinical Risk Model for 5-year Breast Cancer Risk Prediction: An Observational Study. Radiology 307, e222733 (2023).
Google Scholar
NHS England. Protocols for the surveillance of women at higher risk of developing breast cancer. GOV.UK https://www.gov.uk/government/publications/breast-screening-higher-risk-women-surveillance-protocols/protocols-for-surveillance-of-women-at-higher-risk-of-developing-breast-cancer (2025).
iCAD, Inc. ProFound AI Breast Cancer Health Suite. https://www.icadmed.com/breast-health/.
Choosing Transpara. ScreenPoint Medical BV https://screenpoint-medical.com/choosing-transpara/.
Yala, A. et al. Toward robust mammography-based models for breast cancer risk. Sci. Transl. Med. 13, eaba4373 (2021).
Google Scholar
Yala, A. yala/OncoServe_Public. (2025).
McKinney, S. M. et al. International evaluation of an AI system for breast cancer screening. Nature 577, (2020).
NHS England. NHS Breast screening programme screening standards valid for data collected from 1 April 2021. GOV.UK https://www.gov.uk/government/publications/breast-screening-consolidated-programme-standards/nhs-breast-screening-programme-screening-standards-valid-for-data-collected-from-1-april-2021 (2024).
Grabler, P., Sighoko, D., Wang, L., Allgood, K. & Ansell, D. Recall and Cancer Detection Rates for Screening Mammography: Finding the Sweet Spot. Am. J. Roentgenol. 208, 208–213 (2017).
Google Scholar

Download references

Acknowledgements

We would like to thank all the women who contributed their deidentified data to CC-MEDIA for research purposes, and consequently, this research. We would also like to thank the associated commercial companies for providing research access to their risk prediction algorithms, and Yala et al. for making the Mirai code base freely available under the MIT license. This research was supported by the Future Dreams Breast Cancer Charity, the National Institute for Health and Care Research (NIHR) Cambridge Biomedical Research Centre (NIHR203312*), and the Cancer Research UK early detection program grant (C543/A26884). The views expressed are those of the authors and not necessarily those of the NIHR or the Department of Health and Social Care.

Author information

Authors and Affiliations

University of Cambridge, Department of Radiology, Cambridge, UK
Joshua Rothwell, Nicholas Payne, Fleur Kilburn-Toppin, Yuan Huang, Joshua Kaggie, Sarah Hickman & Fiona Gilbert
Cambridge University Hospitals NHS Foundation Trust, Department of Radiology, Cambridge, UK
Fleur Kilburn-Toppin, Richard Black & Fiona Gilbert
University of Cambridge, EPSRC Cambridge Mathematics of Information in Healthcare Hub, Cambridge, UK
Yuan Huang
Barts Health NHS Trust, Department of Radiology, London, UK
Sarah Hickman
Norfolk and Norwich University Hospital, Department of Radiology, Norwich, UK
Bahman Kasmai & Arne Juette

Authors

Joshua Rothwell
View author publications
Search author on:PubMed Google Scholar
Nicholas Payne
View author publications
Search author on:PubMed Google Scholar
Fleur Kilburn-Toppin
View author publications
Search author on:PubMed Google Scholar
Yuan Huang
View author publications
Search author on:PubMed Google Scholar
Joshua Kaggie
View author publications
Search author on:PubMed Google Scholar
Richard Black
View author publications
Search author on:PubMed Google Scholar
Sarah Hickman
View author publications
Search author on:PubMed Google Scholar
Bahman Kasmai
View author publications
Search author on:PubMed Google Scholar
Arne Juette
View author publications
Search author on:PubMed Google Scholar
Fiona Gilbert
View author publications
Search author on:PubMed Google Scholar

Contributions

J.R., N.R.P., F.KT., and F.J.G. conceptualized the study. J.R., N.R.P., F.KT., S.E.H., B.K., and A.J. collected and curated screening data. J.R., N.R.P., J.D.K., R.T.B., and S.E.H. coordinated the technical setup of algorithms and the research environment in which they processed data. J.R. conducted failure analyses. J.R., N.R.P., and Y.H. coordinated and conducted analyses. Y.H. provided statistical guidance and supervised the implementation of analyses. J.R. assembled all figures. J.R. wrote the manuscript. All authors reviewed and contributed to manuscript editing.

Corresponding author

Correspondence to Fiona Gilbert.

Ethics declarations

Competing interests

J.R., N.R.P., S.E.H. and F.J.G. research agreements with Merantix, ScreenPoint Medical, Lunit, iCAD, Google, Therapixel and Volpara. S.E.H. Radiology Artificial Intelligence Trainee Editorial Board member. J.R. and F.KT. were supported by a Future Dreams breast cancer charity grant awarded to F.J.G. F.KT. consulting for Genesis Care. J.D.K. research support from the NIHR Cambridge Biomedical Research Centre and the Wellcome Trust; grants from Cancer Research UK, AstraZeneca, and GE HealthCare. F.J.G., recipient of the Cancer Research UK Early Detection Programme grant; consulting for Alphabet and Kheiron; honoraria for lectures from GE HealthCare; participation on an advisory board for Bayer; past president (2020–2022) of the European Society of Breast Imaging; current Clinical Radiology AI Lead Advisor at the Royal College of Radiologists; contrast media for unrelated trial from Bayer. Y.H., R.T.B., B.K. and A.J. have no competing interests to declare.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary information (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rothwell, J., Payne, N., Kilburn-Toppin, F. et al. Performance of breast cancer risk prediction algorithms across mammography systems in the UK screening programme. npj Digit. Med. (2026). https://doi.org/10.1038/s41746-026-02507-7

Download citation

Received: 07 October 2025
Accepted: 20 February 2026
Published: 08 March 2026
DOI: https://doi.org/10.1038/s41746-026-02507-7