Evaluating breast cancer screening performance without registries using medico-administrative data

Jemelen, Emilien; Orchard, Francisco; Madie, William; Valentin, Bernard; Belin, Josine; Laas, Enora; Jeannerod, Guillaume; Mares, Pierre; Katsahian, Sandrine; Guilloux, Agathe

doi:10.1038/s41598-025-10115-w

Download PDF

Article
Open access
Published: 11 July 2025

Evaluating breast cancer screening performance without registries using medico-administrative data

Emilien Jemelen^1,2,
Francisco Orchard²,
William Madie²,
Bernard Valentin³,
Josine Belin³,
Enora Laas⁴,
Guillaume Jeannerod⁵,
Pierre Mares³,
Sandrine Katsahian⁶ &
…
Agathe Guilloux¹

Scientific Reports volume 15, Article number: 25096 (2025) Cite this article

2427 Accesses
2 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The French Breast Cancer Screening Program (DOCS) was created to detect early Breast Cancer (BC). Key performance indicators for digital mammography include sensitivity (SE), positive predictive value (PPV), interval cancer rate (ICR) and cancer detection rate (CDR). Calculating these metrics requires a linkage between screening data and BC registries; however, registries are scarce in France and often inaccessible for research. We therefore used medico-administrative data as an alternative. We linked regional screening data to the French National Health Data System (SNDS) between 2011 and 2020. Women were followed for 24 months post-screening. Screen-detected cancers and those identified with the SNDS were included. Performance metrics were calculated based on these linked datasets. A total of 252,786 screening exams were analyzed, covering 29,661–33,447 screenings annually, with a mean age of 61 years. SE was 77.9% (95% CI 76.3–79.3), indicating that approximately four in five cancers were detected through mammography. PPV was 19.8% (95% CI 19–20.5), meaning that one in five women with a positive screening test were confirmed with cancer within 24 months. CDR was 10.9 per 1000 exams (95% CI 10.5–11.3), equating to one detected case per 100 screenings. ICR was 2.4 per 1000 exams (95% CI 2.2–2.6), meaning that more than two interval cancers were detected per 1000 screenings. This identification approach using medico-administrative data offers a reproducible alternative for regions where cancer registries are unavailable. A future study applying this methodology in a registry-covered region could further validate the effectiveness of linking screenings to SNDS data for systematic cancer identification.

Breast cancer screening patterns and associated factors in Iranian women over 40 years

Article Open access 03 July 2024

Long-term trends in incidence, characteristics and prognosis of screen-detected and interval cancers in women participating in the Dutch breast cancer screening programme

Article 11 March 2024

Identifying breast cancer risk factors and evaluating biennial mammography screening efficacy using big data analysis in Taiwan

Article Open access 09 May 2025

Introduction

Breast Cancer (BC) is currently the most commonly diagnosed cancer and the leading cause of cancer mortality among women in Europe, with an estimated 557,900 new cases and 144,500 deaths in 2022. In France alone, 2023 saw an estimated 61,200 new diagnoses and 12,100 deaths^1,2.

To detect BC, public health agencies recommend tailored screening strategies based on individual risk profiles (Fig. 1)³.

Clinical trials indicate that mammography can reduce BC mortality by 10–20%, aiding in early detection and thereby improving treatment outcomes^4,5,6,7. Expert groups presented substantial evidence supporting the performance and cost-effectiveness of mammographic screening⁵, which prompted the initiation of organized screening programs across Europe in the 2000s⁸. The French Organized Screening Program for Breast Cancer (DOCS) was introduced in 2004, reaching a 47.7% participation rate among 10.8 million eligible women in 2020–2021⁹.

In 2023, a meta-review of BC screening guidelines reported the five most recommended metrics to assess mammography performance: sensitivity (recommended in 11 guidelines), cancer detection rate per 1000 women screened (10 guidelines), cancer size, interval cancer rate, and positive predictive value (7 guidelines)¹⁰. Public health institutions and other studies emphasize the importance of these metrics to evaluate mammography performance.

In 2004, the National Agency for Healthcare Accreditation and Evaluation (ANAES) recommended two types of test validity measures: sensitivity and specificity, assessing the intrinsic validity of screening tests, and predictive values, relevant to the screened population¹¹. In 2008, the European Commission Initiative on Breast Cancer (ECIBC) recommended organized screening mammography based on findings from a Danish study indicating that its sensitivity and specificity outperformed those of opportunistic or non-organized screening programs^12,13. Regional studies in France and Germany estimated that approximately 15–20% of cancers detected within screened populations are identified within 24 months following a negative mammography^14,15.

In 2019, a report by Santé Publique France on the performance of DOCS mammography highlighted the necessity of aligning with European guidelines to improve sensitivity and predictive values while pointing to the need for registries to track post-negative-screening cases¹⁶. In France in 2018, 33 cancer registries covered 24 out of 101 departments^17,18, with databases format heterogeneity, making it difficult to use them for data linkage¹⁹.

This study aimed (1) to assess the performance of mammography by estimating the sensitivity, specificity, predictive values, cancer detection rate, and interval cancer rates of the screening program without cancer registry and (2) to compare these estimates with findings from other studies.

Material and methods

Data sources

Screening data were obtained from the Coordination Center of Cancer Screening in the Occitanie region (CRCDC-OC) and included individual patient characteristics, full-field digital mammography (FFDM) images, and mammography results. Mammography exams comprise a clinical assessment (palpation and observation) and a double reading of four images: two per breast, one cranio-caudal (CC) view and one mediolateral oblique (MLO) view. Following the initial reading, additional exams such as image enlargement or fine-needle biopsy may be conducted as needed²⁰. Data were collected from centers in Gard and Lozère departments (see Supplementary Figure S1) between 2004 and 2020, supported by individual or collective communications. Patients’ data were pseudonymized twice, and the data processing was designated of public interest and complied with the European General Data Protection Regulation act (GDPR) and French privacy standards, as authorized by the French National Data Protection Authority (CNIL) with authorization number DR-2020-365, granted in 2020. All methods were performed in accordance with these guidelines and regulations.

For all screened women, medico-administrative data from the National Health Data System (SNDS) were also collected. Established in 2016 to unify public insurance databases, the SNDS provides general information (age, sex and residence location), treatment and acts data (SNIIRAM tables), patient records from healthcare institutions (PMSI tables), and cause-of-death data (CepiDC tables), generally retained for 20 years from the date of inclusion^21,22. No individual socioeconomic data can be found for the general population in the SNDS²³.

The linkage between the screening database and the SNDS database was facilitated through a technological platform provided by the Health Data Hub (HDH), a public entity formed in 2019 to support SNDS-related research projects^24,25.

Study population

Every two years, the DOCS screening program invites women aged 50 to 74 years with a Social Security (French healthcare reimbursement system) number, without reported risk factors and without any previously reported breast lesions²⁶. Our study population included all women invited to the program with at least one screening mammography done between 2011 and 2018. The start year, 2011, was chosen to avoid the impact of a technology shift from analog to digital mammography between 2008 and 2010, which likely affected false-negative rates in 2010 (see Supplementary Figure S2). The endpoint, 2018, was selected to allow a two-year follow-up period in the SNDS database for cancer detection. Mean age in the study cohort was 61 years. 1,407 examinations (0.2%) that did not comply with DOCS guidelines, such as negative first readings not followed by a second reading, were excluded²⁷. In total, 252,786 screening exams (29,661 to 33,447 annually) from 111,783 women were included.

BC identification

BC cases were either identified by the screening program or through the SNDS medico-administrative follow-up for the shorter of two periods: within 24 months or until next screening, if applicable (Fig. 2). Women with screen-detected cancer were also followed up in the SNDS to complete an initial assessment of the accuracy of our identification method.

In the SNDS, five criteria were used for BC identification. BC deaths were identified in the Center of Epidemiology on Medical Causes of Death (CépiDc) tables (1). BC surgeries, such as total mastectomy or partial mastectomy with axillary lymph node dissection, were identified in the SNIIRAM tables, with the first qualifying surgery found up to 24 months after mammography serving as a BC identification proxy (2). Breast surgeries potentially indicating cancer (e.g. partial mastectomy or tumorectomy without axillary dissection) were identified in the SNIIRAM tables with cancer treatments used to confirm BC diagnosis (3). Surgery to treatments intervals were based on the existing oncological literature²⁸. Targeted therapy (TT): between 250 days before and up to 180 days after surgery. Endocrine therapy (ET): between 250 and up to 365 days. Radiotherapy (RT): between 150 and up to 365 days. Chemotherapy (CT): between 250 and up to 180 days.

Other cases without recorded death or surgery were identified up to 24 months post-screening either via the SNIIRAM Long-Term Conditions (LTC, ALD in French) tables, where CIM-10 diagnosis codes C50 and D05 were confirmed with treatment (4), or with BC diagnoses found in the hospitals stays data tables (PMSI in French), with confirmatory treatments (5). Among LTC cases, 2% had no record of TT, ET, RT, or CT. Among PMSI cases, 64% had no record of TT, ET, RT or CT.

Mammography result and screening result

The American College of Radiology Breast Imaging Reporting and Data System (BI-RADS)²⁹ reports seven levels to categorize breast imaging tests results (categories definitions are detailed in Supplementary Figure S3). In the context of the DOCS screening program, mammography result ranges from 1 (normal) to 5 (highly suggestive of malignancy). Following DOCS evaluation methods of national authorities, a mammography result was deemed positive if rated 3, 4, or 5 and negative if rated 1 or 2^16,30,31. The screening outcome was classified as positive or negative, integrating both mammography results and any additional exam findings.

Interval cancers

Interval cancers, defined as cases detected after a negative mammography within 24 months and before next screening, were identified by SNDS follow-up or women self-reporting their cancer diagnosis to the program (Fig. 2)^{12,13,14,15,16}. Interval cancer rate was defined as the number of interval cancer cases per 1,000 screenings. Relative interval cancer rate was defined as the proportion of cancers detected after a negative mammography.

Metrics definition

Sensitivity (SE), specificity (SP), positive and negative predictive values (PPV/NPV), cancer detection rate (CDR) and interval cancer rate (ICR) and relative interval cancer rate (RICR) were computed with different combinations of SNDS criteria for cancer identification, in addition to screen-detected cases (Table 1).

Table 1 Definition of mammography performance metrics.

Full size table

Outcomes and statistical analyses

Metrics were computed over the whole 2011–2018 period and annually, with and without stratification.

To stratify the metrics, the following features were considered: age at the exam, rank of the screening exam and ACR level of the exam.

Confidence intervals (CI) were computed with a dedicated method for each metric^32,33,34. Results were obtained with a Python 3 kernel in the HDH environment.

Results

Additional cases identified in the SNDS

Among positive mammography exams, the DOCS screening program identified 1,611 cases, and an additional 387 cases were identified by SNDS follow-up up to 24 months after screening.

Among negative mammography exams, the DOCS identified 101 cases, and 670 additional cases were identified in the SNDS (Fig. 3 for the full distribution).

95.3% of screen-detected cases were also identified through SNDS follow-up.

Global performance of mammography

Over the period, SE of mammography without incorporating SNDS-identified cancers was 98.8% (95% CI 98.2–99.2). Including SNDS cases, SE was 77.9% (76.3–79.3) (Fig. 4), indicating that 77.9% of cases appearing within 24 months after screening had a positive mammography result. This result is above the European minimum standard of 70%³⁵ and aligns with findings from studies using cancer registries^13,36,37.

PPV of mammography was 15.1% (14.5–15.8) without SNDS-identified cases, and 19.8% (19–20.5) with SNDS cases. Over time, an upward trend of PPV was observed both with and without SNDS cases. In 2017, 22.4% (20.3–24.5) of patients with a positive mammography were confirmed with cancer within 24 months.

CDR without SNDS cases was 6.6‰ (6.2–6.8). Including SNDS cases, CDR was 10.9‰ (10.5–11.3), meaning that approximately 1 in 100 screened patients was diagnosed with cancer within 24 months. This result is consistent with reported CDRs for organised screening with FFDM^38,40.

ICR with SNDS cases was 2.4‰ (2.2-2.6), indicating that more than 2 in every 1000 mammographies led to an interval cancer within 24 months. ICR for the first year post-screening was 0.8‰ (0.7–0.9), increasing to 1.7‰ (1.5–1.9) in the second year (Fig. 4). These values are in line with the literature^37,38 and comply with European guidelines (details on the ICR guidelines are available at the end of Supplementary Materials)³⁵.

RICR with SNDS cases was 22.1% (20.6–23.7) over the whole period, meaning that 22.1% of cases identified within 24 months after screening occurred after a negative mammography. First year post-screening RICR was 9% (7.9–10.2), and second year RICR was 80% (76.5–83.1). In other words, approximately 1 in 10 cancers diagnosed within the first year after screening had previously been classified as negative on mammography, compared to about 4 in 5 for cancers diagnosed in the second year following screening.

Mammography performance by exam rank

PPV significantly increased with the exam rank (Table 2). This may reflect improved radiologist accuracy with a cumulative screening record, enhancing understanding of the current exam. Alternatively, increased PPV might indicate severity differences among patients with higher screening participation. However, using mean positive ACR result as a severity proxy showed no evidence of correlation between severity and participation rates.

A slight decrease in CDR was observed with higher exam rank, potentially reflecting a reduced cancer incidence in populations undergoing repeated testing.

Table 2 Mammography PPV and CDR by screening exam rank (95% CI).

Full size table

Mammography performance by age

PPV varied significantly by age (Table 3). For women aged 50–59, PPV was 14.6%, when it reached 28.1% for women aged 70–74, suggesting that a positive mammography result was more predictive of cancer for older patients. CDR also increased with age, from 9.1‰for women aged 50-59 to 14.8‰for women aged 70–74.

Table 3 Mammography PPV and CDR by age (95% CI).

Full size table

Mammography performance by ACR level

PPV varied considerably by ACR level. Over the 2011–2018 period, PPV for ACR level 3 was 6.1% (5.6–6.8), PPV was 50.2% (47.8–52.5) for ACR 4, and 94.2% (92.5–95.6) for ACR 5, without significant changes over time (see Supplementary Figure S4). These findings align with radiologists’ observations on PPV by ACR level⁴¹ and comply with French national BC screening guidelines²⁷.

Discussions

In this study, interval cancers were defined as mammography false negatives, whether or not precursor signs could be seen on the mammograms. This definition assumes that truly undetectable interval cancers could stem from the screening program’s limitations, in particular the time interval between two consecutive invitations. By distinguishing interval cancers missed by radiologists (false negatives) from those retrospectively undetectable (true negatives), we could better separate screening inaccuracies from the inherent constraints of the program.

Our stratified analyses showed notable variations in performance metrics, specifically in positive predictive value (PPV) and cancer detection rate (CDR), across age groups and with successive screening exams. These findings may suggest that breast cancer in older patients has characteristics that make it easier to detect, as evidenced by the increase in ACR scores along with age (3.35 average for women aged 50–59, 3.46 for 60–69, and 3.54 for 70–74, see Supplementary Figure S5). The observed improvement in PPV with subsequent screening exams requires further investigation to identify causal factors.

Although an initial validation showed that our BC identification algorithm is 95.3% sensitive to screen-detected cases, the only way to test whether the algorithm over-identifies cases is to link a BC registry to the SNDS in a French region equipped with a population-based cancer registry. In other words, a future study applying this methodology in a registry-covered department should validate the effectiveness of linking screening data to SNDS data for systematic BC identification.

Conclusion

Our findings indicate substantial changes in SE, PPV, and CDR with SNDS-identified cancers. Additionally, SNDS follow-up enables the computation of ICR and RICR. Most metrics align with European guidelines and findings from registry-based studies (when available). Given the national coverage of the SNDS, this approach has the potential to bridge the gap created by the limited availability of registries in France. Future studies applying this methodology in regions with registries could validate the effectiveness of linking screenings to the SNDS for cancer identification.

Although breast cancer is relatively well contained compared to other cancers, with only 5% of cases diagnosed at stage 4, versus 57% for lung and 65% for colorectal cancers^42,43, approximately 30% of survivors eventually develop metastases⁴⁴. This underscores that, while early detection and treatment are effective, the disease mortality burden remains considerable. Thus, advancing the prediction of cancer severity and outcomes based on patient characteristics is essential. The database used in this study, one of the largest imaging resources for organized breast cancer screening research with more than 250,000 mammographic images and linked medico-administrative follow-up⁴⁵, offers a valuable opportunity to address these challenges.

Data availability

All data and materials built for this study are available with restricted access in a virtual environment of the Health Data Hub technological platform. Emilien Jemelen (corresponding author) should be contacted to request the data from this study. The search engine developed during the study to identify breast cancer in the SNDS administrative database is planned for release in 2025 on the Health Data Hub open source softwares library (BOAS).

Abbreviations

ACR:: American College of Radiology
ANAES (in French):: National Agency for Healthcare Accreditation and Evaluation
BC:: Breast cancer
BI-RADS:: Breast imaging-reporting and data system
CC:: Cranio-caudal
CDR:: Cander detection rate
CepiDC (in French):: Causes of death national database
CI:: Confidence interval
CNIL (in French):: French national data protection authority
CRCDC (in French):: Center for the Coordination of Cancer Screening
CT:: Chemotherapy
DOCS (in French):: French Breast Cancer Screening Program
ECIBC:: European Commission Initiative on Breast Cancer
ET:: Endocrine therapy
FFDM:: Full-field digital mammography
HDH:: Health Data Hub
ICR:: Interval cancer rate
LTC:: Long-term conditions/illnesses
MLO:: Medio-lateral oblique
NPV:: Negative predictive value
PMSI (in French):: Hospitals medico-administrative database
PPV:: Positive predictive value
RICR:: Relative interval cancer rate (relative to the incidence of BC in the population)
RT:: Radiotherapy
SE:: Sensitivity
SNDS (in French):: French National Health Data System, administrative database for reimbursements purposes
SNIIRAM (in French):: Medical treatments and acts administrative database
SP:: Specificity
TT:: Targeted therapy

References

International Agency for Research on Cancer, WHO. Global cancer observatory. https://gco.iarc.fr/. Accessed: 2024-01-29.
Ligue contre le Cancer website, Breast cancer section. https://www.ligue-cancer.net/questce-que-le-cancer/les-types-de-cancer/cancer-du-sein, Accessed: 2024/05/07.
Haute Autorité de Santé. Cancer du sein: quel dépistage selon vos facteurs de risque ? (Technical report, Haute Autorité de Santé, 2014).
Peter, C. Gøtzsche (International agency for research on cancer (IARC) handbooks of cancer prevention, Breast cancer screening, 2002).
Google Scholar
Tabár, L., Vitak, B., Chen, T.H., Yen, A.M., Cohen, A. Tot, T., Chiu, S.Y. Chen, S.L., Fann, J.C., Rosell, J., Fohlin, H., Smith, R.A., & Duffy, S.W. Swedish two-county trial: impact of mammographic screening on breast cancer mortality during 3 decades. Radiology 260(3), 658 (2011).
Marmort, M. G., Altman, D. G., Cameron, D. A., Dewar, J. A., Thompson, S. G. & Wilcox, M. The benefits and harms of breast cancer screening: an independent review. The Lancet 380(9855), 1778–1786 (2012).
Gøtzsche, P. C. & Jørgensen, K. J. Screening for breast cancer with mammography. Cochrane Database Syst. Rev. 6 (2013).
Antonio Ponti, A., Anttila, G. Ronco. & Senore, C. Cancer screening in the european union,. report on the implementation of the council recommendation on cancer screening 2017 (Technical report, European Commission, 2017).
Institut National du Cancer. Panorama des cancers en france, edition 2023 (Technical report, Institut National du Cancer, 2023).
Google Scholar
Selby, K., Sedki, M., Levine, E., Kamineni, A., Green, B.B., Vachani, A., Haas, J.S., Ritzwoller, D.P., Croswell, J.M., Ohikere, K., Doria-Rose, V.P., Rendle, K.A., Chubak, J., Lafata, J.E., Inadomi, J. & Corley, D.A. Test performance metrics for breast, cervical, colon, and lung cancer screening: a systematic review. J. Natl. Cancer Inst. 115(4), 375–384 (2023).
Corbillon, E., Poullié, A.-I., Blondet, E., & Missour, S. Guide méthodologique: comment évaluer a priori un programme de dépistage? uppercaseANAES, Service évaluation technologique, (2004).
European guidelines on breast cancer screening and diagnosis. https://healthcare-quality.jrc.ec.europa.eu/en/ecibc/european-breast-cancer-guidelines, Accessed: 2024-01-30.
Bihrmann, K., Jensen, A., Olsen, A.H., Njor, S., Schwartz, W., Vejborg, I., & Lynge, E. Performance of systematic and non-systematic (‘opportunistic’) screening mammography: A comparative study from Denmark. J. Med. Screen. 15(2), 23–26 (2008).
Bertrand, C., Le Bihan-Benjamin, C., de Bels, F. & Bousquet, P.-J. Identification des cancers du sein de l’intervalle à partir de données médico-administratives. Revue d’Epidémiologie et de Santé Publique 67(2), S99 (2019).
Article Google Scholar
Urbschat, I. & Heidinger, O. Determination of interval cancer rates in the german mammography screening program using population-based cancer registry data. Bundesgesundheitsblatt-Gesundheitsforschung-Gesundheitsschutz 57, 68–76 (2014).
Quintin, C., Rogel, A. & Évaluation du programme de dépistage organisé du cancer du sein: résultats et évolution des indicateurs de performance depuis,. en france métropolitaine 2019 (Santé publique France, Saint-Maurice, 2004).
Francis, F. Les registres de morbidité en France : état des lieux, enjeux et perspectives, thèse pour l’obtention du diplôme d’Etat de docteur en médecine. PhD thesis, Université de Bordeux, 2018. soutenue le 11 juin à Bordeaux (2018 ).
INSEE webpage. https://www.insee.fr/, Accessed: 2024-02-01.
Sollogoub, N. Proposition de loi visant à mettre en place un registre national des cancers. https://www.senat.fr/rap/l22-703/l22-7034.html, Accessed: 2024-01-30.
National institute of cancer (INCA). https://www.e-cancer.fr/. Accessed: 2024-01-30.
Tuppin, P., Rudant, J., Constantinou, P., Gastaldi-ménager, C., Rachas, A., de Roquefeuil, Maura, G., Caillol, H., Tajahmady, A., Coste, J., Gissot, C., Weill, A., & Fagot-Campagna, A. Value of a national administrative database to guide public decisions: From the système national d’information interrégimes de l’assurance maladie (sniiram) to the système national des données de santé (snds) in france. Revue d’Épidémiologie et de Santé Publique, 65(4):S149–S167, (2017).
SNDS webpage on the CNIL website. https://www.cnil.fr/fr/snds-systeme-national-des-donnees-de-sante, Accessed: 2024-02-02.
Online documentation on Sociodemographic variables in the SNDS. https://documentation-snds.health-data-hub.fr/snds/fiches/variables_sociodemo.html. Accessed: 2025-05-31.
Moore, N., Blin, P., Lassalle, R., Thurin, N., Bosco-Levy, P. & Droz, C. National Health Insurance Claims Database in France (SNIRAM), Système Nationale des Données de Santé (SNDS) and Health Data Hub (HDH), chapter Databases for Pharmacoepidemiological Research, pages 131–140. Springer, )(2021).
Health data hub (HDH) webpage. https://www.health-data-hub.fr/qui-sommes-nous, Accessed: 2024/02/26.
Arrêté du 16 janvier 2024 relatif aux programmes de dépistages organisés des cancers, Annexe II, Section VII. https://www.legifrance.gouv.fr/eli/arrete/2024/1/16/TSSP2332083A/jo/texte. Accessed: 2025-05-27.
Dépistage Organisé du Cancer du Sein. Cahiers des charges du dépistage organisé du cancer du sein, cahier des charges pour les radiologues. Journal officiel du 21 décembre 2006, 2006.
Dumas, E., Laot, L., Coussy, F., Grandal Rejo, B., Daoud, E., Laas, E., Kassara, A., Majdling, A., Kabirian, R., Jochum, F. et al. The french early breast cancer cohort (fresh): a resource for breast cancer research and evaluations of oncology practices based on the french national healthcare system database (snds). Cancers 14(11), 2671 (2022).
American College of Radiology. Breast Imaging Reporting and Data System (BI-RADS). 4th Ed. American College of Radiology, (2004).
Agence Nationale d’Accréditation et d’Évaluation de la Santé. Breast imaging reporting and data system (bi-rads) 4th edn. (Agence Nationale d’Accréditation et d’Évaluation de la Santé, Technical report, 2004).
Google Scholar
Société Française de Radiologie. BI-RADS (Breast Imaging Reporting and Data System). Atlas d’imagerie du sein - Mammographie. Deuxième édition française basée sur la 4ème édition américaine. Société Française de Radiologie, 2004.
Altman, D., Machin, D., Bryant, T. & Gardner,M. Statistics with Confidence: Confidence Intervals and Statistical Guidelines, 2nd Edition, pages 45–47. BMJ Books, (2000).
Gildenblat, J. A python library for confidence intervals. https://github.com/jacobgil/confidenceinterval, (2023).
Stein Emil Vollset. Confidence intervals for a binomial proportion. Stat. Med. 12(9), 809–824 (1993).
Article Google Scholar
Perry, N. et al. European guidelines for quality assurance in breast cancer screening and diagnosis 4th edn. (Technical report, European Union, 2006).
Google Scholar
Geertse, T. D., Paap, E., Waal, D. van der, Duijm, L. E.M., Pijnappel, R.M., & Broeders, M. J.M. Utility of supplemental training to improve radiologist performance in breast cancer screening: A literature review. J. Am. Coll. Radiol. 16(11), 1528–1546 (2019).
Heinze, F., Czwikla, J., Heinig, M., Langner, I. & Haug, U. German mammography screening program: program sensitivity between 2010 and 2016 estimated based on german health claims data. BMC Cancer 23(852) (2023).
Houssami, N., Zackrisson, S., Blazek, K., Hunter, K., Bernardi, D., Lång, K. & Hofvind, S. Meta-analysis of prospective studies evaluating breast cancer detection and interval cancer rates for digital breast tomosynthesis versus mammography population screening. Eur. J. Cancer 148, 14–23 (2021).
Karsa, L. V., Holland, R., Broeders, M., Wolf, C., Perry, N. & Törnberg, S. European guidelines for quality assurance in breast cancer screening and diagnosis : fourth edition. Technical report, European Commission, Directorate-General for Health and Consumers, (2013).
Miglioretti, D. L., Bissell, M. C. S., Kerlikowske, K., Buist, D. S. M., Cummings, S. R., Henderson, L. M., Onega, T., O’Meara, E. S., Rauscher, G. H., Sprague, B. L., Tosteson, A. N. A., Wernli, K. J., Lee, J. M., & Lee, C. I. Assessment of a risk-based approach for triaging mammography examinations during periods of reduced capacity. JAMA Netw. Open 4(3) (2021).
Le Roquais, P. La classification bi-rads: lecture d’images ou ligne-guide clinique? conséquences dans le quotidien du radiologue. In 27\(^{\circ }\) Journées de la Société française de sénologie et de pathologie mammaire (SFSPM), Deauville, 2005. Dogmes et doutes, pages 235–240. Datebe SAS, (2005).
Debieuvre, Didier, Molinier, O., Falchero, L., Locher, C., & Templement-Grangerat, D. Lung cancer trends and tumor characteristic changes over 20 years (2000-2020): Results of three french consecutive nationwide prospective cohorts’ studies. The Lancet Regional Health - Europe, page 100492, (2022).
Cancer center website, Colorectal cancer section. https://www.cancercenter.com/cancer-types/colorectal-cancer/types/metastatic-colorectal-cancer, Accessed: 2024/07/22.
Redig, A. J., & McAllister, S. S. Breast cancer as a systemic disease: a view of metastasis. J. Internal Med. 274(2), 113–126 (2013).
Logan, J., Kennedy, P.J., & Catchpoole, D. A review of the machine learning datasets in mammography, their adherence to the FAIR principles and the outlook for the future. Sci. Data 10(595) (2023).

Download references

Funding

This work was supported by the Health Data Hub as part of the Deep.piste project, which funded the sharing of reusable code components.

Author information

Authors and Affiliations

French Institute for Research in Computer Science and Automation (INRIA), Paris, France
Emilien Jemelen & Agathe Guilloux
Data Science Department, Epiconcept, Paris, France
Emilien Jemelen, Francisco Orchard & William Madie
Regional Coordination Center for Cancer Screening (CRCDC), Nîmes, France
Bernard Valentin, Josine Belin & Pierre Mares
Institut Curie, Foundation for Cancer Research, Paris, France
Enora Laas
Epiconcept, Paris, France
Guillaume Jeannerod
Georges Pompidou European Hospital, Paris, France
Sandrine Katsahian

Authors

Emilien Jemelen
View author publications
Search author on:PubMed Google Scholar
Francisco Orchard
View author publications
Search author on:PubMed Google Scholar
William Madie
View author publications
Search author on:PubMed Google Scholar
Bernard Valentin
View author publications
Search author on:PubMed Google Scholar
Josine Belin
View author publications
Search author on:PubMed Google Scholar
Enora Laas
View author publications
Search author on:PubMed Google Scholar
Guillaume Jeannerod
View author publications
Search author on:PubMed Google Scholar
Pierre Mares
View author publications
Search author on:PubMed Google Scholar
Sandrine Katsahian
View author publications
Search author on:PubMed Google Scholar
Agathe Guilloux
View author publications
Search author on:PubMed Google Scholar

Contributions

FO, GJ, BV and PM managed the linkage between the CRCDC screening database and the SNDS database. After the linkage was authorized by the CNIL, FO and WM cleaned the database on the Health Data Hub workspace and started building the SNDS search engine. EJ finished the search engine, performed the analyses and wrote the paper, with the technical support of WM and with the helpful counseling of SK, AG and FO. EL brought her medical expertise to help us understand the intricacies of SNDS data and improve our cancer identification algorithm. JB actively took part in the reading of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Emilien Jemelen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethical approval and consent to participate

All screened women within the scope of this study received either individual or collective information letters asking for their Informed consent before data inclusion. The data linkage of the study as well as all experimental protocols were approved by the CNIL (French national data protection authority) in 2019.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Jemelen, E., Orchard, F., Madie, W. et al. Evaluating breast cancer screening performance without registries using medico-administrative data. Sci Rep 15, 25096 (2025). https://doi.org/10.1038/s41598-025-10115-w

Download citation

Received: 16 December 2024
Accepted: 02 July 2025
Published: 11 July 2025
Version of record: 11 July 2025
DOI: https://doi.org/10.1038/s41598-025-10115-w

Keywords

This article is cited by

The plasma nanoDSF denaturation profiles predict the presence of breast cancer
- Mathilde Guerin
- Rémi Eyraud
- François Bertucci
Journal of Translational Medicine (2025)

Subjects

Abstract

Similar content being viewed by others

Breast cancer screening patterns and associated factors in Iranian women over 40 years

Long-term trends in incidence, characteristics and prognosis of screen-detected and interval cancers in women participating in the Dutch breast cancer screening programme

Identifying breast cancer risk factors and evaluating biennial mammography screening efficacy using big data analysis in Taiwan

Introduction

Material and methods

Data sources

Study population

BC identification

Mammography result and screening result

Interval cancers

Metrics definition

Outcomes and statistical analyses

Results

Additional cases identified in the SNDS

Global performance of mammography

Mammography performance by exam rank

Mammography performance by age

Mammography performance by ACR level

Discussions

Conclusion

Data availability

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethical approval and consent to participate

Additional information

Publisher’s note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

The plasma nanoDSF denaturation profiles predict the presence of breast cancer

Search

Quick links