A 20-feature radiomic signature of triple-negative breast cancer identifies patients at high risk of death

Noor, Humaira; Zheng, Yuanning; Mantz, Adam B.; Zhou, Ryle; Kozlov, Andrew; DeMartini, Wendy B.; Chen, Shu-Tian; Okamoto, Satoko; Ikeda, Debra M.; Telli, Melinda L.; Kurian, Allison W.; Ford, James M.; Vinayak, Shaveta; Satoyoshi, Mina; Joshi, Vishal; Mattonen, Sarah A.; Lee, Kevin; Gevaert, Olivier; Sledge, George W.; Itakura, Haruka

doi:10.1038/s41523-025-00790-3

Download PDF

Article
Open access
Published: 26 July 2025

A 20-feature radiomic signature of triple-negative breast cancer identifies patients at high risk of death

Humaira Noor¹,
Yuanning Zheng¹,
Adam B. Mantz²,
Ryle Zhou²,
Andrew Kozlov³,
Wendy B. DeMartini³,
Shu-Tian Chen³,
Satoko Okamoto³,
Debra M. Ikeda³,
Melinda L. Telli²,
Allison W. Kurian^2,4,
James M. Ford²,
Shaveta Vinayak²,
Mina Satoyoshi⁵,
Vishal Joshi²,
Sarah A. Mattonen³,
Kevin Lee²,
Olivier Gevaert^1,6,
George W. Sledge² &
…
Haruka Itakura²

npj Breast Cancer volume 11, Article number: 79 (2025) Cite this article

3303 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

A substantial proportion of patients with non-metastatic triple-negative breast cancer (TNBC) experience disease progression and death despite treatment. However, no tool currently exists to discriminate those at higher risk of death. To identify high-risk TNBC, we conducted a retrospective analysis of 749 patients from two independent cohorts. We built a prediction model that leverages breast magnetic resonance imaging (MRI) features to predict risk groups based on a 50-gene Transcriptomics Signature (TS). The TS distinguished patients with high-risk for death in multivariate survival analysis (Transcriptomic cohort: [HR] = 13.6, 95% confidence interval [CI] = 1.56-1, p = 0.02; SCAN-B cohort: HR = 1.45, CI 1.04-2.03, p = 0.02). The model identified a 20-feature radiomic signature derived from breast MRI that predicted the TS-based risk groups. This imaging-based classifier was applied to a validation cohort (log rank p = 0.013, accuracy 0.72, AUC 0.71, F1 0.74, precision 0.67, and recall 0.82), detecting a 25% absolute survival difference between high- and low-risk groups after 5 years.

Prognosis prediction and risk stratification of breast cancer patients based on a mitochondria-related gene signature

Article Open access 03 February 2024

Survival prediction in triple negative breast cancer using multiple instance learning of histopathological images

Article Open access 25 August 2022

Discovery of a new mitophagy-related gene signature for predicting the outlook and immunotherapy in triple-negative breast cancer

Article Open access 25 February 2025

Introduction

Triple-negative breast cancer (TNBC) is an aggressive breast cancer subtype^1,2,3,4 marked by shorter relapse-free survival, higher risk of distant recurrence within five years, and lower 5-year survival rate compared with other subtypes^5,6,7. While systemic chemotherapy regimens have improved long-term survival in a subset of patients⁸ in the nonmetastatic setting, a significant proportion of patients progress to metastatic disease and ultimately death despite standard treatment⁵. Molecular heterogeneity in TNBC^{9,10,11,12,13} likely contributes to variability in tumoral response to treatment¹⁴, but in practice, no clinical tool has yet been developed that identifies the subgroup of patients with a higher proclivity for disease progression and accelerated death. A pre-treatment prognostic tool that identifies patients biologically destined for poor overall survival (OS) could enable earlier, proactive implementation of increased surveillance or consideration of clinical trials with alternative therapy and avoid toxicities – physical, time, and financial – on ineffective treatments. There remains a critical unmet need for an effective and easily accessible prognostic tool for high-risk patients among those with early-stage TNBC.

Transcriptomic profiling for prognostication or risk stratification has been explored across various cancer types with varying degrees of translated clinical utility^{15,16,17,18,19,20,21,22,23}. In breast cancer, there is no better example of a transcriptomic signature impacting clinical decision-making than in early stage, hormone receptor-positive, HER2-negative cancer, where a 21-gene expression profile provides prognostication of recurrence and prediction of benefit from adjuvant chemotherapy²⁴. However, no such tool exists in practice despite efforts to build prognostic signatures for clinical outcomes in TNBC^{25,26,27,28,29,30,31}. In our study, we sought to prognosticate OS, focusing on the identification of patients with non-metastatic TNBC with a high risk of death despite receiving standard treatment. We further sought to improve upon prior attempts^32,33 to develop a prognostic transcriptomic signature by leveraging a pre-identified set of high-risk genes and validating our findings in an external validation cohort. Previously, Kim et al. presented a set of 5 radiomic features that were associated with disease-free survival in TNBCs³³. However, the derivation of the radiomic signature was not based on an association with an underlying transcriptomic profile, leaving potential associations with molecular changes unexamined. Their limited long-term follow-up data also precluded the study of a prognostic connection between their 5-feature set and OS. Here, we employ a radiogenomic approach - for identifying a radiomic signature that predicts prognostic risk groups stratified by a transcriptomic signature - on multiple cohorts with sufficient long-term follow-up data to analyze OS in TNBC. You et al. presented a radio-multiomic study for cohorts that included all breast cancer subtypes³². Due to the molecular heterogeneity between breast cancer subtypes, a more subtype-specific analysis, such as the present study, is warranted.

We had previously identified a composite mRNA expression level set of 50 genes that was predictive of breast cancer recurrence across unselected subtypes of breast cancer patients (n = 858) in The Cancer Genome Atlas (TCGA) - Breast Cancer cohort^34,35,36. This 50-gene transcriptomic signature (TS) collectively represents a number of critical cellular regulatory functions, including cell-cycle phase transition, inflammatory response, and apoptosis, among others (Supplementary Fig. S1)³⁴. We hypothesized that the TS embodies a set of molecular alterations promoting an aggressive tumoral phenotype with a propensity for progressive disease, and that in TNBC, can prognosticate OS to identify the subgroup of patients with a higher risk of death.

We additionally sought to develop a method by which the prognostic ability of TS could be accessed more easily and noninvasively. We employed dynamic contrast-enhanced magnetic resonance imaging (MRI), a highly sensitive and noninvasive modality for breast cancer imaging^37,38, and hypothesized that a signature set of breast MRI-derived imaging features could predict TS using radiogenomics^39,40, an analytic approach that links imaging features with molecular data. Radiomic, or computationally-derived quantitative imaging, features derived from MRI⁴¹, delineating the three-dimensional tumor shape, texture, and edge characteristics, have demonstrated prognostic ability across various cancer types^{42,43,44,45,46,47,48,49,50,51,52,53,54}. In our prior radiogenomic work in glioblastoma, radiomic features defined distinct subtypes, each characterized by differential survival outcomes and distinct patterns of differential oncogenic pathway activation⁵⁵, supporting the idea that radiomic features can represent underlying molecular activities. More specifically, in breast cancer, imaging phenotypes have been associated with gene expression⁵⁶ and molecular profiles^57,58,59,60, further indicating the potential of radiomic features to capture a molecular signature in TNBC. We thus specifically hypothesized that a signature set of radiomic features can predict TS, behaving as its surrogate.

In this study, we analyzed both transcriptomic and radiomic features of TNBC from pre-treatment breast MRI to prognosticate OS and identify high-risk patients with early stage, non-metastatic TNBC. We hypothesized that the 50-gene TS could identify patients at a high risk of death and that a signature set of radiomic features could be identified using a radiogenomic model that maps to the TS, thereby establishing a radiomic signature (RS) for the subgroup of patients with high-risk TNBC. The ultimate goal of our study was to demonstrate the ability of imaging data to be used as an effective, noninvasive prognostication tool in TNBC that is biologically substantiated by TS. Given that many patients with TNBC undergo breast MRI, RS could contribute added value through noninvasive prognostication while decreasing the need for tissue sampling and lab testing.

Results

Characteristics of two TNBC cohorts

All patients in the Institutional cohort (n = 179) were female. Baseline characteristics were closely matched between the Institutional cohort, and its subset, the Transcriptomic cohort (n = 63)⁶¹, including median age at diagnosis, racial composition with predominance of White patients, majority tumor grade 3, proportion of carriers with germline pathogenic BRCA 1/2 mutations, and median follow up time (Table 1). Fewer stage I patients were observed in the Transcriptomic cohort compared with the superset Institutional cohort. Within the Transcriptomic cohort, the distribution of TNBC molecular subtypes as per Lehmann 2011⁹ was as follows: immunomodulatory, IM (n = 15, 23.8%); mesenchymal, M (n = 14, 22.2%); mesenchymal stem like, MSL (n = 11, 17.5%); basal-like 1, BL1, (n = 9, 14.3%), basal-like 2, BL2, (n = 7, 11.1%), and luminal androgen receptor (LAR) (n = 7, 11.1%).

Table 1 Institutional cohort patient and tumor characteristics

Full size table

In the SCAN-B cohort⁶², the majority had stage I or II disease (n = 510; 95.8%) rather than stage III (n = 23; 4.2%). Baseline characteristics of the full SCAN-B cohort have previously been described⁶². The majority of patients in the SCAN-B cohort had grade 3 TNBC (n = 447; 83.86%) compared with grade 1 (n = 7; 0.01%) and grade 2 (n = 79, 14.82%) disease. There were no differences between the Institutional and SCAN-B cohorts in patient age, grade and race. All patients in the Transcriptomic cohort underwent neoadjuvant chemotherapy with carboplatin and gemcitabine, and a subset of patients also received adjuvant chemotherapy⁶³, whereas the chemotherapy regimen used in the SCAN-B cohort was unspecified.

The analytic steps of this study and the final evaluable samples included in each are summarized in Fig. 1. Results from each step are described in detail below.

**Fig. 1: Overall radiogenomic study design and workflow.**

The 50-gene transcriptomic signature set prognosticates OS in TNBC

The 50-gene transcriptomic signature set, previously developed to predict cancer recurrence in 858 TCGA breast cancers was prognostic of OS in two independent TNBC cohorts. In our Transcriptomic cohort, the Transcriptomic Signature (TS)-High and TS-Low groups as stratified by the transcriptomic signature demonstrated a significant difference in OS (Kaplan-Meier log-rank p = 0.012) (Fig. 2A). In the TS-Low group, only one death occurred at 5.5 years, and the median survival was not reached during the 10-year follow up period. In the TS-High group, 8 deaths occurred in the first 5.5 years. Kaplan Meier survival analyses showed a trend in survival difference for tumor grade (p = 0.087) (Supplementary Fig. S2). No other variables exhibited significant survival differences.

**Fig. 2: A 50-gene transcriptomic signature (TS) distinguishes the high-risk patients with poor overall survival (OS) in the Institutional cohort with transcriptomic data (Transcriptomic cohort).**

In the univariate Cox survival analyses, TS-High (HR: 9.09, 95% CI: 1.13–71.40) and high tumor grade (HR: 4.14, 95%CI: 1.19–14.31) were statistically significant predictors of poor survival (Fig. 2B-i). In the multivariate Cox regression model that also included age, stage, and Ki67, only TS-High (HR: 13.60, 95% CI: 1.56–125.00) and tumor grade (HR: 8.20, 95% CI: 1.74–38.61) remained statistically significant (Fig. 2B-ii). The concordance index for the multivariate model with TS-High and tumor grade was 0.827 (SE 0.057).

In the SCAN-B cohort, TS-High demonstrated a significantly worse OS compared with TS-Low (Kaplan-Meier log-rank p = 0.032) (Fig. 3A). While median survival was not reached by either group at 10 years, the upper quartile (75%) survival difference between TS-High and TS-Low groups was 3.9 years (4.9 years vs 8.8 years), and event rates diminished for both risk groups by years 8–10. Kaplan Meier survival analysis showed a significant survival difference for tumor stages I/II versus III (p < 0.0001) and age when stratified by median age of 65 (p < 0.0001) (Supplementary Figs. S3 and S4). No other variables exhibited significant survival differences.

**Fig. 3: A 50-gene transcriptomic signature (TS) distinguishes the high-risk patients with poor overall survival (OS) in the SCAN-B cohort.**

In the univariate Cox survival analyses, TS-High (HR: 1.42 95% CI: 1.02–1.95), age above the median (>65) (HR 1.05, 95% CI 1.04–1.06), and stage III (HR: 3.53, 95% CI 2.07–6.03) were statistically significant predictors of poor survival (Fig. 3B-i, p < 0.05 for all). In the multivariate Cox model that included age, stage and grade, the transcriptomic signature remained a statistically significant prognosticator along with age and stage: TS-High (HR: 1.45, 95% CI: 1.04–2.03), older age (HR: 1.05, 95% CI: 1.03–1.06), and stage (HR: 2.72, 95% CI: 1.54–4.79) (Fig. 3B-ii, p < 0.001 for all).

Comparison of clinical and treatment characteristics between TS risk groups

Comparing clinical features between TS-High and TS-Low risk groups in the Transcriptomic cohort (Table 2), there was a statistically significant difference in the racial composition between the two groups, with fewer Asian and more White patients represented in the TS-High group. Absolute lymphocyte count (ALC) was also statistically significantly lower in the TS-High group (1.20 ± 0.71 vs 0.81 ± 0.43, p = 0.015). We found no significant differences in age, tumor grade, stage, BRCA1/2 germline pathogenic variant status, Ki-67%, or TNBC molecular subtype. In the SCAN-B cohort, no statistically significant differences in age, tumor grade, or stage were observed between the TS-High and TS-Low risk groups, indicating their independence of these clinical features (Supplementary Table S2).

Table 2 Comparison of clinical features between TS-High and TS-Low risk groups in the Institutional-Transcriptomic cohort

Full size table

Comparing pre- and post-neoadjuvant treatment-related features in the Transcriptomic cohort (Table 3), there were no differences between the type of surgery performed (mastectomy versus lumpectomy), or in pre- and post-treatment tumor measurements between the TS risk groups. However, we observed a lower rate of pathologic complete response (pCR) in the TS-High compared with the TS-Low group (23.1% versus 45.2%, p = 0.047). Commensurately, the TS-High group demonstrated a significantly higher residual cancer burden (RCB) value compared with the TS-Low-risk group (p = 0.006).

Table 3 Comparison of treatment-associated features between TS-High and TS-Low risk groups in the Institutional-Transcriptomic cohort

Full size table

MRI radiomic features predict transcriptomic-based prognostic risk-groups

Our radiogenomic model identified a signature set of 20 radiomic features that best predicts the 50-gene transcriptomic-based prognostic risk groups, TS-High versus TS-Low (Fig. 4A). The 20-feature radiomic signature (RS) set, which compositely describes tumor texture (12 features), intensity (6 features), and shape (2 features), resulted in high prediction performance, with mean accuracy score of 0.79 on the training set employing leave-one-out cross-validation. When tested on the held-out test set, accuracy was 0.72, with mean AUC 0.71, F1 0.74, precision 0.67, and recall 0.82. In the Radiomic cohort, the Kaplan-Meier survival analysis demonstrates a clear separation between the RS-High- and RS-Low-risk groups predicted by the 20-feature radiomic signature (log rank p = 0.013) (Fig. 4B). No deaths were observed in the RS-Low-risk group during the 16-year follow up period.

**Fig. 4: A signature set of 20 radiomic features best identifies patients at high risk for death.**

Discussion

We identified a 20-feature MRI-based radiomic signature (RS) of TNBC that predicts a high risk of death. From our retrospective analysis of 749 patients from two independent cohorts, we demonstrated that this RS is a direct proxy for a 50-gene transcriptomic signature (TS) that predicts poor OS in TNBC, providing the underlying biologic substantiation for the high-risk assessment. After 5 years, the RS-High risk group demonstrated a 25% absolute OS difference, or a relative risk of death of 25, compared with the counterpart RS-Low risk group, which exhibited 100% survival.

RS has several potential clinical applications. Identifying the high-risk TNBC subgroup enables the early and proactive consideration of clinical trials of novel therapies since standard treatment will likely be inadequate in producing a favorable long-term clinical outcome. Conversely, identifying the low-risk TNBC subgroup distinguishes patients who not only will respond well to standard treatment, but may also be candidates for clinical trials of de-escalated treatment strategies. RS could also be used for candidate selection in clinical trial enrollment, preferentially including patients with high-risk TNBC or TNBC that harbors the 50-gene transcriptomic alterations for therapeutic targeting. The ability to leverage imaging data in lieu of transcriptomic profiling can expedite the identification of appropriate candidates for alternative treatments and/or clinical trials, while also enhancing knowledge accessibility in resource-limited settings.

Moreover, the ubiquity of imaging data in clinical practice makes RS a more accessible clinical tool than transcriptomic profiling, which is costly, not widely available, and currently, largely limited to research use. The ready availability and non-invasiveness of imaging-based RS make it an ideal surrogate for TS, which is generally harder to acquire. In addition, 3D imaging captures information from the entire tumor volume and avoids sampling bias that accompanies tissue biopsy for transcriptomic profiling. Finally, breast MRI^64,65,66 and radiomic features have demonstrated utility as predictors of tumor malignancy⁶⁷, progression-free survival^64,65,66, response to neoadjuvant treatment^{65,66,68,69,70,71,72,73,74,75}, and for discriminating breast cancer subtypes⁷⁶. These prior experiences strongly suggest the ability of imaging data to capture underlying biological signals that manifest as differences in clinical phenotypes and outcomes.

Delving into the biological underpinning of RS, we demonstrated that the RS is a proxy for the 50-gene TS, which, in turn, is an independent predictor of OS in multivariate Cox analyses of two separate TNBC cohorts. The additional contributions of tumor grade in our Institutional cohort, and of stage and older age in the SCAN-B cohort, align with current clinical views of their importance in impacting OS. The TS signal was higher in the Institutional cohort, likely due to the relative uniformity in ranges of patient age, stage, and treatment, as part of a clinical trial⁶³. In comparison, the SCAN-B cohort entails a wider range of patient demographics, tumor characteristics, and treatments received. These two cohorts were notably generated from disparate health care systems in different countries (U.S., Sweden). Despite these differences, TS remained an independent predictor of survival, suggesting its robustness. Importantly, however, these cohorts were matched in their patient accrual period and predated recent therapeutic changes, including immune checkpoint inhibitor use, which has positively impacted survival. Nevertheless, our radiogenomic model is expected to remain a viable analytic approach that could be applied in newer cohorts of patients receiving immune checkpoint inhibitor treatments.

We identified no apparent associations between TS and clinical features, suggesting the ability of TS to provide added value with additional information. Racial composition and absolute lymphocyte count (ALC) differed between TS-High and TS-Low prognostic groups, but the small sample size precluded a detailed analysis of whether Asian patients are more highly represented in the TS-Low group. The higher ALC in the TS-Low group is consistent with its known ability to predict lower mortality in early-stage TNBC as a surrogate for higher levels of tumor-infiltrating lymphocytes⁷⁷. TS-High was associated with lower pCR rates (p = 0.047) and higher RCB scores (p = 0.006), consistent with poor treatment response and lower survival likelihood, as previously reported⁷⁸.

Molecularly, TS was first developed in predicting breast cancer recurrence across all subtypes in the TCGA-Breast Cohort. In a separate exploratory analysis, we observed a trend towards significance between TS and survival in its TNBC subset (n = 172; log-rank p = 0.097; Supplementary Fig. S6). The ability of TS to accurately predict survival in our TNBC study suggests the presence of a unique set of shared molecular activities that drive tumor progression and accelerate lethal outcomes. Not surprisingly, TS was independent of those encoding estrogen, progesterone, and HER2/neu receptors. Pathway analysis of the 50 TS genes revealed significant associations with eight oncogenic pathways, among which the glucocorticoid receptor pathway was highly enriched (Supplementary Fig. S1). Notably, studies have shown that glucocorticoid receptor levels are altered in up to 40% of TNBC and are also associated with both chemotherapy response rate and time to recurrence in breast cancer^{79,80,81,82,83,84}. In addition, upregulation of the brain development pathway, also highly enriched, has been implicated in breast cancer metastasis⁷⁹. The precise oncogenic roles played by the 50 genes in the context of these pathways warrant further examination. Meanwhile, this 50-gene TS differs from that used for prior TNBC molecular subtype profiling⁹, and while the distribution of molecular subtypes in our Institutional cohort analysis varied from the usual composition, the significance of this difference remains unclear, and we observed no overlap between them.

There are some limitations to this study. The size of the dataset for building our radiogenomic model (n = 63) may have attenuated the predictive signal of TS and RS. We attempted to overcome this limitation by using larger, independent datasets to validate TS (n = 570) and RS (n = 116). However, larger cohorts with both transcriptomic and MRI data may strengthen potentially hidden signals that can improve prediction modeling. Second, the validation dataset for RS (n = 116) from the same institution could introduce unforeseen demographic or treatment biases. Rebuilding the radiogenomic model on a diverse, multi-institutional cohort incorporating both transcriptomic and MRI data would be beneficial. Third, we enhanced the reliability of TS and RS using machine learning techniques, including multicollinearity removal, feature selection, and cross-validation. However, it would be crucial to evaluate the performance and generalizability of our signatures in external, independent patient cohorts from varied clinical settings. Lastly, while the retrospective study design allowed for the long follow-up time and OS data, a prospective study controlling for confounding variables and biases would provide higher evidence for definitive management recommendations, particularly in predicting response to neoadjuvant treatment.

In conclusion, we present a 20-feature MRI-based radiomic signature in non-metastatic TNBC that identifies a subgroup of patients at high risk of death despite standard treatment. This signature could be used in lieu of its surrogate 50-gene transcriptomic signature for identifying patients who warrant exploration of alternative treatments and expediting candidate selection for clinical trials, while simultaneously providing biological substantiation for the high-risk assessment, as well as possible molecular markers for therapeutic targeting. The general availability and noninvasiveness of imaging data render them a relatively accessible and transferable tool that can help support clinical decision-making.

Methods

Datasets

We built our primary study cohort (Institutional cohort) using our pre-existing institutional breast cancer research database, Oncoshare, that integrates electronic medical record data from two different regional healthcare systems (the university-based Stanford Health Care system and the community-based Sutter Health network) with the California Cancer Registry database for patients who have had health care for diagnosis of breast cancer^80,81. All research reported here was performed in accordance with the Declaration of Helsinki, including a waiver of individual consent for research use of de-identified data, and was approved by the Institutional Review Board (IRB Approval number 19482). Using our institutional database, which captured clinical, imaging and survival data between January 2000 and May 2022, we identified patients for our study cohort based on the following inclusion criteria: (1) diagnosed with pathology-confirmed TNBC; (2) staged as non-metastatic (stages I–III); (3) had pre-treatment breast MRI available for analysis; and (4) availability of at least 5 years of follow up data. A total of 179 patients (MRI scanned between 2001 and 2014) met the inclusion criteria for our training dataset derived from our institution. 63 patients from this institutional cohort were previously enrolled in a clinical trial (NCT00813956), had pre-existing transcriptomic (mRNA expression) microarray data obtained from their primary tumor (“Transcriptomic cohort”)^61,63, and were studied separately. The remaining 116 patients from the training cohort with only imaging (radiomic) data available were designated as the “Radiomics cohort”.

The Sweden CAncerome Analysis Network-Breast (SCAN-B) database is derived from a multicenter, Swedish population-based observational study of breast cancer and consists of clinical, survival, and whole transcriptome RNA-sequencing data from over 20,000 patients. We obtained previously published SCAN-B transcriptomic and clinical data⁶² and selected patients with TNBC who had transcriptomic and complete follow-up data for analysis (n = 604). Of these, 71 patients were eliminated (four with stage IV TNBC, 30 with unrecorded tumor stage, and 37 with unrecorded tumor grade), leaving a total of 533 evaluable patients for multivariate analysis.

Predicting overall survival using a 50-gene transcriptomic signature set developed on the TCGA-BRCA cohort

We first aimed to establish the molecular predictor of poor OS using pre-treatment tumoral transcriptomic data, examining whether the differential expression levels of the 50-gene TS identified high-risk patients. We preprocessed the raw microarray data (Affymetrix U133 plus 2.0; Santa Clara, CA) from our Transcriptomic cohort (n = 63) using the Affy Bioconductor R package⁸². We conducted probe-level data extraction, background correction, and quantile normalization for the samples using the robust multi-array average (RMA) algorithm⁸³.

We applied on the Transcriptomic cohort a 50-gene transcriptomic signature set that we previously developed for predicting breast cancer recurrence across all breast cancer subtypes in the TCGA-BRCA cohort^34,35,36. Coefficients by gene are listed in Supplementary Table S1. In our study, we tested the ability of the 50-gene set to prognosticate OS, defined as the time from diagnosis to date of death or of last follow-up. Specifically, we measured the transcriptomic expression levels of the 50 signature genes for each patient and calculated their individual probability of OS based on the linear combination of coefficient weights of the gene expression levels. We used the median risk score value cutoff to stratify patients into TS-High- and TS-Low survival risk groups in the Transcriptomic and SCAN-B cohorts.

In the Transcriptomic cohort, we compared baseline clinical and tumor characteristics between the TS-High and TS-Low risk groups, including age, race, grade, stage, germline BRCA1/2 mutation status, tumor proliferation index (Ki67%), absolute minimum value of the circulating absolute lymphocyte count (ALC), and TNBC molecular subtypes. We also compared treatment-related features, including type of surgery performed (mastectomy or lumpectomy), tumor measurements in the longest dimension both before and after neoadjuvant treatment, achievement of pathologic complete response (pCR) after neoadjuvant treatment, and residual cancer burden (RCB). RCB was estimated following neoadjuvant therapy, using measurements of the primary tumor bed area, cancer cellularity, percentage of in situ disease, number of positive lymph nodes, and the diameter of largest metastasis⁸⁴. Comparisons were made for each variable using t-test, Chi Square test, or the Fisher’s Exact test.

We performed Kaplan-Meier analyses on the Transcriptomic and SCAN-B cohorts with a log-rank test to compare survival between the TS-High- and TS-Low risk groups. We conducted univariate and multivariate Cox Proportional Hazards regression analyses, evaluating the relative impacts of the survival risk strata compared with clinical variables, including age, race, stage (stages I/II or stage III), grade (2 or 3), and tumor proliferation index (Ki67%). We used the “survival” package in R for survival analysis. We reported hazard ratios associated with each variable with p values < 0.05 to signify statistical significance.

Imaging data acquisition and preprocessing

We first curated de-identified, pre-treatment, gadolinium dynamic contrast-enhanced (DCE), T1-weighted 3D breast MRI studies from each of the 179 patients in the Institutional cohort, who underwent imaging between 2001 and 2014. Images were acquired on 1.5 T General Electric scanners (n = 135), 3 T General Electric (n = 36), 1.5 T Siemens (n = 2), 1.5 T Aurora (n = 1), 3T Siemens (n = 1). DCE slice thickness ranged between 0.5 to 2.4 mm, varying by the MRI scanner (1.5 T General Electric: 1—2.4 mm, 3 T General Electric: 0.5–1.4 mm, and 1.5 T Aurora: 1.4 mm). Examination protocols met the general quality acquisition specifications, including for slice thickness and early and delayed DCE series (as per guideline link).

Imaging data preprocessing included spatial normalization to account for variation in image dimensions, voxel spacing, and intensity distributions. Non-informative regions were removed by eliminating background voxels, and image volumes were cropped to retain the anatomical ROI. Following spatial standardization, we performed voxel resampling to unify inter-slice spacing, thereby mitigating discrepancies introduced by acquisition protocols across different scanners. We applied a series of controlled intensity and spatial perturbations. These included randomized cropping and zero-padding to maintain structural consistency, probabilistic flipping along all three spatial axes to simulate anatomical variability, and stochastic intensity transformations involving slight shifts and scaling of normalized pixel values.

Imaging data analysis: Radiomic feature extraction from institutional imaging data

We sought to identify pre-treatment imaging characteristics that best predicted the 50-gene transcriptomic signature. Tumor segmentations around regions-of-interest (ROI) employed 3D U-Net⁸⁵ a convolutional neural network for volumetric image segmentation, augmented by manual segmentation. After undergoing standard preprocessing, 67 manually segmented ROI were initially used to train the 3D U-Net.

Subsequent U-Net-generated segmentation masks were reviewed and manually edited for accuracy using the Multi-Image Analysis GUI software (https://mangoviewer.com/) by three breast radiologists (A.K., S.C., S.O.; 5–9 years of experience), then iteratively added to the training set to improve subsequent performance. Representative segmentation images for three cases are shown in Supplementary Fig. S5. Where smaller satellite lesions were concurrently identified, we selected the largest tumor to segment for each patient. We applied PyRadiomics version 2.2.0⁸⁶ to extract from each segmented tumor a total of 900 radiomic features, representing tumor shape, size, texture, and edge characteristics⁸⁶. Dataset preprocessing included Z-score normalization, followed by removal of multicollinearity using a correlation threshold of 0.65.

TNBC molecular subtype characterization in the Transcriptomic cohort

To examine the association between transcriptomic features with TNBC molecular subtypes as characterized by Lehmann et al. in 2011⁹, we characterized the molecular subtypes in our Transcriptomic cohort using the TNBCtype calculator⁸⁷. For samples that were reported as unclassified using the TNBCtype tool (n = 22), we used the PAMR R package to complete the classification⁸⁸.

Development and validation of radiogenomic prediction model for overall survival associating transcriptomic and radiomic features

We built a radiogenomic prediction model on the Transcriptomic cohort subset with both radiomic and transcriptomic data to establish the association between the pre-treatment radiomic features and the 50-gene TS, seeking to delineate a radiomic signature (RS) that could distinguish between high-risk patients with poor OS and low-risk patients with favorable OS. The 900-feature radiomic data from the Transcriptomic cohort was sub-divided into training (n = 44) and test (n = 19) datasets in a 70:30 split. On the training dataset, after removing feature multicollinearity using a correlation threshold of 0.65, we performed feature selection using the top 20 Kendall’s rank coefficient values, optimized model hyperparameters using GridSearchCV, and applied the Decision Tree Classifier with LeaveOneOut cross-validation to predict the binary outcome of TS-High- or TS-Low-risk OS. The final model developed on the training dataset was applied and evaluated on the test set, and performance metrics were reported.

To increase rigor in our prediction model assessment, we further tested the final model on the institutional Radiomic cohort (n = 116) as a secondary internal validation cohort. We evaluated the differential survival using Kaplan-Meier curves, with censoring at the time of last follow-up, and stratified by Radiomic Signature (RS)-High and RS-Low groups, respectively predicting TS-High- and TS-Low risk groups. We assessed the difference between the two risk groups using the log-rank test. The Cox Proportional Hazards regression model could not be applied because of the lack of death events in the RS-Low group during the follow-up period.

Data availability

The SCAN-B dataset can be obtained from a previously published study⁶². The Institutional data may be available from the corresponding author upon request with potential collaboration.

Code availability

The code has been made available at (https://github.com/Humaira77/TNBC_Radiogenomic) for reproducibility.

References

Almansour, N. M. Triple-negative breast cancer: a brief review about epidemiology, risk factors, signaling pathways, treatment and role of artificial intelligence. Front. Mol. Biosci. 9, 836417 (2022).
Article PubMed PubMed Central CAS Google Scholar
Howard, F. M. & Olopade, O. I. Epidemiology of triple-negative breast cancer: a review. Cancer J. 27, 8–16 (2021).
Article PubMed PubMed Central CAS Google Scholar
Waks, A. G. & Winer, E. P. Breast cancer treatment: a review. JAMA 321, 288–300 (2019).
Article PubMed CAS Google Scholar
Criscitiello, C. et al. Understanding the biology of triple-negative breast cancer. Ann. Oncol. 23, vi13–vi18 (2012).
Article PubMed Google Scholar
Sharma, P. Biology and management of patients with triple-negative breast cancer. Oncologist 21, 1050–1062 (2016).
Article PubMed PubMed Central Google Scholar
Dent, R. et al. Triple-negative breast cancer: clinical features and patterns of recurrence. Clin. Cancer Res. 13,4429–4434 (2007).
Article PubMed Google Scholar
Jwa, E. et al. Locoregional recurrence by tumor biology in breast cancer patients after preoperative chemotherapy and breast conservation treatment. Cancer Res. Treat. 48, 1363–1372 (2016).
Article PubMed PubMed Central CAS Google Scholar
Spring, L. M. et al. Pathologic complete response after neoadjuvant chemotherapy and impact on breast cancer recurrence and survival: a comprehensive meta-analysis. Clin. Cancer Res. 26, 2838–2848 (2020).
Article PubMed PubMed Central Google Scholar
Lehmann, B. D. et al. Identification of human triple-negative breast cancer subtypes and preclinical models for selection of targeted therapies. J. Clin. Invest. 121, 2750–2767 (2011).
Article PubMed PubMed Central CAS Google Scholar
Lehmann, B. D. et al. Refinement of triple-negative breast cancer molecular subtypes: implications for neoadjuvant chemotherapy selection. PLoS One 11, e0157368 (2016).
Article PubMed PubMed Central Google Scholar
Burstein, M. D. et al. Comprehensive genomic analysis identifies novel subtypes and targets of triple-negative breast cancer. Clin. Cancer Res. 21, 1688–1698 (2015).
Article PubMed CAS Google Scholar
Perou, C. M. Molecular stratification of triple-negative breast cancers. Oncologist 15, 39–48 (2010).
Article PubMed CAS Google Scholar
Jiang, Y. Z. et al. Genomic and transcriptomic landscape of triple-negative breast cancers: subtypes and treatment strategies. Cancer Cell 35, 428–440.e5 (2019).
Article PubMed CAS Google Scholar
Asleh, K., Riaz, N. & Nielsen, T. O. Heterogeneity of triple negative breast cancer: Current advances in subtyping and treatment implications. J. Exp. Clin. Cancer Res. 41, 265 (2022).
Article PubMed PubMed Central Google Scholar
Zhang, D. et al. Comprehensive single-cell transcriptomic profiling reveals molecular subtypes and prognostic biomarkers with implications for targeted therapy in esophageal squamous cell carcinoma. Transl. Oncol. 44, 101948 (2024).
Article PubMed PubMed Central CAS Google Scholar
Sutic, M. et al. Transcriptomic profiling for prognostic biomarkers in early-stage squamous Cell Lung Cancer (SqCLC). Cancers, 16, 720 (2024).
Li, C. et al. Multiregional transcriptomic profiling provides improved prognostic insight in localized non-small cell lung cancer. NPJ Precis. Oncol. 8, 225 (2024).
Article PubMed PubMed Central CAS Google Scholar
Zhao, S. et al. Prognostic relevance of pancreatic adenocarcinoma whole-tumor transcriptomic subtypes and components. Clin. Cancer Res. 27, 6491–6499 (2021).
Article PubMed CAS Google Scholar
Mo, S. et al. Comprehensive transcriptomic analysis reveals prognostic value of an EMT-related gene signature in colorectal cancer. Front. Cell Dev. Biol. 9, 681431 (2021).
Article PubMed PubMed Central Google Scholar
Merry, E. et al. Predictive and prognostic transcriptomic biomarkers in soft tissue sarcomas. NPJ Precis. Oncol. 5, 17 (2021).
Article PubMed PubMed Central Google Scholar
Al-Harazi, O. et al. Identification of gene signature as diagnostic and prognostic blood biomarker for early hepatocellular carcinoma using integrated cross-species transcriptomic and network analyses. Front. Genet. 12, 710049 (2021).
Article PubMed PubMed Central CAS Google Scholar
Thakur, R. et al. Transcriptomic analysis reveals prognostic molecular signatures of Stage I Melanoma. Clin. Cancer Res. 25, 7424–7435 (2019).
Article PubMed PubMed Central CAS Google Scholar
Koshkin, V. S. et al. Transcriptomic and protein analysis of Small-cell Bladder Cancer (SCBC) identifies prognostic biomarkers and DLL3 as a Relevant Therapeutic Target. Clin. Cancer Res. 25, 210–221 (2019).
Article PubMed CAS Google Scholar
Sparano, J. A. et al. Adjuvant chemotherapy guided by a 21-Gene expression assay in breast cancer. N. Engl. J. Med. 379, 111–121 (2018).
Article PubMed PubMed Central CAS Google Scholar
Kim, C. M. et al. A 10-gene signature to predict the prognosis of early-stage triple-negative breast cancer. Cancer Res. Treat. 56, 1113–1125 (2024).
Article PubMed PubMed Central CAS Google Scholar
Yang, Y. S. et al. The early-stage triple-negative breast cancer landscape derives a novel prognostic signature and therapeutic target. Breast Cancer Res. Treat. 193, 319–330 (2022).
Article PubMed CAS Google Scholar
Wang, D. et al. A 16-gene signature associated with homologous recombination deficiency for prognosis prediction in patients with triple-negative breast cancer. Open Med.17, 882–896 (2022).
Article CAS Google Scholar
Su, G. H. et al. A multiomics signature highlights alterations underlying homologous recombination deficiency in triple-negative breast cancer. Ann. Surg. Oncol. 29, 7165–7175 (2022).
Article PubMed Google Scholar
Qin, Y. et al. Tumor microenvironment characterization in triple-negative breast cancer identifies prognostic gene signature. Aging 13, 5485–5505 (2021).
Article PubMed PubMed Central CAS Google Scholar
Liu, Q. et al. Prognostic alternative mRNA splicing signature and a novel biomarker in triple-negative breast cancer. DNA Cell Biol. 39, 1051–1063 (2020).
Article PubMed CAS Google Scholar
Perez-Pena, J. et al. A Transcriptomic immunologic signature predicts favorable outcome in neoadjuvant chemotherapy treated triple negative breast tumors. Front. Immunol. 10, 2802 (2019).
Article PubMed PubMed Central CAS Google Scholar
You, C. et al. Multicenter radio-multiomic analysis for predicting breast cancer outcome and unravelling imaging-biological connection. NPJ Precis. Oncol. 8, 193 (2024).
Article PubMed PubMed Central CAS Google Scholar
Kim, S. et al. MRI radiomic features: association with disease-free survival in patients with triple-negative breast cancer. Sci. Rep. 10, 3750 (2020).
Article PubMed PubMed Central CAS Google Scholar
Pizurica, M. et al. Digital profiling of gene expression from histology images with linearized attention. Nat. Commun. 15, 9886 (2024).
Article PubMed PubMed Central CAS Google Scholar
Ciriello, G. et al. Comprehensive molecular portraits of invasive lobular breast cancer. Cell 163, 506–519 (2015).
Article PubMed PubMed Central CAS Google Scholar
Cancer Genome Atlas, N. Comprehensive molecular portraits of human breast tumours. Nature 490, 61–70 (2012).
Article Google Scholar
Holbrook, A. I. & Newell, M. S. Magnetic resonance imaging of the breast. Clin. Obstet. Gynecol. 59, 394–402 (2016).
Article PubMed Google Scholar
Leithner, D. et al. Clinical role of breast MRI now and going forward. Clin. Radio. 73, 700–714 (2018).
Article CAS Google Scholar
Yeh, A. C. et al. Radiogenomics of breast cancer using dynamic contrast-enhanced MRI and gene expression profiling. Cancer Imaging 19, 48 (2019).
Article PubMed PubMed Central Google Scholar
Zhu, Y. et al. Deciphering genomic underpinnings of quantitative MRI-based radiomic phenotypes of invasive breast carcinoma. Sci. Rep. 5, 17787 (2015).
Article PubMed PubMed Central CAS Google Scholar
Lambin, P. et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur. J. Cancer 48, 441–446 (2012).
Article PubMed PubMed Central Google Scholar
Zheng, F. et al. Fusion radiomics-based prediction of response to neoadjuvant chemotherapy for osteosarcoma. Acad. Radio. 31, 2444–2455 (2024).
Article Google Scholar
Yan, Q. et al. MRI radiomics and nutritional-inflammatory biomarkers: a powerful combination for predicting progression-free survival in cervical cancer patients undergoing concurrent chemoradiotherapy. Cancer Imaging 24, 144 (2024).
Article PubMed PubMed Central Google Scholar
Wang, T. W. et al. Radiomics of metastatic brain tumor as a predictive image biomarker of progression-free survival in patients with non-small-cell lung cancer with brain metastasis receiving tyrosine kinase inhibitors. Transl. Oncol. 39, 101826 (2024).
Article PubMed CAS Google Scholar
Song, J. Multi-modality MRI radiomics phenotypes in intermediate-high risk endometrial cancer: correlations with histopathology and prognosis. Jpn J. Radiol. 43, 68–77 (2024).
Article PubMed Google Scholar
Marsilla, J. et al. A T2-weighted MRI-based radiomic signature for disease-free survival in locally advanced cervical cancer following chemoradiation: An international, multicentre study. Radiother. Oncol. 199, 110463 (2024).
Article PubMed CAS Google Scholar
Liu, X. et al. Early prognostication of overall survival for pediatric diffuse midline gliomas using MRI radiomics and machine learning: a two-center study. Neuro-Oncology Advances. 6, vdae108 (2024).
Article PubMed PubMed Central Google Scholar
Wagner, M. W. et al. Radiomic features based on MRI predict progression-free survival in pediatric diffuse midline glioma/diffuse intrinsic pontine glioma. Can. Assoc. Radio. J. 74, 119–126 (2023).
Article Google Scholar
Su, Y. et al. Prognostic value of the radiomics-based model in the disease-free survival of pretreatment uveal melanoma: an initial result. J. Comput. Assist. Tomogr. 47, 151–159 (2023).
Article PubMed Google Scholar
Liao, C. Y. et al. Predicting survival after radiosurgery in patients with lung cancer brain metastases using deep learning of radiomics and EGFR status. Phys. Eng. Sci. Med. 46, 585–596 (2023).
Article PubMed Google Scholar
Wang, X. et al. Radiomics predicts the prognosis of patients with locally advanced breast cancer by reflecting the heterogeneity of tumor cells and the tumor microenvironment. Breast Cancer Res. 24, 20 (2022).
Article PubMed PubMed Central CAS Google Scholar
Choi, Y. et al. Radiomics may increase the prognostic value for survival in glioblastoma patients when combined with conventional clinical and genetic prognostic models. Eur. Radio. 31, 2084–2093 (2021).
Article CAS Google Scholar
Wormald, B. W. et al. Radiomic features of cervical cancer on T2-and diffusion-weighted MRI: Prognostic value in low-volume tumors suitable for trachelectomy. Gynecol. Oncol. 156, 107–114 (2020).
Article PubMed PubMed Central Google Scholar
Jiang, L. et al. Radiogenomic analysis reveals tumor heterogeneity of triple-negative breast cancer. Cell Rep. Med. 3, 100694 (2022).
Article PubMed PubMed Central CAS Google Scholar
Itakura, H. et al. Magnetic resonance image features identify glioblastoma phenotypic subtypes with distinct molecular pathway activities. Sci. Transl. Med. 7, 303ra138 (2015).
Article PubMed PubMed Central Google Scholar
Liu, Z. et al. Radiogenomics: a key component of precision cancer medicine. Br. J. Cancer 129, 741–753 (2023).
Article PubMed PubMed Central Google Scholar
Ashraf, A. B. et al. Identification of intrinsic imaging phenotypes for breast cancer tumors: preliminary associations with gene expression profiles. Radiology 272, 374–384 (2014).
Article PubMed Google Scholar
Thakur, S. B. et al. Apparent diffusion coefficient in estrogen receptor-positive and lymph node-negative invasive breast cancers at 3.0T DW-MRI: A potential predictor for an oncotype Dx test recurrence score. J. Magn. Reson. Imaging 47, 401–409 (2018).
Article PubMed Google Scholar
Ha, R. et al. Convolutional neural network using a breast MRI tumor dataset can predict oncotype Dx recurrence score. J. Magn. Reson. Imaging 49, 518–524 (2019).
Article PubMed Google Scholar
Li, H. et al. Deep learning-based pathology image analysis enhances Magee feature correlation with oncotype DX breast recurrence score. Front Med. 9, 886763 (2022).
Article Google Scholar
Vinayak, S. et al. Abstract 2826: Multi-omic profiling to predict response to gemcitabine/ carboplatin (GC) plus iniparib (BSI-201) as neoadjuvant therapy for triple-negative (TN) and BRCA1/2 mutation-associated breast cancer using a pathway-based approach. Cancer Res. 74, 2826–2826 (2014).
Article Google Scholar
Staaf, J. et al. RNA sequencing-based single sample predictors of molecular subtype and risk of recurrence for clinical assessment of early-stage breast cancer. NPJ Breast Cancer 8, 94 (2022).
Article PubMed PubMed Central CAS Google Scholar
Telli, M. L. et al. Phase II Study of Gemcitabine, Carboplatin, and Iniparib as neoadjuvant therapy for triple-negative and BRCA1/2 mutation-associated breast cancer with assessment of a tumor-based measure of genomic instability: PrECOG 0105. J. Clin. Oncol. 33, 1895–1901 (2015).
Article PubMed PubMed Central CAS Google Scholar
Zhao, J. et al. MRI-based radiomics approach for the prediction of recurrence-free survival in triple-negative breast cancer after breast-conserving surgery or mastectomy. Medicine 102, e35646 (2023).
Article PubMed PubMed Central CAS Google Scholar
Yu, X. et al. Triple-negative breast cancer: predictive model of early recurrence based on MRI features. Clin. Radio. 78, e798–e807 (2023).
Article CAS Google Scholar
Ma, M. et al. Radiomics features based on automatic segmented MRI images: Prognostic biomarkers for triple-negative breast cancer treated with neoadjuvant chemotherapy. Eur. J. Radio. 146, 110095 (2022).
Article Google Scholar
Pesapane, F. et al. Recent Radiomics advancements in breast cancer: lessons and pitfalls for the next future. Curr. Oncol. 28, 2351–2372 (2021).
Article PubMed PubMed Central Google Scholar
Yamaguchi, A. et al. Kinetic information from dynamic contrast-enhanced MRI enables prediction of residual cancer burden and prognosis in triple-negative breast cancer: a retrospective study. Sci. Rep. 11, 10112 (2021).
Article PubMed PubMed Central CAS Google Scholar
Eom, H. J. et al. Predictive clinicopathologic and dynamic contrast-enhanced MRI findings for tumor response to neoadjuvant chemotherapy in triple-negative breast cancer. AJR Am. J. Roentgenol. 208, W225–W230 (2017).
Article PubMed Google Scholar
Nemeth, A. et al. Multicontrast MRI-based radiomics for the prediction of pathological complete response to neoadjuvant chemotherapy in patients with early triple negative breast cancer. MAGMA 34, 833–844 (2021).
Article PubMed CAS Google Scholar
Jimenez, J. E. et al. A model combining pretreatment MRI radiomic features and tumor-infiltrating lymphocytes to predict response to neoadjuvant systemic therapy in triple-negative breast cancer. Eur. J. Radio. 149, 110220 (2022).
Article Google Scholar
Wu, C. et al. MRI-based digital models forecast patient-specific treatment responses to neoadjuvant chemotherapy in triple-negative breast cancer. Cancer Res. 82, 3394–3404 (2022).
Article PubMed CAS Google Scholar
Houser, M. & Rapelyea, J. A. Using synthetic MRI and radiomics to predict treatment response in triple-negative breast cancer. Radio. Imaging Cancer 5, e230095 (2023).
Article Google Scholar
Hwang, K. P. et al. A radiomics model based on synthetic MRI acquisition for predicting neoadjuvant systemic treatment response in triple-negative breast cancer. Radio. Imaging Cancer 5, e230009 (2023).
Article Google Scholar
Panthi, B. et al. Assessment of response to neoadjuvant systemic treatment in triple-negative breast cancer using functional tumor volumes from longitudinal dynamic contrast-enhanced MRI. Cancers 15, 1025 (2023).
Article PubMed PubMed Central CAS Google Scholar
Romeo, V. et al. A simultaneous multiparametric (18)F-FDG PET/MRI radiomics model for the diagnosis of triple negative breast cancer. Cancers 14, 3944 (2022).
Article PubMed PubMed Central CAS Google Scholar
Afghahi, A. et al. Higher absolute lymphocyte counts predict lower mortality from early-stage triple-negative breast cancer. Clin. Cancer Res. 24, 2851–2858 (2018).
Article PubMed PubMed Central Google Scholar
Marczyk, M. et al. Treatment Efficacy Score-continuous residual cancer burden-based metric to compare neoadjuvant chemotherapy efficacy between randomized trial arms in breast cancer trials. Ann. Oncol. 33, 814–823 (2022).
Article PubMed CAS Google Scholar
Zheng, T. et al. Molecular mechanisms of breast cancer metastasis by gene expression profile analysis. Mol. Med. Rep. 16, 4671–4677 (2017).
Article PubMed PubMed Central CAS Google Scholar
Weber, S. C. et al. Oncoshare: lessons learned from building an integrated multi-institutional database for comparative effectiveness research. AMIA Annu. Symp. Proc. 2012, 970–978 (2012).
PubMed PubMed Central Google Scholar
Kurian, A. W. et al. Breast cancer treatment across health care systems: linking electronic medical records and state registry data to enable outcomes research. Cancer 120, 103–111 (2014).
Article PubMed Google Scholar
Gautier, L. et al. Affy-analysis of Affymetrix GeneChip data at the probe level. Bioinformatics 20, 307–315 (2004).
Article PubMed CAS Google Scholar
Irizarry, R. A. et al. Exploration, normalization, and summaries of high-density oligonucleotide array probe-level data. Biostatistics 4, 249–264 (2003).
Article PubMed Google Scholar
Symmans, W. F. et al. Measurement of residual breast cancer burden to predict survival after neoadjuvant chemotherapy. J. Clin. Oncol. 25, 4414–4422 (2007).
Article PubMed Google Scholar
Çiçek, Ö. et al. 3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation. Cham: Springer International Publishing. (2016).
van Griethuysen, J. J. M. et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. 77, e104–e107 (2017).
Article PubMed PubMed Central Google Scholar
Chen, X. et al. TNBCtype: A subtyping tool for triple-negative breast cancer. Cancer Inf. 11, 147–156 (2012).
Google Scholar
Tibshirani, R. et al. Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc. Natl Acad. Sci. USA 99, 6567–6572 (2002).
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

The authors would like to thank Dr. Sandy Napel for his technical contributions to this study. This research was supported by the NIH Big Data 2 Knowledge initiative via the National Institute of Environmental Health Sciences under Award Number K01ES026832, the Stanford Cancer Institute Women’s Cancer Innovation Award, the Emerson Collective Cancer Research Award, the Cancer League Award, the Smart Foundation Research Award, and funds from the Division of Oncology in the Department of Medicine at the Stanford University School of Medicine. This work was also supported by the Breast Cancer Research Foundation, the Susan and Richard Levy Gift Fund, the Suzanne Pride Bryan Fund for Breast Cancer Research, the Jan Weimer Junior Faculty Chair in Breast Oncology, the Regents of the University of California’s California Breast Cancer Research Program (16OB-0149 and 19IB-0124), the BRCA Foundation, the G. Willard Miller Foundation, the Carole and Alan Kushnir Charitable Fund, the Nancy McDaniel PlayForHer Fund, and the Biostatistics Shared Resource of the NIH-funded Stanford Cancer Institute (P30CA124435). The collection of cancer incidence data used in this study was supported by the California Department of Public Health pursuant to California Health and Safety Code Section 103885; the Centers for Disease Control and Prevention’s National Program of Cancer Registries, under Cooperative Agreement No. 1NU58DP007156; and the National Cancer Institute’s SEER Program under Contract No. HHSN261201800032I awarded to the University of California, San Francisco, Contract No. HHSN261201800015I awarded to the University of Southern California, and Contract No. HHSN261201800009I awarded to the Public Health Institute, Cancer Registry of Greater California. This research used data or services provided by STARR, “STAnford medicine Research data Repository,” a clinical data warehouse containing live Epic data from Stanford Health Care, the Stanford Children’s Hospital, the University Healthcare Alliance and Packard Children’s Health Alliance clinics and other auxiliary data from Hospital applications such as radiology PACS. STARR platform is developed and operated by the Stanford Medicine Research Technology team and is made possible by the Stanford School of Medicine Research Office and the Stanford CTSA Award Number UL1TR003142. The ideas and opinions expressed herein are those of the authors and do not necessarily reflect the opinions of the State of California, Department of Public Health, the National Cancer Institute, and the Centers for Disease Control and Prevention or their contractors and subcontractors.

Author information

Authors and Affiliations

Stanford Center for Biomedical Informatics Research, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
Humaira Noor, Yuanning Zheng & Olivier Gevaert
Department of Medicine, Division of Oncology, Stanford University School of Medicine, Stanford, CA, USA
Adam B. Mantz, Ryle Zhou, Melinda L. Telli, Allison W. Kurian, James M. Ford, Shaveta Vinayak, Vishal Joshi, Kevin Lee, George W. Sledge & Haruka Itakura
Department of Radiology, Stanford University School of Medicine, Stanford, CA, USA
Andrew Kozlov, Wendy B. DeMartini, Shu-Tian Chen, Satoko Okamoto, Debra M. Ikeda & Sarah A. Mattonen
Department of Epidemiology and Population Health, Stanford University School of Medicine, Stanford, CA, USA
Allison W. Kurian
Stanford University Research Technology/Data Services, Stanford, CA, USA
Mina Satoyoshi
Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA
Olivier Gevaert

Authors

Humaira Noor
View author publications
Search author on:PubMed Google Scholar
Yuanning Zheng
View author publications
Search author on:PubMed Google Scholar
Adam B. Mantz
View author publications
Search author on:PubMed Google Scholar
Ryle Zhou
View author publications
Search author on:PubMed Google Scholar
Andrew Kozlov
View author publications
Search author on:PubMed Google Scholar
Wendy B. DeMartini
View author publications
Search author on:PubMed Google Scholar
Shu-Tian Chen
View author publications
Search author on:PubMed Google Scholar
Satoko Okamoto
View author publications
Search author on:PubMed Google Scholar
Debra M. Ikeda
View author publications
Search author on:PubMed Google Scholar
Melinda L. Telli
View author publications
Search author on:PubMed Google Scholar
Allison W. Kurian
View author publications
Search author on:PubMed Google Scholar
James M. Ford
View author publications
Search author on:PubMed Google Scholar
Shaveta Vinayak
View author publications
Search author on:PubMed Google Scholar
Mina Satoyoshi
View author publications
Search author on:PubMed Google Scholar
Vishal Joshi
View author publications
Search author on:PubMed Google Scholar
Sarah A. Mattonen
View author publications
Search author on:PubMed Google Scholar
Kevin Lee
View author publications
Search author on:PubMed Google Scholar
Olivier Gevaert
View author publications
Search author on:PubMed Google Scholar
George W. Sledge
View author publications
Search author on:PubMed Google Scholar
Haruka Itakura
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization: HI. Data curation: HI, YZ, ABM, RZ, AK, WBD, SC, SO, DMI, MLT, AWK, JMF, SV, MS, VJ, SM, KL. Data analysis: HN, YZ, ABM. Funding acquisition: HI, AWK, GWS. Investigation: HI, HN, ABM, RZ. Methodology: HI, OG, HN. Project administration: HI. Data interpretation: HI, HN, GWS. Validation: HN. Visualization: HN. Writing – original draft preparation: HI, HN. Writing – review & editing: HI, HN; all authors contributed to the final review.

Corresponding author

Correspondence to Haruka Itakura.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary File 1_revised_27.5.25

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Noor, H., Zheng, Y., Mantz, A.B. et al. A 20-feature radiomic signature of triple-negative breast cancer identifies patients at high risk of death. npj Breast Cancer 11, 79 (2025). https://doi.org/10.1038/s41523-025-00790-3

Download citation

Received: 05 December 2024
Accepted: 30 June 2025
Published: 26 July 2025
Version of record: 26 July 2025
DOI: https://doi.org/10.1038/s41523-025-00790-3