Utilizing cohort-level and individual networks to predict best response in patients with metastatic triple negative breast cancer

Bottomly, Daniel; Zheng, Christina; Creason, Allison L.; Mitri, Zahi I.; Mills, Gordon B.; McWeeney, Shannon K.

doi:10.1038/s41698-025-00959-w

Download PDF

Article
Open access
Published: 13 June 2025

Utilizing cohort-level and individual networks to predict best response in patients with metastatic triple negative breast cancer

npj Precision Oncology volume 9, Article number: 179 (2025) Cite this article

2325 Accesses
1 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Given the highly aggressive and heterogeneous nature of metastatic triple-negative breast cancer, molecular subtypes have been evaluated for their utility in patient stratification and therapeutic selection. Leveraging both our unique longitudinal multimodal analysis of serial tumor biopsies, as well as existing public reference cohorts, we refined clinically relevant molecular subtypes through de-novo network-based approaches. A plasma/B-cell related co-expression module emerged as a robust predictor of clinical response. Refinements of this module were significantly associated with pathological complete response and survival in the CALGB and METABRIC cohorts, as well as dramatically improving the call rate in a CLIA setting. We explored patient-specific networks to monitor individual adaptive responses to therapy, allowing for dynamic adjustments in treatment strategies. Our work supports the shift from traditional molecular subtyping towards a more integrated view that includes the tumor microenvironment and immune landscape in a network-based context.

Identification of HER2-positive breast cancer molecular subtypes with potential clinical implications in the ALTTO clinical trial

Article Open access 29 November 2024

Discovery of a new mitophagy-related gene signature for predicting the outlook and immunotherapy in triple-negative breast cancer

Article Open access 25 February 2025

Molecular subtype conversion in CTCs as indicator of treatment adequacy associated with metastasis-free survival in breast cancer

Article Open access 05 December 2022

Introduction

Triple negative breast cancer (TNBC), characterized by a lack of expression of estrogen (ER) and progesterone (PR) receptors and a lack of human epidermal growth factor receptor 2 (HER2) amplification, is the most aggressive breast cancer subtype with the least favorable outcomes. TNBC is also characterized by high tumor heterogeneity, which has made the development of therapies that provide a durable response challenging. The development of a TNBC molecular classification system for patient stratification has been an area of focus over the last two decades^1,2,3,4,5.

Our phase II clinical trial (NCT03801369; Adaptive multi-drug treatment of evolving cancers (AMTEC)) is evaluating the efficacy of the combination of the Poly (ADP-ribose) polymerase (PARP) inhibitors (PARPi) olaparib and the programmed death-ligand 1 (PD-L1) inhibitor durvalumab for the treatment of BRCA^wt metastatic TNBC (mTNBC) patients with a longitudinal analysis of serial tumor samples in real-time to identify adaptive mechanisms of resistance as they emerge in response to treatment. This longitudinal characterization includes comprehensive multimodal analysis of serial liquid and tumor biopsies utilizing the Oregon Health & Science University Knight Cancer Institute precision oncology platform, Serial Measurements of Molecular and Architectural Responses to Therapy (SMMART).

Initial analysis of the AMTEC data⁶ indicated that one of the most informative predictors of response was the molecular subtypes Basal-Like Immune Activated (BLIA), Luminal Androgen Receptor (LAR) or Basal-Like Immunosuppressed (BLIS), termed the Burstein subtypes⁴. The Burstein expression subtypes were originally identified in TNBC tumors, with those with the BLIS or LAR subtypes having poor prognosis while BLIA tumors had improved outcomes⁴. In AMTEC, we used collapsed versions of these subtypes, which were seen to correspond to poor survival outcomes (BLIS/LAR) vs better survival outcomes (Non-BLIS/LAR). Although they were highly predictive in our cohort, one challenge with the Burstein subtypes in the clinical setting (i.e., CLIA laboratory) was the classification of a subset of patient samples correlated with multiple subtypes. Currently, they are given a “No Call” or “Indeterminant (IND)” in the CLIA setting by our diagnostic laboratory and not reported (12/26 (46.2%) patient samples) for clinical use.

We wanted to determine if the BLIA/BLIS/LAR subtypes could be further refined and reduce the number of “No Call” determinations. In order to explore the prognostic immune signatures displayed by our multi-modal dataset, we examined the utility of de-novo coexpression network-based approaches given that progression would likely reflect underlying perturbations of complex intracellular networks. Recognizing the limited sample size in the AMTEC trial for network inference, we leveraged The Cancer Genome Atlas (TCGA) Breast Cancer samples⁷ to create a reference (pre-treatment) cohort. We evaluated the network-based signatures as predictors of best clinical response. In addition, we wanted to assess the utility of patient specific networks to allow us to identify individual network rewiring (due to adaptive responses to perturbation such as treatment) which could be utilized for monitoring disease outcome and therapy selection in our precision oncology tumor boards.

Results

Development of a reference cohort and reference networks

We leveraged 152 Basal-classified Cancer Genome Atlas (TCGA) Project Breast Cancer samples⁷ to provide a pre-therapy reference cohort for assessment of the degree of therapeutic changes in clinical trial patient samples. Given that these therapy-related expression perturbations often lead to the rewiring or alteration of relevant gene networks, we performed weighted gene co-expression network analysis (WGCNA⁸) to provide the pre-treatment reference networks and identified five topologically significant gene expression modules (also known as subnetworks) across the basal TCGA samples (See Fig. 1 for a diagram of the design and Supplementary Data S1 for gene module membership) and Methods for description of topological significance.

**Fig. 1: Overall workflow for co-expression module formation.**

Clinical trial cohort

We leveraged 13 patients with metastatic triple negative breast cancer (mTNBC) from the phase II clinical trial (NCT03801369; Adaptive multi-drug treatment of evolving cancers (AMTEC))^6,9,10 each having paired pre-treatment (Bx1) and on-treatment (combination of olaparib and durvalumab; Bx2) samples (denoted as the “AMTEC cohort”). Additionally, we formed a separate group of 10 patient samples (5 Bx1 and 5 Bx2, 3 of which were paired) termed the “Validation cohort”. For this study, the main outcome was based on the best response achieved by a given patient as part of the trial. These outcomes were defined as either progressive disease (PD), stable disease (SD), or partial response (PR). For classification, we further binned patients into those who achieved a best response of SD or PR, termed responders, vs those who did not (PD), termed non-responders.

Module characterization

Three of the five modules (subnetworks) were significantly enriched for MSigDB Hallmarks¹¹ gene signatures (FDR < 0.05; Fig. 2a), therefore, for the purposes of this manuscript, those are the ones that were focused on. As the modules by definition are highly correlated gene sets, we computed the first two principal component (PC) scores for our three modules of interest and predicted the corresponding values for our AMTEC cohort to ensure compatibility. Both biopsies from AMTEC patients completely overlapped within the range of PC1 and PC2 values of TCGA (Supplementary Fig. S1). For both TCGA and AMTEC patients, we used the PC1 score as the representative value for each module, termed the “eigengene”¹² or “module eigengene”.

**Fig. 2: Co-expression modules formed from Basal classified TCGA reflect EMT and Immune processes.**

Module 1 (Mod1 or Turquoise) was most significantly enriched for epithelial-to-mesenchymal transition (EMT). Module 2 (Mod2 or Blue) was enriched for allograft rejection, complement, inflammatory response, interferon alpha and gamma response, as well as TNFA and IL6-JAK-STAT3 signaling. While Module 3 (Mod3 or Brown) was only enriched for Xenobiotic metabolism. Correlating the modules with Reverse Phase Protein Array (RPPA) antibodies in TCGA showed that both Mod2 (Blue) and Mod3 (Brown) were associated with immune response while Mod1 (Turquoise) was characterized by genes influencing EMT processes such as Vinculin^13,14 and PDGFR-B^15,16 as well as cell adhesion such as MYOSIN1A¹⁷ and FAK¹⁸ (Fig. 2b). Modules 1-3 were likewise significantly associated with the “Burstein” BLIA and BLIS subtypes⁴ with Mod2 (Blue) and Mod3 (Brown) increased in BLIA relative to BLIS while Mod1 (Turquoise) was increased in BLIS (Fig. 2c). Interestingly, the Mod2 (Blue) eigengene by itself could predict BLIA/BLIS status in TCGA with 93.6% accuracy (MCC: 0.873, F1: 0.938) while Mod3 (Brown) was less predictive at 77.3% (MCC: 0.545, F1: 0.783) using single-feature logistic regression models. Overall, this indicated that although Modules 2 and 3 were related, Mod2 (Blue) serves as an immunomodulatory feature. The correlation of a gene’s expression profile with its assigned module eigengene (termed kME) can serve as a measure of membership in that module¹². In practice, this means that high kME values indicate that the gene’s expression pattern closely mirrors its corresponding module eigengene. Given the uninformative of the MSigDB Hallmarks, we further assessed Mod3 (Brown) by annotating the most influential genes in the signature as indicated by their correlation with the module eigengene. As can be seen in Fig. 2d, the Mod3 (Brown) expression pattern reflects genes associated with B and Plasma cells, such as MZB1, CD79A, and MS4A1 (CD20), a standard B-cell marker. Significant Pearson’s correlations in both Bx1 (P-value = 0.023) and Bx2 (P-value < 0.001) were seen between the Mod3 (Brown) eigengenes and GSVA scores of a previously reported B-cell gene set¹⁹ (Fig. 2e). This was further confirmed by our multiplex immunohistochemistry (mIHC) data as the Mod3 (Brown) eigengene highly correlates with CD20+ cell density in AMTEC Bx1 samples (Fig. 2f). In total, this led to us to attributing Mod2 (Blue) and Mod3 (Brown) to distinct immune processes/cell types while Mod1 (Turquoise) likely represented EMT/cell adhesion processes.

Shepard et al.²⁰ examined RNA sequencing from pre-treatment TNBC tumor biopsies as part of the CALGB 40603 clinical trial. After analyzing the B and T cell repertoire, they found that low diversity (in terms of the Evenness measure²⁰—see below) of immunoglobulin G (IgG) was associated with both pathologic complete response and event-free survival²⁰. To explore this further, we additionally analyzed the B and T cell repertoires in the AMTEC samples. In agreement with our characterization of Mod3 (Brown), the eigengene was significantly correlated (all P-values < 0.001) with the abundance of all three immunoglobulin chains (IGH, IGK, and IGL; Supplementary Fig. S2a). We also assessed diversity measures for the IgG class for AMTEC. Depending on the measure, patients whose best response was PD tended to have low abundance and trended towards lower or no difference in diversity as compared to responders (SD/PR) (Supplementary Fig. S2b).

The Evenness measure²⁰, which is defined as the entropy normalized by the log of the number clonotypes, showed no difference between response groups (Supplementary Fig. S2c). Adjusting for the large difference in read counts for the IGH chain between samples by down-sampling, we observed that non-responders (PD) had marginally less (P-value = 0.048) Evenness in Bx1 samples compared to responders (SD/PR) (Supplementary Fig. S2d).

Evaluation of potential confounders

Next, we explored the relationship between biopsy tissue site and tumor purity (two key potential confounders) with clinical response across our three modules. Tumor purity was not seen to be predictive of response to therapy using logistic regression in either Bx1 (P-value = 0.210) or Bx2 (P-value = 0.811) (Supplementary Fig. S3a). However, in Bx1, there were suggestive linear correlations with Mod1 (Turquoise; P-value = 0.044) and Mod3 (Brown; P-value = 0.013), but not in Bx2 (Supplementary Fig. S3b). From this analysis we observed that lymph node biopsies tended to have higher Mod3 (Brown) eigengene values in Bx1 but not in Bx2. Although most (5/8) samples from patients who achieved a response were derived from lymph node biopsies, it was not seen to be predictive of clinical response (logistic regression; P-value = 0.155). Similarly, the module eigengenes were not significantly different by Welch’s T-test between patient samples derived from lymph node vs those derived from other tissues (Supplementary Fig. S3c). Therefore, based on this data, purity and tissue seemed to have limited impact on clinical outcomes and co-expression modules.

Predicting AMTEC patient response using Mod3 (Brown)

First, we examined the pattern of the eigengenes (PC1 scores) for the three modules in AMTEC with respect to the corresponding Burstein CLIA calls and clinical response (Supplementary Fig. S4). From this comparison, both Mod2 (blue) and Mod3 (brown) had low values for PD patients and patients with Burstein BLIS CLIA calls in both biopsies. For most patient samples, Mod2 and 3 were overall increased for SD or PR patients in Bx2. On the other hand, Mod1 (turquoise) had a mixture of high and low scoring patient samples in Bx1 though they were consistently increased in PD patients for Bx2.

Next, we compared the ability of the three expression module signatures to separate PD from SD or PR patients using only their module eigengenes in Bx1. Mod1 (turquoise) and Mod2 (blue) both showed poor accuracy (61.5%, MCC: N/A, F1: N/A), Mod3 (Brown), however, was of particular interest because it achieved 92.3% accuracy (MCC: 0.854, F1: 0.909) in the main AMTEC cohort (Fig. 3a, Supplementary Fig. S1). For Bx2, for Mod1 (turquoise) again had poor performance in the main cohort while both Mod2 (blue) and Mod3 (brown) had increased accuracy (84.6% (MCC: 0.732, F1: 0.833) and 100% (MCC: 1.0, F1: 1.0), respectively; Supplementary Fig. S1). Based on its promising performance for both biopsies in the AMTEC cohort as compared to Mod1 and Mod2, and well as the desire to use this for longitudinal monitoring, we focused on Mod3 (brown) and termed the classifier based on its eigengene mTNBC3e.

**Fig. 3: Co-expression module 3 (Brown) is highly prognostic in the AMTEC cohort.**

However, one challenge with using eigengene-based classifiers is the relative difficulty of externally validating co-expression subnetwork results due to differences in assay platforms and annotation. To increase applicability, gene sets like the MSigDB Hallmarks¹¹ are derived from consistently expressed gene sets. Therefore, one approach to validating expression modules would be to transform them into distinct and consistently expressed gene sets and use a rank-based method such as single-sample GSEA²¹ (or alternatively GSVA²²) to score each sample with respect to each module. In this manner, we leveraged the kME to choose two smaller sets of genes which could be used in more robust rank-based approaches (Fig. 3b). The first of these gene-sets was formed by keeping the thirty-four genes with a kME greater than 0.9, which we referred to as mTNBC3s while the second gene-set used only the top 3 genes (MZB1, IGKC, and CD79A) by kME (termed mTNBC3s_top3). We scored samples using single-sample GSEA and learned the optimal score to separate non-responders (PD) from responders (SD/PR) (see Methods). We found that resulting classifiers based on either of the two gene-sets had the same or better performance as using the mTNBC3e (Fig. 3c). The accuracy for mTNBC3s was 100% (MCC: 1.0, F1: 1.0) and for mTNBC3s_top3 was 92.3% (MCC: 0.854, F1: 0.909).

Finally, in addition to using the eigengene as the module representative, often the gene with the highest kME can be considered a module hub gene while maintaining similar predictive performance²³. In this case, using the centered and scaled expression of only the MZB1 gene maintains similar accuracy (92.3%, MCC: 0.843, F1: 0.889) as mTNBC3e (Fig. 3c).

Given the strong B cell context of Mod 3 (Brown), we assessed the overlap with other similar published signatures. B-cells have been proposed to be a strong predictor of response to either chemotherapy or anti-PD-L1 + chemotherapy²⁴. Sub-clustering of the Zhang et al.²⁴ single-cell data led to the formation of four main B-cell type signatures (pB, Bfoc, BN, and Bmem). Of these, 3/4 genes in the pB gene-set overlapped with genes in Mod3 (Brown). Using the scoring methodology described in Zhang et al.²⁴, we observed moderate performance in the AMTEC cohort (Accuracy: 84.6%, MCC: 0.732, F1: 0.833) but lower than our mTNBC3s series classifiers (Fig. 3c).

A key question was if the network-based signatures could “rescue” the patient samples with Burstein indeterminate subtypes (CLIA “No Calls”), allowing these patients to be classified to aid in therapeutic clinical trial decision aids. Utilizing the mTNBC3s predictor to assign predicted progressive disease calls (pPD) as BLIS and predicted responders (pSD/PR) as BLIA for the indeterminate samples resulted in a subtype/classification approach with high accuracy for either biopsy (Bx1 Accuracy: 92.3%, MCC: 0.843, F1: 0.889; Bx2 Accuracy: 100%, MCC: 1.0, F1: 1.0) and allowed us to provide calls for the remaining 46.2% of the ATMEC samples that were classified as “No Calls” (Fig. 3c).

Validation of B-cell related classifiers

Four of the five classifiers trained on the AMTEC data, including the Zhang’s pB signature, were able to predict patient response in our hold-out validation cohort with high accuracy (100% for Bx1, MCC: 1.0, F1: 1.0), MZB1 was the exception (Accuracy: 80%, MCC: 0.612, F1: 0.857). Three of the five achieved 80% accuracy (mTNBC3s and mTNBC3s_top3 MCC: 0.612, F1: 0.667; Zhang’s pB MCC: 0.667, F1: 0.8) for Bx2. Again, MZB1 as well as mTNBC3e only achieved 60% accuracy (MCC: N/A; F1: N/A) on Bx2. Due to the performance in the pre-treatment biopsy of AMTEC, we believe that these classifiers are potentially prognostic and do not appear to predict response to the AMTEC therapy. To further explore the generalizability of these classifiers as potential prognostic markers, we performed validation in two external breast cancer datasets. The CALGB 40603 dataset consisted of 389 locally advanced TNBC patients who received neoadjuvant chemotherapy with pre-treatment RNASeq samples²⁰. Using a 277 patient subset that were classified as having Basal subtypes, we found that classifications based on MZB1, mTNBC3s and mTNBC3s_top3 were significantly associated with pathological complete response (pCR) in the breast (logistic regression; OR: 2.097, 95% CI: 1.274–3.479, P-val: 0.004; OR: 2.315, 95% CI: 1.217–4.526, P-val: 0.012; OR: 1.918, 95% CI: 1.109–3.353, P-val: 0.021; respectively; Supplementary Data S2a). The mTNBC3s classifier was further able to significantly differentiate patients based on event-free survival (single-predictor CoxPH; HR: 0.577, 95% CI: 0.346–0.961, P-val: 0.035) (Fig. 4a; Supplementary Data S2a). We noted that the mTNBC3s_top3 was not significant based on the AMTEC data cutoff for high vs low (single-predictor CoxPH; HR: 0.705, 95% CI: 0.445–1.117, P-val: 0.136; Supplementary Data S2a). However, single-predictor Cox Proportional Hazards models for both mTNBC3s and mTNBCs_Top3 were significant (HR: 0.311, 95% CI: 0.120–0.804, P-val: 0.016; HR: 0.439, 95% CI: 0.206–0.936, P-val: 0.033; respectively) without binning the actual scores; highlighting the potential for further refinement of the cutoff value (Supplementary Data S2a).

**Fig. 4: Validation of prognostic B-cell biomarkers.**

Although mTNBC3s couldn’t be tested in Basal-classified METABRIC²⁵ patients as only 5/34 genes were present in the processed microarray data, mTNBC3s_top3 was able to differentiate a subset of 20 patients having poor overall survival (P-value = 0.001; logRank; Fig. 4b; Supplementary Data S2b). In addition, it remained significant after adjusting for clinical covariates in the Cox proportional hazards model discussed previously²⁵ (Cox PH Likelihood ratio test; P-value = 0.009; Supplementary Data S2c). These results indicated that the network-based mTNBC3 and mTNBC3_top3 classifiers, although originally derived in the context of metastatic TNBC, are potentially prognostic and generalizable beyond their original context.

Patient-specific network measures predict best response

To realize the full utility of network medicine and provide informative readouts for precision oncology tumor boards, patient-specific networks are key to assess individual baseline and response to treatment. In order to explore individual changes in wiring, we adapted the LIONESS approach^26,27 to further decompose each of the 3 co-expression modules of interest into their estimated patient sample-specific subnetworks based on interpolation from the reference cohort. This resulted in 78 total subnetworks. For each subnetwork we computed a set of gene-specific summary measures including Connectivity, the Maximum Adjacency Ratio (MAR), and the Clustering Coefficient²⁸ (Fig. 5a). Adjacency and Connectivity are fundamental network measures with Adjacency defined as the strength of the connection/association between two genes ranging between 0-1, whereas Connectivity is defined as the per-gene sum of the Adjacencies to all the other genes (i.e., sum of its connection strengths with all other genes in the network). We also computed overall measures including Density, Centralization and Heterogeneity, metrics that could quantify potential re-wiring at the module level²⁸ (Fig. 5a). However, in AMTEC, these measures were not predictive of response so only the gene-specific network features (Connectivity, MAR, and Clustering Coefficient) were considered further.

**Fig. 5: Patient-specific subnetwork rewiring predicts response.**

Given the strength of the observed expression differences in Mod3 (Brown), we were interested in determining if these gene-specific network measures provided additional information beyond simply reflecting overall differences in expression. We first carried out a differential expression analysis between responders (SD/PR) and non-responder patients (PD) separately for each biopsy. Next, we carried out a similar analysis for each of the three gene-level network features. We visualized the T-statistics from these results using ‘sector’ plots²³. In these plots, sectors 2 and 5 indicate differences due only to sub-network differences, while sectors 1,3,4, and 6 indicate genes which may have both an expression and network component (Supplementary Fig. S5a). Focusing on the latter sectors for Connectivity we found that the IFI27 gene from Mod2 (Blue) was one of the top genes (not in Mod3 (Brown)) for Bx1 (Fig. 5b). Examining the average Adjacency matrices for Mod2 (Blue), we see that there is a clear decrease in connection strength for the responders (SD/PR) relative to non-responders (PD) for IFI27 (Fig. 5c). Note that no genes had significantly different expression in Bx1, but 77 were found in Bx2 (FDR < 0.05; Supplementary Data S3a). Similarly, no genes had significantly different network features for either biopsy (Supplementary Data S3b). Interestingly, for IFI27, a combination of expression and connectivity provided separation of responders from non-responders in both AMTEC and the Validation cohort (Fig. 5d).

Temporal differences in gene-level network measures predict best response

Given the paired biopsy nature of the study, we were interested in assessing the informativeness of the temporal differences in expression and network features. We performed differential expression and network-wiring testing to determine whether the average differences between Bx2 and Bx1 were different between responders (SD/PR) and non-responders (PD). Again, we used sector plots to visualize the relationship between expression and the network features (Supplementary Fig. S5b). The KRT23 gene relative to the EMT-related Mod1 (turquoise) was one of the top genes in sector 2 for the Maximum Adjacency Ratio (MAR) network measure (Fig. 6a). A small increase in MAR was seen in Bx2 samples that could differentiate responders (SD/PR) from non-responders (PD) (Fig. 6b). Independent of expression, KRT23 MAR was able to achieve separation of responders (SD/PR) from non-responders (PD) patients in both AMTEC and the validation cohort (Fig. 6c), warranting further examination of patient specific temporal network measures in other studies. Note that no genes had significantly different expression or network features (FDR < 0.05; Supplementary Data S4a,b) indicating the potential utility of this approach for identifying patient-specific temporal changes as they would not have been identified through traditional approaches.

**Fig. 6: Temporal differences between KRT23 subnetwork-based MAR values predict response.**

Discussion

Based on initial analysis of the AMTEC cohort, which indicated that one of the most informative predictors of response in TNBC was the expression-based Burstein molecular subtypes, we further expanded on these data through network-based analyses. In the CLIA environment, when the calls were definitively BLIA or BLIS, they were highly predictive. However, there were a number of samples that did not have a strong signal that identified a sample as either BLIA or BLIS and were thus considered “Indeterminant (IND)” and assigned a “No Call” label. Our network-based analysis was used to identify approaches to improve the clinical utility of the Burstein molecular subtypes.

Three out of the five co-expression modules learned from Basal-classified TCGA BRCA patient reference cohort were significantly enriched for a MSigDB Hallmarks. Based on this annotation as well as orthogonal multi-modal support of these modules (RPPA and mIHC), we attributed Mod1 (Turquoise) to EMT/cell adhesion, Mod2 (Blue) represented immunomodulatory and Mod3 (Brown) reflected plasma/B-cell processes.

Immune repertoire profiling of the AMTEC cohort RNASeq data further reinforced the idea that the eigengene from Mod 3 (Brown) tracks the pattern of relative expression of B-cell-related genes, especially immunoglobulin genes. Harris et al.²⁹ found that tumor-infiltrating B lymphocyte-enriched tumors showed preferential clonal expansion of IgG isotypes and were associated with improved clinical outcome²⁹. Similarly, Shephard et al. 2022 saw reductions of IgG diversity, defined as low Evenness, in TNBC patients achieving a clinical response. They concluded that this potentially indicated clonal expansion and Ig class switching associated with a directed immune response²⁰. However, we saw no significant difference in the Evenness measure between patients with and without a clinical response. This is likely due to several factors. For instance, our cohort size was small and might not be able to detect a subtle effect. Our study focused on metastatic as opposed to primary TNBC. Additionally, patients who did not achieve a response tended to have low abundance of the immunoglobulin chains. This, in turn, would lead to poor sampling of available clonotypes, potentially contributing to observed low diversity in the un-normalized measures such as Shannon’s entropy, which disappears after normalizing to form the evenness measure. This can only be corrected to a certain point using down-sampling, since down-sampling results in further information loss. Finally, there were differences in protocols, such as the use of a different alignment and post-processing pipeline. Importantly, we discarded samples that had too little data to be informative. Considering samples with low representation as having high evenness had the potential to artificially create differences in evenness, especially in our clinical trial dataset.

The Mod2 (blue) module demonstrated superior ability in distinguishing between BLIA and BLIS samples within the TCGA dataset, outperforming both Mod3 (brown) and Mod1 (turquoise). Despite its effectiveness in TCGA, predictive models derived from Mod2’s eigengene showed less temporal consistency compared to Mod3 in AMTEC. Mod2 demonstrated predictive capability only in the second biopsy (Bx2), while Mod3 had high predictive performance in both biopsy timepoints. Mod1 was not considered further as it exhibited low predictive value across both biopsy timepoints.

Predictive models formed from the plasma/B-cell related Mod3 (Brown) co-expression module mTNBC3e, as well as more clinically accessible mTNBC3s versions, achieved excellent classification accuracy with separating responders (high values) from non-responders (low values) in AMTEC and our holdout Validation cohort. When applied to the CALGB 40603 clinical cohort and METABRIC, the mTNBC3s classifiers were significantly associated with pathological complete response and event free survival in CALGB and identified a subset of 20 patient samples with poor response in METABRIC, highlighting that this signature had prognostic activity and was not directly related to response to a specific therapy. However, it remains possible that there could be an additional component of predictive value for specific therapies. A single-cell RNASeq experiment of tumors from TNBC patients treated with paclitaxel or paclitaxel in combination with atezolizumab found that B-cells were the most predictive immune cell type for patients achieving a response²⁴. However, they found that follicular B-cells (Bfoc) were the most important B-cell subset for their cohort. Interestingly, of the four B-cell subsets evaluated in the AMTEC samples, plasma B-cells (pB), not Bfoc, were the strongest predictor of response. We were able to independently derive biologically similar small gene-sets from bulk RNA sequencing using co-expression-based network methodology, highlighting the potential value of generating and re-analyzing existing large datasets with network-based approaches.

The plasma and B-cell related mTNBC3s signature was derived from an adjuvant study (TCGA) and independently validated in neoadjuvant (CALGB) studies as well as in our AMTEC metastatic cohort. In addition, it was independently validated in METABRIC. This suggests a potentially generalized utility related to prognosis and the potential that the gene signature may identify metastatic potential as well as aggressiveness of metastatic tumors that determine the overall outcomes in TNBC.

Despite the AMTEC cohort comprising patients with metastatic breast cancer—a population historically associated with poor survival outcomes—the development of prognostic or predictive biomarkers retains critical clinical relevance. Such markers could enhance therapeutic decision-making by identifying subgroups likely to derive sustained benefit from stratified treatment approaches. As noted in a recent evaluation of biomarkers for immune checkpoint inhibitors (ICIs) in advanced melanoma, robust prognostic risk stratification can guide more precise utilization of ICIs to reduce over-treatment³⁰. Furthermore, prognostic biomarkers may serve dual purposes within the broader framework of Awareness of Disease Status (ADS) facilitating earlier transitions to palliative care when appropriate, allowing patients and providers to align care plans with clinical trajectories and personal priorities³¹.

In addition to serving as a stand-alone predictor, the plasma and B-cell related mTNBC3s signature also augmented the Burstein BLIA-BLIS CLIA calls, resolving indeterminate (No call) samples. This immediately extends the utility and applicability of this approach.

It is important to note that there are limitations to this study. The first is the small sample size of the primary clinical cohort, a known challenge in precision oncology trials focused on in-depth longitudinal characterization. This is ameliorated to some degree by the use of large public datasets and orthogonal data to help validate our findings. Another potential limitation was the use of non-metastatic patients from TCGA to build the initial network. While this could impact generalizability to the metastatic patient population, we instead found that the TCGA-based eigengene and geneset signatures are predictive in our metastatic cohort. This highlights the potential preservation of prognostic gene expression profiles between primary and metastatic disease, which has been previously observed³². This preservation is not entirely unexpected, as long-term patient outcomes often depend on the development and aggressiveness of metastases, given that primary disease is typically well-controlled with current therapeutic approaches.

In addition to the evaluation of our WGCNA co-expression module eigengene-based predictors, we also formed patient-specific subnetworks based on the module genes. We showed that features derived from these patient-specific subnetworks could potentially be used as prognostic or predictive biomarkers, both in conjunction with gene-expression or without. However, given sample size constraints, the clinical utility of these signatures and this approach in general is yet to be determined. Our network-based approach provided many benefits over traditional differential expression. For example, neither the standard paired T-test between biopsies, the test of paired differences, nor Bx1-only samples between patient response groups provided a significant result after FDR adjustment. Only the comparison of Bx2 samples between response groups provided significant differential expression. These 77 DE genes included both MZB1 and IGKC but not CD79A, which make up mTNCBC3s_top3. However, IGKC was ranked 51/77, and MZB1 was ranked 58/77 based on fold change. These genes would not have been associated together using standard differential expression analysis, highlighting again the strength of network-based predictors. In addition, we highlighted the informativeness of patient-specific network perturbations, which also would be missed (e.g., IFI27 and KRT23 were non-significant in the traditional DE comparisons as well). By leveraging the highly correlated subnetworks from WGCNA, we were more easily able to identify the most prominent biological themes. We were then able to use multiple orthogonal approaches to identify both expression and co-expression subnetwork patterns potentially associated with patient prognosis and response.

Methods

Reference and clinical cohort

To allow assessment of the degree of therapeutic changes in on-therapy clinical trial samples, we utilized 152 Basal-classified Breast Cancer samples from The Cancer Genome Atlas (TCGA) Project⁷ as our “reference cohort”. For TCGA, Institutional review boards at each tissue source site reviewed protocols and consent documentation and approved submission of cases to TCGA⁷. For our clinical cohort, all patients gave informed consent to participate in this study, which had the approval and guidance of the Institutional Review Board at Oregon Health and Science University (OHSU IRB #18504). All human subjects research was performed in accordance with the Declaration of Helsinki.

For the clinical trial cohort comparator, 13 patients with metastatic triple negative breast cancer (mTNBC) from the phase II clinical trial (NCT03801369; Adaptive multi-drug treatment of evolving cancers (AMTEC))^6,9,10 each having paired pre-treatment (Bx1) and on-treatment (Bx2) samples, were used (denoted as the “AMTEC cohort”). Additionally, we held out a second group of 10 patient samples (5 Bx1 and 5 Bx2, 3 of which were paired), denoted as the “Validation cohort”. For this study, our main outcome was based on the best response achieved by a given patient as part of the trial. These outcomes were defined as either progressive disease (PD), stable disease (SD), or partial response (PR). For classification, we further binned patients into those who achieved a “best response” (SD/PR) vs those who did not (PD). For external independent validation, we used RNA-Seq from 277 Basal classified patients from CALGB 40603 clinical trial²⁰ (re-processed as below), as well as microarray expression data for 199 Basal classified patients from METABRIC²⁵.

RNA sequencing data processing

Kallisto (an RNA-seq quantification algorithm³³) processed abundance values (Transcripts Per Million, known as TPM) were retrieved for TCGA³⁴ and limited to the Basal subtype samples. Fourteen samples were removed due to having relatively low expression, leaving 152 samples.

For the AMTEC samples, preparation of RNA and transcriptome sequencing was performed at the Knight Diagnostics Laboratories. Total nucleic acid was extracted from macro-dissected, tumor-rich areas from FFPE sections, purified, and used for next generation sequencing (NGS). Libraries were prepared using the TruSeq RNA Access library preparation kit and sequenced on the Illumina NextSeq500. Approximately 100 million reads were generated per sample. For both the AMTEC and CALGB cohort samples, gene expression was quantified relative to Gencode v24³⁵ transcripts using Kallisto (v0.43)³³. The AMTEC patient cohort was limited to 13 Bx1 and Bx2 pairs excluding CLIA-classified LAR samples.

Weighted Gene Coexpression Network Analysis (WGCNA)

Coexpression network modules for the TCGA data were formed using WGCNA (v1.71)³⁶ using the top 2000 most variable genes after log2 transformation. A range of parameters were assessed for WGCNA using stability assessment of module assignments using 50 iterations of 63.2% subsampling and assessment of module quality³⁷. Topological significance was based on the Z-scores of the density-based measures relative to 100 random gene sets. We required a median Z-score of 2 or greater. The final WGCNA parameters were a signed hybrid network with power of 5 using bicor correlation, deepSplit =2, detectCutHeight = 0.995, minimum module size of 30, and pamStage=TRUE. For comparison with AMTEC and the validation cohort, the abundance values were batch corrected using ComBat from the SVA package (v3.44.0)³⁸ using TCGA as the reference batch. Principal component scores were computed for the TCGA cohort after centering and scaling. AMTEC cohort PC scores values were ‘predicted’ using the means, standard deviations, and eigenvectors from the TCGA cohort. Gene set enrichment for the MSigDB Hallmarks was performed using clusterProfiler (v4.4.4)³⁹. Benjamini-Yekutieli⁴⁰ false discovery rate (FDR) adjustment was used. Single-sample GSEA was limited to the WGCNA gene universe using GSVA (v1.44.5). The ssGSEA normalization was not performed.

Immune cell type scoring

The immune cell type analysis followed a prior approach⁴¹ for calculating immune cell scores. Briefly, Gene Set Variation Analysis (GSVA)²² (v1.44.5) was performed on the log2 TPM values from the AMTEC cohort relative to 16 immune cell gene sets¹⁹ using a Gaussian kernel cumulative density function (kcdf = ”Gaussian”). The GSVA enrichment statistic was calculated as the magnitude difference between the highest and lowest random walk deviations (mx.diff=TRUE).

Immune Repetroire profiling

We used TRUST4 (v1.0.8)⁴² to assemble BCR and TCR repertoires from AMTEC bulk RNASeq data. As part of the TRUST4 post-processing, we defined BCR clonotypes by clustering CDR3 sequences after matching on length, and assigned V and J genes with a cutoff of 0.8. TCR clonotypes were based on the CDR3 sequence only. To define the IGH and IGHG classes, we used the distinct clonotypes defined for each of the isotypes. For each class, we computed several diversity measures metrics to assess the variety and distribution of different types or entities within the group) including Shannons’ Entropy, Evenness (normalized Shannon’s Entropy, which is a measure of how evenly distributed the entities are), D50, Gini-Shannon as well as the Gini Coefficient. Diversity measures were examined both in the original values as well as after downsampling reads to 8500 for Bx1 and 400 for Bx2, levels that ensured at least three PD patients would remain after excluding samples lower than the corresponding thresholds. Downsampling was repeated five times, with the median used for comparison between groups. Missing values were removed prior to testing.

Patient-specific networks

We used the Linear Interpolation to Obtain Network Estimates for Single Samples (LIONESS) method (v1.10.0)⁴³ to generate patient-specific sub-networks for each AMTEC patient sample and gene coexpression module. As LIONESS interpolates networks based on gene correlation values for a single cohort, we combined each AMTEC patient sample with the TCGA cohort in turn to compute the sample’s network. We converted each subnetwork to a standard adjacency network by removing negatively weighted edges and scaling the remainder to be between 0 and 1. The WGCNA function ‘fundamentalNetworkConcepts‘ was used to compute the overall and gene-specific network features. Differential network feature analysis was performed using limma (v3.52.4)⁴⁴ after log2 transformation.

Classification

Either logistic regression or the Conditional Inference Tree (ctree)⁴⁵ methodology was used for the best response classification as indicated in the text. We used the ctree implementation from the ‘partykit‘ R package (1.2-16). Single variable models required ‘minbucket‘ and ‘minsplit‘ to be 3, limiting depth to 1. The Zhang et al.²⁴ genesets were scored by averaging the expression values across the genes, with the high vs low categorization performed using the median of those scores as described in their manuscript²⁴. We computed accuracy and the F1 measure using the caret (6.0-94)⁴⁶ R package and Mathews correlation coefficient (MCC) using yardstick (1.2.0)⁴⁷.

Burstein (BLIA/BLIS/LAR) subtype calling

The Burstein subtypes were computed for TCGA using the refined list of 77 genes⁴⁸. Spearman’s correlation was computed between the gene expression values and each centroid. Patient samples were assigned to the centroid with the highest correlation. Calls were considered indeterminate if the difference between the top two centroids was less than 0.1.

Differential expression analysis

For the AMTEC cohort, counts were formed from scaled TPM abundance using the tximport (v1.24.0) package. The limma-trend pipeline (limma v3.52.4 and edgeR v3.38.4) was used for model fitting⁴⁹, including TMM normalization⁵⁰.

Reverse phase protein array

The TCGA Processed RPPA data was downloaded from The Cancer Proteome Atlas (TCPA). Data had been already been standardized and normalized by TCPA as described previously⁵¹.

Multiplex immunohistochemistry

For the AMTEC cohort, multiplex immunohistochemistry (mIHC) was performed as previously described⁵². Cell phenotypes were assigned with hierarchically gating and quantified to cell densities (cells/mm²). Cell phenotype densities were log10 transformed.

Statistics and reproducibility

All analyses were carried out using R v4.3.1. Visualizations were generated using ggplot2 v3.5.0⁵³ or ComplexHeatmap v2.16.0⁵⁴. All P-values are reported unadjusted unless otherwise specified.

Data availability

All data is available through the HTAN Data Portal as part of the HTAN OHSU Atlas (https://data.humantumoratlas.org/). Mapping to HTAN patient identifiers is provided in Supplementary Data S5. Raw sequencing data have been deposited in dbGAP (Project phs002371.v1.p1). RPPA data was from TCPA⁵⁵. METABRIC data was downloaded from CBioPortal⁵⁶. CALGB RNASeq data were retrieved from SRA through dbGaP (phs001863.v1.p1). The mIHC data is provided in Supplementary Data S6.

Code availability

The code (and corresponding parameters) to reproduce manuscript results is freely available in our GitHub repository under a GPL-3.0 license at: https://github.com/biodev/amtec_manuscript.

References

Elsawaf, Z. et al. Biological subtypes of triple-negative breast cancer are associated with distinct morphological changes and clinical behaviour. Breast 22, 986–992 (2013).
Article PubMed Google Scholar
Lehmann, B. D. et al. Identification of human triple-negative breast cancer subtypes and preclinical models for selection of targeted therapies. J. Clin. Investig. 121, 2750–2767 (2011).
Article CAS PubMed PubMed Central Google Scholar
Lehmann, B. D. et al. Refinement of triple-negative breast cancer molecular subtypes: implications for neoadjuvant chemotherapy selection. PloS One 11, e0157368 (2016).
Article PubMed PubMed Central Google Scholar
Burstein, M. D. et al. Comprehensive genomic analysis identifies novel subtypes and targets of triple-negative breast canceridentification of four unique subtypes of TNBCs. Clin. Cancer Res. 21, 1688–1698 (2015).
Article CAS PubMed Google Scholar
Jézéquel, P. et al. Identification of three subtypes of triple-negative breast cancer with potential therapeutic implications. Breast Cancer Res. 21, 1–14 (2019).
Article Google Scholar
Mitri, Z. I. et al. Abstract 2149: Biomarker-driven selection of polyADP ribose polymerase inhibitors (PARPi)-based combination therapies in patients with metastatic triple negative breast cancer (mTNBC). Cancer Res 82, 2149–2149 (2022).
Article Google Scholar
The Cancer Genome Atlas Network. Comprehensive molecular portraits of human breast tumours. Nature 490, 61–70 (2012).
Zhang, B. & Horvath, S. A general framework for weighted gene co-expression network analysis. Stat. Appl. Genet. Mol. Biol. 4, 17 (2005).
Article Google Scholar
Hobbs, E. et al. Abstract OT3-05-01: Adaptive Multi-Drug Treatment of Evolving Cancers (AMTEC): A Phase II, open-label, study of Olaparib in combination with either Durvalumab, Selumetinib or Capivasertib, or Ceralasertib monotherapy in patients with metastatic TNBC. Cancer Res 83, OT3-05-01-OT03-05-01 (2023).
Article Google Scholar
Mitri, Z. I. et al. Abstract CT203: Multi-omic analysis of serial biopsies to inform biomarkers of sensitivity to olaparib and durvalumab in patients with metastatic BRCA-wildtype triple negative breast cancer (mTNBC). Cancer Res. 84, CT203–CT203 (2024).
Article Google Scholar
Liberzon, A. et al. The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst. 1, 417–425 (2015).
Article CAS PubMed PubMed Central Google Scholar
Horvath, S. & Dong, J. Geometric interpretation of gene coexpression network analysis. PLOS Comput. Biol. 4, e1000117 (2008).
Article PubMed PubMed Central Google Scholar
Li, H. et al. High expression of vinculin predicts poor prognosis and distant metastasis and associates with influencing tumor-associated NK cell infiltration and epithelial-mesenchymal transition in gastric cancer. Aging 13, 5197–5225 (2021).
Article CAS PubMed PubMed Central Google Scholar
Li, T. et al. Loss of vinculin and membrane-bound β-catenin promotes metastasis and predicts poor prognosis in colorectal cancer. Mol. Cancer 13, 263 (2014).
Article PubMed PubMed Central Google Scholar
Yin, J., Guo, Y. & Li, Z. Platelet-derived growth factor-B signalling might promote epithelial-mesenchymal transition in gastric carcinoma cells through activation of the MAPK/ERK pathway. Contemp. Oncol. 25, 1–6 (2021).
CAS Google Scholar
Shenoy, A. K. et al. Epithelial-to-mesenchymal transition confers pericyte properties on cancer cells. J. Clin. Investig. 126, 4174–4186 (2016).
Article PubMed PubMed Central Google Scholar
Nambiar, R., McConnell, R. E. & Tyska, M. J. Control of cell membrane tension by myosin-I. Proc. Natl. Acad. Sci. 106, 11972–11977 (2009).
Article CAS PubMed PubMed Central Google Scholar
Golubovskaya, V. M. et al. A small molecule focal adhesion kinase (FAK) inhibitor, targeting Y397 site: 1-(2-hydroxyethyl)-3, 5, 7-triaza-1-azoniatricyclo [3.3.1.1(3,7)]decane; bromide effectively inhibits FAK autophosphorylation activity and decreases cancer cell viability, clonogenicity and tumor growth in vivo. Carcinogenesis 33, 1004–1013 (2012).
Article CAS PubMed PubMed Central Google Scholar
Tamborero, D. et al. A Pan-cancer landscape of interactions between solid tumors and infiltrating immune cell populations. Clin. Cancer Res. 24, 3717–3728 (2018).
Article CAS PubMed Google Scholar
Shepherd, J. H. et al. CALGB 40603 (Alliance): Long-term outcomes and genomic correlates of response and survival after neoadjuvant chemotherapy with or without carboplatin and bevacizumab in triple-negative breast cancer. J. Clin. Oncol. 40, 1323–1334 (2022).
Article CAS PubMed PubMed Central Google Scholar
Barbie, D. A. et al. Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1. Nature 462, 108–112 (2009).
Article CAS PubMed PubMed Central Google Scholar
Hänzelmann, S., Castelo, R. & Guinney, J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinforma. 14, 1–15 (2013).
Article Google Scholar
Fuller, T. F. et al. Weighted gene coexpression network analysis strategies applied to mouse weight. Mamm. Genome 18, 463–472 (2007).
Article PubMed PubMed Central Google Scholar
Zhang, Y. et al. Single-cell analyses reveal key immune cell subsets associated with response to PD-L1 blockade in triple-negative breast cancer. Cancer Cell 39, 1578–1593.e1578 (2021).
Article CAS PubMed Google Scholar
Curtis, C. et al. The genomic and transcriptomic architecture of 2000 breast tumours reveals novel subgroups. Nature 486, 346–352 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kuijjer, M. L., Tung, M. G., Yuan, G., Quackenbush, J. & Glass, K. Estimating sample-specific regulatory networks. Iscience 14, 226–240 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lopes-Ramos, C. M. et al. Regulatory network of PD1 signaling is associated with prognosis in Glioblastoma Multiforme. Cancer Res. 81, 5401–5412 (2021).
Article CAS PubMed PubMed Central Google Scholar
Dong, J. & Horvath, S. Understanding network concepts in modules. BMC Syst. Biol. 1, 1–20 (2007).
Article Google Scholar
Harris, R. J. et al. Tumor-infiltrating B lymphocyte profiling identifies IgG-biased, clonally expanded prognostic phenotypes in triple-negative breast cancer. Cancer Res. 81, 4290–4304 (2021).
Article CAS PubMed PubMed Central Google Scholar
Grad, R. N. et al. Prognostic risk stratification and end-of-life care outcomes in patients with metastatic melanoma treated with immune checkpoint inhibitors. Oncologist 28, 911–916 (2023).
Article PubMed PubMed Central Google Scholar
Finlayson, C. S. et al. Awareness of disease status among patients with cancer: an integrative review. Cancer Nurs. 47, 189–197 (2024).
Article Google Scholar
Weigelt, B. et al. Molecular portraits and 70-gene prognosis signature are preserved throughout the metastatic process of breast cancer. Cancer Res. 65, 9155–9158 (2005).
Article CAS PubMed Google Scholar
Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527 (2016).
Article CAS PubMed Google Scholar
Tatlow, P. & Piccolo, S. R. A cloud-based workflow to quantify transcript-expression levels in public cancer compendia. Sci. Rep. 6, 1–11 (2016).
Article Google Scholar
Frankish, A. et al. GENCODE 2021. Nucleic Acids Res. 49, D916–D923 (2021).
Article CAS PubMed Google Scholar
Langfelder, P. & Horvath, S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinforma. 9, 559 (2008).
Article Google Scholar
Langfelder, P., Luo, R., Oldham, M. C. & Horvath, S. Is my network module preserved and reproducible?. PLOS Comput. Biol. 7, e1001057 (2011).
Article CAS PubMed PubMed Central Google Scholar
Leek, J. T., Johnson, W. E., Parker, H. S., Jaffe, A. E. & Storey, J. D. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics 28, 882–883 (2012).
Article CAS PubMed PubMed Central Google Scholar
Yu, G., Wang, L.-G., Han, Y. & He, Q.-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. Omics: J. Integr. Biol. 16, 284–287 (2012).
Article CAS Google Scholar
Benjamini, Y. & Yekutieli, D. The control of the false discovery rate in multiple testing under dependency. Ann. Stat. 29, 1165–1188 (2001).
Bareche, Y. et al. Unraveling triple-negative breast cancer tumor microenvironment heterogeneity: towards an optimized treatment approach. J. Natl. Cancer Inst. 112, 708–719 (2020).
Article PubMed Google Scholar
Song, L. et al. TRUST4: immune repertoire reconstruction from bulk and single-cell RNA-seq data. Nat. Methods 18, 627–630 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kuijjer, M. L., Hsieh, P.-H., Quackenbush, J. & Glass, K. lionessR: single sample network inference in R. BMC Cancer 19, 1–6 (2019).
Article Google Scholar
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47–e47 (2015).
Article PubMed PubMed Central Google Scholar
Hothorn, T., Hornik, K. & Zeileis, A. Unbiased recursive partitioning: A conditional inference framework. J. Comput. Graph. Stat. 15, 651–674 (2006).
Article Google Scholar
Kuhn, M. Building predictive models in R using the caret Package. J. Stat. Softw. 28, 1–26 (2008).
Article Google Scholar
Kuhn, M., Vaughan, D. & Hvitfeldt, E. yardstick: Tidy Characterizations of Model Performance. https://github.com/tidymodels/yardstick (2025).
Ding, Y. C. et al. Molecular subtypes of triple-negative breast cancer in women of different race and ethnicity. Oncotarget 10, 198 (2019).
Article PubMed PubMed Central Google Scholar
Law, C. W., Chen, Y., Shi, W. & Smyth, G. K. voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 15, R29 (2014).
Article PubMed PubMed Central Google Scholar
Robinson, M. D. & Oshlack, A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 11, 1–9 (2010).
Article Google Scholar
Chen, M. M. et al. TCPA v3.0: An integrative platform to explore the pan-cancer analysis of functional proteomic data. Mol. Cell Proteom. 18, S15–s25 (2019).
Article CAS Google Scholar
Banik, G. et al. in Methods in enzymology Vol. 635 1-20 (Elsevier, 2020).
Wilkinson, L. (Oxford University Press, 2011).
Gu, Z., Eils, R. & Schlesner, M. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics 32, 2847–2849 (2016).
Article CAS PubMed Google Scholar
Li, J. et al. TCPA: a resource for cancer functional proteomics data. Nat. Methods 10, 1046–1047 (2013).
Article CAS PubMed PubMed Central Google Scholar
Gao, J. et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci. Signal. 6, pl1–pl1 (2013).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This project was carried out with major support from the OHSU SMMART Program, National Institutes of Health (NIH), National Cancer Institute (NCI) Human Tumor Atlas Network (HTAN) Research Center (U2CCA233280), Prospect Creek Foundation, Breast Cancer Research Foundation (BCRF-24-110) and AstraZeneca Pharmaceuticals LP. The program was initiated with support from a Stand Up to Cancer-American Association for Cancer Research Dream Team Translational Cancer Research Grant, SU2C-AACR-DT0409. Additional support came from the OHSU Brenden-Colson Center for Pancreatic Care, the W.M. Keck Foundation, NIH/NCI Cancer Target Discovery and Development (CTD²) (U01CA217842), the NIH/NCI Cancer Systems Biology Consortium Center (U54CA209988), and the M.J. Murdock Charitable Trust. Sequencing and multiscale microscopy were supported by a Knight Cancer Institute Cancer Center Support grant (5P30CA69533). Short-read sequencing assays were performed by the OHSU Massively Parallel Sequencing Shared Resource. RPPA was performed at MD Anderson. We are grateful to CALGB for allowing us access to their data for the purpose of validating our signatures. Finally, we want to acknowledge the incredible team effort over the years by the SMMART Program including the former Management team (Joe Gray, Annette Kolodzie), the Clinical Operations team (Kiara Siex, Julian Amaya, Alisa Pairmore, Nat Tilden, Marlana Klinger, Swapnil Parmar, Maddy Barrett, Annie Yang, Taylor Kelley), Translational Research team (Jayne Stommel, Jamie Keck, Brett Johnson, Ben Kong), Data team (Lauren Murray, Ana Olson, Dayana Rojas-Rodriguez, Jordan Teicher, Zainab Anbari, Souraya Mitri, Becky Goodford, Nick Van Marter-Sanders, Michele Czajkowski), Collaborations (Allison Solanki, Rochelle Williams-Belizaire) and Knight Data Management System (Matt Viehdorfer, Andrew Silvernail, Patrick Leyshock, Georgia Mayfield, Imogen Bentley) and Knight Data Operations (Ocean Murff, David McCoy, Leah Schwartz, Vijayalakshmi Subbiah). We thank Shamilene Sivagnanam and Lisa Coussens for generating the mIHC data, and their insightful input on the integration of that data. We also recognize the outstanding efforts of the Knight Diagnostic Laboratory (Chris Corless, Chris Suciu, Carol Beading, Jinho Lee, John Letaw) and the OHSU Knight Breast Cancer Disease team.

Author information

Authors and Affiliations

Knight Cancer Institute, Oregon Health & Science University, Portland, OR, USA
Daniel Bottomly, Christina Zheng, Allison L. Creason, Gordon B. Mills & Shannon K. McWeeney
Department of Biomedical Engineering, Oregon Health & Science University, Portland, OR, USA
Allison L. Creason
Division of Medical Oncology, University of British Columbia, Vancouver, BC, Canada
Zahi I. Mitri
British Columbia Cancer Agency, Vancouver, BC, Canada
Zahi I. Mitri
Department of Cell, Developmental & Cancer Biology, Oregon Health & Science University, Portland, OR, USA
Gordon B. Mills
Division of Oncological Sciences, Oregon Health & Science University, Portland, OR, USA
Shannon K. McWeeney

Authors

Daniel Bottomly
View author publications
Search author on:PubMed Google Scholar
Christina Zheng
View author publications
Search author on:PubMed Google Scholar
Allison L. Creason
View author publications
Search author on:PubMed Google Scholar
Zahi I. Mitri
View author publications
Search author on:PubMed Google Scholar
Gordon B. Mills
View author publications
Search author on:PubMed Google Scholar
Shannon K. McWeeney
View author publications
Search author on:PubMed Google Scholar

Contributions

D.B. co-led the design, performed analysis, modeling, and writing, and performed data management and processing of the CALGB patient samples. C.Z. oversaw clinical data management, de-identification and assisted with QA/QC; A.L.C. performed the Kallisto processing of the AMTEC and Validation samples and the coordination with HTAN for data dissemination; Z.I.M. provided key input into model interpretation and assisted with writing; G.B.M. led the orthogonal validation, provided critical input into the modeling, interpretation, validation, and testing of network signatures; S.K.M. provided oversight for analysis and modeling, co-led design and writing.

Corresponding author

Correspondence to Shannon K. McWeeney.

Ethics declarations

Competing interests

The Authors declare no Competing Non-Financial Interests but the following Competing Financial Interests: G.B.M. has licensed technologies to Myriad Genetics and NanoString; is on the SAB or is a consultant to Amphista, Astex, AstraZeneca, BlueDot, Ellipses Pharmaceuticals, ImmunoMET, Leapfrog Bio, Bruker/Nanostring, Neophore, Nerviano, Nuvectis, Pangea, PDX Pharmaceuticals, Qureator, RyboDyne, Signalchem Lifesciences, Turbine and Zentalis Pharmaceuticals; and has stock/options/financial interests in Bluedot, ImmunoMet, Nuvectis, RyboDyne, SignalChem Lifesciences, and Turbine; Sponsored research: AstraZeneca, Zentalis and Nanostring.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplemental Figures S1-5 and Supplemental Data legends

Supplemental Data S1

Supplemental Data S2

Supplemental Data S3

Supplemental Data S4

Supplemental Data S5

Supplemental Data S6

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bottomly, D., Zheng, C., Creason, A.L. et al. Utilizing cohort-level and individual networks to predict best response in patients with metastatic triple negative breast cancer. npj Precis. Onc. 9, 179 (2025). https://doi.org/10.1038/s41698-025-00959-w

Download citation

Received: 23 July 2024
Accepted: 16 May 2025
Published: 13 June 2025
Version of record: 13 June 2025
DOI: https://doi.org/10.1038/s41698-025-00959-w