Introduction

Triple negative breast cancer (TNBC) is a highly aggressive subtype characterized by poor prognosis, early recurrence, and high mortality rates. While first-line (1 L) combination therapies for metastatic TNBC (mTNBC) have shown promise, effective second-line and later (2 L + ) treatment options remain limited, with a median overall survival (OS) of only 8 to 15 months for these patients, highlighting a significant unmet need for new therapeutic strategies to improve outcomes in later-line settings1. Eribulin mesylate, a microtubule inhibitor, has also shown promise in mTNBC treatment by improving OS in patients who have received multiple prior chemotherapy lines. The EMBRACE trial was a phase 3 trial in patients with heavily pretreated metastatic breast cancer, including mTNBC, which showed significantly increased overall survival (OS) at 13.1 months versus 10.6 months in patients treated with eribulin versus physician’s choice2, leading to the inclusion of eribulin into standard treatment guidelines for mTNBC. Subgroup analyses indicated that patients with HER2-negative and triple-negative disease experienced increased survival when treated with eribulin versus capecitabine, with an increase in median OS from 9.4 months to 14.4 months3.

Pembrolizumab combined with chemotherapy is the current standard of care in the first line treatment of mTNBC tumors expressing PD-L1 with CPS ≥ 10. The KEYNOTE-086 study, a phase II trial which evaluated pembrolizumab as a monotherapy in mTNBC patients reported an overall response rate (ORR) of 5% in pre-treated Cohort A, indicating modest efficacy in this subset of patients4. KEYNOTE-355, a phase 3 trial which randomized patients to either pembrolizumab in combination with chemotherapy or chemotherapy alone, showed that the addition of pembrolizumab improved median overall survival from 16.1 months to 23.0 months in patients with a PD-L1 combined positive score of at least 105. In the neoadjuvant setting, KEYNOTE-522 was a phase 3 trial which demonstrated that the addition of pembrolizumab to neoadjuvant chemotherapy significantly increased the pathologic complete response (pCR) from 51.2% to 64.8% in patients with early TNBC6. Additionally, there are ongoing studies evaluating antibody drug conjugates in combination with pembrolizumab, such as the ASCENT-04 study evaluating Trop2 targeted sacituzumab govitecan7,8. Given the individual efficacies of eribulin and pembrolizumab, the ENHANCE-1 trial was conducted as an open-label, single-arm, phase 1b/2 study which aimed to evaluate the safety and efficacy of eribulin plus pembrolizumab as 1–3 L treatment in patients with mTNBC. As ENHANCE-1 was conducted prior to treatment with pembrolizumab as standard of care in TNBC, none of the patients had received prior immunotherapy. Patients with PD-L1 positive tumors exhibited increased ORR (28.4% versus 17.3%)9.

Pembrolizumab is thought to reshape the tumor-immune microenvironment into an immune activated state, enhancing cytotoxic T cell function10, reducing immunosuppressive mechanisms11 and improving tumor antigen presentation12. Eribulin can complement this effect by modulating the tumor-immune microenvironment through vascular remodeling and cytokine profile modulation13, which can alter tumor-associated immune cell infiltration14. We explored potential biomarkers associated with response to pembrolizumab plus eribulin.

In 142 patients enrolled in the ENHANCE-1 trial with available tissue, we perform an exploratory analysis to characterize the tumor-immune microenvironment via assessment of stromal tumor-infiltrating lymphocytes (sTILs) and multiplexed immunofluorescence to search for biomarkers associated with response. The assessment of TILs in breast cancer has gained attention as a potential prognostic and predictive biomarker in breast cancer, with the presence of sTILs being associated with improved clinical outcomes in response to cytotoxic treatments and immunotherapies15. The International TILs Working Group has established guidelines for evaluating TILs in breast cancer using hematoxylin and eosin (H&E) stained sections16. These guidelines outline methods to quantify sTILs as a percentage, which has been shown to correlate with patient prognosis. High levels of sTILs are associated with better overall survival and disease-free survival, particularly in patients with TNBC17,18. We assessed associations of response with sTILs on digitized H&E stained tissue sections with manual assessment by trained pathologists and a deep learning approach known as MuTILs19.

While pathologist assessment of sTILs has shown clinical utility, more granular characterizations of lymphocytes can provide insight for developing next generation biomarkers20. While T cells have been extensively investigated in their role of tumor regulation21,22, innate and adaptive immune cells in the tumor-immune microenvironment can act in concert to regulate tumor growth23,24. Cell types other than T cells can present opportunities for novel therapeutic targeting11. For example, myeloid cells constitute a significant portion of the breast cancer TME25, though the prognostic role of macrophages remains controversial26,27. We characterized granular populations of lymphocytes and myeloid cells using multiplexed immunofluorescence (mIF), which allows us to assess multiple proteins simultaneously while maintaining spatial context. We employed two panels of mIF markers tailored for the breast cancer microenvironment including T-cells, B-cells, NK cells, macrophages, tumor, MHC Class-II and epithelial-mesenchymal transition (EMT). These panels provide a more in-depth view of the immune landscape and tumor biology which contribute to differential therapy response.

Results

ENHANCE-1 trial

ENHANCE-1 was a single arm phase Ib/II trial conducted from 2015 to 2021 which evaluated the efficacy and safety of eribulin and pembrolizumab in 167 patients with mTNBC across 18 sites in the United States (NCT02513472) (Fig. 1A). Patients received 0-2 prior lines in the metastatic setting and were enrolled either into stratum 1 (0 prior lines of therapy—66 patients) or stratum 2 (1-2 prior lines of therapy—101 patients). Patients in stratum 1 could have received prior therapy, including hormonal therapy or chemotherapy in either the adjuvant or neoadjuvant setting, prior to being diagnosed with metastatic disease. Evaluable tissue was available for 142 patients for our study including 58 in stratum 1 and 84 in stratum 2 (clinical characteristics are shown in Fig. 1B, Supplementary Table 1). The samples analyzed were archival FFPE tissues obtained via either core biopsy or a resection which could be from excisional biopsy, lumpectomy or mastectomy. Objective response rate (ORR) was defined as the percentage of patients with either complete response (CR) or partial response (PR) by RECIST 1.1. The ORR was 25.8% in stratum 1 and 22.5% in stratum 2 for the 142 patients analyzed. A summary of the clinical trial findings has been previously published9. Best overall response (BOR) was assessed via RECIST v1.1 with responders (n = 34, n = 15 in stratum 1, n = 19 in stratum 2) defined as patients with complete or partial radiographic responses and non-responders (n = 95, n = 39 in stratum 1, n = 56 in stratum 2) defined as those with stable disease or progressive disease. We excluded patients for which only lymph node tissue or pleural fluid was submitted, resulting in 130/142 samples. While all 130 samples in our study were acquired prior to enrollment in the trial, ENHANCE-1 did not require fresh tissue biopsies immediately prior to trial enrollment. Tissue samples correspond to archival FFPE material acquired prior to enrollment in ENHANCE-1 from a metastatic or primary site. As the tumor-immune microenvironment can be modulated by intervening treatments, we evaluate candidate biomarkers across all patients and in a subset of patients who have not been treated between the time of tissue acquisition and trial enrollment. We refer to these tissues as the TME-naive samples, as they are most representative of the pre-treatment tumor microenvironment. These criteria resulted in 43/142 patient samples in the TME-naive group: 31 from stratum 1 and 12 from stratum 2. PD-L1 combined positive score (CPS), a commonly assessed biomarker for pembrolizumab, was assessed for each patient and was not significantly associated with response across all available samples and in the TME-naive group.

Fig. 1: Patient characteristics in the ENHANCE-1 trial.
figure 1

A Patient selection starting from total ENHANCE-1 enrollment and (B) heatmap showing clinical and pathological characteristics for n = 142 patients with histological sections.

H&E evaluation of sTILs

We evaluated sTILs across H&Es drawn from all available samples with evaluable disease (n = 118/130) and the TME-naive group (n = 41/43). sTILs were evaluated with two methods (see Methods—Tumor infiltrating lymphocytes analysis): pathologist (HG) assessment and with MuTILs, a deep learning model (Fig. 2). In samples where TILs could be evaluated, pathologist-assessed sTILs were not significantly different across response groups when considering all available samples (n = 115/118), but were significant in the TME-naive group (n = 40/41) (Fig. 2A). Restricting to primary site tissue (n = 80/115) did not reveal associations to response in all available samples or in the TME-naive group (n = 18/40) (Fig. 2B). As MuTILs was only trained on breast tissue, we computationally assessed TILs in breast tissue samples from the primary site (n = 73/118), which was correlated to pathologist assessment but not associated with response (Fig. 2C). MuTILs was not significantly correlated to pathologist assessment in the TME-naïve subset restricted to breast tissue samples from the primary site (n = 15/41) and was also not associated with response (Fig. 2D).

Fig. 2: sTILs were evaluated via manual assessment and with MuTILs.
figure 2

Pathologist-assessed sTILs stratified by response across all samples and the TME-naive subset in (A) primary and metastatic sites and (B) the primary site. MuTILs computationally assessed sTILs in breast samples compared to pathologist assessed sTILs and stratified by response (C) and the TME-naive subset in breast samples (D).

Multiplexed immunofluorescence immune populations

We evaluated lymphocyte and myeloid populations using multiplexed immunofluorescence using 2 panels (Supplementary Data 1). We identified all biologically possible combinations of markers and assessed corresponding intratumoral and stromal populations. We used Cox proportional hazards to model PFS using all samples and each immune population individually with the following covariates: number of treatments between tissue collection and enrollment in ENHANCE-1, stratum and site of tissue (primary versus metastatic). No populations were significantly associated with PFS. We then evaluated associations between immune populations and response (Fig. 3) across all samples and in the TME-naive group. As our study is exploratory and an arbitrary number of cellular phenotypes can be defined from marker combinations, we did not correct for multiple hypotheses. Across all available samples, there were no significant associations with either lymphocyte or myeloid populations and response. In the TME-naive group, we found several significant lymphocyte-related findings including: increased intratumoral CD4 + CD8+ in non-responders (p = 0.021) when only breast samples were considered (Fig. 3A) and increased stromal CD56 + , a marker associated with NK cells, in non-responders (p = 0.047) (Fig. 3B). We also identified significant findings among stromal myeloid cells (Fig. 3C), including increased CD68+ cells in responders (p = 0.009). CD68 + CD163-HLA-DR- and CD68 + CD163 + , which could represent M1 and M2 macrophage populations, were not significantly different across response groups (Supplementary Data 1). Sub-populations CD68 + CD163+Vimentin+ (p = 0.014) and CD68+Vimentin+ (p = 0.005) were increased in responders. Furthermore, we found increased stromal HLA-DR+ in responders (p = 0.039).

Fig. 3: Immune populations associated with response groups in TME-naive samples.
figure 3

Populations include: (A) intratumoral CD4 + CD8 + , (B) stromal CD56+ and (C) stromal CD68 + , CD68 + CD163+Vimentin + , CD68+Vimentin+ and HLA-DR + .

Discussion

In our study, only pathologist assessed sTILs in the TME-naive group associated with response. However, the TME-naive subset only consists of n = 43 samples and the association should be treated cautiously. While we found MuTILs to be correlated to pathologist assessed sTILs, the correlation has been shown to be imperfect19. The correlation could be affected by the subjective nature of TIL assessment and the heterogeneity of TIL spatial distributions which could affect accuracy. Additionally, the presence of necrotic debris and karyorrhectic debris, which can be visually similar to TILs, could be enhanced in our heavily pre-treated cohort. While studies have reported associations between sTILs in TNBC and efficacy of immune checkpoint inhibitors17,18,28, one study showed higher sTILs were associated with pCR overall cohort but did not predict the efficacy of PD-1/PD-L1 inhibitors29. The extent of disease (presence of nodal involvement or metastases) and treatment combinations may confound the association of sTILs with efficacy in immune checkpoint inhibitor treated patients with TNBC28.

Our study identified several more granular immune populations associated with response, including the enrichment of intratumoral CD4 + CD8+ and stromal CD56+ in non-responders. The CD4 + CD8+ population was visually assessed by a pathologist (DC) to confirm that these cells expressed both markers (Supplementary Fig. 1). CD4 + CD8+ double positive cells are a rarely studied population30 which has observed in breast cancer and associated pleural effusions31, melanoma32, Hodgkin lymphoma33 and urological cancers34 with conflicting roles and cytokine profiles that vary between cytotoxic and immunosuppressive capacities. For example, using flow cytometry, in patients with renal cell carcinoma, up to 50% of TILs were identified as CD4 + CD8+ with high PD-1 and TIM3 expression, suggesting a dysfunctional phenotype35. In melanoma patients, transcriptional analysis showed that CD4 + CD8+ TILs had consistent expression with CD8 + T-cells but also showed evidence of being poorly cytolytic, transiting towards features of CD4 + T-cells36. In our study, CD4 + CD8+ cells were enriched in non-responders, indicating a possible immunosuppressive effect which could be further elucidated with cytokine profiling. However, the presence of this population should be treated cautiously as limitations in spatial resolution can result in identifying CD4 + CD8+ cells which are tightly clustered CD4+ and CD8+ populations. In addition to CD4 + CD8 + , we found enrichment of stromal CD56 + , a marker of natural killer or NK-T cells, in non-responders, which has been previously associated with poor overall survival in TNBC as well as increased in comparison to non-TNBC37. In vivo animal studies have demonstrated that tumor-infiltrating NK cells exhibit impaired cytotoxic functions, dysfunctional states, and features consistent with exhaustion in the later stages of TNBC38. The dysfunction of these NK cells could be involved in breast cancer progression39. Our observed CD56+ enrichment may reflect a population with reduced functional capacity for cytotoxic activity against tumor cells.

We observed increased stromal HLA-DR expression in responders, which may reflect enhanced immune activation and T-cell recruitment, critical for anti-tumor immunity and response to therapy. One previous study using serial single RNA sequencing found that higher HLA-DR expression in macrophages was enriched in responders in mTNBC patients treated with nab-paclitaxel and pembrolizumab40. Other studies have suggested higher intraepithelial HLA-DR expression was associated with better responses to treatments combining PD-1/PD-L1 inhibitors with chemotherapy in patients with TNBC41,42.

Eribulin has been found to reverse EMT in TNBC cells in vivo and in vitro43. Eribulin has also been associated with EMT changes in patients with mTNBC, shifting tumors to a more epithelial phenotype44. Moreover, increased vascular remodeling and cell adherence induced by eribulin may increase infiltration of immune cells into tumors45. While we do not observe associations between response and tumor vimentin expression, we do observe enrichment of M2 macrophages expressing vimentin (CD68 + CD163+Vimentin + ) in responders and no difference in CD68 + CD163+Vimentin- across response groups. Future studies could evaluate the modulation of vimentin expression, which could be related to pro-inflammatory pathways or cell migration, on macrophages with pre and post paired samples46,47. Combining anti-PD-1 therapy, which primarily reactivates T cells but can influence macrophage polarization, may further enhance this reprogramming effect48,49. Our findings suggest that increased immune stimulatory cells may not necessarily associate with response. Instead, increased immunosuppressive cells, such as Vimentin expressing M2 macrophages, appear to be associated with response, which may be due to the overall suppressive tumor-immune microenvironment in patients with mTNBC and the functional state of immune cells in this context.

Our findings suggest associations between the pre-treatment tumor-immune microenvironment in mTNBC and response to eribulin plus pembrolizumab. However, our study has several limitations. Most importantly, given that the tumor-immune microenvironment is influenced by therapy, and our objective was to characterize the relationship between representative pre-treatment tissue and response to eribulin plus pembrolizumab, our analysis was constrained by the limited number of tissue samples available that had not undergone treatment between tissue collection and trial enrollment. As a result of the limited number of samples and large number of comparisons possible with multiplexed immunofluorescence analysis, our findings were not corrected for multiple hypotheses, raising the possibility of type I error. While not obtaining tissue prior to trial enrollment was a pragmatic decision, our ability to characterize pre-treatment tissue was limited. Another limitation originates from heterogeneity in the tumor-immune microenvironment from treatment prior to tissue collection, as most of the patients were not treatment-naive. The lack of tissue samples post-treatment limited our ability to assess treatment induced immune modulation. Lastly, as the trial was single-armed, there were no comparison arms to assess eribulin-induced changes in the tumor-immune microenvironment over pembrolizumab alone.

Despite these limitations, our results suggest a role for macrophages as a novel biomarker, which is not included in established estimates of sTILs, in patients with mTNBC treated with eribulin plus pembrolizumab.

Methods

Tumor infiltrating lymphocyte analysis

Manual pathologist assessment of stromal and intratumoral TILs was performed according to the International Immuno-Oncology Biomarker Working Group on Breast Cancer16. MuTILs, a deep learning approach, was used to simultaneously classify tissue regions and segment individual cell nuclei to assess sTILs19.

Multiplex immunofluorescence staining and analysis

Patient samples were stained using 2 panels of antibodies (Supplementary Fig. 2, Supplementary Table 2). Panel 1: CD56 (Cell Marque, Cat# 156R-95; 1:200), Granzyme B (Leica, Cat# NCL-L-GRANB; 1:40), PanCK (Biolegend, Cat# 914201; 1:160), CD4 (Abcam, Cat# ab133616; 1:500), CD8 (Leica, Cat# PA0183; 1:2), FoxP3 (Abcam, Cat# ab20034; 1:200); Panel 2: CD68 (Biogenex, Cat# AM416-5M; 1), Vimentin (Leica, Cat# PA0640; 1:2), CD163 (Leica, Cat# CD163-L-CE; 1:125), HLA-DR (Abcam, Cat# ab20181; 1:6000), PanCK, and CD20 (Leica, Cat# NCL-L-CD20-L26; 1:4) with the Opal 7-Color Automation IHC Kit (Akoya Biosciences) on a Leica Bond RX automated stainer (Leica Biosystems). FFPE tissue sections (4 μm) were baked for 2 h at 60 °C, followed by automatic deparaffinization, rehydration, and The Heat Induced Epitope Retrieval (HIER) in BOND Epitope Retrieval Solution 2, pH 9 (Leica Biosystems) for 30 min at 95 °C. Primary antibodies against each marker were diluted using Akoya 1× Antibody Diluent Block (Akoya, Cat# ARD1001EA) sequentially applied, with optimized dilutions and orders (Supplementary Table 2) for 30 min per marker at room temperature. Signal amplification was achieved using tyramide signal amplification (TSA) reagents performed in six cycles. In each cycle, the tissue was incubated sequentially with a primary antibody for 30 min at room temperature, the secondary antibody conjugated to polymeric horseradish peroxidase (HRP), an Opal fluorophore in TSA buffer, and BOND Epitope Retrieval Solution 1, pH 6 (Leica Biosystems) for 20 min at 95 °C to strip the tissue-bound primary–secondary antibody complexes before the next staining cycle. After nuclear counterstaining with DAPI, slides were coverslipped with Vectashield HardSet Antifade mounting medium (Vector Laboratories). Whole slide scans (WSS) were captured at 10× magnification using the Vectra Polaris automated multispectral microscope (Akoya/PerkinElmer) with Vectra Polaris 1.0.13 software. Regions of interest were chosen for multispectral imaging (MSI) at 20× magnification using the Phenochart 1.1.0 software (Akoya). Spectral unmixing and image exportation were performed using the InForm v2.5.1 software (Akoya).

Images were analyzed using HALO AI software (Indica Labs). Whole-slide scans (WSS) were first preprocessed for image alignment and quality assurance to ensure accurate segmentation of tissue regions and fluorescence channels. Tissue segmentation was performed by selecting regions of interest (ROIs) based on morphological features and fluorescence intensity patterns. Tissue classification and single-cell analysis were conducted using the integrated AI-driven tissue classifier (DenseNet V2) and cell phenotyping modules (HighPlex FL v4.0.4) to identify and quantify cells expressing specific markers, including CD56, Granzyme B, CD4, CD8, FoxP3, CD68, Vimentin, CD163, HLA-DR, and CD20. The pathologist’s annotations on H&E-stained slides served as a reference for confirming ROI selection and cell phenotype. Fluorescence intensities were adjusted and normalized across slides to reduce inter-sample variability. The data output included marker co-expression profiles, cell density maps, and spatial distributions of immune and tumor cell populations for further statistical analysis.

Statistical analysis

Mann-Whitney statistical tests were used to compare responders and non-responders in the analysis of sTILs and multiplex immunofluorescence derived populations. Correlation between computational and pathologist sTILs was performed with Spearman’s rank correlation test.