Introduction

Triple-negative breast cancer (TNBC) accounts for ~20% of all breast cancers and is characterized by the lack of expression of estrogen receptor (ER), progesterone receptor (PR), and epidermal growth factor receptor 2 expression (HER-2). NAST is the standard of care for early-stage TNBC and offers in vivo evaluation of the tumor’s chemosensitivity, as well as downstaging, which allows more conservative surgery. TNBC is a heterogenous disease with variable clinical outcomes and response to neoadjuvant systemic therapy (NAST). Pathologic complete response (pCR) to NAST at surgery is a clinically useful predictor for improved disease-free and overall survival1,2,3. The rate of pCR to NAST regimens in TNBC is variable ranging from 18% to 65% and differs by TNBC subtype4,5.

Breast MRI is emerging as the imaging modality of choice to monitor breast cancer response to treatment because of its reproducibility and high soft tissue contrast resolution6. Currently, tumor size on dynamic contrast-enhanced MRI (DCE-MRI) is the primary method to monitor treatment response. In the ACRIN 6657/I-SPY trial, functional tumor volume measured by DCE-MRI was more accurate than the longest tumor diameter or clinical exam at predicting pCR; area under the curve (AUC) of 0.75, 0.64, and 0.68, respectively7. Functional tumor volume is defined as the subset of enhancing tumor voxels that demonstrate enhancement above a certain preset percent enhancement threshold. A more recent study by Panthi et al. confirmed that functional tumor volume is superior to the longest tumor diameter in predicting pCR8. However, acquisition of functional tumor volumes requires dedicated software and postprocessing that can be time consuming and may not be readily available in clinical practice, especially in the community practice setting. Golshan et al. demonstrated the utility of MRI tumor response by investigating complete MRI response and the change in longest tumor diameter as predictors of pCR9. Building on these prior results, we hypothesized that early changes in tumor volume based on three-dimensional manual MRI measurements might be more reliable than the longest tumor diameter, and a more readily obtainable imaging biomarker than functional tumor volume changes at identifying responders versus nonresponders to NAST. Additionally, our study investigates the possibility of predicting pCR at an earlier timepoint during NAST. If successful, it can allow triage of responders to standard-of-care NAST or to de-escalation trials, and triage of nonresponders to novel clinical trials or targeted treatments. To our knowledge, this novel yet simple approach has not been demonstrated previously in the literature. As such, the purpose of this study was to determine if changes in tumor volume based on manual DCE-MRI measurements between baseline and after two and/or four cycles of NAST can be used as a clinically relevant, early noninvasive biomarker to predict pCR and guide treatment for these patients.

Results

Patients

The clinical and tumor characteristics of the 287 patients included in the analysis are summarized in Table 1. Following NAST, 132 patients (132 of 287, 46%) achieved pCR, 64 (44.1%) in the discovery cohort and 68 (47.9%) in the validation cohort (Supplementary Data 1). NAST consisted of doxorubicin/cyclophosphamide (AC) for four cycles every 2 weeks followed by paclitaxel weekly for 12 doses. DCE-MRI after two and four cycles of AC was used for imaging response assessment (Fig. 1). Figure 2 shows a representative example of a patient with excellent treatment response following two and four cycles of NAST, with pCR confirmed by surgical pathology.

Table 1 Patient and tumor characteristics
Fig. 1
figure 1

Imaging and treatment protocol for patients participating in the ARTEMIS (A Robust TNBC Evaluation Framework to Improve Survival) trial.

Fig. 2: A 44 year-old woman with TNBC undergoing NAST.
figure 2

MRI at baseline (A), after two cycles (B), and four cycles (C) of AC shows an oval, heterogeneously enhancing mass with associated clip marker (arrows). MRI shows concentric shrinkage and tumor volume reduction of 83% after two cycles, and 96% after four cycles. Pathology after the completion of NAST confirmed pathologic complete response.

Tumor volume reduction after two cycles of NAST

In the discovery cohort, tumor volume reduction > 80% after two cycles of NAST predicted pCR with a PPV of 82% (18 of 22 patients had pCR) (Fig. 3a). The results in the validation cohort confirmed these findings: tumor volume reduction > 80% predicted pCR with a PPV of 83% (29 of 35 patients had pCR) (Fig. 3c).

Fig. 3: PPV and NPV for probability of pCR by the percent tumor volume reduction after two cycles of NAST in the discovery cohort (A, B) and the validation cohort (C, D).
figure 3

PPV was 82% in the discovery cohort (A) and 83% in the validation cohort (C) at tumor volume reduction > 80% (vertical line). NPV was 91% in the discovery cohort (B) and 84% in the validation cohort (D) at tumor volume reduction < 55% (vertical line). The dotted lines represent 95% confidence intervals. Area under the curve (AUC) 0.82 (0.74-0.90, p < 0.0001) in the discovery cohort and 0.81(0.73-0.89, p < 0.0001) in the validation cohort.

In the discovery cohort, tumor volume reduction <55% after two cycles of NAST predicted residual disease with an NPV of 91% (39 of 43 patients had residual disease) (Fig. 3b). In the validation cohort, tumor volume reduction < 55% predicted residual disease with an NPV of 84% (31 of 37 patients had residual disease) (Fig. 3d).

The AUC for tumor volume reduction to predict pCR after two cycles of NAST was 0.82 (0.74-0.90) in the discovery cohort and 0.81(0.73-0.89) in the validation cohort (p < 0.0001) (Fig. 4a). For a subset of 90 patients, the AUCs for tumor volume reduction and for functional tumor volume reduction after two cycles of NAST were comparable (p = 0.67, Table 2) and were 0.80 (0.70-0.89) and 0.79 (0.69-0.88), respectively.

Fig. 4: Area under curve (AUC) for tumor volume reduction to predict pCR after two cycles and four cycles of NAST.
figure 4

AUC for tumor volume reduction to predict pCR after two cycles of NAST was 0.82 (0.74-0.90) in the discovery cohort and 0.81(0.73-0.89) in the validation cohort (p < 0.0001) (A). AUC for tumor volume reduction to predict pCR after four cycles of NAST was 0.82 (0.75–0.90) in the discovery cohort and 0.79 (0.71-0.86) in validation cohort (p < 0.0001) (B).

Table 2 Comparison of the areas under the receiver operating characteristics curves (AUCs) for percent changes of functional tumor volumes and tumor volume reduction for a subset of 90 patients

Tumor volume reduction after four cycles of NAST

In the discovery cohort, tumor volume reduction > 90% after four cycles of NAST predicted pCR with a PPV of 80% (37 of 46 patients had pCR) (Fig. 5a). In the validation cohort, tumor volume reduction > 90% predicted pCR with a PPV of 75% (45 of 60 patients had pCR) (Fig. 5c).

Fig. 5: PPV and NPV for probability of pCR by the percent tumor volume reduction after four cycles of NAST in the discovery cohort (A, B) and the validation cohort (C, D).
figure 5

PPV was 80% in the discovery cohort (A) and 75% in the validation cohort (C) at tumor volume reduction >90%. NPV was 84% in the discovery cohort (B) and 81% in the validation cohort (D) at tumor volume reduction <80% (vertical line). The dotted lines represent 95% confidence intervals. AUC 0.82 (0.75–0.90, p < 0.0001) in the discovery cohort; AUC 0.79 (0.71–0.86, p < 0.001) in the validation cohort.

In the discovery cohort, tumor volume reduction <80% after 4 cycles of NAST predicted residual disease with an NPV of 84% (46 of 55 patients had residual disease) (Fig. 5b). In the validation cohort, tumor volume reduction < 80% predicted residual disease with an NPV of 81% (29 of 36 patients had residual disease) (Fig. 5d).

The AUC to predict pCR after four cycles of NAST was 0.82 (0.75–0.90) in the discovery cohort and 0.79 (0.71-0.86) in validation cohort (p < 0.0001) (Fig. 4b). For a subset of 90 patients, the AUCs for tumor volume reduction and for functional tumor reduction after four cycles of NAST were comparable (p = 0.85, Table 2) and were 0.81 (0.73–0.90) and 0.82 (0.73–0.91), respectively.

Inter-reader variability

ICCs were calculated for all DCE-MRI timepoints using measurements from both readers in the study. An ICC value closer to 1 means better reproducibility, and values > 0.81 indicate excellent reproducibility10. The ICC (95% CI) was 0.951 (0.939–0.961) for baseline measurements, 0.987 (0.983-0.990) for measurements after two cycles, and 0.996 (0.995-0.997) for measurements after four cycles, indicating excellent inter-reader reproducibility.

Discussion

Our analysis of this large prospective cohort of patients with TNBC showed that early DCE-MRI tumor volume changes can differentiate responders from non-responders and serve as a clinically relevant noninvasive predictor of pCR just after two or four cycles of NAST with AUC of 0.80. Tumor volume reduction greater than 80% after two cycles of NAST was predictive of pCR with high PPV of 80%. Less than 55% tumor volume reduction early after two cycles of NAST initiation was predictive of residual disease with NPV of 84%. The percentage tumor volume reduction thresholds shown in our study can serve as a practical, clinically applicable model in patients with TNBC undergoing NAST. Patients who achieve tumor volume reduction > 80% after two cycles of NAST, or greater than 90% after four cycles of NAST have a high likelihood of pCR, potentially triaging them to standard-of-care NAST or to investigational de-escalation therapies and sparing them unnecessary toxicity and cost. Similarly, tumor volume reduction < 55% after two cycles, or less than 80% after four cycles suggests chemoresistance to standard NAST, allowing these patients the opportunity to be considered for investigational treatments, and biomarker-directed novel targeted treatment regimens to improve the likelihood of pCR.

The clinical utility of using early DCE-MRI response to tailor NAST therapy was demonstrated in a retrospective study by Fatayer et al. in which DCE-MRI was performed after two cycles of NAST11. Tumor response was measured using the modified RECIST criteria and all molecular subtypes of breast cancer were included. Partial response was defined as 50% reduction in the diameter of the index tumor mass, or 25–50% reduction with improvement of the enhancement curve; minimal response was defined as reduction in tumor diameter of 25–50% without improvement in the enhancement curve. Of 72 patients with minimal response on DCE-MRI after two NAST cycles, 62 (86%) were switched to a different regimen, with improvement in tumor response seen in 74.2% following this change11. The utility of early DCE-MRI response to predict pCR was shown by Golshan et al.9. In their prospective study of 519 patients with TNBC, DCE-MRI response was assessed after 12 weeks of paclitaxel-based therapy using percentage reduction in the largest lesion diameter. Complete MRI response was defined as resolution of all target lesions, while partial response was defined as reduction in largest tumor diameter by 50% or greater. Complete DCE-MRI response at 12 weeks of paclitaxel-based therapy was seen in 116 patients (22%), of whom 91 (78.5%) achieved pCR at final pathology. Partial DCE-MRI response was seen in 166 (32%), of whom 95 (57.2%) had pCR. The PPV of pCR based on complete DCE-MRI response after 12 weeks of NAST was 79%. Our study builds on these results in that it attempts to predict pCR at earlier timepoints, after two and four cycles of NAST, to allow for more time for risk-adapted therapy. Additionally, our study identified percentage tumor volume reduction values that identified which early partial responders were likely to achieve pCR. Furthermore, our study differed from these studies in that we used tumor volume and tumor volume reduction, instead of percent difference in largest single target lesion diameter.

The clinical utility of functional tumor volumes was demonstrated by the I-SPY 1 Trial, which demonstrated that breast DCE-MRI was more accurate than clinical examination at predicting pCR7. Functional tumor volume is the number of pixels within the tumor that reach a minimum predefined threshold of signal enhancement and requires specialized software12. Hylton et al. showed that the use of functional tumor volumes was more accurate than the use of the longest tumor diameter for pCR prediction. In their study, functional tumor volume after one cycle of NAST was the most predictive of pCR with AUC of 0.707. Functional tumor volume after one cycle of NAST also had the strongest predictive performance for recurrence-free survival12. Recurrence-free survival was also shown in a retrospective study by Gampenrieder et al. to be significantly associated with tumor response based on DCE-MRI13.

In a similar study by Henderson et al., enhancing tumor volume was shown to be a comparable or a more useful model than functional tumor volume at predicting pCR: sensitivity of 81% and specificity 91.8%, compared to 80% and 76.8% for functional tumor volume14. Enhancing tumor volume, defined as the total number of pixels that enhance, is a another measure of tumor volume and does not inadvertently exclude foci of tumor that do not meet a predefined threshold. In our study, an a simpler form of volume estimation was performed by measuring the tumor dimensions in all three planes during the early phase of DCE-MRI images. These results are promising as this method of volume estimation is more accessible to radiologists and does not require dedicated software. The excellent interreader reproducibility shown in our study further supports the clinical utility of manual volume measurements and volume reduction calculations.

Our findings are consistent with our earlier study using B scale ultrasound to estimate tumor volume reduction and to predict pCR; in that study tumor volume reduction >80% after two cycles of AC was predictive of pCR15. However, although ultrasound is an inexpensive and widely available tool, breast MRI is more reproducible across centers and demonstrates less inter-reader variability and operator dependency14. Breast MRI is the most sensitive breast imaging modality and is superior to physical examination, mammography, and sonography at estimating tumor volume and tumor response when compared to surgical pathology16. Studies have shown that breast MRI is more accurate and sensitive than conventional imaging for assessing treatment response after conclusion of NAST6,17,18,19,20,21,22,23,24,25. Newly developed American College of Radiology appropriateness criteria as well as the guidelines from the European Society of Breast Imaging recommend MRI as the modality of choice for tumor response assessment26.

With the introduction of immunotherapy in the treatment of early TNBC following the KEYNOTE-522 trial, it has become evident that not all patients benefit from this treatment4,27. Although the patients receiving pembrolizumab had higher rates of pCR and improved disease-free survival than the control arm, 64.8% versus 51.2%, patients who received this regimen had higher incidence of treatment-related adverse events, including death4,27. Therefore, development of noninvasive imaging biomarkers that allow identification of responders versus nonresponders early during NAST is of even greater importance. If these findings are confirmed in additional datasets, this model could identify patients who could be spared immunotherapy and avoid its associated toxicities if projected to have a high chance of pCR on standard chemotherapy alone.

The main novelty of our study is in using changes in tumor volume based on three dimensional measurements from standard of care DCE-MRI performed early during NAST as a clinically relevant noninvasive biomarker for pCR prediction. This model could be easily obtained and adapted in daily clinical practice without the need of specialized software for post processing or additional analysis. High inter-reader reproducibility of DCE-MRI tumor measurements further supports the use of this model across different practices. Furthermore, a subset analysis confirmed that the AUC for tumor volume reduction is comparable to the AUC of functional tumor volume reduction.

The main limitation of our study is that it was performed at a single institution. This study included only patients with TNBC; therefore, results cannot be directly applied to other subtypes of breast cancer. Our prospective protocol finished enrollment before US Food and Drug Administration approval of immunotherapy for TNBC; therefore, only patients undergoing chemotherapy were included in this study. We are evaluating the applicability of these results to an immunotherapy cohort in an ongoing prospective clinical trial.

DCE-MRI tumor volume reduction early during NAST is a useful tool for predicting pCR in TNBC. Our study proposes a novel, clinically relevant noninvasive imaging biomarker that can be easily used for predicting pCR and non-PCR status as early as after two cycles of NAST. These findings may allow patients with a high likelihood of pCR to avoid toxicities of a complete chemotherapeutic regimen, including those of immunotherapy. Excellent responders may be triaged to de-escalation treatment strategies, including shortening of NAST regimens and early surgery, while nonresponders can be triaged to clinical trials for treatment escalation and targeted therapy. Clinical application of these results will require multi-institutional prospective trials including other breast cancer molecular subtypes and different treatment regimens.

Methods

Patients

Patients were enrolled in the ARTEMIS (A Robust TNBC Evaluation Framework to Improve Survival) prospective clinical trial for patients with stage I to III TNBC (NCT02276443). The study was approved by the Institutional Review Board at the University of Texas MD Anderson Cancer Center. Informed consent was obtained from all patients before study enrollment. The study was performed in accordance with all relevant ethical regulations and those specified under the Declaration of Helsinki.

A total of 326 patients with stage I-III TNBC who underwent standard of care NAST and were monitored during therapy using serial DCE-MRI imaging were evaluated for this analysis (Fig. 1). Of 326 patients, 39 were excluded because they had inflammatory breast cancer (3 patients); achieved pCR following experimental targeted therapies (14 patients); had MRI artifact or diffuse disease making tumor response difficult to assess (12 patients); or lacked MRI for both time points: after two cycles and after four cycles of NAST (10 patients). Our study cohort therefore consisted of 287 patients, who were randomly assigned in a 1:1 ratio to a discovery cohort (145 patients) and a validation cohort (142 patients) (Supplementary Data 1).

Chemotherapy

NAST consisted of standard of care doxorubicin/cyclophosphamide (AC) for four cycles every 2 weeks followed by paclitaxel weekly for 12 doses. Imaging response was assessed using DCE-MRI after two and four cycles of AC (Fig. 2). After completion of NAST, patients underwent either mastectomy or breast conserving surgery, and axillary lymph node surgery. Residual disease or pCR was assessed following surgery.

Histopathology

Core needle biopsy of the index tumor was performed in all patients before treatment. ER, PR, HER-2 and Ki-67 results were obtained via immunohistochemical staining of the biopsy specimen. TNBC was defined as tumor in which <10% of invasive tumor cells had positive nuclear staining for ER and PR receptors. HER-2 negativity was defined according to the American Society of Clinical Oncology/College of American Pathologists guidelines28,29. Surgical specimens were examined by dedicated breast pathologists. pCR was defined as no residual invasive disease (ypT0 or ypTis) in the breast and the resected nodal tissue (ypN0).

MRI and tumor volumes

Patients were imaged with DCE-MRI at baseline, after two cycles and four cycles of NAST (GE 3 Tesla MR750w whole body scanner (Waukesha, WI) with bilateral 8-channel phased array RF coil). Imaging was performed in the prone, feet first position. A single bolus of gadobutrol (Gadovist, Bayer Health Care) was injected (0.1 mL/kg at 2 mL/second followed by saline flush). At least one pre-contrast phase was obtained. The imaging protocol included a T2-weighted series and DCE-MRI series based on differential subsampling with Cartesian ordering (DISCO) sequence. Typical MRI scan parameters used for DISCO acquisition were as follows: field of view = 34 × 34 cm, slice thickness = 3.0 mm, slice spacing = −1.5 mm, flip angle = 12°, repetition time = 7.6 ms, echo time 1/echo time 2 = 1.1/2.3 ms, total acquisition time = 7 min, matrix = 320 × 320, number of acquired slices = 60 to 115, in-plane spatial resolution = 0.6−0.8 mm, temporal resolution of DISCO series = 8−15.5 s, receiver bandwidth = ±166.7 kHz, and number of excitations = 0.69.

Tumor volumes were calculated using manual measurements in the anterior-posterior, craniocaudal, and transverse dimension (width, height, length,) of the enhancing index lesion obtained during the early phase of DCE-MRI (acquired at 60 s after contrast injection) using the ellipse volume equation:

$${\rm{Volume}}={\rm{width}}\times {\rm{length}}\times {\rm{height}}\times 0.52$$

Tumors were independently measured by two fellowship-trained breast imaging radiologists with 10 and 16 years of experience, respectively, to allow evaluation of inter-reader variability. Percentage tumor volume reduction was calculated after two and four cycles of NAST with respect to the volume at baseline, using the following formula:

$$\% {\rm{Tumor\; volume\; reduction}}=({{\rm{Volume}}}_{{\rm{baseline}}}-{{\rm{Volume}}}_{2{\rm{cycles\; or}}4{\rm{cycles}}})/{{\rm{Volume}}}_{{\rm{baseline}}}$$

Statistical analysis

Receiver operating characteristic (ROC) analysis and assessment of AUC were used to evaluate the predictability of pCR based on the percentage reduction in tumor volume between baseline and after two cycles of NAST and between baseline and after four cycles of NAST. ROC curves were plotted with sensitivity on the y-axis and 1-specificity on the x-axis. Positive predictive value (PPV) and negative predictive value (NPV) curve analyses with 95% confidence intervals were plotted, along with Youden index calculations. Combination of the highest PPV with best Youden’s index was used to identify the specific tumor volume reduction threshold that could predict the occurrence of pCR and highest NPV and Youden’s index combination for the presence of residual disease, after two and four cycles of NAST. Patient age was compared between the discovery and validation cohorts using a Wilcoxon rank-sum test. Categorical variables were compared between the two cohorts using either chi-square tests or Fisher exact tests. The interreader variability in volumetric measurements was assessed by calculating the intraclass correlation coefficients (ICCs). A p-value <0.05 was considered statistically significant. All statistical analyses were carried out using R (version 3.6.1, R Development Core Team, Vienna, Austria). For a subset of 90 patients that were included in both, this cohort and the cohort from Panthi et al.8, the AUCs for the percentage changes in functional tumor volumes were compared with those for tumor volume reduction using DeLong’s test.