Precision medicine solutions for breast cancer include gene expression-based molecular subtyping1 and quantification of the expression of biomarkers such as the estrogen receptor, progesterone receptor, and the human epidermal growth factor receptor 2 (HER2), encoded by the ERBB2 gene, to direct targeted treatment. HER2 inhibitors have been a hallmark success in oncology therapies2 since HER2 gene amplification, occurring in around 15% of invasive breast cancers3, was found to be associated with poor prognosis. Standard methods for determining a tumor’s HER2 status include immunohistochemistry (IHC) and in situ hybridization methods such as FISH, which measure ERBB2 gene copy numbers4.

An important extension has recently been made with the demonstration of improved outcome in metastatic HER2-low patients receiving antibody–drug conjugates (ADCs)5,6. The ASCO/CAP 2018 HER2 testing guidelines7 have been updated8 to acknowledge the new indication for ADC therapy, providing best practice recommendations to identify the often subtle differences between HER2-zero (IHC 0) and HER2-low (IHC 1+ or IHC 2+ with negative ISH) in metastatic breast cancer. Meanwhile, recent evidence from the DESTINY-Breast06 trial suggests the existence of a distinct HER2-ultralow category, defined as IHC 0 with faint/incomplete membranous staining in ≤10% of tumor cells—reinforcing the growing recognition of heterogeneity within the HER2-low subgroup. While still formally HER2-negative, such cases may still respond to HER2-directed ADCs due to minimal antigen expression. Notably, Huang et al.9 have emphasized the lack of robust clinical evidence for efficacy in true IHC 0 tumors without any membranous staining, underscoring the need for more precise discrimination among these categories. The updated testing guidelines, however, ultimately affirmed the existing HER2 reporting recommendations, while admitting that it remains challenging to differentiate between HER2-low and HER2-zero based on standard IHC staining, which could result in erroneous treatment recommendations and potentially impact ongoing clinical trials8.

This shortcoming could be rectified by including quantitative ERBB2 mRNA measurements by nCounter-based platforms10,11 or high-throughput transcriptomics such as DNA microarrays12,13 or next generation sequencing (NGS)14,15, as a robust biomarker complementary to conventional IHC in predicting therapeutic benefit. Notably, these methods are, by design, free of well-described pitfalls4 of IHC and FISH such as variations in fixation, choice of antibody (IHC only), and scoring interpretation.

In this study, we sought to explore the relevance of transcriptomics-derived measurement of HER2 in distinguishing between HER2-zero and HER2-low cancers, compared with the histopathology-derived assessment of HER2 status. We collected transcriptomic profiles of 3564 primary breast cancer tumors from 3512 patients treated at Copenhagen University Hospital, Rigshospitalet generated between the years 2014 and 2022. After discarding samples not assigned IHC-derived scores for HER2 (Supplementary Fig. 1), this led to a collection of 3182 samples from 3137 patients, 830 of which had been processed with RNA-Seq and 2352 with tissue microarrays. These two technologies have been shown to generate results with high concordance16, and we indeed observed a similar distribution of expression values between the two sets of sample populations (Fig. 1).

Fig. 1: Histogram of ERBB2 expression of samples colored by HER2 IHC scoring.
figure 1

a Histogram of samples profiled with RNA-Seq; b Histogram of samples profiled with microarrays. Distribution of HER2 IHC score is indicated in each bin by a different color; blue: 0; yellow: 1+; green: 2+/normal; orange:2+/amplified; pink: 3+. x-axis: normalized log2-transformed ERBB2 values measured with transcriptomics; y-axis: the number of samples in each bin.

As expression measurements from the two transcriptomic platforms come in different scales and could not be directly compared, we divided the expression value distributions into five expression classes (“very low”, “low”, “intermediate”, “high” and “very high”) delineated by four quantiles (0.05, 0.25, 0.75, 0.95) (Fig. 1). Interestingly, we noticed that for 86% of all samples that had been assigned IHC 0 by standard classification, their ERBB2 expression fell under “low” (41%), “intermediate” (42%), and even “high” (4%) classes. This observation highlights the sensitivity of quantitative transcriptomics in detecting HER2 signal in samples with low levels of the protein, thus adequately distinguishing between HER2-zero and HER2-low.

Furthermore, we performed ROC analysis using ERBB2 expression as the predictor and IHC-based HER2 classification as the reference (HER2-low being samples with IHC score of 1+ or 2+ with negative in situ hybridization and HER2-zero being samples with IHC score of 017) (Supplementary Fig. 2). The AUC values were ≥0.75 for both RNA-Seq and microarray datasets, indicating strong performance. Nonetheless, our study aims to establish an expression-based approach to classifying breast tumors, aligning more closely with the definitions of HER2-low and HER2-zero in Tarantino et al.18. In this context, while our “very low” class is not exclusive to IHC = 0 (Fig. 1B) and includes samples with minimal HER2 expression, we consider it equivalent to HER2-zero. Similarly, our ‘low’ class corresponds to HER2-low. Given this distinction, despite previous IHC-based studies reporting higher proportions of HER2-low status among ER-positive breast cancers19, our analysis finds transcriptomics-defined HER2-low to be more prevalent in ER-negative disease (Supplementary Figure 3).

For a subset of 324 samples that received neoadjuvant chemotherapy (NACT), we obtained scores for response to NACT in the form of residual cancer burden (RCB), as well as treatment information. 44% of samples were from patients that received chemotherapy in combination with targeted anti-HER2 therapy (Supplementary Tables 2, 3). To investigate whether ERBB2 expression classes modulate response to NACT in samples of patients either treated or untreated with anti-HER2, we compared the pathological complete response (pCR) rates across ERBB2 mRNA and HER2 IHC classes, stratifying by the receipt of anti-HER2 therapy (Fig. 2). As only patients with an IHC score of 2+ (plus positive ISH) or 3+ are offered targeted treatment, there is an inherent selection bias in treatment assignment. As such, there are no samples with “very high” expression class in the untreated group and no samples of “very low” expression class in the treated group. As expected, among both treated and untreated patients, pCR is more widely distributed across ERBB2 expression classes compared to IHC scores, with the highest proportions of pCR observed in treated patients' tumors classified as “very high”.

Fig. 2: Distribution of response to treatment stratified by receipt of anti-HER2 therapy.
figure 2

The proportion of NACT samples in each treatment response category (RCB), split by ERBB2-expression class (a) and HER2 IHC score (b). The left-hand side displays samples from patients that received chemotherapy and the right-hand side displays samples from patients that received combined therapy. Gray color represents pCR, and blue indicates residual disease (non-pCR). Labels denote the absolute number of samples in each bar segment. RCB: residual cancer burden; class: ERBB2-expression class; HER2 IHC: HER2 status determined by immunohistochemistry.

Using Fisher’s Exact Test, both treated and untreated groups showed significant (p < 0.05) association between ERBB2 expression classes and treatment response in the form of pCR versus residual disease. On the contrary, when examining the association between HER2 IHC scores and treatment response, it was only found significant in the treated group. This is a further indication that NACT could be beneficial for more patients with tumors lacking IHC 3+ score. Notably, although our results link ERBB2 mRNA levels to NACT response, other studies have reported conflicting evidence. In a cohort of patients with HR + /HER2-negative breast cancer20, IHC HER2 status did not correlate with RCB score. Similarly, Schettini et al.21 found no association between baseline ERBB2 mRNA and pathological outcomes after NACT. Thus, more research is needed to determine whether response to neoadjuvant chemotherapy is ERBB2 expression-dependent.

Taken together, our findings underscore the importance of quantifying ERBB2 mRNA in cases where HER2 expression is too low for standard IHC to provide reliable results. Here, transcriptomic measurements better capture the continuum of HER2 expression compared to IHC-based classification (Supplementary Fig. 4). This study has certain limitations, including the lack of external validation and limited RCB data, which may restrict broader applicability of the findings.

Moreover, the DESTINY trial results have spurred renewed interest in developing more sensitive HER2 assays to better capture this heterogeneity and inform treatment decisions22,23,24. Despite its cost and relative complexity, RNA-Seq remains indispensable when comprehensive transcriptome data are required as it enables the identification and quantification of both known and novel transcripts—including non-coding RNAs, single nucleotide variants, fusion genes, and splice variants13. Moreover, RNA-Seq is ideally suited for high-throughput clinical diagnostics25 as it allows pooling and simultaneous processing of dozens of samples. Our combined dataset of 3564 cross-platform, whole-transcriptome ERBB2 mRNA measurements from breast tumors thus provides a reference set for future studies employing this method.

Refinement of HER2-low tumor classification and integration of mRNA profiling into diagnostics could improve patient selection for anti-HER2 therapies and optimize treatment outcomes. Ongoing Phase 2 clinical trials systematically enrolling HER2-low patients6 have the potential to suggest refinements of the targeted treatment strategies.

Methods

Study population and sample selection

The Copenhagen Breast Cancer Genomics Study (CBCGS) population is described in refs. 26,27, while the sample selection is illustrated in Supplementary Fig. 1. This register-based study was conducted with approval of the Danish Data Protection Agency (jr. no.: 2012-58-0004) and Danish Breast Cancer Group (jr. no.: DBCG-2015-14). The study did not include any contact with patients nor use of biological material, and thus ethical approval, including the need to obtain informed consent, was explicitly waived by the Ethical Committee of the Capital Region of Denmark. The NACT cohort comprises two distinct treatment groups. A subset of patients received chemotherapy alone—encompassing both single-agent and multiagent regimens—while another subset, corresponding to HER2-positive cases, received chemotherapy in combination with targeted anti-HER2 therapy (Supplementary Table 2).

NGS library preparation and sequencing

Total RNA was isolated from fresh tumor biopsies as previously described27. For 2,610 tumor samples, gene expression was quantified using Human Genome U133 Plus 2.0 Array (Afymetrix, Santa Clara, CA, USA) while for 954 samples it was quantified using paired-ended RNA-Sequencing on an Illumina NovaSeq (2 × 150 bp) or HiSeq2500 (2 × 126 bp) platform.

Computational data preprocessing

Microarray and NGS data were processed by the computational pipelines described in ref. 27. Briefly, for microarray files, the raw intensity.CEL file was preprocessed by quantile normalization, and probe summaries were extracted via robust multi-array average (RMA) using the affy package28 in R v. 4.0.041. For RNA-seq, fastq files were mapped to the human reference genome (GRCh37.p13 for samples analyzed before 2021, GRCh38 for subsequent samples) using STAR aligner29 v. 2.7.2b and reads overlapping features were quantified using Subread package’s featureCounts v. 1.6.230. Subsequently, values for each sample were normalized by size factors determined by the median ratio of gene counts relative to geometric mean per gene using the DeSeq2 R package31.

Histopathology data

Clinical and sample metadata were retrieved from DCBG, including metadata on histopathology assessments. We used the guidelines of the ASCO/CAP 20187, same as all Danish hospitals for histopathology assessment of HER2. For the non-NACT samples, only the earliest registered genomic sample for each patient was retained. Supplementary Fig. 1. Samples flagged for issues such as low quality, containing a large amount of normal tissue or classified as CIT256 molecular subtype outliers were removed.

Computational data analysis

Histopathology metadata such as HER2 immunohistochemistry scoring, FISH ERBB2 scoring, HER2 final status, and residual cancer burden were matched with transcriptomics measured ERBB2 expression by patient ID. Only HER2 status data that could be uniquely assigned to a transcriptomic sample were retained. R was used to curate histopathology metadata, while statistical analysis and visualization were carried out using Python libraries pandas32, numpy33 and seaborn34. The ROC curve was created using Python library scikit-learn35, whereby the IHC-based definition of HER2-low (IHC = 1+ or 2 + /normal) and HER2-zero (IHC = 0)36 were used as ground truth labels, with transcriptomics-measured HER2 expression as the continuous predictor.