Abstract
Pathogenic coding variants in BRCA1, BRCA2 and PALB2 confer hereditary breast/ovarian cancer risk, yet these regions comprise less than 10% of the genomic footprint of these genes, leaving most sequence unexplored. We investigated the contribution of non-coding variation to hereditary breast cancer by analyzing intronic variants and 5′ upstream regions of BRCA1, BRCA2 and PALB2 in the BEACCON case–control study of over 11,000 participants. Full-gene sequencing showed that 46.3% of cases carried at least one rare non-coding variant. This was associated with a modest increase in breast cancer risk (OR = 1.2, p < 0.0001), most likely reflecting the presence of a small proportion of pathogenic variants within a larger background of predominantly neutral variation. Stronger enrichment was observed for triple-negative disease, particularly for BRCA1 (OR = 1.5, p = 0.0001). Tumor sequencing of 42 high-priority variants identified 11 (26.2%) with wild-type allele loss and high homologous recombination deficiency. Functional CRISPR/Cas9 knock-in assays in MCF10A cells confirmed that two deep intronic variants created aberrant splice sites, disrupted splicing and impacted transcript expression.
Similar content being viewed by others
Data availability
The sequencing data generated in this study are not publicly available due to ethical and privacy restrictions but are available from the corresponding author upon reasonable request and with appropriate ethical approval.
Code availability
Python script used for data analysis available at https://github.com/Qihong0925/rare-non-coding-variants-in-BRCA1-BRCA2-and-PALB2-.git. The Seqliner code is available separately from the R script at http://bioinformatics.petermac.org/seqliner/. All other publicly available code used during whole genome sequence data processing and variant calling are available via the links mentioned within the methods.
References
Apostolou, P. & Fostira, F. Hereditary breast cancer: the era of new susceptibility genes. BioMed. Res. Int. 2013, 1–11 (2013).
Shiovitz, S. & Korde, L. A. Genetics of breast cancer: a topic in evolution. Ann. Oncol. 26, 1291–1299 (2015).
Hoang, L. N. & Gilks, B. C. Hereditary breast and ovarian cancer syndrome: moving beyond BRCA1 and BRCA2. Adv. Anat. Pathol. 25, 85–95 (2018).
Thompson, E. R. et al. Exome sequencing identifies rare deleterious mutations in DNA repair genes FANCC and BLM as potential breast cancer susceptibility alleles. PLoS Genet. 8, e1002894 (2012).
Wilcox, N. et al. Exome sequencing identifies breast cancer susceptibility genes and defines the contribution of coding variants to breast cancer risk. Nat. Genet. 55, 1435–1439 (2023).
Li, N. et al. Investigation of monogenic causes of familial breast cancer: data from the BEACCON case-control study. Npj Breast Cancer 7, 76 (2021).
James, P. A. et al. Estimating the proportion of pathogenic variants from breast cancer case–control data: Application to calibration of ACMG/AMP variant classification criteria. Hum. Mutat. 43, 882–888 (2022).
Montalban, G. et al. Screening of BRCA1/2 deep intronic regions by targeted gene sequencing identifies the first germline BRCA1 variant causing pseudoexon activation in a patient with breast/ovarian cancer. J. Med. Genet. 56, 63–74 (2019).
Gulsuner, S. et al. Long-read DNA and cDNA sequencing identify cancer-predisposing deep intronic variation in tumor-suppressor genes. Genome Res. 34, 1825–1831 (2024).
Rheinbay, E. et al. Analyses of non-coding somatic drivers in 2,658 cancer whole genomes. Nature 578, 102–111 (2020).
Evans, D. G. R. et al. A dominantly inherited 5′ UTR variant causing methylation-associated silencing of BRCA1 as a cause of breast and ovarian cancer. Am. J. Hum. Genet. 103, 213–220 (2018).
Moles-Fernández, A. et al. Role of splicing regulatory elements and in silico tools usage in the identification of deep intronic splicing variants in hereditary breast/ovarian cancer genes. Cancers 13, 3341 (2021).
Walker, L. C. et al. Using the ACMG/AMP framework to capture evidence related to predicted and observed impact on splicing: recommendations from the ClinGen SVI splicing subgroup. Am. J. Hum. Genet. 110, 1046–1067 (2023).
Lim, B. W. X. et al. Somatic inactivation of breast cancer predisposition genes in tumors associated with pathogenic germline variants. JNCI J. Natl. Cancer Inst. 115, 181–189 (2023).
Canson, D. M. et al. SpliceAI-10k calculator for the prediction of pseudoexonization, intron retention, and exon deletion. Bioinformatics 39, btad179 (2023).
Breast Cancer Association Consortium Breast cancer risk genes—association analysis in more than 113,000 women. N. Engl. J. Med. 384, 428–439 (2021).
Wood, L. D. et al. The genomic landscapes of human breast and colorectal cancers. Science 318, 1108–1113 (2007).
Wai, H. A. et al. Blood RNA analysis can increase clinical diagnostic rate and resolve variants of uncertain significance. Genet. Med. 22, 1005–1014 (2020).
Devereux, L. et al. Cohort profile: lifepool. Int. J. Epidemiol. 54, dyaf044 (2025).
Jaganathan, K. et al. Predicting splicing from primary sequence with deep learning. Cell 176, 535–548.e24 (2019).
Subramanian, D. N. et al. Assessment of candidate high-grade serous ovarian carcinoma predisposition genes through integrated germline and tumour sequencing. Npj Genom. Med. 10, 1 (2025).
Subramanian, D. N. et al. Exome sequencing of familial high-grade serous ovarian carcinoma reveals heterogeneity for rare candidate susceptibility genes. Nat. Commun. 11, 1640 (2020).
Talevich, E., Shain, A. H., Botton, T. & Bastian, B. C. CNVkit: genome-wide copy number detection and visualization from targeted DNA sequencing. PLoS Comput. Biol. 12, e1004873 (2016).
Sztupinszki, Z. et al. Migrating the SNP array-based homologous recombination deficiency measures to next generation sequencing data of breast cancer. Npj Breast Cancer 4, 16 (2018).
Birkbak, N. J. et al. Telomeric allelic imbalance indicates defective DNA repair and sensitivity to DNA-damaging agents. Cancer Discov. 2, 366–375 (2012).
Popova, T. et al. Ploidy and large-scale genomic instability consistently identify basal-like breast carcinomas with BRCA1/2 inactivation. Cancer Res. 72, 5454–5462 (2012).
Abkevich, V. et al. Patterns of genomic loss of heterozygosity predict homologous recombination repair defects in epithelial ovarian cancer. Br. J. Cancer 107, 1776–1782 (2012).
Guo, Q. et al. The mutational signatures of formalin fixation on the human genome. Nat. Commun. 13, 4487 (2022).
Koh, G., Degasperi, A., Zou, X., Momen, S. & Nik-Zainal, S. Mutational signatures: emerging concepts, caveats and clinical applications. Nat. Rev. Cancer 21, 619–637 (2021).
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
Acknowledgements
We thank all participants of the ViP and Lifepool studies for generously donating their DNA samples and clinical information. The work of D.C. was supported in part by funding from the Helen Lyons Foundation. This work was supported by the National Breast Cancer Foundation (IIRS-22-027; I.G.C. and P.A.J.).
Author information
Authors and Affiliations
Contributions
Q.Z. was responsible for formal data analysis, investigation, validation, and drafting of the original manuscript, and contributed to data curation, methodology, and visualization. In addition, Q.Z. undertook more than half of the tumor sample DNA collection and sequencing data processing, and was responsible for the design and construction of the mutant cell lines. M.Z. and N.L. contributed to primary data clean. E.M. was responsible for the collection of DNA from a subset of tumor samples. S.M. and L.D. coordinated patient contact for the collection of blood samples and performed subsequent sample processing. D.C. contributed to the review of data analysis results. D.C., A.B.S., and R.J.S. contributed to the study design. I.G.C. and P.A.J. were jointly responsible for project conceptualization, funding acquisition, supervision and methodology, with I.G.C. providing the resources for this work. D.C., A.B.S., I.G.C. and P.A.J. reviewed and edited the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhao, Q., Li, N., Marinovic, E. et al. Investigating the contribution of rare non-coding variants in BRCA1, BRCA2 and PALB2 to hereditary breast cancer. npj Breast Cancer (2026). https://doi.org/10.1038/s41523-026-00942-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41523-026-00942-z


