Preliminary evaluation of ShallowHRD performance compared to HRDetect in familial breast cancer tumors

Adel Jensen, Louise; Baekgaard, Caroline Hey; Larsen, Mie Bohnensack; Boonen, Susanne Eriksen; Bak Jylling, Anne Marie; Hikmat, Zainab; Hao, Qin; van Overeem Hansen, Thomas; Pedersen, Inge Søkilde; Larsen, Martin Jakob; Thomassen, Mads

doi:10.1038/s41598-025-14122-9

Download PDF

Article
Open access
Published: 11 August 2025

Preliminary evaluation of ShallowHRD performance compared to HRDetect in familial breast cancer tumors

Louise Adel Jensen^1,2,
Caroline Hey Baekgaard^1,3,
Mie Bohnensack Larsen¹,
Susanne Eriksen Boonen^1,3,
Anne Marie Bak Jylling^4,5,
Zainab Hikmat^1,6,
Qin Hao¹,
Thomas van Overeem Hansen^7,8,
Inge Søkilde Pedersen^9,10,
Martin Jakob Larsen^1,3 &
…
Mads Thomassen^1,2

Scientific Reports volume 15, Article number: 29442 (2025) Cite this article

1826 Accesses
Metrics details

Subjects

Abstract

Determining the Homologous Recombination Deficiency (HRD)-status of a malignant tumor is central in predicting patient response to specific treatments. Therefore, precise and cost-effective tools are needed for clinical implementation. HRDetect is widely regarded as a golden standard for determining HRD-status. In contrast, ShallowHRD is a simpler algorithm. However, it offers a more economical alternative optimized for Formalin-Fixed, Paraffin-Embedded tissue (FFPE) and potentially useful for most breast cancer patients. Data from shallow whole-genome sequencing (1-5X) on FFPE tissue and whole-genome sequencing (50X, and additionally downscaled to 5X) on fresh frozen tissue from 19 patients were analyzed using ShallowHRD and compared to the HRD-status attained by HRDetect using Receiver Operating Characteristic (ROC) curve analysis. Further, Spearman rank correlation was calculated to estimate the correlation between ShallowHRD and HRDetect scores, as well as between the three ShallowHRD datasets. The comparison of ShallowHRD to HRDetect displayed a significant specificity (85.7–100%) and sensitivity (80%) in all data groups. The ROC curve analyses illustrated that ShallowHRD displayed an area under the curve statistically similar to what was previously reported for HRDetect in all three data sets. The Spearman correlation also indicated significant correlation between ShallowHRD and HRDetect scores for the three datasets (HRDetect vs. FFPE (1-5X) (ρ = 0.68, p = 0.0013), Fresh Frozen (5X) (ρ = 0.58, p = 0.0086), and Fresh Frozen (50X) (ρ = 0.50, p = 0.029)). The ShallowHRD analysis was of good quality in all data groups, and the ShallowHRD scores were similar across data groups. One sample was incorrectly labeled as HRD-negative by ShallowHRD, but it contained one variant of unknown significance (VUS) in RAD51D, requiring further investigation. HRD-status from ShallowHRD correlated well with HRDetect output in this preliminary study, potentially making ShallowHRD an accurate, efficient, and more economical alternative for clinical use. However, further examination and validation in larger cohorts are required.

Shallow whole genome sequencing approach to detect Homologous Recombination Deficiency in the PAOLA-1/ENGOT-OV25 phase-III trial

Article Open access 09 November 2023

Factors affecting health-related quality of life following axillary lymph node dissection among breast cancer survivors in Egypt

Article Open access 10 October 2025

RAD51-based homologous recombination deficiency is associated with treatment response and survival in early breast cancer

Article Open access 05 March 2026

Introduction

The Homologous Recombination (HR) Repair pathway is crucial in repairing double-stranded DNA breaks, and several proteins, e.g., BRCA1, BRCA2, and RAD51, are central to the process^1,2. When pathway components are dysregulated or dysfunctional, it leads to HR Deficiency (HRD), which is associated with genomic instability and risk of cancer development^3,4. Despite not yet being part of routine clinical practice when treating breast cancer, determining the HRD-status of a tumor is an important predictive marker, as patients with a high level of HRD have been shown to respond better to treatments such as poly (ADP-ribose) polymerase (PARP) inhibitors and platinum-based chemotherapies than patients with a functional HR Repair Pathway in breast^5,6,7,8,9 and ovarian cancers^5,10,11. Previously, it has been shown that the prevalence of HRD varies among breast cancer tumors according to their PAM50 subtype, with 46–62% of triple-negative breast cancer cases displaying HRD, compared to 2–23% for non-triple-negative breast cancer^12,13. This highlights HRD as a clinically relevant biomarker for predicting treatment response in breast cancer.

Several tools have been developed to assess HRD-status using various HRD definitions and sequencing data types, such as panel sequencing and whole genome sequencing (WGS)¹⁴. This study compares two tools primed for HRD-status assessment using WGS data, HRDetect¹⁵ and ShallowHRD¹⁶, both have previously been trained on breast cancer cohorts^15,16. HRDetect has been tested on tumor cohorts from breast^8,15, ovarian^15,17, and pancreatic cancers^15,18, and it is often regarded as the best method to assess HRD-status due to the high precision and robustness of the tool. HRDetect uses several measurements of HRD, including loss of heterozygosity, single base substitutions, and larger structural changes^15,19. In contrast, ShallowHRD only considers larger structural changes, namely Large-scale Genomic Alterations (LGAs)¹⁶, which may result in a less precise assessment of the HRD-status. HRDetect relies on deep WGS (≥ 30X) of preferably fresh frozen tumor tissue, as paraffin-induced artifacts might affect the analysis. Furthermore, germline sequencing is demanded for somatic variant calling^19,20,21. Conversely, ShallowHRD relies on shallow tumor sequencing (~ 1X) and works on data from fresh frozen as well as Formalin-Fixed, Paraffin-Embedded (FFPE) tissue¹⁶.

Utilizing FFPE tissue offers a significant advantage in the clinical setting, as it is far more readily available than fresh frozen tumor tissue. Additionally, shallow sequencing would be preferable due to its lower cost (10–20 times cheaper, depending on WGS depth for HRDetect), as price can be a limiting factor. Guo et al. tested ShallowHRD with success in predicting ovarian cancer patients’ response to treatment with PARP inhibitors¹¹, and Sztupinszki et al. conducted a comparison of HRDetect to the HRD-score of The Cancer Genome Atlas and concluded that they perform similarly in the assessment of HRD-status in ovarian cancer¹⁷. However, to our knowledge, proper head-to-head assessment of ShallowHRD compared to HRDetect has yet to be performed in breast cancer patients.

Therefore, we aim to compare ShallowHRD results to previously reported HRD-status attained from HRDetect²² in breast cancer patients to assess the usability of ShallowHRD in a clinical setting.

Materials and methods

General workflow

FFPE and fresh frozen tumor tissue were sequenced using WGS at 1-5X and 50X, respectively, and the fresh frozen data was additionally bioinformatically downscaled to 5X. The 50X Fresh Frozen data was used for HRDetect, making up the truth set for comparison. ShallowHRD was used on the fresh frozen data at 50X, the downscaled Fresh Frozen data, and the shallow FFPE sequencing data. Figure 1 provides an overview of the workflow.

Patient material

For this study, 19 breast cancer patients were selected from a cohort of patients that has previously been analyzed using data from WGS (50X) of fresh frozen tissue in HRDetect²² and analyzed for BRCAness using an RNA-profiler²⁴. These 19 patients were selected based on the availability of FFPE tissue matching the previously analyzed frozen tissue. The inclusion criteria for the non-BRCA carriers were as previously described by Andersen et al., and 2 positive BRCA carriers from the original cohort were included²². Patient information can be found in Table S1, where germline variant information also is available²². Of the BRCA-positive patients, Sample 42 carried a known pathogenic germline BRCA1 variant (NM_007300.4:c.2475delC, p.(Asp825Glufs*21)), and Sample 47 carried a pathogenic germline BRCA2 variant (NM_000059.3:c.6490_6491delCA, p.(Gln2164Valfs*11)). The 16 remaining patients carried no known pathogenic variants in BRCA1 or BRCA2. However, two patients (Sample 10 and 37) had a hypermethylated BRCA1 promoter, and one (Sample 11) was a carrier of two variants - a somatic BRCA1 (NM_007300.4:c.3668T > A, p.(Leu1223His)) variant and a germline variant of unknown significance (VUS) in RAD51D (NM_002878.3:c.202G > A, p.(Gly68Ser)). The somatic BRCA1 VUS is considered likely benign, in part due to its low variant allele frequency in the original analysis²², but also due to its location within exon 11 of BRCA1 and outside functional domains, which according to ENIGMA guidelines²⁵ indicates a likely benign variant (BP1_Strong). These five samples were found to be HRD-positive using HRDetect, results are available in Table S2²².

Whole genome sequencing

The fresh frozen tissue samples and matched blood samples were prepared using the Illumina TruSeq Nano protocol and sequenced on Illumina Novaseq 6000 with paired-end 2 × 150 bp. The mean sequencing coverage was 38.8X for the blood samples and 51.8X for tumor tissue²². The tumor tissue data was termed the Fresh Frozen (50X) group in the following. Data is presented in Table S3. The WGS 50X Fresh Frozen data were then downscaled to 5X using seqtk²⁶ on the FASTQ files.

The FFPE tissue samples were prepared using the ThruPLEX Tag-seq protocol from Takara Bio and sequenced on Illumina Novaseq 6000 with paired-end 2 × 150 bp. 16 of 19 samples were sequenced twice at a shallow depth to check if the lower sequencing depth affected the results. Two of the samples (Sample 11, and 42) were not resequenced as their coverage was already above 5X after Run 1, while Sample 10 was not resequenced due to lack of material. The mean coverage was 3X after Run 1, and for samples undergoing merging, 5.96X after Run 2; coverage information can be found in Table S3. Data from Run 1 is used to compare to HRDetect. The WGS data was aligned to the human reference genome (GRCh37) using BWA-MEM²⁷.

Determination of HRD-status

The HRDetect model was applied as previously described²², and the results will be regarded as the truth set for the comparisons conducted in this study. As part of the ShallowHRD R-script by Eeckhoutte et al.^16,28, the aligned reads were analyzed using the recommended 50 kb sliding windows with QDNAseq²⁹, which detects Copy Number Alterations (CNAs) and cut-offs for these. The cut-off calculation by ShallowHRD is based on the result of 100,000 simulations, and the cut-off value is termed the M-value. Another obtained value is the corrected Median Absolute Deviation (cMAD), measuring the level of fluctuations in the number of reads per sliding window. These two values indicate analysis quality, as a cMAD value below 0.14 and an M-value in the interval 0.025–0.45 indicate good quality¹⁶. Additionally, ShallowHRD also assesses the tumor contents of the samples in the MAX2 value, as a value below 0.16 is considered low tumor content¹⁶.

ShallowHRD determines HRD-status based on the CNAs, as it counts the number of LGAs, meaning segments larger than 3 Mb above the CNA cut-off value. As described by Eeckhoutte et al., values below 15 are HRD-negative, while 20 or more LGAs signify HRD-positive. If a sample had an LGAs number in the interval 15 ≤ X ≤ 19 LGAs, it is instead placed in the category Borderline for visual investigation¹⁶. The visual investigation was conducted by 3 individuals, of whom 1 was blinded to the HRDetect status of the samples, and all without knowing the assessment of the other investigators. All 3 investigators reached the same conclusions regarding the reclassification.

Comparison of HRD methods

The output of the ShallowHRD analyses was then compared to the HRDetect-status and was given a category: True Positive or True Negative, if the result correlated with the output from HRDetect, and False Positive or False Negative if it did not.

Receiver operating characteristic (ROC) curve analysis was performed separately for each of the three datasets (Fresh Frozen 50X, Fresh Frozen 5X, and FFPE 1-5X). The area under the curve (AUC) values were used to assess the correlation between ShallowHRD and HRDetect. The statistical significance of the AUCs was calculated relative to HRDetect’s performance, which is assumed to have an AUC of 1.0, and to that of a random classifier, which corresponds to an AUC of 0.5. Furthermore, a previously estimated AUC for HRDetect of 0.9636 by Diossy et al.³⁰ was compared to the ShallowHRD AUCs of this study.

AUCs, standard errors, and p-values were calculated based on the datasets generated in the study as well as after bootstrapping (1000 iterations) of these datasets. The analysis was performed using functions from Scikit-learn³¹ and scipy³² with the code available on GitHub³³.

In addition to ROC curve analysis for evaluating ShallowHRD’s classification performance, Spearman rank correlation was used to assess score-level concordance. Correlation between ShallowHRD scores and HRDetect scores was calculated separately for each dataset (FFPE (1-5X), Fresh Frozen (5X), and Fresh Frozen (50X)) using the cor.test function from the R Stats Package³⁴. Further, to assess the reproducibility of ShallowHRD scores across input types and sequencing depths, pairwise Spearman correlations were also performed between each of the ShallowHRD datasets.

Results

Quality

Before comparing ShallowHRD to HRDetect, we examined the quality of the samples, the simulations, and the estimated tumor content of the samples. Most samples were estimated to have sufficient tumor content. However, 4 samples were estimated to have low tumor content in the Fresh Frozen (50X) dataset, 8 in the Fresh Frozen (5X) dataset, and 4 in the FFPE (1-5X) set. Two of these samples were estimated by ShallowHRD to have a low tumor content in all datasets (Table S4). This correlates somewhat with previously attained data from Andersen et al., where the Battenberg output indicates a tumor content of above 20% in all samples (see Table S7), however, the number of substitutions found within the samples aligns well with what could be expected in tumor samples and the data was of a quality usable in HRDetect analysis²².

When examining the ShallowHRD quality parameters, all samples, regardless of sequencing depth and tissue type, are of good quality, as the cMAD value is below 0.14 and the M-value is within the designated interval for all samples, see Table S4. Thus, it is estimated that the analysis was conducted on a dataset of reasonable quality.

ShallowHRD output

The LGA scores show slight variation across the data groups, as illustrated in Fig. 2A, providing a detailed breakdown of the scores for each sample. This visual representation allows for easy identification of the HRD-status, with ShallowHRD outputs (S11, S14, S21, S37) indicating a borderline HRD-status. Additionally, a boxplot displaying individual score distributions is available in Fig. S1, where the variation across the data groups was non-significant. As few completed ShallowHRD simulations were available in a single ShallowHRD analysis of Sample 10, the LGA score of Sample 10 is the average of ten ShallowHRD simulation runs (Fig. 2A). All individual scores were 20 or above, and the results of these separate analyses are available in Table S5.

The HRD-scores from the HRDetect analysis conducted by Andersen et al.²² are presented in Fig. 2B for comparison to the ShallowHRD score.

After resequencing and merging data for 16 samples, slight non-significant differences in LGA scores were identified. However, no category changes were detected (Fig. 2C and Table S4).

Comparison to HRDetect

When comparing the ShallowHRD scores to the HRDetect, each result was categorized as True Negative, True Positive, False Negative, False Positive, or Borderline if the ShallowHRD score was between 15 and 19. The categorization result is represented in Figure S2. Here, it is also worth noting that the categorization did not change between Run 1 and the Merged result of the FFPE tissue (Table S4).

It was apparent that a visual inspection of the borderline results was required to complete the categorization and comparison. The visual inspection recategorized the borderline samples into one of the other categories, visualized in Fig. 3A. The individual sample categories are available in Table S4, where it also becomes evident that the categories of the borderline samples after visual assessment are identical to the categorization of the sample in the other tissue datasets.

Slight differences between the tissues are detected in the True Negative and False Positive categories, where ShallowHRD incorrectly categorized two Fresh Frozen (50X) samples and one Fresh Frozen (5X) sample as HRD-positive. However, these differences are not significant (Table S4), and the ShallowHRD scores were highly correlated across the datasets. FFPE (1-5X) scores correlated strongly with those from Fresh Frozen (5X) (Spearman ρ = 0.79, p = 5.1 × 10⁻⁵) and Fresh Frozen (50X) (ρ = 0.79, p = 5.8 × 10⁻⁵). A strong correlation was also observed between Fresh Frozen (5X) and (50X) (ρ = 0.88, p = 5.1 × 10⁻⁷). These results suggest that ShallowHRD provides reproducible estimates of HRD-associated genomic instability across tissue types and sequencing depths. For all three tissue groups, Sample 11 is incorrectly categorized as HRD-negative. All of this brings the sensitivity of ShallowHRD compared to HRDetect to 80% for all tested tissue groups, while the specificity in Fresh Frozen (50X) is 85.7%, in Fresh Frozen (5X) 92.9%, and in the FFPE (1-5X) analysis 100%.

To assess the overall performance of ShallowHRD compared to HRDetect in determination of HRD-status, a ROC curve analysis was performed (Fig. 3B). AUC values and p-values for each data set, with and without bootstrapping, are presented in Table 1, where it becomes evident that ShallowHRD is significantly different from a random classifier in all data sets. The non-bootstrapped data sets are not statistically different from assumed AUC of HRDetect (1.0) or the AUC achieved by Diossy et al.³⁰, while the comparison with bootstrapped datasets are significantly different in all comparisons, except for Fresh Frozen 5X against the Diossy et al.³⁰ AUC.

Table 1 P-values when comparing the ShallowHRD dataset AUCs to hrdetect (AUC 1.0) and any random classifier (AUC 0.5).

Full size table

Spearman rank correlation analysis also indicated a strong positive correlation between HRDetect and ShallowHRD scores in FFPE (1-5X) samples (ρ = 0.68, p = 0.0013). Correlations were moderate but still statistically significant in Fresh Frozen (5X) (ρ = 0.58, p = 0.0086) and Fresh Frozen (50X) samples (ρ = 0.50, p = 0.029). These findings indicate general concordance between the two methods at the score level, particularly in the clinically relevant FFPE setting.

Discussion

The aim of our study was to compare ShallowHRD to HRDetect in breast cancer tissue. After visual inspection of genomic profiles, we found that ShallowHRD performed with high specificity and sensitivity in all data groups. However, slight differences between the data groups were detected, with the FFPE analysis outperforming the Fresh Frozen datasets when focusing on false positives. This may be attributable to improvements implemented in preparation kits in the time from sequencing of Fresh Frozen to FFPE tissue. The specificity values observed in our study align well with those previously reported, while the slightly lower sensitivity is likely influenced by the limited sample size¹⁶.

The ROC curve, AUC values, and statistical significance analyses illustrate that ShallowHRD performs significantly better than a random classifier. No statistical differences were found when comparing the ShallowHRD AUC to that of HRDetect from Diossy et al.³⁰ or the perfect classifier AUC. When bootstrapping the ShallowHRD output, only the Fresh Frozen 5X dataset produced AUC values statistically similar to those from Diossy et al.; all other bootstrapped values differed significantly.

In summary, ShallowHRD performs significantly better than a random classifier in its ability to determine HRD-status, and it displays AUC performance similar to HRDetect, in both FFPE and frozen tissue datasets. In the study by Davies et al., the sensitivity of HRDetect was reported to decrease from nearly 1.0 at 30X coverage to 0.86 at 10X coverage in fresh frozen tissue. Additionally, HRDetect was applied on an FFPE-derived sample with a known pathogenic BRCA1 variant and correctly predicted HRD, despite formalin-induced artifacts that prevented detection of a key mutational signature¹⁵. However, broader FFPE validation was not conducted. As such the 0.8 sensitivity achieved in this study and a comparable AUC, ShallowHRD may offer a more practical alternative when FFPE tissue is the only option available in the assessment of HRD-status in breast cancer, as previously demonstrated for ovarian cancer¹¹.

Further, the observed Spearman correlation between HRDetect and ShallowHRD scores, especially the strong correlation in FFPE samples, supports the hypothesis that ShallowHRD captures key features of HRD-associated genomic instability. While correlation does not imply equivalence or classification agreement, these results suggest that ShallowHRD and HRDetect scores track together in magnitude, which may be useful for further validation and calibration efforts.

In all three data groups, the HRD-positive Sample 11 was the only case misclassified as HRD-negative by ShallowHRD. This sample carried a germline VUS in RAD51D, alongside a somatic BRCA1 variant considered likely benign²². Structural variants have been well-characterized as a result of HRD and the subsequent genomic instability mediated by BRCA1 and BRCA2 dysfunctionality^35,36,37,38, and as a part of the HR pathway, RAD51D dysfunction would also lead to HRD and higher levels of genomic instability^39,40.

However, the identified RAD51D variant is currently classified as a VUS, even though minigene RNA data suggest that the variant affects splicing⁴¹. However, the findings from the ShallowHRD analysis could indicate a tendency towards a likely benign classification, thereby going against the findings of the study using HRDetect²². A potential reason for this disagreement between ShallowHRD and HRDetect could be due to a less structurally disruptive subtype of HRD as a result of the RAD51D variant.

Therefore, further studies into the specific RAD51D VUS as well as known RAD51D pathogenic and benign variants are warranted to clarify what causes the observed discrepancy between ShallowHRD and HRDetect. Understanding whether deleterious RAD51D variants lead to the same level of structural variants, such as the LGAs, as pathogenic BRCA1 and BRCA2 variants would help evaluate ShallowHRD’s performance. Such analyses could potentially indicate if gene-specific cut-offs are required. If RAD51D variants produce a different pattern of structural variants, this could explain why ShallowHRD fails to assess this sample accurately when compared to HRDetect’s more exhaustive analysis of HRD signatures.

This discrepancy also raises a broader question about how the performance characteristics of HRD classifiers, particularly their sensitivity and specificity, affect their clinical utility. HRD-status is useful for several clinical aspects, most notably as a predictive marker for response to PARP inhibitors, and potentially as supportive evidence in the future classification of VUS. Depending on the intended use, a higher sensitivity or specificity of a classifier would be favored. As a predictor of treatment response, a high level of sensitivity is needed to minimize the number of false negatives to avoid missing patients who would have benefited from the treatment. Conversely, for VUS classification, high specificity is presumed to be more important. If a VUS is present in an HR gene, a truly pathogenic VUS would be expected to lead to HRD and increased genomic instability. However, studies have shown that HRD is present in tumors without pathogenic variants^42,43. Therefore, the absence of HRD in a tumor carrying a VUS may support a benign classification, but only if the test has high specificity and reliably identifies true negatives. Ultimately, both sensitivity and specificity matter in each context, and the clinical utility of HRD testing depends on balancing these priorities. For example, overly low specificity in treatment prediction may lead to overtreatment, while overly low sensitivity in VUS classification could falsely support benign interpretation of truly pathogenic variants. These considerations underscore the importance of evaluating not only biological relevance but also methodological robustness in HRD assessment. As of publishing this paper, it is unclear which HRD algorithm is optimal for VUS classification and what strength HRD-status can provide for classification. However, the ENIGMA⁴⁴ consortium is currently conducting studies to define classification strength of HRD-status for both benign and pathogenic classification purposes.

In our study, ShallowHRD achieved a sensitivity of 80% and a specificity ranging from 85.7 to 100% across the three datasets. While the sensitivity may limit its immediate clinical utility as a standalone predictor of treatment response, the high specificity, particularly in the FFPE samples, supports its potential value in research settings, including as supportive evidence in VUS classification. However, these findings need further verification in larger cohorts.

Further limitations of our study should also be accounted for. First, the sample size is small (n = 19), limiting our findings’ statistical power and generalizability. The retrospective nature of the study and reliance on available matched FFPE and frozen tissue samples introduce potential selection bias, as tissue quality and availability may not reflect routine clinical populations. While bootstrapping was applied to explore robustness, no formal internal or external validation of ShallowHRD performance was conducted, and the risk of overfitting and type I or II statistical errors cannot be excluded.

Future work should aim to validate these promising preliminary findings in larger, prospectively collected cohorts with more BRCA1/2-deficient tumors, ideally across multiple institutions. Using additional independent methods to assess HRD, such as functional assays or clinical treatment response, could also be valuable in evaluating the clinical relevance. Ultimately, such studies are needed to determine if ShallowHRD can serve as a reliable, cost-effective diagnostic tool for HRD detection in routine clinical practice for breast cancer, as this preliminary study suggests.

Conclusions

The performance of ShallowHRD is, in this preliminary study, comparable to that of HRDetect, making it a potentially quicker and more economical alternative for assessing HRD-status due to lower sequencing depth of malignant tumor tissue and its ability to perform without paired sequencing of normal tissue. Furthermore, the power of ShallowHRD to work on data from FFPE tissue makes it usable for HRD-status assessment for most patients with breast cancer. ShallowHRD differed from HRDetect in the assessment of one sample. Additional research into the cause of this difference is required, and to make finite conclusions on ShallowHRD sensitivity, examination and validation in larger, external cohorts are required.

In conclusion, this study indicates that using ShallowHRD to assess HRD-status of malignant breast tumors has substantial potential in a clinical setting.

Data availability

The output of ShallowHRD analyzed during the study are publicly available in the figshare repository, at https://doi.org/10.6084/m9.figshare.28323362. The WGS datasets generated and/or analyzed during the study can be made available by contacting corresponding authors to receive permissions and sign data processing agreements.

References

Krejčí, L., Altmannová, V., Špı́rek, M. & Zhao, X. Homologous recombination and its regulation. Nucleic Acids Res. 40, 5795–5818. https://doi.org/10.1093/nar/gks270 (2012).
Article PubMed PubMed Central CAS Google Scholar
Zhao, W., Steinfeld, J. B., Liang, F., Chen, X., Maranon, D. G., Jian, C. et al. BRCA1–BARD1 promotes RAD51-mediated homologous DNA pairing. Nature 550, 360–365. https://doi.org/10.1038/nature24060 (2017).
He, L., Gao, F., Zhu, J., Xu, Q., Yu, Q., Yang, M. et al. Homologous recombination deficiency serves as a prognostic biomarker in clear cell renal cell carcinoma. Exp. Ther. Med. 26. https://doi.org/10.3892/etm.2023.12128 (2023).
Wagener-Ryczek, S., Merkelbach‐Bruse, S. & Siemanowski, J. Biomarkers for homologous recombination deficiency in cancer. J. Pers. Med. 11, 612. https://doi.org/10.3390/jpm11070612 (2021).
Article PubMed PubMed Central Google Scholar
Bergstrom, E. N., Abbasi, A., Díaz-Gay, M., Galland, L., Lippman, S. M., Ladoire, S. et al. Deep learning predicts HRD and platinum response from histology slides in breast and ovarian cancer 2023. Medrxiv https://doi.org/10.1101/2023.02.23.23285869
Bhattacharyya, A., Ear, U., Koller, B. H., Weichselbaum, R. R. & Bishop, D. K. The breast cancer susceptibility gene BRCA1 is required for subnuclear assembly of Rad51 and survival following treatment with the DNA cross-linking agent cisplatin. J. Biol. Chem. 275, 23899–23903. https://doi.org/10.1074/jbc.c000276200 (2000).
Article PubMed CAS Google Scholar
Valenza, C., Trapani, D., Gandini, S., Sposetti, C., Boscolo Bielo, L., Marra, A. et al. Platinum-based chemotherapy and PARP inhibitors for patients with a germline BRCA pathogenic variant and advanced breast cancer (LATER-BC): retrospective multicentric analysis of post-progression treatments. Eur. J. Cancer. 190, 112944. https://doi.org/10.1016/j.ejca.2023.112944 (2023).
Zhao, E. Y., Shen, Y., Pleasance, E., Kasaian, K., Leelakumari, S., Jones, M. et al. Homologous recombination deficiency and platinum-based therapy outcomes in advanced breast cancer. Clin. Cancer Res. 23, 7521–7530. https://doi.org/10.1158/1078-0432.CCR-17-1941 (2017).
Cortesi, L., Rugo, H. S. & Jackisch, C. An overview of PARP inhibitors for the treatment of breast cancer. Target. Oncol. 16, 255–282. https://doi.org/10.1007/s11523-021-00796-4 (2021).
Article PubMed PubMed Central Google Scholar
Cunningham, J. M., Cicek, M. S., Larson, N. B., Davila, J., Wang, C., Larson, M. C. et al. Clinical characteristics of ovarian cancer classified by BRCA1, BRCA2 and RAD51C status. Sci. Rep. 4 https://doi.org/10.1038/srep04026 (2014).
Guo, Y., He, X., Tan, Y., Liu, J., Chen, H., Huang, Y. et al. ShallowHRD status acts as an effective prognostic predictor in ovarian cancer patients treated by poly (ADP-ribose) polymerase inhibitors (PARPis). J. Cancer Res. Clin. Oncol. 149, 15839–15844. https://doi.org/10.1007/s00432-023-05341-6 (2023).
Yndestad, S., Engebrethsen, C., Herencia-Ropero, A., Nikolaienko, O., Vintermyr, O. K., Lillestøl, R. et al. Homologous recombination deficiency across subtypes of primary breast cancer. Jco Precis Oncol. https://doi.org/10.1200/po.23.00338 (2023).
Manié, É., Popova, T., Battistella, A., Tarabeux, J., Caux‐Moncoutier, V., Golmard, L. et al. Genomic hallmarks of homologous recombination deficiency in invasive breast carcinomas. Int. J. Cancer. 138, 891–900. https://doi.org/10.1002/ijc.29829 (2015).
Mark, L. R.,.Terp, S. K., Krarup, H. B., Thomassen, M., Pedersen, I. S. & Bøgsted, M. Homologous recombination deficiency detection algorithms: a systematic review. Cancers (Basel). 15, 5633. https://doi.org/10.3390/cancers15235633 (2023).
Davies, H., Glodzik, D., Morganella, S., Yates, L. R., Staaf, J., Zou, X. et al. HRDetect is a predictor of BRCA1 and BRCA2 deficiency based on mutational signatures. Nat. Med. 23, 517–525. https://doi.org/10.1038/nm.4292 (2017).
Eeckhoutte, A., Houy, A., Manié, E., Reverdy, M., Bièche, I., Marangoni, E. et al. ShallowHRD: detection of homologous recombination deficiency from shallow whole genome sequencing. Bioinformatics 36, 3888–3889. https://doi.org/10.1093/bioinformatics/btaa261 (2020).
Sztupinszki, Z., Dióssy, M., Börcsök, J., Prósz, A., Cornelius, N., Kjeldsen, M.K. et al. Comparative assessment of diagnostic homologous recombination deficiency–associated mutational signatures in ovarian cancer. Clin. Cancer Res. 27, 5681–5687. https://doi.org/10.1158/1078-0432.ccr-21-0981 (2021).
Golan, T., O’Kane, G. M., Denroche, R. E., Raitses-Gurevich, M., Grant, R. C., Holter, S. et al. Genomic features and classification of homologous recombination deficient pancreatic ductal adenocarcinoma. Gastroenterology 160, 2119–2132e9. https://doi.org/10.1053/j.gastro.2021.01.220 (2021).
Brown, T. J. & Reiss, K. A. PARP inhibitors in pancreatic cancer. Cancer J. 27, 465–475. https://doi.org/10.1097/ppo.0000000000000554 (2021).
Article PubMed PubMed Central CAS Google Scholar
Staaf, J., Glodzik, D., Bosch, A., Vallon-Christersson, J., Reuterswärd, C., Häkkinen, J. et al. Whole-Genome sequencing of triple-negative breast cancers in a population-based clinical study. Nat. Med. 25, 1526–1533. https://doi.org/10.1038/s41591-019-0582-4 (2019).
Wattenberg, M. M. & Reiss, K. A. Determinants of homologous recombination deficiency in pancreatic cancer. Cancers (Basel). 13, 4716. https://doi.org/10.3390/cancers13184716 (2021).
Article PubMed CAS Google Scholar
Andersen, L. V. B., Larsen, M. J., Davies, H., Degasperi, A., Nielsen, H. R., Jensen, L. A. et al. Non-BRCA1/BRCA2 high-risk Familial breast cancers are not associated with a high prevalence of BRCAness. Breast Cancer Res. 25, 69. https://doi.org/10.1186/s13058-023-01655-y (2023).
Thomassen, M. Created in BioRender. (2024). https://biorender.com/a66p512
Larsen, M. J., Thomassen, M., Tan, Q., Lænkholm, A.-V., Bak, M., Sørensen, K. P. et al. RNA profiling reveals Familial aggregation of molecular subtypes in non-BRCA1/2 breast cancer families. BMC Med. Genomics. 7, 9. https://doi.org/10.1186/1755-8794-7-9 (2014).
Parsons, M. T., de la Hoya, M., Richardson, M. E., Tudini, E., Anderson, M., Berkofsky-Fessler, W. et al. Evidence-based recommendations for gene-specific ACMG/AMP variant classification from the ClinGen ENIGMA BRCA1 and BRCA2 variant curation expert panel. Am. J. Hum. Genet. 111, 2044–2058. https://doi.org/10.1016/j.ajhg.2024.07.013 (2024).
Li, H. & seqtk Github. (2023). https://github.com/lh3/seqtk (accessed July 16, 2024).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760. https://doi.org/10.1093/bioinformatics/btp324 (2009).
Article PubMed PubMed Central CAS Google Scholar
Eeckhoutte, A. ShallowHRD Github Repository. GitHub Repos 2024. https://github.com/aeeckhou/shallowHRD
Scheinin, I., Sie, D., Bengtsson, H., van de Wiel, M. A., Olshen, A. B., van Thuijl, H. F., et al. DNA copy number analysis of fresh and formalin-fixed specimens by shallow whole-genome sequencing with identification and exclusion of problematic regions in the genome assembly. Genome Res. 24, 2022–2032. https://doi.org/10.1101/gr.175141.114 (2014).
Diossy, M., Reiniger, L., Sztupinszki, Z., Krzystanek, M., Timms, K. M., Neff, C. et al. Breast cancer brain metastases show increased levels of genomic aberration-based homologous recombination deficiency scores relative to their corresponding primary tumors. Ann. Oncol. 29, 1948–1954. https://doi.org/10.1093/annonc/mdy216 (2018).
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O. et al. Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T., Cournapeau, D. et al. SciPy 1.0: fundamental algorithms for scientific computing in python. Nat. Methods. 17, 261–272. https://doi.org/10.1038/s41592-019-0686-2 (2020).
Adel, L. & AUC_Boot-noBoot GitHub Repos. (2025). https://github.com/LouiseAdel/AUC_Boot-noBoot (accessed April 28, 2025).
Stats (version 3.6.2). cor.test. RdocumentationOrg n.d. https://www.rdocumentation.org/packages/stats/versions/3.6.2/topics/cor.test (accessed June 20, 2025).
Lord, C. J. & Ashworth, A. BRCAness revisited. Nat. Rev. Cancer. 16, 110–120. https://doi.org/10.1038/nrc.2015.21 (2016).
Article PubMed CAS Google Scholar
Nik-Zainal, S., Davies, H., Staaf, J., Ramakrishna, M., Glodzik, D., Zou, X. et al. Landscape of somatic mutations in 560 breast cancer whole-genome sequences. Nature 534, 47–54. https://doi.org/10.1038/nature17676 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Roy, R., Chun, J. & Powell, S. N. BRCA1 and BRCA2: Different roles in a common pathway of genome protection. Nat. Rev. Cancer. 12, 68–78. https://doi.org/10.1038/nrc3181 (2011).
Article PubMed PubMed Central CAS Google Scholar
Sigorski, D., Iżycka-Świeszewska, E. & Bodnar, L. Poly(ADP-Ribose) polymerase inhibitors in prostate cancer: Molecular mechanisms, and preclinical and clinical data. Target. Oncol. 15, 709–722. https://doi.org/10.1007/s11523-020-00756-4 (2020).
Article PubMed PubMed Central Google Scholar
Torres-Esquius, S., Llop-Guevara, A., Gutiérrez-Enríquez, S., Romey, M., Teulé, À., Llort, G. et al. Prevalence of homologous recombination deficiency among patients with Germline RAD51C/D breast or ovarian cancer. Jama Netw. Open. 7, e247811. https://doi.org/10.1001/jamanetworkopen.2024.7811 (2024).
Article PubMed PubMed Central Google Scholar
Reh, W. A., Nairn, R. S., Lowery, M. P. & Vásquez, K. M. The homologous recombination protein RAD51D protects the genome from large deletions. Nucleic Acids Res. 45, 1835–1847. https://doi.org/10.1093/nar/gkw1204 (2016).
Article PubMed Central CAS Google Scholar
Bueno-Martínez, E., Sanoguera-Miralles, L., Valenzuela-Palomo, A., Lorca, V., Gómez-Sanz, A., Carvalho, S. et al. RAD51D aberrant splicing in breast cancer: Identification of splicing regulatory elements and minigene-based evaluation of 53 DNA variants. Cancers (Basel). 13, 2845. https://doi.org/10.3390/cancers13112845 (2021).
Castroviejo-Bermejo, M., Cruz, C., Llop-Guevara, A., Gutiérrez-Enríquez, S., Ducy, M., Ibrahim, Y. H. et al. A RAD 51 assay feasible in routine tumor samples calls PARP inhibitor response beyond BRCA mutation. EMBO Mol. Med. 2018;10, e9172. https://doi.org/10.15252/emmm.201809172
Ladan, M. M. & van Gent, D. C. Homologous recombination deficiency testing for BRCA-Like tumors: The road to clinical validation. Cancers (Basel). 13, 1004. https://doi.org/10.3390/cancers13051004 (2021).
Article PubMed CAS Google Scholar
Spurdle, A. B., Healey, S., Devereau, A., Hogervorst, F. B. L., Monteiro, A. N. A., Nathanson, K. L. et al. ENIGMA-evidence-based network for the interpretation of germline mutant alleles: An international initiative to evaluate risk and clinical significance associated with sequence variation in BRCA1 and BRCA2 genes. Hum. Mutat. 33, 2–7. https://doi.org/10.1002/humu.21628 (2012).

Download references

Acknowledgements

We sincerely thank our laboratory technicians for their assistance, and a sincere thanks to the patients for their willingness to participate in this study.

Funding

The Region of Southern Denmark’s Highly Specialized Functions and Strategic Initiatives at Odense University Hospital Foundation (R129-A5358) supports the project. LAJ is supported by the Danish Cancer Society (Case no. R231-A13765).

Author information

Authors and Affiliations

Department of Clinical Genetics, Odense University Hospital, Odense, Denmark
Louise Adel Jensen, Caroline Hey Baekgaard, Mie Bohnensack Larsen, Susanne Eriksen Boonen, Zainab Hikmat, Qin Hao, Martin Jakob Larsen & Mads Thomassen
Clinical Genome Center, Department of Clinical Research, University of Southern Denmark, Odense, Denmark
Louise Adel Jensen & Mads Thomassen
Department of Clinical Research, University of Southern Denmark, Odense, Denmark
Caroline Hey Baekgaard, Susanne Eriksen Boonen & Martin Jakob Larsen
Department of Pathology, Odense University Hospital, Odense, Denmark
Anne Marie Bak Jylling
Research Unit of Pathology, Department of Clinical Research, University of Southern Denmark, Odense, Denmark
Anne Marie Bak Jylling
Research Unit OPEN, Department of Clinical Research, University of Southern Denmark, Odense, Denmark
Zainab Hikmat
Department of Clinical Genetics, Copenhagen University Hospital, Copenhagen, Denmark
Thomas van Overeem Hansen
Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
Thomas van Overeem Hansen
Department of Molecular Diagnostics and Clinical Cancer Research Center, Aalborg University Hospital, Aalborg, Denmark
Inge Søkilde Pedersen
Department of Clinical Medicine, Aalborg University, Aalborg, Denmark
Inge Søkilde Pedersen

Authors

Louise Adel Jensen
View author publications
Search author on:PubMed Google Scholar
Caroline Hey Baekgaard
View author publications
Search author on:PubMed Google Scholar
Mie Bohnensack Larsen
View author publications
Search author on:PubMed Google Scholar
Susanne Eriksen Boonen
View author publications
Search author on:PubMed Google Scholar
Anne Marie Bak Jylling
View author publications
Search author on:PubMed Google Scholar
Zainab Hikmat
View author publications
Search author on:PubMed Google Scholar
Qin Hao
View author publications
Search author on:PubMed Google Scholar
Thomas van Overeem Hansen
View author publications
Search author on:PubMed Google Scholar
Inge Søkilde Pedersen
View author publications
Search author on:PubMed Google Scholar
Martin Jakob Larsen
View author publications
Search author on:PubMed Google Scholar
Mads Thomassen
View author publications
Search author on:PubMed Google Scholar

Contributions

MT, QH, MBL, MJL, and CHB designed the study with input from SEB, ZH, and LAJ. Laboratory work was conducted in collaboration with AMBJ and planned by MBL and CHB. Data analysis was performed by CHB and LAJ with essential guidance and support from MJL and QH. CHB and LAJ performed the main data interpretation with support from ISP, TvOH, ZH and SEB. LAJ wrote the manuscript and prepared figures. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Louise Adel Jensen or Mads Thomassen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Institutional review board

The study was conducted according to the guidelines of the Declaration of Helsinki, and the Danish Ethical Committee System approved the study (The Regional Scientific Ethical Committees for Southern Denmark, case no. S-20220080, 11th of May 2023).

Informed consent

Informed consent was obtained from all subjects involved in the study using HRDetect. Patient consent regarding shallow sequencing and ShallowHRD was waived, as it is considered a study of quality assessment and validation, which according to the approval from the Regional Scientific Ethical Committees for Southern Denmark is exempted from informing the patients.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1 (download DOCX )

Supplementary Material 2 (download DOCX )

Supplementary Material 3 (download XLSX )

Supplementary Material 4 (download XLSX )

Supplementary Material 5 (download XLSX )

Supplementary Material 6 (download XLSX )

Supplementary Material 7 (download XLSX )

Supplementary Material 8 (download XLSX )

Supplementary Material 9 (download XLSX )

Supplementary Material 10 (download DOCX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Adel Jensen, L., Baekgaard, C.H., Larsen, M.B. et al. Preliminary evaluation of ShallowHRD performance compared to HRDetect in familial breast cancer tumors. Sci Rep 15, 29442 (2025). https://doi.org/10.1038/s41598-025-14122-9

Download citation

Received: 17 January 2025
Accepted: 29 July 2025
Published: 11 August 2025
Version of record: 11 August 2025
DOI: https://doi.org/10.1038/s41598-025-14122-9