Abstract
Multigene panel tests (MGPTs) revolutionized the diagnosis of Lynch syndrome (LS), however noncoding pathogenic variants (PVs) can only be detected by complementary methods including whole genome sequencing (WGS). Here we present a DNA-, RNA- and tumor tissue-based WGS prioritization workflow for patients with a suspicion of LS where MGPT detected no LS-related PV. Among the 100 enrolled patients, MGPT detected 28 simple PVs and an additional 3 complex PVs. Among the 69 MGPT-negative patients, the lack of somatic MLH1 promoter methylation in a patient with a distinguished MLH1 allelic imbalance selected this sample for WGS. This returned a germline deep intronic MLH1 variant, with further functional studies confirming its’ pathogenicity. Interestingly, all three complex PVs and the MLH1 deep intronic PV were found to be recurrent at our center. Our straightforward and cost-effective prioritization workflow can optimally include WGS in the genetic diagnosis of LS.
Similar content being viewed by others
Background
Lynch syndrome (LS) originally described by Warthin1 is a frequent cancer predisposition condition resulting from germline pathogenic variants in MLH1, MSH2, MSH6, PMS2 and EPCAM genes affecting DNA mismatch repair. The growing availability of next-generation sequencing-based multigene panel testing (MGPT) revolutionized the genetic diagnosis of families with LS. In addition to single nucleotide variants (SNV), simple copy number variations including larger deletions and duplications (CNVs) and complex structural variations (SVs) might also serve as pathogenic variants (PVs) resulting in LS. In the case of MSH2 and PMS2, 20–25% of all pathogenic variants are CNVs and SVs, while these are less frequent in MLH1 ( ~10%) and MSH6 ( ~3%)2. Nevertheless, our research group has previously shown that germline 3’ deletions of EPCAM (formerly known as TACSTD1) are additional PVs in LS3.
Although MGPT has a high yield in detecting SNVs and CNVs, the validation of complex SVs is more challenging and often requires the multimodal application of several molecular genetic techniques and bioinformatic pipelines. The precise characterization of complex variants found in LS is necessary to enhance adequate nomenclature and to determine transcriptional relevance and pathogenicity. The confirmation of altered splicing or premature stop codon is necessary for optimal clinical interpretation of such variants as PVs.
Nevertheless, MGPTs cannot identify PVs located outside of the targeted exons, which usually account for the missing heritability in LS4. The detection of these variants requires the systematic analysis of the non-coding genome by targeted deep-intronic and long-read sequencing or whole genome sequencing (WGS) as confirmed in case series of families with high cancer predisposition, which were collected throughout variable time frames4,5,6. These studies clearly showed the robustness of the whole-gene/whole-genome-based approaches, however, a clear workflow regarding the precise role of these tools in the clinical genetic diagnostic setting is lacking.
Here, we present a comprehensive molecular genetic workflow, prospectively applying germline DNA, RNA and tumor tissue-based techniques for the diagnosis of LS. In a pilot study of 100 consecutive patients with colorectal or endometrial cancers fulfilling the testing criteria, MGPT detected 31 PVs, among which 3 were complex PVs, which were subjected to further characterization. In cases where MGPT failed to detect a PV, tumor tissue-based methylation assay targeting the MLH1 promoter as well as RNA-based analysis of allelic imbalance were conducted to select high-risk samples for WGS. In the sole sample from this cohort selected for further testing, WGS detected a deep intronic PV in MLH1 causing exonization. Interestingly, this novel variant and all of the detected complex PVs were found to be recurrent in apparently independent families diagnosed at our center, highlighting their relative importance in this population. Our results present a straightforward clinical laboratory genetic workflow leveraging multiple assays for the optimal diagnosis of LS.
Methods
Patients and samples
One hundred individuals from 99 independent families diagnosed with colorectal (CRC) or endometrial (EC) cancers and fulfilling testing criteria for LS were consecutively enrolled at the Department of Molecular Genetics, National Institute of Oncology between September 2021 and March 2023. Testing criteria adhered to valid National Comprehensive Cancer Network (NCCN) and Hungarian national guidelines and the modified Bethesda criteria7,8. Patients with mismatch-repair deficient/microsatellite instable cancers or conforming the modified Bethesda criteria were included7. Patients underwent genetic counseling and provided written informed consent. DNA was isolated from peripheral blood using Gentra DNA Blood Extraction Kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions. The study was approved by the Scientific and Research Ethics Committee of the Medical Research Council of Hungary (ETT-TUKEB 53720-7/2019/EÜIG) and complied with national and international regulations including the Declaration of Helsinki. Genetic counseling was provided to all participants. All participants provided written informed consent. Family trees were edited using f-tree software (v4.0.3, https://www.holonic-systems.com/f-tree/en/).
Immunohistochemical evaluation of tumors
Immunohistochemical analyses were performed at the Department of Surgical and Molecular Pathology of the National Institute of Oncology. Briefly, 2-3 µm-thick formalin-fixed paraffin-embedded tissue sections were transferred to a 50 °C water bath (Tissue Flotation Bath TFB 45, Medite) and mounted on a glass slide, which underwent incubation for 1 hour in a drying oven at a temperature of 55 °C and were stored at 4 °C until staining. The BenchMark ULTRA system (Roche Tissue Diagnostics) was applied for fully automated immunohistochemical staining. In the case of MLH1, MSH2 and MSH6 ultraView DAB (Ventana Medical Systems, Cat. No.: 760-500, v.1.02.0018) method was used. After slide heating and deparaffinization at 72 °C for 4 minutes, cell conditioning was performed with ULTRA Cell Conditioning Solution (ULTRA CC1, Ventana Medical Systems, Cat. No.: 950-224) for 3 hours at 95 °C. Slides were incubated with one drop of anti-MLH1 (M1) mouse monoclonal antibody (1 μg/mL, Ventana Medial Systems, Cat. No.: 790-5091), anti-MSH2 (G219-1129) mouse monoclonal antibody (20 μg/mL, Ventana Medial Systems, Cat. No.: 790-5093) or anti-MSH6 (SP93) rabbit monoclonal antibody (1 μg/mL, Ventana Medial Systems, Cat. No.: 790-5092) with cover oil for 16 minutes. For background staining one drop of Hematoxylin II (Ventana Medial Systems, Cat. No.: 790-2208) with cover oil was applied for 12 minutes. One drop post-background stain using bluing reagent (Ventana Medial Systems, Cat. No.: 760-2037) with cover oil was applied for 12 minutes.
In the case of PMS2 U OptiView DAB IHC (Ventana Medical Systems, Cat. No.: 760-700, v.1.00.01.36) method was used. After slide heating and deparaffinization at 72 °C for 4 minutes, cell conditioning was performed with ULTRA CC1 for 580 minutes at 100 °C. After peroxidase inhibition with a 3.0% hydrogen peroxide solution, slides were incubated with one drop of anti-PMS2 (A16-4) mouse monoclonal antibody (1 μg/mL, Ventana Medial Systems, Cat. No.: 790-5094) for 32 min. One drop 0.04% hydrogen peroxide in phosphate buffer solution OptiView H2O2 was used for 4 minutes. OptiView HRP Multimer containing a 40 μg/mL monoclonal anti-HQ-labeled HRP tertiary mouse antibody in a buffer containing protein and 0.05% ProClin 300 preservative with cover oil was incubated for 4 minutes. For background staining one drop of Hematoxylin II with cover oil was applied for 12 min. One drop post-background stain of bluing reagent with cover oil was applied for 12 minutes.
Multigene panel testing (MGPT) and variant interpretation
Next-generation sequencing library preparation has been performed using the TruSight Hereditary Cancer Panel and sequencing was run on a MiSeq or NextSeq550DX instruments with appropriate reagent kits (Illumina Inc, San Diego, CA, USA). Genomic alignment and variant calling including SNVs, CNVs and SVs were performed in the DRAGEN Enrichment (version 4.2.4, Illumina Inc, San Diego, CA, USA). Variants correspond to the MANE Select transcript (version 1.3) of each gene (MLH1: NM_000249.4; MSH2: NM_000251.3; MSH6: NM_000179.3; PMS2: NM_000535.7; EPCAM: NM_002354.3). Clinical variant interpretation was performed according to the ACMG guidelines9,10 and variants were cross-checked in the InSight (http://insight-database.org/) and ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/) databases (finalized October 31, 2023). All pathogenic and likely pathogenic variants were validated from an independent blood sample using Sanger sequencing (SNVs) or multiplex ligation-dependent probe amplification (MLPA in the case of CNVs). In the case of validating PMS2 SNVs, long-range PCR was used to specifically exclude amplification from the pseudogene.
Somatic MLH1 promoter methylation analysis
DNA was isolated from formalin-fixed paraffin-embedded CRC and EC tissues at the Department of Surgical and Molecular Pathology of the National Institute of Oncology by the Maxwell® RSC DNA FFPE Kit (Promega, Cat. No.: ASB1450), following the manufacturer’s recommendations. DNA samples were bisulfite converted using the EZ DNA Methylation‐Lightning kit (Zymo Research, Cat. No.: D5031) according to the manufacturer’s instructions. The MLH1 promoter region was amplified by touchdown PCR reaction (14 cycles with gradually decreasing annealing temperatures from 62°C to 55°C, followed by 26 cycles with annealing temperature of 55 °C11, Table S1A). Sanger sequencing was performed on an Applied Biosystems 3500 Genetic Analyzer (ThermoFisher Scentific, MA, USA).
Peripheral blood mononuclear cell isolation
Peripheral blood mononuclear cells (PBMCs) were isolated by density gradient centrifugation, as earlier reported12. Briefly, 9 ml whole blood was layered on 5 ml Pancoll solution (PanBiotech, Cat. No.: P04-601000) and was centrifuged (800 g, 20 min). PBMCs were washed twice with PBS (Corning, Cat. No.: 21-040-CV) and cryopreserved in a 90% FBS-10% DMSO solution (PanBiotech, Cat. No.: P60-36720100) in liquid nitrogen.
RNA isolation and cDNA synthesis
RNA was isolated (Qiagen, Cat. No.: 217004) from PBMC samples, according to the manufacturer’s instruction after centrifugation with QIAshredder (Qiagen, Cat. No.: 79654). Following RNA extraction, concentration was determined by Nanodrop (Cat.No.: ND-1000). Reverse transcription was performed by the application of ProtoScript® II First Strand cDNA Synthesis Kit (New England Biolabs, Ipswich, MA, USA).
Analysis of allelic imbalance
In patients selected for allelic imbalance analysis, RT-PCR amplification was performed on the cDNA and positions of germline transcribed heterozygote variants were assessed using Sanger sequencing of both gDNA and cDNA (Table S1B). The presence of a more than 50% decrease in electrophoretic peak intensity of either allele in comparison to the gDNA-based result was considered an allelic imbalance.
Nonsense-mediated mRNA decay (NMD) inhibition
NMD inhibition has been performed as described previously, with minor modifications13,14. Briefly, cells were thawed rapidly in a 37 °C water bath, centrifuged for 10 min at 300 g and underwent incubation for 120 hours in 10 ml PB-MAX Karyotyping Medium (Gibco, Cat. No.: 12557-021) at 37 °C in 5% CO2 atmosphere. Thereafter, cells were treated with 200 μL 10 mM (end concentration 200 μM) puromycin (Merck, Cat. No.: P4512) for 6 h at 37 °C to achieve translation inhibition. The same volume of vehicle (distilled water) was added to the negative controls. Following puromycin treatment, cells were harvested, centrifuged at 250 g for 5 min, resuspended in QIAzol (Qiagen) and stored at -80 °C until RNA isolation.
Whole genome sequencing
Whole genome libraries were prepared using the Illumina DNA PCR-Free Prep, Tagmentation kit (Illumina, USA), with a standard input of 400 ng of gDNA as described in the manufacturer’s guide. To remove free adapters from the final libraries, a cleaning step with AMPure XP magnetic beads (Beckman Coulter, USA) at a ratio of 1:1 was added. The concentration of WGS libraries was measured using the Qubit ssDNA Kit (Thermo Fisher Scientific, USA). The quality of next-generation sequencing (NGS) libraries was assessed using automated electrophoresis on a 2100 Bioanalyzer System (Agilent, USA). For normalization of prepared NGS to 2 nM, a library length of 450 bp was used. The generated libraries were further diluted and denatured according to the NovaSeq 6000 Denature and Dilute Libraries Guide (Illumina, USA), and sequenced using the S4 Reagent Kit v1.5 (300 cycles) (Illumina, USA) with the Standard Loading workflow. A sequencing run with a 2 × 151 cycles configuration was performed on the NovaSeq 6000 instrument (Illumina, USA).
Sequencing files in BCL format were converted into demultiplexed FASTQ-format files using bcl2fastq v2.20.0.422 with default settings and one barcode mismatch allowed. Quality control of the FASTQ files was done in FastQC. FASTQ files were further processed in the DRAGEN environment (Version 4.2.4). Illumina DRAGEN Multigenome Graph Reference hg38 (alt-masked_graph+cnv+hla+rna_v3) was used as the reference genome. VCF files were generated for SNV, CNV and SV and were filtered using Basespace Variant Interpreter (v2.17.0.60) with >10 Total Read Depth, >5 Alt Allele Depth, >0.2 variant read frequency. VCF sequence variation descriptions were converted into the HGVS format with Variant Validator (https://www.variantvalidator.org/service/validate/batch/). Variant call annotation was performed using the UCSC Variant Annotation Integrator (https://genome.ucsc.edu/cgi-bin/hgVai) using the MANE transcript and all available annotations. Additional variant annotation including splice effect predictions were performed in Ensembl Variant Effect Predictor (https://www.ensembl.org/Homo_sapiens/Tools/VEP/), where masked and raw SpliceAI, MaxEntScan, dbscSNV ADA and RF scores were applied. Further splice annotation was performed with CI-spliceAI (https://ci-spliceai.com/). Population-level frequencies of analyzed variants were queried from the Genome Aggregation Database (gnomAD Genomes 3.1.2 v2).
Molecular genetic characterization of complex PVs
Complex SVs underwent a detailed characterization including the application of MLPA and cDNA-based assays to determine the genomic architecture and transcriptional relevance, respectively. In the case of the MLH1 SV detected in patient #10, cDNA was amplified MLH1_C6-F and MLH1_C7-R (Table S1B) and the SV-specific PCR detecting the breakpoint was performed with primer pairs MLH1_int18_tp_AS and MLH1_int18_tp_S (Table S1C). Regarding the cDNA analysis of the MSH2 SV detected in patient #97, primer pairs MSH2_C7-F and MSH2_C7-R were used (Tables S1B and S1D).
Allelic imbalance regarding the MLH1 c.93 G > A variant in patient #4 was analyzed using cDNA primers MLH1_promoter_cDNS_F and MLH1_promoter_cDNS_R (Table S1E). MLH1 c.306+1222 A > G variant was confirmed from gDNA using primers MLH1_int03_F and MLH1_int03_R (Table S1E). cDNA-based functional characterization of the MLH1 c.306+1222 A > G variant in patient #4 was performed by primer pairs MLH1_C1-F and MLH1_C2-R, MLH1_c.-93G_allele_spec_F and MLH1_C1-R, MLH1_promoter_cDNS_F and MLH1_ex04-ex03_R, respectively (Tables S1B and S1E).
Results
Establishment of a multimodal molecular genetic diagnostic workflow in LS
The current NCCN guideline for LS testing recommends performing MGPT if testing criteria are met8. In cases where MGPT failed to detect a PV, a systematic screening is needed to prioritize samples for WGS. Availability of information regarding familial cancer background is highly variable and is irrelevant in the case of de novo PVs, therefore we opted for a prospective analysis of tumor tissue- and RNA-based methods (Fig. 1). If the reduced expression of MLH1 was confirmed in the pathological report, but the MLH1 promoter was confirmed to be unmethylated in the tumor tissue our workflow recommended performing WGS. Otherwise, if an allelic imbalance of a germline heterozygote marker is confirmed in a gene with corresponding decreased protein expression in the tumor our workflow also advises performing WGS. This algorithm provides a fast, easy and cost-effective prioritization of samples for WGS.
MGPT identifies complex PVs in LS genes
Thirty-one out of 100 investigated patients carried germline pathogenic or likely pathogenic variants in MLH1 (n = 8), MSH2 (n = 13), MSH6 (n = 5), PMS2 (n = 2) or EPCAM (n = 3) associated with LS (Fig. 1 and Table 1). Four out of these variants are novel SNVs (in patients #7 and #72) or SVs (in patients #10 and #97) with no previous records in the InSight or ClinVar databases.
In patient #10, we identified an SV affecting MLH1. The proband developed colorectal cancer (CRC) at the age of 22 years, with familial predisposition and molecular pathological analysis (decreased MLH1 and PMS2 protein expressions in both the proband and her mother’s CRC) both indicative of LS (Table 1 and Fig. 2A). Primary bioinformatic analysis performed on the DRAGEN SV pipeline signaled a potential complex alteration (Table S2) in MLH1. Both SV call and visual inspection of the BAM file reads suggested a 1563 base pair (bp) deletion (Fig. 2B). MLPA analysis (probe set: P003-D1 MLH1/MSH2) revealed a deletion of exon 18 and a duplication of exon 19 (Fig. 2C, Figure S1). Confirmatory MLPA analysis with a different MLPA probe set (probe set P248-B2 MLH1-MSH2 Confirmation) only corroborated the duplication of exon 19 (Fig. 2C, Figure S1). To further characterize the transcriptional consequence of this variant, RNA-based analyses have been performed. Two additional, shorter transcripts were detected on the cDNA level (Fig. 2D). Sanger sequencing of these amplicons revealed one of them to include the fusion exon between exons 18 and 19, which was imputed from the deletion observed in the NGS data, however, the other altered transcript lacked the whole exon 18, while exon 19 was present in its entirety (Fig. 2E). Based on these observations we hypothesized that the complex alteration consisted of a tandem duplication of a larger region stretching from the 3’ part of exon 18 and the neighboring regions containing the whole exon 19 and a partial deletion of the 5’ copy of this duplication including the 3’ part of exon 18 and the 5’ part of exon 19 (Fig. 2F). To confirm this hypothesis, a breakpoint-specific PCR has been designed with the forward primer binding to exon 19 and the reverse primer binding the interconnecting intron of exons 18 and 19 (Fig. 2F-G). Successful and specific amplification of the amplicon including the breakpoint in the proband and her relatives carrying this SV not only confirmed this hypothesis but also allowed the precise nomenclature of the variant as MLH1 (NM_000249.4) c.[2078_2172del; 2080_*+493dup] (Fig. 2H).
Pedigree of the proband (A). Visual representation of detected reads of exons 18 and 19 in the BAM file (B). MLPA analyses with default (P003-D1) and confirmation (P248-B2) probe sets, extended data can be found in Figure S1 (C). Electrophoretic separation of MLH1 cDNA amplicons on an Agilent Bioanalyzer DNA1000 chip (D). Arrows point to two additional cDNA transcripts. Note: Uncropped image is demonstrated in Figure S2. Visual representation of the architecture of the two additional cDNA transcripts detected with Sanger sequencing (E). Genomic architecture of the detected SV compared to the reference gDNA (F). Breakpoint-specific PCR primers are visualized as red arrows binding at sequence complementary regions. Primers oriented outwards in the reference genome get face each other only at the duplication breakpoint. Electrophoretic separation of breakpoint-specific PCR products on 1.5% agarose gel (G) performed on germline DNA sample of the proband (a), her mother (b), her sister (c) and two independent controls. Note: a, b and c individuals are highlighted also in Panel A. Note: Uncropped image is demonstrated in Figure S3. Chromatogram visualizing the breakpoint (vertical brown line) included in the breakpoint-specific PCR (H). PCR polymerase chain reaction, L ladder.
In patient #97 we identified an SV in MSH2 consisting of an insertion of a guanine and a duplication of the 5’ 114 bps from the insertion (NM_000251.3 c.2620_2621ins[G;2507_2620], Fig. 3A). MLPA also pointed out the SV (as a duplication of exon 15), since the probe hybridization sequence fell into the duplicated stretch (Fig. 3B and Figure S4). This variant was found in a woman with a prior diagnosis of endometrial cancer at the age of 58 years, while no familial cancer history was explorable (Fig. 3C). Immunohistochemical analysis of the tumor revealed a tumor-specific decrease in the expression of MSH2 and MSH6, corresponding to the germline PV. RNA-based analyses confirmed the same sequence context as experienced in the germline: the insertion was faithfully transcribed, without any interference with splicing (r.2620_2621ins[G;2507_2620]). However, since the deduced translation product contained a premature termination codon (p.Tyr874Cysfs*3), the altered transcript occurred at a lower abundance probably as a result of nonsense-mediated decay (NMD, Fig. 3D-F).
Visual representation of the partial duplication of exon 15, contributing to the SV in the BAM file observed in patient #97 (A). MLPA analysis (probe set #P003-D1) extended data can be found in Figure S4 (B). Pedigree of patient #98 (C). Electrophoretic separation of MSH2 cDNA amplicons on a 1.5% agarose gel (D). The arrow point to an additional cDNA transcript. Note: Uncropped image is demonstrated in Figure S5. Chromatogram of gDNA-based Sanger sequencing visualizing the 5’ start of the detected SV (E). Chromatogram of cDNA-based Sanger sequencing visualizing the 5’ start of the detected SV (F). bp base pair, EC endometrial cancer.
In addition to the detected SVs, we have found a third complex PV in patient #38 as a result of two rare SNVs within 4 base pairs in MSH2. The proband was diagnosed with MSH2- and MSH6-deficient colorectal cancer at the age of 33 years with more LS-related cancer occurrences in the pedigree (Fig. 4A). MGPT followed by Sanger sequencing validated two SNVs within 4 base pairs in the MSH2 gene (Fig. 4B) which were confirmed to be on the same allele (Fig. 4C). As HGVS nomenclature (version 21.0.2, https://hgvs-nomenclature.org/stable/) requires the independent description of such variants with the signal that they are in cis position, the adequate designation is MSH2 c.[2042 A > C;2045 C > T], p.[(Gln681Pro; Thr682Ile)]. We have found no previous record with this naming in the ClinVar or Insight databases, however, the alternative naming of c.2042_2045delinsCAAT revealed an earlier record with likely pathogenic classification. Additionally, functional testing of the c.2042 A > C variant altering codon 681 revealed a deleterious effect on protein function15.
Pedigree of patient #38 (A). Chromatogram of gDNA-based Sanger sequencing visualizing the complex PV MSH2 c.[2042 A > C;2045 C > T] (B). The arrow points to the heterozygous loci of c.2042 A > C and c.2045 C > T. Representative image of reads revealing the co-occurrence of c.2042 A > C and c.2045 C > T on the same allele in the BAM file (C). Yellow boxes represent the alternative C at the 2042 position, while blue boxes represent the alternative T at the 2045 position.
Somatic MLH1 promoter methylation and RNA-based analyses prioritize sample for WGS
As per our recommended workflow (Fig. 1), PV-negative cases were further studied to select samples for WGS. MLH1 promoter methylation is a frequent epigenetic mechanism responsible for decreased MLH1 expression in tumors. In our cohort, 45 out of 50 tumor tissues with decreased expression of MLH1 were available for testing for MLH1 promoter methylation, among which all but one sample exhibited the methylation in all 16 analyzed CpG positions. The tumor sample of patient #4 was unmethylated at the MLH1 promoter and thus was selected for WGS. Allelic imbalance in the RNA level was used as another prioritization method to select samples for WGS, as both pathogenic promoter or deep intronic variants resulting in frameshift proteins and NMD can cause the relative predominance of the reference allele. No presentation of allelic imbalance was observed among the 17 cases, where germline transcribed heterozygote markers were present in the adequate mismatch repair gene (i.e. the one coding the protein with decreased expression in the tumor). However, the sample of patient #4 already selected for WGS exhibited a significant allelic imbalance regarding the promoter variant MLH1 c.-93G > A, which was due to NMD, as inhibition of NMD diminished this effect (Fig. 5A).
Allele imbalance detected at the MLH1 c.-93G > A heterozygote marker in patient #4 (A). Note: results from the reverse sequencing are presented. Puromycin treatment almost restored the allelic ratios measured at the gDNA-level. Red arrows show the position of MLH1 c.-93G > A. Chromatogram of gDNA-based Sanger sequencing of the MLH1 c.306+1222 A > G variant (B). The arrow points to the heterozygous locus of c.306+1222 A > G. Electrophoretic separation from MLH1 cDNA amplicons incorporating exons 1-9 from patient #4 and controls (C). Puromycin (Puro) treatment inhibits NMD. Note: Uncropped image is demonstrated in Figure S6. Electrophoretic separation of MLH1 cDNA amplicons restricted from alleles containing the reference ’G’ base at MLH1 c.-93 position (D). This allowed the amplification to occur from the allele which carried the detected deep intronic variant, as the alternative ’G’ base at the c.306+1222 position was in phase with the reference ’G’ base at the c.-93 position in patient #4 (as highlighted in A). Note: Uncropped image is demonstrated in Figure S7. Chromatogram of cDNA-based Sanger sequencing restricted from wild type cDNA transcript highlighting the -93G > A position from patient #4 (E). Restriction for the wild-type cDNA transcript was allowed by the reference primer which was specific to the exon 3-4 boundary. Note: results from the reverse sequencing are presented and might be compared to those presented in (A). The red arrow shows the position of MLH1 c.-93G > A. Genomic architecture of altered splicing resulting from the deep intronic variant MLH1 c.306+1222 A > G (F). Pedigree of patient #4 (G). bp base pair, BrainT brain tumor, CRC colorectal cancer, L ladder, NMD nonsense-mediated decay, Puro puromycin.
WGS detects a deep intronic PV in MLH1
WGS analysis of the gDNA of patient #4 detected 12 variants in MLH1 and further filtering regarding putative splice alterations revealed a rare SNV c.306+1222 A > G located in intron 3 (Fig. 5B, Supplementary Data S1). This variant was predicted to activate a deep intronic cryptic splice donor site, resulting in the exonization of an intronic sequence. cDNA amplification of the MLH1 amplicon incorporating exons 1–9 revealed an additional, larger PCR product in NMD-inhibited cells, which was absent in controls (Fig. 5C). We confirmed the pathogenicity of this deep intronic variant in two ways. Both ways harnessed marker variants, the phase of which relative to the studied intronic variant was determined by the former molecular analyses (i.e. allelic imbalance studies). First, we performed allele-specific RT-PCR restricted from the intronic variant-carrying allele and revealed the exclusive amplification of an altered mRNA incorporating a 288 bp-long intronic sequence upstream of c.306+1222 A > G (Fig. 5D). Second, we applied a so-called tagging variant analysis, where RT-PCR amplicon was generated with a reverse primer specific to the exon 3–4 boundary (and therefore selecting for the wild-type mRNA). The sequencing chromatogram of this amplicon did not contain the tagging variant (which is in phase with c.306+1222 A > G), so it revealed that reference mRNA is derived exclusively from the reference gDNA allele (Fig. 5E). The main aberrant transcript was r.306 + 934_306+1221ins, but to a much lesser extent r.[301_305del;306 + 934_306+1221ins] was also present (Fig. 5F). The inserted intronic sequence causes frameshift and consequential premature termination codon, so elicits NMD mechanism, which correlates very well with the strong allelic imbalance detected in this patient. The confirmation of its transcriptional consequence allowed the correct genetic diagnosis in a family with multiple LS-associated cancer manifestations (Fig. 5G), where the proband was diagnosed with MLH1-deficient colorectal cancer (Table 1, Fig. 5G). The inclusion of WGS following a prioritization workflow elevated the detection rate of PVs resulting in LS from 31% to 32% (Fig. 1A).
Recurring complex PVs in LS
To determine if the detected complex PVs were present in independent families within the analyzed time frame, we examined if there were any recurrent PVs in other patients who underwent genetic counseling and testing at our center. The MSH2 SV found in patient #97 (c.2620_2621ins[G;2507_2620]) was detected in a patient (patient #A) with prostate cancer (Fig. 6A) and lack of MSH2 and MSH6 expression in the tumor tissue verified the pathogenic nature of this variant concerning this malignancy (Fig. 6B). Interestingly, this family exhibited an abundance of urinary tract cancers (Fig. 6C) which is in line with the recently reported higher risk of urinary tract cancers in the carriers of MSH2 PVs16. Since this partial exon duplication results in framsehift and premature stop codon (PVS1 – very strong), confirmed by the functional mRNA-based assay presented in Fig. 3D-F (PS3 – strong) and we have identified this variant in two patients with a phenotype highly specific to MSH2-associated LS (PP4 – supporting), the ACMG classification of this variant is pathogenic.
Visual representation of the partial duplication of MSH2 exon 15, contributing to the SV in the BAM file observed in patient #A as well as patient #97 (A). Note: Read depths of patient #97 and control are also demonstrated in Fig. 4A and are presented here for optimal comparison. Immunohistochemical analysis of the mismatch repair proteins in the prostate cancer biopsy of patient #A (B). 20× magnification. Pedigree of patient #A (C). Representative image of reads revealing the co-occurrence of MSH2 c.2042 A > C and c.2045 C > T on the same allele from the BAM file in patient #B (D). Yellow boxes represent the alternative C at the 2042 position, while blue boxes represent the alternative T at the 2045 position. Immunohistochemical analysis of the mismatch repair proteins in the pancreatic cancer specimen of patient #B (E). 20× magnification. Pedigree of patient #B (F). Electrophoretic separation of MLH1 c.[2078_2172del; 2080_*+493dup]-specific PCR (further elaborated in Fig. 2F–H) products on 1.5% agarose gel (G) performed on germline DNA sample of patients #C and #C2 and two independent controls. Note: patients #C and #C2 are highlighted also on Panel J. Note: Uncropped image is demonstrated in Figure S8. Chromatogram visualizing the breakpoint (vertical brown line) included in the breakpoint-specific PCR (H). MLPA analyses with default (P003-D1) and confirmation (P248-B2) probe sets (I). Pedigree of patient #C (J). bp base pair, CRC colorectal cancer, EC endometrial cancer, GC gastric cancer, KC kidney cancer, L DNA ladder, OC ovarian cancer, PanC pancreatic cancer, PC prostate cancer.
The MSH2 c.[2042 A > C;2045 C > T] was also identified in patient #B, where the two variants were also found on the same allele (Fig. 6D). Patient #B was diagnosed with pancreatic cancer at the age of 55 and immunohistochemical analyses on the tumor tissue revealed the pathogenic nature of this variant concerning this rare LS manifestation (Fig. 6E). The patient’s familial cancer history included 4 diagnosed CRCs and one ovarian cancer (Fig. 6F). As in vitro data strongly support the pathogenicity of MSH2 c.2042 A > C15 fulfilling the criteria PS3 (strong), the variant is absent from the studied gnomAD database (PM2—moderate) and we have identified it in two patients with a phenotype highly specific to MSH2-associated LS (PP4 – supporting), the ACMG classification of this variant is likely pathogenic.
Our research group has previously reported an LS family with a germline MLH1 deletion affecting the same region (MLH1 c.2078_2172del) as observed in patient #1017. We have re-evaluated the gDNA samples of this previous family (HFC100 in the original publication, patient #C in the current study) and have confirmed that they harbor the same complex SV c.[2078_2172del; 2080_*+493dup] as patient #10 (Fig. 6G-J and Table S3). Additionally, following the completion of patient accrual to our pilot study, we have detected this complex MLH1 SV in a third independent family (patient #D, Table S3). The proband was diagnosed with three primary LS-associated tumors (one endometrial cancer at the age of 55 years and two CRCs (at the ages of 54 and 59 years), immunohistochemical analysis was available from the latter, which confirmed the decreased expression of MLH1 and PMS2). As this SV affects multiple exons resulting in a frameshift mutation (PVS1—very strong), we have confirmed the functional consequence on the mRNA level (PS3 – strong) and have confirmed the occurrence of the variant in three independent families with a clinical diagnosis of LS, among which 3 CRCs from two families exhibited the highly specific phenotype (MLH1- and PMS2-deficient CRC), satisfying the PP4 criteria (supporting), the ACMG classification of this variant is pathogenic.
Although limited segregation analyses were available in the family of patient #10, they were unavailable to assess in the families of patients #38, #97, #A and #B. Additionally, it is important to note, that proper segregation analyses necessitate the investigation of multiple first- second- and third-degree family members to confirm cosegregation18. In cases, where these segregation analyses were not available, the pathogenicity of the detected variants was identified by fulfilling additional criteria of the ACMG guideline9 as specifically described above. Although cascade testing was offered to 137 first-degree family members of the 32 patients diagnosed with LS by MGPT, only 34 family members (24.8%) chose to undergo genetic testing. This correlates with larger studies finding the request for cascade testing to be between 10% and 30%19 and represent a limitation in assessing pathogenicity.
In summary, all of the complex PVs identified with MGPT were confirmed to be recurrent in our investigated population.
WGS analysis of a retrospective cohort of PV-negative patients reveals the recurrent nature of MLH1 c.306+1222 A > G
Additionally, we performed WGS in a retrospective cohort from our center with a high likelihood of Lynch syndrome, where previous genetic analyses found no PVs (Table S4). We found no SVs or CNV in the genes of MLH1, MSH2, MSH6, PMS2 and EPCAM, however, 1013 SNVs were called. Further filtering of the variants excluded frequent variants and finally computational splice-altering predictions were performed (Fig. 7A, Supplementary Data S2). This workflow returned the same MLH1 deep intronic variant previously found in patient #4 (MLH1 c.306+1222 A > G) in patient #W10 (Fig. 7B-C), who was first diagnosed with MLH1- and PMS2-deficient CRC at the age of 54 years. As we were able to confirm the damaging functional consequence of this variant in patient #4 (PS3 – strong), this variant is not present in gnomAD referenc database (PM2 – moderate) and both the affected patients’ phenotype are highly specific to MLH1-associated LS (PP4 – supporting), the ACMG classification of this variant is likely pathogenic.
Filtering strategy of SNV (A). Chromatogram of gDNA-based Sanger sequencing visualizing the MLH1 c.306+1222 A > G variant in patient #W10 (B). Pedigree of patient #W10 (C). AF allele frequency, CRC colorectal cancer, GC gastric cancer, KC kidney cancer, PC prostate cancer, SNV single nucleotide variant.
As we have identified the MLH1 c.306+1222 A > G variant in our retrospective cohort as well, we were able to confirm the recurrent nature of this variant in our population.
Discussion
The diagnosis of LS has rapidly evolved in the past decades. In 2004, the revised Bethesda guidelines advised microsatellite instability (MSI) testing of only high-risk CRCs7, while today all CRCs are recommended to be tested for MSI20. The revised Bethesda guidelines advised germline testing only if mismatch repair deficiency has been confirmed7, while the latest NCCN guideline does not require prior testing for MSI in young CRC patients for germline testing8 and advocates for universal germline testing argue for offering genetic testing for all CRC patients21,22,23. On the technical side, the wide-ranging availability of NGS-based MGPT allowed the expansion of patients eligible for germline testing, however, the missing heritability in LS can only be investigated by the concurrent analysis of both the coding and non-coding regions4. Our diagnostic center has previously confirmed the robustness of WGS in detecting noncoding PVs in families with familial adenomatous polyposis and the Carney complex6,24. However, the optimal place of WGS in the molecular genetic diagnostic workflow of LS has not been determined.
Therefore, we designed a straightforward diagnostic algorithm for LS. Following national and international guidelines8, the first step is the application of the NGS-based MGPT. If this test fails to detect a PV, the lack of somatic MLH1 promoter methylation in MLH1-deficient tumors suffices the need for WGS. Although there are examples of rendering patients with somatic MLH1 promoter methylation ineligible for LS germline testing4,25, the instances of LS-associated cancers with MLH1 promoter methylation25 convinced us not to exclude these samples from further analysis. Subsequent evaluation of samples utilized RNA-based identification of a gDNA-directed transcriptional alteration, a method which has previously been widely applied14,26,27. In particular, we tested if an allelic imbalance of a germline transcribed marker detected by the MGPT might provide evidence of NMD caused by a frameshift PV28, in which case WGS would also be advised. This prioritization algorithm can be safely regarded as a time- and cost-efficient workflow. We have recently demonstrated that the application of a pan-cancer hereditary MGPT to diagnose hereditary endocrine tumor syndromes is a cost-effective strategy29. Following MGPT, the cost of performing both the MLH1 promoter methylation analysis and the cDNA-based investigation of allelic imbalance is approximately 300 USD, while WGS followed by clinical variant annotation and confirmatory validation by an independent method is available between 3000 and 4000 USD30,31. Although inititatives to reduce sequencing costs by performing low-pass WGS might allow the detection of CNVs32,33, current diagnostic procedures necessitate 30-50X average read depth to allow adequate SNV calling.
During the 19-month prospective implementation period of this workflow, MGPT has identified 31 PVs (31%) in LS genes, which correlates well with results from different centers34,35,36. Among these PVs, 3 (9.7%) were complex variants. Molecular characterization of these complex PVs revealed the transcriptional consequences, while all of them were confirmed to be recurrent in our studied population. The two novel SVs have not been previously identified, however in the case of MSH2 c.[2042 A > C; 2045 C > T], c.2045 C > T has been previously detected in another patient with a clear germline MSH2 PV (c.1292 T > A), supporting the neutral effect of c.2045 C > T37. This is important as only c.2042 A > C can be regarded as a PV, which presumably first appeared on a haplotype containing c.2045 C > T as no occurrence of c.2042 A > C without c.2045 C > T has been observed.
The WGS-based detection of an MLH1 deep intronic PV resulting in a novel splice donor and causing the exonization of a 288bp-long intronic region in the only sample selected for WGS from our prospective cohort clearly shows the benefit of adding WGS to the diagnostic procedure. Interestingly, this very variant was also detected in our retrospective cohort in a patient with MLH1-deficient CRC and a clear familial history of LS-associated cancers, while it was absent in the gnomAD database (gnomAD Genomes 3.1.2 v2) reporting the results of 75,000 human genomes.
The recurrent nature of all the identified complex PVs as well as the detected MLH1 deep intronic PV greatly highlights the clinical importance of this workflow in the studied population. Our method also confirms the necessity of a multimodal approach in the genetic diagnostic workflow of LS, including genetic counseling, DNA and RNA-based genetic testing and expert histopathological diagnosis. This enables optimal variant classification and is a prerequisite to performing proper presymptomatic genetic testing in relatives, who can then benefit from personalized cancer screening protocols.
It is important to note the following limitations of this study. Primarily, although this is the first prospective analysis of the role of WGS in the diagnosis of LS, our 19-month timeframe allowed the inclusion of only 100 patients. Next, our algorithm is unable to leverage allelic imbalance assays in samples with no germline transcribed heterozygote markers and therefore, might still miss the diagnosis of all deep intronic PVs. Further, there are certain, extra rare cases of hereditary epigenetic variations resulting in Lynch syndrome (e.g. constitutional, primary or secondary methylation of the MLH1 promoter38,39) which might still be missed with this approach. However, as earlier reports confirmed that germline noncoding DNA variations are the most frequent cause of undiagnosed LS after MGPT4, we believe that our approach presents a clinically viable and cost-effective workflow to optimally provide molecular genetic diagnosis of LS. Moreover, the application of the gnomAD database as an universal healthy control population is suboptimal as it does not reflect the germline genetic background of the studied (Hungarian) population, however there is no large germline whole genome database currently available which might have been used in this population. Nevertheless, by the application of the gnomAD database we were able to filter out variants which might have been found to be frequent in only one, distinct population, which can help to robustly filter out benign variants40.
In conclusion, we designed a biomarker-driven prioritization workflow for the incorporation of WGS in the diagnosis of LS. By implementing this workflow on 100 consecutive patients diagnosed at our center, 3 complex PVs were identified out of 31 PVs with MGPT and 1 deep-intronic PV has been detected with WGS. Further analysis confirmed the recurrent nature of all these complex and deep-intronic PVs which underlines the clinical importance of this workflow in our studied population.
Data availability
All data generated or analyzed during this study are included in this published article [and its supplementary information files]. Previously unreported variants have been deposited in the Leiden Open Variation Database (LOVD, variant IDs: 0000959750, 0000959748, 0000959747 0000959744, 0000985142, 0000987870).
References
Lynch, H. T. Classics in oncology. Aldred Scott Warthin, M.D., Ph.D. (1866-1931). CA Cancer J. Clin. 35, 345–347, https://doi.org/10.3322/canjclin.35.6.345 (1985).
Pocza, T. et al. Germline Structural Variations in Cancer Predisposition Genes. Front Genet. 12, 634217, https://doi.org/10.3389/fgene.2021.634217 (2021).
Kovacs, M. E., Papp, J., Szentirmay, Z., Otto, S. & Olah, E. Deletions removing the last exon of TACSTD1 constitute a distinct class of mutations predisposing to Lynch syndrome. Hum. Mutat. 30, 197–203, https://doi.org/10.1002/humu.20942 (2009).
Te Paske, I. et al. Noncoding aberrations in mismatch repair genes underlie a substantial part of the missing heritability in Lynch syndrome. Gastroenterology 163, 1691–1694 e1697, https://doi.org/10.1053/j.gastro.2022.08.041 (2022).
Arnold, A. M. et al. Targeted deep-intronic sequencing in a cohort of unexplained cases of suspected Lynch syndrome. Eur. J. Hum. Genet. 28, 597–608, https://doi.org/10.1038/s41431-019-0536-9 (2020).
Bozsik, A. et al. Genome sequencing-based discovery of a novel deep intronic APC pathogenic variant causing exonization. Eur. J. Hum. Genet. 31, 841–845, https://doi.org/10.1038/s41431-023-01322-y (2023).
Umar, A. et al. Revised Bethesda Guidelines for hereditary nonpolyposis colorectal cancer (Lynch syndrome) and microsatellite instability. J. Natl. Cancer Inst. 96, 261–268, https://doi.org/10.1093/jnci/djh034 (2004).
Guidelines, N. C. C. N. NCCN Clinical Practice Guidelines in Oncology: Genetic/Familial High-Risk Assessment: Colorectal version 2.2023. (2023).
Richards, S. et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 17, 405–424, https://doi.org/10.1038/gim.2015.30 (2015).
Riggs, E. R. et al. Technical standards for the interpretation and reporting of constitutional copy-number variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics (ACMG) and the Clinical Genome Resource (ClinGen). Genet. Med. 22, 245–257, https://doi.org/10.1038/s41436-019-0686-8 (2020).
Korbie, D. J. & Mattick, J. S. Touchdown PCR for increased specificity and sensitivity in PCR amplification. Nat. Protoc. 3, 1452–1456, https://doi.org/10.1038/nprot.2008.133 (2008).
Balog, J. Á. et al. Peripheral immunophenotyping reveals lymphocyte stimulation in healthy women living with hereditary breast and ovarian cancer syndrome. iScience 27, 109882, https://doi.org/10.1016/j.isci.2024.109882 (2024).
Morak, M. et al. Full-length transcript amplification and sequencing as universal method to test mRNA integrity and biallelic expression in mismatch repair genes. Eur. J. Hum. Genet. 27, 1808–1820, https://doi.org/10.1038/s41431-019-0472-8 (2019).
Andreutti-Zaugg, C., Scott, R. J. & Iggo, R. Inhibition of nonsense-mediated messenger RNA decay in clinical samples facilitates detection of human MSH2 mutations with an in vivo fusion protein assay and conventional techniques. Cancer Res. 57, 3288–3293 (1997).
Jia, X. et al. Massively parallel functional testing of MSH2 missense variants conferring Lynch syndrome risk. Am. J. Hum. Genet. 108, 163–175, https://doi.org/10.1016/j.ajhg.2020.12.003 (2021).
Dominguez-Valentin, M. et al. Mortality by age, gene and gender in carriers of pathogenic mismatch repair gene variants receiving surveillance for early cancer diagnosis and treatment: a report from the prospective Lynch syndrome database. EClinicalMedicine 58, 101909, https://doi.org/10.1016/j.eclinm.2023.101909 (2023).
Papp, J., Kovacs, M. E. & Olah, E. Germline MLH1 and MSH2 mutational spectrum including frequent large genomic aberrations in Hungarian hereditary non-polyposis colorectal cancer families: implications for genetic testing. World J. Gastroenterol. 13, 2727–2732, https://doi.org/10.3748/wjg.v13.i19.2727 (2007).
Jarvik, G. P. & Browning, B. L. Consideration of cosegregation in the pathogenicity classification of genomic variants. Am. J. Hum. Genet. 98, 1077–1081, https://doi.org/10.1016/j.ajhg.2016.04.003 (2016).
Schmidlen, T. J. et al. The impact of proband indication for genetic testing on the uptake of cascade testing among relatives. Front Genet. 13, 867226, https://doi.org/10.3389/fgene.2022.867226 (2022).
Rubenstein, J. H., Enns, R., Heidelbaugh, J. & Barkun, A. American Gastroenterological Association Institute Guideline on the Diagnosis and Management of Lynch Syndrome. Gastroenterology 149, 777–782, https://doi.org/10.1053/j.gastro.2015.07.036 (2015).
Jiang, W. et al. Universal germline testing among patients with colorectal cancer: clinical actionability and optimised panel. J. Med. Genet. 59, 370–376, https://doi.org/10.1136/jmedgenet-2020-107230 (2022).
Pearlman, R. et al. Prospective Statewide Study of Universal Screening for Hereditary Colorectal Cancer: The Ohio Colorectal Cancer Prevention Initiative. JCO Precis Oncol 5, https://doi.org/10.1200/PO.20.00525 (2021).
Esplin, E. D. et al. Universal germline genetic testing for hereditary cancer syndromes in patients with solid tumor cancer. JCO Precis Oncol. 6, e2100516, https://doi.org/10.1200/PO.21.00516 (2022).
Palla, S. et al. Whole genome sequencing resolves 10 years diagnostic odyssey in familiar myxoma. Sci. Rep. 13, 14658, https://doi.org/10.1038/s41598-023-41878-9 (2023).
Helderman, N. C. et al. MLH1 promotor hypermethylation in colorectal and endometrial carcinomas from patients with lynch syndrome. J. Mol. Diagn. 26, 106–114, https://doi.org/10.1016/j.jmoldx.2023.10.005 (2024).
Jakubowska, A. et al. Optimization of experimental conditions for RNA-based sequencing of MLH1 and MSH2 genes. Hum. Mutat. 17, 52–60 (2001).
Abed, A. A., Gunther, K., Kraus, C., Hohenberger, W. & Ballhausen, W. G. Mutation screening at the RNA level of the STK11/LKB1 gene in Peutz-Jeghers syndrome reveals complex splicing abnormalities and a novel mRNA isoform (STK11 c.597(insertion mark)598insIVS4). Hum. Mutat. 18, 397–410, https://doi.org/10.1002/humu.1211 (2001).
Kwok, C. T., Ward, R. L., Hawkins, N. J. & Hitchins, M. P. Detection of allelic imbalance in MLH1 expression by pyrosequencing serves as a tool for the identification of germline defects in Lynch syndrome. Fam. Cancer 9, 345–356, https://doi.org/10.1007/s10689-009-9314-0 (2010).
Patocs, A. et al. Cost-effectiveness of genetic testing of endocrine tumor patients using a comprehensive hereditary cancer gene panel. J. Clin. Endocrinol. Metab. 109, 3220–3233, https://doi.org/10.1210/clinem/dgae300 (2024).
Runheim, H. et al. The cost-effectiveness of whole genome sequencing in neurodevelopmental disorders. Sci. Rep. 13, 6904, https://doi.org/10.1038/s41598-023-33787-8 (2023).
Thangavelu, T. et al. Micro-costing of genetic diagnostics in acute leukemia in Sweden: from standard-of-care to whole-genome sequencing. J. Med. Econ. 27, 1053–1060, https://doi.org/10.1080/13696998.2024.2387515 (2024).
Christodoulou, E. et al. Combined low-pass whole genome and targeted sequencing in liquid biopsies for pediatric solid tumors. NPJ Precis Oncol. 7, 21, https://doi.org/10.1038/s41698-023-00357-0 (2023).
Di Cosimo, S. et al. Low-pass whole genome sequencing of circulating tumor cells to evaluate chromosomal instability in triple-negative breast cancer. Sci. Rep. 14, 20479, https://doi.org/10.1038/s41598-024-71378-3 (2024).
Kim, S. R. et al. Maximizing cancer prevention through genetic navigation for Lynch syndrome detection in women with newly diagnosed endometrial and nonserous/nonmucinous epithelial ovarian cancer. Cancer 127, 3082–3091, https://doi.org/10.1002/cncr.33625 (2021).
Pan, S. et al. Discordance between germline genetic findings and abnormal tumor immunohistochemistry staining of mismatch repair proteins in individuals with suspected Lynch syndrome. Front Oncol. 13, 1069467, https://doi.org/10.3389/fonc.2023.1069467 (2023).
Syed, H. et al. Referral, uptake, and outcome of genetic counseling and testing in patients with early-onset colorectal cancer. J. Natl. Compr. Canc Netw. 21, 1156–1163.e1155, https://doi.org/10.6004/jnccn.2023.7057 (2023).
Mangold, E. et al. Spectrum and frequencies of mutations in MSH2 and MLH1 identified in 1,721 German families suspected of hereditary nonpolyposis colorectal cancer. Int. J. cancer J. Int. du cancer 116, 692–702, https://doi.org/10.1002/ijc.20863 (2005).
Hitchins, M. P. et al. Dominantly inherited constitutional epigenetic silencing of MLH1 in a cancer-affected family is linked to a single nucleotide variant within the 5’UTR. Cancer cell 20, 200–213, https://doi.org/10.1016/j.ccr.2011.07.003 (2011).
Joo, J. E. et al. Identifying primary and secondary MLH1 epimutation carriers displaying low-level constitutional MLH1 methylation using droplet digital PCR and genome-wide DNA methylation profiling of colorectal cancers. Clin. Epigenetics 15, 95, https://doi.org/10.1186/s13148-023-01511-y (2023).
Gudmundsson, S. et al. Variant interpretation using population databases: Lessons from gnomAD. Hum. Mutat. 43, 1012–1030, https://doi.org/10.1002/humu.24309 (2022).
Acknowledgements
The project was implemented with the support from the National Research, Development and Innovation Fund of the Ministry of Culture and Innovation under the National Laboratories Program (National Tumor Biology Laboratory (2022-2.1.1-NL-2022-00010)) and the Hungarian Thematic Excellence Program (under project TKP2021-EGA-44) Grant Agreements with the National Research, Development and Innovation Office. We would like to acknowledge the financial support of the National Research, Development and Innovation Office under grant NKFIH-FK-138377. V.K.G. is the recipient of the János Bolyai Postdoctoral Scholarship (BO/00141/21) and the New National Excellence Program Bolyai+ Scholarship (ÚNKP-22-5-SE-25, ÚNKP-23-5-SE-16). H.B. is supported by the NRDI NKFI-FK135065, the New National Excellence Program of the Ministry of Human Capacities (ÚNKP-23-5-SE-4) and the Bolyai Research Fellowship of the Hungarian Academy of Sciences. We would like to acknowledge the professional expertise and help of Dr. Jan Radvanszky and Dr. Jaroslav Budis. We would like to acknowledge the professional support of Lajosne Franko, Gabriella Domokos, Maria Baloghne Kovacs and Judit Ferenczi.
Funding
Open access funding provided by National Institute of Oncology.
Author information
Authors and Affiliations
Contributions
V.K.G., A.B., A.P., and J.P. designed the project. V.K.G., H.B., and A.P. performed genetic counseling and were involved in data analysis. K.H., A.B., T.P., and J.P. performed genetic analyses and interpreted the results. I.V., T.S., and E.T. performed histopathological analyses. Ta.Se., D.R., and To.Sz. performed WGS which was analyzed by I.L. and J.P. E.O. and A.P. provided scientific supervision. V.K.G., A.B., and K.H. wrote the manuscript. All authors read and approved the final form of the manuscript.
Corresponding author
Ethics declarations
Competing interests
Ta.Se., D.R., and To.Sz. are affiliated with Geneton Ltd., but this did not affect the current study. All other authors declare no conflicts of interest.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Horti-Oravecz, K., Bozsik, A., Pócza, T. et al. Whole genome sequencing completes the molecular genetic testing workflow of patients with Lynch syndrome. npj Genom. Med. 10, 5 (2025). https://doi.org/10.1038/s41525-025-00461-z
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41525-025-00461-z









