Abstract
Joubert syndrome (JS) is a genetically heterogeneous neurodevelopmental ciliopathy. Despite exome sequencing (ES), several patients remain undiagnosed. This study aims to increase the diagnostic yield by uncovering cryptic variants through targeted ES reanalysis. We first focused on 26 patients in whom ES only disclosed heterozygous pathogenic coding variants in a JS gene. We reanalyzed raw ES data searching for copy number variants (CNVs) and intronic variants affecting splicing. We validated CNVs through real-time PCR or chromosomal microarray, and splicing variants through RT-PCR or minigenes. Cryptic variants were then searched in additional 44 ES-negative JS individuals. We identified cryptic “second hits” in 14 of 26 children (54%) and biallelic cryptic variants in 3 of 44 (7%), reaching a definite diagnosis in 17 of 70 (overall diagnostic gain 24%). We show that CNVs and intronic splicing variants are a common mutational mechanism in JS; more importantly, we demonstrate that a significant proportion of such variants can be disclosed simply through a focused reanalysis of available ES data, with a significantly increase of the diagnostic yield especially among patients previously found to carry heterozygous coding variants in the KIAA0586, CC2D2A and CPLANE1 genes.
Similar content being viewed by others
Introduction
Joubert syndrome (JS) is a genetically heterogeneous neurodevelopmental ciliopathy, diagnosed by the presence of a highly peculiar cerebellar and brainstem malformation, known as the “molar tooth sign” (MTS). The disease manifests early in life with hypotonia, developmental delay, oculomotor abnormalities, and respiratory pattern defects. Later signs include intellectual disability of variable severity, ataxia, and possible involvement of other organs such as the retina, kidneys, liver and skeleton [1].
More than 40 genetic forms of JS have been described, all following autosomal recessive inheritance, with the exceptions of X-linked OFD1 and dominant SUFU [2]. Yet, despite extensive sequencing of known genes through next-generation sequencing (NGS)-based targeted panels or exome sequencing (ES), distinct studies over the past few years have detected biallelic pathogenic variants in about 60–90% patients [3]. This leaves a substantial proportion of families without a definite diagnosis, hampering proper counseling, management, and access to prenatal and preimplantation diagnosis.
Among known genes, six “major genes” (CPLANE1, CEP290, AHI1, CC2D2A, TMEM67, and KIAA0586) account for most diagnosed cases, with the remaining genes representing rare or ultra-rare causes of JS [3]. The quest for novel genes is still open, but it is unlikely that these alone can explain all negative cases. Recent evidences increasingly show that hidden heritability can be at least partly explained by the occurrence of cryptic pathogenic variants in known genes [4]. The term “cryptic” refers to variants that are either not easily captured or even undetectable by conventional diagnostic NGS techniques and related bioinformatic pipelines, which are mainly designed to disclose single nucleotide variants (SNVs) and small indels within exons and canonical splice sites. Cryptic variants may include complex genomic rearrangements, intronic variants disrupting splicing, and variants in regulatory regions affecting gene expression [5]. Moreover, intragenic copy number variants (CNVs) involving exonic and/or intronic regions of a gene can also remain undetected by conventional diagnostic NGS algorithms, especially those adopted in the past years, likely contributing to the proportion of undiagnosed cases.
Cryptic variants in JS-related genes have occasionally been reported. A paradigmatic example is the serendipitous identification of the deep intronic c.2991+1655 A > G splicing variant in the CEP290 gene, which is a frequent cause of isolated Leber Congenital Amaurosis and also occurs in JS [6]. Few other reports have described the occasional detection of large deletions or intronic variants in JS-related genes such as KIAA0586, TMEM67, CPLANE1, TCTN2, CEP120, CEP290 and OFD1, which were validated by studies on patients’ RNA [7,8,9,10,11,12,13].
Currently, genome sequencing (GS) is the gold standard technique to identify cryptic variants, as it can sequence beyond coding regions, and is able to detect SNVs as well as CNVs and more complex rearrangements [14]. However, in many countries including Italy, GS has not yet entered routine diagnostics, due to the still elevated costs, high amount of data to be analyzed and stored, and difficulties in data interpretation.
In the present study we tackle the issue of missing heritability in JS, hypothesizing that: (1) a relevant proportion of JS patients lacking a genetic diagnosis could harbor cryptic variants in a known gene, and, (2) at least a subset of these variants could be easily and quickly identified through a focused reanalysis of available ES data. To address these hypotheses, we adopted a proof-of-principle strategy by focusing on a sub-cohort of patients in whom ES identified only a single heterozygous pathogenic variant in a JS-related gene.
Patients and methods
Patients’ selection
This project relied on a large cohort of over 550 JS probands of European descent (mainly Italian) recruited in the Valente Lab since 2003. This cohort underwent genetic testing over twenty years, including either direct Sanger sequencing of some JS genes, targeted sequencing of a large panel of ciliopathy genes (comprising the majority of JS known genes) or, more recently, ES. Ethical approval for genetic testing of JS patients is in place, and all families signed a written informed consent for genetic testing. Through these approaches, biallelic pathogenic SNVs were so far identified in 357 patients.
Among patients lacking a conclusive diagnosis, we focused on the 70 JS children who failed to receive a definite genetic diagnosis by ES. We did not include patients who remained undiagnosed by “old” targeted panel sequencing, as in our experience this approach featured incomplete coverage across coding sequences and a relatively low diagnostic accuracy. Notably, 26 out of 70 children (37%) carried a single heterozygous pathogenic variant in a known autosomal recessive JS gene, including either loss-of-function variants (frameshift, nonsense or variants affecting canonical splice sites) or missense variants classified as pathogenic or likely pathogenic according to ACMG criteria [15, 16]. No other pathogenic variants in causative or candidate genes were identified by ES in these cases.
These 26 patients were initially selected for this study, while the remaining 44 ES-negative children were analyzed in a second step. Patients with monoallelic or biallelic variants of unknown significance (VUS) identified after ES were considered as negative cases and included in this second step.
Of the 26 single heterozygous carriers, 22 had a pathogenic variant in a major JS gene. The commonest genes were CC2D2A (NM_001378615.1), with 8 patients carrying 7 truncating and one missense variant, and CPLANE1 (NM_023073.4), with 6 patients all carrying truncating variants. Four patients harbored the same heterozygous frameshift variant in KIAA0586 (NM_001244189.2), two carried heterozygous truncating variants in CEP290 (NM_025114.4), while the remaining six patients carried each a deleterious variant in either the AHI1 (NM_001134831.2), TMEM67 (NM_153704.6), INPP5E (NM_019892.6), CEP120 (NM_001375405.1), PIBF1 (NM_006346.4) and TCTN1 (NM_001082538.3) genes (Supplementary Table 1).
Statistical comparison of LOF allele frequencies
We sought to compare the frequencies of single heterozygous loss of function (LOF) variants in our JS cohort (ES-JS database) versus non-JS cohorts, including our ES-non-JS database, the Network for Italian Genomes - Exomes from Italy (NIG-ExIT; http://nigdb.cineca.it, accessed on April 2024) and gnomAD (https://gnomad.broadinstitute.org, v.3.1.2) databases. To this aim, we employed different approaches due to the different nature of the databases. In our ES-JS (n = 218) and ES-nonJS (n = 4328) cohorts and in the NIG-ExIT database (n = 1686), we derived the frequencies of single heterozygous LOF variants by referring to the total number of sequenced alleles in each group. Differently, in the gnomAD database, since each nucleotide is sequenced in a variable number of individuals, we calculated the frequencies of heterozygous LOF variants by referring to the weighted average of the times each single nucleotide harboring a LOF was sequenced in gnomAD, according to the following formula:
Where:
-
m is the total number of LOF alleles,
-
ni is the “Allele Number” for the LOF i-th allele,
-
fi is the frequency of the LOF i-th allele,
-
Fglobal is the final “weighted” global frequency of all LOF alleles for each gene.
After this process, the weighted frequencies of LOF alleles for the considered genes in gnomAD were statistically compared to the frequencies of single heterozygous LOF variants in our JS cohort using a chi-square test.
Identification of copy number variants
To search for in trans deletions or duplications, we exploited ExomeDepth [17] and CNVkit [18] algorithms along with a direct inspection of BAM files. Identified CNVs were verified either by quantitative real-time PCR (qRT-PCR) or chromosomal microarray analysis (Agilent Human Genome CGH Microarray Kit 4 × 180 k, Santa Clara, California, USA) on genomic DNA. Primers for qRT-PCR were designed within genomic regions flanking the predicted CNVs and are reported in Supplementary Table 2.
Identification of intronic variants
To search for intronic variants possibly affecting splicing, we reanalyzed raw ES data from FASTQ files to include deep intronic variants, since conventional ES secondary analysis is usually set to filter intronic variants lying beyond 20–30 nucleotides from the exon-intron junction. In particular, we extended the .bed file (Human Core Exome, Twist Bioscience, San Francisco, California, USA) target regions by adding 200 nucleotides upstream and downstream each interval, to retain as many intronic variants as possible, even if poorly covered. We then inspected the genes of interest, focusing on intronic variants which were either absent or had a frequency lower than 0.5% both in gnomAD and in our internal ES-non-JS database. Retained variants were tested in silico for their potential ability to affect splicing using an array of informatic tools including Human Splicing Finder [19], SpliceAI [20] and Pangolin [21]. Variants fulfilling the above criteria and predicted to potentially affect splicing were confirmed by Sanger sequencing, and their correct segregation (i.e., in trans with the known pathogenic coding variant) was verified by sequencing the parents.
Validation of the impact of intronic variants on splicing
To confirm the predicted impact of intronic variants on splicing, we followed two alternative strategies. In three patients, a fresh blood sample could be obtained. RNA was extracted from Tempus Blood RNA tubes (Applied Biosystem, Waltham, Massachusetts, USA) and then retrotranscribed to cDNA using the High-Capacity RNA-to-cDNA kit (Thermo Fisher Scientific, Waltham, Massachusetts, USA). A PCR was set up with specific primers designed within flanking exons (Supplementary Table 3), to amplify the region of interest. The PCR product was run on an agarose gel to detect the presence of two distinct bands. Individual bands were gel-excised and Sanger sequenced using the Big Dye Terminator chemistry (Thermo Fisher Scientific).
For the remaining patients, it was not possible to obtain a fresh sample for RNA extraction. To functionally validate these variants, minigene experiments were set up, as previously described [22]. Briefly, each genomic region containing the candidate splicing variant and flanking exons was PCR-amplified. The PCR product was cloned into a pGEM®-T Easy Vector (Promega, Madison, Wisconsin, USA) and transformed in the One-Shot TM TOP10 Chemically Competent E. coli (Invitrogen, Waltham, Massachusetts, USA) competent bacterial cells, following manufacturer’s protocol. Plasmids containing the wild-type or the mutated sequence were extracted using PureYield™ Plasmid Miniprep System (Promega) and then sub-cloned into pSPL3 vector (Life Sciences-Invitrogen, Waltham, Massachusetts, USA). The final pSPL3 plasmids containing the wild-type or the mutated sequence were transfected in HEK293-T cells using Lipofectamine 2000 (Thermo Fisher Scientific). After 24 h, RNA was extracted, retrotranscribed, a PCR was performed on cDNA using plasmid-specific primers and run on agarose gel as described above. Individual bands were excised and Sanger sequenced to assess the splicing defect at the RNA level. All cryptic variants confirmed to affect splicing were submitted to the Leiden Open Variation Database (LOVD) with the following individual accession numbers: #00449684, #00449685, #00449685, #00449687, #00449688, #00449689 and #00449690.
Results
Single heterozygous deleterious variants are strongly enriched in the JS cohort compared to control populations
For this “proof-of-principle” study, we initially focused on 26 JS individuals in whom ES had detected a single heterozygous pathogenic coding variant in a JS gene, as we reasoned that these patients had the highest chances to carry a cryptic variant in trans on the same gene.
To reinforce this hypothesis against the possibility that such single heterozygous variants could represent coincidental findings, we first compared the frequencies of heterozygous LOF alleles in five major genes (CPLANE1, CEP290, AHI1, CC2D2A, and KIAA0586) in our JS cohort with their respective frequencies in the Caucasian non-Finnish control population in gnomAD. We focused on LOF variants only, as the pathogenicity of missense variants would be much more difficult to assess. We also excluded TMEM67, since most variants found in this gene in JS patients are hypomorphic missense variants.
The frequency of single heterozygous LOF variants in our JS cohort ranged between 0.7% and 1.4%, while these variants were extremely rare in gnomAD, with frequencies ranging from 0.001 to 0.01%. For each gene, such difference was strongly statistically significant (p < 0.00001) (Fig. 1). Similarly, single heterozygous LOF variants in the tested genes were significantly less common in our internal diagnostic database after excluding JS cases (0.10–0.39%), as well as in the NIG-ExIT database, containing aggregated data from 1686 healthy subjects of Italian origin (0.09–0.21%). These results support the hypothesis that the heterozygous pathogenic variants detected in JS patients do not represent a coincidental occurrence reflecting a mere “carrier status” for a recessive condition, but are likely causative of the disease, in trans with a second cryptic variant.
Bioinformatic reanalysis of ES data disclosed potential cryptic “second hits” in over half patients
Since CNVs were not routinely detected in older bioinformatic pipelines, these variants may contribute to hidden heritability. Thus, we first searched for CNVs by combining bioinformatic tools with a manual re-inspection of BAM files focusing on the genes of interest, and detected large intragenic CNVs in six patients (Table 1). Four unrelated patients carried the same ~7 Kb intragenic deletion encompassing exons 8 to 10 in the KIAA0586 gene, already reported in literature [23,24,25]. The remaining two patients carried an already reported deletion of ~5 Kb, encompassing exons 18 to 20 of the AHI1 gene [26], and a duplication of ~88 Kb involving exons 17 to 49 of the CPLANE1 gene, respectively. All CNVs were confirmed by quantitative real time PCR or chromosomal microarray analysis on genomic DNA and were shown to be in trans with the coding variant. Patients’ RNA was not available to assess the impact of these CNVs at the transcriptional level. No additional CNVs were detected in the remaining patients.
Next, we reanalyzed ES data searching for rare intronic variants potentially affecting splicing and identified candidate variants in nine patients, including three variants in CPLANE1 (one recurrent in two unrelated patients), four in CC2D2A and one in CEP290 (Table 2). Three variants (in the CPLANE1 and CC2D2A genes) were within 20 bp from the exon-intron junction, but had not been reported in the former diagnostic report as they did not affect canonical splice sites. The remaining variants were deeper in the introns (from −23 up to +187) and had a coverage ranging from only 5 up to 124 reads. All variants were confirmed by Sanger sequencing and were demonstrated to be in trans with the coding variant.
No additional cryptic variants were detected in the remaining 11 patients (Supplementary Table 4).
Functional studies confirmed defective splicing for all but one candidate variants
Despite none of the intronic variants affected canonical donor or acceptor splice sites, they were predicted to alter splicing by in silico analyses, either by creating new acceptor splice sites, altering the branching points, or dysregulating splicing regulatory elements (Supplementary Table 5). We sought to validate these predictions by direct amplification and sequencing of patients’ RNA or, in the absence of such biological material, by the development of customized minigenes. Schematic results of in vitro assays are summarized in Fig. 2, while the resulting effect on proteins is reported in Table 2.
A Direct RNA assay, the variant causes skipping of exon 13; (B) Direct RNA assay, a 63 bp pseudo-exon is inserted between exons 15 and 16; (C) Minigene, the variant disrupts the acceptor splicing site of exon 24, causing the loss of 15 bp; (D) Minigene, the variant introduces a new acceptor splicing site upstream exon 17, causing a 16 bp intron retention; (E, F) Minigenes, the variants cause the skipping of exons 34 and 44, respectively; (G) Minigene, a 57 bp pseudo-exon is inserted between exons 9 and 10. CTR control, PT patient, WT wild-type, MUT mutated, b1, b2: pSPL3 synthetic exons; black pins: wild-type bands; red pins: mutated bands; white asterisks: unexpected bands probably due to an internal splice site present in both the wild-type and mutated constructs in minigenes experiments.
Notably, we were able to confirm a defective splicing for all tested variants but one, the CEP290 c.7130-160 T > G variant, for which both cDNA sequencing and minigene assay failed to detect overt splicing defects (Supplementary Fig. 1).
Variants c.1360-29 C > G in the CC2D2A gene and c.2747-161 A > G in the CPLANE1 gene were evaluated directly by sequencing patients’ RNA samples. The first variant was shown to cause skipping of exon 13, resulting in a prematurely truncated protein, while the second (recurring in three unrelated patients) introduced a 63 bp pseudo-exon in frame between exons 15 and 16, and is therefore expected to include 21 additional amino acids in the protein.
For all remaining variants, through the development of minigene plasmids, we were able to demonstrate the activation of new acceptor splice sites (for CC2D2A c.3015-12 T > G and CC2D2A c.2004-17 A > G), exon skipping (for CC2D2A c.4315-23 T > C and CPLANE1 c.8471-4_8471-3del) or the introduction of a pseudo-exon (for CPLANE1 c.1121+187 A > G). At the protein level, all these splicing defects resulted in premature protein truncation or, in the case of CC2D2A c.4315-23 T > C and CC2D2A c.3015-12 T > G, the in frame deletion of 41 and 5 amino acids, respectively.
Reanalysis of patients with negative ES
Lastly, we performed bioinformatic reanalysis of ES data in the remaining 44 negative patients and identified biallelic cryptic variants in three additional cases (Table 3). Two patients were homozygous for the CPLANE1 c.8471-4_8471-3del variant, while the third one carried the CPLANE1 c.2747-161 A > G deep intronic variant in the heterozygous state. In this patient, reanalysis further disclosed a heterozygous in frame deletion spanning ~1Kb, encompassing exons 7 and 8 of the CPLANE1 gene, in trans with the deep intronic variant.
Discussion
Here we demonstrate that CNVs and intronic variants affecting splicing are common mutational mechanisms in JS, and that at least a proportion of these variants can be safely identified through a focused reanalysis of available ES data. This approach, easily exportable to the diagnostic setting, can significantly increase the diagnostic yield of JS, providing substantial benefits for patients and families. A timely diagnosis of the underlying genetic defect would allow families to receive adequate genetic counseling for future pregnancies and gain access to prenatal and preimplantation diagnosis. Even more importantly, a genetic diagnosis is pivotal to provide affected children with a correct prognosis and establish the most appropriate management plan for prevention and treatment of potential complications. For instance, renal complications such as nephronophthisis, occurring in about one third of JS patients, usually present insidiously towards the end of the first decade and are potentially life-threatening in the absence of specific management and treatment. While clinical and laboratory indicators of renal involvement may not be clearly detectable at onset, the early identification of pathogenic variants in genes carrying a strong risk of renal involvement (such as CEP290) allow for the appropriate surveillance for renal issues since the first years of life, while mutations in genes that rarely associate to kidney defects (e.g., CPLANE1) can reassure families on a very low risk of renal failure [1, 2].
In the pre-NGS era, the genetic diagnosis of JS was extremely difficult due to the high number of disease-causing genes, many of which consisting of a large number of exons. The widespread adoption of NGS greatly improved the diagnostic rate in JS, yet a variable proportion of patients (10 to 35% in different studies) still remains undiagnosed [3]. Recent evidence suggests that cryptic pathogenic variants, which are not easily detectable with conventional diagnostic routine, may account for a relevant subset of allegedly “negative” cases, both in JS as well as in other inherited rare diseases [7,8,9,10,11,12,13]. Indeed, it has been reckoned that between 9 and 30% of causative variants in patients with rare genetic disorders might act through disruption of splicing [27], yet only 8.6% (24,976/289,000) of all variants reported in the Human Gene Mutation Database (HGMD) are splicing mutations [28], suggesting they may be extensively underestimated. In this line, a recent study performed deep targeted sequencing of the genomic region encompassing the ABCA4 gene in a cohort of 67 patients with retinal dystrophy carrying single coding pathogenetic variants, leading to the identification of nine distinct cryptic variants in 21 probands, with a diagnostic gain of 31% [29].
In the present study, ES reanalysis in 26 JS patients heterozygous for a pathogenic coding variant led to a diagnostic gain of 54% (14 out of 26), including intragenic CNVs in 6 patients (23%) and cryptic variants affecting splicing in 8 (31%). When considering the whole cohort of 70 ES-negative cases, a definite diagnosis was reached in 17 patients, resulting in an overall diagnostic yield of 24%. Notably, the vast majority of detected variants fell within three major JS-genes, namely KIAA0586, CC2D2A and CPLANE1, and some of them recurred in multiple unrelated patients, suggesting a founder effect [25]. It is worth noting that the identification of all variants reported in this study was obtained only through a reanalysis of available ES data, without performing additional wet-lab experiments, such as GS or RNA sequencing. This shows that a focused implementation of bioinformatic pipelines may represent a powerful approach adoptable in the diagnostic setting to disclose not only CNVs but also a subset of intronic variants, which will likely significantly increase the diagnostic yield, especially when a single pathogenic coding variant had already been identified. This could be particularly relevant for variants affecting splicing, for which targeted therapeutic strategies based on antisense oligonucleotides are being developed to effectively revert the splicing defect, at least in accessible tissues such as the retina. It is clearly emerging how this approach can hold a great potential for treating retinal dystrophy, a severe condition which occurs in about one third of JS patients. Several clinical trials have given promising results [30], among which the trial with the splicing-modulating antisense oligonucleotide Sepofarsen, which targets the common intronic splice-site variant c.2991+1655 A > G in the CEP290 gene [31, 32].
We acknowledge the fact that certain variants, such as the intragenic CNVs and the CPLANE1 c.8471-4_8471-3del splicing variant, should not be considered “cryptic” anymore, as they would have been detected in current diagnostic settings using up-to-date bioinformatics pipelines. Algorithms to detect CNVs from ES data were only recently implemented in most diagnostic labs, thanks to the development of some user-friendly bioinformatics tools, resulting in a significant increase of the diagnostic yield for several rare genetic disorders [33]. However, even in a recent past such variants were regularly missed, as appropriate CNV callers were lacking. Also the CPLANE1 splicing variant c.8471-4_8471-3del was not detected in the initial ES analysis, likely due to early limitations in correctly classifying splicing variants outside the canonical donor and acceptor sites. Thus, besides disclosing “true cryptic” variants, a targeted reanalysis of ES data may also highlight variants which the original analysis failed to detect, due to its limited sensitivity and robustness in correctly detecting and classifying certain types of variants. Indeed, in our cohort all these variants were disclosed in exomes that had been formerly analyzed some years ago. This highlights the importance of periodically re-analyzing old negative ES using updated tools, as this simple and fast strategy is likely to significantly reduce missing heritability, by detecting not only “true cryptic” variants, but also “former cryptic” variants missed by previous analyses. Similarly to coding variants, both CNVs and intronic variants also require validation through alternative techniques. While CNVs can be easily validated by real time PCR on genomic DNA, demonstration of the effect of putative splicing variants at the RNA level is crucial and requires RNA obtained from fresh blood or cell cultures. As obtaining these additional samples is not always possible, we confirm the minigene technique as a reliable, cost-effective, and relatively fast alternative strategy to validate the effect of splicing variants, in line with former evidences from the literature [34, 35]. In the present study, only one out of eight candidate intronic variants predicted to affect splicing could not be validated as pathogenic. This high success rate confirms the reliability of the proposed approach to identify a substantial subset of cryptic variants, suggesting this strategy can also be adopted for other rare disorders with hidden heritability.
Alternative strategies will be required to solve the remaining negative cases, whose second hits may be represented by deeper intronic variants or by structural variants affecting regulatory elements or disrupting topologically associated domains, which are truly undetectable by ES and whose disclosure requires distinct approaches such as GS, RNA sequencing, Hi-C, Optical Genome Mapping or long-read sequencing [36]. Another relevant contribution towards an increase in the diagnostic yield will likely come from improvements in the classification and functional validation of variants of unknown significance, especially missense hypomorphic variants, which we and others showed to occur in JS and other ciliopathies, and to bear a pathogenic impact despite low pathogenicity prediction scores [25, 37]. Finally, we would like to mention that negative genetic testing may also result from an incorrect recognition of the MTS, wrongly addressing patients affected by distinct rare neurodevelopmental disorders to JS gene panel testing [38]. Despite the significant progresses made by genetic technologies, a careful clinical assessment of patients still remains an essential step for a successful diagnostic process.
Data availability
The authors declare that all data generated or analyzed during this study and supporting the findings of the study are available within the paper and its supplementary information files.
References
Bachmann-Gagescu R, Dempsey JC, Bulgheroni S, Chen ML, D’Arrigo S, Glass IA, et al. Healthcare recommendations for Joubert syndrome. Am J Med Genet A. 2020;182:229–49.
Van De Weghe JC, Gomez A, Doherty D. The Joubert–Meckel–Nephronophthisis Spectrum of Ciliopathies. Annu Rev Genom Hum Genet. 2022;23:301–29.
Gana S, Serpieri V, Valente EM. Genotype–phenotype correlates in Joubert syndrome: a review. Am J Med Genet C Semin Med Genet. 2022;190:72–88.
French JD, Edwards SL. The role of noncoding variants in heritable disease. Trends Genet. 2020;36:880–91.
Spielmann M, Mundlos S. Looking beyond the genes: the role of non-coding variants in human disease. Hum Mol Genet. 2016;25:R157–65.
den Hollander AI, Koenekoop RK, Yzer S, Lopez I, Arends ML, Voesenek KEJ, et al. Mutations in the CEP290 (NPHP6) gene are a frequent cause of leber congenital amaurosis. Am J Hum Genet. 2006;79:556–61.
Shen Y, Lu C, Cheng T, Cao Z, Chen C, Ma X, et al. A novel 1.38-kb deletion combined with a single nucleotide variant in KIAA0586 as a cause of Joubert syndrome. BMC Med Genom. 2023;16:4.
Webb TR, Parfitt DA, Gardner JC, Martinez A, Bevilacqua D, Davidson AE, et al. Deep intronic mutation in OFD1, identified by targeted genomic next-generation sequencing, causes a severe form of X-linked retinitis pigmentosa (RP23). Hum Mol Genet. 2012;21:3647–54.
Fei H, Wu Y, Wang Y, Zhang J. Exome sequencing and RNA analysis identify two novel CPLANE1 variants causing Joubert syndrome. Mol Genet Genom Med. 2022;29:10.
Hiraide T, Shimizu K, Okumura Y, Miyamoto S, Nakashima M, Ogata T, et al. A deep intronic TCTN2 variant activating a cryptic exon predicted by SpliceRover in a patient with Joubert syndrome. J Hum Genet. 2023;68:499–505.
Kurolap A, Mory A, Simchoni S, Krajden Haratz K, Malinger G, Birnbaum R, et al. Upgrading an intronic TMEM67 variant of unknown significance to likely pathogenic through RNA studies and community data sharing. Prenat Diagn. 2022;42:1484–7.
Marshall AE, Lemire G, Liang Y, Davila J, Couse M, Boycott KM, et al. RNA sequencing reveals deep intronic CEP120 variant: a report of the diagnostic odyssey for two siblings with Joubert syndrome type 31. Am J Med Genet A. 2023;194:e63485.
Zhu H, Chen W, Ren H, Zhang Y, Niu Y, Wu D, et al. Non-classic splicing mutation in the CPLANE1 (C5orf42) gene cause Joubert syndrome in a fetus with severe craniocerebral dysplasia. Eur J Med Genet. 2021;64:104212.
Ellingford JM, Ahn JW, Bagnall RD, Baralle D, Barton S, Campbell C, et al. Recommendations for clinical interpretation of variants found in non-coding regions of the genome. Genome Med. 2022;14:73.
Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, et al. Standards and guidelines for the interpretation of sequence variants: A joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015;17:405–24.
Durkie M, Cassidy EJ, Berry I, Owens M, Turnbull C, Scott RH, et al. ACGS best practice guidelines for variant classification in rare disease 2024. 12, Royal Devon University Healthcare NHS Foundation Trust, Exeter, EX2 5DW. 5. Division of Genetics and Epidemiology. 2024. Available from: https://www.acgs.uk.com
Plagnol V, Curtis J, Epstein M, Mok KY, Stebbings E, Grigoriadou S, et al. A robust model for read count data in exome sequencing experiments and implications for copy number variant calling. Bioinformatics. 2012;28:2747–54.
Talevich E, Shain AH, Botton T, Bastian BC. CNVkit: genome-wide copy number detection and visualization from targeted DNA sequencing. PLoS Comput Biol. 2016;12:e1004873.
Desmet FO, Hamroun D, Lalande M, Collod-Béroud G, Claustres M, Béroud C. Human splicing finder: an online bioinformatics tool to predict splicing signals. Nucleic Acids Res. 2009;37:e67
Jaganathan K, Kyriazopoulou Panagiotopoulou S, McRae JF, Darbandi SF, Knowles D, Li YI, et al. Predicting splicing from primary sequence with deep learning. Cell. 2019;176:535–548.e24.
Zeng T, Li YI. Predicting RNA splicing from DNA sequence using Pangolin. Genome Biol. 2022;23:103.
Mancini C, Vaula G, Scalzitti L, Cavalieri S, Bertini E, Aiello C, et al. Megalencephalic leukoencephalopathy with subcortical cysts type 1 (MLC1) due to a homozygous deep intronic splicing mutation (c.895-226T>G) abrogated in vitro using an antisense morpholino oligonucleotide. Neurogenetics. 2012;13:205–14.
Sumathipala D, Strømme P, Gilissen C, Einarsen IH, Bjørndalen HJ, Server A, et al. Sudden death in epilepsy and ectopic neurohypophysis in Joubert syndrome 23 diagnosed using SNVs/indels and structural variants pipelines on WGS data: a case report. BMC Med Genet. 2020;21:96.
Malicdan MCV, Vilboux T, Stephen J, Maglic D, Mian L, Konzman D, et al. Mutations in human homologue of chicken talpid3 gene (KIAA0586) cause a hybrid ciliopathy with overlapping features of Jeune and Joubert syndromes. J Med Genet. 2015;52:830–9.
Serpieri V, Mortarini G, Loucks H, Biagini T, Micalizzi A, Palmieri I, et al. Recurrent, founder and hypomorphic variants contribute to the genetic landscape of Joubert syndrome. J Med Genet. 2023;60:885–93.
Zhang J, Wang L, Chen W, Duan J, Meng Y, Yang H, et al. Whole exome sequencing facilitated the diagnosis in four Chinese pediatric cases of Joubert syndrome related disorders. Am J Transl Res. 2022;14:5088–97.
Li Q, Wang Y, Pan Y, Wang J, Yu W, Wang X. Unraveling synonymous and deep intronic variants causing aberrant splicing in two genetically undiagnosed epilepsy families. BMC Med Genom. 2021;14:152.
Stenson PD, Mort M, Ball EV, Chapman M, Evans K, Azevedo L, et al. The Human Gene Mutation Database (HGMD®): optimizing its use in a clinical diagnostic or research setting. Hum Genet. 2020;139:1197–207.
Bauwens M, Garanto A, Sangermano R, Naessens S, Weisschuh N, de Zaeytijd J, et al. ABCA4-associated disease as a model for missing heritability in autosomal recessive disorders: novel noncoding splice, cis-regulatory, structural, and recurrent hypomorphic variants. Genet Med. 2019;21:1761–71.
Gemayel MC, Bhatwadekar AD, Ciulla T. RNA therapeutics for retinal diseases. Expert Opin Biol Ther. 2021;21:603–13.
Cideciyan AV, Jacobson SG, Drack AV, Ho AC, Charng J, Garafalo AV, et al. Effect of an intravitreal antisense oligonucleotide on vision in Leber congenital amaurosis due to a photoreceptor cilium defect. Nat Med. 2019;25:225–8.
Cideciyan AV, Jacobson SG, Ho AC, Swider M, Sumaroka A, Roman AJ, et al. Durable vision improvement after a single intravitreal treatment with antisense oligonucleotide in CEP290-LCA: replication in two eyes. Am J Ophthalmol Case Rep. 2023;32:101873.
Tilemis FN, Marinakis NM, Veltra D, Svingou M, Kekou K, Mitrakos A, et al. Germline CNV Detection through Whole-Exome Sequencing (WES) data analysis enhances resolution of rare genetic diseases. Genes. 2023;14:1490.
Okada E, Horinouchi T, Yamamura T, Aoto Y, Suzuki R, Ichikawa Y, et al. All reported non-canonical splice site variants in GLA cause aberrant splicing. Clin Exp Nephrol. 2023;27:737–46.
Weisschuh N, Buena-Atienza E, Wissinger B. Splicing mutations in inherited retinal diseases. Prog Retin Eye Res. 2021;80:100874.
Bronstein R, Capowski EE, Mehrotra S, Jansen AD, Navarro-Gomez D, Maher M, et al. A combined RNA-seq and whole genome sequencing approach for identification of non-coding pathogenic variants in single families. Hum Mol Genet. 2020;29:967–79.
Nikopoulos K, Cisarova K, Quinodoz M, Koskiniemi-Kuendig H, Miyake N, Farinelli P, et al. A frequent variant in the Japanese population determines quasi-Mendelian inheritance of rare retinal ciliopathy. Nat Commun. 2019;10:2884.
D’Abrusco F, Arrigoni F, Serpieri V, Romaniello R, Caputi C, Manti F, et al. Get Your Molar Tooth Right: Joubert Syndrome Misdiagnosis Unmasked by Whole-Exome Sequencing. Cerebellum. 2021;21:1144–50.
Acknowledgements
GZ and EBe are members of the European Reference Network for Rare Neurological Diseases.
Funding
This project was supported by Telethon Foundation Italy (GGP20070 to EMV), by the Italian Ministry of Health (Ricerca Finalizzata RF-2019-12369368 to EMV, RC 2023 line 1 to RB), and #NEXTGENERATIONEU (NGEU), funded by the Ministry of University and Research (MUR), National Recovery and Resilience Plan (NRRP), project MNESYS (PE0000006) – A Multiscale integrated approach to the study of the nervous system in health and disease (DN. 1553 11.10.2022 to EG).
Author information
Authors and Affiliations
Contributions
Conceptualization: FD, EMV, and EG; Data Collection: SG, RB, EBe, GZ, EBo, RB, RR, SS, VL, CC, FiM, SD, ADL, CG, JL, FeM, DPR, and FS; Formal Analysis: FD, VS, CMT, LC, and MB; Genetic and Molecular Investigation: FD; Writing-original draft: FD; Writing-review and editing: FD, JG, EG, and EMV; Supervision: EMV and EG
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval
This research study adheres to the principles in the Declaration of Helsinki and was approved by the medical ethical committee of the University of Pavia (Nr. 20210017314 dated 13/05/2021). Written informed consent was obtained from parents or legal representatives of all the enrolled patients for clinical testing and publication of genetic and clinical data.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
D’Abrusco, F., Serpieri, V., Taccagni, C.M. et al. Pathogenic cryptic variants detectable through exome data reanalysis significantly increase the diagnostic yield in Joubert syndrome. Eur J Hum Genet 33, 72–79 (2025). https://doi.org/10.1038/s41431-024-01703-x
Received:
Revised:
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1038/s41431-024-01703-x
This article is cited by
-
A CSPP1 variant associated with metabolic dysfunction in Joubert syndrome: a case report
Journal of Medical Case Reports (2025)
-
Welcome to 2025 from EJHG
European Journal of Human Genetics (2025)




