Abstract
Inherited retinal dystrophies (IRDs) are a genetically diverse group of vision loss disorders with over 360 implicated genes. However, 30-50% of cases remain unresolved after panel-based clinical testing and may benefit from exome or genome sequencing for a genetic diagnosis. To manage the extensive and analytically demanding datasets generated by genome sequencing, we developed ReDGAP (Retinal Degeneration Genome Analysis Pipeline), a phenotype-guided, semi-automated genome analysis pipeline that integrates clinical phenotyping with flexible variant scoring to prioritize variants of interest (https://github.com/vincentlab-la/ReDGAP). The pipeline supports the joint analysis of multiple variant classes, using an evidence-weighted scoring system informed by in silico predictors. Validation in eleven previously solved IRD cases achieved a 100% re-identification rate. Application to five unsolved cases yielded diagnoses in four (80%), including intronic variants in CRB1 and HGSNAT, a tandem duplication in OAT, and a 5′UTR deletion affecting a retina-specific promoter of RPGRIP1. Functional validation confirmed transcript-level disruptions in three variants, while computational analysis demonstrated regulatory impact in the fourth. Integrating phenotypic data with broad variant analysis offers a tailored model for improving IRD diagnostics, enabling timely molecular diagnoses and informing eligibility for emerging gene-targeted therapies. This positions ReDGAP as a tailored, clinically relevant model for investigating rare diseases within the evolving landscape of precision health.
Similar content being viewed by others
Data availability
Data supporting the findings of this study are available within the paper and its Supplementary Information.
Code availability
The underlying code for this pipeline is available on Github and can be accessed via the link https://github.com/vincentlab-la/ReDGAP.
References
Nash, B. M., Wright, D. C., Grigg, J. R., Bennetts, B. & Jamieson, R. V. Retinal dystrophies, genomic applications in diagnosis and prospects for therapy. Transl. Pediatr. 4, 139–163 (2015).
Bravo-Gil, N. et al. Improving the management of inherited retinal dystrophies by targeted sequencing of a population-specific gene panel. Sci. Rep. 6, 23910 (2016).
Daiger, S. P., Rossiter, B. J. F., Greenberg, J., Christoffels, A. & Hide, W. Data services and software for identifying genes and mutations causing retinal degeneration. Invest. Ophthalmol. Vis. Sci. 39, S295 (1998).
Genetic testing for Retinal Dystrophy. Blueprint Genetics https://blueprintgenetics.com/tests/panels/ophthalmology/retinal-dystrophy-panel/.
Rehm, H. L. et al. ACMG clinical laboratory standards for next-generation sequencing. Genet. Med. J. Am. Coll. Med. Genet. 15, 733–747 (2013).
Mustafi, D., Hisama, F. M., Huey, J. & Chao, J. R. The Current State of Genetic Testing Platforms for Inherited Retinal Diseases. Ophthalmol. Retin. 6, 702–710 (2022).
Pontikos, N. et al. Genetic basis of inherited retinal disease in a molecularly characterized cohort of more than 3000 families from the United Kingdom. Ophthalmology 127, 1384–1394 (2020).
Karali, M. et al. Genetic epidemiology of inherited retinal diseases in a large patient cohort followed at a single center in Italy. Sci. Rep. 12, 20815 (2022).
Perea-Romero, I. et al. Genetic landscape of 6089 inherited retinal dystrophies affected cases in Spain and their therapeutic and extended epidemiological implications. Sci. Rep. 11, 1526 (2021).
Weisschuh, N. et al. Genetic architecture of inherited retinal degeneration in Germany: a large cohort study from a single diagnostic center over a 9-year period. Hum. Mutat. 41, 1514–1527 (2020).
Hussain, H. M. J. et al. Novel pathogenic mutations identified from whole-genome sequencing in unsolved cases of patients affected with inherited retinal diseases. Genes 14, 447 (2023).
González-del Pozo, M. et al. A comprehensive WGS-based pipeline for the identification of new candidate genes in inherited retinal dystrophies. npj Genom. Med 7, 1–15 (2022).
Liu, X. et al. Whole genome sequencing enables new genetic diagnosis for inherited retinal diseases by identifying pathogenic variants. Npj Genom. Med. 9, 1–9 (2024).
Zeuli, R. et al. Whole genome sequencing identifies elusive variants in genetically unsolved Italian inherited retinal disease patients. Hum. Genet. Genomics Adv. 5, 100314 (2024).
Fadaie, Z. et al. Whole genome sequencing and in vitro splice assays reveal genetic causes for inherited retinal diseases. npj Genom. Med 6, 1–11 (2021).
Ellingford, J. M. et al. Whole genome sequencing increases molecular diagnostic yield compared with current diagnostic testing for inherited retinal disease. Ophthalmology 123, 1143–1150 (2016).
Stranneheim, H. et al. Integration of whole genome sequencing into a healthcare setting: high diagnostic rates across multiple clinical entities in 3219 rare disease patients. Genome Med 13, 40 (2021).
McClements, M. E., Elsayed, M. E. A. A., Major, L., de la Camara, C. M.-F. & MacLaren, R. E. Gene therapies in clinical development to treat retinal disorders. Mol. Diagn. Ther. 28, 575–591 (2024).
Janssen Research & Development, LLC. Phase 3 Follow-up Study of AAV5-hRKp.RPGR for the Treatment of X-Linked Retinitis Pigmentosa Associated With Variants in the RPGR Gene. https://clinicaltrials.gov/study/NCT04794101 (2025).
Luxturna | European Medicines Agency (EMA). https://www.ema.europa.eu/en/medicines/human/EPAR/luxturna (EMA, 2018).
Yang, P. et al. Subretinal gene therapy drug AGTC-501 for XLRP phase 1/2 multicenter study (HORIZON): 24-month safety and efficacy results. Am. J. Ophthalmol. 271, 268–285 (2025).
Oh, R., Woo, S. J. & Joo, K. Whole genome sequencing for inherited retinal diseases in the Korean National Project of Bio Big Data. Graefes Arch. Clin. Exp. Ophthalmol. Albrecht Von. Graefes Arch. Klin. Exp. Ophthalmol. 262, 1351–1359 (2024).
Wieder, N. et al. The role of untranslated region variants in Mendelian disease: a review. Eur. J. Hum. Genet. 33, 1096–1105 (2025).
Pais, L. S. et al. seqr: a web-based analysis and collaboration tool for rare disease genomics. Hum. Mutat. 43, 698–707 (2022).
Cipriani, V. et al. An improved phenotype-driven tool for rare mendelian variant prioritization: benchmarking exomiser on real patient whole-exome data. Genes 11, 460 (2020).
van der Velde, K. J. et al. GAVIN: gene-aware variant interpretation for medical sequencing. Genome Biol. 18, 6 (2017).
Franklin. https://franklin.genoox.com/clinical-db/home.
Quinodoz, M. et al. RetiGene, a comprehensive gene atlas for inherited retinal diseases (IRDs). BioRxiv Prepr. Serv. Biol. 2025.06.08.653722 (2025) https://doi.org/10.1101/2025.06.08.653722.
Martin, A. R. et al. PanelApp crowdsources expert knowledge to establish consensus diagnostic gene panels. Nat. Genet. 51, 1560–1565 (2019).
Haider, N. B. et al. Mutation of a nuclear receptor gene, NR2E3, causes enhanced S cone syndrome, a disorder of retinal cell fate. Nat. Genet. 24, 127–131 (2000).
Newman, H. et al. Homozygosity for a recessive loss-of-function mutation of the nrl gene is associated with a variant of enhanced s-cone syndrome. Invest. Ophthalmol. Vis. Sci. 57, 5361–5371 (2016).
Mitchell, G. A. et al. An initiator codon mutation in ornithine-delta-aminotransferase causing gyrate atrophy of the choroid and retina. J. Clin. Invest. 81, 630–633 (1988).
Sajovic, J. et al. The role of vitamin A in retinal diseases. Int. J. Mol. Sci. 23, 1014 (2022).
Pollard, K. S., Hubisz, M. J., Rosenbloom, K. R. & Siepel, A. Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res. 20, 110–121 (2010).
Choi, Y., Sims, G. E., Murphy, S., Miller, J. R. & Chan, A. P. Predicting the functional effect of amino acid substitutions and indels. PloS One 7, e46688 (2012).
Cheng, J. et al. Accurate proteome-wide missense variant effect prediction with AlphaMissense. Science 381, eadg7492 (2023).
Gao, H. et al. The landscape of tolerated genetic variation in humans and primates. Science 380, eabn8153 (2023).
Defize, L. H. & de Laat, S. W. Structural and functional aspects of epidermal growth factor (EGF) and its receptor. Prog. Brain Res. 69, 169–182 (1986).
Schubach, M., Maass, T., Nazaretyan, L., Röner, S. & Kircher, M. CADD v1.7: using protein language models, regulatory CNNs and other nucleotide-level scores to improve genome-wide variant predictions. Nucleic Acids Res 52, D1143–D1154 (2024).
Ioannidis, N. M. et al. REVEL: an ensemble method for predicting the pathogenicity of rare missense variants. Am. J. Hum. Genet. 99, 877–885 (2016).
Kawaji, H., Kasukawa, T., Forrest, A., Carninci, P. & Hayashizaki, Y. The FANTOM5 collection, a data series underpinning mammalian transcriptome atlases in diverse cell types. Sci. Data 4, 170113 (2017).
Jaganathan, K. et al. Predicting expression-altering promoter mutations with deep learning. Science 389, eads7373 (2025).
Kumar, P., Henikoff, S. & Ng, P. C. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat. Protoc. 4, 1073–1081 (2009).
Niroula, A. & Vihinen, M. How good are pathogenicity predictors in detecting benign variants?. PLoS Comput. Biol. 15, e1006481 (2019).
Jian, X., Boerwinkle, E. & Liu, X. In silico prediction of splice-altering single nucleotide variants in the human genome. Nucleic Acids Res. 42, 13534–13544 (2014).
Qian, X., Aceves, N. & Chen, R. Systematic identification of deep-intronic splicing variants in a large cohort of patients with inherited retinal diseases. Invest. Ophthalmol. Vis. Sci. 63, 1601–A0390 (2022).
Riepe et al. Benchmarking deep learning splice prediction tools using functional splice assays. Hum. Mutat. 42, 799–810 (2021).
Kong, S. W. et al. Discordance between a deep learning model and clinical-grade variant pathogenicity classification in a rare disease cohort. MedRxiv Prepr. Serv. Health Sci. 2024.05.22.24307756 (2024) https://doi.org/10.1101/2024.05.22.24307756.
Vihinen, M. Problems in variation interpretation guidelines and in their implementation in computational tools. Mol. Genet. Genom. Med. 8, e1206 (2020).
Hanany, M. & Sharon, D. Allele frequency analysis of variants reported to cause autosomal dominant inherited retinal diseases question the involvement of 19% of genes and 10% of reported pathogenic variants. J. Med. Genet. 56, 536–542 (2019).
Robinson, P. N. et al. The human phenotype ontology: a tool for annotating and analyzing human hereditary disease. Am. J. Hum. Genet. 83, 610–615 (2008).
Smith, C. L., Goldsmith, C.-A. W. & Eppig, J. T. The mammalian phenotype ontology as a tool for annotating, analyzing and comparing phenotypic information. Genome Biol. 6, R7 (2005).
Li, J. M., Tavares, E., Duncan, J. L., Vincent, A. & Héon, E. Novel BBS1 deletion and BBS9 nonsense pathogenic variant in Bardet-Biedl syndrome. Ophthalmic Genet 46, 406–412 (2025).
Moran, J. et al. IFT80 mutations cause a novel complex ciliopathy phenotype with retinal degeneration. Clin. Genet. 94, 368–372 (2018).
de Bruijn, S. E. et al. Structural variants create new topological-associated domains and ectopic retinal enhancer-gene contact in dominant retinitis pigmentosa. Am. J. Hum. Genet. 107, 802–814 (2020).
Tabbarah, S. et al. COG5 variants lead to complex early onset retinal degeneration, upregulation of PERK and DNA damage. Sci. Rep. 10, 21269 (2020).
Nitoiu, A. et al. Defective IFT57 underlies a novel cause of Bardet-Biedl syndrome. Hum. Mol. Genet. 34, 1108–1122 (2025).
Vig, A. et al. DYNC2H1 hypomorphic or retina-predominant variants cause nonsyndromic retinal degeneration. Genet. Med. J. Am. Coll. Med. Genet. 22, 2041–2051 (2020).
Dvaladze, A. et al. Deep intronic variant in MVK as a cause for mevalonic aciduria initially presenting as non-syndromic retinitis pigmentosa. Clin. Genet. 102, 524–529 (2022).
Di Scipio, M. et al. Phenotype driven analysis of whole genome sequencing identifies deep intronic variants that cause retinal dystrophies by aberrant exonization. Invest. Ophthalmol. Vis. Sci. 61, 36 (2020).
Khan, K. et al. CEP76 impairment at the centrosome-cilium interface contributes to a spectrum of ciliopathies. Sci. Adv. 11, eadw3717 (2025).
Gao, K., Masuda, A., Matsuura, T. & Ohno, K. Human branch point consensus sequence is yUnAy. Nucleic Acids Res. 36, 2257–2267 (2008).
Magalhães, A. et al. Macular changes in a mucopolysaccharidosis type I patient with earlier systemic therapies. Case Rep. Ophthalmol. Med. 2021, 8866837 (2021).
Haer-Wigman, L. et al. Non-syndromic retinitis pigmentosa due to mutations in the mucopolysaccharidosis type IIIC gene, heparan-alpha-glucosaminide N-acetyltransferase (HGSNAT). Hum. Mol. Genet. 24, 3742–3751 (2015).
Beryozkin, A. et al. Retinal degeneration associated with RPGRIP1: a review of natural history, mutation spectrum, and genotype–phenotype correlation in 228 patients. Front. Cell Dev. Biol. 9, 746781 (2021).
Abugessaisa, I. et al. FANTOM enters 20th year: expansion of transcriptomic atlases and functional annotation of non-coding RNAs. Nucleic Acids Res. 49, D892–D898 (2021).
Meylan, P., Dreos, R., Ambrosini, G., Groux, R. & Bucher, P. EPD in 2020: enhanced data visualization and extension to ncRNA promoters. Nucleic Acids Res. 48, D65–D69 (2020).
Torii, K. et al. The structural abnormalities are deeply involved in the cause of RPGRIP1-related retinal dystrophy in japanese patients. Int. J. Mol. Sci. 24, 13678 (2023).
VCV000010602.22 - ClinVar - NCBI. https://www.ncbi.nlm.nih.gov/clinvar/variation/10602/.
Thuma, T. B. T., Procopio, R. A., Jimenez, H. J., Gunton, K. B. & Pulido, J. S. Hypomorphic variants in inherited retinal and ocular diseases: a review of the literature with clinical cases. Surv. Ophthalmol. 69, 337–348 (2024).
Zernant, J. et al. Frequent hypomorphic alleles account for a significant fraction of ABCA4 disease and distinguish it from age-related macular degeneration. J. Med. Genet. 54, 404–412 (2017).
Kremer, L. S. et al. Genetic diagnosis of Mendelian disorders via RNA sequencing. Nat. Commun. 8, 15824 (2017).
Pedersen, B. S. et al. Effective variant filtering and expected candidate variant yield in studies of rare human disease. npj Genom. Med. 6, 60 (2021).
Holtgrewe, M. et al. VarFish: comprehensive DNA variant analysis for diagnostics and research. Nucleic Acids Res. 48, W162–W169 (2020).
Hombach, D. et al. MutationDistiller: user-driven identification of pathogenic DNA variants. Nucleic Acids Res 47, W114–W120 (2019).
Geoffroy, V. et al. AnnotSV: an integrated tool for structural variations annotation. Bioinformatics 34, 3572–3574 (2018).
Sangermano, R. et al. Identification of a novel large multigene deletion and a frameshift indel in PDE6B as the underlying cause of early-onset recessive rod–cone degeneration. Mol. Case Stud. 8, a006247 (2022).
Miller, D. T. et al. Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies. Am. J. Hum. Genet. 86, 749–764 (2010).
Danis, D. et al. SvAnna: efficient and accurate pathogenicity prediction of coding and regulatory structural variants in long-read genome sequencing. Genome Med. 14, 44 (2022).
Leman, R. et al. Assessment of branch point prediction tools to predict physiological branch points and their alteration by variants. BMC Genomics 21, 86 (2020).
Khan, M. et al. Resolving the dark matter of ABCA4 for 1054 Stargardt disease probands through integrated genomics and transcriptomics. Genet. Med. J. Am. Coll. Med. Genet. 22, 1235–1246 (2020).
Steele-Stallard, H. B. et al. Screening for duplications, deletions and a common intronic mutation detects 35% of second mutations in patients with USH2A monoallelic mutations on Sanger sequencing. Orphanet J. Rare Dis. 8, 122 (2013).
Rajkumar, V. & Dumpa, V. Lysosomal Storage Disease. in StatPearls (StatPearls Publishing, 2025).
De Boeck, K., Zolin, A., Cuppens, H., Olesen, H. V. & Viviani, L. The relative frequency of CFTR mutation classes in European patients with cystic fibrosis. J. Cyst. Fibros. J. Eur. Cyst. Fibros. Soc. 13, 403–409 (2014).
Ramaswami, G. & Geschwind, D. H. Genetics of autism spectrum disorder. Handb. Clin. Neurol. 147, 321–329 (2018).
Rodan, L. H., Stoler, J., Chen, E., Geleske, T. & Council on Genetics Genetic evaluation of the child with intellectual disability or global developmental delay: clinical report. Pediatrics 156, e2025072219 (2025).
Manickam, K. et al. Exome and genome sequencing for pediatric patients with congenital anomalies or intellectual disability: an evidence-based clinical guideline of the American College of Medical Genetics and Genomics (ACMG). Genet. Med. J. Am. Coll. Med. Genet. 23, 2029–2037 (2021).
DRAGEN DNA Pipeline. https://support-docs.illumina.com/SW/DRAGEN_v38/Content/SW/DRAGEN/GPipelineIntro_fDG.htm.
Zhu, M. et al. Using ERDS to infer copy-number variants in high-coverage genomes. Am. J. Hum. Genet. 91, 408–421 (2012).
Abyzov, A., Urban, A. E., Snyder, M. & Gerstein, M. CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res 21, 974–984 (2011).
Gardner, E. J. et al. The Mobile Element Locator Tool (MELT): population-scale mobile element discovery and biology. Genome Res. 27, 1916–1929 (2017).
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).
Collins, R. L. et al. A structural variation reference for medical and population genetics. Nature 581, 444–451 (2020).
C Yuen, R. K. et al. Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder. Nat. Neurosci. 20, 602–611 (2017).
MacDonald, J. R., Ziman, R., Yuen, R. K. C., Feuk, L. & Scherer, S. W. The Database of Genomic Variants: a curated collection of structural variation in the human genome. Nucleic Acids Res. 42, D986–D992 (2014).
Perez, G. et al. The UCSC Genome Browser database: 2025 update. Nucleic Acids Res. 53, D1243–D1249 (2025).
Acknowledgements
We thank the patients and their families for their participation in this study, and Bhooma Thiruvahindrapuram for technical support. LA and MM were awarded a scholarship from the Vision Science Research Program (Department of Ophthalmology and Vision Sciences, University of Toronto). This study was funded by the Foundation Fighting Blindness, USA [CDCMM-0224-0873-HSC-(AV)], Department of Ophthalmology Research Funds, Hospital for Sick Children Toronto), The Henry Brent Chair in Innovative Pediatric Ophthalmology Research and the Azrieli Precision Child Health Platform for ARD. The funders played no role in study design, data collection, analysis and interpretation of data, or the writing of this manuscript.
Author information
Authors and Affiliations
Contributions
Conceptualization: L.A., E.T., J.M.L., K.A., C.R.M., E.H., and A.V. Patient phenotyping: G.N., E.H., and A.V. Dry lab strategy: L.A., E.T., J.M.L., A.D., C.R.M., and A.V. Wet lab strategy: L.A., E.T., J.M.L., K.A., T.P., G.C., and A.V. Dry and wet lab experiments: L.A., M.M., C.A., G.A.-S., R.O., K.G.-S., and G.C. Investigation and analysis: L.A., E.T., J.M.L., K.A., M.M., C.E., G.A.-S., R.O., A.R.D., T.P., G.C., and A.V. Manuscript Draft writing: L.A., E.T., K.A, M.M., and A.V. Manuscript Review and Editing: All authors. Funding acquisition: E.H. and A.V.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Ahmed, L., Tavares, E., Li, J.M. et al. A novel phenotype-guided genome analysis pipeline for variant discovery. npj Genom. Med. (2026). https://doi.org/10.1038/s41525-026-00557-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41525-026-00557-0


