Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

A phenome-wide association study of methylated GC-rich repeats identifies a GCC repeat expansion in AFF3 associated with intellectual disability

Abstract

GC-rich tandem repeat expansions (TREs) are often associated with DNA methylation, gene silencing and folate-sensitive fragile sites, and underlie several congenital and late-onset disorders. Through a combination of DNA-methylation profiling and tandem repeat genotyping, we identified 24 methylated TREs and investigated their effects on human traits using phenome-wide association studies in 168,641 individuals from the UK Biobank, identifying 156 significant TRE–trait associations involving 17 different TREs. Of these, a GCC expansion in the promoter of AFF3 was associated with a 2.4-fold reduced probability of completing secondary education, an effect size comparable to several recurrent pathogenic microdeletions. In a cohort of 6,371 probands with neurodevelopmental problems of suspected genetic etiology, we observed a significant enrichment of AFF3 expansions compared with controls. With a population prevalence that is at least fivefold higher than the TRE that causes fragile X syndrome, AFF3 expansions represent a major cause of neurodevelopmental delay.

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Confirmation of expanded GC-rich TRs as the underlying cause of rare gains of methylation at 24 loci in the human genome.
The alternative text for this image may have been generated using AI.
Fig. 2: Validation of GC-rich repeat expansions using long-read sequencing.
The alternative text for this image may have been generated using AI.
Fig. 3: Results of phenome-wide association analysis for expansions of six GC-rich tandem repeats.
The alternative text for this image may have been generated using AI.
Fig. 4: AFF3 expansions are enriched specifically in unsolved probands with ID in the 100kGP.
The alternative text for this image may have been generated using AI.
Fig. 5: PacBio HiFi GS data of two trios carrying AFF3 expansions.
The alternative text for this image may have been generated using AI.
Fig. 6: Relative effects of the AFF3 expansion on educational attainment compared with recurrent microdeletions/duplications.
The alternative text for this image may have been generated using AI.
Fig. 7: SNVs that associate with presence of the AFF3 repeat expansion are the same as those identified in previous GWASs of intelligence, cognitive ability and educational attainment.
The alternative text for this image may have been generated using AI.

Similar content being viewed by others

Data availability

Project MinE DNA methylation data are available from the European Genome–phenome Archive via accession no. EGAS00001004587. Project MinE sequencing data are accessible to approved researchers via an online data request: https://www.projectmine.com/research/data-sharing. The GTEx DNA methylation data are available from https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE213478, the GTEx normalized expression level data from https://storage.googleapis.com/adult-gtex/bulk-gex/v8/rna-seq/GTEx_Analysis_2017-06-05_v8_RNASeQCv1.1.9_gene_tpm.gct.gz and the GTEx allele-specific expression data from https://www.google.com/url?q=https://console.cloud.google.com/storage/browser/fc-secure-ff8156a3-ddf3-42e4-9211-0fd89da62108/GTEx_Analysis_2017-06-05_v8_ASE_WASP_counts_by_subject/&sa=D&source=docs&ust=1678389406863768&usg=AOvVaw1McwIkcmsHEds7G5NSMcf6. The UKB Data Showcase is available from https://biobank.ctsu.ox.ac.uk/crystal/search.cgi. Access to the tandem repeat genotypes generated here will be provided by the UKB (https://www.ukbiobank.ac.uk). PacBio genome sequences will be available to approved researchers via the Genomics England research environment. Information on how to access the 100 kGP data by joining a Genomics England Clinical Interpretation Partnership is available online: https://www.genomicsengland.co.uk/join-a-gecip-domain. AoU Researcher Workbench is available from https://www.researchallofus.org/data-tools/workbench. Genotypes generated from AoU participants cannot be shared owing to the terms of the Data Access Agreement under which we accessed AoU GS data.

Code availability

Code utilized for the present study is available as follows: ExpansionHunter classifier, https://zenodo.org/doi/10.5281/zenodo.10821643 (ref. 76); Global ancestry assignment, https://zenodo.org/doi/10.5281/zenodo.10820994 (ref. 73); pbmm2, https://github.com/PacificBiosciences/pbmm2; TRGT and TRGT-denovo, https://zenodo.org/doi/10.5281/zenodo.10839644 (ref. 81); Repeat catalogs for TRGT, https://doi.org/10.5281/zenodo.7987365 (ref. 86).

References

  1. Kremer, E. J. et al. Mapping of DNA instability at the fragile X to a trinucleotide repeat sequence p(CCG)n. Science 252, 1711–1714 (1991).

    Article  CAS  PubMed  Google Scholar 

  2. Oberlé, I. et al. Instability of a 550-base pair DNA segment and abnormal methylation in fragile X syndrome. Science 252, 1097–1102 (1991).

    Article  PubMed  Google Scholar 

  3. La Spada, A. R., Wilson, E. M., Lubahn, D. B., Harding, A. E. & Fischbeck, K. H. Androgen receptor gene mutations in X-linked spinal and bulbar muscular atrophy. Nature 352, 77–79 (1991).

    Article  PubMed  Google Scholar 

  4. Depienne, C. & Mandel, J.-L. 30 years of repeat expansion disorders: What have we learned and what are the remaining challenges? Am. J. Hum. Genet. 108, 764–785 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Boivin, M. & Charlet-Berguerand, N. Trinucleotide CGG repeat diseases: an expanding field of polyglycine proteins? Front. Genet. 13, 843014 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Hagerman, R. J. et al. Fragile X syndrome. Nat. Rev. Dis. Prim. 3, 17065 (2017).

    Article  PubMed  Google Scholar 

  7. Hagerman, P. J. & Hagerman, R. J. The fragile-X premutation: a maturing perspective. Am. J. Hum. Genet. 74, 805–816 (2004).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Jacquemont, S. et al. Penetrance of the fragile X-associated tremor/ataxia syndrome in a premutation carrier population. JAMA 291, 460–469 (2004).

    Article  CAS  PubMed  Google Scholar 

  9. Murray, A., Webb, J., Grimley, S., Conway, G. & Jacobs, P. Studies of FRAXA and FRAXE in women with premature ovarian failure. J. Med. Genet. 35, 637–640 (1998).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Hagerman, R. J. et al. Intention tremor, parkinsonism, and generalized brain atrophy in male carriers of fragile X. Neurology 57, 127–130 (2001).

    Article  CAS  PubMed  Google Scholar 

  11. Knight, S. J. et al. Trinucleotide repeat amplification and hypermethylation of a CpG island in FRAXE mental retardation. Cell 74, 127–134 (1993).

    Article  CAS  PubMed  Google Scholar 

  12. Ritchie, R. J. et al. The cloning of FRAXF: trinucleotide repeat expansion and methylation at a third fragile site in distal Xqter. Hum. Mol. Genet. 3, 2115–2121 (1994).

    Article  CAS  PubMed  Google Scholar 

  13. DeJesus-Hernandez, M. et al. Expanded GGGGCC hexanucleotide repeat in noncoding region of C9ORF72 causes chromosome 9p-linked FTD and ALS. Neuron 72, 245–256 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Renton, A. E. et al. A hexanucleotide repeat expansion in C9ORF72 is the cause of chromosome 9p21-linked ALS-FTD. Neuron 72, 257–268 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Deng, J. et al. Expansion of GGC repeat in GIPC1 is associated with oculopharyngodistal myopathy. Am. J. Hum. Genet. 106, 793–804 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. LaCroix, A. J. et al. GGC repeat expansion and exon 1 methylation of XYLT1 is a common pathogenic variant in Baratela–Scott syndrome. Am. J. Hum. Genet. 104, 35–44 (2019).

    Article  CAS  PubMed  Google Scholar 

  17. Lafrenière, R. G. et al. Unstable insertion in the 5′ flanking region of the cystatin B gene is the most common mutation in progressive myoclonus epilepsy type 1, EPM1. Nat. Genet. 15, 298–302 (1997).

    Article  PubMed  Google Scholar 

  18. Sone, J. et al. Long-read sequencing identifies GGC repeat expansions in NOTCH2NLC associated with neuronal intranuclear inclusion disease. Nat. Genet. 51, 1215–1221 (2019).

    Article  CAS  PubMed  Google Scholar 

  19. Garg, P. et al. A survey of rare epigenetic variation in 23,116 human genomes identifies disease-relevant epivariations and CGG expansions. Am. J. Hum. Genet. 107, 654–669 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Rajan-Babu, I.-S. et al. Genome-wide sequencing as a first-tier screening test for short tandem repeat expansions. Genome Med. 13, 126 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Dolzhenko, E. et al. Detection of long repeat expansions from PCR-free whole-genome sequence data. Genome Res. 27, 1895–1903 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Dolzhenko, E. et al. REViewer: haplotype-resolved visualization of read alignments in and around tandem repeats. Genome Med. 14, 84 (2022).

    Article  PubMed  PubMed Central  Google Scholar 

  23. Sarafidou, T. et al. Folate-sensitive fragile site FRA10A is due to an expansion of a CGG repeat in a novel gene, FRA10AC1, encoding a nuclear protein. Genomics 84, 69–81 (2004).

    Article  CAS  PubMed  Google Scholar 

  24. Metsu, S. et al. FRA2A is a CGG repeat expansion associated with silencing of AFF3. PLoS Genet. 10, e1004242 (2014).

    Article  PubMed  PubMed Central  Google Scholar 

  25. Winnepenninckx, B. et al. CGG-repeat expansion in the DIP2B gene is associated with the fragile site FRA12A on chromosome 12q13.1. Am. J. Hum. Genet. 80, 221–231 (2007).

    Article  CAS  PubMed  Google Scholar 

  26. Debacker, K. et al. The molecular basis of the folate-sensitive fragile site FRA11A at 11q13. Cytogenet. Genome Res. 119, 9–14 (2007).

    Article  CAS  PubMed  Google Scholar 

  27. Jones, C. et al. Association of a chromosome deletion syndrome with a fragile site within the proto-oncogene CBL2. Nature 376, 145–149 (1995).

    Article  CAS  PubMed  Google Scholar 

  28. Trost, B. et al. Genome-wide detection of tandem DNA repeats that are expanded in autism. Nature 586, 80–86 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Metsu, S. et al. A CGG-repeat expansion mutation in ZNF713 causes FRA7A: association with autistic spectrum disorder in two families. Hum. Mutat. 35, 1295–1300 (2014).

    CAS  PubMed  Google Scholar 

  30. Nancarrow, J. K. et al. Implications of FRA16A structure for the mechanism of chromosomal fragile site genesis. Science 264, 1938–1941 (1994).

    Article  CAS  PubMed  Google Scholar 

  31. Kobayashi, H. et al. Expansion of intronic GGCCTG hexanucleotide repeat in NOP56 causes SCA36, a type of spinocerebellar ataxia accompanied by motor neuron involvement. Am. J. Hum. Genet. 89, 121–130 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Cortese, A. et al. A CCG expansion in ABCD3 causes oculopharyngodistal myopathy in individuals of European ancestry. Nat. Commun. 15, 6327 (2023).

    Article  Google Scholar 

  33. GTEx Consortium et al. Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017).

    Article  PubMed Central  Google Scholar 

  34. Oliva, M. et al. DNA methylation QTL mapping across diverse human tissues provides molecular links between genetic variation and complex traits. Nat. Genet. 55, 112–122 (2023).

    Article  CAS  PubMed  Google Scholar 

  35. Beck, J. et al. Large C9orf72 hexanucleotide repeat expansions are seen in multiple neurodegenerative syndromes and are more frequent than expected in the UK population. Am. J. Hum. Genet. 92, 345–353 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Majounie, E. et al. Frequency of the C9orf72 hexanucleotide repeat expansion in patients with amyotrophic lateral sclerosis and frontotemporal dementia: a cross-sectional study. Lancet Neurol. 11, 323–330 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Zanovello, M. et al. Unexpected frequency of the pathogenic AR CAG repeat expansion in the general population. Brain J. Neurol. https://doi.org/10.1093/brain/awad050 (2023).

  38. Masrori, P. & Van Damme, P. Amyotrophic lateral sclerosis: a clinical review. Eur. J. Neurol. 27, 1918–1929 (2020).

    Article  CAS  PubMed  Google Scholar 

  39. Loh, M. L. et al. Mutations in CBL occur frequently in juvenile myelomonocytic leukemia. Blood 114, 1859–1863 (2009).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. 100,000 Genomes Project Pilot Investigators et al. 100,000 Genomes Pilot on rare-disease diagnosis in health care—preliminary report. N. Engl. J. Med. 385, 1868–1880 (2021).

    Article  Google Scholar 

  41. Okbay, A. et al. Polygenic prediction of educational attainment within and between families from genome-wide association analyses in 3 million individuals. Nat. Genet. 54, 437–449 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Hop, P. J. et al. Genome-wide study of DNA methylation shows alterations in metabolic, inflammatory, and cholesterol pathways in ALS. Sci. Transl. Med. 14, eabj0264 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Sollis, E. et al. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res. 51, D977–D985 (2023).

    Article  CAS  PubMed  Google Scholar 

  44. Crowley, J. J. et al. Common-variant associations with fragile X syndrome. Mol. Psychiatry 24, 338–344 (2019).

    Article  CAS  PubMed  Google Scholar 

  45. van Rheenen, W. et al. Genome-wide association analyses identify new risk variants and the genetic architecture of amyotrophic lateral sclerosis. Nat. Genet. 48, 1043–1048 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  46. Afshari, N. A. et al. Genome-wide association study identifies three novel loci in Fuchs endothelial corneal dystrophy. Nat. Commun. 8, 14898 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Nicolas, A. et al. Genome-wide analyses identify KIF5A as a novel ALS gene. Neuron 97, 1268–1283.e6 (2018).

    Article  CAS  PubMed Central  Google Scholar 

  48. Cossée, M. et al. Evolution of the Friedreich’s ataxia trinucleotide repeat expansion: founder effect and premutations. Proc. Natl Acad. Sci. USA 94, 7452–7457 (1997).

    Article  PubMed  PubMed Central  Google Scholar 

  49. Squitieri, F. et al. DNA haplotype analysis of Huntington disease reveals clues to the origins and mechanisms of CAG expansion and reasons for geographic variations of prevalence. Hum. Mol. Genet. 3, 2103–2114 (1994).

    Article  CAS  PubMed  Google Scholar 

  50. Bampi, G. B. et al. Haplotype study in SCA10 families provides further evidence for a common ancestral origin of the mutation. Neuromol. Med. 19, 501–509 (2017).

    Article  CAS  Google Scholar 

  51. Verkerk, A. J. et al. Identification of a gene (FMR-1) containing a CGG repeat coincident with a breakpoint cluster region exhibiting length variation in fragile X syndrome. Cell 65, 905–914 (1991).

    Article  CAS  PubMed  Google Scholar 

  52. Bassett, A. S. et al. Clinical features of 78 adults with 22q11 deletion syndrome. Am. J. Med. Genet. A 138, 307–313 (2005).

    Article  PubMed  PubMed Central  Google Scholar 

  53. Bernier, R. et al. Clinical phenotype of the recurrent 1q21.1 copy-number variant. Genet. Med. 8, 341–349 (2016).

    Article  Google Scholar 

  54. Mefford, H. C. et al. Recurrent rearrangements of chromosome 1q21.1 and variable pediatric phenotypes. N. Engl. J. Med. 359, 1685–1699 (2008).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Sharp, A. J. et al. A recurrent 15q13.3 microdeletion syndrome associated with mental retardation and seizures. Nat. Genet. 40, 322–328 (2008).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. van Bon, B. W. M. et al. Further delineation of the 15q13 microdeletion and duplication syndromes: a clinical spectrum varying from non-pathogenic to a severe outcome. J. Med. Genet. 46, 511–523 (2009).

    Article  PubMed  Google Scholar 

  57. Hunter, J. et al. Epidemiology of fragile X syndrome: a systematic review and meta-analysis. Am. J. Med. Genet. 164A, 1648–1658 (2014).

    Article  PubMed  Google Scholar 

  58. Gillentine, M. A., Lupo, P. J., Stankiewicz, P. & Schaaf, C. P. An estimation of the prevalence of genomic disorders using chromosomal microarray data. J. Hum. Genet. 63, 795–801 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  59. Nolin, S. L. et al. Familial transmission of the FMR1 CGG repeat. Am. J. Hum. Genet. 59, 1252–1261 (1996).

    CAS  PubMed  PubMed Central  Google Scholar 

  60. Voisin, N. et al. Variants in the degron of AFF3 are associated with intellectual disability, mesomelic dysplasia, horseshoe kidney, and epileptic encephalopathy. Am. J. Hum. Genet. 108, 857–873 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. Rodriguez, C. M. et al. A native function for RAN translation and CGG repeats in regulating fragile X protein synthesis. Nat. Neurosci. 23, 386–397 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  62. Green, K. M. et al. RAN translation at C9orf72-associated repeat expansions is selectively enhanced by the integrated stress response. Nat. Commun. 8, 2005 (2017).

    Article  PubMed  PubMed Central  Google Scholar 

  63. Eichler, E. E. et al. Missing heritability and strategies for finding the underlying causes of complex disease. Nat. Rev. Genet. 11, 446–450 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  64. Manolio, T. A. et al. Finding the missing heritability of complex diseases. Nature 461, 747–753 (2009).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  65. Garg, P. & Sharp, A. J. Screening for rare epigenetic variations in autism and schizophrenia. Hum. Mutat. 40, 952–961 (2019).

    CAS  PubMed  PubMed Central  Google Scholar 

  66. Dolzhenko, E. et al. ExpansionHunter Denovo: a computational method for locating known and novel repeat expansions in short-read sequencing data. Genome Biol. 21, 102 (2020).

    Article  PubMed  PubMed Central  Google Scholar 

  67. Willems, T. et al. Genome-wide profiling of heritable and de novo STR variations. Nat. Methods 14, 590–592 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Mitsuhashi, S., Frith, M. C. & Matsumoto, N. Genome-wide survey of tandem repeats by nanopore sequencing shows that disease-associated repeats are more polymorphic in the general population. BMC Med. Genom. 14, 17 (2021).

    Article  CAS  Google Scholar 

  69. Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  70. Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).

    Article  PubMed  PubMed Central  Google Scholar 

  71. Simpson, J. T. et al. Detecting DNA cytosine methylation using nanopore sequencing. Nat. Methods 14, 407–410 (2017).

    Article  CAS  PubMed  Google Scholar 

  72. Garg, P. et al. A phenome-wide association study identifies effects of copy-number variation of VNTRs and multicopy genes on multiple human traits. Am. J. Hum. Genet. 109, 1065–1076 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  73. Bharati, J. Global-Ancestry-Assignment (1.0). Zenodo https://doi.org/10.5281/zenodo.10820994 (2024).

  74. Dolzhenko, E. et al. ExpansionHunter: a sequence-graph-based tool to analyze variation in short tandem repeat regions. Bioinformatics 35, 4754–4756 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  75. Ibañez, K. et al. Whole genome sequencing for the diagnosis of neurological repeat expansion disorders in the UK: a retrospective diagnostic accuracy and prospective clinical validation study. Lancet Neurol. 21, 234–245 (2022).

    Article  PubMed  PubMed Central  Google Scholar 

  76. Bharati, J. ExpansionHunter Classifier (1.0). Zenodo https://doi.org/10.5281/zenodo.10821643 (2024).

  77. World Health Organization. ICD-10 : international statistical classification of diseases and related health problems: Tenth revision 2nd edn (World Health Organization, 2004).

  78. Mbatchou, J. et al. Computationally efficient whole-genome regression for quantitative and binary traits. Nat. Genet. 53, 1097–1103 (2021).

    Article  CAS  PubMed  Google Scholar 

  79. Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  80. Dolzhenko, E. et al. Characterization and visualization of tandem repeats at genome scale. Nat Biotechnol. https://doi.org/10.1038/s41587-023-02057-3 (2024).

  81. Mokveld, T. & Dolzhenko, E. TRGT-0.4.0 and TRGT-denovo-0.1.0. Zenodo https://doi.org/10.5281/zenodo.10839644 (2024).

  82. Pedersen, B. S. & Quinlan, A. R. Mosdepth: quick coverage calculation for genomes and exomes. Bioinformatics 34, 867–868 (2018).

    Article  CAS  PubMed  Google Scholar 

  83. Wetzel, A. S. & Darbro, B. W. A comprehensive list of human microdeletion and microduplication syndromes. BMC Genom. Data 23, 82 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  84. Kendall, K. M. et al. Cognitive performance among carriers of pathogenic copy number variants: analysis of 152,000 UK Biobank subjects. Biol. Psychiatry 82, 103–110 (2017).

    Article  PubMed  Google Scholar 

  85. Kar, S. P. et al. Genome-wide analyses of 200,453 individuals yield new insights into the causes and consequences of clonal hematopoiesis. Nat. Genet. 54, 1155–1166 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  86. Dolzhenko, E. & English, A. Repeat catalogs for TRGT (0.0.1) [Data set]. Zenodo https://doi.org/10.5281/zenodo.7987365 (2023).

Download references

Acknowledgements

We thank K. Temple, University of Southampton, UK, for her support. This work was supported by the National Institutes of Health (NIH) grants (nos. AG075051, NS105781, HD103782 and NS120241 to A.J.S.), National Heart, Lung, and Blood Institute Biodata Catalyst fellowship (no. 5120339 to A.M.T.), funding from the Prinses Beatrix Spierfonds (grant no. W.OR20-08 to J.J.F.A.V.), funding from the European Research Council under the European Union’s Horizon 2020 research and innovation program (grant no. 772376—EScORIAL to J.V.) and funding from UK Research and Innovation (grant no. MR/S006753/1), Barts charity (grant no. MGU0569) and a Medical Research Council (MRC) Clinician Scientist award (no. MR/S006753/1; all to A.T.). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript. This work was supported in part through the computational resources and staff expertise provided by Scientific Computing at the Icahn School of Medicine at Mount Sinai and supported by the Clinical and Translational Science Awards (grant no. UL1TR004419) from the National Center for Advancing Translational Sciences. Research reported in this paper was supported by the Office of Research Infrastructure of the NIH under award no. S10OD026880. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. This research was made possible through access to the data and findings generated by the 100kGP. The 100kGP is managed by Genomics England Limited (a company wholly owned by the Department of Health and Social Care). The 100kGP is funded by the National Institute for Health Research and NHS England. The Wellcome Trust, Cancer Research UK and the MRC have also funded research infrastructure. The 100kGP uses data provided by patients and collected by the NHS as part of their care and support. The AoU Research Program is supported by the NIH, office of the director: regional medical centers: 1 OT2 OD026549, 1 OT2 OD026554, 1 OT2 OD026557, 1 OT2 OD026556, 1 OT2 OD026550, 1 OT2 OD 026552, 1 OT2 OD026553, 1 OT2 OD026548, 1 OT2 OD026551, 1 OT2 OD026555 and IAA no. AOD 16037; federally qualified health centers: HHSN 263201600085U; data and research center: 5 U2C OD023196; biobank: 1 U24 OD023121; the participant center: U24 OD023176; participant technology systems center: 1 U24 OD023163; communications and engagement: 3 OT2 OD023205 and 3 OT2 OD023206; and community partners: 1 OT2 OD025277, 3 OT2 OD025315, 1 OT2 OD025337 and 1 OT2 OD025276. In addition, the AoU Research Program would not be possible without the partnership of its participants.

Author information

Authors and Affiliations

Authors

Consortia

Contributions

B.J., P.G., J.J.F.A.V., K.I., W.L., M.S., T.M. and E.D. designed bioinformatics pipelines and performed data analyses. D.G., M.J., A.M.T., S.L.G., G.A., C.R. and M.B. performed data analyses. N.L. and K.L. contributed clinical information. H.H., B.P., J.V. and A.T. supervised and advised on the project. A.J.S. conceived the study, performed data analyses, supervised the project and drafted the manuscript. All authors reviewed and approved the final draft.

Corresponding author

Correspondence to Andrew J. Sharp.

Ethics declarations

Competing interests

PacBio provided research support for the HiFi sequencing performed in the present study. E.D. and T.M. are employees and shareholders of PacBio. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Genetics thanks David Nelson, Christel Depienne and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jadhav, B., Garg, P., van Vugt, J.J.F.A. et al. A phenome-wide association study of methylated GC-rich repeats identifies a GCC repeat expansion in AFF3 associated with intellectual disability. Nat Genet 56, 2322–2332 (2024). https://doi.org/10.1038/s41588-024-01917-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • Issue date:

  • DOI: https://doi.org/10.1038/s41588-024-01917-1

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing