Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Nature Communications
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. nature communications
  3. articles
  4. article
Long-read sequencing of families reveals increased germline and postzygotic mutation rates in repetitive DNA
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 09 March 2026

Long-read sequencing of families reveals increased germline and postzygotic mutation rates in repetitive DNA

  • Michelle D. Noyes1,
  • Yang Sui  ORCID: orcid.org/0000-0001-7285-87331,
  • Youngjun Kwon  ORCID: orcid.org/0000-0002-5024-21341,
  • Nidhi Koundinya  ORCID: orcid.org/0009-0008-7155-12871,
  • Isaac Wong  ORCID: orcid.org/0000-0003-4877-57481,
  • Katherine M. Munson  ORCID: orcid.org/0000-0001-8413-64981,
  • Kendra Hoekzema1,
  • Jennifer Kordosky1,
  • Gage H. Garcia  ORCID: orcid.org/0009-0005-2383-722X1,
  • Jordan Knuth  ORCID: orcid.org/0009-0007-0176-70931,
  • Alexandra P. Lewis1 &
  • …
  • Evan E. Eichler  ORCID: orcid.org/0000-0002-8246-40141,2 

Nature Communications , Article number:  (2026) Cite this article

  • 2573 Accesses

  • 2 Citations

  • 13 Altmetric

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Autism spectrum disorders
  • DNA sequencing
  • Genomics
  • Population genetics

Abstract

Long-read sequencing improves sensitivity to discover variation in complex repetitive regions, assign parent-of-origin, and distinguish germline from postzygotic mutations. We applied Illumina, Oxford Nanopore Technologies, and PacBio sequencing to discover and validate de novo mutations in 73 children from 42 autism families (157 individuals). We assay 2.77 Gbp of the human genome, yielding on average 95 de novo mutations per transmission (87.5 single-nucleotide substitutions, 7.8 indels), with no significant difference in mutation rate or profile between probands and their unaffected siblings. Long reads increase de novo mutation discovery by 20-40% and double the mutations classified as early embryonic. The germline mutation rate is 1.30×10−8 substitutions/base pair/generation; the postzygotic rate is 0.23×10−8. These rates are significantly increased in repetitive DNA, where segmental duplication mutability is dependent on length and percent identity. Here, we show that enrichment in repeats occurs predominantly postzygotically, likely resulting from faulty DNA repair and interlocus gene conversion.

Similar content being viewed by others

Human de novo mutation rates from a four-generation pedigree reference

Article Open access 23 April 2025

Long-read whole-genome analysis of human single cells

Article Open access 24 August 2023

A mutation rate model at the basepair resolution identifies the mutagenic effect of polymerase III transcription

Article 30 November 2023

Data availability

The data used for analysis, including underlying sequence data, assemblies, and alignment files, are available to approved researchers in the SFARI Base under the accession number SFARI_DS0000104 (https://base.sfari.org/dataset/DS0000104) and through the National Institute of Mental Health Data Archive (NDA) under Collection ID 3780. Source data are provided with this paper.

Code availability

Code and scripts used for the analyses presented in this manuscript are available in GitHub at https://github.com/mdnoyes/denovo_calling69. Any additional information required to reanalyze the data reported in this work paper is available from the lead contact upon request.

References

  1. Sasani, T. A. et al. Large, three-generation human families reveal post-zygotic mosaicism and variability in germline mutation accumulation. eLife 8, e46922 (2019).

    Google Scholar 

  2. Jónsson, H. et al. Parental influence on human germline de novo mutations in 1,548 trios from Iceland. Nature 549, 519–522 (2017).

    Google Scholar 

  3. Goldmann, J. M. et al. Parent-of-origin-specific signatures of de novo mutations. Nat. Genet. 48, 935–939 (2016).

    Google Scholar 

  4. Kong, A. et al. Rate of de novo mutations and the importance of father’s age to disease risk. Nature 488, 471–475 (2012).

    Google Scholar 

  5. Kessler, M. D. et al. De novo mutations across 1,465 diverse genomes reveal mutational insights and reductions in the Amish founder population. Proc. Natl. Acad. Sci. USA 117, 2560–2569 (2020).

    Google Scholar 

  6. Gao, Z. et al. Overlooked roles of DNA damage and maternal age in generating human germline mutations. Proc. Natl. Acad. Sci. USA 116, 9491–9500 (2019).

    Google Scholar 

  7. de Manuel, M., Wu, F. L. & Przeworski, M. A paternal bias in germline mutation is widespread in amniotes and can arise independently of cell division numbers. eLife 11, e80008 (2022).

    Google Scholar 

  8. Hahn, M. W., Peña-Garcia, Y. & Wang, R. J. The ‘faulty male’ hypothesis for sex-biased mutation and disease. Curr. Biol. 33, R1166–R1172 (2023).

    Google Scholar 

  9. Ju, Y. S. et al. Somatic mutations reveal asymmetric cellular dynamics in the early human embryo. Nature 543, 714–718 (2017).

    Google Scholar 

  10. Lindsay, S. J., Rahbari, R., Kaplanis, J., Keane, T. & Hurles, M. E. Similarities and differences in patterns of germline mutation between mice and humans. Nat. Commun. 10, 4053 (2019).

    Google Scholar 

  11. Porubsky, D. et al. Human de novo mutation rates from a four-generation pedigree reference. Nature 643, 427–436 (2025).

    Google Scholar 

  12. Jonsson, H. et al. Differences between germline genomes of monozygotic twins. Nat. Genet. 53, 27–34 (2021).

    Google Scholar 

  13. Huang, A. Y. et al. Postzygotic single-nucleotide mosaicisms in whole-genome sequences of clinically unremarkable individuals. Cell Res. 24, 1311–1327 (2014).

    Google Scholar 

  14. Acuna-Hidalgo, R. et al. Post-zygotic point mutations are an underrecognized source of de novo genomic variation. Am. J. Hum. Genet. 97, 67–74 (2015).

    Google Scholar 

  15. Wright, C. F. et al. Clinically-relevant postzygotic mosaicism in parents and children with developmental disorders in trio exome sequencing data. Nat. Commun. 10, 2985 (2019).

    Google Scholar 

  16. Dou, Y. et al. Postzygotic single-nucleotide mosaicisms contribute to the etiology of autism spectrum disorder and autistic traits and the origin of mutations. Hum. Mutat. 38, 1002–1013 (2017).

    Google Scholar 

  17. Lim, E. T. et al. Rates, distribution and implications of postzygotic mosaic mutations in autism spectrum disorder. Nat. Neurosci. 20, 1217–1224 (2017).

    Google Scholar 

  18. Telenti, A. et al. Deep sequencing of 10,000 human genomes. Proc. Natl. Acad. Sci. USA 113, 11901–11906 (2016).

    Google Scholar 

  19. Turner, T. N. et al. Genomic patterns of de novo mutation in simplex autism. Cell 171, 710–722.e12 (2017).

    Google Scholar 

  20. Wilfert, A. B. et al. Recent ultra-rare inherited variants implicate new autism candidate risk genes. Nat. Genet. 53, 1125–1134 (2021).

    Google Scholar 

  21. Noyes, M. D. et al. Familial long-read sequencing increases yield of de novo mutations. Am. J. Hum. Genet. 109, 631–646 (2022).

    Google Scholar 

  22. Ebbert, M. T. W. et al. Systematic analysis of dark and camouflaged genes reveals disease-relevant genes hiding in plain sight. Genome Biol. 20, 97 (2019).

    Google Scholar 

  23. Porubsky, D. & Eichler, E. E. A 25-year odyssey of genomic technology advances and structural variant discovery. Cell 187, 1024–1037 (2024).

    Google Scholar 

  24. Holt, G. S. et al. Phasing of de novo mutations using a scaled-up multiple amplicon long-read sequencing approach. Hum. Mutat. 43, 1545–1556 (2022).

    Google Scholar 

  25. Kucuk, E. et al. Comprehensive de novo mutation discovery with HiFi long-read sequencing. Genome Med. 15, 34 (2023).

    Google Scholar 

  26. Fu, J. M. et al. Rare coding variation provides insight into the genetic architecture and phenotypic context of autism. Nat. Genet. 54, 1320–1331 (2022).

    Google Scholar 

  27. An, J.-Y. et al. Genome-wide de novo risk score implicates promoter variation in autism spectrum disorder. Science 362, eaat6576 (2018).

    Google Scholar 

  28. Sui, Y. et al. Using the linear references from the pangenome to discover missing autism variants. Nat. Commun. 17, 1681 (2026).

    Google Scholar 

  29. McLaren, W. et al. The Ensembl variant effect predictor. Genome Biol. 17, 122 (2016).

    Google Scholar 

  30. Seplyarskiy, V. et al. A mutation rate model at the basepair resolution identifies the mutagenic effect of polymerase III transcription. Nat. Genet. 55, 2235–2242 (2023).

    Google Scholar 

  31. Koch, Z., Li, A., Evans, D. S., Cummings, S. & Ideker, T. Somatic mutation as an explanation for epigenetic aging. Nat. Aging 5, 709–719 (2025).

    Google Scholar 

  32. Michaelson, J. J. et al. Whole-genome sequencing in autism identifies hot spots for de novo germline mutation. Cell 151, 1431–1442 (2012).

    Google Scholar 

  33. Besenbacher, S. et al. Multi-nucleotide de novo mutations in humans. PLOS Genet. 12, e1006315 (2016).

    Google Scholar 

  34. Ng, J. K. & Turner, T. N. HAT: de novo variant calling for highly accurate short-read and long-read sequencing data. Bioinformatics 40, btad775 (2024).

    Google Scholar 

  35. Mitchell, E. et al. Clonal dynamics of haematopoiesis across the human lifespan. Nature 606, 343–350 (2022).

    Google Scholar 

  36. Horebeek, L. V., Dubois, B. & Goris, A. Somatic variants: new kids on the block in human immunogenetics. Trends Genet. 35, 935–947 (2019).

    Google Scholar 

  37. Fabre, M. A. & Vassiliou, G. S. The lifelong natural history of clonal hematopoiesis and its links to myeloid neoplasia. Blood 143, 573–581 (2024).

    Google Scholar 

  38. Bernstein, N. et al. Analysis of somatic mutations in whole blood from 200,618 individuals identifies pervasive positive selection and novel drivers of clonal hematopoiesis. Nat. Genet. 56, 1147–1155 (2024).

    Google Scholar 

  39. Harland, C. et al. Frequency of mosaicism points towards mutation-prone early cleavage cell divisions in cattle. Preprint at https://doi.org/10.1101/079863 (2017).

  40. Shoag, J. E. et al. Direct measurement of the male germline mutation rate in individuals using sequential sperm samples. Nat. Commun. 16, 2546 (2025).

    Google Scholar 

  41. Kunisaki, J. et al. Sperm from infertile, oligozoospermic men have elevated mutation rates. Preprint at https://doi.org/10.1101/2024.08.22.24312232 (2024).

  42. Braude, P., Bolton, V. & Moore, S. Human gene expression first occurs between the four- and eight-cell stages of preimplantation development. Nature 332, 459–461 (1988).

    Google Scholar 

  43. Musson, R., Gąsior, Ł, Bisogno, S. & Ptak, G. E. DNA damage in preimplantation embryos and gametes: specification, clinical relevance and repair strategies. Hum. Reprod. Update 28, 376–399 (2022).

    Google Scholar 

  44. Park, S. et al. Clonal dynamics in early human embryogenesis inferred from somatic mutation. Nature 597, 393–397 (2021).

    Google Scholar 

  45. Rahbari, R. et al. Timing, rates and spectra of human germline mutation. Nat. Genet. 48, 126–133 (2016).

    Google Scholar 

  46. Kiktev, D. A., Sheng, Z., Lobachev, K. S. & Petes, T. D. GC content elevates mutation and recombination rates in the yeast Saccharomyces cerevisiae. Proc. Natl. Acad. Sci. USA 115, E7109–E7118 (2018).

    Google Scholar 

  47. Harpak, A., Lan, X., Gao, Z. & Pritchard, J. K. Frequent nonallelic gene conversion on the human lineage and its effect on the divergence of gene duplicates. Proc. Natl. Acad. Sci. USA 114, 12779–12784 (2017).

    Google Scholar 

  48. Vollger, M. R. et al. Increased mutation and gene conversion within human segmental duplications. Nature 617, 325–334 (2023).

    Google Scholar 

  49. Palsson, G. et al. Complete human recombination maps. Nature 639, 700–707 (2025).

    Google Scholar 

  50. Harris, K. & Nielsen, R. Error-prone polymerase activity causes multinucleotide mutations in humans. Genome Res. 24, 1445–1454 (2014).

    Google Scholar 

  51. Nagylaki, T. & Petes, T. D. Intrachromosomal gene conversion and the maintenance of sequence homogeneity among repeated genes. Genetics 100, 315–337 (1982).

    Google Scholar 

  52. Hallast, P. et al. Assembly of 43 human Y chromosomes reveals extensive complexity and variation. Nature 621, 355–364 (2023).

    Google Scholar 

  53. Rhie, A. et al. The complete sequence of a human Y chromosome. Nature 621, 344–354 (2023).

    Google Scholar 

  54. Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Preprint at https://doi.org/10.48550/arXiv.1303.3997 (2013).

  55. Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).

    Google Scholar 

  56. Poplin, R. et al. Scaling accurate genetic variant discovery to tens of thousands of samples. Preprint at https://doi.org/10.1101/201178 (2018).

  57. Poplin, R. et al. A universal SNP and small-indel variant caller using deep neural networks. Nat. Biotechnol. 36, 983–987 (2018).

    Google Scholar 

  58. Smit, A. F. A., Hubley, R. & Green, P. RepeatMasker Open-3.0. https://www.repeatmasker.org/ (1996).

  59. Cheng, H., Concepcion, G. T., Feng, X., Zhang, H. & Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat. Methods 18, 170–175 (2021).

    Google Scholar 

  60. Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).

    Google Scholar 

  61. Hinrichs, A. S. et al. The UCSC genome browser database: update 2006. Nucleic Acids Res. 34, D590–D598 (2006).

    Google Scholar 

  62. Wang, T. et al. Large-scale targeted sequencing identifies risk genes for neurodevelopmental disorders. Nat. Commun. 11, 4932 (2020).

    Google Scholar 

  63. Satterstrom, F. K. et al. Large-scale exome sequencing study implicates both developmental and functional changes in the neurobiology of autism. Cell 180, 568–584.e23 (2020).

    Google Scholar 

  64. Liu, X., Li, C., Mou, C., Dong, Y. & Tu, Y. dbNSFP v4: a comprehensive database of transcript-specific functional predictions and annotations for human nonsynonymous and splice-site SNVs. Genome Med. 12, 103 (2020).

    Google Scholar 

  65. Orenbuch, R. et al. Proteome-wide model for human disease genetics. Nat. Genet. 57, 3165–3174 (2025).

    Google Scholar 

  66. Zhao, Y. et al. A probabilistic graphical model for estimating selection coefficients of nonsynonymous variants from human population sequence data. Nat. Commun. 16, 4670 (2025).

    Google Scholar 

  67. Chen, S. et al. A genomic mutational constraint map using variation in 76,156 human genomes. Nature 625, 92–100 (2024).

    Google Scholar 

  68. Hennick, K. et al. Sex differences in the developing human cortex intersect with genetic risk of neurodevelopmental disorders. Preprint at https://doi.org/10.1101/2025.09.04.674293 (2025).

  69. Noyes, Michelle D. Long-read sequencing of trios reveals increased germline and postzygotic mutation rates in repetitive DNA. mdnoyes/denovo_calling https://doi.org/10.5281/zenodo.18736932 (2026).

  70. Helgason, A. et al. The Y-chromosome point mutation rate in humans. Nat. Genet. 47, 453–457 (2015).

  71. Xue, Y. et al. Human Y Chromosome Base-Substitution Mutation Rate Measured by Direct Sequencing in a Deep-Rooting Pedigree. Curr. Biol. 19, 1453–1457 (2009).

  72. Kuroki, Y. et al. Comparative analysis of chimpanzee and human Y chromosomes unveils complex evolutionary pathway. Nat. Genet. 38, 158–167 (2006).

Download references

Acknowledgements

This work was supported, in part, by the US National Institutes of Health (NIH R01MH101221 to E.E.E.) and the Simons Foundation (SFARI #810018EE to E.E.E.). E.E.E. is an investigator of the Howard Hughes Medical Institute. We thank Tonia Brown for assistance in editing this manuscript. This article is subject to HHMI’s Open Access to Publications policy. HHMI lab heads have previously granted a nonexclusive CC BY 4.0 license to the public and a sublicensable license to HHMI in their research articles. Pursuant to those licenses, the author-accepted manuscript of this article can be made freely available under a CC BY 4.0 license immediately upon publication.

Author information

Authors and Affiliations

  1. Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA

    Michelle D. Noyes, Yang Sui, Youngjun Kwon, Nidhi Koundinya, Isaac Wong, Katherine M. Munson, Kendra Hoekzema, Jennifer Kordosky, Gage H. Garcia, Jordan Knuth, Alexandra P. Lewis & Evan E. Eichler

  2. Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA

    Evan E. Eichler

Authors
  1. Michelle D. Noyes
    View author publications

    Search author on:PubMed Google Scholar

  2. Yang Sui
    View author publications

    Search author on:PubMed Google Scholar

  3. Youngjun Kwon
    View author publications

    Search author on:PubMed Google Scholar

  4. Nidhi Koundinya
    View author publications

    Search author on:PubMed Google Scholar

  5. Isaac Wong
    View author publications

    Search author on:PubMed Google Scholar

  6. Katherine M. Munson
    View author publications

    Search author on:PubMed Google Scholar

  7. Kendra Hoekzema
    View author publications

    Search author on:PubMed Google Scholar

  8. Jennifer Kordosky
    View author publications

    Search author on:PubMed Google Scholar

  9. Gage H. Garcia
    View author publications

    Search author on:PubMed Google Scholar

  10. Jordan Knuth
    View author publications

    Search author on:PubMed Google Scholar

  11. Alexandra P. Lewis
    View author publications

    Search author on:PubMed Google Scholar

  12. Evan E. Eichler
    View author publications

    Search author on:PubMed Google Scholar

Contributions

E.E.E. and M.D.N. conceptualized the study. K.M.M., K.H., J. Kordosky, G.H.G., J. Knuth, and A.P.L. generated the data. Y.S., Y.K., I.W., and N.K. performed data quality control. M.D.N. and Y.S. conducted the formal analysis. M.D.N. created the visualizations. M.D.N. developed the methodology. M.D.N. wrote the original draft. M.D.N., Y.S., and E.E.E. reviewed and edited the manuscript. E.E.E. supervised the study.

Corresponding author

Correspondence to Evan E. Eichler.

Ethics declarations

Competing interests

E.E.E. is a scientific advisory board (SAB) member of Variant Bio, Inc. All other authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Ryan Yuen, who co-reviewed with Mahreen Khan; Jeremy Guez and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Description of Additional Supplementary Files (download PDF )

Supplementary Data 1-5 (download XLSX )

Reporting Summary (download PDF )

Transparent Peer Review file (download PDF )

Source data

Source Data (download XLSX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Noyes, M.D., Sui, Y., Kwon, Y. et al. Long-read sequencing of families reveals increased germline and postzygotic mutation rates in repetitive DNA. Nat Commun (2026). https://doi.org/10.1038/s41467-026-70342-1

Download citation

  • Received: 11 August 2025

  • Accepted: 25 February 2026

  • Published: 09 March 2026

  • DOI: https://doi.org/10.1038/s41467-026-70342-1

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • Reviews & Analysis
  • News & Comment
  • Videos
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims & Scope
  • Editors
  • Journal Information
  • Open Access Fees and Funding
  • Calls for Papers
  • Editorial Values Statement
  • Journal Metrics
  • Editors' Highlights
  • Contact
  • Editorial policies
  • Top Articles

Publish with us

  • For authors
  • For Reviewers
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Nature Communications (Nat Commun)

ISSN 2041-1723 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing: Translational Research

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

Get what matters in translational research, free to your inbox weekly. Sign up for Nature Briefing: Translational Research