Abstract
Recent advancements in single-cell DNA methylation (scDNAm) sequencing technologies have enabled the profiling of epigenetic landscapes at unprecedented resolution, offering insights into cellular heterogeneity, differentiation and evolution. Trajectory inference, which orders cells along pseudotime, allows researchers to track genomics changes across continuous cell states and identify key loci exhibiting differential methylation. However, no methods currently exist to model methylation changes along pseudotime in scDNAm data. Here, we present a hierarchical Bayesian framework for scDNAm data analysis. Our method, named mist (methylation inference for single-cell along trajectory), models stage-specific biological variations, identifies genomic features with significant methylation changes along pseudotime, and performs Differential Methylation (DM) analysis across phenotypical groups. Simulations demonstrate its superior accuracy in detecting DM genes along pseudotime compared to existing methods. Applied to multi-omics datasets of mouse embryonic development and developing human brain, mist identifies key developmental regulators, whose methylation patterns align with lineage transitions. mist is publicly available as an R/Bioconductor package.
Similar content being viewed by others
Data availability
The data for simulations and real-data analyses are available at https://doi.org/10.5281/zenodo.1845194067. The mouse embryos data is available under accession codes GSE121708 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE12170829. The human brain data is available through https://cells.ucsc.edu/?ds=brain-epigenome49. Source data are provided with this paper.
Code availability
mist is publicly available as a R/Bioconductor package at: https://bioconductor.org/packages/mist/. The package includes detailed vignettes and example datasets to guide users through typical workflows. Source code is hosted at https://github.com/dxd429/mistand is released under the MIT License. The specific version of the code associated with this publication is archived in https://doi.org/10.5281/zenodo.1845194067.
References
Deaton, A. M. & Bird, A. CpG islands and the regulation of transcription. Genes Dev. 25, 1010–1022 (2011).
Dor, Y. & Cedar, H. Principles of {DNA} methylation and their implications for biology and medicine. Lancet 392, 777–786 (2018).
Jiang, S. & Guo, Y. Epigenetic clock: DNA methylation in aging. Stem Cells Int. 2020, 1047896 (2020).
Horvath, S. & Raj, K. {DNA} methylation-based biomarkers and the epigenetic clock theory of ageing. Nat. Rev. Genet. 19, 371–384 (2018).
Matzke, M. A. & Mosher, R. A. {RNA}-directed {DNA} methylation: an epigenetic pathway of increasing complexity. Nat. Rev. Genet. 15, 394–408 (2014).
Kulis, M. & Esteller, M. {DNA} methylation and cancer. Adv. Genet. 70, 27–56 (2010).
Klutstein, M., Nejman, D., Greenfield, R. & Cedar, H. Dna methylation in cancer and aging. Cancer Res. 76, 3446–3450 (2016).
Koch, A. et al. Analysis of {DNA} methylation in cancer: location revisited. Nat. Rev. Clin. Oncol. 15, 459–466 (2018).
Beck, S. & Rakyan, V. K. The methylome: approaches for global {DNA} methylation profiling. Trends Genet. TIG 24, 231–237 (2008).
Krueger, F. & Andrews, S. R. Bismark: a flexible aligner and methylation caller for {Bisulfite}-{Seq} applications. Bioinformatics 27, 1571–1572 (2011).
Hansen, K. D., Langmead, B. & Irizarry, R. A. BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions. Genome Biol. 13, 1–10 (2012).
Jaffe, A. E. et al. Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies. Int. J. Epidemiol. 41, 200 (2012).
Akalin, A. et al. Methylkit: a comprehensive R package for the analysis of genome-wide DNA methylation profiles. Genome Biol. 13, R87 (2012).
Feng, H., Conneely, K. N. and Wu, H. A Bayesian hierarchical model to detect differentially methylated loci from single nucleotide resolution sequencing data. Nucleic Acids Res. 42(8), https://doi.org/10.1093/nar/gku154 (2014).
Sun, D. et al. {MOABS}: model based analysis of bisulfite sequencing data. Genome Biol. 15, R38 (2014).
Guo, H. et al. Single-cell methylome landscapes of mouse embryonic stem cells and early embryos analyzed using reduced representation bisulfite sequencing. Genome Res. 23, 2126–2135 (2013).
Guo, H. et al. The {DNA} methylation landscape of human early embryos. Nature 511, 606–610 (2014).
Li, Q. et al. Heterogeneous {DNA} methylation status in same-cell subpopulations of ovarian cancer tissues. Tumour Biol. J. Int. Soc. Oncodev. Biol. Med. 39, 1010428317701650 (2017).
Hou, Y. et al. Single-cell triple omics sequencing reveals genetic, epigenetic, and transcriptomic heterogeneity in hepatocellular carcinomas. Cell Res. 26, 304–319 (2016).
He, Y. et al. Spatiotemporal {DNA} methylome dynamics of the developing mouse fetus. Nature 583, 752–759 (2020).
Yao, Z. et al. A transcriptomic and epigenomic cell atlas of the mouse primary motor cortex. Nature 598, 103–110 (2021).
Liu, H. et al. {DNA} methylation atlas of the mouse brain at single-cell resolution. Nature 598, 120–128 (2021).
Luo, C. et al. Single nucleus multi-omics identifies human cortical cell regulatory genome diversity. Cell Genom. 2, 100106,100107 (2022).
Zhang, Z. et al. Epigenomic diversity of cortical projection neurons in the mouse brain. Nature 598, 167–173 (2021).
Kapourani, C.-A. & Sanguinetti, G. Melissa: {Bayesian} clustering and imputation of single-cell methylomes. Genome Biol. 20, 61 (2019).
Angermueller, C., Lee, H. J., Reik, W. & Stegle, O. {DeepCpG}: accurate prediction of single-cell {DNA} methylation states using deep learning. Genome Biol. 18, 67 (2017).
Zou, L. S. et al. {BoostMe} accurately predicts {DNA} methylation values in whole-genome bisulfite sequencing of multiple human tissues. BMC Genom. 19, 390 (2018).
Farlik, M. et al. Single-cell {DNA} methylome sequencing and bioinformatic inference of epigenomic cell-state dynamics. Cell Rep. 10, 1386–1397 (2015).
Argelaguet, R. et al. Multi-omics profiling of mouse gastrulation at single-cell resolution. Nature 576, 487–491 (2019).
de Souza, C. et al. Epiclomal: {Probabilistic} clustering of sparse single-cell {DNA} methylation data. PLoS Comput. Biol. 16, e1008270 (2020).
Johnson, K. C. et al. Single-cell multimodal glioma analyses identify epigenetic regulators of cellular plasticity and environmental stress response. Nat. Genet. 53, 1456–1468 (2021).
Luo, C. et al. Single-cell methylomes identify neuronal subtypes and regulatory elements in mammalian cortex. Science 357, 600–604 (2017).
Hernando-Herraez, I. et al. Ageing affects {DNA} methylation drift and transcriptional cell-to-cell variability in mouse muscle stem cells. Nat. Commun. 10, 4361 (2019).
Kapourani, C. A., Argelaguet, R., Sanguinetti, G. & Vallejos, C. A. scMET: Bayesian modeling of DNA methylation heterogeneity at single-cell resolution. Genome Biol. 22, 1–21 (2021).
Jones, P. A. & Taylor, S. M. Cellular differentiation, cytidine analogs and {DNA} methylation. Cell 20, 85–93 (1980).
Razin, A. et al. Variations in {DNA} methylation during mouse cell differentiation in vivo and in vitro. Proc. Natl. Acad. Sci. USA 81, 2275–2279 (1984).
Parry, A., Rulands, S. & Reik, W. Active turnover of {DNA} methylation during cell fate decisions. Nat. Rev. Genet. 22, 59–66 (2021).
Dolzhenko, E. & Smith, A. D. Using beta-binomial regression for high-precision differential methylation analysis in multifactor whole-genome bisulfite sequencing experiments. BMC Bioinform. 15, 215 (2014).
Amat, M., Guignard, G., Llor, N. & Bosch, J. Access to enantiopure 4-substituted 1,5-aminoalcohols from phenylglycinol-derived δ-lactams: synthesis of {Haliclona} alkaloids. J. Org. Chem. 79, 2792–2802 (2014).
Cao, J. et al. The single-cell transcriptional landscape of mammalian organogenesis. Nature 566, 496–502 (2019).
Street, K. et al. Slingshot: Cell lineage and pseudotime inference for single-cell transcriptomics. BMC Genom. 19, 1–16 (2018).
Chen, H. et al. Single-cell trajectories reconstruction, exploration and mapping of omics data with {STREAM}. Nat. Commun. 10, 1903 (2019).
Chng, S. C., Ho, L., Tian, J. & Reversade, B. ELABELA: a hormone essential for heart development signals via the apelin receptor. Dev. Cell 27, 672–680 (2013).
Ho, L. et al. ELABELA is an endogenous growth factor that sustains hESC self-renewal via the PI3K/AKT pathway. Cell Stem Cell 17, 435–447 (2015).
Pauli, A. et al. Toddler: an embryonic signal that promotes cell movement via Apelin receptors. Science 343. https://doi.org/10.1126/SCIENCE.1248636 (2014).
Iwabuchi, K. A. et al. ECAT11/L1td1 is enriched in ESCs and rapidly activated during iPSC generation, but it is dispensable for the maintenance and induction of pluripotency. PloS ONE 6, https://doi.org/10.1371/JOURNAL.PONE.0020461 (2011).
Rowe, J. H. et al. Patients with CD3G mutations reveal a role for human CD3γ in Treg diversity and suppressive function. Blood 131, 2335–2344 (2018).
Garcillán, B. et al. CD3G or CD3D knockdown in mature, but not immature, T lymphocytes similarly cripples the human TCRαβ complex. Front. Cell Dev. Biol. 9. https://doi.org/10.3389/FCELL.2021.608490 (2021).
Heffel, M. G. et al. Temporally distinct 3D multi-omic dynamics in the developing human brain. Nature 635. https://doi.org/10.1038/S41586-024-08030-7 (2024).
Rhein, C. et al. Association of a CAMK2A genetic variant with logical memory performance and hippocampal volume in the elderly. Brain Res. Bull. 161, 13–20 (2020).
Yasuda, R., Hayashi, Y. & Hell, J. W. CaMKII: a central molecular organizer of synaptic plasticity, learning and memory. Nat. Rev. Neurosci. 23, 666–682 (2022).
Lisman, J., Yasuda, R. & Raghavachari, S. Mechanisms of CaMKII action in long-term potentiation. Nat. Rev. Neurosci. 13, 169 (2012).
Thomson, R. E. et al. Fgf receptor 3 activation promotes selective growth and expansion of occipitotemporal cortex. Neural Dev. 4. https://doi.org/10.1186/1749-8104-4-4 (2009).
Huang, J. Y. et al. Enhanced FGFR3 activity in postmitotic principal neurons during brain development results in cortical dysplasia and axonal tract abnormality. Sci. Rep. 10. https://doi.org/10.1038/S41598-020-75537-0 dec (2020).
Song, D. & Li, J. J. {PseudotimeDE}: inference of differential gene expression along cell pseudotime with well-calibrated p-values from single-cell {RNA} sequencing data. Genome Biol. 22, 124 (2021).
den Berge, K. et al. Trajectory-based differential expression analysis for single-cell sequencing data. Nat. Commun. 11, 1201 (2020).
Neufeld, A., Dharamshi, A., Gao, L. L., Witten, D. & Bien, J. Discussion of “Data fission: splitting a single data point”. J. Am. Stat. Assoc. 120, 151–157 (2025).
Setty, M. et al. Characterization of cell fate probabilities in single-cell data with Palantir. Nat. Biotechnol. 37, 451–460 (2019).
La Manno, G. et al. RNA velocity of single cells. Nature 560, 494–498 (2018).
Wolf, F. A. et al. PAGA: graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells. Genome Biol. 20, 1–9 (2019).
Haghverdi, L., Büttner, M., Wolf, F. A., Buettner, F. & Theis, F. J. Diffusion pseudotime robustly reconstructs lineage branching. Nat. methods 13, 845–848 (2016).
Lange, M. et al. CellRank for directed single-cell fate mapping. Nat. methods 19, 159–170 (2022).
Weiler, P., Lange, M., Klein, M., Pe’er, D. & Theis, F. CellRank 2: unified fate mapping in multiview single-cell data. Nat. Methods 21, 1196–1205 (2024).
Schiebinger, G. et al. Optimal-transport analysis of single-cell gene expression identifies developmental trajectories in reprogramming. Cell 176, 928–943.e22 (2019).
Carvalho, C. M., Polson, N. G. & Scott, J. G. The horseshoe estimator for sparse signals. Biometrika 97, 465–480 (2010).
Neal, R. M. Slice sampling. 31, 705–767. https://doi.org/10.1214/AOS/1056562461 (2003).
dxd429. dxd429/mistdata: Data_code_mist_manuscript. https://doi.org/10.5281/zenodo.18451940 (2026).
Acknowledgements
We thank Dr. Chongyuan Luo for insightful discussions and for publicly sharing the snm3C-seq data from his lab. This work was supported by grants from the National Institutes of Health awarded [NIGMS R35GM154862 to H.F.]. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. Funding to pay the Open Access publication charges for this article was provided by NIH [R35GM154862].
Author information
Authors and Affiliations
Contributions
L.Z. and H.F. conceived the study. D.D. and W.M. performed in-silico simulation studies and real data analysis. W.T. acquired and processed real scDNAm datasets. D.D., L.Z., and H.F. wrote the manuscript. H.W. supervised the research and led discussions. All authors helped edit the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Duan, D., Ma, W., Tang, W. et al. mist: a hierarchical Bayesian framework for detecting differential DNA methylation dynamics in single-cell data. Nat Commun (2026). https://doi.org/10.1038/s41467-026-70523-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-026-70523-y


