Abstract
The emergence of single-cell transcriptomics has given us novel views of gene expression heterogeneity and cellular trajectories in development and disease at unprecedented resolution. However, an overarching theoretical framework to interpret single-cell gene expression data is lacking. Here we argue that dynamical systems theory can provide an interpretable, causal, and quantitative perspective to understand and analyze these enormously rich data sets, in addition to yielding potential benefits for health care.
Similar content being viewed by others
A cell as a dynamical system
The number of possible states that a human cell could occupy in a roughly 20,000-dimensional gene expression space is immeasurably large. Yet, although the precise number is debated, there are likely no more than a few hundred distinct cell types in the human body1. One explanation for this relatively small number comes from dynamical systems theory2. In this perspective, the network of gene regulatory interactions in a cell generates a dynamical system, i.e., one whose evolution in time can be described by a set of coupled ordinary differential equations involving time derivatives. Such a system has a finite number of stable states or “attractors” that correspond to discrete cell types. Transitions between these attractors constitute trajectories followed by cells during physiological development or under various perturbations3. This “dynamical paradigm”4, the quantitative basis of the field of systems biology, has yielded rich insights into cell behavior and cell fate choice5,6,7,8,9,10.
Single-cell biology in light of dynamical systems theory
The emergence of single-cell genomics methods over the past decade has provided extremely rich biological data sets, giving us unprecedented views of cell-to-cell heterogeneity, cellular developmental trajectories, and disease-associated pathology11,12,13, culminating in the assembly of a human cell atlas14. In addition to their biological richness, these data sets have the potential to revolutionize our understanding of disease pathology and the development of remedies15. However, a theoretical framework to understand these enormous data sets is still lacking. Importantly, is there a causal mechanism that explains the underlying data-generative process?
Dynamical systems theory, by explaining the layout of cellular states and trajectories in gene expression space, can serve as this much-needed quantitative framework and help us make sense of single-cell data. A major advantage of a dynamical formulation is that the rate equations involved are mechanistic, intuitive, and easily interpretable, with rate terms based on mass-action, Michaelis–Menten, or Hill-type biochemical kinetics16,17. This advantage cannot be overstated in the modern data analytics landscape where hard-to-interpret “black box” models are the norm. Dynamical systems theory is also simpler than alternative theoretical frameworks in that it makes fewer assumptions on how cells traverse the high-dimensional gene expression space. As such, the dynamical systems framework is more likely to describe the underlying generative process for single-cell gene expression data and can therefore be more predictive of cellular developmental trajectories. It gives a clear sense of the “arrow of time” in cell fate choice, given that the rate terms in the governing equations define a resultant vector for each cell with specific magnitude and direction in gene expression space. Conceptual innovations like RNA velocity18,19,20, which can be interpreted as gene-specific temporal rate terms, fit comfortably into a dynamical systems perspective21.
The dynamical systems framework would also explain common structural features of two-dimensional projections of single-cell gene expression data, which tend to show cells clustered by type in stable states corresponding to specific attractors of the underlying gene regulatory network. The dynamical systems formalism is fully compatible with the observation that cells occupy a lower-dimensional “manifold” in the ambient high-dimensional gene expression space, given the restriction of cell states and trajectories to specific attractors and optimal transitional paths between them22. It can also serve as a theoretical basis for the burgeoning field of gene regulatory network inference from single-cell data23, given that these networks are essentially complex dynamical systems. We should note that there is a long history of data-driven modeling of protein signaling networks dating back to before the dawn of the current single-cell biology ere, often based on Bayesian approaches24,25,26,27. Below, we lay out possible approaches to derive dynamical rate equations for gene regulation from single-cell transcriptomics data (schematic in Fig. 1).
Single-cell transcriptomics produces high-resolution snapshots of gene expression (a) but does not directly generate mechanistic predictive models of cellular states and transitions. A dynamical systems model, expressed as a set of coupled ordinary differential equations (ODEs) describing the evolution in time of n genes xi (i = 1 to n) (b), can potentially be derived from single-cell data using dynamical systems inference tools. The derived models can be used to map putative cell states as computed attractors on a gene expression landscape (c) and predict physiological developmental trajectories (solid blue arrows) as well as potential pathways of cell fate reprogramming (dashed red arrow). Panels (a, c) created in https://BioRender.com.
Deriving dynamical system equations
The most common criticism of the dynamical systems approach for modeling cell behavior is methodological rather than conceptual; namely, that it is only viable for small gene regulatory networks with known structure and previously characterized parameters28. However, there is an established field of data-driven dynamical system inference going back at least two decades29,30,31,32, which may be useful for the discovery of the governing rate equations underlying single-cell dynamics. As an example, an elegant mathematical approach called “sparse identification of nonlinear dynamics” (SINDy) has been proposed to derive interpretable governing equations of a dynamical system from time-course data33. Given a library of user-defined functions of the system variables, SINDy performs sparse regression to derive a minimal set of functions that capture the system dynamics. Thus, one can derive from data both the form of the rate equations for gene regulatory networks and the parameters therein. Successive tools have built on SINDy to specifically derive the governing equations of reactive systems34, model implicit dynamics and rational nonlinearities35, and systems with smaller, noisy data sets36.
One problem here is that, given the destructive nature of current single-cell assays, we only have static, single-time-point snapshots of gene expression across a population of cells. However, one can make use of reordered “pseudotime” gene expression values along a cellular trajectory as a proxy for time-course data37. By providing a library of biologically plausible functions of the mass action, Michaelis–Menten, or Hill form to SINDy, we can potentially derive rate equations for the temporal evolution of the transcriptome of cells along a trajectory. The predictive power of the rate equations can then be tested on a different trajectory that the model has not previously “seen”. This approach, by weighting the more significant regulatory interactions in the gene network, would also yield the structure of the network. The rate terms in the derived governing equations can then be used to generate a quantitative map of the much-invoked Waddington gene expression landscape, identifying attractors on the landscape as optimal states of cell occupancy, and transition paths between them (Fig. 1). Importantly, a quantitative derivation of the gene expression landscape using a dynamical systems approach does not require the modeled network to be a “gradient system” with a closed-form potential function, a condition that is not met by realistic gene regulatory networks. Even for non-gradient networks, an incremental “quasi-potential” (as opposed to a closed-form potential) can be defined along a cellular trajectory that, upon numerical integration, yields the surface of the gene expression landscape38. This deterministic quasi-potential shapes the “valleys” around attractors, thus imposing constraints on the extent of permissible noise for a cell to remain in a specific stable attractor state without switching to a different neighboring attractor.
Challenges in applying dynamical systems theory to single-cell transcriptomics
There are potential challenges in using the dynamical systems approach to analyze single-cell RNA-sequencing (scRNA-seq) data. Notably, these data sets are often sparse, and it may be difficult to distinguish true biological non-expression from “dropout” due to technical artifacts, which can complicate downstream analysis tasks. This problem can be addressed in part by specialized computation tools that have been developed to impute the expression of missing genes in single-cell data39,40,41. Secondly, a characteristic feature of single-cell data is the presence of cell-to-cell variability even among cells of the same type, arising from noisy gene expression. The dynamical systems approach can be modified to account for the stochasticity in gene regulatory networks arising from gene expression noise. The Fokker–Planck or Langevin approximations to the chemical master equation, or the Gillespie algorithm in case of low molecular copy number, explicitly model noise in the biochemical reactions underlying cellular dynamics42. Notably, recent extensions to the SINDy approach have been shown to be robust to the presence of sparsity and noise in the data36.
Alternative and complementary approaches to modeling single-cell gene expression dynamics
Other approaches have been proposed for predictive modeling of cellular dynamics without relying on explicit derivation of systems of ODEs describing gene regulation. Dynamo43 builds on the RNA velocity concept to reconstruct transcriptomic vector fields from sparse and noisy single-cell data, and successfully predicted optimal paths and potential regulators of cell state transitions, as well as the results of genetic perturbations. Another tool, Scribe44 derives causal gene regulatory networks by estimating the strength of regulator-target gene interactions. Scribe demonstrated limitations in using pseudotime for network inference, emphasizing the importance of temporal coupling between gene expression measurements. Velorama is another approach for gene regulatory network inference using graph neural networks and Granger causality45. Traditionally, dynamical systems analysis has been applied solely to gene expression data. In the light of recent developments in multi-omic technologies, this approach can be improved upon by using, for example, data from epigenomics assays. An interesting example of such data integration is CellOracle46, which combines single-cell ATAC-seq (chromatin accessibility) and scRNA-seq data to infer quantitative models of gene regulatory networks and simulate network perturbations.
Other broadly applicable approaches for dynamical system inference specifically take into account the risk of overfitting associated with very complex models when experimental data is limited. Sir Isaac47, for example, builds parsimonious coarse-grained models of network dynamics by using a combination of nonlinear differential equations with power-law kinetics, continuous time sigmoidal networks, and Bayesian inference. Another class of models uses machine learning to first reduce the dimensionality of the data into a smaller number of “latent dimensions” before deriving the governing equations of that reduced system48,49. In the context of gene regulatory network inference, it would be interesting to see if these inferred latent variables correspond to meaningful groups of genes and thus provide new biological insights. Other computational tools have focused on formulating potential-based approaches to model the gene expression landscape. Geometric models based on a gradient system assumption and the resulting potential function have been proposed to explore the underlying spatiotemporal dynamics of gene networks50. Very recently, biologically-constrained neural networks using the concept of a potential function have been used to infer two-dimensional gene expression landscapes51.
Practical implications
Dynamical systems theory, as a mechanistic and causal framework, can allow us to progress from the question of “What do cells do?” (alterations in gene expression, for example), to “How do they do it?” (rewiring of underlying signaling pathways and gene regulatory networks), to “Why do they do it?” (cells are driven by a quasi-potential that is progressively minimized along a developmental trajectory till they reach a stable attractor). The question of whether dynamical systems theory is an appropriate theoretical framework to understand and predict cell fate choice and cellular transitions is of more than academic interest. There are several practical consequences to choosing a particular quantitative basis to decipher cellular dynamics. Adoption of the dynamical systems formalism would allow for the application of powerful associated tools to the analysis of cellular trajectories in health and disease. For example, the prediction of critical state transitions in temporally evolving systems relies on the theory of bifurcations, a core tenet of dynamical systems theory52. Indeed, bifurcation theory has been applied fruitfully to single-cell data to demonstrate critical transitions in cell state during hematopoiesis and differentiation of pluripotent stem cells53,54. Additionally, the prediction of such “tipping points” in developmental trajectories would be extremely useful in the analysis of chemical- and drug-induced toxicity, where a central and unsettled question is whether dose-response curves are smooth and gradual, or whether they exhibit abrupt transitions at critical dose thresholds55,56. A mechanistic understanding of this conundrum is vital to drug development and the establishment of rigorous quantitative standards for chemical risk assessment in environmental and public health.
Data availability
No data sets were generated or analysed during the current study.
References
Jones, R. C. et al. The Tabula sapiens: a multiple-organ, single-cell transcriptomic atlas of humans. Science 376, eabl4896 (2022).
Strogatz, S. H. Nonlinear Dynamics and Chaos: With Applications to Physics, Biology, Chemistry, and Engineering (Westview Press, 2014).
Huang, S., Eichler, G., Bar-Yam, Y. & Ingber, D. E. Cell fates as high-dimensional attractor states of a complex gene regulatory network. Phys. Rev. Lett. 94, 128701 (2005).
Tyson, J. J. & Novak, B. A dynamical paradigm for molecular cell biology. Trends Cell Biol. 30, 504–515 (2020).
Gómez-Schiavon, M., Montejano-Montelongo, I., Orozco-Ruiz, F. S. & Sotomayor-Vivas, C. The art of modeling gene regulatory circuits. npj Syst. Biol. Appl. 10, 60 (2024).
Ferrell, J. E. Jr & Machleder, E. M. The biochemical basis of an all-or-none cell fate switch in Xenopus oocytes. Science 280, 895–898 (1998).
Huang, C. Y. F. & Ferrell, J. E. Ultrasensitivity in the mitogen-activated protein kinase cascade. Proc. Natl Acad. Sci. USA 93, 10078–10083 (1996).
Tyson, J. J., Chen, K. C. & Novak, B. Sniffers, buzzers, toggles and blinkers: dynamics of regulatory and signaling pathways in the cell. Curr. Opin. Cell Biol. 15, 221–231 (2003).
Novák, B. & Tyson, J. J. A model for restriction point control of the mammalian cell cycle. J. Theor. Biol. 230, 563–579 (2004).
Tyson, J. J., Csikasz-Nagy, A. & Novak, B. The dynamics of cell cycle regulation. Bioessays 24, 1095–1109 (2002).
Griffiths, J. A., Scialdone, A. & Marioni, J. C. Using single-cell genomics to understand developmental processes and cell fate decisions. Mol. Syst. Biol. 14, 8046–8046 (2018).
Potter, S. S. Single-cell RNA sequencing for the study of development, physiology and disease. Nat. Rev. Nephrol. 14, 479–492 (2018).
Lim, B., Lin, Y. & Navin, N. Advancing cancer research and medicine with single-cell genomics. Cancer Cell 37, 456–470 (2020).
Yanai, I., Haas, S., Lippert, C. & Kretzmer, H. Cellular atlases are unlocking the mysteries of the human body. Nature 635, 553–555 (2024).
Rood, J. E., Maartens, A., Hupalowska, A., Teichmann, S. A. & Regev, A. Impact of the human cell atlas on medicine. Nat. Med. 28, 2486–2496 (2022).
Srinivasan, B. A guide to the Michaelis–Menten equation: steady state and beyond. FEBS J. 289, 6086–6098 (2022).
Frank, S. A. Input-output relations in biological systems: measurement, information and the Hill equation. Biol. Direct 8, 31 (2013).
Manno, G. L. et al. RNA velocity of single cells. Nature 560, 494–498 (2018).
Bergen, V., Lange, M., Peidli, S., Wolf, F. A. & Theis, F. J. Generalizing RNA velocity to transient cell states through dynamical modeling. Nat. Biotechnol. 38, 1408–1414 (2020).
Bergen, V., Soldatov, R. A., Kharchenko, P. V. & Theis, F. J. RNA velocity-current challenges and future perspectives. Mol. Syst. Biol. 17, e10282 (2021).
Liu, R., Pisco, A. O., Braun, E., Linnarsson, S. & Zou, J. Dynamical systems model of RNA velocity improves inference of single-cell trajectory, pseudo-time and gene regulation. J. Mol. Biol. 434, 167606 (2022).
Moon, K. R. et al. Manifold learning-based methods for analyzing single-cell RNA-sequencing data. Curr. Opin. Syst. Biol. 7, 36–46 (2018).
Badia-i-Mompel, P. et al. Gene regulatory network inference in the era of single-cell multi-omics. Nat. Rev. Genet. https://doi.org/10.1038/s41576-023-00618-5 (2023).
Halasz, M., Kholodenko, B. N., Kolch, W. & Santra, T. Integrating network reconstruction with mechanistic modeling to predict cancer therapies. Sci. Signal. 9, ra114 (2016).
Rukhlenko, O. S. et al. Control of cell state transitions. Nature 609, 975–985 (2022).
Mukherjee, S. & Speed, T. P. Network inference using informative priors. Proc. Natl Acad. Sci. USA 105, 14313–14318 (2008).
Sachs, K., Perez, O., Pe’er, D., Lauffenburger, D. A. & Nolan, G. P. Causal protein-signaling networks derived from multiparameter single-cell data. Science 308, 523–529 (2005).
Teschendorff, A. E. & Feinberg, A. P. Statistical mechanics meets single-cell biology. Nat. Rev. Genet. 22, 459–476 (2021).
McKinney, B. A. et al. Hybrid grammar-based approach to nonlinear dynamical system identification from biological time series. Phys. Rev. E 73, 021912 (2006).
Bongard, J. & Lipson, H. Automated reverse engineering of nonlinear dynamical systems. Proc. Natl Acad. Sci. USA 104, 9943–9948 (2007).
Timme, M. & Casadiego, J. Revealing networks from dynamics: an introduction. J. Phys. A Math. Theor. 47, 343001 (2014).
Casadiego, J., Nitzan, M., Hallerberg, S. & Timme, M. Model-free inference of direct network interactions from nonlinear collective dynamics. Nat. Commun. 8, 2192 (2017).
Brunton, S. L., Proctor, J. L., Kutz, J. N. & Bialek, W. Discovering governing equations from data by sparse identification of nonlinear dynamical systems. Proc. Natl Acad. Sci. USA 113, 3932–3937 (2016).
Hoffmann, M., Fröhner, C. & Noé, F. Reactive SINDy: discovering governing reactions from concentration data. J. Chem. Phys. 150, 025101 (2019).
Kaheman, K., Kutz, J. N. & Brunton, S. L. SINDy-PI: a robust algorithm for parallel implicit sparse identification of nonlinear dynamics. Proc. R. Soc. A 476, 20200279 (2020).
Fasel, U., Kutz, J. N., Brunton, B. W. & Brunton, S. L. Ensemble-SINDy: robust sparse model discovery in the low-data, high-noise limit, with active learning and control. Proc. R. Soc. A 478, 20210904 (2022).
Trapnell, C. et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat. Biotechnol. 32, 381–386 (2014).
Bhattacharya, S., Zhang, Q. & Andersen, M. E. A deterministic map of Waddington’s epigenetic landscape for cell fate specification. BMC Syst. Biol. 5, 85–85 (2011).
Van Dijk, D. et al. Recovering gene interactions from single-cell data using data diffusion. Cell 174, 716–729.e27 (2018).
Li, W. V. & Li, J. J. An accurate and robust imputation method scImpute for single-cell RNA-seq data. Nat. Commun. 9, 1–9 (2018).
Linderman, G. C. et al. Zero-preserving imputation of single-cell RNA-seq data. Nat. Commun. 13, 192 (2022).
Ullah, M. & Wolkenhauer, O. Stochastic approaches in systems biology. Wiley Interdiscip. Rev. Syst. Biol. Med. 2, 385–397 (2010).
Qiu, X., et al. Mapping transcriptomic vector fields of single cells. Cell 185, 690–711.e45 (2022).
Qiu, X. et al. Inferring causal gene regulatory networks from coupled single-cell expression dynamics using scribe. Cell Syst. 0, 265–274.e11 (2020).
Singh, R., Wu, A. P., Mudide, A. & Berger, B. Causal gene regulatory analysis with RNA velocity reveals an interplay between slow and fast transcription factors. Cell Syst. 15, 462–474.e5 (2024).
Kamimoto, K. et al. Dissecting cell identity via network inference and in silico gene perturbation. Nature 614, 742–751 (2023).
Daniels, B. C. & Nemenman, I. Automated adaptive inference of phenomenological dynamical models. Nat. Commun. 6, 8133 (2015).
Floryan, D. & Graham, M. D. Data-driven discovery of intrinsic dynamics. Nat. Mach. Intell. 4, 1113–1120 (2022).
Champion, K., Lusch, B., Kutz, J. N. & Brunton, S. L. Data-driven discovery of coordinates and governing equations. Proc. Natl Acad. Sci. USA 116, 22445–22451 (2019).
Rand, D. A., Raju, A., Sáez, M., Corson, F. & Siggia, E. D. Geometry of gene regulatory dynamics. Proc. Natl Acad. Sci. USA 118, e2109729118 (2021).
Howe, A. & Mani, M. Learning geometric models for developmental dynamics. Preprint at bioRxiv https://doi.org/10.1101/2024.09.21.614191 (2024).
Scheffer, M. et al. Early-warning signals for critical transitions. Nature 461, 53–59 (2009).
Freedman, S. L., Xu, B., Goyal, S. & Mani, M. A dynamical systems treatment of transcriptomic trajectories in hematopoiesis. Development 150, dev201280 (2023).
Sáez, M. et al. Statistically derived geometrical landscapes capture principles of decision-making dynamics during cell fate transitions. Cell Syst. 13, 12–28.e3 (2022).
Crump, K. S. Use of threshold and mode of action in risk assessment. Crit. Rev. Toxicol. 41, 637–650 (2011).
Rhomberg, L. R. et al. Linear low-dose extrapolation for noncancer heath effects is the exception, not the rule. Crit. Rev. Toxicol. 41, 1–21 (2011).
Acknowledgements
This work was partially supported by MSU AgBioResearch and the National Institute of Environmental Health Sciences of the National Institutes of Health (R01 ES031937 and P42 ES004911). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The authors gratefully acknowledge Qiang Zhang, Rory Conolly, Melvin Andersen, and Debajit Saha for discussions and feedback on the manuscript.
Author information
Authors and Affiliations
Contributions
S.B. and S.I. wrote the main manuscript text and designed Fig. 1.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Islam, S., Bhattacharya, S. Dynamical systems theory as an organizing principle for single-cell biology. npj Syst Biol Appl 11, 85 (2025). https://doi.org/10.1038/s41540-025-00565-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41540-025-00565-3