Abstract
Here we introduce a graph neural network architecture built on geometric vector perceptrons to predict the committor function directly from atomic coordinates, bypassing the need for hand-crafted collective variables. The method offers atom-level interpretability, pinpointing the key atomic players in complex transitions without relying on prior assumptions. Applied across diverse molecular systems, the method accurately infers the committor function and highlights the importance of each heavy atom in the transition mechanism. It also yields precise estimates of the rate constants for the underlying processes. The proposed approach assists in understanding and modeling complex dynamics, by enabling collective-variable-free learning and automated identification of physically meaningful reaction coordinates of complex molecular processes.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$32.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
$119.00 per year
only $9.92 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to the full article PDF.
USD 39.95
Prices may be subject to local taxes which are calculated during checkout


Similar content being viewed by others
Data availability
Due to the size of the dataset, all the data in this work are provided upon request. Nevertheless, the necessary configuration files for data reproducibility are available via Zenodo at https://doi.org/10.5281/zenodo.18259668 (ref. 49).
Code availability
All the codes generated in this work are available via Zenodo at https://doi.org/10.5281/zenodo.18259668 (ref. 49).
References
Peters, B. Reaction coordinates and mechanistic hypothesis tests. Annu. Rev. Phys. Chem. 67, 669–690 (2016).
E, W. & Vanden-Eijnden, E. Transition-path theory and path-finding algorithms for the study of rare events. Annu. Rev. Phys. Chem. 61, 391–420 (2010).
Onsager, L. Initial recombination of ions. Phys. Rev. 54, 554–557 (1938).
Berezhkovskii, A. & Szabo, A. One-dimensional reaction coordinates for diffusive activated rate processes in many dimensions. J. Chem. Phys. 122, 014503 (2005).
Roux, B. Transition rate theory, spectral analysis, and reactive paths. J. Chem. Phys. 156, 134111 (2022).
van Erp, T. S., Moroni, D. & Bolhuis, P. G. A novel path sampling method for the calculation of rate constants. J. Chem. Phys. 118, 7762–7774 (2003).
Best, R. B. & Hummer, G. Reaction coordinates and rates from transition paths. Proc. Natl Acad. Sci. USA. 102, 6732–6737 (2005).
Peters, B. & Trout, B. L. Obtaining reaction coordinates by likelihood maximization. J. Chem. Phys. 125, 054108 (2006).
Ma, A. & Dinner, A. R. Automatic method for identifying reaction coordinates in complex systems. J. Phys. Chem. B 109, 6769–6779 (2005).
Krivov, S. V. On reaction coordinate optimality. J. Chem. Theory Comput. 9, 135–146 (2013).
Rotskoff, G. M., Mitchell, A. R. & Vanden-Eijnden, E. Active importance sampling for variational objectives dominated by rare events: consequences for optimization and generalization. In Proc. 2nd Mathematical and Scientific Machine Learning Conference Vol. 145, (eds Bruna, J., Hesthaven, J. & Zdeborova, L.) 757–780 (PMLR, 2022).
Ribeiro, J. M. L., Bravo, P., Wang, Y. & Tiwary, P. Reweighted autoencoded variational Bayes for enhanced sampling (RAVE). J. Chem. Phys. 149, 072301 (2018).
Khoo, Y., Lu, J. & Ying, L. Solving for high-dimensional committor functions using artificial neural networks. Res. Math. Sci. 6, 1–13 (2019).
Lin, B. & Ren, W. A deep learning method for computing committor functions with adaptive sampling. SIAM J. Sci. Comput. 47, 1091–1114 (2025).
Prinz, J.-H. et al. Markov models of molecular kinetics: generation and validation. J. Chem. Phys. 134, 174105 (2011).
Chen, H., Roux, B. & Chipot, C. Discovering reaction pathways, slow variables, and committor probabilities with machine learning. J. Chem. Theory Comput. 19, 4414–4426 (2023).
Ansari, N., Rizzi, V. & Parrinello, M. Water regulates the residence time of benzamidine in trypsin. Nat. Commun. 13, 5438 (2022).
Bonati, L., Trizio, E., Rizzi, A. & Parrinello, M. A unified framework for machine learning collective variables for enhanced sampling simulations: mlcolvar. J. Chem. Phys. 159, 014801 (2023).
Zhang, J. et al. Descriptor-free collective variables from geometric graph neural networks. J. Chem. Theory Comput. 20, 10787–10797 (2024).
Pengmei, Z., Lorpaiboon, C., Guo, S. C., Weare, J. & Dinner, A. R. Using pretrained graph neural networks with token mixers as geometric featurizers for conformational dynamics. J. Chem. Phys. https://doi.org/10.1063/5.0244453 (2025).
Eismann, S. et al. Hierarchical, rotation-equivariant neural networks to select structural models of protein complexes. Proteins Struct. Funct. Bioinform. 89, 493–501 (2020).
Battaglia, P. W. et al. Relational inductive biases, deep learning, and graph networks. Preprint at https://arxiv.org/abs/1806.01261 (2018).
Jing, B., Eismann, S., Soni, P. N. & Dror, R. O. Equivariant graph neural networks for 3D macromolecular structure. In Proc. 38th International Conference on Machine Learning https://doi.org/10.48550/arXiv.2106.03843 (2021).
Townshend, R. J. L. et al. ATOM3D: tasks on molecules in three dimensions. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 1) Vol. 1 https://openreview.net/forum?id=FkDZLpK1Ml2 (2021).
Megías, A. et al. Iterative variational learning of committor-consistent transition pathways using artificial neural networks. Nat. Comput. Sci. https://doi.org/10.1038/s43588-025-00828-3 (2025).
Ajaz, A. et al. Concerted vs stepwise mechanisms in dehydro-Diels–Alder reactions. J. Org. Chem. 76, 9320–9328 (2011).
Qiu, L., Pabit, S. A., Roitberg, A. E. & Hagen, S. J. Smaller and faster: the 20-residue trp-cage protein folds in 4 μs. J. Am. Chem. Soc. 124, 12952–12953 (2002).
Lindorff-Larsen, K., Piana, S., Dror, R. O. & Shaw, D. E. How fast-folding proteins fold. Science 334, 517–520 (2011).
Swenson, D. W. H., Prinz, J.-H., Noé, F., Chodera, J. D. & Bolhuis, P. G. OpenPathSampling: a Python framework for path sampling simulations. 1. Basics. J. Chem. Theory Comput. 15, 813–836 (2019).
Chen, H. et al. A companion guide to the string method with swarms of trajectories: characterization, performance, and pitfalls. J. Chem. Theory Comput. 18, 1406–1422 (2022).
Tiwary, P. & Berne, B. J. Spectral gap optimization of order parameters for sampling complex molecular systems. Proc. Natl Acad. Sci. USA 113, 2839–2844 (2016).
Bannwarth, C. et al. Extended tight-binding quantum chemistry methods. Wiley Interdiscip. Rev. Comput. Mol. Sci. 11, 1493 (2021).
Meuzelaar, H. et al. Folding dynamics of the trp-cage miniprotein: evidence for a native-like intermediate from combined time-resolved vibrational spectroscopy and molecular dynamics simulations. J. Phys. Chem. B 117, 11490–11501 (2013).
Lloyd, S. Least squares quantization in pcm. IEEE Trans. Inf. Theory 28, 129–137 (1982).
Schütte, C., Noé, F., Lu, J., Sarich, M. & Vanden-Eijnden, E. Markov state models based on milestoning. J. Chem. Phys 134, 204105 (2011).
Bicout, D. J. & Szabo, A. Electron transfer reaction dynamics in non-Debye solvents. J. Chem. Phys. 109, 2325–2338 (1998).
Donati, L., Hartmann, C. & Keller, B. G. Girsanov reweighting for path ensembles and Markov state models. J. Chem. Phys. 146, 244112 (2017).
Donati, L., Weber, M. & Keller, B. G. A review of Girsanov reweighting and of square root approximation for building molecular markov state models. J. Math. Phys. 63, 123306 (2022).
Glorot, X. & Bengio, Y. Understanding the difficulty of training deep feedforward neural networks. In Proc. Thirteenth International Conference on Artificial Intelligence and Statistics (eds Teh, Y. W. & Titterington, M.) Vol. 9, 249–256 (PMLR, 2010); https://proceedings.mlr.press/v9/glorot10a.html
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. Preprint at https://api.semanticscholar.org/CorpusID:6628106 (2014).
Phillips, J. C. et al. Scalable molecular dynamics on CPU and GPU architectures with NAMD. J. Chem. Phys. 153, 044130 (2020).
Fiorin, G. et al. Expanded functionality and portability for the Colvars library. J. Phys. Chem. B 128, 11108–11123 (2024).
Fu, H., Shao, X., Cai, W. & Chipot, C. Taming rugged free energy landscapes using an average force. Acc. Chem. Res. 52, 3254–3264 (2019).
Andersen, H. C. Rattle: a “velocity” version of the shake algorithm for molecular dynamics calculations. J. Comput. Phys. 52, 24–34 (1983).
MacKerell, A. D. J. et al. All-atom empirical potential for molecular modeling and dynamics studies of proteins. J. Phys. Chem. B 102, 3586–3616 (1998).
Maier, J. A. et al. ff14sb: Improving the accuracy of protein side chain and backbone parameters from ff99sb. J. Chem. Theory Comput. 11, 3696–3713 (2015).
Fey, M. & Lenssen, J. E. Fast graph representation learning with pytorch geometric. Preprint at https://api.semanticscholar.org/CorpusID:70349949 (2019).
an Rossum, G. & Drake, F. L. Python 3 Reference Manual (CreateSpace, 2009).
GNN committor learning repository. Zenodo https://doi.org/10.5281/zenodo.18259668 (2025).
Juraszek, J. & Bolhuis, P. G. Rate constant and reaction coordinate of Trp-cage folding in explicit water. Biophys. J. 95, 4246–4257 (2008).
Acknowledgements
C.C. acknowledges the European Research Council (project 101097272 ‘MilliInMicro’), the Université de Lorraine through its Lorraine Université d’Excellence initiative, and the Région Grand-Est (project ‘Respire’). C.C. also acknowledges the Agence Nationale de la Recherche under France 2030 (contract ANR-22-PEBB-0009) for support in the context of the MAMABIO project (B-BEST PEPR). A.M. is thankful for the support provided by Universidad Politécnica de Madrid through the ‘Programa Propio de Investigación’ grant EST-PDI-25-C0ON11-36-6WBGZ8, as well as the hospitality of Université de Lorraine during his stay.
Author information
Authors and Affiliations
Contributions
C.C. conceived the idea, and designed and supervised the research. S.C.A. designed the neural network, developed the corresponding code and carried out the qGNN learnings. C.T., R.A.T. and C.G.C. performed the MD simulations. A.M. performed the shooting analysis of NANMA and worked on the theoretical framework. All the authors contributed in the writing of the manuscript and provided helpful discussions.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Computational Science thanks Han Wang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Kaitlin McCardle, in collaboration with the Nature Computational Science team. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Trialanine conformational equilibrium.
Projection of learned committor q onto (ϕ1, ϕ3) (A), (ϕ1, ϕ2) (C) and (ϕ2, ϕ3) (D) space, from atomic coordinates of a biased trajectory of trialanine. Relative node sensitivity computed for a short biased simulation of 15 ns and a representation of the trialanine molecule where carbon atoms are represented in cyan, oxygen in red, nitrogen in blue and hydrogen in white (B).
Extended Data Fig. 2 Diels-Alder reaction.
Learned committor q, projected on (d1, d2)-space, from atomic coordinates of a biased trajectory of Diels–Alder system. The contour lines represent the free-energy surface (A). Relative node sensitivity analysis computed for a short biased simulation of 2 ns (B). A committor map learned using the variational committor network16 for this reaction is depicted for comparison (C). Furthermore, we show a sketch of the Diels–Alder system with the heavy atoms labelled as well as the distances used in the CV-projection. Carbon atoms are represented in cyan and hydrogen atoms in white(D).
Extended Data Fig. 3 Trp-cage reversible folding.
Learned committor q, projected on (RMSD, dend)-space,—where the RMSD is computed with respect to basin B—from atomic coordinates of an unbiased trajectory of Trp-cage system. (A). Relative node sensitivity computed for a short unbiased simulation of 50 ns (B). Sketch of Trp-cage protein with the three main distances between atoms (green spheres) d1, d2 and d3 (C). The projection of the committor onto the (d1,d2)-subspace (D). Conformations extracted from a clustering analysis in the vicinity of the separatrix, exhibiting distinct secondary-structure elements (E, I to IV). The Trp-cage representations are colour-coded by secondary structure: white for random coils, cyan for turns, yellow for β-sheets, and magenta for α-helices.
Supplementary information
Source data
Source Data Fig. 2
Raw data for Fig. 2.
Source Data Extended Data Fig. 1
Raw data for Extended Data Fig. 1.
Source Data Extended Data Fig. 2
Raw data for Extended Data Fig. 2.
Source Data Extended Data Fig. 3
Raw data for Extended Data Fig. 3.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Contreras Arredondo, S., Tang, C., Talmazan, R.A. et al. Learning the committor without collective variables. Nat Comput Sci (2026). https://doi.org/10.1038/s43588-026-00958-2
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s43588-026-00958-2


