Introduction

The main goal of molecular biophysics is to understand how proteins function. Protein behavior is governed by a rugged energy landscape featuring numerous valleys separated by barriers1. The valleys correspond to functionally important metastable conformations, with the deepest valley being the native structure. Transitions between conformations are critical for protein function1,2,3, such as enzymatic reactions, allostery, substrate binding, and protein-protein interaction.

With the development of AlphaFold4,5, the long-standing structure prediction problem has been solved, making the native structures of proteins readily available. The major challenge now is to identify the other functionally important conformations and understand the transition dynamics between them. Molecular dynamics (MD) simulation is a pivotal tool, as it can provide full atomic details—an advantage unmatched by experimental techniques. However, due to the large gap between the time scales of MD simulation (approaching microseconds) and functional processes (milliseconds to hours), MD simulation of functional processes has been infeasible except in a few special cases6,7.

To overcome this time scale challenge, intensive efforts have been focused on developing methods to enhance the sampling of functionally important regions of the conformational space, including two branches. One branch focuses on sampling important metastable conformations without addressing the transition dynamics between them8,9. The other branch focuses on sampling the transition dynamics between different metastable conformations10,11.

Enhanced sampling of important conformations consists of two components: accelerating conformational changes to effectively explore the conformational space and reweighting to extract the correct thermodynamics. The former is the bottleneck and the focus of the current work. Many methods, such as umbrella sampling, adaptive biasing force, and metadynamics12,13,14,15, use bias potentials on user-selected collective variables (CVs) to accelerate conformational changes. Their efficacy hinges on finding suitable CVs; without them, they provide no more acceleration than standard MD simulations8. Traditionally, CVs were chosen by user intuition, involving geometric parameters (e.g., radius of gyration), principal components, evolutionary correlations, and RMSD from reference structures8,16,17. In recent years, it has been increasingly clear that intuition is inadequate for identifying the suitable CVs8,18. Consequently, significant effort has been made to develop systematic methods for CV identification, yielding many innovative methods19. The predominant strategy was to use machine learning to extract slow modes from simulation data18,20,21,22. Despite these efforts, finding CVs that can effectively accelerate protein conformational changes remains a formidable challenge.

Among the methods for sampling transition dynamics, the transition path sampling (TPS) method is an important example10,11. It offers unique advantages over methods like milestoning, Markov state modeling, and string method23,24,25 because it generates natural reactive trajectories (NRTs)—unbiased MD trajectories that connect the reactant and product basins without any assumption or approximation, providing the full details of transition dynamics that mirror the physical reality. NRTs also achieve optimal efficiency by covering the entire transition period while skipping the prolonged waiting period in the reactant basin. The waiting period is orders of magnitude longer than the transition period, causing MD simulations to be trapped in the reactant basin without reaching the transition region, thus preventing simulation of functional processes. TPS generates NRTs by initiating MD trajectories from conformations on existing NRTs close to the transition state (TS). However, finding an initial NRT or TS conformations poses a formidable challenge, and TPS does not provide an effective solution for this, hindering applications of TPS to complex systems.

The solutions to the bottleneck challenges in both branches of enhanced sampling are provided by true reaction coordinates (tRCs), the few essential protein coordinates that fully determine the committor of any system conformation10,26,27. The committor (\({p}_{B}\)) is the probability that a trajectory initiated from a given system conformation, with momenta drawn from Boltzmann distribution, will reach the product state before the reactant state10,26,28,29. It precisely tracks the progression of a conformational change, marking the key states: \({p}_{B}=0\) for the reactant, \({p}_{B}=1\) for the product, and \({p}_{B}=0.5\) for the TS. tRCs are coordinates that can accurately predict the committor for any conformation, rendering all other system coordinates irrelevant.

tRCs are widely regarded as the optimal CVs for accelerating conformational changes8,9, as they are expected to not only provide efficient acceleration but also generate accelerated trajectories that follow the natural transition pathway. This is due to their role in energy activation—the critical step in protein conformational change where rare fluctuations channel energy into tRCs to propel the system over the activation barrier (i.e. the TS) to the product basin30,31. Applying bias potentials on tRCs maximizes energy transfer into tRCs, leading to highly accelerated barrier crossing. In contrast, if the CVs significantly deviate from the tRCs, the bias potential will miss the actual activation barrier, resulting in the infamous “hidden barrier”32,33 that prevents effective sampling.

Furthermore, tRCs provide an effective solution to the bottleneck problem in TPS. As we showed in Fig. 7a of ref. 34, bias applied to tRCs generates trajectories that pass through the full range of intermediate committor values \({p}_{B}\in [0.1,\,0.9]\), which marks the transition period. This provides an efficient means to obtain TS conformations, which are otherwise difficult to attain, thereby addressing the key limitation of TPS.

Given the importance of tRCs, identifying them in complex biomolecules has been a central challenge in chemical physics and molecular biophysics since the pioneering work by Du et al. and Chandler and colleagues in the late 1990s10,19,26,27,35,36,37,38,39. In recent years, we have made important progress by developing rigorous, physics-based methods—energy flow theory and the generalized work functional (GWF) method31,40,41,42,43. Using these methods, we have identified the tRCs for the flap opening process of HIV-1 protease (HIV-PR) in implicit solvent34, marking the first successful identification of tRCs in proteins. However, tRCs identification required NRTs, making them not useful in solving the bottlenecks in enhanced sampling.

In this work, we show that tRCs control both conformational changes and energy relaxation, enabling their computation from energy relaxation alone. Biasing tRCs in HIV-PR in explicit solvent accelerates flap opening and ligand unbinding, a process with an experimental lifetime of \(8.9\times {10}^{5}\) s, to 200 ps. The resulting trajectories, termed RC-uncovered trajectories, follow natural transition pathways and pass through TS conformations, enabling NRT generation via TPS. These findings provide an effective solution to the bottlenecks in both accelerating protein conformational changes and harvesting NRTs. In contrast, biased trajectories using a standard CV follow non-physical transition pathways, reinforcing the well-accepted assumption that tRCs are optimal CVs for enhanced sampling. Our method computes tRCs from a single protein structure, enabling predictive sampling. Applying it to PDZ allostery, a long-standing puzzle for over 20 years44, we observe previously unrecognized large-scale transient conformational changes at PDZ allosteric sites in NRTs for ligand dissociation. This discovery reveals an intuitive mechanism for PDZ allostery, where effectors affect ligand binding by interfering with these fleeting conformational changes.

Results

We use two methods to identify tRCs. First, potential energy flows (PEFs) through individual coordinates measure the importance of each coordinate in driving protein conformational changes, with the highest PEFs indicating the critical coordinates. Second, the generalized work functional (GWF) generates an orthonormal coordinate system, called singular coordinates (SCs), that disentangles tRCs from non-RCs by maximizing the PEFs through individual coordinates. Consequently, tRCs are identified as the SCs with the highest PEFs34,40,43.

Potential energy flow

The motion of a coordinate is governed by its equation of motion (EoM). For a coordinate \({q}_{i}\), its EoM is generated by the mechanical work done on \({q}_{i}\), given by: \(d{W}_{i}=-\frac{\partial U\left({{\bf{q}}}\right)}{\partial {q}_{i}}d{q}_{i}\)40. The EoM of \({q}_{i}\) (Hamiltonian equation) is: \({\dot{p}}_{i}=-\frac{\partial H}{\partial {q}_{i}}=-\frac{\partial K}{\partial {q}_{i}}-\frac{\partial U}{\partial {q}_{i}}\), where \({p}_{i}\) is the conjugate momentum of \({q}_{i}\), \(K({{\bf{q}}},\,\dot{{{\bf{q}}}})\) is the kinetic energy, and \(U({{\bf{q}}})\) is the potential energy of the system. Moving \(\frac{\partial K}{\partial {q}_{i}}\) to the left-hand side results in:

$${\dot{p}}_{i}+\frac{\partial K}{\partial {q}_{i}}=-\frac{\partial U\left({{\bf{q}}}\right)}{\partial {q}_{i}}=\frac{d{W}_{i}}{d{q}_{i}}$$
(1)

The left-hand side of Eq. (1) represents the total change in the motion of qi, meaning that dWi is the energy cost of the motion of qi, which we refer to as the PEF through qi. The PEF through qi during a finite period is given by the integration of \(d{W}_{i}\): \(\Delta {W}_{i}\left({t}_{1},{t}_{2}\right)={\int }_{{t}_{1}}^{{t}_{2}}d{W}_{i}=- {\int }_{{q}_{i}\left({t}_{1}\right)}^{{q}_{i}\left({t}_{2}\right)}\frac{\partial U\left({{\bf{q}}}\right)}{\partial {q}_{i}}d{q}_{i}\).

Intuitively, coordinates with a higher energy cost require more effort from the system and play a more significant role in dynamic processes. In protein conformational changes, which are activated processes, tRCs need to overcome the activation barrier, incurring the highest energy cost for their motion. Therefore, tRCs correspond to coordinates with the highest PEFs.

Generalized work functional

The remaining challenge in identifying tRCs is the entanglement between tRCs and non-RCs in general-purpose coordinate systems (e.g., internal coordinates), where many coordinates have components in both tRC and non-RC directions. To cleanly distinguish tRCs from non-RCs, we need to transform internal coordinates into an “ideal” coordinate system that disentangles them. Since PEFs measure the importance of individual coordinates, this “ideal” coordinate system should maximize the differences in PEFs, thereby maximally separating the most important coordinates (tRCs) from the unimportant ones (non-RCs). This can be achieved by maximizing the PEFs through individual coordinates in an orthogonal coordinate system.

Finding the “ideal” coordinate system starts with understanding how PEFs transform between different coordinate systems. The transformation of \(d{W}_{i}={F}_{i}d{q}_{i}\) in coordinates \({{\bf{q}}}\) to \(d{W}_{\alpha }={F}_{\alpha }d{r}_{\alpha }\) in coordinates \({{\bf{r}}}\), related by \(d{{\bf{r}}}={{\bf{A}}}\cdot d{{\bf{q}}}\) and \(d{{\bf{q}}}={{{\bf{A}}}}^{-1}\cdot d{{\bf{r}}}\), is given by the chain rule:

$$d{W}_{\alpha }=-\frac{\partial U}{\partial {r}_{\alpha }}d{r}_{\alpha }=- \sum _{i,k=1}^{N}\frac{\partial U}{\partial {q}_{i}}\frac{\partial {q}_{i}}{\partial {r}_{\alpha }}\frac{\partial {r}_{\alpha }}{\partial {q}_{k}}d{q}_{k}=\sum _{i,k=1}^{N}{A}_{\alpha i}\left({F}_{i}d{q}_{k}\right){A}_{k\alpha }^{-1}$$
(2)

Here, \({A}_{\alpha i}={A}_{i\alpha }^{-1}=\frac{\partial {r}_{\alpha }}{\partial {q}_{i}}=\frac{\partial {q}_{i}}{\partial {r}_{\alpha }}\) because A is an orthogonal matrix. Equation (2) introduces a conceptually new physical quantity \({F}_{i}d{q}_{k}\), which is not a mechanical work. Therefore, the coordinate transformation of \(d{W}_{i}\) requires generalizing the concept of mechanical work to incorporate quantities like \({F}_{i}d{q}_{k}\).

The GWF generalizes the concept of mechanical work. Its differential in coordinates \({{\bf{q}}}\) is defined as:

$$d{{\mathbb{W}}}_{q}={{\bf{F}}}\otimes {{\rm{d}}}{{\bf{q}}}$$
(3)

Here, denotes tensor product, making \(d{{\mathbb{W}}}_{q}\) an asymmetric tensor and \({F}_{i}d{q}_{k}\) its elements. The GWF in coordinates r and q are related by a similarity transformation:

$${{d}}{{\mathbb{W}}}_{r}={{\bf{A}}}\cdot {{d}}{{\mathbb{W}}}_{q}\cdot {{{\bf{A}}}}^{-1}$$
(4)

Therefore, the coordinate transformation of GWF does not introduce any new quantities, making it a self-contained fundamental concept, with mechanical work as its sub-concept. All the important mechanical quantities, such as dW = (dW1dW2, …, dWN) and dU, are encompassed in GWF:

$$d{{{\bf{W}}}}_{{{\rm{r}}}}={diag}({{\rm{d}}}{{\mathbb{W}}}_{{{\rm{r}}}})={diag}({{\bf{A}}}\cdot d{{\mathbb{W}}}_{q}\cdot {{{\bf{A}}}}^{-1})$$
(5)
$${dU}={Tr}(d{{\mathbb{W}}}_{r})={Tr}(d{{\mathbb{W}}}_{q})$$
(6)

where diag() denotes diagonal vector and Tr() denotes trace. Equation (5) further shows that \(d{\mathbb{W}}\) is the operator for transforming dW between different coordinate systems.

The singular value decomposition of GWF is:

$$d{\mathbb{W}}{\mathbb{=}}{{\bf{U}}}\cdot {{\mathbf{\Lambda }}}\cdot {{{\bf{V}}}}^{{{\rm{T}}}}$$
(7)

It decomposes \(d{\mathbb{W}}\) into its optimal basis tensors \({{{\bf{u}}}}_{{{\bf{i}}}}\bigotimes {{{\bf{v}}}}_{{{\rm{i}}}}\) as: \(d{\mathbb{W}}{\mathbb{=}}{\sum }_{i=1}^{N}{\lambda }_{i}{{{\bf{u}}}}_{{{\rm{i}}}}\bigotimes {{{\bf{v}}}}_{{{\rm{i}}}}\). Here, ui and vi are the i-th column vectors of the left and right singular matrices \({{\bf{U}}}\) and \({{\bf{V}}}\), respectively; λi is the i-th singular value. The optimality of \({{{\bf{u}}}}_{{{\rm{i}}}}\bigotimes {{{\bf{v}}}}_{{{\rm{i}}}}\) means that \({\lambda }_{i}{{{\bf{u}}}}_{{{\rm{i}}}}\bigotimes {{{\bf{v}}}}_{{{\rm{i}}}}\) represents the i-th largest contribution to \(d{\mathbb{W}}\), and the sum \({\sum }_{i=1}^{m\ll N}{\lambda }_{i}{{{\bf{u}}}}_{{{\rm{i}}}}\bigotimes {{{\bf{v}}}}_{{{\rm{i}}}}\) provides the optimal m-dimensional reduced description. Since \(d{\mathbb{W}}{\mathbb{=}}{{\bf{F}}}\bigotimes d{{\bf{q}}}\), ui and vi are the optimal basis vectors for the force and displacement spaces, respectively.

The collection of all \({{{\bf{u}}}}_{{{\rm{i}}}}\) forms an orthonormal coordinate system: \(d{{\bf{u}}}={{{\bf{U}}}}^{{{\rm{T}}}}\cdot d{{\bf{q}}}\), which we refer to as the SCs. \({F}_{i}=-\frac{\partial U}{\partial {s}_{i}}\) represents the force with the i-th largest impact on the system’s dynamics, and \(d{W}_{i}={F}_{i}d{s}_{i}={\lambda }_{i}({{{\bf{u}}}}_{{{\rm{i}}}}\cdot {{{\bf{v}}}}_{{{\rm{i}}}})\) is the i-th highest PEF in the system. Consequently, \({\sum }_{i=1}^{m\ll N}d{W}_{i}\) provides the optimal m-dimensional reduced description of \({dU}\), the generating function of all equations of motion in the system. This is the condition tRCs would meet if they exist in a protein. Therefore, the SCs provide the “ideal” coordinate system; the leading SCs are tRCs, as they represent the directions of forces with the highest impact on the system’s dynamics.

A hypothesis on activation and energy relaxation

In ref.34, we demonstrated that the leading SCs of the GWF calculated from an ensemble of NRTs for the flap-opening of HIV-PR in implicit solvent are tRCs, as they can determine the committor with high accuracy. This result confirms that the GWF method can effectively identify tRCs in proteins. In addition, applying bias potentials along the tRCs led to RC-uncovered trajectories that open the flaps within 4 ps—an acceleration of over 104-fold compared to MD34. This result supports the long-standing hypothesis that tRCs are the optimal CVs for enhanced sampling of protein conformational changes.

In our application of the GWF method to energy relaxation in myoglobin42, we observed a significant gap between the PEFs of the leading SCs and the others, resembling the PEF pattern of tRCs in ref. 34. (Fig. 2C in ref. 34). Energy relaxation and activation are closely related, as a system needs to relax into the stable basin after crossing the activation barrier. Drawing on Onsager’s regression hypothesis45, we propose that the coordinates essential for energy relaxation (i.e. its leading SCs) are identical to those governing activation (i.e. the tRCs).

Leading SCs of energy relaxation are the same as tRCs of conformational transition

To determine if the leading SCs of HIV-PR energy relaxation are also the tRCs for its activation (e.g. flap opening; Fig. 1), we simulated energy relaxation of ligand-free HIV-PR in implicit solvent, the same system as in ref. 34. To mimic energy relaxation after ligand binding or enzymatic reaction, we deposit excess kinetic energy into the active site of HIV-PR (details in the Methods) and generate an ensemble of N MD trajectories, each one 5 ps in length42. We then compute GWF and PEFs averaging over this ensemble.

$$\left\langle \Delta {\mathbb{W}}\left(0\to t\right)\right\rangle=\frac{1}{N}\sum _{\alpha=1}^{N}\Delta {\mathbb{W}}(0\to {t;}\, \alpha )$$
(8)
$$\left\langle \Delta {{W}}_{i}\left(0\to t\right)\right\rangle=\frac{1}{N}\sum _{\alpha=1}^{N}\Delta {W}_{i}(0\to {t;}\, \alpha )$$
(9)

Here, t=0 is the starting point of energy relaxation when the excess kinetic energy is deposited to the active site; \(\Delta {\mathbb{W}}\left(0\to {t;}\, \alpha \right)={\int }_{0}^{t}d{\mathbb{W}}\) and \(\Delta {{W}}_{i}\left(0\to {t;}\, \alpha \right)={\int }_{\!0}^{t}d{W}_{i}\) are GWF and PEF through qi along trajectory α, respectively.

Fig. 1: Representative structures of the closed and open states of DRV-bound HIV-PR protease.
Fig. 1: Representative structures of the closed and open states of DRV-bound HIV-PR protease.
Full size image

The two spheres denote the center of mass of the Cα atoms of residues Met46 to Lys55 in the two monomers; the flap distance df is the distance between them. The closed conformation (df= 1.03 nm) is from crystal structure (Protein Data Bank (PDB): 1T3R), and the open conformation (df = 2.73 nm) is from a natural reactive trajectory of DRV dissociation. Different structural segments are marked by different colors. Source data are provided as a Source Data file.

Figure 2a shows the inner product between the six leading SCs for the energy relaxation of HIV- PR in implicit solvent and the six tRCs of its flap opening we identified in ref. 34. Each SC (\({u}_{a}\)) or tRC (\({r}_{a}\)) is a linear combination of the backbone dihedrals (\({\chi }_{i}\)) of HIV-PR: \({u}_{a}={\sum }_{a=1}^{N}{U}_{{ia}}{\chi }_{i}\). All inner products are close to 1—confirming our hypothesis that the leading SCs of energy relaxation

Fig. 2: Singular vectors of energy relaxation of HIV-PR.
Fig. 2: Singular vectors of energy relaxation of HIV-PR.
Full size image

a Inner product between leading SCs (ua) of energy relaxation and tRCs (ra) of flap opening of HIV-PR in implicit solvent. b Potential energy flows of SCs of DRV-bound HIV-PR averaged over \({E}_{0}({u}_{0})\) (details in Methods). Red lines represent u0 to u6; gray lines represent the other SCs (i.e. non-RCs). Source data are provided as a Source Data file.

are the same as the tRCs for conformational transitions.

This result revealed a fundamental connection between activation and energy relaxation, which, in our understanding, stems from the need for optimal efficiency in protein function. Both processes rely on systematic protein dynamics driven by systematic PEFs through individual coordinates, which are absent during equilibrium fluctuations. The importance of a coordinate is determined by the magnitude of its PEF, which is an intrinsic physical property determined by the protein structure and fine-tuned through evolution. tRCs, with the highest PEFs, serve as optimal energy flow channels.

During activation, the protein delivers energy to the active site to perform its function. For optimal delivery, energy systematically flows from low-capacity channels (non-RCs) to high-capacity channels (tRCs). During energy relaxation, the protein dissipates excess energy from the active site to the rest of the protein and eventually to the solvent. For efficient dissipation, energy first flows into the tRCs for rapid removal and then disperses into the non-RCs. Thus, energy flows through the same channels in both activation and relaxation but in opposite directions. This dual role of tRCs allows us to compute them from energy relaxation as well as from activated processes.

tRCs provide effective enhanced sampling of conformational changes

A common assumption is that tRCs are the optimal CVs for enhanced sampling of protein conformational changes8. If we can compute tRCs from energy relaxation, it will be highly valuable for applications. To explore this possibility, we simulated energy relaxation in two systems, HIV-PR bound with DRV and MA/CA peptide in explicit solvent. Figure 2b shows the PEFs of SCs of energy relaxation of DRV-bound HIV-PR. The first seven SCs (\({u}_{0}\) to \({u}_{6}\); red lines in Fig. 2b) show much higher PEFs than the rest (gray lines in Fig. 2b), suggesting they are the tRCs. The same holds true for the SCs of MA/CA-bound HIV-PR. To distinguish, we will refer to them as tRCs from now on.

By applying bias potentials on the tRCs (details in Methods section and ref. 34), we generated RC- uncovered trajectories for flap opening and ligand unbinding of HIV-PR in ligand-free, DRV-bound, and MA/CA-bound states. The time evolution of the flap distance \({d}_{f}\) (Fig. 1a) along these trajectories is shown in Fig. 3. For all tRCs and across all systems, \({d}_{f}\) quickly increases to the range of 2.5 to 3 nm within 20 ps, corresponding to wide-open conformations in the literature46,47. Moreover, the ligand dislodges from the active site and is well on its way to exit the flaps within 200 ps (Fig. 4a,c; Supplementary video S1, S2).

Fig. 3: Efficiency of flap opening along RC-uncovered trajectories in ligand-free, DRV-bound and MA/CA-bound HIV-PR.
Fig. 3: Efficiency of flap opening along RC-uncovered trajectories in ligand-free, DRV-bound and MA/CA-bound HIV-PR.
Full size image

Solid lines are results of averaging over 30 trajectories. Each shaded region shows the range of df covered by trajectories generated from the corresponding SC. The upper bound is the maximum and the lower bound is the minimum df value reached at each time point among the 30 trajectories used in calculating the average. Each SC ui is represented by a different color, as indicated in the middle panel. Source data are provided as a Source Data file.

Fig. 4
Fig. 4
Full size image

Snapshots from RC-uncovered trajectories and natural reactive trajectories (NRTs) of flap opening and ligand unbinding in HIV-PR bound to darunavir (DRV) (a, b) and (MA/CA peptide) (c, d). The full trajectories are shown in Supplementary Videos S1 to S4, respectively. Source data are provided as a Source Data file.

To evaluate the efficiency of enhanced sampling by the tRCs, we compare the time scales of flap opening and ligand unbinding in RC-uncovered trajectories with those observed in other simulations and experiments47,48,49. Sadiq and Fabritiis conducted 461 MD trajectories, each 50 ns long, totaling 23 µs simulation time47. They observed four wide-open events in ligand-free HIV- PR, averaging 5.8 µs per event, in line with the 100 µs observed by NMR46. In comparison, RC-uncovered trajectories show wide-open flaps within 10 ps, at least a \(2.9\times {10}^{5}\)-fold acceleration compared to MD for ligand-free HIV-PR. For MA/CA-bound HIV-PR, Sadiq et al. conducted 100 MD simulations totaling 10 μs and observed only thermal fluctuations49. Thus, flap opening in this system requires much longer than 10 µs. In stark contrast, RC-uncovered trajectories demonstrate wide-open flaps within 20 ps, marking an acceleration far exceeding \({10}^{6}\)-fold. Additionally, while experimentally measured half-life of DRV unbinding is \(8.9\times {10}^{5}\) s50, RC-uncovered trajectories achieved this in 200 ps (Fig. 4a). Together, these results demonstrate the efficiency of RC-uncovered trajectories.

Reversible transitions along individual tRCs

Beyond sampling the transition paths, an important goal of enhanced sampling is to compute thermodynamic variables, especially free energy surfaces. This requires that the system undergo reversible transitions. The current system has multiple tRCs, with different tRCs opening the flaps in distinct ways, as shown in Fig. 6 in ref. 34. Therefore, it is crucial that the flaps can open and close reversibly along a single tRC so that, with proper application of bias potentials on multiple tRCs, a multi-dimensional free energy surface could be computed. Figure 5 shows time evolution of both \({u}_{0}\) and the flap distance along an RC-uncovered trajectory obtained from applying bias on \({u}_{0}\) of ligand-free HIV-PR, featuring multiple rounds of flap opening and closing. The high similarity in time dependence of both \(\Delta {u}_{0}(t)\) and \({d}_{f}(t)\) across different transition cycles demonstrates the robustness of transition reversibility.

Fig. 5: RC-uncovered trajectory of HIV-PR with reversible transitions.
Fig. 5: RC-uncovered trajectory of HIV-PR with reversible transitions.
Full size image

Time evolution of tRC u0 (blue) and flap distance df (red) along an RC-uncovered trajectory of u0 with multiple transitions between close and open states. Source data are provided as a Source Data file.

RC-uncovered trajectories follow natural transition pathways

A major advantage of RC-uncovered trajectories is that they follow natural transition pathways, the same pathways traced by NRTs. This is crucial for understanding protein functions, the main goal of MD simulations. A rigorous way to verify whether a trajectory follows natural pathways is through the committor. Along an NRT, the committor covers the full range of values: \({p}_{B}\in [0,\,1]\), with \({p}_{B}\in (0.1,\,0.9)\) marking barrier crossing. By contrast, a trajectory deviating from natural pathways will show an abrupt jump in committor values from 0 to 1 or do not manifest well-defined committor values, as it bypasses the actual activation barrier and TS. Instead, it explores a non-physical region of the conformational space, where committor values are not well-defined. Therefore, a progression through intermediate committor values is a clear signature that a trajectory follows natural transition pathways.

In Fig. 7a of ref. 34, RC-uncovered trajectories of all 6 tRCs span the full range of committor values, confirming that they follow natural transition pathways. Therefore, an RC-uncovered trajectory that follows natural pathways can validate the corresponding RC as a tRC. For HIV-PR in explicit solvent, calculating committor is prohibitively expensive. As a computationally efficient but rigorous alternative, we use the shooting move of TPS10 to show that RC-uncovered trajectories of the current systems pass through the TS region. The TS is the bottleneck along a natural transition pathway and represents the most critical intermediate committor value: \({p}_{B}\simeq 0.5\). It is difficult to envision a scenario where a trajectory reaches the TS but still diverges from the natural pathway.

In the shooting move10, we select a conformation R0 from an RC-uncovered trajectory and draw momenta \({{{\bf{p}}}}_{0}\) from Boltzmann distribution. We then launch a pair of MD trajectories from \({{{\bf{R}}}}_{0}\) with initial momenta \({{{\bf{p}}}}_{0}\) and \(-{{{\bf{p}}}}_{0}\), respectively. If these trajectories reach opposite basins, we leverage the time reversibility of classical mechanics to create a reactive trajectory by reversing the momenta along the trajectory that ends in the reactant basin and merging the two trajectories at \({{{\bf{R}}}}_{0}\)10. Since no bias is used, trajectories generated by the shooting move are NRTs, as demonstrated by Chandler and colleagues10,11.

The likelihood of successfully generating NRTs from a conformation \({{{\bf{R}}}}_{0}\) is determined by \(p\left({{\rm{RT}}}\,|\,{{{\bf{R}}}}_{0}\right)\), the probability that a dynamic trajectory passing through \({{{\bf{R}}}}_{0}\) is an NRT. This probability is related to the committor by \(p({{\rm{RT}}}\,|\,{{{\bf{R}}}}_{0})=2{p}_{B}({{{\bf{R}}}}_{0})\,\left(1-{p}_{B}({{{\bf{R}}}}_{0})\right)\)51,52, which reaches a maximum of 0.5 at \({p}_{B}\left({{{\bf{R}}}}_{0}\right)=\) 0.5, and drops to 0 at \({p}_{B}\left({{{\bf{R}}}}_{0}\right)=0\) or 1. Given this, the likelihood of generating NRTs via shooting is significant only when \({{{\bf{R}}}}_{0}\) is close to the TS—where the free energy difference between \({{{\bf{R}}}}_{0}\) and the TS is within the thermal energy \({{{\rm{k}}}}_{{{\rm{B}}}}{{\rm{T}}}\). Successfully generating NRTs through the shooting move provides objective validation that RC-uncovered trajectories pass through the TS, thereby confirming that they follow natural pathways.

We obtained NRTs for flap opening and complete ligand dissociation in DRV-bound and MA/CA-bound HIV-PRs (Fig. 4b, d; Supplementary videos S3, S4) by shooting from the respective RC-uncovered trajectories. To our knowledge, this is the first successful attainment of NRTs for ligand dissociation from HIV-PR. For computational efficiency, only five pairs of shooting trajectories were attempted per conformation. Consequently, conformations that successfully generate NRTs are likely close to the TS. These results demonstrate that the RC-uncovered trajectories pass through TS and follow the natural pathways, validating the tRCs computed from energy relaxation simulations.

Comparison with enhanced sampling by empirically derived CVs

tRCs are widely recognized as the optimal CVs for enhanced sampling of protein conformational changes. To test this, we simulate MA/CA dissociation from HIV-PR by applying bias potentials on commonly used empirical CVs and compare the results with NRT and RC-uncovered trajectories. The NRT provides the correct mechanism of this process because it is an unbiased MD trajectory that covers the entire transition period.

The empirical CV we use is the distance sl between the center of mass (CoM) of Cα atoms of MA/CA and the CoM of Cα atoms of active site residues (residues 24, 26, 27). We chose sl because it was employed in ref. 17, an extensive metadynamics-based bias-exchange simulation of the dissociation of a six-residue peptide from HIV-PR that has an accumulated simulation time of 1.6 µs.

We first present the results from steered MD simulation using the umbrella pulling method implemented in GROMACS, similar to how we applied bias to tRCs. The pulling strength on sl was adjusted to achieve dissociation in 1.4 ns. As shown in Fig. 6, the resulting trajectory is fundamentally different from the NRT and RC-uncovered trajectory (Fig. 4c,d).

Fig. 6: Non-physical features of enhanced sampling with empirical CVs.
Fig. 6: Non-physical features of enhanced sampling with empirical CVs.
Full size image

a Time evolution of sl and df along a steered MD (SMD) trajectory of MA/CA dissociaiton from HIV-PR in explicit solvent with sl as the CV. b Time evolution of sl and df along a metadynamics (metaD) trajectory. c Time evolution of df along an NRT and an RC-uncovered trajectory. (d) Snapshots from the steered MD trajectory in (a). e Snapshots from the metadynamics trajectory in (b). Source data are provided as a Source Data file.

In the steered MD trajectory (Fig. 6a), \({s}_{l}\) and the flap distance \({d}_{f}\) increase together, indicating that the ligand is actively pushing the flaps open, causing elastic distortion of the flaps (Fig. 6d). Moreover, \({d}_{f}\) only reaches 1.8 nm, just enough for the ligand to slip through. We consider it an elastic distortion rather than a true flap-opening because it involves only the bending of the flaps (Fig. 1). This behavior contradicts the established mechanism of HIV-PR function, which requires

full flap-opening.

By contrast, in both NRTs and RC-uncovered trajectories (Fig. 4), the flaps open to 2 nm as the ligand begins to dislodge (192 ns for the NRT, 7 ps for the RC-uncovered trajectory). Afterwards, the influx of water molecules into the active site drives ligand motion and further flap opening. Full ligand dissociation occurs when the flaps open to 3 nm (Fig. 6c). Furthermore, flap opening in NRT and RC-uncovered trajectory (Fig. 4) is driven by global, collective protein structural changes, causing the flaps to swivel open rather than just bend. The main difference between the NRT and RC-uncovered trajectory is the longer time scale and larger fluctuations in \({d}_{f}\) along the NRT (Fig. 6c).

These results demonstrate that the NRT and RC-uncovered trajectory follow the same transition pathway, which aligns with the established mechanism of ligand dissociation from HIV-PR and contrasts sharply with the non-physical behavior seen in the steered MD trajectory, a typical issue when applying bias on non-RC CVs. A more detailed discussion is provided in Supplementary Note 2 (Supplementary Fig. 2).

For fair comparison, we need to ensure that the non-physical features shown in Fig. 6d are due to the empirical CV, not the specifics of how the bias is applied. For this purpose, we simulated the same process using the same CV with the well-tempered metadynamics implemented in PLUMED253. Unlike steered MD, metadynamics applies bias potentials in a highly adaptive and flexible manner, minimizing the risk of artifacts from the non-adaptive bias protocol used in steered MD. Supplementary Fig. 4 shows the time evolution of \({d}_{f}\) and \({s}_{l}\) under varying strength of Gaussian bias. To achieve flap-opening within 2 ns, a Gaussian height at 10,000-fold the recommended value (0.2 \({{\rm{kJ\; mo}}}{{{\rm{l}}}}^{-1}\)) is needed. The resulting trajectory displays the same features as the steered MD simulation, evident in the time evolutions of \({s}_{l}\) and \({d}_{f}\) (Fig. 6b) and the snapshots (Fig. 6e). Notably, ref. 17 reported peptide dissociation with minimal flap opening, in line with our observations here. These results confirm that the non-physical characteristics of the MA/CA dissociation trajectories are the consequence of using \({s}_{l}\) as the CV.

Efficiency of enhanced sampling by tRCs

In the results above, the time scales for DRV dissociation in RC-uncovered trajectories, NRTs, and experiments differ by orders of magnitude. This is because they emphasize different phases of the same process. The experimental half-life corresponds to the average time for observing a DRV dissociation event in an extremely long MD trajectory containing many rounds of DRV binding and unbinding. In this context, NRTs represent a \({10}^{12}\)-fold acceleration over MD. This is enabled by the shooting move’s focus on the actual barrier crossing phase while bypassing the extended waiting time in the reactant basin, which is the determinant of the half-life of an activated process10,11. To bypass waiting, shooting move must start from a TS conformation—the least populated conformational state that requires a long waiting period to reach—which is provided by RC-uncovered trajectories. In contrast, enhanced sampling trajectories obtained with empirically derived CVs do not sample TS conformations. Finally, RC-uncovered trajectories are ~\({10}^{3}\)-fold shorter than NRTs, aggregating into a \({10}^{15}\) -fold acceleration over MD. This is achieved by reducing diffusive motions—the most time-consuming aspect of NRTs54.

The efficiency of RC-uncovered trajectories is enabled by an intriguing physical mechanism. The critical step of an activated process is energy activation, where rare fluctuations channel energy into the tRCs, enabling the system to cross the activation barrier30,31. The efficiency of enhanced sampling hinges upon effective transfer of energy into the tRCs to expedite activation. tRCs are optimal CVs because, in this case, the bias potentials directly inject energy into tRCs, thereby maximizing the efficiency of energy activation.

Predictive enhanced sampling by tRCs: PDZ domain allostery

Energy relaxation requires only a single protein structure, yet the resulting tRCs enable enhanced sampling of the protein’s inherent conformational transitions, underscoring the predictive capability of our approach. In our HIV-PR simulations, the only input was the flap-closed conformation, but we successfully obtained NRTs of the flap opening process and the ensemble of flap-open conformations. To further validate this predictive capability, we use the PDZ domain as a blind test, because it has only one known structure, and its allostery has been a challenging puzzle for over two decades.

PDZ domains are a large family of protein-ligand interaction modules55. They share the same canonical fold (Fig. 7) and function as organization centers in multi-protein signaling complexes. Two examples of PDZ allostery have been extensively studied. The first is a ~ 13-fold increase in ligand binding affinity when Cdc42 binds to the Par6 CRIB-PDZ (Fig. 7b)56 at the interface formed by \({\alpha }_{1}\)-helix and \({\beta }_{1}\)-sheet. The second example involves an extra \({\alpha }_{3}\)-helix appended to the C-terminal of PDZ3 (pink helix in Fig. 7c). Two studies found a consistent conclusion57: the presence of \({\alpha }_{3}\) increases PDZ3 ligand affinity by 21- to 120-fold, despite the lack of direct interactions between \({\alpha }_{3}\) and the ligand binding groove formed by \({\alpha }_{2}\) and \({\beta }_{2}\) (Fig. 7a)58.

Fig. 7: Structures of PDZ domains and allosteric sites.
Fig. 7: Structures of PDZ domains and allosteric sites.
Full size image

a Structure of PDZ2 domain (PDB: 3LNY) bound to 6-residue (EQVSAV) peptide. Different structural elements are labeled. d1: distance between the CoMs of α1 and upper β1, marking the opening of the α1-β1 cleft. d2 Distance between the CoMs of α2 and β2, characterizing the opening of the binding groove. b Structure of Par6 CRIB-PDZ domain (PDB: 1NF3) aligned to PDZ2 (White). The CRIB domain is colored Pink. c Structure of PDZ3 domain (PDB: 1TP5) bound to a 6-residue (KKETWV) peptide aligned to PDZ2 (White). The α3-helix is colored Pink.

The difficulty in understanding PDZ allostery is that crystal structures across different PDZ domains in both ligand-bound and ligand-free states are virtually identical (Fig. 7)55, challenging the conventional view that allosteric effectors modify ligand affinity by exploiting the structural differences between apo and holo states. It is difficult to understand how the perturbations introduced by effectors—Cdc42 or \({\alpha }_{3}\)—at distal sites are communicated to the binding groove and alter ligand affinity. Despite intensive efforts for over two decades, the mechanism of PDZ allostery remains elusive. This ongoing challenge is encapsulated by the title of a recent review: Allostery Frustrates the Experimentalist44.

The process underlying PDZ domain allostery is its ligand binding; it has been speculated that conformational changes during this process are responsible for allostery59,60. A solid understanding of ligand binding could resolve the PDZ puzzle unequivocally. However, the time scale for PDZ ligand unbinding—estimated to be 10 to 100 ms—is too slow for MD59. Consistent with this estimation, in an extensive MD simulation that totaled 0.5 ms, no ligand unbinding or discernable conformational change in PDZ2 was observed59. Therefore, ligand unbinding from PDZ2 provides an ideal blind test for our method’s predictive capability.

Mechanism of PDZ allostery from predictive enhanced sampling

Supplementary Fig. 5a shows the potential energy flows along SCs from energy relaxation of PDZ2 bound to an eight-residue (ENEQVSAV) RA-GEF2 peptide42. There is a clear gap between the first six SCs (\({u}_{0}\) to \({u}_{5}\)) and the rest, suggesting they are the tRCs. To distinguish, we refer to them as the tRCs from now on.

RC-uncovered trajectories show significant PDZ2 conformational changes and ligand dislocation from the binding groove. Supplementary Fig. 5b presents end structures along RC-uncovered trajectories of the six RCs. They show a consistent pattern: the ligand-binding site opens between \({\alpha }_{2}\) and \({\beta }_{2}\), while the Cdc42 binding face spanning over the \({\alpha }_{1}\)-\({\beta }_{1}\) cleft significantly expands due to the large shift of \({\alpha }_{1}\) and the \({\alpha }_{1}\)-\({\beta }_{4}\) loop (Figs. 8, 9). These results strongly suggest that PDZ allostery is due to effectors interfering with conformational changes critical for ligand binding. To validate this hypothesis and pin down the allosteric mechanism, we generate NRTs (Fig. 8) for ligand unbinding by applying the shooting move on TS conformations from RC-uncovered trajectories10,11. Figure 9b shows example TS conformations, which are identified by their ability to successfully produce NRTs. They all show the opening of the \({\alpha }_{1}\)-\({\beta }_{1}\) cleft and the binding groove, highlighting the critical importance of these changes to ligand unbinding. Across the TS ensemble, opening of the binding groove is virtually identical, whereas opening of the \({\alpha }_{1}\)-\({\beta }_{1}\) cleft span a small range marked by the TS conformations in cyan and yellow in Fig. 9b.

Fig. 8: Snapshots from a natural trajectory for PDZ2 ligand unbinding.
Fig. 8: Snapshots from a natural trajectory for PDZ2 ligand unbinding.
Full size image

The crystal structure of PDZ2 is shown in White as reference to illustrate the transient nature of the conformational changes during ligand binding. Red arrows point to locations of large-scale conformational changes. Source data are provided as a Source Data file.

Fig. 9: Conformational dynamics of PDZ2 during ligand unbinding.
Fig. 9: Conformational dynamics of PDZ2 during ligand unbinding.
Full size image

a Time evolution of the three parameters (\({d}_{1},{d}_{2},{d}_{3}\)) we used to characterize PDZ2 conformational change along the trajectory shown in Fig. 8. d1 indicates the opening of the α1-β1 cleft, \({d}_{2}\) characterizes the opening of the binding groove, and d3 is the distance between ligand and the binding site. Definitions of d1 and d2 are given in Fig. 7. TS is placed at time 0 for easy comparison. b TS ensemble (Gray) harvested from RC-uncovered trajectories. The crystal structure is shown in Purple as comparison. TS conformations in Yellow and Cyan show the largest and smallest conformational changes among all the TS we harvested. The ligands are shown in cartoon representation. All the TS conformations are aligned with the crystal structure using the parts that do not show conformational changes. c Steric clashes in the TS conformation of the trajectory in Fig. 8. Crystal structure of PDZ3 is shown in White and TS is shown in Purple. Residues involved in the steric clashes are shown in spheres. Residues in α3 of PDZ3 are in Cyan. Residues Tyr36 and Lys38 of PDZ2 are in Yellow; Val30 and Arg31 are in Purple. Source data are provided as a Source Data file.

The NRT for ligand unbinding in Fig. 8 (Supplementary Video S5) shows a transient large-scale conformational change in PDZ2 that lasts 20 to 30 ns. This conformational change shares the same features shown in RC-uncovered trajectories: opening in both the \({\alpha }_{1}\)-\({\beta }_{1}\) cleft and the binding groove. These two conformational changes are concerted. Figure 9a shows the time evolution of three CVs for characterizing PDZ2 conformational changes and ligand dissociation along the NRT in Fig. 8. The distance \({d}_{1}\) between the CoMs of \({\alpha }_{1}\) and upper \({\beta }_{1}\) marks the opening of the \({\alpha }_{1}\)-\({\beta }_{1}\) cleft, \({d}_{2}\) between the CoMs of \({\alpha }_{2}\) and \({\beta }_{2}\) characterizes the opening of the binding groove, and \({d}_{3}\) between CoMs of the ligand and the binding groove delineates ligand dissociation. During ligand unbinding, the \({\alpha }_{1}\)-\({\beta }_{1}\) cleft opens first, marked by the increase of \({d}_{1}\) from 8 \({{\text{\AA}}}\) in the crystal structure to 12 \({{\text{\AA}}}\) in the TS. After reaching the TS, \({d}_{1}\) starts to decrease while \({d}_{2}\) starts to increase from its value of 10 \({{\text{\AA}}}\) in the crystal structure until it reaches 16 \({{\text{\AA}}}\). The period of \({d}_{2}\) increase coincides with the dislodging of the ligand, which begins to dissociate rapidly from the binding groove once \({d}_{2}\) reaches 16 \({{\text{\AA}}}\). After the ligand fully dissociates, both the \({\alpha }_{1}\)-\({\beta }_{1}\) cleft and the binding groove return to their crystal structure conformation in 10 ns. As shown in Fig. 8, the opening of the \({\alpha }_{1}\)-\({\beta }_{1}\) cleft and the binding groove occur only during the barrier crossing process, leaving the PDZ2 conformation at the beginning and the end of the NRT effectively the same as the crystal structure. The transient duration of this critical conformational change explains why it has never been observed in experiments, demonstrating the unparalleled value of NRTs in providing mechanistic insights into protein function.

The \({\alpha }_{1}\)-\({\beta }_{1}\) cleft is the major component of the Cdc42 binding interface (Fig. 7b), thus Cdc42 binding will hinder its opening. As shown in Fig. 9c, opening of the \({\alpha }_{1}\)-\({\beta }_{1}\) cleft moves the \({\alpha }_{1}\)-\({\beta }_{3}\)-\({\beta }_{2}\) block toward \({\alpha }_{3}\), while opening the binding groove moves the \({\beta }_{2}\)-\({\beta }_{3}\) loop toward \({\alpha }_{3}\). The combined effects are the severe steric clashes in the TS (Fig. 9c): between α3 and two residues (Tyr36, Lys38) in \({\beta }_{3}\), and between \({\alpha }_{3}\) and two residues (Val30, Arg31) in the \({\beta }_{2}\)-\({\beta }_{3}\) loop. These results suggest that both Cdc42 and \({\alpha }_{3}\) modify ligand affinity to PDZ by interfering with the transient conformational changes during ligand unbinding, potentially slowing down the dissociation process. This interaction provides a straightforward and intuitive mechanism for PDZ allostery. Overall, our simulation results demonstrate the predictive capability of our method by enabling the detection of large-scale conformational changes critical for PDZ allostery that eluded intensive studies for two decades44.

Discussion

In this work, we discovered that tRCs control both activation and energy relaxation, revealing a surprising reciprocity between these processes despite their significant differences in energy, timescale, and motion. Energy dissipated in relaxation is on the order of thermal energy, whereas activation requires energy far beyond thermal levels. Additionally, while energy relaxation occurs within a few picoseconds, ligand unbinding in HIV-PR can take over 200 hours. Energy relaxation is also characterized by small-amplitude vibrational motions, in contrast to the large-scale conformational changes typical of activation. These differences highlight the complexity of the underlying mechanisms. Yet, both processes are governed by the same set of essential coordinates—the tRCs—demonstrating a fundamental unity between these seemingly disparate phenomena. If this reciprocity is confirmed across other proteins, it could lead to a generalization of the fluctuation-dissipation theorem in proteins, extending its applicability to macroscopic energy scales.

Our finding has significant applications in enhanced sampling, the primary method for bridging the time-scale gap between MD simulations and functionally important protein processes. The main challenge in enhanced sampling is identifying CVs that can effectively accelerate protein conformational changes. While tRCs are widely regarded as the optimal CVs, their identification previously required NRTs, which themselves required effective enhanced sampling, creating a paradox. Our discovery allows for the computation of tRCs from energy relaxation at minimal computational cost, enabling predictive and efficient enhanced sampling of protein conformational changes. This significantly broadens the range of protein functional processes accessible to MD simulations.

The GWF is a general, flexible method that can identify nonlinear tRCs through piecewise linearization, as detailed in ref. 43. In practice, for all processes we have studied, the tRCs have been linear within numerical error34,43. It remains to be seen whether this linearity is a general feature of tRCs in proteins and if there is a fundamental physical reason for it.

Methods

The workflow for computing tRCs and generating NRTs of protein conformational changes consists of six steps. (1) Simulate the energy relaxation of the input protein structure. (2) Compute GWF from the energy relaxation trajectories using Eq. (8). (3) Apply singular value decomposition to \(\langle \Delta {\mathbb{W}}\left(0\to t\right)\rangle\) and compute the PEFs of the singular coordinates. (4) Identify the leading singular coordinates as the tRCs, distinguished by a clear separation between their PEFs and those of the other singular coordinates. (5) Generate RC-uncovered trajectories by applying bias potentials to move the system along the tRCs. (6) Generate NRTs by applying the shooting move to selected conformations on the RC-uncovered trajectories.

All simulations are constant NVE and use the CHARMM36m force field and TIP3P water model61,62. For HIV-PR, the simulation system consists of 71,589 atoms from water and 3280 atoms from protein and ligand. For PDZ2, the simulation system consists of 22,101 atoms from water and 1482 atoms from protein and ligand. Simulations of energy relaxation and RC-uncovered trajectories are performed with GROMACS 2019.2 using CPUs, whereas natural trajectories are generated using GROMACS 2022 on GPUs63. All the bonds involving hydrogen atoms are constrained using the LINCS algorithm64. Time step used in all simulations is 1 fs.

Simulating energy relaxation

To simulate energy relaxation of ligand-bound HIV-PR and PDZ2 domain in explicit solvent, extra kinetic energy is deposited into the ligand to increase the temperature of each ligand coordinate by 400 K, the same as we did for myoglobin42. For energy relaxation of ligand-free HIV-PR in implicit solvent, extra kinetic energy was injected into atoms of active site residues (residues 25, 28, 29) instead. Afterwards, we run regular MD simulation of the system for 5 ps. This is to mimic the process of dissipating the excess energy at the active site after ligand binding or enzymatic reaction. We select 5 ps simulation time because we found that systematic energy flows in the system stop after 5 ps in our study of myoglobin42. To test if the SCs depend on the specific amount of excess energy deposited into the ligand, we also tried increasing the ligand temperature by 150 K and 800 K. The results are the same.

Clustering energy relaxation trajectories

For the GWF analysis, 8000 energy relaxation trajectories are divided into 6 clusters (\({E}_{0}\left({q}_{i}\right)\) to \({E}_{5}\left({q}_{i}\right)\)) using k-mean clustering. Visual inspection of PEFs suggests the presence of six clusters (Supplementary Fig. 1), a pattern consistently observed across all energy relaxation systems studied, including myoglobin42, HIV-PR and the PDZ2 domain. Clustering starts with seed trajectories identified by visual inspection. Each energy relaxation trajectory is assigned to the cluster with the closest seed trajectory, resulting in six initial clusters. Trajectories are then re-assigned to the cluster where their average distance to cluster’s trajectories is the shortest. This process repeats until cluster assignments no longer change. The GWF was computed using cluster \({E}_{0}\left({q}_{i}\right)\), which contains 3300 trajectories. More details are provided in Supplementary Note 1.

Generating RC-uncovered trajectories

To generate RC-uncovered trajectories, we apply bias potentials to tRCs \({u}_{a}\,(a={\mathrm{0,1}},..,6)\). Because of the periodicity of angles, we replace ua by \({Q}_{a}={\sum }_{i=1}^{{N}_{a}}{U}_{{ia}}\cos \left({\chi }_{i}-{\chi }_{i}^{*}\right)\)65, where \({U}_{{ia}}\) is an element of the left singular matrix of the GWF, \({N}_{a}\) is the number of coordinates included in the definition of \({Q}_{a}\), \({\chi }_{i}\) denotes a dihedral, \({\chi }_{i}^{*}={\chi }_{i}^{0}+c\cdot {U}_{{ia}}\) is the target value of \({\chi }_{i}\), \({\chi }_{i}^{0}\) is the value of \({\chi }_{i}\) in the starting structure, and c is the constant that we use to decide how many units of \({u}_{a}\) we want to move the system. The use of cosine function is to remove the discontinuity caused by the periodicity of angular coordinates. Because the GWF inevitably contains noise, we do not include all the backbone dihedrals in \({Q}_{a}\). Instead, we only include \({\chi }_{i}\) if \(\left|{U}_{{ia}}\right| > \,\varepsilon\) to remove dihedrals that are included in \({u}_{a}\) due to the noise in the GWF. In current calculations, we used \(\varepsilon=0.03\), resulting in \({N}_{a}\in \left(240,\,290\right)\) for different RCs of HIV-PR.

To apply the bias force gradually, we divide the interval between the minimum value \(-c{\sum }_{i}{U}_{{ia}}\) and the target value \(c{\sum }_{i}{U}_{{ia}}\) of \({Q}_{a}\) into 100 bins, with the n-th bin spanning the interval \(\left[{Q}_{a,n},\,{Q}_{a,n+1}\right)\). At any instant t, when the system configuration is at \({Q}_{a}\left(t\right)\in \left[{Q}_{a,n},\,{Q}_{a,{n}+1}\right)\), it feels an bias potential of the form \(V\left({Q}_{a}\right)=\frac{1}{2}k{\left({Q}_{a}\left(t\right)-{Q}_{a,n+1}\right)}^{2}\). In this way, the system always feels a gentle force pulling it towards the target value, with the center of the bias potential shifting adaptively as the system moves from one bin to another. Even though the relationship between \({Q}_{a}\) and \({u}_{a}\) is nonlinear, Fig. 5 shows that applying a bias potential to \({Q}_{a}\) moves \({u}_{a}\) continuously and efficiently, suggesting that the bias force on \({Q}_{a}\) is effectively translated into a force acting on \({u}_{a}\). Because \({Q}_{a}\) is a unitless variable, the spring constant \(k\) has a unit of \({{\rm{kJ\; mo}}}{{{\rm{l}}}}^{-1}\).

To achieve multiple rounds of reversible transitions, we will reverse the direction of the bias potential once the system reaches the terminal bins centered at \(\Delta {u}_{0}=0\) and 2.6 radians respectively, and has stabilized there for 50 ps. To balance the duration for flap opening and closing transitions and minimize the overall duration of each cycle, we empirically selected \(k=2000\,{{\rm{kJ\; mo}}}{{{\rm{l}}}}^{-1}\) for flap opening and \(k=500\,{{\rm{kJ\; mo}}}{{{\rm{l}}}}^{-1}\) for flap closing, respectively.

Generating NRTs

To generate NRTs using the shooting move10, we first identify a set of candidate TS conformations from an RC-uncovered trajectory based on visual inspection and intuition. From each candidate conformation, we launch 5 pairs of MD trajectories with opposite initial momenta. A natural reactive trajectory is successfully generated when a pair of MD trajectories launched from a candidate conformation reached opposite basins. This procedure also allows us to identify TS conformations by their success in generating natural trajectories.

Enhanced sampling with empirical CVs

For metadynamics simulation of MA/CA dissociation, we used the widely adopted well-tempered metadynamics implemented in PLUMED266. The parameters used in the simulations are: Gaussian height 0.2 \({{\rm{kJ\; mo}}}{{{\rm{l}}}}^{-1}\), Gaussian width 0.25 nm, and deposition of a Gaussian every 0.5 ps, all based on PLUMED2 recommendations and ref. 67. Additional Gaussian heights of 1, 100, 1000, and 2000 \({{\rm{kJ\; mo}}}{{{\rm{l}}}}^{-1}\) are also tested (Supplementary Fig. 4) to achieve MA/CA dissociation.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.