Introduction

Life depends on the light-driven fixation of CO2 during photosynthesis. Two major types of photosynthesis have been established during evolution. C3 photosynthesis, where primary carboxylation takes place in the Calvin-Benson cycle, is the typical pathway used by most plants. Higher efficiency in light-driven energy conversion is realized in C4 plants, where primary carbon fixation is spatially separated from carbon release to the Calvin-Benson cycle. Here, the rate limiting regeneration of the primary CO2 acceptor phosphoenolpyruvate (PEP) is catalyzed by the multi-domain protein pyruvate phosphate dikinase (PPDK)1. In addition to its essential role in C4 plants, PPDK is also found in various bacteria and protozoa, where the enzyme catalyzes the reverse reaction–ATP formation from AMP, pyrophosphate and PEP–during glycolysis2. The different directions of the catalytic reaction favored in plants and microorganisms underline that the enzyme is a reversible molecular machine. Bacterial PPDKs are known to form fully active homodimers, while for a number of plant PPDKs the active form was attributed to a homotetramer3,4,5. Ubiquitous distribution and low expression pattern6 preclude in vivo detection of C3-PPDK catalytic products against a background of other pyruvate/PEP converting enzymes such as pyruvate kinase or PEP carboxykinase2, leaving the physiological function of PPDK in C3 plants largely unknown. Though, cytoplasmic C3-PPDK is considered to play a role as secondary glycolytic enzyme based on its abundance in developing rice endosperm7.

Previous studies emphasize that PPDK has a modular structure consisting of distinct substrate binding domains for PEP/pyruvate at the C-terminus (PBD) and for nucleotides ATP/AMP at the N-terminus (NBD, aa 1–340) (Supplementary Fig. 1a). The NBD is structured in three subdomains: NBD1 (aa 1–111 and 197–243), NBD2 (aa 112–196), and NBD3 (aa 244–340). A central domain (CD, aa 380–515) catalyzing the phosphoryl group transfer from ATP to pyruvate–or from PEP to AMP–via a phospho-histidine intermediate mediates a catalytic cross-talk between the distant substrate binding domains (Supplementary Fig. 1b)8,9. Distinct conformational states resolved in crystal structures from maize and the non-plant organisms Clostridium symbiosum and Trypanosoma brucei suggest that the phosphoryl group transfer in the catalytic cycle is accompanied by a large swiveling motion of the CD from a position next to the NBD to a position facing the PBD (~110° and 45 Å)8,9,10,11. Similar single domain conformational re-arrangements have been observed for other proteins or protein assemblies, and domain swiveling is a common mechanism in enzyme catalysis, molecular transport, or electron transfer12,13,14,15. Still, the proposed translocation of the PPDK CD reflects one of the largest single domain movements observed in proteins yet. However, detailed insights into the molecular processes during the proposed swiveling motion, in particular on intermediate conformations of the CD and NBD, and potential driving forces behind the motion have remained elusive. Here, we co-crystallized the C4-PPDK from Flaveria trinervia and the related C3-PPDK from Flaveria pringlei with their natural product PEP and the substrate analogue 2′-Bromo-2′-deoxy-adenosine-5′-[(β, γ)-imido]triphosphate (2′-Br-dAppNHp). Both isoforms share 96% sequence identity–excluding the chloroplast transport peptide–and have a monomeric molecular weight of ~95 kDa. Using X-ray crystallography, we solved for the first time structures of ternary complexes of this multi-domain enzyme. The C4-PPDK structures (PDB 5JVJ and 5JVL) resolve intermediates of the NBD opening-closing motion with 5JVL providing the first structural model with bound nucleotide in the NBD. In addition, these structures substantiate that the catalytic histidine in the CD mediating phosphoryl group transfer between the active sites at the N- and C-terminus is positioned in close proximity to the bound PEP molecule. More importantly, the C3-PPDK structure from F. pringlei (PDB 5JVN) for the first time has trapped an intermediate state of the central domain, shedding light on sequential steps of the swiveling motion. The analysis of essential motions in available crystal structures and unrestrained molecular dynamics simulations reveal coupled motions of the CD and the NBD for non-phosphorylated PPDK. Extensive 1D and 2D potential of mean force (PMF) calculations of the CD motion also reveal the existence of distinct intermediate conformational states, resulting in sawtooth-like free energy profiles that are indicative of a Brownian ratchet mechanism biasing random thermal fluctuations. Furthermore, they suggest a tilting of the configurational free energy profiles depending on the binding state of the NBD and the phosphorylation state of the CD.

Results and Discussion

Overall structure of Flaveria PPDKs

The structure of the C4-isoform of PPDK from the flowering plant Flaveria trinervia was determined by molecular replacement at 2.9 Å resolution using the maize structure (PDB ID 1VBH)11 as a template. The structure (PDB 5JVJ) includes two monomers in the asymmetric unit (ASU) forming a dimer that corresponds to the previously described biological assembly of bacterial and maize PPDK8,11 with an overall well-defined electron density for the entire monomer A and for the PBD of monomer B. Parts of the NBD of monomer B revealed only poorly defined electron density, and direct tracing of monomer B in these regions was hampered. Yet, both monomers show electron density in the PBD for the co-crystallized substrate PEP. Besides, the NBD of monomer B exhibits additional density in both the mFo − DFc difference map and the feature enhanced maps (FEM, see methods section) probably reflecting a bound adenine nucleotide. The overall shape of this additional density is consistent with structural requirements and binding mode of adenine nucleotides in other nucleotide-binding proteins with the ATP-grasp fold16,17. In addition, this density complies with those observed when PPDK was crystallized in the presence of the nucleotide analogue 2′-Br-dAppNHp (see PDB 5JVL and Fig. 2c). However, since the molecular identity of the bound compound was not fully resolved at the present resolution, no compound was placed in this density in the deposited structure. Large parts of monomer B were successfully built using monomer A as a template by iterative manual model building and refinement. Yet, no conclusive electron density was found for residues 18–22, 47–65, 83–87, 101–106, 120–124, 163–166, 192–198 and 216–236. An overall root mean square deviation (RMSD) of 4.8 Å was calculated from a structural alignment of the individual monomers of the PPDK dimer in 5JVJ, indicating a substantial difference in their conformation. The main difference is found in the NBD of both monomers with the A monomer reflecting an open conformation and the B monomer reflecting a closed conformation of this domain. Overall, the orientation of NBD1 (aa 1–111 and 197–243) and NBD2 (aa 112–196) relative to NBD3 (aa 244–340) is greatly changed in the two monomer conformations (Fig. 1d–f). Superimposition of the NBD subdomains highlights that NBD1 is reoriented by a large motion of about 40° around a hinge region consisting of two short peptide linkers formed by residues 112–115 and 195–200 towards NBD3. At the same time, NBD2 is displaced by about 40° to accommodate for the new position of NBD1. The hinge motion of NBD1 and NBD2 results in the closing of the large cleft formed in the open configuration of the PPDK monomer (Fig. 1d–f). This cleft is no longer accessible in the closed form of the PPDK monomer. The presence of two structurally distinct conformations of the PPDK monomer (NBD open versus closed) within a single crystal structure may suggest that these structural asymmetries reflect functional asymmetries in substrate binding and/or catalytic turnover in the individual subunits of the PPDK dimer. A similar structural asymmetry in the monomer arrangement (open versus closed) was observed in the crystal structure of bacterial ATP-dependent DNA helicases and related to functional asymmetry in the mechanism of ATP hydrolysis in solution in each of the two subunits forming the functional dimer12,18.

Figure 2: Substrate binding sites of FtPPDK.
figure 2

(a) Semi-closed state of the PEP binding site (PDB 5JVL/A) with the catalytic H456 (yellow) pointing away from PEP. (b) Closed state of the PEP binding site (PDB 5JVL/C) revealing tight interactions between PEP and surrounding residues, including the catalytic H456 (yellow). (c) Closed state of the nucleotide binding site of 5JVL/D occupied with 2'-Br-dAppNHp. Interacting residues are highlighted.

Figure 1: Movement of the PPDK domains.
figure 1

(a–c) Movement of the CD (yellow). (a) FtPPDK in PBD-facing conformation (PDB 5JVL/C). (b) FpPPDK (PDB 5JVN) with CD in an intermediate position. (c) TbPPDK (PDB 2X0S)19 with CD in NBD-facing conformation. (d-f) Movement of the NBD (the three subdomains are depicted by three different greens). (d) FtPPDK in nucleotide-unbound state (PDB 5JVJ/A). (e) FtPPDK in semi-closed, nucleotide-bound state (PDB 5JVL/C). (f) FtPPDK in fully closed, nucleotide-bound state (PDB 5JVL/A).

When 2′-Br-dAppNHp was used for co-crystallization (PDB 5JVL), the crystal contains four monomers in the ASU arranged in terms of two dimers. The dimers correspond to the subunit arrangement in 5JVJ. Electron density maps (Supplementary Fig. 3) clearly indicate main chain connectivity and most of the side chain orientations for subunits A, C and D, whereas the majority of the NBD in chain B is not traceable. The overall structures of both FtPPDK crystal forms (Fig. 1a,d,e,f) are highly similar to PPDKs from C. symbiosum, T. brucei and Z. mays8,11,19 consisting of an N-terminal NBD (aa 1–340), a CD (aa 380–515) of β-strands with associated α-helices that contain the catalytic H456 in helix 20 involved in the phosphoryl group transfer between NBD and PBD, and a C-terminal TIM barrel containing the PBD (aa 535–874). For monomers C and D of 5JVL, the catalytic H456 is located in close position and appropriate orientation to the bound PEP substrate to mediate phosphoryl group transfer from the NBD to the PBD (Fig. 2a,b). Such a close contact has never been observed in previously reported PPDK structures, but has been resolved in Enzyme I of the E. coli Phosphoenolpyruvate:Sugar Phosphotransferase System (PTS), a bacterial carbohydrate import system bearing a homologous PEP binding domain20. For structural comparison of the catalytic sites in PTS Enzyme I and the PPDK a set of reference atoms was selected. While the catalytic histidine in the PTS Enzyme I structure is in the phosphorylated state, the corresponding histidine in the active site of PPDK structure 5JVL is in the non-phosphorylated form. On the contrary, PPDK substrate PEP carries a phosphoryl group, whereas PTS Enzyme I substrate inhibitor oxalate does not. Hence, atoms Nε2 of the catalytic histidine oxygen atoms O2 in oxalate and PEP in 5JVL were used for distance measurements and structural comparison of both enzymes at their PEP binding site. In the PTS Enzyme I structure, the distance between the phosphorylated Nε2 atom of the catalytic H189 and the oxygen atom O2 of the oxalate substrate is 4.9 Å. The distance between Nε2 of the catalytic H456 and O2 of the PEP substrate in chain C and D of the 2'-Br-dAppNHp-FtPPDK structure is similar (4.5 Å) (Fig. 2b), which emphasizes the close structural relationship of the newly described extreme conformation observed in 5JVL chains C and D and the PTS Enzyme I structure. This close spatial arrangement of the catalytic histidine and the phosphoryl group substrate in the PEP binding site enables efficient phosphoryl group transfer from PEP to H456 or H456 to pyruvate in PPDK as for PTS Enzyme I. The distance between the nucleophilic nitrogen atom Nε2 of H456 in the 2'-Br-dAppNHp-FtPPDK structure and the attacked phosphorus of PEP is around 3 Å (Fig. 2b). This relatively short distance is indicative for an associative in-line displacement of the phosphoric monoester21,22. Therefore, the structural conformations resolved in the 2'-Br-dAppNHp-FtPPDK structure visualize and verify for the first time the proposed phosphoryl group transfer mechanism in the PBD and the involvement of the catalytic histidine in the CD in this process.

Considering the overall CD, only small conformational changes between the plant structures from maize and Flaveria presented here and the non-plant structures from Clostridium and Trypanosoma are evident indicating that this domain primarily rotates as a rigid body. However, in contrast to the crystal structures of the non-plant PPDKs where the CD is close to the NBD, the CD of FtPPDK rests alongside the PBD (Fig. 1a) as observed for PPDK from maize11 and the Clostridium triple mutant R219E/E271R/S262D (PDB 2R82)9. The different conformations of the NBD observed in the Flaveria structures presented in this work cover the whole range of conformational intermediates observed with other members of the ATP grasp family such as synapsin23 or biotin carboxylase24 and reflect individual snapshots of the closing motion of this domain. These structural intermediates of PPDK have been previously proposed based on structures of homologous nucleotide binding enzymes25 but have not been experimentally verified until now.

Structural intermediate of the swiveling motion

In contrast to the C4 plant crystal structures from F. trinervia and maize, the structure of the C3-isoform of PPDK from F. pringlei (Fig. 1b) contains only one monomer in the ASU (PDB 5JVN). Remarkably, in this structure the CD is positioned in an intermediate state between the NBD and PBD, thereby enlarging the solvent accessibility of the PEP binding pocket. The CD position is not enforced by crystal contacts and therefore represents a genuine structural intermediate of the proposed swiveling mechanism. The entire domain is rather mobile resulting in higher B-factors in this region. Consequently, the electron density of several side chains in this region is not well defined. Concerning the substrate and cofactor binding site, density is visible for both ligands, PEP and 2'-Br-dAppNHp. Additional density in the conformational intermediate was observed in close proximity of the phosphoryl group of the PEP substrate. This additional density might reflect an alternative position of the PEP substrate in the PBD. However, the available resolution and data quality prevent modelling of the PEP substrate in those positions in the active site. Compared to the near-PBD positioned conformation of the CD in the C4-isoform (PDB 5JVL/C and 5JVL/D) the entire CD in the FpPPDK structure is rotated by 45° towards the NBD around a rotation axis formed by the linker region (Fig. 3). When compared to the PPDK structure from Clostridium symbiosum (CsPPDK, PDB 1KBL)10, the CD is rotated in the opposite direction and displaced by 52° towards the PBD. Focusing on the catalytic H456 in the CD, this residue is displaced by 17 Å(Cα-distance) when the F. pringlei conformational intermediate (5JVN) and the PBD-facing FtPPDK structures (e.g. 5JVL/C) are aligned, and by 24 Å in the opposite direction when the NBD-facing conformation observed in the CsPPDK is compared to the FpPPDK structure, respectively (Supplementary Fig. 7). Hence, compared to the two extreme conformations of the CD next to the NBD or PBD previously resolved, the CD in the F. pringlei conformational intermediate shows about 50% of the rotational and translational movement of the proposed complete swiveling motion. The CD intermediate identified in our crystallographic studies suggests that the swiveling motion of this domain in the catalytic cycle proceeds via at least one sub-step, starting from a position near either the PBD (Fig. 1a) or the NBD (Fig. 1c) via the intermediate conformation resolved in 5JVN to a position near the other substrate binding site. Similarly, stepped movements have been described e.g. for the F1 ATPase rotary molecular motor26,27,28,29.

Figure 3: Stepped movement of the CD.
figure 3

CDs of 5JVL/C (A, orange) and 5JVN (B, yellow). Helix 20 containing the catalytic H456 is depicted as cylinder with the Cα atom of H456 shown as red sphere. The rotational axis for the transitions between states A and B is depicted as a blue arrow. The distance between the Cα atoms of the catalytic His456 is shown as a dashed line.

To elucidate critical contacts mediating interactions between the CD or linker domain (LD) and the PBD, the FpPPDK conformational intermediate was analyzed with Cytoscape30 and RINalyzer31, which identified a salt bridge formed between E804 and R462, and non-polar patches at I864 in the PBD and L378 in the LD likely involved in hydrophobic interactions. Both interactions may restrict rotational freedom of the CD in this state, promoting the stepped motion of the CD. MD simulations confirm the presence of these interactions, although the salt bridge repeatedly forms and breaks (Supplementary Fig. 6f–h). Conservation analysis of these interactions by ConSurf32 revealed that only the non-polar patches are universally conserved among plant and non-plant PPDKs, whereas the identified salt bridge is unique to plant PPDKs, although non-plant enzymes might realize a corresponding interaction by an alternative salt bridge formed by R462 with an aspartate residue at position 801.

Conformational space spanned by PPDK structures

To investigate the conformational space spanned by all available PPDK structures including the three newly resolved ones, the structures were clustered with respect to structural similarity. The clustering reveals five conformational states (Fig. 4a; Supplementary Tab. 2). The FtPPDK structures described above constitute cluster II (PDB 5JVJ/B and 5JVL) and the FpPPDK structure cluster IV (PDB 5JVN) (Supplementary Tab. 1 and Supplementary Tab. 2), demonstrating that these structures populate hitherto uncharted regions of PPDK’s conformational space. In addition, PDB 5JVJ/A supplements cluster III.

Figure 4: Motions of PPDK.
figure 4

(a) Overview of all currently known crystal structures of the PPDK clustered according to Cα atom RMSD (Roman numerals), showing the CD near the PBD (left), between PBD and NBD (middle), and near the NBD (right) as well as the NBD in an open (top) or closed (bottom) conformation. Conformational states observed for the first time in this work are marked with an orange circle. (b) Representation of atomic displacements along the directions of the first two principal components (1st: gold; 2nd: silver) obtained from a PCA over the representative structures of each cluster depicted in panel (a). The amplitudes of the motions were scaled, and a cutoff for small displacements was applied for best graphical representation. (c) Cross-correlation maps of Cα atom fluctuations of the representative structures of each cluster (upper triangular) and from an aggregate MD simulation time of ~10 μs (lower triangular). The two axes on the left and at the bottom refer to residue indices according to the F. trinervia numbering. Positive correlations are indicated in blue, negative correlations in red (see color scale). Residues not resolved in all compared crystal structures are shown in black. Substructures of PPDK are labeled on the top and right axes. Correlated movements within domains are marked with squares, correlated movements between the domains are marked with boxes with round corners.

A principle component analysis in Cartesian coordinate space (PCA) across PPDK crystal structures corroborates the existence of two predominant motions: ~89% of the variance in the Cα atom coordinates can be explained by the first two principal components (PC) (Supplementary Fig. 4a). Atomic displacements along the component directions and a domain-wise index of the collectivity of motions (eq. 1, Supplementary Fig. 4b) reveal that the first PC predominantly characterizes a swiveling motion of the CD (Fig. 4b). The second PC characterizes a coordinated opening-closing motion of the NBD, with almost exclusively the first and second subdomain executing the movement (Fig. 4b and Supplementary Fig. 4b). Notably, the first PC also indicates a coupling of the swiveling motion of the CD with the opening-closing motion of the NBD as this PC leads to displacements in both domains (Fig. 4b). Cross-correlations of atomic fluctuations computed from the cluster-representative crystal structures agree very well with this result (Fig. 4c/upper triangle): They reveal, aside from positively correlated motions of the NBD, CD, and PBD themselves, weakly anti-correlated motions between the CD and the NBD (correlation coefficient down to −0.2), in agreement with the collective motions described by the first PC (Fig. 4b). Particularly, subdomains NBD1 and NBD2 move towards the CD if the latter swivels from the PBD to the NBD and vice versa.

To exclude any bias by the small number of available crystal structures, we further explored the conformational space of FtPPDK by molecular dynamics (MD) simulations of in total ~10 μs length (Supplementary Tab. 3). Cross-correlations of atomic fluctuations computed from the MD simulations yield a qualitatively and quantitatively highly similar result compared with correlated motions computed from the cluster-representative crystal structures (Fig. 4c), confirming the anti-correlated motions between the CD and the NBD (correlation coefficient down to −0.15).

Directionality of the swiveling motion

The above analyses suggest the FpPPDK structure (cluster IV, Fig. 4a) to be an intermediate of the swiveling motion of the CD and indicate coupled motions between the CD and the NBD. To corroborate these findings, we computed the configurational free energy (potential of mean force, PMF) using the distance between H456Cα–H565Cα(distanceCD–PBD, Supplementary Fig. 4c, numbering according to F. trinervia and F. pringlei, see also Supplementary Tab. 1) as a reaction coordinate. This reaction coordinate represents the swiveling motion very well (Supplementary Fig. 4d). PMFs were computed by umbrella sampling along distanceCD–PBD for three plausible transition paths obtained by targeted constrained geometric simulations33 between start/end states of FtPPDK from cluster III/cluster V, cluster III/cluster I, and cluster II/cluster I (Fig. 4a and Supplementary Fig. 5a). These transition paths were chosen such that the swiveling motion occurs in the presence of an open NBD (Fig. 5a/d), a closed NBD (Fig. 5c/f), or where the NBD closes when the CD approaches it and vice versa (Fig. 5b/e). While the opening-closing motion of the NBD was not restrained during the umbrella sampling, the observed sampling very well represents these paths (Fig. 5). DistanceCD–PBD was sampled in intervals of 1 Å, leading to a sufficient overlap of the sampling windows (Fig. 5).

Figure 5: 1D Potential of mean force of the swiveling motion of the CD with the distance H456Cα − H565 used as a reaction coordinate.
figure 5

Results for the non-phosphorylated state are depicted in (a–c), those for the state with phosphorylated H456 in (d–f). (a,d) depict results obtained with open NBD, (b,e) those with a simultaneous opening-closing of the NBD, and (c,f) with closed NBD. At the top, schematic representations of each state at the respective endpoints of a PMF are shown. In the middle row, sampled conformations are projected onto distanceCD–PBD and distanceNBD1 − NBD3 (Supplementary Fig. 4c), with each color representing one MD simulation with an umbrella potential applied at a given value of distanceCD–PBD. The PMFs are depicted in the bottom row. The diamonds show projections of PPDK crystal structures in conformational states marked by Roman numbers (Fig. 4a) onto the plane spanned by the two reaction coordinates (using for each organism the corresponding residues to evaluate the reaction coordinates (Supplementary Tab. 1))

The obtained PMFs (Fig. 5a,b,d,e) show several remarkable characteristics. First, the overall precision is high, with standard deviations for all points <0.15 kcal mol−1. Second, the free energy difference between conformational states III and I, or III and V (Fig. 4) is <2 kcal mol−1 (Fig. 5a,b,d,e), showing that the respective higher-energy conformational state is populated to ~2% at room temperature. Third, we identify a stable conformational intermediate of the CD swivelling motion, revealed by pronounced minima in the PMFs at a distanceCD–PBD ~30–35 Å (Fig. 5a,b,d,e), which is structurally highly similar to the intermediate state IV found in the FpPPDK crystal structure (located at a distanceCD–PBD ~25 Å) as shown by a Cα atom RMSD of 2.7 Å. Note that the FpPPDK crystal structure was not used for generating the transition path, hence, no information about conformational state IV entered the PMF calculations. The presence of the intermediate state leads to a sawtooth-like PMF. Fourth, the PMFs reveal for PPDK with non-phosphorylated H456 (Fig. 5a,b) that the correlated movement of the CD and the NBD results in state I being favored over III by 1.4 kcal mol−1 (Fig. 5b); in contrast, if the CD moves with the NBD remaining open, state V is disfavored over III by 2.1 kcal mol−1 (Fig. 5a). Therefore, movement of the non-phosphorylated CD from the PBD towards the NBD is exergonic only if it is coupled to a closing motion of the NBD, which is in line with the swiveling domain model8. This finding corroborates cross-correlations between these motions observed from crystal structures (Fig. 4c, upper triangle) and from structures obtained by MD simulations (Fig. 4c, lower triangle). Fifth, the PMFs suggest that the phosphorylation state of the CD influences the preferred direction of motion of this domain: When H456 is phosphorylated, state III becomes favorable over V by 1.5 kcal mol−1 (Fig. 5d) or is only slightly disfavored over I by 0.5 kcal mol−1 (Fig. 5e). Compared to the non-phosphorylated state, the sawtooth-like PMF is thus tilted towards III. Therefore, with phosphorylated H456, the movement of the CD from the NBD towards the PBD is exergonic or approximately isoenergetic. This result is also in line with the swiveling domain model8. In the phosphorylated state, coupled motions between the CD and the NBD have a smaller influence on the energetics of the conformational states. We speculate that the electrostatic repulsion between the phosphate group at H456 and reaction products, including adenosine monophosphate, still bound to or being in the vicinity of the NBD, fosters the CD movement towards the PBD instead.

Complex, stepped swiveling motion

PMFs calculated for the transition between conformational states I and II unexpectedly do not reveal a structural intermediate along the transition path and show a marked prevalence of state I (free energy difference >12 kcal mol−1), irrespective of the phosphorylation state of H456 (Fig. 5c,f). To provide an explanation for this observation and further details on the coupling between motions of the CD and the NBD, we computed a 2D PMF, using as reaction coordinates distanceCD–PBD and the distance between S215Cα–E272 (distanceNBD1−NBD3, Supplementary Fig. 4c, numbering according to F. trinervia and F. pringlei, see also Supplementary Tab. 1) for the non-phosphorylated PPDK. The reaction coordinate distanceNBD1−NBD3 represents the opening-closing motion of the NBD very well (Supplementary Fig. 4e). Reference points for umbrella sampling were generated using targeted constrained geometric simulations33 between start/end states of FtPPDK from cluster III/cluster I (Supplementary Fig. 5a), with further reference points added using the conformation after 6 ns of umbrella sampling as a starting point for the next interval. The sampling windows overlap well along the two reaction coordinates (Supplementary Fig. 5b–d), and the 2D PMFs are qualitatively indistinguishable irrespective whether only the first half, the second half, or the complete sampling time is used for their calculation (Supplementary Fig. 5e-g), strongly indicating converged results. The most prominent feature of the PMF is that conformational states III and I reside in or close to minima of the free energy landscape (Fig. 6). Furthermore, a shallow free energy minimum is identified at distanceCD–PBD 28 Å and distanceNBD1−NBD3 38 Å (marked by a star in Fig. 6), lying close to the structural intermediate from the FpPPDK crystal structure (conformational state IV, Fig. 4a). The transition path between conformational state III and IV runs such that distanceNBD1−NBD3 remains almost constant while distanceCD–PBD changes by ~15 Å. In contrast, between conformational state IV and I, the transition paths continues in a diagonal manner, revealing a correlation between the swiveling motion of the CD (change of distanceCD–PBD by ~25 Å) and the opening-closing motion of the NBD (change of distanceNBD1−NBD3 by ~10 Å). These characteristics agree very well with those found by the 1D PMFs, where distanceNBD1−NBD3 only starts to decrease once distanceCD–PBD >25 Å (Fig. 5b/e). Overall, both PMF calculations thus reveal a complex, stepped swiveling motion of the CD with varying degrees of coupling to the NBD motions and proceeding via conformational state IV. Moreover, the 2D PMF reveals that conformational state II is located in a region of elevated free energy (Fig. 6), providing an explanation why a transition path obtained by targeted simulations between conformational states II and I results in a downhill motion towards state I (Fig. 5c,f). Unrestrained MD simulations confirm this finding (see section Supporting Results in the SI, Supplementary Fig. 6). A plausible explanation for why this state is observed in PDB 5JVL may be that protein-protein interactions between the PBD of chain B and the CD of chain A occur as crystal contacts, which can stabilize the CD near the PBD despite a closed NBD.

Figure 6
figure 6

2D Potential of mean force, with the distanceCD–PBD between H456Cα – H565Cα and the distanceNBD1−NBD3 between S215Cα − E272Cα used as reaction coordinate of the swiveling motion of the CD and the opening-closing motion of the NBD, respectively, (Supplementary Fig. 4c) computed for H456 in the non-phosphorylated state. The diamonds show projections of PPDK crystal structures in conformational states marked by Roman numbers (Fig. 4a) onto the plane spanned by the two reaction coordinates (using for each organism the corresponding residues to evaluate the reaction coordinates (Supplementary Tab. 1)) (I: PDB 1DIK, 2DIK, 1GGO, 1JDE, 1KBL, 1KC7; IIA: 5JVJ/B, IIB: 5JVL/A, IIC: 5JVL/C, IID: 5JVL/D; IIIA: 2R82, IIIB: 1VBG, IIIC: 1VBH, IIID: 5JVJ/A; IV: 5JVN; V 2X0S). The star marks a shallow energy minimum close to conformation IV.

Effective driving force of the swiveling motion

The analysis of crystal structures, unrestrained MD simulations, and configurational free energy calculations consistently and independently revealed the existence of structural intermediates of the swiveling motions and coupled motions of the CD and the NBD for non-phosphorylated PPDK. The respective 1D PMF of PPDK with non-phosphorylated H456 is sawtooth-like (Fig. 5b), with barrier heights for the transition from state III to I between 1.5–2.5 kcal mol−1 (2.5–4.1 kT at T = 300 K). For the reverse transition with phosphorylated H456, the 1D PMF also displays a sawtooth-like character (Fig. 5e), with barrier heights of 1.8 kcal mol−1 (3 kT at T = 300 K) and the free energy profile being tilted in favor of state III compared to the non-phosphorylated PPDK. The sawtooth-like free energy profiles suggest that PPDK can exploit random thermal fluctuations for directional motion of its CD. Typically, this type of profile is indicative of a Brownian ratchet mechanism34, pioneered by Feynman35 and Huxley36. Barrier heights on the order of kT, as associated with the CD motion in the PPDK catalytic cycle, suggest that a Brownian ratchet biases fluctuations rather than rectifying them37. To drive the respective directional motions, a non-equilibrium situation needs to be created that relaxes towards equilibrium. For the non-phosphorylated PPDK, this situation is suggested to be created by binding of ATP to the NBD, as such binding leads to a closing of the NBD due to the progressive formation of ATP/NBD interactions. Transmitted via coupled motions, the NBD closing then leads to a preference for the CD to be close to the NBD, at least in the second half of the transition pathway between PBD and NBD. However, we are aware that biased Brownian ratchets and power stroke motors have the same phenomenological behavior and are difficult to distinguish experimentally37. Hence, we cannot exclude that the conformational changes observed in the NBD of our structural intermediates induce a strain in the enzyme that would directly drive the CD motion upon ATP binding, thus resulting in a power stroke mechanism38. Overall, this situation is similar to ATP synthase and ATP-dependent molecular machines such as myosin, kinesin, or chaperonin parts, where the stepped motions in the catalytic cycle are triggered by binding of the high energy substrates PEP or ATP, respectively27. For the phosphorylated PPDK, at least part of the non-equilibrium situation is suggested to arise from electrostatic repulsion between the phosphorylated H456 and the NBD. A similar situation was created by introducing appropriately charged residues into a mutant of non-phosphorylated CsPPDK, resulting in the CD being adjacent near the PBD (PDB ID 2R82)9. In all, our analyses suggest that both changes of the binding state of PPDK and of its molecular identity ((non)phosphorylation) contribute to the enzyme acting as a molecular switch with respect to the swiveling motion.

Moreover, our crystallographic and molecular simulation data are indicative that PPDK might employ a Brownian ratchet mechanism biasing thermal fluctuations in order to generate a net directional CD motion. The coupling of this motion to the open-close state of the NBD revealed by the distinct conformational states (Fig. 1a–c and Supplementary Fig. 5a) and substrate binding states resolved in our dimeric structural intermediate of the PPDK catalytic cycle further suggests that the enzyme might employ an alternate binding change mechanism similar to ATP synthase or bacterial ATP-dependent DNA helicases.

Methods

Expression and purification of recombinant FtPPDK/FpPPDK

Codon-optimized coding regions of PPDK from Flaveria trinervia (EMBL-ENA: X57141)39 or Flaveria pringlei (EMBL-ENA: X75516)40, stripped of the chloroplast transport sequence, were cloned into the multiple cloning site of the pET-16b vector (Novagen) including a histidine10 tag and a Tobacco Etch Virus (TEV) protease cleavage site. The plasmid was used to transform E. coli BL21 (DE3) cells (Agilent Technologies). Previously described expression and purification protocols11 were adapted for FtPPDK/FpPPDK. Transformed E. coli BL21 (DE3) cells (Agilent Technologies) were grown in 2YT medium (5 gL−1 NaCl, 10 gL−1 yeast extract, 16 gL−1 peptone) with ampicillin at 30 °C to OD600 = 0.8. Protein expression was induced by the addition of 0.1 mM isopropyl-β-D-thiogalactopyranosid (IPTG). Cells were harvested 18 after induction by centrifugation. After harvesting, cells were suspended in lysis buffer (50 mM Tris/HCl pH 7.5, 300 mM NaCl, 10 mM imidazole, 10 mM MgSO4, 10% (w/v) glycerol, 5 mM DTT, 0.002% (w/v) phenylmethanesulfonylfluoride) and disrupted using a cell disruptor (Constant Systems). PPDK was purified using a nickel affinity chromatography column (GE Healthcare) using purification (50 mM Tris/HCl pH 7.5, 300 mM NaCl, 10 mM MgSO4, 10% (w/v) glycerol, 5 DTT) and elution buffer (50 mM Tris/HCl pH 7.5, 300 mM NaCl, 500 mM imidazole, 10 mM MgSO4, 10% (w/v) glycerol, 5 mM DTT). The loaded column was washed with 50 mM and 150 mM imidazole, 500 mM imidazole were used for the elution of PPDK. A PD-10 desalting column (GE Healthcare) was used to exchange the elution buffer against purification buffer before enzymatic cleavage of the affinity tag via Tobacco Etch Virus Protease (TEV) over-night at room temperature. Cleaved PPDK was concentrated by ultrafiltration (30 kDA cutoff, Millipore) and the buffer was exchanged using a PD-10 column. For crystallization trials, crystallization buffer (10 mM Tris/HCl pH 7.5, 5 mM MgSO4) was used for the buffer exchange step, otherwise PPDK storage buffer (50 mM Tris/HCl pH 8, 10 mM MgCl2, 0.1 mM EDTA, 5 mM DTT). Monodispersity of the sample was verified via size exclusion chromatography (SEC) and dynamic light scattering (DLS) (Supplementary Fig. 2a). Activity of purified PPDKs was confirmed by a coupled-enzyme assay in the PEP-forming direction41.

Crystallization

Initial crystallization trials were performed in microbatch technique. FtPPDK in crystallization buffer at a concentration of 10 mg mL−1 was incubated at room temperature for 10 min with 20 mM phosphoenolpyruvate (PEP) and either 10 mM nicotinamide adenine dinucleotide (NADH) or 1.5 mM 2'-Br-dAppNHp. The protein solution was mixed with precipitant at a 1:1 ratio resulting in a final volume of 2 μL. The drop was sealed using mineral oil (Sigma Aldrich). The optimized composition of the precipitant solution included 17% (w/v) PEG 3350, 100 mM MOPS (pH 7) and 100 mM magnesium formate with either 10 mM nicotinamide adenine dinucleotide (NADH) or 1.5 mM 2'-Br-dAppNHp added to the protein solution 10 min prior of mixing with the precipitant. Crystals sized approx. 500 × 40 × 10 μm3 appeared within 24 h after incubation at 21 °C, were transferred into a cryoprotection buffer comprised of the precipitant solution supplemented with 20% ethylene glycol and cryo-cooled in liquid nitrogen.

FpPPDK was crystallized in sitting drop technique. FpPPDK in crystallization buffer at a concentration of 10 mg mL−1 was incubated for 10 min with 20 mM PEP and 2 mM 2'-Br-dAppNHp at room temperature. The optimized reservoir solution was composed of 85 mM HEPES pH 7.5, 17% (w/v) PEG 4000, 15% (w/v) glycerol and 5% (v/v) isopropanol and mixed at a 1:1 ratio with the protein solution in a final drop volume of 2 μL. Crystals sized approx. 200 × 50 × 50 μm3 appeared within 48 h after incubation at 12 °C. The crystals were flash-frozen in liquid nitrogen without adding any additional cryoprotectant.

Structure determination

X-ray diffraction data for FtPPDK were collected at the European Synchrotron Radiation Facility (ESRF), Grenoble at beamline BM30A at 0.9799 Å wavelength for 5JVJ with an oscillation range of 1° per image spanning a total range of 151° at 100 K. Data for the 2'-Br-dAppNHp-bound form was collected at 0.9123 Å with an oscillation range of 0.5° per image and 360° of total range. The data were integrated and scaled with XDS42 and data reduction was performed with Aimless43 from the CCP4 suite44. Initial phases for 5JVJ were obtained by molecular replacement (MR) with Z. mays PPDK11 as a template (PDB 1VBH, 79% sequence identity) using Phaser45. The initial model included two monomers in the asymmetric unit (ASU). For one monomer (chain B) only the PBD was correctly placed in the electron density, hence the misplaced CD and NBD were removed manually. This model was subjected to multiple rounds of manual model rebuilding and extension using Coot46 and refinement by phenix.refine47 using local non-crystallographic symmetry restraints to account for the obvious conformational differences between both monomers. Parts of chain B were gradually retrieved in this process. The structure of the 2'-Br-dAppNHp-bound FtPPDK (5JVL) was determined by MR using the coordinates of the previously determined 5JVJ/A structure. The resulting model contained four monomers in the ASU. Three monomers were extended to completeness, the fourth monomer (chain B) exhibited inconclusive density for the NBD. The atomic displacement parameters (ADPs) for both crystal forms were refined individually and were partly described as groups of translation, libration and screw-motion (TLS)48.

Data of the FpPPDK were collected at the ESRF beamline ID29 at a wavelength of 0.9762 Å with an oscillation range of 0.1° per image and 360° of total range. The reflection data were processed with XDS. Initial phases were obtained by MR with Phaser using the PBD and NBD of 5JVL/D as starting model. The resulting structure was subject to automated model-rebuilding using Bucaneer49 which recovered the CD followed by iterative rounds of manual model rebuilding using Coot and refinement with phenix.refine and REFMAC550. In all cases, ligands were not modeled into excess density indicated by the mFo − DFc map (where m represents the figure of merit, D the σ-A weighting factor and Fo and Fc the observed (experimental) and calculated (model) amplitudes respectively) until near-final refinement rounds to reduce model bias. To enhance sensitivity for weak sidechain features, Feature Enhanced Maps (FEM) were used in the model building process. For FEM, a 2mFo − DFc σ-A-weighted map is modified to strengthen weak signals if present. This includes calculation of omit maps, map randomization and map sharpening. The resulting map shows enhanced sensitivity for weak features and reduced model bias compared to 2mFo − DFc maps51. The structure was validated using tools provided by Coot and PHENIX, in particular MolProbity52. Figures were generated using PyMOL53. Final parameter of refinement and model stereochemistry of Flaveria PPDK structures 5JVJ, 5JVL, and 5JVN are summarized in Table 1.

Table 1 Data collection and refinement statistics.

Comparison of PPDK crystal structures

All currently available crystal structures of PPDK were obtained from the Protein Data Bank54 (PDB) (Supplementary Tab. 1). The structure of PDB 5JVL, chain B and NMR derived structure of the CD only (PDB 2FM4) was excluded from further analysis as the NBD has not been resolved there. A multiple sequence and structural alignment of the PPDK structures was generated with PROMALS3D55. Residues resolved and common to all structures were identified from the alignment and used for further analysis.

Cluster analysis of PPDK conformations

Cluster analysis on all PPDK crystal structures listed in Supplementary Tab. 1 was performed with the CPPTRAJ56 module of the AMBER suite of programs57 using the hierarchical agglomerative (bottom-up) algorithm. As a distance measure, the best-fit Cα atom root mean square deviation (RMSD) of all residues common in the PPDK structures with all domains resolved (Supplementary Tab. 1) was used. A maximal distance between all members of two clusters (complete linkage) of <4 Å was used as terminating criterion for the clustering. For subsequent analysis, only those crystal structures were used that are cluster representatives in order to avoid a bias towards the number of crystal structures in the same conformational state.

Principle component analysis

To describe the essential dynamics of PPDK, a principle component analysis (PCA) in Cartesian space was performed on a set of experimental structures (Supplementary Tab. 1) and snapshots of MD simulations (Supplementary Tab. 3) using CPPTRAJ56 in a similar manner as described in refs 58 and 59. In detail, the coordinate covariance matrix was calculated for all Cα atoms present in all crystal structures without missing domains. The structures were RMS-fit to the average structure of all cluster representative crystal structures using the 15% least fluctuating residues to remove global transitional and rotational motion prior to calculating the coordinate covariance matrix. The symmetric matrix is diagonalized by an orthogonal coordinate transformation, yielding the eigenvalues and eigenvectors (principle components). An eigenvalue corresponds to the mean square eigenvector coordinate fluctuation (the variance) and, hence, describes how much a principal component contributes to the total coordinate fluctuations60.

To analyze the locality or collectivity of motions for the domains of PPDK, the collectivity index κ described in ref. 61 was calculated (eq. 1)

with N being the number of atoms in the domain, and being the relative displacement of the principal component. All values of were scaled consistently such that . A value of κ = 1 indicates a mode of maximal collectivity, that is, all are identical. Conversely, if only one atom is affected by the mode, κ reaches the minimal value of .

Homology modeling of FtPPDK conformational states

As there was no crystal structure of FtPPDK available at the beginning of this project, homology models of conformational states I, III, IV, and V were generated. From each cluster (Fig. 4a), the crystal structures with the highest resolution and without mutations that interfere with the enzymatic activity were used as templates (C. symbiosum structure 1KBL and 1KC7 for state I; Z. mays structure 1VBH and 1VBG for state II; F. pringlei structure 5JVN for state IV; and T. brucei structure 2X0S for state V). The sequence alignment of the FtPPDK sequence and the respective template sequence(s) was generated by MAFFT62. The sequence identities are for conformational state I: 55%; conformational state III: 79%; conformational state IV: 96%; conformational state V: 54%. The homology models were generated using the program MODELLER63 in a multi-template approach, applying the dope loop model algorithm, and including ligands (if present in the crystal structure). The quality of our models was assessed for conformational state III, for which now PDB ID 5JVJ, chain A is available: for NBD: 1.16 Å, for CD including the linker domain: 1.13 Å, and for the PBD: 0.93 Å. Such structural deviations are close to the experimental uncertainty of the crystal structure.

Molecular dynamics simulations of FtPPDK

The crystal structure of FtPPDK (5JVL/C; conformational state II) and homology models of FtPPDK for conformational states I, III, IV and V served as input structures for MD simulations. Three independent replicates of MD simulations were performed for each system which are summarized in Supplementary Tab. 3. Co-crystallized water and ligands were removed. Hydrogen atoms were added using REDUCE64, flipping side chains of Asn, Gln, and His when appropriate. These model systems were placed in a truncated octahedral box of TIP3P water65 leaving a distance of at least 11 Å between the solute and the border of the box. Counter ions were added to neutralize the systems. All MD simulations were performed with the ff99SB force field66 using the Amber suite of programs57. Parameters for the phosphorylated histidine were obtained from ref. 67. Bonds containing hydrogen atoms were constrained using the SHAKE algorithm68, and long range interactions were treated by the particle mesh Ewald (PME) method69. A time step of 2 fs was used. The system was equilibrated by, first, applying harmonic restraints to solute atom positions with force constants of at least 5 kcal mol−1 Å−2 for 100 steps of steepest descent and 400 steps of conjugate gradient minimization. Second, the temperature of the system was raised from 100 K to 300 K in 50 ps of NVT-MD simulations. Third, 150 ps of NPT-MD simulations were performed to adjust the system density. Finally, the force constants of harmonic restraints were gradually reduced to zero during 250 ps of NVT-MD simulations. Production NVT-MD simulations were carried out at 300 K, using the Berendsen thermostat70 and a coupling constant of 0.5 ps. Three independent replicates of MD simulations were performed for each system by spawning production runs after the thermalization at temperatures of 299.9 K, 300.0 K, and 300.1 K respectively. The first 2 ns of each trajectory were omitted from subsequent analyses. All unrestrained MD simulations are listed in Supplementary Tab. 3.

Generation of transition paths

For the potential of mean force calculations, plausible pathways of the swiveling motion and opening-closing motion have been generated using targeted normal mode-based geometric simulations by the NMSim approach33. NMSim is a three-step protocol for multiscale modeling of protein conformational changes that incorporates information about preferred directions of protein motions into a geometric simulation algorithm. In the first step, the protein structure is coarse-grained by the software FIRST71 into rigid parts connected by flexible links. For this, an energy cut-off for including hydrogen bonds (and salt bridges) of −1 kcal mol−1 and a distance cutoff for including hydrophobic constraints of 0.35 Å were used. In the second step, low-frequency normal modes are computed by rigid cluster normal mode analysis (RCNMA) with a 10 Å distance cutoff for considering interactions between Cα-atoms. In the third step, a linear combination of the first 50 normal modes was used to bias backbone motions along the low-frequency normal modes, while the side chain motions were biased towards favored rotamer states, generating 500 conformations in 500 simulation cycles with a step size of 0.5 Å and side chain distortion of 0.3. Targeted NMSim calculations33 were performed between start / end states of cluster III / cluster V, cluster III / cluster I, and cluster II / cluster I, using homology models for cluster III, V, and I, and the cluster-representative crystal structure for cluster II.

Potential of mean force calculations

Free energy profiles of the swiveling motion of the CD (and the opening-closing motion of the NBD) were computed along the NMSim-generated transition paths by umbrella sampling72 followed by the Weighted Histogram Analysis Method (WHAM)73. As a reaction coordinate for analyzing FtPPDK’s swiveling motion, the distanceCD–PBD between H456Cα – H565Cα was used (Supplementary Fig. 4c), as it changes monotonously between the two endpoints and provides an intuitive measure for the progress of the swiveling movement of the CD. The opening-closing motion of the NBD of FtPPDK was analyzed along the reaction coordinate distanceNBD1−NBD3, measured between S215Cα–E272Cα (Supplementary Fig. 4c). 1D (2D) umbrella sampling MD simulations were performed along reaction coordinate(s) distanceCD–PBD between 10 Å and 52 Å (9 Å and 54 Å) (and distanceNBD1−NBD3 between 26 Å and 42 Å) in intervals of 1 Å, applying harmonic potentials with a force constant of 1 kcal mol−1 Å−2 to tether the conformations to the respective reference point. This resulted in 42 (782) umbrella sampling simulations, each 42.5 ns (9 ns) long, excluding the first 2.5 ns (1 ns) from the WHAM analysis, for the 1D (2D) PMF. Approximately Gaussian-shaped frequency distributions were obtained for each reference point along the reaction coordinate(s), with all such distributions well overlapping (Supplementary Fig. 5b-d). The latter is a prerequisite for the successful application of WHAM73 to extract a PMF from these distributions. Bootstrapping was applied to compute the standard deviations at the reference points.

Additional Information

Accession codes: Coordinates of dimeric and 2'-Br-dAppNHp-bound FtPPDK have been deposited in the Protein Data Bank under accession codes 5JVJ and 5JVL. Coordinates of FpPPDK have been deposited under the accession code 5JVN.

How to cite this article: Minges, A. et al. Structural intermediates and directionality of the swiveling motion of Pyruvate Phosphate Dikinase. Sci. Rep. 7, 45389; doi: 10.1038/srep45389 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.