AlphaFold as a prior: experimental structure determination conditioned on a pretrained neural network

Fadini, Alisia; Li, Minhuan; McCoy, Airlie J.; Banjara, Suresh; Okumura, Hiroki; Napier, Eve; Fontana, Pietro; Khan, Amir R.; Jovine, Luca; Terwilliger, Thomas C.; Read, Randy J.; Hekstra, Doeke R.; AlQuraishi, Mohammed

doi:10.1038/s41592-026-03047-4

Download PDF

Article
Open access
Published: 01 April 2026

AlphaFold as a prior: experimental structure determination conditioned on a pretrained neural network

Nature Methods volume 23, pages 785–795 (2026) Cite this article

12k Accesses
3 Citations
115 Altmetric
Metrics details

Subjects

This article has been updated

Abstract

Advances in machine learning have transformed structural biology, enabling swift and accurate prediction of protein structure from sequence. However, key challenges persist in modeling side-chain packing, condition-dependent conformational changes and biomolecular interactions, largely because of limited high-quality training data. At the same time, emerging experimental techniques such as cryo-electron microscopy (cryo-EM), cryo-electron tomography (cryo-ET) and high-throughput crystallography are generating vast amounts of structural information but converting these data into mechanistically interpretable atomic models often remains difficult. Here we show that integrating experimental measurements directly into protein structure prediction can overcome these limitations. We introduce ROCKET, an augmentation of AlphaFold2 that refines predicted structures using cryo-EM, cryo-ET and X-ray crystallography data. By optimizing structures in the space of coevolutionary embeddings rather than Cartesian coordinates, ROCKET captures biologically meaningful structural variation that is inaccessible to AlphaFold2 alone and to existing automated modeling approaches, especially when the signal-to-noise ratio is low. ROCKET enables scalable, automated model building without retraining and provides a general framework for integrating experimental observables with biomolecular machine learning.

End-to-end cryo-EM complex structure determination with high accuracy and ultra-fast speed

Article 24 June 2025

CryoAtom improves model building for cryo-EM

Article 14 November 2025

The power and pitfalls of AlphaFold2 for structure prediction beyond rigid globular proteins

Article 21 June 2024

Main

Machine learning (ML) has revolutionized structural biology by enabling highly accurate protein structure prediction. Breakthrough models such as AlphaFold2 (AF2)¹, RoseTTAFold² and their descendants harness coevolutionary signals in large-scale sequence data to produce predictions with atomic-level precision and near-experimental accuracy^3,4. Despite this accomplishment, computational approaches still struggle to capture important properties such as side-chain packing, functional dynamics and large-scale molecular assembly^5,6,7,8,9.

The success of these ML models relies heavily on a vast collection of experimentally resolved structures, made possible by decades of effort by the structural biology community. However, high-quality ground-truth data that capture multiple functional states, biomolecular interactions and structural variations are scarce. Advances in high-throughput experimental techniques are beginning to provide such data. For example, modern crystallography beamlines and single-particle cryo-electron microscopy (cryo-EM) can now yield datasets under many perturbations, for example, for drug screening and tracking of structural changes during biochemical transformations^10,11, while cryo-electron tomography (cryo-ET) enables in situ observations of macromolecular complexes^12,13. These advances promise deeper insights into conformational flexibility and complex assembly.

A key bottleneck in processing these large datasets is the reconstruction of atomic models from experimental observations. ML methods have helped streamline this process by providing high-quality starting points for atomic model building^6,14,15. Standard refinement software^16,17,18 improves these starting points by optimizing experimental likelihoods in Cartesian coordinate space, complemented by pattern-matching-based model building to overcome local barriers^19,20,21. This combination struggles when the structural rearrangements presented by the data are large, such as switches in secondary structure, flips in flexible loops or shifts in relative domain orientations. Model building becomes particularly challenging, even for humans, below 4–5-Å resolution, where critical structural details such as side chains become indistinct. This difficulty is especially pronounced in the emerging field of cryo-ET, which currently tends to produce data at low resolution²². New priors that guide model building could, therefore, yield more accurate atomic models for low-resolution datasets and limit labor-intensive manual intervention when initial structures deviate from the experimental data.

We hypothesized that the implicit prior structural knowledge embedded in pretrained ML structure prediction methods could guide atomic model building from experimental data more efficiently than traditional geometric restraints²³. Historically, the integration of model and experiment was facilitated by the explicitly physics-based nature of structure prediction methods, which readily permitted the incorporation of experiment-derived potential functions^24,25. ‘Black-box’ ML models such as AF2, however, make this integration less straightforward. Recent efforts to adapt ML-based structure prediction to incorporate experimental constraints have done so either through fine-tuning of weights from existing architectures^26,27 or full training of a novel architecture, as exemplified by the ModelAngelo method for cryo-EM data²⁸. These strategies are promising but computationally expensive, data hungry and limited to the data modality they were trained on. Another approach, PredictAndBuild^6,29, iterates between predicting structure conditioned on a template and rebuilding the predicted structures on the basis of experimental data to yield the next template. This approach avoids modifying the structure prediction method but its decoupled prediction and rebuilding steps can work against each other and hinder convergence.

Ideally, existing ML methods could serve as implicit structural priors, without retraining, to accelerate and automate atomic model building. Indeed, the ColabDock framework incorporates crosslinking experimental restraints in protein complex structure prediction³⁰ and other contemporaneous studies have begun exploring this direction for cryo-EM and crystallography but their applicability to challenging modeling tasks remains unclear^31,32. Here we combine the high accuracy of pretrained AF2 structure prediction with guidance from experimental data in a way that can be flexibly adapted to different data modalities. We achieve this through ROCKET (refining OpenFold with crystallographic/cryo-EM likelihood targets), a framework that integrates OpenFold³³, a trainable reimplementation of AF2, with differentiable crystallographic and cryo-EM likelihood targets³⁴. ROCKET refines structures at inference time (no retraining), steering predictions toward experimentally supported conformations (Fig. 1a). Inspired by strategies for expanding AF2 to conformational sampling^{35,36,37,38,39}, we directly optimize the embedded multiple-sequence alignment (MSA) cluster profile^36,37 in AF2, transforming structure refinement to a data-guided search within AF2’s continuous representation of sequences.

**Fig. 1: Data-driven structure prediction refinement with ROCKET.**

Cryo-EM and crystallography are the predominant techniques for producing atomic-level insights into conformational heterogeneity, macromolecular interactions and functional structural rearrangements. Despite crystallography’s promise for high-resolution, high-throughput screening of experimental conditions^40,41,42 and drug candidates⁴³, ML applications in protein crystallography^34,44 lag behind cryo-EM^28,45,46,47. To bridge this gap, ROCKET integrates both X-ray and cryo-EM data directly into OpenFold’s inference process.

For both data modalities, we find that ROCKET is particularly valuable for the unsolved challenge of model building at low resolution⁴⁸. Existing software struggles at resolutions worse than 4–5 Å; ModelAngelo was not trained to work with maps past 4 Å (ref. ²⁸) and PredictAndBuild will not automatically rebuild with maps worse than 3.5 Å (ref. ⁶). We find that ROCKET’s inference-time refinement allows it to explore a wide conformational space and it remains robust for atomic modeling in noisy, low-resolution maps, sometimes outperforming expert manual refinement. As ROCKET relies on an input sequence for AF2 inference, it is complementary to other ML approaches designed to identify proteins in cryo-EM maps^28,49,50. Its capabilities render ROCKET a generalizable approach for integrating experimental data with ML-based biomolecular modeling.

Results

Method overview

AF2-based structure prediction begins with the construction of an MSA, typically through a search of sequence databases for proteins homologous to the query of interest. The resulting MSA comprises aligned sequences of identified homologs, from which one can infer a protein’s evolutionary history, including its patterns of residue–residue coevolution. AF2 transforms raw MSAs into an input representation suitable for neural network computation, known as the MSA cluster profile. Through many studies and ablations, it has been shown that the depth and diversity of the MSA and the statistical patterns found therein determine the geometry and quality of a structure predicted by AF2 (ref. ¹). Given this central role, we reasoned that direct optimization of the continuous space of MSA cluster profiles would provide the greatest lever for influencing AF2 predictions, a hypothesis supported by previous work on using evolutionary coupling restraints to build molecular replacement templates⁵¹, recent observations made by other groups^36,37 and our own experiments (Supplementary Fig. 1).

To operationalize this principle, ROCKET augments OpenFold with a new module that optimizes MSA cluster profiles to maximize agreement between predictions and experimental observables (Fig. 1b and Methods). Through an experimental target function that quantifies this agreement, ROCKET performs gradient descent in the space of MSA cluster profiles. During each descent step, ROCKET computes a forward pass through OpenFold, evaluates the target function and its derivatives and then updates the MSA cluster profile to increase the function’s agreement with model predictions. ROCKET currently provides two target functions, for crystallographic and cryo-EM data, and is extensible to other data modalities.

As OpenFold-predicted structures are generated in an arbitrary reference frame, ROCKET first performs molecular replacement⁵² or cryo-EM docking⁵³ to align the predicted model with the experimental data before starting the iterative refinement process. The resulting rototranslation is applied at every subsequent iteration to align the model. While initial AF2 predictions are less than perfect, previous observations indicate that they are usually sufficiently accurate for robust placement in the data^14,15,29,54. After iterative refinement through OpenFold is complete, a final local structure refinement is performed using phenix.refine¹⁶ to optimize local geometry and atomic displacement parameters. We investigate the impact of ablations of the ROCKET pipeline in Supplementary Fig. 5. Because of memory limitations, ROCKET currently only operates on one protein chain at a time and requires special handling for cases involving multiple chains (Methods).

Evaluation dataset and approach

We took a two-pronged approach to evaluating ROCKET’s effectiveness in guiding structure prediction with experimental data. In the first prong, we validated ROCKET’s model-building accuracy across a range of resolutions. We started by confirming that ROCKET could match the best established methods for high-resolution X-ray datasets. For such cases, ROCKET’s primary utility would be in streamlining experimental model building through integration with ML-based structure prediction. We then generated two reduced-resolution cryo-EM series—by either omitting or degrading experimental data—that allowed us to assess and develop confidence in ROCKET’s model building at progressively lower resolutions. In the second and more ambitious prong, we turned to scenarios at the frontier of experimental structural biology, where automated methods typically fail, cases in which manual human intervention is necessary and may not even be sufficient.

For the first prong, we started by identifying a diverse set of 27 high-resolution, single-chain X-ray crystallographic datasets and their corresponding deposited structures (Supplementary Fig. 2). All 27 structures were solved after the AF2 training date cutoff and at resolutions better than 3 Å (Supplementary Table 1), a regime where conventional methods, such as phenix.refine, perform well and hybrid methods, such as PredictAndBuild, perform extremely well²⁹. To then study ROCKET’s performance when large-scale structural changes are required from the starting AF2 prediction, we selected ligand-induced loop rearrangements in three human proteins: c-Abl kinase (PDB 3PYY), protein tyrosine phosphatase 1B (PDB 1NWL) and the serpin plasminogen activator inhibitor-1 (PDB 1JL5). At high resolution, we prioritized crystallography because, as noted earlier, its integration with ML-based methods remains underdeveloped³⁴.

Anticipating that, for cryo-EM, ROCKET’s added value would manifest at lower resolutions, we constructed reduced-resolution series where the accuracy of ROCKET models built from progressively weaker experimental signals could be objectively evaluated against a higher-resolution map that serves as the ground truth. As a first example, we chose the thiamine transporter SLC19A3 structure solved from a single-particle 3.1-Å map in the outward-open state (PDB 8S4U, EMD-19716)⁵⁵. We added noise separately to each high-resolution half-map until their Fourier shell correlation (FSC) curve matched that expected of lower-resolution maps at 6.0 Å, 8.0 Å and 10 Å (Methods). As a second example, we picked a time-resolved single-particle 20-s intermediate of the GroEL:GroES–ATP complex, solved at 2.7 Å (PDB 8BM1, EMD-16117)⁵⁶. We reprocessed progressively smaller subsets of the particle images to produce maps at resolutions of 2.9 Å, 4.9 Å and 6.8 Å. For these two series, we then tested how well ROCKET (which, in both cases, starts from an AF2 prediction that is substantially different in conformation) recovers the experimentally captured state from maps progressively decreasing in quality.

For the second prong, we tackled the following challenging model-building case studies: a 9.60-Å subtomogram average of Escherichia coli GroEL (PDB 8P4P), a 3.82-Å, crystallographic dataset of the human multidomain protease inhibitor HAI1 (PDB 5H7V), a 3.5-Å cryo-EM dataset of 2:2 complex of human protein phosphatase PPM1H and its substrate Rab8a, complicated by preferred particle orientation, and a time-resolved extrapolated dataset of class II photolyase bound to a thymine dimer, captured 10 μs after initiation of DNA repair (PDB 8OYA). Lastly, we highlight a case where ROCKET provided biological insight by supporting multimeric model building into a single-particle 8.6-Å map and where the resulting ROCKET structure was later validated against a higher-resolution 4.6-Å dataset.

ROCKET reliably models structural details across a wide resolution range

At high resolution, we expect deposited backbone and side-chain coordinates to serve as reliable ground truths for evaluation. To validate ROCKET’s performance on the 27 crystallographic structures in our first prong, we compared its refined models to those from the PDB-REDO database⁵⁷, which systematically rerefines and validates human-deposited coordinates. Figure 2a shows the Cα root-mean-squared deviation (r.m.s.d.) with respect to PDB-REDO models for original AF2 predictions and ROCKET-refined ones. ROCKET improves all AF2 predictions except one, bringing them closer to the experimentally determined structures. Focusing on the ten most difficult cases, where AF2 predictions deviated by more than 1 Å r.m.s.d. from the PDB-REDO models, ROCKET achieves substantial structural corrections (Fig. 2a, stars; mean r.m.s.d. drop of 0.47 Å), demonstrating robustness in challenging scenarios.

Although Cα r.m.s.d. is a convenient measure of overall model quality, it heavily penalizes complete models that retain less ordered regions when compared to reference models that omit those regions. For a more sensitive assessment, we used the real-space Pearson correlation coefficient (RSCC), which directly compares model-derived electron density maps to maps from experimental amplitudes (here, combined with phases from PDB-REDO models). As shown in Fig. 2a (middle), ROCKET substantially improves RSCC values for both backbones and side chains across all test cases. It is notable that optimizing MSA cluster profiles not only corrects secondary structures but also improves side-chain fit to the data (Supplementary Fig. 3). On this benchmark, ROCKET achieves RSCC accuracy comparable to human-deposited models and performs on par with PredictAndBuild and with ModelCraft, an advanced de novo automated model-building pipeline for crystallography and cryo-EM datasets⁵⁸ (Supplementary Fig. 4a). In further validation using crystallographic R_free factors⁵⁹ (Supplementary Fig. 4b), all but two models see improvements relative to phenix.refine alone, with ten models showing an R_free reduction of more than 3%. We note that ROCKET achieves performance on R_free comparable to both ModelCraft and to the more complex combination of PredictAndBuild + phenix.refine, while still maintaining, unlike the other two methods, full model sequence completeness (Supplementary Fig. 4c).

We found that performing optimization in the latent MSA profile embedding allows for more pronounced structural rearrangements than conventional refinement in Cartesian coordinate space; compared to phenix.refine, ROCKET is able to better recover experimental fold and backbone from AF2 predictions with larger initial r.m.s.d. to the deposited structure (Supplementary Fig. 5). For the high-resolution benchmark, internal data-driven iterative updates to AF2’s prediction (ROCKET) are comparable to external rebuilding iterations (PredictAndBuild) (Supplementary Fig. 6a). ROCKET, however, can perform challenging large-scale structural rearrangements, such as ligand-induced loop movements and peptide flips, that are not accessible to PredictAndBuild (Supplementary Fig. 6b–e). We highlight that these two strategies can complement each other (Supplementary Fig. 6f). Additionally, in cases where the initial prediction requires a large structural change, subsampling MSAs to generate alternative MSA profile embeddings can improve initial predictions and help overcome limitations of gradient descent (Supplementary Figs. 7 and 8).

After benchmarking ROCKET at high resolution, we evaluated its model-rebuilding performance at progressively lower resolutions. For the SLC19A3 thiamine transporter series, ROCKET recovers the outward-open conformation from an inward-open AF2 starting model even at 10 Å. The overall fold is recovered at all three target resolutions (6 Å, 8 Å and 10 Å), and is supported by the largely improved RSCC values for the final models (Fig. 2b). After full ROCKET refinement, we computed FSC curves between maps calculated from the refined models and the independent, original 2.9-Å map that was not used for refinement. In each case, the refined models agree with the high-resolution data well beyond the nominal resolution of the target maps; models refined against 6-Å and 8-Å data maintain FSC > 0.5 up to 3.56 Å and 3.66 Å, respectively. At 10 Å, ambiguity in rigid-body placement within the map reduces agreement relative to the other two cases. Nevertheless, the 10-Å model retains FSC > 0.5 up to 7.14 Å. These results indicate that ROCKET can capture high-resolution detail absent from the guiding maps, showing that the AF2 structural prior provides information complementary to the experimental signal.

We next examine the time-resolved GroEL:GroES–ATP series, whose results are summarized in Fig. 2c. We refine the GroEL subunit from the upper heptameric ring, where, compared to the starting AF2 prediction, the top domain needs to undergo a large rearrangement to complex with the respective GroES subunit. At 2.9 Å and 4.9 Å, ROCKET recovers the complexed conformation, improving RSCC against the deposited 2.7-Å map from 0.44 (AF2) to 0.72 (2.9-Å-refined ROCKET) and 0.64 (4.9-Å-refined ROCKET). This task is more challenging than the thiamine transporter above because the required torsion of the top domain weakens the gradient-based signal. The limitation becomes evident at 6.8 Å, where ROCKET improves placement of the lower domain but does not recover the full conformational change. Although the correct conformation would achieve a higher experimental log-likelihood gain (LLG; 1,583 versus 1,448 for the ROCKET model), gradient-descent refinement remains trapped in a local minimum.

Data-driven MSA profile optimization improves AF model confidence

Intriguingly, we noticed that, across our crystallographic datasets, ROCKET’s AF2-derived model confidence (measured as per-residue predicted local distance difference test (pLDDT)) is correlated with agreement with experimental data, as reflected in the positive correlation between final model confidence and RSCC (Supplementary Fig. 9). As MSAs may implicitly encode multiple conformations that a protein can adopt³⁵, we suggest that ROCKET’s data-guided optimization uses and then resolves this structural ambiguity, allowing it to reach different functional conformations. To more directly test this hypothesis, we performed a (negative) control experiment. AF2 includes built-in confidence metrics, which several studies have leveraged to explore conformational space, particularly in the context of protein design^36,60,61,62; we asked whether optimizing MSA cluster profiles with respect to AF2’s own confidence metrics may improve structure prediction without the need for experimental data. To implement this idea, we used ROCKET to optimize pLDDT, AF2’s primary confidence metric. We found that this approach fails to improve the correspondence of AF2 predictions with experiment in every one of the cases studied (Supplementary Fig. 10), indicating that experimental data provide new and orthogonal information, beyond AF2’s implicit scoring function, which is necessary for efficient sampling of functionally relevant conformations. This clarifies but does not contradict our first finding. Specifically, it indicates that pLDDT can identify highly preferred conformations but cannot, by itself, distinguish the experimentally observed state (and provide a useful gradient) without further experimental information.

ROCKET facilitates frontier model-building tasks

Cutting-edge structural biology techniques often generate data at the limits of available methodology, with partially automated workflows bottlenecked by human expertise. Examples include (1) cryo-ET, where missing-wedge artifacts and noise obscure atomic detail; (2) low-resolution crystallography, where conformational heterogeneity from flexible loops and domains blurs density and increases susceptibility to phase bias during refinement; (3) cryo-EM with a preferred orientation of particles, which results in an incomplete sampling of views that compromises the final three-dimensional (3D) reconstruction; or (4) the emerging field of time-resolved crystallography, where signals from short-lived intermediates must be disentangled from mixtures of activated and ground-state populations⁶³. To assess whether ROCKET can address these frontier problems, which remain solidly outside the purview of existing methods, we tackled four representative tasks summarized in Fig. 3.

We begin with two datasets that lie past the resolution limits for automated model building: a 9.60-Å subtomogram average of E. coli GroEL and a 3.82-Å human HAI1 crystallographic dataset, where domain motion prevented high-resolution diffraction (Fig. 3a,b). In the GroEL subtomogram average, the 9.60-Å map reveals two distinct conformations for the repeating subunit in the top and bottom heptameric rings. Despite the low resolution, ROCKET accurately recovers both conformations (Fig. 3a), yielding refined models with map correlations comparable to deposited structures (Supplementary Fig. 11). Moreover, for chain A, ROCKET explores a broad conformational space, deviating substantially from the AF2 prediction. Notably, in the top domain (residues 250–280; star in Supplementary Fig. 11), ROCKET achieves an average RSCC of 0.5, substantially higher than the 0.0 achieved by the AF2 model or even the 0.2 achieved by the deposited model, supporting a closer match to experimental data.

In well-resolved regions of HAI1, ROCKET successfully converges to a backbone conformation that more closely matches the experimental density than the AF2 prediction (Fig. 3b, regions i–iii). Particularly noteworthy is region iii (residues 310–330), where the density is highly noisy, making even manual model building difficult. In this segment, the deposited model appears to have an incorrect sequence register. ROCKET corrects this and improves AF2’s initial prediction to better align with the density map without introducing new geometric outliers. Conversely, in areas where the experimental map is poorly resolved, such as region iv, which is not modeled in the deposited structure, ROCKET refrains from forcing arbitrary changes and instead preserves the original AF2 prediction.

Next, we modeled the 2:2 heterotetrameric complex of the protein phosphatase PPM1H and its substrate Rab8a (Fig. 3c) using ROCKET from a 3.5-Å cryo-EM map affected by preferred orientation (Supplementary Fig. 12). This dimeric assembly represents the physiological state of the enzyme, which creates a more intricate interaction surface than a monomer can provide⁶⁴. AF3 (ref. ⁹) fails to produce a plausible complex (Supplementary Fig. 12), underscoring the need for map-guided modeling to reveal the correct physiological assembly. Compared to a docked AF2 prediction, the ROCKET model achieves a much higher real-space correlation coefficient (CC) with the cryo-EM map (Supplementary Fig. 12). Specifically, ROCKET captures a crucial conformational change in PPM1H, where the highly conserved flap domain (region i, top part, residues 304–414) moves toward Rab8a. This movement induces a change in the active site consistent with previously reported flexibility⁶⁵. Furthermore, ROCKET substantially improves the modeling of the flap domain–Rab8a interface (region ii), which is the primary determinant of substrate specificity⁶⁴. ROCKET also positions the β-motif (region i, bottom part) distant from Rab8a, suggesting that it is not directly involved in this interaction; both findings are consistent with biochemical evidence (Supplementary Fig. 13). We benchmarked our result against the de novo modeling tool ModelAngelo²⁸, as this 3.4-Å dataset falls within its applicable resolution range. Although ModelAngelo improved the model–map CC in the regions it built, its model suffered from low completeness (<20%). In contrast, ROCKET achieved the highest overall CC value while providing a complete model of the complex.

For our final frontier task, we used ROCKET to refine a time-resolved extrapolated dataset of class II photolyase bound to a thymine dimer (Fig. 3d), collected 10 μs after initiation of DNA repair (PDB 8OYA). Although the dataset has a nominal resolution of 2.10 Å, the effective resolution of the extrapolated data is estimated around 2.50–2.80 Å (Supplementary Table 1)⁶⁶. The crystals contain two independent protein–DNA complexes; chain A exhibits stronger time-resolved signal and consistently lower B factors (by 10–20 Å² in some regions) than chain B, likely because of crystal packing. We report ROCKET refinements for both chain A and the more challenging chain B. In both cases, ROCKET markedly improves model–map agreement (Fig. 3d). Per-residue RSCC values comparing AF2, ROCKET and deposited structures are plotted in Supplementary Fig. 14. In the helical segment spanning residues 200–214, ROCKET correctly rebuilds the secondary-structure element (region i), with remaining differences relative to the deposited model confined to solvent-exposed side chains. At the DNA-binding interface, ROCKET robustly corrects side-chain-level details (region ii), guiding AF2 toward the DNA-bound conformation. This happens without providing any explicit information about DNA during inference and is purely driven by protein-specific data. ROCKET also rebuilds a solvent-exposed segment of chain B where the density is markedly poorer than in chain A (Fig. 3d, region iii), recovering side-chain rotamers and a backbone shift that, where supported by density, match the depositing group’s choices. These results collectively highlight ROCKET’s utility for structural interpretation from heterogeneous or weak signals.

ROCKET accurately models the structure of a homomeric protein filament

Beyond the four frontier challenges we just described, we sought to apply ROCKET to an active project involving multimeric model building from a low-resolution cryo-EM map. Subsequent to our model building, we obtained a higher-resolution dataset that validated ROCKET’s structure.

Vertebrate reproduction depends on the zona pellucida (ZP), a specialized extracellular coat that surrounds the egg and, from amphibians to mammals, mediates sperm attachment, penetration and the block to polyspermy⁶⁷. The ZP is a 3D mesh of filaments assembled by glycoprotein subunits that share a ZP module, a polymerization element consisting of ZP-N and ZP-C domains separated by an interdomain linker (IDL)⁶⁸. Recently, the structure of a heteromeric egg coat filament was obtained from native ZP fragments that crystallized by reassembling into polymers⁶⁹. However, the paucity, highly heterogenous nature and often covalent crosslinking of ZP filaments has made it challenging to obtain detailed structural information for intact filaments.

We used ROCKET to aid model building into a 8.6-Å helical reconstruction map of ZPD⁷⁰, a noncrosslinked, homopolymeric glycoprotein of the avian egg coat. In Fig. 4a, we illustrate the docked AF2 prediction, in which a ZP module (chain A) engages with the ZP-C and ZP-N domains of the preceding and following subunits within the filament, respectively (chains B and C), as observed in the fish ZP heteropolymer⁶⁹. For visual clarity, we display a map postprocessed using EMReady2 (ref. ⁷¹), although we emphasize that ROCKET refinement was carried out against unmodified half-map data. In Fig. 4b, we compare the ROCKET-built model (purple) to a model whose initial coordinates also came from AF2 but that was independently built without ROCKET on the basis of the same low-resolution map (Methods). Importantly, our validation can also take advantage of a higher-resolution 4.6-Å map of ZPD obtained by single-particle analysis of a later dataset and a corresponding model also refined without ROCKET (light blue model and map). ROCKET’s model, refined solely against the 8.6-Å map, achieves a CC of 0.67 to the higher-resolution map compared to only 0.29 for the unrefined AF2 prediction. Beyond this overall agreement, ROCKET captures key architectural features that differ from the independently built 8.6-Å model but are validated by the higher-resolution dataset. These include (1) a much more accurately modeled ZP-N fg loop interacting with a complementary surface of the partner ZP-C; (2) conformational details in the IDL that stabilize subunit interactions within the filament; and (3) better-defined secondary structure in regions where the low-resolution model was ambiguous. Underscoring the importance of the full pipeline and consistent with the ablation study in Supplementary Fig. 5f, we show a broken disulfide from the unrelaxed ROCKET model prediction, which was resolved by a conventional refinement polishing step (inset iv).

**Fig. 4: ROCKET enables model building of a ZPD filament from low-resolution cryo-EM.**

This validation against independently collected higher-resolution data provides a strong demonstration of ROCKET’s ability to recover accurate atomic information, even under the challenging conditions of low-resolution cryo-EM. Biologically, the structure of native ZPD provides important evidence that all egg coat protein filaments share a common general organization, with the same interlocked subunit architecture first observed for urinary ZP module protein uromodulin⁷² but with helical twist parameters around −120° (ref. ⁶⁹).

Discussion

In developing ROCKET, we demonstrate that experimentally guided refinement of the embedded MSA profiles of AF2 enables efficient exploration of conformational space. Our results suggest that these embeddings provide access to paths along which the barrier for structural rearrangements is greatly reduced or eliminated, indicating that information about such rearrangements may be encoded in evolutionary statistics. Furthermore, we show examples where leveraging AF2 priors in combination with experimental constraints extends atomic model building into increasingly complex and dynamic structural datasets.

We anticipate that data-guided, inference-time optimization will prove broadly valuable across diverse atomic model-building scenarios. Beyond cryo-EM and crystallographic datasets, implemented here, our approach can be used for other, sparser data modalities, provided a likelihood-based target between data and prediction can be formulated. Extending ROCKET to handle multichain complexes or protein–ligand cofolding is straightforward within any AF-like framework, in principle requiring only a switch in the inference model used.

More ambitious directions may involve integrating generative models to account for conformational ensembles. Furthermore, given our success with MSA profile biasing, we propose that learning a mapping from experimental observables to a profile bias matrix could effectively condition single-shot structure prediction toward experimentally probed conformations. Such a mapping would translate experimental data, such as cryo-EM density maps, X-ray diffraction intensities or nuclear-magnetic-resonance-derived constraints, into a profile bias matrix that captures residue-level probabilities or pairwise constraints derived from experiment, effectively guiding the model toward relevant conformations without requiring exhaustive searches. By adopting amortized inference, which learns a reusable mapping to enable fast predictions for new inputs, we could further streamline the process by replacing the stepwise search (MSA subsampling and likelihood scoring; Supplementary Fig. 7) with a more efficient learned transformation.

To understand how ROCKET alters its coevolutionary representation during optimization, we examined the signals within the refined MSA profile that contribute to a specific structural transition: the prototypical conformational change of the activation loop of c-Abl kinase between its inactive (observed here) and active conformation. ROCKET’s prediction confidence increased notably during refinement (Fig. 5a), consistent with the hypothesis that ROCKET’s optimized MSA profile disambiguates multistate coevolutionary signals into a clear structural directive. To identify the coevolutionary features most impacted by ROCKET’s MSA profile optimization, we computed residue–residue correlations, in the form of mutual information (MI) matrices, from the initial and the final MSA profiles. The correlations in the final MSA profile are more dispersed than in the original (Fig. 5b). To understand the causal role of changes in the MSA profile, we first ranked residue pairs on the basis of the magnitude of change in their correlations implied by the MSA profile (by ∥ΔMI∥) and then, starting from the final ROCKET MSA profile, progressively muted the optimized signals at these high-ranking residue pairs by replacing them with their corresponding values from the untransformed MSA profile. This MI-guided muting strategy proved much more effective at reverting the structure back toward its incorrect starting conformation, inducing a flip with ~95 muted residues, relative to a random sampling baseline that required ~170 residues (Supplementary Fig. 5c). In contrast, muting residues on the basis of their rank by simpler statistics such as the position-specific scoring matrix entropy change or average absolute MI change did not outperform random selection (Supplementary Fig. 15a), indicating that changes to specific pairwise correlations drive ROCKET’s performance. Further analysis of the top-ranked pairs revealed that the signal is not uniformly distributed but is instead concentrated; the top 155 pairs in the ∥ΔMI∥ ranking involve fewer unique residues than 155 randomly selected pairs (Supplementary Fig 15b). As shown in Fig. 5d, the involved residues cluster at the interface ‘below’ the activation loop and at contact points for the active conformation of the loop (superposed from PDB 2GQG), while residues at the ‘back’ of the kinase are hardly affected. Nevertheless, nearly one third of the entire protein is involved in some of the residue–residue correlations that need to be muted to undo the activation loop’s conformational switch, reinforcing the challenge of engineering such transformations manually and motivating the development of learned conditioning approaches.

**Fig. 5: MI highlights distributed residue signals driving conformational transitions.**

While additional work is needed to fully realize these extensions, our study marks critical progress by showing that structure optimization in coevolutionary embeddings can overcome limitations of conventional refinement. Additionally, by introducing differentiable likelihood targets for cryo-EM and crystallography, which include a robust treatment of measurement error, we provide a framework that is well suited for training future predictive models. More broadly, our Bayesian approach to combining information learned by ML models with information obtained by direct observation establishes a foundation for continuous interaction between ML and experimental data. This interplay is critical for the interconnected goals of scaling experimental throughput and training ML models with enhanced functionality.

Our atomic model building is automated, requires no retraining and, for difficult cases, produces models of quality comparable or even superior to those created by expert human modelers. Despite this advance, certain limitations remain. First, it is always advisable to inspect refined models visually and to validate critical aspects of the model by complementary data when the experimental map is of low quality. For crystallographic cases, the current approximation of the atomic displacement parameters is derived empirically from model confidence and could be improved by incorporating considerations of density fit at the residue level. Additionally, because OpenFold is not explicitly aware of crystal contacts, ROCKET may struggle to converge to certain lattice-constrained conformations. We expect that this limitation can be addressed by extending ROCKET to use multichain models⁷³. We also noticed that ROCKET can fail to flip small loops (3–4 residues in length) that contain bulky side chains. We show examples of cases that are difficult for ROCKET in Supplementary Fig. 16. Perhaps most importantly, iterative backpropagation through OpenFold is memory intensive and limits the maximum size of the protein or domain that can be refined at once—about 500 residues on a 40-GB A100 GPU (Supplementary Fig. 17). We believe that this can be extended with further code optimization and the implementation of a learned mapping for single-shot MSA profile bias, mentioned above, could also help overcome this limit.

Naturally, assessing biological accuracy solely through the lens of how well the atomic model fits experimental maps has inherent limitations, as both cryo-EM and crystallography can introduce artifacts⁷⁴. Ultimately, multimodal information is essential for building a full picture of physiological protein states and functions. For this purpose, our approach is readily adaptable to alternative loss functions that combine multiple sources of experimental and computational data, supporting integrative modeling strategies for biological structure determination.

Methods

ROCKET algorithm and processing pipeline

ROCKET’s inputs and preprocessing steps are summarized in Algorithm 1, while the ROCKET inference-time optimization algorithm is summarized in Algorithm 2.

Inputs and preprocessing pipeline

For crystallographic datasets, a protein sequence and a reflection MTZ file (or CIF file) containing observed intensities and their uncertainties are required; for cryo-EM datasets, two half-maps are required. As outlined in Algorithm 1, to obtain an aligned reference model (x^ref), we use Phasertng⁷⁵ for molecular replacement with crystallographic datasets and a likelihood-based docking tool, EM_placement⁷⁶, with cryo-EM datasets. These determine the correct pose from an initial, unconditional AF2 model. These tools also estimate experimental data parameters for refinement, such as E_e and D_obs, which represent observed normalized amplitudes and an accounting for measurement error, respectively⁷⁷, and are described further below.

Algorithm 1: ROCKET Preprocessing

function PREPROCESSING (ProteinSeq, ReflectionMTZ, HalfMaps)

${{\bf{x}}}^{{\rm{initial}}},{{\rm{pLDDT}}}^{{\rm{ref}}},{{\bf{m}}}_{0}^{{\rm{cluster}}\_{\rm{profile}}}=$ OpenFold_Inference(ProteinSeq)

pseudo B^ref = pLDDT_to_pseudo B(pLDDT^ref)

if ReflectionMTZ then

MR_solution, x^ref = Phasertng_MR(x^initial, ReflectionMTZ)

E_e, D_obs = Phasertng_Preprocess(ReflectionMTZ)

return ${{\bf{x}}}^{\mathrm{ref}},{E}_{e},{D}_{obs},\mathrm{pseudo}\,{B}^{\mathrm{ref}},{{\bf{m}}}_{0}^{\mathrm{cluster}\_\mathrm{profile}},$$\,\mathrm{Type}:\mathrm{Crystallographic}$

else if HalfMaps then

x^ref = CryoEM_Dock(x^initial, HalfMaps)

E_e, D_obs = CryoEM_ParameterEstimation(HalfMaps)

return ${{\bf{x}}}^{\mathrm{ref}},{E}_{{\rm{e}}},{D}_{\mathrm{obs}},{\mathrm{pseudo}}\,{B}^{\mathrm{ref}},{{\bf{m}}}_{0}^{\mathrm{cluster}\_{\mathrm{profile}}},\,{\mathrm{{Type}}\; :\; {\mathrm{Cryo}}\; -\; {\mathrm{EM}}}$

else

return Error: Missing required data (ReflectionMTZ or HalfMaps)

end if

end function

Refinement algorithm

ROCKET optimizes a linear bias (with scales w and offsets b) that it applies to the starting MSA cluster profile (${{\bf{m}}}_{0}^{{\rm{cluster}}\_{\rm{profile}}}$) to maximize agreement between an OpenFold prediction (x^prediction) and experimental data (E_e). This agreement is quantified by a data LLG target, ${{\mathcal{L}}}_{\mathrm{LLG}}$, which is combined with an optional positional restraint, ${{\mathcal{L}}}_{L2}$, to yield an overall objective function ${\mathcal{L}}$, described below. The shapes of the w and b tensors match that of the MSA cluster profile tensor (number of MSA clusters × number of residues × 23)¹.

Algorithm 2: ROCKET Refinement

function REFINEMENT (E_e, D_obs, N_iter, x^ref, pseudo B^ref, ${{\bf{m}}}_{0}^{{\rm{cluster}}\_{\rm{profile}}}$, lr_mul, lr_add, ω_L2)

${\bf{w}}={\rm{ones}}\_{\rm{like}}({{\bf{m}}}_{0}^{{\rm{cluster}}\_{\rm{profile}}})$ ⊳ Initialize multiplicative bias

${\bf{b}}={\rm{zeros}}\_{\rm{like}}({{\bf{m}}}_{0}^{{\rm{cluster}}\_{\rm{profile}}})$ ⊳ Initialize additive bias

optimizer = adam({w: lr_mul, b: lr_add})

for iter = 1…N_iter do

${{\bf{m}}}^{{\rm{cluster}}\_{\rm{profile}}}={\bf{w}}\odot {{\bf{m}}}_{0}^{{\rm{cluster}}\_{\rm{profile}}}+{\bf{b}}$

x^prediction, pLDDT = OpenFold_Inference(m^{cluster_profile})

pseudo B = pLDDT_to_pseudo B(pLDDT)

x^array = Weighted_Kabsch(x^prediction, x^ref, pseudo B^ref)

x^RBR = Rigid_Body_Refinement(x^array, pseudo B, E_e, D_obs)

${{\mathcal{L}}}_{\mathrm{LLG}}=\,\mathrm{LLG}\,({{\bf{x}}}^{\mathrm{RBR}},\,\mathrm{pseudo}\,B\,,{E}_{{\rm{e}}},{D}_{\mathrm{obs}})$ ⊳ LLG Targets

${{\mathcal{L}}}_{\mathrm{L2}}=\,\mathrm{Weighted}\_\mathrm{L2}\,({{\bf{x}}}^{\mathrm{RBR}},{{\bf{x}}}^{\mathrm{ref}},\mathrm{pseudo}\,{B}^{\mathrm{ref}})$ ⊳ Weighted L2 loss

${\mathcal{L}}={{\mathcal{L}}}_{\mathrm{LLG}}+{\omega }_{\mathrm{L2}}\cdot {{\mathcal{L}}}_{\mathrm{L2}}$

${\mathcal{L}}.\,\mathrm{backward()}$

optimizer.step()

end for

return x^RBR, w, b

end function

During each ROCKET iteration, we apply a linear bias to the initial cluster profile matrix and then run OpenFold inference to obtain a new prediction, along with its pLDDT confidence values. For crystallography, our current implementation estimates atomic B factors at every iteration from pLDDT confidence scores using a previously established heuristic², without explicitly refining them. Specifically, we convert pLDDT values into equivalent r.m.s.d. values, using the empirical relation from Baek et al.², and then to corresponding pseudo B factors:

$$\,\mathrm{pseudo}\,B\,=\frac{8{\pi }^{2}}{3}{\left[1.5\times {e}^{4\times (0.7-0.01\times \mathrm{pLDDT})}\right]}^{2}$$

(1)

At every iteration, we align the newly predicted model to the reference model using a weighted Kabsch alignment (with weights determined using the pseudo B factors as described in Eq. 16), followed by rigid-body refinement, elaborated later. We then compute the experimental LLG using the aligned coordinates and the pseudo B factors.

In practice, ROCKET runs in two phases: an ‘adventurous’ phase 1 and a ‘fine-tuning’ phase 2. We apply different learning rates for the multiplicative (w) and additive (b) components of our MSA profile bias and these rates vary across phases. In phase 1, we use higher learning rates (by default, lr_mul = 1.0 and lr_add = 0.05) and a default low-resolution cutoff of 3 Å. We also incorporate a weighted Cα mean squared distance between the reference model and latest prediction as an L₂ regularization term that quantifies initial model confidence. The weights are computed with the same scheme used for the weighted Kabsch alignment, discussed further below.

$${{\mathcal{L}}}_{L2}=w(\,\mathrm{pseudo}\,B)\cdot {\left({{\bf{x}}}^{\mathrm{array}}-{{\bf{x}}}^{\mathrm{ref}}\right)}^{2}$$

(2)

with the confidence-based weights w(pseudo B) defined in Eq. 16 and the final loss for backpropagation:

$${\mathcal{L}}={{\mathcal{L}}}_{\mathrm{LLG}}+{\omega }_{L2}{{\mathcal{L}}}_{L2}$$

(3)

By default, the L₂ loss weight is ω_L2 = 10⁻¹¹. We run phase 1 for three independent traces, each consisting of 100 iterations, and select the model with the best LLG score to proceed to phase 2. The aim of phase 2 is to further fine-tune the structure. By default, we set phase 2 to run for 500 iterations with both lr_mul and lr_add to 10⁻³ and remove the L₂ loss term (ω_L2 = 0). An early stop occurs if the LLG score does not improve by more than 0.1 for 50 consecutive iterations. Furthermore, Supplementary Fig. 19 illustrates the efficacy of phase 1 for avoiding local optima through the example of the c-Abl kinase crystallographic dataset (PDB 3PYY).

The pseudo B approximation can limit accuracy by not capturing finer structural details. Moreover, geometric validation indicates that outputs from the iterative optimization have more bond outliers and steric clashes than stricter refinement protocols typically allow (Supplementary Fig. 5f). To address these limitations, we append a short standard local refinement step using phenix.refine¹⁶ after iterative OpenFold inference. Analogous to AMBER relaxation in standard AF2 pipelines¹, this step further optimizes geometry (Supplementary Fig. 5) and displacement parameters, polishing the final model’s overall quality.

Implementation

ROCKET is implemented in PyTorch 1.12.1 as an extension of the OpenFold system. It currently uses the monomer version of OpenFold with AF2 model_1 weights to maintain consistency with AF2’s data splits. This allows for prediction and refinement of crystallographic datasets containing a single chain in the asymmetric unit or one domain (at a time) in a cryo-EM complex.

Crystallographic LLG targets

For crystallographic datasets, we use the LLG on intensity (LLGI) target introduced in a previous study⁷⁷, which can be expressed as follows, for acentric and centric reflections:

$$\begin{array}{l}{{\rm{LLGI}}}_{{\rm{a}}}({E}_{{\rm{e}}};{E}_{{\rm{C}}})={\rm{ln}}\,[\frac{{p}_{{\rm{a}}}({E}_{{\rm{e}}};{E}_{{\rm{C}}})}{{p}_{{\rm{a}}}({E}_{{\rm{e}}})}]\\ \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,={\rm{ln}}\,[{p}_{{\rm{a}}}({E}_{{\rm{e}}};{E}_{{\rm{C}}})]-{\rm{ln}}\,[{p}_{{\rm{a}}}({E}_{{\rm{e}}})]\end{array}$$

(4)

$$\begin{array}{l}{{\rm{LLGI}}}_{{\rm{c}}}({E}_{{\rm{e}}};{E}_{{\rm{C}}})={\rm{ln}}\,[\frac{{p}_{{\rm{c}}}({E}_{{\rm{e}}};{E}_{{\rm{C}}})}{{p}_{{\rm{c}}}({E}_{{\rm{e}}})}]\\ \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,={\rm{ln}}\,[{p}_{{\rm{c}}}({E}_{{\rm{e}}};{E}_{{\rm{C}}})]-{\rm{ln}}\,[{p}_{{\rm{c}}}({E}_{{\rm{e}}})]\end{array}$$

(5)

with ${p}_{{\rm{a}}}\left({E}_{{\rm{e}}}\right)=2{E}_{{\rm{e}}}\exp \left(-{E}_{{\rm{e}}}^{2}\right)$, ${p}_{{\rm{c}}}\left({E}_{{\rm{e}}}\right)={\left(\frac{2}{\pi }\right)}^{1/2}\exp \left(-\frac{{E}_{{\rm{e}}}^{2}}{2}\right)$ and

$$\begin{array}{l}{p}_{{\rm{a}}}({E}_{{\rm{e}}};{E}_{{\rm{C}}})=\frac{2{E}_{{\rm{e}}}}{1-{D}_{\mathrm{obs}}^{2}{\sigma }_{{\rm{A}}}^{2}}\exp \left[-\frac{{E}_{{\rm{e}}}^{2}+{({D}_{\mathrm{obs}}{\sigma }_{{\rm{A}}}{E}_{{\rm{C}}})}^{2}}{1-{D}_{\mathrm{obs}}^{2}{\sigma }_{{\rm{A}}}^{2}}\right]\\ \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\times {I}_{0}\left(\frac{2{D}_{\mathrm{obs}}{\sigma }_{{\rm{A}}}{E}_{{\rm{e}}}{E}_{{\rm{C}}}}{1-{D}_{\mathrm{obs}}^{2}{\sigma }_{{\rm{A}}}^{2}}\right)\end{array}$$

(6)

$$\begin{array}{l}{p}_{{\rm{c}}}({E}_{{\rm{e}}};{E}_{{\rm{C}}})={\left[\frac{2}{\pi (1-{D}_{\mathrm{obs}}^{2}{\sigma }_{{\rm{A}}}^{2})}\right]}^{1/2}\exp \left[-\frac{{E}_{{\rm{e}}}^{2}+{({D}_{\mathrm{obs}}{\sigma }_{{\rm{A}}}{E}_{{\rm{C}}})}^{2}}{2(1-{D}_{\mathrm{obs}}^{2}{\sigma }_{{\rm{A}}}^{2})}\right]\\ \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\times \cosh \left(\frac{{D}_{\mathrm{obs}}{\sigma }_{{\rm{A}}}{E}_{{\rm{e}}}{E}_{{\rm{C}}}}{1-{D}_{\mathrm{obs}}^{2}{\sigma }_{{\rm{A}}}^{2}}\right)\end{array}$$

(7)

where p(x; y) denotes the conditional probability of x given y.

As defined previously⁷⁷, E_e is the ‘effective’ observed normalized amplitude, E_C is the normalized structure factor amplitude calculated from the predicted model in a differentiable manner using SFCalculator with solvent correction³⁴, D_obs encodes the reduction in correlation between true and effective normalized structure factors arising from experimental error and σ_A is a resolution-dependent factor that encodes the reduction in correlation between the true and calculated normalized structure factors arising from model error.

We refine σ_A in resolution bins⁷⁸ at every ROCKET iteration using the Newton–Raphson optimization method⁷⁹. The LLG of the observed effective amplitudes, E_e, given the calculated amplitudes, E_C, is maximized by refining σ_A. The derivative of the LLG with respect to σ_A for each resolution bin is given by

$$\frac{\partial \mathrm{LLG}}{\partial {\sigma }_{{\rm{A}}}}=\mathop{\sum }\limits_{i}\frac{\partial \,{\text{ln}}\,p\left({E}_{{\rm{e}},i};{E}_{{\rm{C}},i},{\sigma }_{{\rm{A}}}\right)}{\partial {\sigma }_{{\rm{A}}}}$$

(8)

where i is an index over observations. The second derivative is given by

$$\frac{{\partial }^{2}\,\mathrm{LLG}}{\partial {\sigma }_{{\rm{A}}}^{2}}=\mathop{\sum }\limits_{i}\frac{{\partial }^{2}\,{\text{ln}}\,p\left({E}_{{\rm{e}},i};{E}_{{\rm{C}},i},{\sigma }_{{\rm{A}}}\right)}{\partial {\sigma }_{{\rm{A}}}^{2}}$$

(9)

We obtain the updated value of σ_A using a Newton step:

$${\sigma }_{{\rm{A}}}^{\mathrm{new}}={\sigma }_{{\rm{A}}}- \left( {\frac{{\partial }^{2}{{\mathcal{L}}}_{\mathrm{LLG}}}{\partial {\sigma }_{{\rm{A}}}^{2}}}\right)^{-1}\frac{\partial {{\mathcal{L}}}_{\mathrm{LLG}}}{\partial {\sigma }_{{\rm{A}}}}$$

(10)

The value of σ_A is constrained within [0.015, 0.99] to maintain physical relevance and stability during refinement. For all LLG and σ_A estimates we use the working set of reflections (we find that using the working set for σ_A refinement does not lead to any meaningful overfitting with ROCKET; Supplementary Fig. 18). We keep a test set of reflections for final R_free calculation after conventional refinement.

LLG target and noise modeling for cryo-EM

For cryo-EM data, we follow the method outlined previously⁷⁶ to dock the initial prediction into the experimental map. ROCKET works with a sphere surrounding the model, with contributions from any other fixed components within that sphere being accounted for. We model the signal and noise in Fourier space to account for directional and resolution-dependent variations in data quality. The signal is derived from correlations between Fourier terms of the experimental half-maps, while the noise is deduced from their differences⁵³.

For a single Fourier term, the LLG is given by

$$\begin{array}{l}{\mathrm{LLG}}_{\mathrm{cryo}}=\frac{2\cdot {D}_{\mathrm{obs}}\cdot {\sigma }_{A}\cdot {E}_{{\rm{e}}}\cdot {E}_{\mathrm{calc}}\cdot \cos (\Delta \phi )}{1-{D}_{\mathrm{obs}}^{2}{\sigma }_{A}^{2}}\\ \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,-\frac{{D}_{\mathrm{obs}}^{2}{\sigma }_{A}^{2}({E}_{{\rm{e}}}^{2}+{E}_{\mathrm{calc}}^{2})}{1-{D}_{\mathrm{obs}}^{2}{\sigma }_{A}^{2}}-\text{ln}\,\left(1-{D}_{\mathrm{obs}}^{2}{\sigma }_{A}^{2}\right)\end{array}$$

(11)

where E_e and E_calc are the normalized amplitudes of the observed and calculated Fourier terms, = ϕ_calc − ϕ_obs is the phase difference between these Fourier terms and D_obs and σ_A are analogous to their crystallographic counterparts.

We compute σ_A for each resolution bin as described previously⁵³, using observed normalized amplitudes (E_e), calculated normalized amplitudes (E_calc) and the phase difference Δϕ = ϕ_calc − ϕ_obs.

For the cryo-EM cases, which are not affected by phase bias, we update residue-level B factors at each iteration using a conversion that is informed by local RSCC to the experimental map.

Cryo-EM map resolution degradation

For testing, it is invaluable to have low-resolution cryo-EM maps for which the ground truth is known from a corresponding higher-resolution map. Simple Fourier filtering can remove high-resolution information but the resulting data are much higher in quality at the new resolution limit than one would encounter with real data.

One approach, used for instance in testing the Q-score algorithm⁸⁰, is to make reconstructions with progressively fewer particle images. This approach was used to generate lower-resolution versions of the GroEL:GroES–ATP complex, using cryoSPARC⁸¹ to reprocess data obtained from EMPIAR-11481. Subsets of particles used in the original reconstruction were selected randomly: a reconstruction with C₇ symmetry using 45,174 particles yielded a map at 2.9-Å resolution, while maps at the lower resolutions of 4.9 Å and 6.8 Å were produced using 310 and 247 particles, respectively.

A second approach, in which independent complex random noise is added to the half-map Fourier terms, is much faster and allows finer control of resolution limits. Implementing this approach required first defining targets for the FSC curves that would be expected at different resolution limits. This was achieved by devising a functional form for an equation that could fit a wide variety of FSC curves in the EM Data Bank. The underlying idea for the functional form is that the variation of FSC with resolution is controlled by the relative size of the signal and noise powers at zero scattering angle and the difference in how those powers fall off with resolution.

$${\rm{FSC}}=\frac{r}{r+\exp (\Delta B| s{| }^{2}/4)}$$

(12)

where r is the ratio between the signal and noise powers at s = 0 and ΔB is the difference in the falloff B factors for signal and noise.

FSC curves were downloaded for all the EM Data Bank entries where half-map data and author-supplied FSC curves were deposited in 2024. These were ordered by nominal resolution and every tenth entry was taken, yielding 458 cases. Good fits were obtained for most FSC curves by optimizing the values of r and ΔB; the best-fit resolution (s_max = 1/d_min) was defined as the point where the fitted FSC curve was equal to 0.143. Then functional forms for r and ΔB in terms of s_max were fitted.

$$r=34.4680\,\exp (3.00533{s}_{\max }+4.27895{s}_{\max }^{2})$$

(13)

$$\Delta B=17.1158+12.0213/{s}_{\max }+21.3225/{s}_{\max }^{2}$$

(14)

To match the target FSC curve in a particular case, the Fourier terms are divided into resolution shells. In each shell, the signal and noise powers are determined and then the amount of noise required to match the desired FSC is computed and added to the half-map Fourier terms. Finally, the half-maps are computed using only the Fourier terms to s_max.

Weighted Kabsch alignment and rigid-body refinement

As stated above, we use the initial prediction from OpenFold to run Phasertng (or EM_placement for cryo-EM) for molecular replacement. For every iteration, we align the OpenFold model to the reference molecular replacement solution before computing the relevant LLG score. This alignment is performed by first solving the following minimization problem with the Kabsch algorithm⁸²:

$$L(C)=\frac{1}{2}\mathop{\sum }\limits_{i=1}^{n}{w}_{i}{\left\Vert {{\bf{x}}}_{i}^{{\rm{ref}}}-C{{\bf{x}}}_{i}^{{\rm{prediction}}}\right\Vert }^{2}$$

(15)

where C denotes the rotation–translation matrix, x^ref and ${{\bf{x}}}_{i}^{{\rm{prediction}}}$ are corresponding atomic coordinates of atom i in the reference and moving model, respectively, and w_i denotes positional weights. Only Cα atoms are included in the alignment and their weights are determined empirically from the pseudo B values of the reference structure (the first, unconditioned OpenFold prediction). Specifically, for each residue,

$$w(\,\mathrm{pseudo}\,B)=\left\{\begin{array}{ll}1.0 & \mathrm{pseudo}\,B\,\le 11.5\\ 1.0-0.5\times \frac{\,\mathrm{pseudo}\,B\,-11.5}{40.0-11.5} & 40.0\ge \,\mathrm{pseudo}\,B\, > 11.5\\ 0.5\exp (-\sqrt{\,\mathrm{pseudo}\,B\,-40.0}) & \mathrm{pseudo}\,B\, > 40.0\end{array}\right.$$

(16)

Translation vectors are determined by the vector difference of weighted center of mass of Cα atoms in the reference and moving models; then, the rotation matrices are estimated with Kabsch algorithm. Once the alignment is completed, a subsequent rigid-body refinement is performed through gradient optimization of the LLG target:

$${C}^{* }=\arg \mathop{\max }\limits_{C}\mathrm{LLG}({E}_{{\rm{o}}},{\bf{x}},C)$$

(17)

where x represents the model coordinates after Kabsch alignment. When the predicted aligned error matrix from AF2 can be used automatically to split proteins into domains, ROCKET performs domain-specific alignments to make the best use of gradient information from the LLG.

Multichain dataset handling

ROCKET can readily handle monomeric protein predictions in its current form. We also demonstrated refinement of crystallographic datasets that contain two chains in the asymmetric unit for the kinase datasets (PDB 3PYY in Supplementary Fig. 6 and PDB 7DT2 in Supplementary Fig. 16). For these cases, we included the second chain present in the asymmetric unit as a fixed Fourier contribution in the likelihood calculation but excluded it from refinement.

General multichain refinement requires integrating ROCKET with a multimeric model such as OpenFold-Multimer. We present a first demonstration on ZPD (Fig. 4), where chains A–C are predicted and refined simultaneously using OpenFold-Multimer weights at inference time. ZPD is a relatively small multimeric complex, allowing this demonstration within ROCKET’s current memory constraints.

ZPD sample preparation and structure determination

Chicken ZPD was prepared following an established procedure⁷⁰, except that a HEPES buffer (H0887, Sigma) supplemented with protease inhibitors (protease inhibitor cocktail set III, EDTA-free; Calbiochem) was used. On the basis of densitometry using BSA bands as standards (Supplementary Fig. 20a), the yield of ZPD was estimated at ~0.35 mg from ~70 mg wet weight of chicken egg coat. ZPD was notably concentrated in the crude ZPD preparation (reaching ~0.70 mg ml⁻¹), albeit with residual contamination of ZP1 and ZP3 (Supplementary Fig. 20).

Data processing was carried out using cryoSPARC v.4.7.0. Following patch motion and patch contrast transfer function estimation, filaments were initially traced in a subset of 2,000 micrographs. Particles were extracted with a box size of 512 pixels and Fourier-cropped to 128 pixels for two-dimensional (2D) classification, generating initial templates for subsequent template-based filament tracing. After several rounds of tracing using well-defined classes, a total of 1,113,548 particles were extracted and subjected to iterative 2D classification to remove low-quality particles. Ab initio reconstruction was then performed, followed by heterogeneous and homogeneous refinement. To further resolve particle heterogeneity, 3D classification with five classes was conducted using a principal component analysis-based approach. From this, a final dataset of 317,745 unbinned particles was selected. Helical parameters were determined by indexing the power spectrum⁸³ generated from cryoSPARC’s ‘average power spectra’ job and compared to power spectra of potential models using SPIDER⁸⁴. Initial helical twist 126.8° and rise of 69.7-Å values obtained from this analysis were used as input for helical refinement in cryoSPARC, which refined them to a twist of 120.8° and a rise of 71.4 Å. A maximum symmetry order of 3 was applied during reconstruction, resulting in a map with a nominal resolution of 8.6 Å. A single-particle processing strategy was applied to the second, higher-resolution dataset. A total of 2,031,564 particles were extracted with a box size of 512 pixels and Fourier-cropped to 128 pixels. Iterative 2D classification, combined with heterogeneous refinement including multiple noise classes, was performed to clean the dataset. Selection of good classes after 3D classification resulted in a final subset of 498,339 unbinned particles, which were further subjected to homogeneous and nonuniform refinement. A focused mask was then applied to the central region of the map for local refinement, producing a reconstruction with a nominal resolution of 4.6-Å resolution. Data collection statistics for the two ZPD datasets are reported in Supplementary Tables 2 and 3.

For non-ROCKET modeling of ZPD, a local installation of AF3 (ref. ⁹) was used to predict a minimal filament fragment encompassing a full protein chain and two half ones (as previously done for uromodulin⁷², as well as the ZP1–ZP3 complex⁶⁹). The top-scoring model (ranking score 0.8) was then fitted into a version of the ZPD low-resolution map postprocessed by EMReady2 (ref. ⁷¹) through rigid-body docking in Chimera⁸⁵, followed by flexible fitting with Namdinator⁸⁶, trimming of nonresolved terminal residues and fitting of clear N-glycan densities in Coot⁸⁷. The resulting model was finally subjected to positional real-space refinement using noncrystallographic symmetry (NCS) constraints and increased nonbonded interaction weights, followed by atomic displacement parameter refinement against the unsharpened map with phenix.real_space_refine⁸⁸. The coordinates refined against the low-resolution data were used as a starting point for extending the model with an additional EGF and ZP-N domain fragment from a fourth ZPD subunit; after rigid-body docking into the medium-resolution map and manual editing of N termini and glycans, the resulting seven-domain model was flexibly fitted using the cryo_em_minimizer script of the cg2all package⁸⁹. Refinement was initially performed with phenix.real_space_refine as described above; subsequently, the model was refined against the medium-resolution half-maps using the REFMAC Servalcat task of CCP-EM Doppio^90,91 and applying global NCS restraints, ProSMART⁹²-generated self-restraints and an increased weight of nonbonded interactions (‘vdwr 2’). Model geometry and map-fitting parameters were computed using the comprehensive validation tool of PHENIX⁹³. Helical indexing of the medium-resolution single-particle map of ZPD, performed with HI3D⁹⁴, yielded helical parameters of twist = 115° and rise = 71 Å.

Data visualization

Visualization of PDB structures and experimental densities was performed with PyMOL (Schrödinger)⁹⁵ and Moorhen⁹⁶. Structure validation was performed with MolProbity⁹⁷.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Datasets are made available for tutorials and further experiment on Zenodo (https://doi.org/10.5281/zenodo.15084557)⁹⁸. Cryo-EM maps and atomic coordinates of the ZPD filament have been deposited in EMDB and PDB with accession codes 28YJ and EMD-56971, respectively.

Code availability

All ROCKET code and target functions are available from GitHub (https://github.com/alisiafadini/ROCKET) and the v.0.1.0 used for the paper is archived on Zenodo (https://doi.org/10.5281/zenodo.15084557)⁹⁸. Documentation and tutorials can be found online (https://rocket-9.gitbook.io/rocket-docs).

Change history

28 April 2026
This paper was originally published under a standard Springer Nature license (© The Author(s), under exclusive licence to Springer Nature America, Inc.). It is now available as an open-access paper under an CC-BY 4.0 licence, © The Author(s).

References

Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Article CAS PubMed PubMed Central Google Scholar
Baek, M. et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science 373, 871–876 (2021).
Article CAS PubMed PubMed Central Google Scholar
Jumper, J. et al. Applying and improving AlphaFold at CASP14. Proteins 89, 1711–1721 (2021).
Article CAS PubMed PubMed Central Google Scholar
Pereira, J. et al. High-accuracy protein structure prediction in CASP14. Proteins 89, 1687–1699 (2021).
Article CAS PubMed Google Scholar
Karelina, M., Noh, J. J. & Dror, R. O. How accurately can one predict drug binding modes using AlphaFold models? eLife 12, RP89386 (2023).
Article CAS PubMed PubMed Central Google Scholar
Terwilliger, T. C. et al. AlphaFold predictions are valuable hypotheses and accelerate but do not replace experimental structure determination. Nat. Methods 21, 110–116 (2024).
Article CAS PubMed Google Scholar
He, X.-H., Li, J.-R., Shen, S.-Y. & Xu, H. E. AlphaFold3 versus experimental structures: assessment of the accuracy in ligand-bound G protein-coupled receptors. Acta Pharmacol. Sin. 46, 1111–1122 (2025).
Article CAS PubMed Google Scholar
Buttenschoen, M., Morris, G. M. & Deane, C. M. PoseBusters: AI-based docking methods fail to generate physically valid poses or generalise to novel sequences. Chem. Sci. 15, 3130–3139 (2024).
Article CAS PubMed Google Scholar
Abramson, J. et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493–500 (2024).
Article CAS PubMed PubMed Central Google Scholar
Brändén, G. & Neutze, R. Advances and challenges in time-resolved macromolecular crystallography. Science 373, eaba0954 (2021).
Article PubMed Google Scholar
Hekstra, D. R. Emerging time-resolved X-ray diffraction approaches for protein dynamics. Annu. Rev. Biophys. 52, 255–274 (2023).
Article CAS PubMed PubMed Central Google Scholar
Turk, M. & Baumeister, W. The promise and the challenges of cryo-electron tomography. FEBS Lett 594, 3243–3261 (2020).
Article CAS PubMed Google Scholar
Nogales, E. & Mahamid, J. Bridging structural and cell biology with cryo-electron microscopy. Nature 628, 47–56 (2024).
Article CAS PubMed PubMed Central Google Scholar
McCoy, A. J., Sammito, M. D. & Read, R. J. Implications of AlphaFold2 for crystallographic phasing by molecular replacement. Acta Crystallogr. D 78, 1–13 (2022).
Article CAS Google Scholar
Millán, C. et al. Assessing the utility of CASP14 models for molecular replacement. Proteins 89, 1752–1769 (2021).
Article PubMed PubMed Central Google Scholar
Liebschner, D. et al. Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in PHENIX. Acta Crystallogr. D 75, 861–877 (2019).
Article CAS Google Scholar
Murshudov, G. N., Vagin, A. A. & Dodson, E. J. Refinement of macromolecular structures by the maximum-likelihood method. Acta Crystallogr. D 53, 240–255 (1997).
Article CAS PubMed Google Scholar
Yamashita, K., Wojdyr, M., Long, F., Nicholls, R. A. & Murshudov, G. N. GEMMI and Servalcat restrain REFMAC5. Acta Crystallogr. D 79, 368–373 (2023).
Article CAS Google Scholar
Langer, G. G., Cohen, S. X., Lamzin, V. S. & Perrakis, A. Automated macromolecular model building for X-ray crystallography using ARP/wARP v.7. Nat. Protoc. 3, 1171–1179 (2008).
Article CAS PubMed PubMed Central Google Scholar
Terwilliger, T. C. Improving macromolecular atomic models at moderate resolution by automated iterative model building, statistical density modification and refinement. Acta Crystallogr. D 59, 1174–1182 (2003).
Article PubMed PubMed Central Google Scholar
Terwilliger, T. C. et al. Iterative model building, structure refinement and density modification with the PHENIX AutoBuild wizard. Acta Crystallogr. D 64, 61–69 (2008).
Article CAS PubMed Google Scholar
The wwPDB Consortium. EMDB—the Electron Microscopy Data Bank. Nucleic Acids Res. 52, D456–D465 (2024).
Article Google Scholar
Headd, J. J. et al. Use of knowledge-based restraints in phenix.refine to improve macromolecular refinement at low resolution. Acta Crystallogr. D 68, 381–390 (2012).
Article CAS PubMed PubMed Central Google Scholar
Liwo, A., Lee, J., Ripoll, D. R., Pillardy, J. & Scheraga, H. A. Protein structure prediction by global optimization of a potential energy function. Proc. Natl Acad. Sci. USA 96, 5482–5485 (1999).
Article CAS PubMed PubMed Central Google Scholar
Raman, S. et al. Structure prediction for CASP8 with all-atom refinement using Rosetta. Proteins 77, 89–99 (2009).
Article CAS PubMed PubMed Central Google Scholar
Stahl, K., Graziadei, A., Dau, T., Brock, O. & Rappsilber, J. Protein structure prediction with in-cell photo-crosslinking mass spectrometry and deep learning. Nat. Biotechnol. 41, 1810–1819 (2023).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. et al. Distance-AF improves predicted protein structure models by AlphaFold2 with user-specified distance constraints. Commun. Biol. 8, 1392 (2025).
Article CAS PubMed PubMed Central Google Scholar
Jamali, K. et al. Automated model building and protein identification in cryo-EM maps. Nature 628, 450–457 (2024).
Article CAS PubMed PubMed Central Google Scholar
Terwilliger, T. C. et al. Accelerating crystal structure determination with iterative AlphaFold prediction. Acta Crystallogr. D 79, 234–244 (2023).
Article CAS Google Scholar
Feng, S. et al. Integrated structure prediction of protein–protein docking with experimental restraints using ColabDock. Nat. Mach. Intell. 6, 924–935 (2024).
Article Google Scholar
Levy, A. et al. Solving inverse problems in protein space using diffusion-based priors. Preprint at https://arxiv.org/abs/2406.04239 (2024).
Maddipatla, A. et al. Inverse problems with experiment-guided AlphaFold. Preprint at https://arxiv.org/abs/2502.09372 (2025).
Ahdritz, G. et al. OpenFold: retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization. Nat. Methods 21, 1514–1524 (2024).
Article CAS PubMed PubMed Central Google Scholar
Li, M., Dalton, K. & Hekstra, D. SFCalculator: connecting deep generative models and crystallography. Preprint at bioRxiv https://doi.org/10.1101/2025.01.12.632630 (2025).
Wayment-Steele, H. K. et al. Predicting multiple conformations via sequence clustering and AlphaFold2. Nature 625, 832–839 (2024).
Article CAS PubMed Google Scholar
Bryant, P. & Noé, F. Improved protein complex prediction with AlphaFold-multimer by denoising the MSA profile. PLoS Comput. Biol. 20, e1012253 (2024).
Article CAS PubMed PubMed Central Google Scholar
Xie, T., Song, Z. & Huang, J. Conditioned protein structure prediction. PRX Life 2, 043001 (2024).
Article Google Scholar
Monteiro da Silva, G., Cui, J. Y., Dalgarno, D. C., Lisi, G. P. & Rubenstein, B. M. High-throughput prediction of protein conformational distributions with subsampled AlphaFold2. Nat. Commun. 15, 2464 (2024).
Article CAS PubMed PubMed Central Google Scholar
Stein, R. A. & Mchaourab, H. S. SPEACH_af: sampling protein ensembles and conformational heterogeneity with AlphaFold2. PLoS Comput. Biol. 18, e1010483 (2022).
Article CAS PubMed PubMed Central Google Scholar
Budziszewski, G. R., Snell, M. E., Wright, T. R., Lynch, M. L. & Bowman, S. E. J. High-throughput screening to obtain crystal hits for protein crystallography. J. Vis. Exp. 193, e65211 (2023).
Google Scholar
Boby, M. L. et al. Open science discovery of potent noncovalent SARS-CoV-2 main protease inhibitors. Science 382, eabo7201 (2023).
Günther, S. et al. X-ray screening identifies active site and allosteric inhibitors of SARS-CoV-2 main protease. Science 372, 642–646 (2021).
Article PubMed PubMed Central Google Scholar
Fearon, D. et al. Accelerating drug discovery with high-throughput crystallographic fragment screening and structural enablement. Appl. Res. 4, e202400192 (2025).
Article CAS Google Scholar
Dalton, K. M., Greisman, J. & Hekstra, D. A unifying Bayesian framework for merging X-ray diffraction data. Nat. Commun. 13, 7764 (2022).
Article CAS PubMed PubMed Central Google Scholar
Zhong, E. D., Bepler, T., Berger, B. & Davis, J. H. CryoDRGN: reconstruction of heterogeneous cryo-EM structures using neural networks. Nat. Methods 18, 176–185 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kimanius, D. et al. Data-driven regularization lowers the size barrier of cryo-EM structure determination. Nat. Methods 21, 1216–1221 (2024).
Article CAS PubMed PubMed Central Google Scholar
Wang, X. et al. CryoREAD: de novo structure modeling for nucleic acids in cryo-EM maps using deep learning. Nat. Methods 20, 1739–1747 (2023).
Article CAS PubMed PubMed Central Google Scholar
Croll, T. I. ISOLDE: a physically realistic environment for model building into low-resolution electron-density maps. Acta Crystallogr. D 74, 519–530 (2018).
Article CAS Google Scholar
Wang, X., Zhu, H., Terashi, G., Taluja, M. & Kihara, D. DiffModeler: large macromolecular structure modeling for cryo-EM maps using a diffusion model. Nat. Methods 21, 2307–2317 (2024).
Article CAS PubMed PubMed Central Google Scholar
He, J., Lin, P., Chen, J., Cao, H. & Huang, S.-Y. Model building of protein complexes from intermediate-resolution cryo-EM maps with deep learning-guided automatic assembly. Nat. Commun. 13, 4066 (2022).
Article CAS PubMed Google Scholar
Sjodt, M. et al. Structure of the peptidoglycan polymerase RodA resolved by evolutionary coupling analysis. Nature 556, 118–121 (2018).
Article CAS PubMed PubMed Central Google Scholar
McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 (2007).
Article CAS PubMed PubMed Central Google Scholar
Read, R. J., Millán, C., McCoy, A. J. & Terwilliger, T. C. Likelihood-based signal and noise analysis for docking of models into cryo-EM maps. Acta Crystallogr. D 79, 271–280 (2023).
Article CAS Google Scholar
Wang, W., Gong, Z. & Hendrickson, W. A. AlphaFold-guided molecular replacement for solving challenging crystal structures. Acta Crystallogr. D 81, 4–21 (2025).
Article CAS Google Scholar
Gabriel, F. et al. Structural basis of thiamine transport and drug recognition by SLC19A3. Nat. Commun. 15, 8542 (2024).
Article PubMed PubMed Central Google Scholar
Torino, S., Dhurandhar, M., Stroobants, A., Claessens, R. & Efremov, R. G. Time-resolved cryo-EM using a combination of droplet microfluidics with on-demand jetting. Nat. Methods 20, 1400–1408 (2023).
Article CAS PubMed Google Scholar
van Beusekom, B. et al. Homology-based hydrogen bond information improves crystallographic structures in the PDB. Protein Sci 27, 798–808 (2018).
Article PubMed Google Scholar
Bond, P. S. & Cowtan, K. D. ModelCraft: an advanced automated model-building pipeline using Buccaneer. Acta Crystallogr. D 78, 1090–1098 (2022).
Article CAS Google Scholar
Brünger, A. T. Free R value: a novel statistical quantity for assessing the accuracy of crystal structures. Nature 355, 472–475 (1992).
Article PubMed Google Scholar
Jendrusch, M. et al. AlphaDesign: a de novo protein design framework based on AlphaFold. Mol. Syst. Biol. 21, 1166–1189 (2025).
Article PubMed PubMed Central Google Scholar
Wicky, B. I. M. et al. Hallucinating symmetric protein assemblies. Science 378, 56–61 (2022).
Article CAS PubMed PubMed Central Google Scholar
Frank, C. et al. Scalable protein design using optimization in a relaxed sequence space. Science 386, 439–445 (2024).
Article CAS PubMed PubMed Central Google Scholar
De Zitter, E., Coquelle, N., Oeser, P., Barends, T. R. M. & Colletier, J.-P. Xtrapol8 enables automatic elucidation of low-occupancy intermediate-states in crystallographic studies. Commun. Biol. 5, 640 (2022).
Article PubMed PubMed Central Google Scholar
Waschbüsch, D. et al. Structural basis for the specificity of PPM1H phosphatase for Rab GTPases. EMBO Rep. 22, e52675 (2021).
Article PubMed PubMed Central Google Scholar
Pullen, K. E. et al. An alternate conformation and a third metal in PstP/Ppp, the M. tuberculosis PP2C-family Ser/Thr protein phosphatase. Structure 12, 1947–1954 (2004).
Article CAS PubMed Google Scholar
Christou, N.-E. et al. Time-resolved crystallography captures light-driven DNA repair. Science 382, 1015–1020 (2023).
Article CAS PubMed Google Scholar
Litscher, E. S. & Wassarman, P. M. The mammalian egg’s zona pellucida, fertilization, and fertility. Curr. Top. Dev. Biol. 162, 207–258 (2025).
Article CAS PubMed Google Scholar
Bokhove, M. & Jovine, L. Structure of zona pellucida module proteins. Curr. Top. Dev. Biol. 130, 413–442 (2018).
Article CAS PubMed Google Scholar
Nishio, S. et al. ZP2 cleavage blocks polyspermy by modulating the architecture of the egg coat. Cell 187, 1440–1459 (2024).
Article CAS PubMed PubMed Central Google Scholar
Okumura, H. et al. A newly identified zona pellucida glycoprotein, ZPD, and dimeric ZP1 of chicken egg envelope are involved in sperm activation on sperm–egg interaction. Biochem. J. 384, 191–199 (2004).
Article CAS PubMed PubMed Central Google Scholar
Cao, H., Li, T., Chen, J., He, J. & Huang, S.-Y. Improving cryo-EM maps by resolution-dependent and heterogeneity-aware deep learning. Preprint at bioRxiv https://doi.org/10.1101/2025.09.03.674102 (2025).
Stsiapanava, A. et al. Cryo-EM structure of native human uromodulin, a zona pellucida module polymer. EMBO J 39, e106807 (2020).
Article CAS PubMed PubMed Central Google Scholar
Evans, R. et al. Protein complex prediction with AlphaFold-Multimer. Preprint at bioRxiv https://doi.org/10.1101/2021.10.04.463034 (2021).
Zheng, H., Hou, J., Zimmerman, M. D., Wlodawer, A. & Minor, W. The future of crystallography in drug discovery. Expert Opin. Drug Discov 9, 125–137 (2014).
Article CAS PubMed Google Scholar
McCoy, A. J. et al. Phasertng: directed acyclic graphs for crystallographic phasing. Acta Crystallogr. D 77, 1–10 (2021).
Article CAS Google Scholar
Millán, C., McCoy, A. J., Terwilliger, T. C. & Read, R. J. Likelihood-based docking of models into cryo-EM maps. Acta Crystallogr. D 79, 281–289 (2023).
Article Google Scholar
Read, R. J. & McCoy, A. J. A log-likelihood-gain intensity target for crystallographic phasing that accounts for experimental error. Acta Crystallogr. D 72, 375–387 (2016).
Article CAS Google Scholar
Read, R. J. Structure-factor probabilities for related structures. Acta Crystallogr. A 46, 900–912 (1990).
Article Google Scholar
Fletcher, R. Practical Methods of Optimization 2nd edn (John Wiley & Sons, 1987).
Pintilie, G. et al. Measurement of atom resolvability in cryo-EM maps with Q-scores. Nat. Methods 17, 328–334 (2020).
Article CAS PubMed PubMed Central Google Scholar
Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290–296 (2017).
Article CAS PubMed Google Scholar
Lawrence, J., Bernal, J. & Witzgall, C. A purely algebraic justification of the Kabsch–Umeyama algorithm. J. Res. Natl Inst. Stand. Technol. 124, 1–6 (2019).
Article PubMed PubMed Central Google Scholar
Klug, A., Crick, F. H. C. & Wyckoff, H. W. Diffraction by helical structures. Acta Crystallogr 11, 199–213 (1958).
Article CAS Google Scholar
Frank, J. et al. SPIDER and WEB: processing and visualization of images in 3D electron microscopy and related fields. J. Struct. Biol. 116, 190–199 (1996).
Article CAS PubMed Google Scholar
Pettersen, E. F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Article CAS PubMed Google Scholar
Kidmose, R. T. et al. Namdinator—automatic molecular dynamics flexible fitting of structural models into cryo-EM and crystallography experimental maps. IUCrJ 6, 526–531 (2019).
Article CAS PubMed PubMed Central Google Scholar
Casañal, A., Lohkamp, B. & Emsley, P. Current developments in Coot for macromolecular model building of electron cryo-microscopy and crystallographic data. Protein Sci 29, 1069–1078 (2020).
Article PubMed PubMed Central Google Scholar
Afonine, P. V. et al. Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallogr. D 74, 531–544 (2018).
Article CAS Google Scholar
Heo, L. & Feig, M. One bead per residue can describe all-atom protein structures. Structure 32, 97–111.e6 (2024).
Article CAS PubMed Google Scholar
Yamashita, K., Palmer, C. M., Burnley, T. & Murshudov, G. N. Cryo-EM single-particle structure refinement and map calculation using Servalcat. Acta Crystallogr. D 77, 1282–1291 (2021).
Article CAS Google Scholar
Burnley, T., Iadanza, M., Joseph, A., Palmer, C. & Winn, M. CCP-EM 2.0: software tools for efficient, accurate and reproducible management and automation of cryo-EM processing from images to structures. Acta Crystallogr. A 79, C17 (2023).
Article Google Scholar
Kovalevskiy, O., Nicholls, R. A., Long, F., Carlon, A. & Murshudov, G. N. Overview of refinement procedures within REFMAC5: utilizing data from different sources. Acta Crystallogr. D 74, 215–227 (2018).
Article CAS Google Scholar
Afonine, P. V. et al. New tools for the analysis and validation of cryo-EM maps and atomic models. Acta Crystallogr. D 74, 814–840 (2018).
Article CAS Google Scholar
Sun, C., Gonzalez, B. & Jiang, W. Helical indexing in real space. Sci. Rep. 12, 8162 (2022).
Article CAS PubMed PubMed Central Google Scholar
PyMOL v.1.8 (Schrödinger, LLC, 2015).
Sanchez, F. Moorhen: a web browser molecular graphics program based on the Coot desktop program. Zenodo https://doi.org/10.5281/zenodo.14923759 (2025).
Williams, C. J. et al. MolProbity: more and better reference data for improved all-atom structure validation. Protein Sci 27, 293–315 (2018).
Article CAS PubMed Google Scholar
Fadini, A. & Li, M. AlphaFold as a prior: experimental structure determination conditioned on a pretrained neural network. Zenodo https://doi.org/10.5281/zenodo.15084557 (2025).

Download references

Acknowledgements

We acknowledge the following funding: Wellcome Trust (grant number 209407/Z/17/Z to R.J.R.), National Institutes of Health National Institute of General Medical Sciences (P01-GM063210 to R.J.R. and T.C.T.; R35-GM150546 to M.A.; DP2-GM141000 to D.R.H.), Biotechnology and Biological Sciences Research Council (grant number BB/Y009398/1 to R.J.R.), Knut and Alice Wallenberg Foundation (grant 2018.0042 to L.J.), Swedish Research Council (grant 2024-05336 to L.J.), a graduate fellowship from the Eric and Wendy Schmidt Center to M.H.L. and Research Ireland grant 20/FFP-A/8446 to A.R.K. ZPD data were collected at the SciLifeLab Cryo-EM Infrastructure at Umeå University (funded by the Knut and Alice Wallenberg, Family Erling Persson and Kempe Foundations, Swedish Foundation for Strategic Research and Swedish Research Council (2021-00271)). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript. We would also like to thank K. M. Dalton, R. Gaudet, T. J. Lane, S. Eddy, S. Petti and L. Sandblad for insight and discussions, C. Floristean for help with the OpenFold codebase and T. Shaikh for help with ZPD helical reconstruction.

Author information

These authors contributed equally: Alisia Fadini, Minhuan Li.

Authors and Affiliations

Cambridge Institute for Medical Research, University of Cambridge, Cambridge, UK
Alisia Fadini, Airlie J. McCoy & Randy J. Read
Department of Systems Biology, Columbia University, New York, NY, USA
Alisia Fadini & Mohammed AlQuraishi
John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
Minhuan Li & Doeke R. Hekstra
Center for Computational Mathematics, Flatiron Institute, New York, NY, USA
Minhuan Li
Department of Chemistry, Umeå University, Umea, Sweden
Suresh Banjara
Department of Applied Biological Chemistry, Faculty of Agriculture, Meijo University, Nagoya, Japan
Hiroki Okumura
School of Biochemistry and Immunology, Trinity College Dublin, Dublin, Ireland
Eve Napier & Amir R. Khan
Department of Immunology, St. Jude Children’s Research Hospital, Memphis, TN, USA
Pietro Fontana
Department of Medicine, Huddinge, Karolinska Institutet, Stockholm, Sweden
Luca Jovine
New Mexico Consortium, Los Alamos, NM, USA
Thomas C. Terwilliger
Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
Doeke R. Hekstra

Authors

Alisia Fadini
View author publications
Search author on:PubMed Google Scholar
Minhuan Li
View author publications
Search author on:PubMed Google Scholar
Airlie J. McCoy
View author publications
Search author on:PubMed Google Scholar
Suresh Banjara
View author publications
Search author on:PubMed Google Scholar
Hiroki Okumura
View author publications
Search author on:PubMed Google Scholar
Eve Napier
View author publications
Search author on:PubMed Google Scholar
Pietro Fontana
View author publications
Search author on:PubMed Google Scholar
Amir R. Khan
View author publications
Search author on:PubMed Google Scholar
Luca Jovine
View author publications
Search author on:PubMed Google Scholar
Thomas C. Terwilliger
View author publications
Search author on:PubMed Google Scholar
Randy J. Read
View author publications
Search author on:PubMed Google Scholar
Doeke R. Hekstra
View author publications
Search author on:PubMed Google Scholar
Mohammed AlQuraishi
View author publications
Search author on:PubMed Google Scholar

Contributions

A.F., M.L., T.C.T., A.J.M., R.J.R., M.A. and D.R.H. contributed to the main ideas and concepts in the work. A.F. and M.L. developed and implemented ROCKET. A.F., M.L., R.J.R., A.J.M. and D.R.H. carried out the analyses with public data. S.B., H.O. and L.J contributed the data, experiments and analysis on ZPD. E.N., P.F. and A.R.K. contributed the data and experiments on the PPM1H complex. All authors contributed to the writing of the paper.

Corresponding authors

Correspondence to Randy J. Read, Doeke R. Hekstra or Mohammed AlQuraishi.

Ethics declarations

Competing interests

M.A. is a member of the scientific advisory boards of Cyrus Biotechnology, Deep Forest Sciences, Nabla Bio, Oracle Therapeutics and Achira. The other authors declare no competing interests.

Peer review

Peer review information

Nature Methods thanks James Fraser and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Arunima Singh, in collaboration with the Nature Methods team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Supplementary Text, Figures 1–20, Tables 1–3 and References.

Reporting Summary (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fadini, A., Li, M., McCoy, A.J. et al. AlphaFold as a prior: experimental structure determination conditioned on a pretrained neural network. Nat Methods 23, 785–795 (2026). https://doi.org/10.1038/s41592-026-03047-4

Download citation

Received: 13 November 2025
Accepted: 25 February 2026
Published: 01 April 2026
Version of record: 01 April 2026
Issue date: April 2026
DOI: https://doi.org/10.1038/s41592-026-03047-4