AI meets physics in computational structure-based drug discovery for GPCRs

Michino, Mayako; Vendome, Jeremie; Kufareva, Irina

doi:10.1038/s44386-025-00019-0

Download PDF

Review
Open access
Published: 03 July 2025

AI meets physics in computational structure-based drug discovery for GPCRs

Mayako Michino¹^nAff4,
Jeremie Vendome² &
Irina Kufareva³

npj Drug Discovery volume 2, Article number: 16 (2025) Cite this article

4311 Accesses
1 Citations
Metrics details

Subjects

Abstract

G protein-coupled receptors (GPCRs) are a prominent class of therapeutic targets for which structure-based drug discovery (SBDD) has traditionally been challenging to apply. However, recent artificial intelligence (AI)-powered breakthroughs have opened new avenues. Here, we discuss the impact of computational models on hit discovery and lead optimization for GPCRs. We also provide best practices for generating and validating predictive models for prospective use.

G protein-coupled receptors: structure- and function-based drug discovery

Article Open access 08 January 2021

G protein-coupled receptors (GPCRs): advances in structures, mechanisms and drug discovery

Article Open access 10 April 2024

Characterizing conformational states in GPCR structures using machine learning

Article Open access 11 January 2024

Introduction

G protein-coupled receptors (GPCRs) are a prominent class of therapeutic targets, with nearly a third of the FDA-approved drugs targeting members of this protein family^1,2,3. Structure-based drug discovery (SBDD) uses the three-dimensional (3D) structure of protein targets to rationally identify and optimize compounds in preclinical drug discovery⁴. The SBDD process consists of four key phases (Fig. 1): 1) receptor modeling, where a 3D model of the target receptor is built or selected, 2) modeling of ligand-bound receptor complex(es), where ligand pose is generated together with receptor conformations suitable for ligand binding, 3) hit identification, where a starting-point chemical matter, referred to as ‘hits’, is discovered, and 4) hit-to-lead and lead optimization, where the hit or lead compounds are optimized for potency and drug-like properties. In this review, we discuss the recent innovation in artificial intelligence- (AI) and physics-based computational methodologies that advance SBDD for GPCRs. The review is organized in four sections, each covering the key phase of SBDD.

**Fig. 1: Overview of computational methods used in the key phases of structure-based drug discovery.**

AI-based prediction of GPCR structures

An accurate three-dimensional structure of the target protein in a relevant functional state is a central component and a critical prerequisite for structure-based drug discovery⁴. However, for GPCRs, high-resolution experimental structures have historically been scarce⁵. Until only a few years ago, challenges of experimental structure determination and accurate structure prediction largely precluded structure-based drug discovery efforts for this target class.

Since 2020, AI approaches have led to truly breakthrough advances in protein structure prediction, as demonstrated in the community-wide blind competition CASP14 (14^th biannual Critical Assessment of Structure Prediction⁶) and as recognized by a Nobel prize in Chemistry⁷. Deep-learning based methods like AlphaFold2 (AF2)⁸ and RoseTTAFold⁹ consistently deliver structural predictions approaching experimental accuracy⁶. These AI-based structure prediction algorithms are trained on known experimental structures deposited in the Protein Data Bank (PDB)¹⁰, and thus would not have been successful for GPCRs without the preceding explosion in the number of experimental GPCR structures in the PDB. However, in contrast to conventional homology modeling, AI methods do not directly depend on structures of homologs with high sequence identity.

As of March 2025, experimentally determined structures have been solved for about a quarter of the GPCR superfamily (235 out of ~800 GPCRs)^11,12, but AF2 models are available for all superfamily members, including its largest subfamily, Class A (674 receptors, Fig. 2). High prediction confidence (average TM domain pLDDT >90) is featured not only by models of receptors with medium-to-high sequence identity to known structures (>35% identity in transmembrane (TM) domain) but also by the majority of receptors with only distant homology in the PDB (Fig. 2a). For class A GPCRs, the pLDDT scores of the TM orthosteric pocket are nearly as high as that of the TM domain (Fig. 2b), albeit slightly more variable, suggesting an overall confidence in predicted AF2 models around the ligand binding site. Importantly, a large fraction of AF2 models are in good agreement with previously available and subsequently solved experimental structures, showing root mean square deviation (RMSD) of <2 Å in both the TM domain backbone and the orthosteric pocket side chains, across all ranges of homology to known structures available at the time of model training (Fig. 2a, b).

**Fig. 2: Overview of confidence and geometric accuracy of the off-the-shelf class A GPCR AF2 models.**

Several studies have systematically examined the geometric accuracy of GPCR models predicted by AF2 and RoseTTAFold. By examining 29 GPCRs for which the structures were released after the publication of the AF2 database in 2021, He et al. established that AF2 achieves a TM domain Cα RMSD accuracy of ~1 Å¹³. However, AF2 models showed limitations in the extracellular loop (ECL)-TM domain assembly and at the transducer interface, as well as in the sidechain conformations of the orthosteric ligand binding site, so that ligands failed to dock in native-like poses¹³. Lee et al. compared the AF2 and RoseTTAFold predicted models to the experimental structures for 73 GPCRs, and found that AF2 tends to be slightly more accurate than RoseTTAFold, while both performed better than conventional homology modeling method for receptors with no good templates¹⁴. These findings are in agreement with the experience of labs specializing in determining experimental structures of GPCRs¹⁵, and with the general evaluation of AF2 predictions across different protein families. Furthermore, despite the reputation of AF2 models having ‘near-experimental accuracy’, Terwilliger et al. concluded that the mean error in predicted models is higher than the experimental error in the determined structures¹⁶. For example, for high-confidence residues (pLDDT > 90), AF2 models had a mean prediction error of 0.6 Å Ca RMSD, vs 0.3 Å Cα RMSD for experimental structures. The side chains in moderate-to-high confidence regions (pLDDT > 70) of AF2 models had 10% (vs 6% in experimental structures) of residues with an error over 2 Å, and 20% (vs 2% in experimental structures) with conformations substantially different (>1.5 Å RMSD) from the experimental density maps¹⁶.

Aside from geometric accuracy, a critical aspect of AI-generated structural models is their physical validity which concerns both the covalent bonds (e.g. bond lengths, angles, and shapes of aromatic rings) and the non-bonded intramolecular interactions (e.g. steric clashes). In the case of AF2, non-physical contacts and geometries are often present in initial models but are sufficiently mild to be removed by model relaxation within the prediction routine⁸.

One major limitation of AF2 is its inability to directly model functionally distinct conformational states of the target protein¹⁷. GPCRs undergo a large conformational change upon agonist binding and thus can adopt at least two distinct states, inactive and active; however, the models often represent only one state that is biased by the experimental structures in the training database. By analyzing the predicted conformation of TM6 and TM7, indicative of receptor activation state, He et al. concluded that AF2 tended to produce an “average” conformation for class A and an active-like conformation for class B1 GPCRs, consistent with the activation state distribution of the available structures in the PDB at the time of their analyses (55% inactive/37% active for class A, 70% active for class B1)¹³. That said, conformational variation and local uncertainties in AF2 ensembles are often consistent with intrinsic protein dynamics¹⁸. Accordingly, for select GPCRs with a sufficient number of conformationally diverse templates in the PDB training set, AF2 can produce a conformational ensemble that spans part of the receptor conformational spectrum¹⁹.

To enable reproducible generation of state-specific GPCR models by AF2, the Feig group developed an extension, AlphaFold-MultiState, that uses activation state-annotated template GPCR databases^20,21. The generated models show an excellent agreement with both previously available and subsequently solved experimental structures of GPCRs in respective states (Fig. 2c–f). Other groups have reported the generation of functionally relevant conformational state ensembles by modifying and reducing the depth of the input multiple-sequence alignments^{22,23,24,25,26}.

In the years since the release of AF2, the computational community has developed a number of alternative implementations designed to improve AF2 scalability, accessibility, and applicability. Spearheaded by the AlQuraishi group, OpenFold is a GPU-memory-efficient reproduction of AF2 that enables retraining on a new dataset²⁷. MassiveFold has been developed to parallelize AF2 to significantly reduce the compute time²⁸. Finally, to address the limitations of single-conformation predictors, Microsoft Research has recently presented BioEmu²⁹, a scalable generative deep learning model for generating protein equilibrium ensembles. While these developments are exciting, they have not yet been applied specifically to GPCR structure prediction.

Prediction of GPCR-ligand complex geometries

Another critical prerequisite for both structure-based hit identification and lead optimization is an accurate structural model for not just the receptor, but for the complex between the receptor and the relevant ligand. A model showing near-native ligand pose within the receptor binding pocket, and forming receptor-ligand interactions similar to those observed experimentally, can be instrumental for rationalizing and predicting structure-activity relationships (SAR) in a compound series, optimizing compound potency, and discovering new ligands with different scaffolds. In benchmark studies, the accuracy of predicted ligand poses is typically assessed relative to an experimental structure of the same complex by the RMSD of ligand heavy atoms after optimally superimposing the receptor binding pocket or TM domain (Fig. 3a–c), whereas receptor-ligand interactions can be evaluated by comparing the experimentally observed and predicted interatomic distances for all ligand-receptor atom pairs. For any given set of receptor-ligand complex models, ligand RMSD from the experimental structure and fraction of correctly predicted contacts are usually only in a loose agreement with each other (Fig. 3d, e). Therefore, it is convenient to combine these two metrics and to assess the result in the context of the variation of corresponding parameters observed across pairs of experimental high-resolution structures of identical composition complexes in the PDB (Fig. 3d, e)^30,31. The percentile within that distribution is a quantitative expression of the geometrical ‘correctness’ of the given model.

**Fig. 3: Variation of geometric accuracy (‘correctness’) for computationally predicted models of GPCR-ligand complexes.**

Prediction of receptor-ligand complex geometry conventionally involves docking of the ligand into the binding pocket of the receptor, by flexibly sampling the possible ligand conformations within the rigid receptor binding pocket, followed by scoring and ranking of the resulting poses. The success of this docking approach strongly depends on the accuracy of the binding pocket and the compatibility of its shape with the binding of the given ligand. It also depends on the type of ligands being docked, e.g. ligands with many rotatable bonds, such as peptides, are more challenging. Keeping the receptor rigid during docking helps computational efficiency but makes it impossible to capture any significant rearrangement of the receptor pocket conformation that may occur upon binding of the ligand, and to account for the so-called induced fit effect.

With the overall improvement in receptor model accuracy achieved by AF2 and RoseTTAFold, there was an expectation that the accuracy of ligand pose prediction by docking would also improve. However, in practice, the resulting impact turned out to be less straightforward. Karelina et al examined the docking accuracy of 54 ligands to unrefined, non-state-specific AF2 models of 17 class A and 1 class B GPCRs³². They found that despite the improved binding pocket accuracy, the fraction of correctly predicted ligand binding poses (ligand RMSD ≤ 2.0 Å relative to experimental structure) was not significantly higher for AF2 models compared to traditional homology models. By contrast, Lee et al. evaluated the success rate of docking to AF2 models for 38 agonist and 32 antagonist ligands across 33 GPCRs spanning classes A, B1, C, and F³³. They found that by considering the relevant functional state of the receptor, and by incorporating receptor side-chain flexibility, both the binding site prediction accuracy and the ligand pose prediction accuracy (ligand RMSD < 2.5 Å relative to experimental structure) are improved for AF2 models compared to homology models. This result underlines the importance of receptor refinement and induced fit modeling in ligand recognition. Despite the observed ligand pose prediction accuracy improvements in AF2 models compared to conventional homology models, both studies showed that the accuracy is much higher when docking to experimentally determined structures^32,33.

Conducted in 2008, 2010 (Fig. 3d), 2013, and 2021 (Fig. 3e), GPCR Dock is a series of community-wide assessments of blind structure prediction for GPCR-ligand complexes^30,31,34,35. The latest round of GPCR Dock was conducted in 2021, soon after the release of AphaFold2, and challenged the participants with predicting the structures for five target complexes: two with small molecule ligands (apelin receptor (APJ) with Cmpd6 and GPR139 with JNJ-63533054) and three with peptides (κ-type opioid receptor (OPRK) with dynorphin, neuropeptide Y receptor type 1 (NPY1R) with NPY, and neuromedin U receptor type 2 (NMUR2) with NMU25)³⁵. Across the board, the majority of predictions (including the most accurate ones) capitalized on AF2 models, even for those receptors that had experimental structures in the PDB at the time of the assessment³⁵. Consistent with findings of Karelina et al. ³² and Lee et al. ³³, the highest achieved prediction accuracy for small-molecule targets (which required ligand docking) was in the same range as in prior, pre-AF2 assessments (Fig. 4a); however, encouragingly, this accuracy was achieved even for GPR139, for which no closely homologous structures were available. By contrast with this modest improvement for the small-molecule targets, the prediction accuracy for the peptide targets (traditionally much more challenging for docking) improved unexpectedly and dramatically (Fig. 4b). For two out of three peptide targets (Y₁ and NMU2), the most accurate predictions were generated by AlphaFold-Multimer³⁶ which allowed the complexes to be co-folded with the use of AI-informed distograms. For the third peptide target, the κ receptor, the peptide might have been too short for AF2-Multimer to capture its co-evolution patterns with the receptor; consequently, the best predictions were less accurate (but still by far exceeded the best peptide results from GPCR Dock 2010, Fig. 4b) and were generated by peptide modeling based on homology rather than by AF2-Multimer³⁵.

**Fig. 4: The distribution of “correctness” (geometric accuracy) across the computational models of GPCR-ligand complexes in the GPCR Dock assessments 2010, 2013, and 2021.**

GPCR Dock 2021 has demonstrated that the co-folding approach implemented in AF2-Multimer³⁶ largely solves the challenges of complex geometry prediction for large natural peptides, including receptor induced fit and ligand flexibility. On the other hand, the problems remained unsolved for small molecules. Thus, the computational community has since been seeking a comparable solution that would accurately predict the ligand pose and the associated pocket conformational rearrangements for receptor complexes with small-molecule compounds.

In 2023-24, several methods have been published for “end-to-end” prediction of protein complexes with small-molecule compounds and non-protein biomolecules (ions, nucleic acids), such as DiffusionProteinLigand³⁷, NeuralPlexer³⁸, RosettaFold-AllAtom³⁹, and most recently, AlphaFold3 (AF3)^40,41,42,43. The new methods rely on diffusion-based co-folding and are designed to address the prior limitation where the inaccuracy (or an inadequate conformation) of the receptor binding site impacted ligand docking. Considering the restricted Terms of Use of AF3⁴⁴, many academic and industry groups strived to implement their own versions^42,45, including ChaiDiscovery Chai-1⁴⁶, Iambic Therapeutics NeuralPlexer 2 and 3 beta⁴⁷, Ligo Biosciences AlphaFold3⁴⁸, Baidu HelixFold3⁴⁹, ByteDance Protenix⁵⁰, Umol⁵¹, and Boltz-1⁵².

The accuracy of receptor-ligand complex geometry predictions varies widely between the listed co-folding methods and is also dependent on the evaluation benchmark. For example, on the PoseBusters benchmark⁵³, the authors of both AF3 and Chai-1 report success (pocket-aligned ligand RMSD within 2 Å of the experimental structure) for ~75% of the cases, whereas the publicly available NeuralPLexer and the proprietary NeuralPLexer 3-beta are claimed to achieve ~55% and ~97% success rate, respectively⁵⁴. However, on the CASP15 dataset, the reported success rate for both Chai-1 and Boltz-1 is only ~35% and 43%, respectively⁵². Another variable aspect is the physical validity of the predictions. Diffusion-based methods often generate non-physical models violating bond length and bond angle limits and featuring inter- and intramolecular steric clashes^53,55, likely as a result of overfitting to particular data subsets in the training set. Unlike in AF2 models, these non-physical violations are not always readily removed by relaxation⁵³. Furthermore, these new AI-based methods often struggle to preserve the specified stereochemistry and protonation state of the input ligand⁵⁴. Nonetheless, graph-convolutional neural networks have been shown to be successful in generating physically valid and stereo-aware low energy conformers for small molecule compounds in isolation⁵⁶, suggesting that the difficulties with physical validity in protein-ligand complex predictions are technical rather than conceptual. Consistent with these concepts, our attempts to reproduce the structures of selected GPCR Dock complexes via diffusion co-folding had mixed success due to the non-physical intra-receptor and ligand-receptor interactions (Fig. 5a,c), and variable geometric accuracy of the predictions (Fig. 5b,d,e–j). More accurate predictions were generated for the D₃ receptor, which has many homologous and analogous structures in the PDB, compared to GPR139, which has few or no homologous and analogous structures (Fig. 5i, j). This is in agreement with the reported bias of the tested methods towards known ligands and complexes⁵⁷.

**Fig. 5: Models predicted by NeuralPlexer, Chai-1, Boltz-1, and AlphaFold3 for the dopamine D₃ receptor-eticlopride and GPR139-JNJ complexes.**

Considering the enduring limitations of end-to-end AI-based complex structure predictors, physics-based methods remain essential for the prediction of GPCR complexes with small molecule ligands, including the modeling of induced fit. Such methods include ensemble docking and scoring^58,59 and the ‘IFD-MD’ methodology which combines protein structure prediction, ligand-based pharmacophore docking, and rigid receptor docking with explicit solvent molecular dynamics (MD) simulations^60,61. Studies also reported successful attempts at ligand pose prediction and optimization by combining traditional force-field based scoring with deep learning⁶², suggesting that synergy between AI and physics may be the optimal path forward.

A key lesson from GPCR Dock 2021 was not only in the advancements in computational models but also in the increased variability in the quality of experimental structures. The cryo-EM resolution revolution⁶³ led to a surge of structures for highly dynamic and conformationally variable GPCR complexes that completely redefined our notion of “high resolution”. We found that in many cases, experimental structures of the same receptor-ligand complex are as distinct from each other (in terms of ligand RMSD and ligand-receptor contacts), or even more distinct, than they are from the best computationally generated models³⁵. In some cases, the differences can be attributed to variations in intracellular effectors^64,65,66, but in other cases, they are hardly rationalizable and may simply reflect the dynamics of the complex^67,68,69 or the resolution limits of the experiment⁷⁰. In any case, it appears that the accuracy of the best modern computational predictions for GPCR-ligand complex geometries is well within the accuracy of modern experimental structures. A caveat for structure-based drug discovery is that neither predicted models nor experimental structures may be directly usable in hit discovery or lead optimization without careful evaluation of their performance in respective applications. This highlights the importance of model optimization, selection, and validation, as outlined in the following sections.

Ligand discovery based on GPCR structural models

In the hit identification phase of the drug discovery process, the goal is to find compounds with novel scaffolds that have measurable potency and desired efficacy towards the target of interest. Computational approaches to this hit-finding process involve screening of a large library of small molecules either by ligand-based or structure-based methods. Ligand-based approaches use 2D or 3D pharmacophores representing known compounds, which often leads to limited variation in scaffolds and chemotypes of the identified new chemical matter, whereas structure-based virtual ligand screening (VLS) relies on one or more 3D models of the target receptor binding pocket and can often find novel and diverse chemotypes. Top-scoring molecules from the VLS campaign are triaged to a final small and diverse set of predicted active compounds (‘actives’) that are experimentally evaluated. The success rates of VLS campaigns (defined as the fraction of predicted actives that are confirmed experimentally) vary substantially depending on the 3D models used. For GPCRs, VLS campaigns using high-resolution experimental structures have reported hit rates as high as ~60%, and nanomolar potencies^{71,72,73,74,75,76,77,78}, whereas the success rates for computationally predicted structural models have traditionally shown lower hit rates and weaker affinities^79,80,81.

A widely accepted approach to structure/model selection for prospective VLS, and to gauging the expected success rate, is to evaluate its performance in retrospective VLS where the model is challenged with discriminating a small number of known actives (agonists or antagonists) from a much larger set of decoys, consisting of unrelated compounds with similar physicochemical properties (size, charge, flexibility) but dissimilar structures to the actives^{82,83,84,85,86}. The Database of Useful Decoys—enhanced (DUDe)⁸⁷ is often used to generate property-matched decoys sets for given sets of active compounds. The VLS performance of the model—reflective of its ability to score known actives better than the decoys—is not only considered predictive of the success rate in prospective screens but also has been reported to correlate with the ability of the model to dock known actives in a geometrically correct pose⁸⁸. A caveat to this approach is a possible bias of retrospectively validated model towards similar compound scaffolds and chemotypes in prospective VLS⁸⁸, which motivates occasional inclusion of new, distinct, and non-retrospectively validated models in hit discovery campaigns⁸⁹.

Given the improved geometrical accuracy of GPCR-ligand complexes predicted with AF2, there is an expectation that such models would also have improved success rates in structure-based hit discovery campaigns. However, studies reported mixed results. Diaz-Rovira et al. established that unrefined AF2 models are not optimal for VLS because they often represent the apo state of the receptor with a collapsed binding site⁹⁰. Zhang et al. subsequently showed that VLS performance can be considerably improved with refined compared to unrefined AF2 models⁹¹. On the other hand, Lyu et al. compared the performance of a large-scale VLS using an unrefined AF2 model and an experimental structure for the 5-HT_2A receptor, and found that despite poor retrospective VLS performance and notable side chain rotamer changes in the binding site, the AF2 model was still quite effective in identifying novel ligands prospectively⁸⁹. Surprisingly, the hit rates were comparable between the experimental structure and the AF2 model, even though there was no overlap in scaffold between the hit sets. In another study, Diaz-Holguin et al. compared the VLS performance of an AF2 model and a traditional homology model for the TA₁ trace amine receptor, and found that the hit rate was two-fold higher with the AF2 model than the homology model⁹². Altogether, these studies support the utility of AI-generated GPCR structural models for hit discovery efforts for select targets, while emphasizing the role of model refinement and performance assessment.

An integral but rarely explicitly acknowledged component of the VLS process is the scoring function used to rank the compounds in the screening library⁹³. Over the years, numerous scoring functions based on physics-informed force fields have been developed^{94,95,96,97,98,99}. These scoring functions take into account physical parameters such as shape complementarity (van der Waals interactions), charge complementarity, hydrogen bonding, and conformational strain. Force-field-based scoring functions have performed well in VLS when using high-resolution experimental structures of the target receptor⁹³, but their performance rapidly degrades with minimal/subtle pocket conformational inaccuracies⁸⁸. In keeping with the AI revolution, a number of deep-learning-based scoring functions have been recently introduced, such as EquiScore¹⁰⁰, AtomNet^101,102, RTMScore¹⁰³, RTCNN^104,105, IGModel¹⁰⁶, and RosettaVS¹⁰⁷. Respective studies almost invariably report superior VLS performance of such functions on various benchmarks, compared to physics-based methods, due to higher tolerance to conformational variation and inaccuracies; however, the functions also frequently suffer from overtraining and inability to penalize non-physical interactions (e.g. assess compound conformational strain and steric clashes)⁸⁸. In the GPCR drug discovery realm, the application of such functions is in its early days, and their prospective evaluation is still pending; however, retrospective studies hint at the possibility that they may compensate for minor geometric inaccuracies in the computational models and thus overcome some of the challenges in model-based hit discovery^88,106.

To illustrate a broader relationship between the ‘correctness’ (geometric accuracy) of computationally predicted models and their VLS performance when assessed with an AI-based scoring function, RTCNN^88,104,105, we evaluated experimental structures and representative models across a range of accuracy levels from GPCR Dock assessments 2010 and 2021, for the dopamine D₃ receptor and GPR139, respectively (Fig. 3d, e). For the former, model correctness was measured in comparison with chain B of a 2.9Å X-ray structure, PDB 3pbl¹⁰⁸; for the latter, in comparison with a 3.22 Å cryo-EM structure, PDB 7vuh⁶⁴. The models in the 2010 assessment were generated by homology with existing structures of aminergic receptors (only β₂-adrenoceptor and β₁-adrenoceptor at that time), whereas the 2021 assessment GPR139 models were almost exclusively built by AF2, which is understandable considering the lack of homologous structures in the PDB at the time³⁵.

For D₃ receptor, as expected, at least one of the experimental structure chains (B) and the highest-accuracy assessment models (e.g. 8004-1 and 5084-3) showed robust discrimination of 14 active compounds from the Astra series (mostly eticlopride analogs, Supplemental Data 1) from 1505 property-matched decoys from DUDe by RTCNN (Fig. 6a, b). Unexpectedly, for GPR139, almost all tested models showed excellent discrimination (ROC AUC >75-80% Fig. 6c, d). A closer inspection revealed that known GPR139 agonists share a very tight SAR and an invariable di-peptide linkage as the key scaffold (Supplemental Data 2), recognized by the polar residues in the binding pocket of not only accurate but also geometrically incorrect computational models. Because these features were entirely absent from the DUDe-generated decoys (selected so that their Tanimoto distance (TD) from the known actives exceeded 0.5), the retrospective VLS task turned out ‘too easy’ in the case of GPR139. By replacing the DUDe-generated decoy set by a set of ChEMBL compounds at 0.1 < TD < 0.5 without known activity at GPR139, the retrospective VLS task became more challenging, leading to overall deterioration of the structure and model VLS performance but also to more pronounced differences among experimental structures, most accurate models, and less accurate models (Fig. 6e, f). These examples demonstrate that the relationship between model correctness and its predictive capacity in VLS is not straightforward and may require the use of carefully selected target-specific compound benchmarks. They also show that experimental structures (especially relatively low resolution cryo-EM structures) can widely vary in their VLS predictive capacity and that, conversely, some models may turn out quite predictive despite the apparently low geometric accuracy.

Fig. 6: The relationship between the VLS performance and the ‘correctness’ (geometric accuracy) of experimental structures and computationally predicted models from GPCR Dock assessments 2010 and 2021.

Beyond structure prediction and scoring of predicted receptor-ligand complexes, AI is being increasingly used to address the growing need for screening ultra-large (billion-size) virtual and combinatorial compound databases^85,109, via AI-accelerated virtual screening platforms such as MolPAL¹¹⁰, Active Learning Glide¹¹¹, DeepDock^112,113, OpenVS¹⁰⁷, GigaScreen¹¹⁴, and CP framework¹¹⁵. At a high level, this approach involves docking and scoring of a small fraction of the database and subsequent training of a target-specific neural net (NN) to predict compound binding scores from 2D structure alone. The NN is then applied to the entire database (which is orders of magnitude faster than evaluating the same compound by docking), a small fraction of top-scoring compounds is re-evaluated by docking and scoring again, and the process is repeated until convergence is reached or until the NN starts showing signs of overtraining. The success of such AI-accelerated pipeline in retrospective and prospective ligand discovery for the dopamine D₄ receptor¹¹¹ and voltage-gated sodium channel Na_V1.7¹⁰⁷, respectively, suggests its promise for membrane proteins; however, we have yet to see its prospective application to GPCRs.

Finally, there are growing reports of using generative AI for ligand discovery, which circumvents the challenges of library screening and complements structure-based VLS in hit discovery applications^{116,117,118,119,120,121,122,123,124}. For example, Powers et al. reported using an SE(3) equivariant graph neural networks to directly build compounds in the binding pockets¹²⁵. Although tested only on experimental structures, the method was demonstrated to produce better-scoring ligands than conventional VLS for a wide range of targets including GPCRs, and also generated compounds with better drug-likeness and predicted PK properties¹²⁵. Similarly, using the dopamine D₂ receptor as their case study, Thomas et al. showed that the generated molecules not only have better predicted affinity but also occupy a different physicochemical space compared to known D₂ actives¹²⁶. In-depth review of the use of generative AI for ligand discovery is outside of the scope of the present review and is described elsewhere^127,128.

In summary, AI-based structural models of GPCRs can be as effective as experimental structures in enabling hit discovery campaigns, especially when combined with AI-powered scoring functions. While selecting structural models for prospective VLS is challenging, certain best practices can increase the chances of success, like using an ensemble docking approach and including models that perform well in retrospective VLS on appropriate benchmarks (Fig. 7). Furthermore, as the accuracy of the recent experimental (mostly cryo-EM) GPCR structures has become more variable, their use for hit discovery may require the same level of refinement and selection as for computationally generated models.

**Fig. 7: Best practices for generating a model ensemble for prospective structure-based hit discovery.**

Hit-to-lead and lead optimization based on GPCR structural models

In the hit-to-lead (H2L) and lead optimization (LO) phases of the drug discovery process, the goal is to improve the drug-like properties (such as on-target potency, selectivity over off-targets, and pharmacokinetic properties) of the hit or lead compound by exploring modifications around its scaffold. This can be guided by computational prediction of potency or binding affinity of hit/lead analogs, which is often done by building and assessing the structural models of their complexes with the target. The underlying models must have sufficient sensitivity and accuracy to discriminate between highly similar ligands with small differences in potency, sometimes below one log unit¹²⁹. Arguably, this is a more challenging task than discriminating binders from non-binders in the hit discovery phase; for example, even the best AI-powered VLS scoring functions could not accurately rank order the binding affinity of actives in the same chemical series^88,93.

Alchemical free-energy perturbation (FEP) is a well-validated physics-based computational method for structure-based prediction of binding free energies^130,131. In relative binding FEP (RB-FEP) calculations, changes in binding free energy upon small modification of the ligand are calculated by simulating the corresponding alchemical transformation of one molecule into the other, both in solvent and in the binding pocket¹³¹, and by MD-based conformational sampling of the complex at different stages of the alchemical transformation path. Implementations of the free-energy perturbation method include GROMACS^132,133, YANK¹³⁴, and Schrödinger FEP + ¹³⁵. Retrospective benchmarking of FEP+ using crystal structures of protein-ligand complexes across multiple target classes reported RB-FEP accuracy close to experimental reproducibility, with an average mean unsigned error (MUE) of 0.90–0.98 kcal/mol, root mean square error (RMSE) of 1.11–1.25 kcal/mol, and correlation coefficient (R²) of 0.37–0.84 (see Fig. 8a for an example)^136,137. These error ranges correspond to only 0.66-0.72 and 0.73–0.92 log units of binding affinity, respectively, suggesting the promise of the FEP+ methodology for structure-based lead optimization¹³⁸. It should be noted, however, that in prospective applications, the reported error of FEP with respect to experiment is typically higher¹³⁹. Cited reasons include pose uncertainties and unaccounted-for pocket induced-fit effects with some of the new compounds, as well as force field limitation for certain types of compound chemistry¹³⁹.

**Fig. 8: Variation of FEP+ predictive accuracy across experimental structures and models of dopamine D₃ receptor and GPR139.**

Recently, several groups have attempted to tackle the task of relative binding energy prediction using machine learning^{140,141,142,143,144}. However, so far, these efforts have had limited success. The predictions tend to be relatively inaccurate, with MUE of log₁₀ potency prediction exceeding 1 in at least half of the cases¹⁴⁵, and are poorly generalizable^146,147. AI-based models that directly integrate physical knowledge (e.g. PBCNet¹⁴⁸ or IGModel¹⁰⁶) may have a higher prediction accuracy and also allow for better interpretability. However, even for these models, the accuracy is markedly lower than for FEP + ¹⁴⁸, possibly because they disregard the pocket conformational changes upon compound analog binding. Consequently, in the field of binding affinity prediction, physics-based approaches remain the standard.

Success of FEP calculations is tightly coupled to the accuracy of the initial protein-ligand complex structure and its ability to capture relevant interactions and their energetic contributions during simulation. GPCR-specific complexities include the dynamics of extracellular loops and the uncertainties of water positioning and displacement in the orthosteric pocket¹⁴⁹. Nevertheless, FEP+ calculations were shown to be accurate, both retrospectively and prospectively, when applied to experimental structures of several Class A GPCRs: adenosine A_2A receptor, β₁-adrenoceptor, CXCR4, δ-opioid receptor, and OX₂ orexin receptor^150,151. Furthermore, proof-of-concept studies showed that accurate FEP+ predictions can also be obtained using homology models refined by IFD-MD or MD simulation: for the adenosine A_2A receptor, the FEP+ performance on such a model was slightly degraded compared to that on experimental structure, but still predictive (MUE = 1.50 vs 0.81 kcal/mol, R² of 0.47 vs 0.39)¹⁵², while for the mosquito neuropeptide Y-like receptor 7 (NPYLR7), FEP+ calculations showed retrospective MUE of 1.29 kcal/mol, R² of 0.57, and allowed to prospectively improve the functional efficacy of the lead compound¹⁵³. Similarly, Xu et al. demonstrated the applicability of FEP+ to IFD-MD-generated models for 14 diverse protein targets (although no GPCRs) (Fig. 8b)¹⁵⁴. Recently, Coskun et al. extended the proof-of-concept to IFD-MD-refined AF2 models, for the somatostatin SST₄ (SSTR4)⁶¹. On a set of 64 congeneric ligands with a wide range (>3 log units) of activities, their final receptor-ligand complex model showed high accuracy (pairwise RMSE of 1.00 kcal/mol, R² of 0.54). These successes indicate that FEP+ possesses a degree of tolerance to structural inaccuracies, likely via its sampling of the target and ligand conformational space by MD, which makes it applicable to computational models built by both conventional homology and AI-based methods.

An important lesson of the study by Xu et al. is that computationally generated models with relatively large geometric deviations from experimental structures can still be predictive; for example, one model with ligand RMSD of 3.1 Å relative to the experimental structure achieved FEP + RMSE of 1.08 kcal/mol and R² of 0.63¹⁵⁴. Moreover, for 9 out of the 18 models, the accuracy of FEP+ predictions was either similar or better than for the corresponding experimental structures. That said, for models with ligand RMSD > 2.5Å, accuracy, as measured by RMSE or by R², was generally degraded compared to experimental structures, whereas for models with ligand RMSD < 2.5 Å, it was either improved or reduced, without any obvious trend (Fig. 8b). The authors also emphasized the risk of using insufficient or inadequate retrospective activity datasets in model validation by RB-FEP, whereby the use of sparse, unevenly distributed, or narrow-range activity data may lead to a misleadingly high retrospective RB-FEP accuracy even for models that are too geometrically incorrect to allow prospective success¹⁵⁴.

In addition to RB-FEP, absolute binding FEP (AB-FEP) can be used to assess the models for prospective use in ligand optimization¹⁵⁵. In contrast to RB-FEP where only the modified region between two ligands is sampled alchemically, AB-FEP alchemically couples/decouples a single ligand in its entirety, simulating its complete apparition/disparition in the binding pocket and the solvent. AB-FEP therefore, does not require a congeneric compound series or an activity dataset. When starting from an accurate model, AB-FEP is expected to closely predict the experimental binding energy of the ligand, or possibly generate a more negative number, with the difference interpreted as protein reorganization energy between the apo and the ligand-bound state¹⁵⁵. Models with AB-FEP prediction significantly less favorable than the experimental binding affinity of the compound are likely incorrect. Therefore, when more than one model with similarly high RB-FEP performance is available, AB-FEP can help prioritize models for prospective applications⁶¹.

To further investigate the relationship between complex geometry and FEP predictive accuracy for experimental structures, homology models, and AF2 models of GPCRs, we applied both RB-FEP and AB-FEP (FEP + ¹³⁵) to 21 GPCR Dock 2010 models of eticlopride-bound D₃ receptor and 11 GPCR Dock 2021 models of JNJ-63533054-bound GPR139, along with two X-ray (for D₃) and six cryo-EM (for GPR139) structures of the respective complexes (Figs. 3d, e and 6a, c, e). As described previously, these two sets of models were selected to span a large range of geometric ‘correctness’ relative to the known experimental structures. For D₃ receptor, relative binding free energies were calculated for a series of 11 eticlopride analogs with potencies ranging from 0.1 nM to 426 nM^108,156, whereas for GPR139, the study involved 21 Janssen glycine benzamides (same series as JNJ-63533054) with EC₅₀ values from 24 nM to 1700 nM¹⁵⁷.

The results for the two receptors and their respective sets of models were in sharp contrast. For the D₃-eticlopride set, the two X-ray structures (chains A and B of PDB 3pbl) showed good retrospective accuracy in RB-FEP (RMSE = 1.3 kcal/mol and R² = 0.66, Fig. 8c), but the accuracy degraded rapidly and significantly in the models (RMSE ≥ 2.4 kcal/mol and R² < 0.52), even those with relatively close positioning of the ligand and globally similar pocket conformations (Fig. 8c). The most geometrically accurate of the tested models (model 5084-3: ligand RMSD of 1.2 Å and receptor pocket RMSD of only 1.6 Å) showed RB-FEP RMSE as high as 2.8 kcal/mol and an R² of only 0.09 (Fig. 8c). This illustrates that geometric accuracy is not directly correlated with RB-FEP performance, and that small structural details can significantly impact FEP accuracy, hence justifying the need for additional refinement to improve performance as in the SSTR4 study⁶¹.

In contrast to the D₃-eticlopride models, all eleven computationally generated models of the GPR139 - JNJ-63533054 complex yielded reasonably accurate retrospective RB-FEP predictions that were similar to the six cryo-EM structures, as measured by RMSE (1.18–1.42 kcal/mol, Fig. 8d). This was in spite of a wide range of geometric correctness of the models (ligand RMSD of 2–9.5 Å and receptor pocket RMSD of 2–6.9 Å compared to the most predictive cryo-EM structure, PDB 7vuj). Notably, four models with poses completely different from the cryo-EM structures (ligand RMSD > 8 Å) still showed relatively accurate RB-FEP predictions, likely exemplifying false positive models and illustrating the potential risks of RB-FEP retrospective assessment. Consistent with the cautionary note from Xu et al.¹⁵⁴, this outcome could have been anticipated given the narrow activity range and sparseness of the activity dataset used (EC₅₀ between 24 nM and 440 nM, only 1.3 log units, for 18 out of the 21 ligands, Fig. 8f and Supplemental Data 2). This narrow and sparse activity distribution, and the uncertainty of the retrospective RB-FEP validation of the models, were also reflected by the poor correlation coefficients observed for all models (R² from 0 to 0.17 for all complex structures assessed, including the cryo-EM structures). By contrast, the retrospective dataset for D₃-eticlopride had better distributed activities covering a broader range (EC₅₀ from 0.1 nM to 440 nM, or 3.6 log units, Fig. 8e and Supplemental Data 1). AB-FEP calculations showed that two of the GPR139 RB-FEP false positive models had very poor predicted ΔG_binding for JNJ-63533054 (−6.3 and −6.6 kcal/mol compared to −13.2 kcal/mol for the best cryo-EM structure, Fig. 8h), supporting the complementary role of AB-FEP in validating receptor-ligand complex models. However, for the two remaining geometrically incorrect models, AB-FEP-predicted ΔG_binding values were close to the experimentally measured potency and only slightly weaker than ΔG_binding for the cryo-EM structures. In such cases, recognizing the limitation of the available ligand activity dataset should invite caution and encourage gathering of more data for more reliable validation.

In summary, FEP can achieve accurate predictions and guide potency optimization within ligand series not only for high-resolution experimental structures but also for some computationally predicted models of GPCRs. That said, there are inherent challenges and caveats that we emphasized in this section. A notable limitation is the amount of SAR known for the series of interest, as illustrated with the GPR139 study above. However, even in cases where the available SAR is insufficient to identify a single model for prospective use, it can help prioritize several models that can further be filtered as more SAR data is accumulated. In Fig. 9 and Table 1, we have gathered best practices and lessons learned to address the above challenges, and to generate and validate predictive models for FEP, both from the literature and from our experience in real-world drug discovery projects (Table 1, Fig. 9). We share a general workflow that uses an ensemble and trial-and-error approach for model selection and validation, to account for the unpredictable effects of small structural details on FEP accuracy.

**Fig. 9: Best practices for the generation and validation of an FEP-enabling model for lead optimization.**

Table 1 Summary of lessons learned and corresponding best practices, for the generation and validation of GPCR models for the purpose of prospective use with FEP

Full size table

Conclusion

The AI revolution opened new avenues for structure-based drug discovery for GPCRs by dramatically increasing the availability of high-quality structural models. The AI-based co-folding methods also have a potential to solve the decades-long challenge in modeling the induced-fit effect and to deliver accurate structural models for receptor complexes with small molecules and peptides–a critical prerequisite for most drug discovery applications. However, these models still show a wide range of geometric accuracy and physical validity. On the other hand, advances in physics-based computational chemistry, including new induced-fit docking methodologies and highly accurate FEP implementations, have led to notable successes in application to GPCRs, and AI-generated structural models have considerably extended the domain of applicability of these methods. Importantly, the geometric accuracy of computational models is not always correlated with their performance in hit discovery and ligand optimization applications. Predictive models can be identified via retrospective assessments in respective applications using adequate benchmarks. Beyond structure prediction, scoring functions and VLS acceleration by score extrapolation exemplify current and future areas of method development at the interface between AI and physics, poised for application in GPCR drug discovery.

Data availability

No datasets were generated in this study. The sets of receptor structural models analyzed in Figs. 2, 3, 4, 5, 6, 8 originate from studies cited in respective figure legends and are published as part of those studies. The sets of chemical compounds used for the analyses in Figs. 6 and 8 originate from studies cited in the text as well as the ChEMBL database (release ChEMBL34, March 2024); the final compound sets for D3 receptor and GPR139 are provided as Supplemental Data 1 and 2.

Abbreviations

GPCR:: G protein-coupled receptors
AI:: artificial intelligence
SBDD:: structure-based drug discovery/design
PDB:: protein data bank
TM:: transmembrane
AF2:: AlphaFold2
AF3:: AlphaFold3
pLDDT:: predicted local distance difference test
RMSD:: root mean square deviation
ECL:: extracellular loop
ICL:: intracellular loop
SAR:: structure-activity relationship
MD:: molecular dynamics
IFD:: induced-fit docking
VLS:: virtual ligand/library screening
TD:: Tanimoto distance
ROC:: receiver-operating characteristic curve
AUC:: area under the curve
DUDe:: database of useful decoys – enhanced
NN:: neural net
PK:: pharmacokinetics
cryo-EM:: cryogenic electron microscopy
H2L:: hit-to-lead optimization
LO:: lead optimization
FEP:: free-energy perturbation
RB-FEP:: relative binding FEP
MUE:: mean unsigned error
RMSE:: root mean square error
AB-FEP:: absolute binding FEP.

References

Oprea, T. I. et al. Unexplored therapeutic opportunities in the human genome. Nat. Rev. Drug Discov. 17, 317–332 (2018).
Article CAS PubMed PubMed Central Google Scholar
Hauser, A. S., Attwood, M. M., Rask-Andersen, M., Schioth, H. B. & Gloriam, D. E. Trends in GPCR drug discovery: new agents, targets and indications. Nat. Rev. Drug Discov. 16, 829–842 (2017).
Article CAS PubMed PubMed Central Google Scholar
Lorente, J. S. et al. GPCR drug discovery: new agents, targets and indications. Nat. Rev. Drug Discov. 16, 829-842 (2025).
Kuhn, P., Wilson, K., Patch, M. G. & Stevens, R. C. The genesis of high-throughput structure-based drug discovery using protein crystallography. Curr. Opin. Chem. Biol. 6, 704–710 (2002).
Article CAS PubMed Google Scholar
Stevens, R. C. et al. The GPCR Network: a large-scale collaboration to determine human GPCR structure and function. Nat. Rev. Drug Discov. 12, 25–34 (2013).
Article CAS PubMed Google Scholar
Kryshtafovych, A., Schwede, T., Topf, M., Fidelis, K. & Moult, J. Critical assessment of methods of protein structure prediction (CASP)-Round XIV. Proteins 89, 1607–1617 (2021).
Article CAS PubMed PubMed Central Google Scholar
NobelPrize.org. Nobel Prize Outreach. They cracked the code for proteins’ amazing structures, <https://www.nobelprize.org/prizes/chemistry/2024/press-release/> (2024).
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Article CAS PubMed PubMed Central Google Scholar
Baek, M. et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science 373, 871–876 (2021).
Article CAS PubMed PubMed Central Google Scholar
Berman, H. M. et al. The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000).
Article CAS PubMed PubMed Central Google Scholar
Isberg, V. et al. GPCRdb: an information system for G protein-coupled receptors. Nucleic Acids Res. 44, D356–D364 (2016).
Article CAS PubMed Google Scholar
Herrera, L. P. T. et al. GPCRdb in 2025: adding odorant receptors, data mapper, structure similarity search and models of physiological ligand complexes. Nucleic Acids Res. 53, D425-D435 (2024).
He, X. H. et al. AlphaFold2 versus experimental structures: evaluation on G protein-coupled receptors. Acta Pharm. Sin. 44, 1–7 (2023).
Article CAS Google Scholar
Lee, C., Su, B. H. & Tseng, Y. J. Comparative studies of AlphaFold, RoseTTAFold and Modeller: a case study involving the use of G-protein-coupled receptors. Brief Bioinform. 23, bbac308 (2022).
Callaway, E. What’s next for AlphaFold and the AI protein-folding revolution. Nature 604, 234–238 (2022).
Article CAS PubMed Google Scholar
Terwilliger, T. C. et al. AlphaFold predictions are valuable hypotheses and accelerate but do not replace experimental structure determination. Nat. Methods 21, 110–116 (2024).
Article CAS PubMed Google Scholar
Borkakoti, N. & Thornton, J. M. AlphaFold2 protein structure prediction: Implications for drug discovery. Curr. Opin. Struct. Biol. 78, 102526 (2023).
Article CAS PubMed PubMed Central Google Scholar
Guo, H. B. et al. AlphaFold2 models indicate that protein sequence determines both structure and dynamics. Sci. Rep. 12, 10696 (2022).
Article CAS PubMed PubMed Central Google Scholar
Pinheiro, I. D. M. et al. Noncanonical roles of chemokine regions in CCR9 activation revealed by structural modeling and mutational mapping. bioRxiv https://doi.org/10.1101/2024.06.04.596985 (2024).
Heo, L. & Feig, M. Multi-state modeling of G-protein coupled receptors at experimental accuracy. Proteins 90, 1873–1885 (2022).
Article CAS PubMed PubMed Central Google Scholar
Pandy-Szekeres, G. et al. GPCRdb in 2023: state-specific structure models using AlphaFold2 and new ligand resources. Nucleic Acids Res. 51, D395–D402 (2023).
Article CAS PubMed Google Scholar
Wayment-Steele, H. K. et al. Predicting multiple conformations via sequence clustering and AlphaFold2. Nature 625, 832–839 (2024).
Article CAS PubMed Google Scholar
Del Alamo, D., Sala, D., McHaourab, H. S. & Meiler, J. Sampling alternative conformational states of transporters and receptors with AlphaFold2. Elife 11, e75751 (2022).
Sala, D., Hildebrand, P. W. & Meiler, J. Biasing AlphaFold2 to predict GPCRs and kinases with user-defined functional or structural properties. Front. Mol. Biosci. 10, 1121962 (2023).
Article CAS PubMed PubMed Central Google Scholar
Bryant, P. & Noe, F. Structure prediction of alternative protein conformations. Nat. Commun. 15, 7328 (2024).
Article CAS PubMed PubMed Central Google Scholar
Rustamov, K. R. & Baev, A. Y. MSA clustering enhances AF-Multimer’s ability to predict conformational landscapes of protein-protein interactions. Bioinform. Adv. 5, vbae197 (2025).
Article PubMed Google Scholar
Ahdritz, G. et al. OpenFold: retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization. Nat. Methods 21, 1514–1524 (2024).
Article CAS PubMed PubMed Central Google Scholar
Raouraoua, N. et al. MassiveFold: unveiling AlphaFold’s hidden potential with optimized and parallelized massive sampling. Nat. Comput. Sci. 4, 824–828 (2024).
Article PubMed PubMed Central Google Scholar
Lewis, S. et al. Scalable emulation of protein equilibrium ensembles with generative deep learning. bioRxiv https://doi.org/10.1101/2024.12.05.626885 (2024).
Kufareva, I. et al. Status of GPCR modeling and docking as reflected by community-wide GPCR Dock 2010 assessment. Structure 19, 1108–1126 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kufareva, I., Katritch, V., Participants of, G. D., Stevens, R. C. & Abagyan, R. Advances in GPCR modeling evaluated by the GPCR Dock 2013 assessment: meeting new challenges. Structure 22, 1120–1139 (2014).
Article CAS PubMed PubMed Central Google Scholar
Karelina, M., Noh, J. J. & Dror, R. O. How accurately can one predict drug binding modes using AlphaFold models? Elife 12 (2023).
Lee, S. et al. Evaluating GPCR modeling and docking strategies in the era of deep learning-based protein structure prediction. Comput. Struct. Biotechnol. J. 21, 158–167 (2023).
Article CAS PubMed Google Scholar
Michino, M. et al. Community-wide assessment of GPCR structure modelling and ligand docking: GPCR Dock 2008. Nat. Rev. Drug Discov. 8, 455–463 (2009).
Article CAS PubMed PubMed Central Google Scholar
Chitsazi, R. et al. The 4th GPCR Dock: assessment of blind predictions for GPCR-ligand complexes in the era of AlphaFold. bioRxiv https://doi.org/10.1101/2025.04.18.647407 (2025).
Evans, R. et al. Protein complex prediction with AlphaFold-Multimer. bioRxiv https://doi.org/10.1101/2021.10.04.463034 (2022).
Nakata, S., Mori, Y. & Tanaka, S. End-to-end protein-ligand complex structure generation with diffusion-based generative models. BMC Bioinforma. 24, 233 (2023).
Article CAS Google Scholar
Qiao, Z., Nie, W., Vahdat, A., Miller, T. F. & Anandkumar, A. State-specific protein–ligand complex structure prediction with a multiscale deep generative model. Nat. Mach. Intell. 6, 195–208 (2024).
Article Google Scholar
Krishna, R. et al. Generalized biomolecular modeling and design with RoseTTAFold All-Atom. Science 384, eadl2528 (2024).
Article CAS PubMed Google Scholar
Abramson, J. et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493–500 (2024).
Article CAS PubMed PubMed Central Google Scholar
Editorial. AlphaFold3 - why did Nature publish it without its code? Nature https://doi.org/10.1038/d41586-024-01463-0 (2024).
Callaway, E. AI protein-prediction tool AlphaFold3 is now more open. Nature https://doi.org/10.1038/d41586-024-03708-4 (2024).
Article PubMed PubMed Central Google Scholar
Google DeepMind. AlphaFold3, <https://github.com/google-deepmind/alphafold3> (2024).
Google DeepMind. AlphaFold3 License, <https://github.com/google-deepmind/alphafold3?tab=License-1-ov-file> (2024).
Callaway, E. Who will make AlphaFold3 open source? Scientists race to crack AI model. Nature https://doi.org/10.1038/d41586-024-01555-x (2024).
Boitreaud, J. et al. Chai-1: Decoding the molecular interactions of life. bioRxiv https://doi.org/10.1101/2024.10.10.615955 (2024).
Iambic Therapeutics. Transforming computational drug discovery with NeuralPLexer2, <https://www.iambic.ai/post/np2> (2024).
Ligo Biosciences. Open source implementation of AlphaFold3, <https://github.com/Ligo-Biosciences/AlphaFold3>.
Liu, L. et al. Technical Report of HelixFold3 for Biomolecular Structure Prediction. arXiv https://doi.org/10.48550/arXiv.2408.16975 (2024).
ByteDance AML AI4Science Team. A trainable PyTorch reproduction of AlphaFold 3 https://github.com/bytedance/Protenix/blob/main/Protenix_Technical_Report.pdf (2024).
Bryant, P., Kelkar, A., Guljas, A., Clementi, C. & Noe, F. Structure prediction of protein-ligand complexes from sequence information with Umol. Nat. Commun. 15, 4536 (2024).
Article CAS PubMed PubMed Central Google Scholar
Wohlwend, J. et al. Boltz-1 democratizing biomolecular interaction modeling. bioRxiv https://doi.org/10.1101/2024.11.19.624167 (2024).
Buttenschoen, M., Morris, G. M. & Deane, C. M. PoseBusters: AI-based docking methods fail to generate physically valid poses or generalise to novel sequences. Chem. Sci. 15, 3130–3139 (2024).
Article CAS PubMed Google Scholar
Iambic Therapeutics. Previewing NeuralPLexer3: Towards fully AI-enabled Structure-Based Drug Discovery, <https://www.iambic.ai/post/np3-preview> (2024).
Masters, M. R., Mahmoud, A. H. & Lill, M. A. Do Deep Learning Models for Co-Folding Learn the Physics of Protein-Ligand Interactions? bioRxiv https://doi.org/10.1101/2024.06.03.597219 (2024).
Raush, E., Abagyan, R. & Totrov, M. Efficient generation of conformer ensembles using internal coordinates and a generative directional graph convolution neural network. J. Chem. Theory Comput. 20, 4054–4063 (2024).
Article CAS PubMed Google Scholar
He, X. H., Li, J. R., Shen, S. Y. & Xu, H. E. AlphaFold3 versus experimental structures: assessment of the accuracy in ligand-bound G protein-coupled receptors. Acta Pharmacol. Sin. 46, 1111-1122 (2024).
Totrov, M. & Abagyan, R. Flexible ligand docking to multiple receptor conformations: a practical alternative. Curr. Opin. Struct. Biol. 18, 178–184 (2008).
Article CAS PubMed PubMed Central Google Scholar
Amaro, R. E. et al. Ensemble docking in drug discovery. Biophys. J. 114, 2271–2278 (2018).
Article CAS PubMed PubMed Central Google Scholar
Miller, E. B. et al. Reliable and accurate solution to the induced fit docking problem for protein-ligand binding. J. Chem. Theory Comput. 17, 2630–2639 (2021).
Article CAS PubMed Google Scholar
Coskun, D. et al. Using AlphaFold and experimental structures for the prediction of the structure and binding affinities of GPCR complexes via induced fit docking and free energy perturbation. J. Chem. Theory Comput. 20, 477–489 (2024).
Article CAS PubMed Google Scholar
Wang, Z. et al. A fully differentiable ligand pose optimization framework guided by deep learning and a traditional scoring function. Brief. Bioinform. 24, bbac520 (2023).
Article PubMed Google Scholar
de Oliveira, T. M., van Beek, L., Shilliday, F., Debreczeni, J. E. & Phillips, C. Cryo-EM: the resolution revolution and drug discovery. SLAS Discov. 26, 17–31 (2021).
Article PubMed Google Scholar
Zhou, Y. et al. Molecular insights into ligand recognition and G protein coupling of the neuromodulatory orphan receptor GPR139. Cell Res. 32, 210–213 (2022).
Article CAS PubMed Google Scholar
You, C. et al. Structural insights into the peptide selectivity and activation of human neuromedin U receptors. Nat. Commun. 13, 2045 (2022).
Article CAS PubMed PubMed Central Google Scholar
Zhao, W. et al. Ligand recognition and activation of neuromedin U receptor 2. Nat. Commun. 13, 7955 (2022).
Article CAS PubMed PubMed Central Google Scholar
Arroyo-Urea, S. et al. A bitopic agonist bound to the dopamine 3 receptor reveals a selectivity site. Nat. Commun. 15, 7759 (2024).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y. et al. Structures of the entire human opioid receptor family. Cell 186, 413–427 e417 (2023).
Article CAS PubMed Google Scholar
Chen, Y., Chen, B., Wu, T., Zhou, F. & Xu, F. Cryo-EM structure of human kappa-opioid receptor-Gi complex bound to an endogenous agonist dynorphin A. Protein Cell 14, 464–468 (2023).
CAS PubMed Google Scholar
Lopez-Balastegui, M. et al. Relevance of G protein-coupled receptor (GPCR) dynamics for receptor activation, signalling bias and allosteric modulation. Br. J. Pharmacol. (2024).
Irwin, J. J. & Shoichet, B. K. Docking screens for novel ligands conferring new biology. J. Med. Chem. 59, 4103–4120 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ballante, F., Kooistra, A. J., Kampen, S., de Graaf, C. & Carlsson, J. Structure-based virtual screening for ligands of G protein-coupled receptors: What can molecular docking do for you?. Pharm. Rev. 73, 527–565 (2021).
Article PubMed Google Scholar
Carlsson, J. & Luttens, A. Structure-based virtual screening of vast chemical space as a starting point for drug discovery. Curr. Opin. Struct. Biol. 87, 102829 (2024).
Article CAS PubMed Google Scholar
Liu, F. et al. Large library docking identifies positive allosteric modulators of the calcium-sensing receptor. Science 385, eado1868 (2024).
Article CAS PubMed Google Scholar
Patel, N. et al. Structure-based discovery of potent and selective melatonin receptor agonists. Elife 9, e53779 (2020).
Zheng, Z. et al. Structure-based discovery of new antagonist and biased agonist chemotypes for the kappa opioid receptor. J. Med. Chem. 60, 3070–3081 (2017).
Article CAS PubMed PubMed Central Google Scholar
Lane, J. R. et al. Structure-based ligand discovery targeting orthosteric and allosteric pockets of dopamine receptors. Mol. Pharm. 84, 794–807 (2013).
Article CAS Google Scholar
Sadybekov, A. A. et al. Synthon-based ligand discovery in virtual libraries of over 11 billion compounds. Nature 601, 452–459 (2022).
Article CAS PubMed Google Scholar
Kaplan, A. L. et al. Bespoke library docking for 5-HT2A receptor agonists with antidepressant activity. Nature 610, 582–591 (2022).
Article CAS PubMed PubMed Central Google Scholar
Bender, B. J. et al. Structure-based discovery of a NPFF1R antagonist with analgesic activity. bioRxiv https://doi.org/10.1101/2023.10.25.564029 (2023).
Smith, S. T. et al. Discovery of protease-activated receptor 4 (PAR4)-tethered ligand antagonists using ultralarge virtual screening. ACS Pharm. Transl. Sci. 7, 1086–1100 (2024).
Article CAS Google Scholar
Katritch, V., Rueda, M., Lam, P. C., Yeager, M. & Abagyan, R. GPCR 3D homology models for ligand screening: lessons learned from blind predictions of adenosine A2a receptor complex. Proteins 78, 197–211 (2010).
Article CAS PubMed PubMed Central Google Scholar
Katritch, V., Rueda, M. & Abagyan, R. Ligand-guided receptor optimization. Methods Mol. Biol. 857, 189–205 (2012).
Article CAS PubMed Google Scholar
Bender, B. J. et al. A practical guide to large-scale docking. Nat. Protoc. 16, 4799–4832 (2021).
Article CAS PubMed PubMed Central Google Scholar
Sadybekov, A. V. & Katritch, V. Computational approaches streamlining drug discovery. Nature 616, 673–685 (2023).
Article CAS PubMed Google Scholar
Liu, F. et al. Small vs. large library docking for positive allosteric modulators of the calcium sensing receptor. bioRxiv https://doi.org/10.1101/2023.12.27.573448 (2024).
Mysinger, M. M., Carchia, M., Irwin, J. J. & Shoichet, B. K. Directory of useful decoys, enhanced (DUD-E): better ligands and decoys for better benchmarking. J. Med. Chem. 55, 6582–6594 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dawson, J. R. D. et al. Molecular determinants of antagonist interactions with chemokine receptors CCR2 and CCR5. bioRxiv https://doi.org/10.1101/2023.11.15.567150 (2023).
Lyu, J. et al. AlphaFold2 structures guide prospective ligand discovery. Science 384, eadn6354 (2024).
Article CAS PubMed PubMed Central Google Scholar
Diaz-Rovira, A. M. et al. Are deep learning structural models sufficiently accurate for virtual screening? Application of docking algorithms to AlphaFold2 predicted structures. J. Chem. Inf. Model 63, 1668–1674 (2023).
Article CAS PubMed Google Scholar
Zhang, Y. et al. Benchmarking refined and unrefined AlphaFold2 structures for hit discovery. J. Chem. Inf. Model 63, 1656–1667 (2023).
Article CAS PubMed Google Scholar
Diaz-Holguin, A. et al. AlphaFold accelerated discovery of psychotropic agonists targeting the trace amine-associated receptor 1. Sci. Adv. 10, eadn1524 (2024).
Article CAS PubMed PubMed Central Google Scholar
Su, M. et al. Comparative assessment of scoring functions: the CASF-2016 update. J. Chem. Inf. Model 59, 895–913 (2019).
Article CAS PubMed Google Scholar
Neves, M. A., Totrov, M. & Abagyan, R. Docking and scoring with ICM: the benchmarking results and strategies for improvement. J. Comput Aided Mol. Des. 26, 675–686 (2012).
Article CAS PubMed PubMed Central Google Scholar
Friesner, R. A. et al. Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. J. Med. Chem. 47, 1739–1749 (2004).
Article CAS PubMed Google Scholar
Halgren, T. A. et al. Glide: a new approach for rapid, accurate docking and scoring. 2. Enrichment factors in database screening. J. Med. Chem. 47, 1750–1759 (2004).
Article CAS PubMed Google Scholar
Coleman, R. G., Carchia, M., Sterling, T., Irwin, J. J. & Shoichet, B. K. Ligand pose and orientational sampling in molecular docking. PLoS One 8, e75992 (2013).
Article CAS PubMed PubMed Central Google Scholar
Jones, G., Willett, P., Glen, R. C., Leach, A. R. & Taylor, R. Development and validation of a genetic algorithm for flexible docking. J. Mol. Biol. 267, 727–748 (1997).
Article CAS PubMed Google Scholar
Park, H., Zhou, G., Baek, M., Baker, D. & DiMaio, F. Force Field Optimization Guided by Small Molecule Crystal Lattice Data Enables Consistent Sub-Angstrom Protein-Ligand Docking. J. Chem. Theory Comput. 17, 2000–2010 (2021).
Article CAS PubMed PubMed Central Google Scholar
Cao, D. et al. EquiScore: a generic protein-ligand interaction scoring method integrating physical prior knowledge with data augmentation modeling. bioRxiv https://doi.org/10.1101/2023.06.18.545464 (2023).
Stafford, K. A., Anderson, B. M., Sorenson, J. & van den Bedem, H. AtomNet PoseRanker: enriching ligand pose quality for dynamic proteins in virtual high-throughput screens. J. Chem. Inf. Model 62, 1178–1189 (2022).
Article CAS PubMed PubMed Central Google Scholar
Atomwise, A. P. AI is a viable alternative to high throughput screening: a 318-target study. Sci. Rep. 14, 7526 (2024).
Article Google Scholar
Shen, C. et al. Boosting protein-ligand binding pose prediction and virtual screening based on residue-atom distance likelihood potential and graph transformer. J. Med Chem. 65, 10691–10706 (2022).
Article CAS PubMed Google Scholar
Raush, E. & Totrov, M. RTCNN Performance (CASF 2016 pose rank benchmark). Molsoft ICM User Group Meeting https://doi.org/10.6084/m9.figshare.24309496.v1 (2023).
Totrov, M. New developments in ICM: neural networks and beyond in Molsoft ICM User Group Meeting. (San Diego, CA, 2023).
Wang, Z. et al. A new paradigm for applying deep learning to protein-ligand interaction prediction. Brief. Bioinform. 25, bbae145 (2024).
Article CAS PubMed PubMed Central Google Scholar
Zhou, G. et al. An artificial intelligence accelerated virtual screening platform for drug discovery. Nat. Commun. 15, 7761 (2024).
Article CAS PubMed PubMed Central Google Scholar
Chien, E. Y. et al. Structure of the human dopamine D3 receptor in complex with a D2/D3 selective antagonist. Science 330, 1091–1095 (2010).
Article CAS PubMed PubMed Central Google Scholar
Lyu, J. et al. Ultra-large library docking for discovering new chemotypes. Nature 566, 224–229 (2019).
Article CAS PubMed PubMed Central Google Scholar
Graff, D. E., Shakhnovich, E. I. & Coley, C. W. Accelerating high-throughput virtual screening through molecular pool-based active learning. Chem. Sci. 12, 7866–7881 (2021).
Article CAS PubMed PubMed Central Google Scholar
Yang, Y. et al. Efficient exploration of chemical space with docking and deep learning. J. Chem. Theory Comput. 17, 7106–7119 (2021).
Article CAS PubMed Google Scholar
Gentile, F. et al. Deep docking: a deep learning platform for augmentation of structure based drug discovery. ACS Cent. Sci. 6, 939–949 (2020).
Article CAS PubMed PubMed Central Google Scholar
Gentile, F. et al. Artificial intelligence-enabled virtual screening of ultra-large chemical libraries with deep docking. Nat. Protoc. 17, 672–697 (2022).
Article CAS PubMed Google Scholar
Raush, E. Highlights of recent ICM developments: GPU acceleration, its applications and more in Molsoft ICM User Group Meeting. (San Diego, CA, 2023).
Luttens, A. et al. Rapid traversal of vast chemical space using machine learning-guided docking screens. Nat. Comput. Sci. 5, 301–312 (2025).
Grechishnikova, D. Transformer neural network for protein-specific de novo drug generation as a machine translation problem. Sci. Rep. 11, 321 (2021).
Article CAS PubMed PubMed Central Google Scholar
Qian, H., Lin, C., Zhao, D., Tu, S. & Xu, L. AlphaDrug: protein target specific de novo molecular generation. PNAS Nexus 1, pgac227 (2022).
Article PubMed PubMed Central Google Scholar
Bernatavicius, A. et al. AlphaFold meets de novo drug design: leveraging structural protein information in multitarget molecular generative models. J. Chem. Inf. Model 64, 8113–8122 (2024).
Article CAS PubMed PubMed Central Google Scholar
Atz, K. et al. Prospective de novo drug design with deep interactome learning. Nat. Commun. 15, 3408 (2024).
Article CAS PubMed PubMed Central Google Scholar
Schneuing, A. et al. Structure-based drug design with equivariant diffusion models. Nat. Comput. Sci. 4, 899–909 (2024).
Peng, X. et al. Pocket2Mol: efficient molecular sampling based on 3D protein pockets. arXiv https://doi.org/10.48550/arXiv.2205.07249 (2022).
Zhang, O. et al. ResGen is a pocket-aware 3D molecular generation model based on parallel multiscale modelling. Nat. Mach. Intell. 5, 1020–1030 (2023).
Article Google Scholar
Jiang, Y. et al. PocketFlow is a data-and-knowledge-driven structure-based molecular generative model. Nat. Mach. Intell. 6, 326–337 (2024).
Article Google Scholar
Zhung, W., Kim, H. & Kim, W. Y. 3D molecular generative framework for interaction-guided drug design. Nat. Commun. 15, 2688 (2024).
Article CAS PubMed PubMed Central Google Scholar
Powers, A. S. et al. Geometric deep learning for structure-based ligand design. ACS Cent. Sci. 9, 2257–2267 (2023).
Article CAS PubMed PubMed Central Google Scholar
Thomas, M., Smith, R. T., O’Boyle, N. M., de Graaf, C. & Bender, A. Comparison of structure- and ligand-based scoring functions for deep generative models: a GPCR case study. J. Cheminform. 13, 39 (2021).
Article CAS PubMed PubMed Central Google Scholar
Grisoni, F. et al. Combining generative artificial intelligence and on-chip synthesis for de novo drug design. Sci. Adv. 7, eabg3338 (2021).
Article CAS PubMed PubMed Central Google Scholar
Ivanenkov, Y. et al. The Hitchhiker’s guide to deep learning driven generative chemistry. ACS Med. Chem. Lett. 14, 901–915 (2023).
Article CAS PubMed PubMed Central Google Scholar
Cournia, Z., Allen, B. & Sherman, W. Relative binding free energy calculations in drug discovery: recent advances and practical considerations. J. Chem. Inf. Model 57, 2911–2937 (2017).
Article CAS PubMed Google Scholar
Straatsma, T. & McCammon, J. Computational alchemy. Annu. Rev. Phys. Chem. 43, 407–435 (1992).
Article CAS Google Scholar
York, D. M. Modern alchemical free energy methods for drug discovery explained. ACS Phys. Chem. Au 3, 478–491 (2023).
Article CAS PubMed PubMed Central Google Scholar
Gapsys, V. et al. Large scale relative protein ligand binding affinities using non-equilibrium alchemy. Chem. Sci. 11, 1140–1152 (2019).
Article PubMed PubMed Central Google Scholar
Abraham, M. J. et al. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1-2, 19–25 (2015).
Article Google Scholar
Wang, K., Chodera, J. D., Yang, Y. & Shirts, M. R. Identifying ligand binding sites and poses using GPU-accelerated Hamiltonian replica exchange molecular dynamics. J. Comput. Aided Mol. Des. 27, 989–1007 (2013).
Article CAS PubMed PubMed Central Google Scholar
Wang, L. et al. Accurate and reliable prediction of relative ligand binding potency in prospective drug discovery by way of a modern free-energy calculation protocol and force field. J. Am. Chem. Soc. 137, 2695–2703 (2015).
Article CAS PubMed Google Scholar
Abel, R., Wang, L., Harder, E. D., Berne, B. J. & Friesner, R. A. Advancing drug discovery through enhanced free energy calculations. Acc. Chem. Res. 50, 1625–1632 (2017).
Article CAS PubMed Google Scholar
Ross, G. A. et al. The maximal and current accuracy of rigorous protein-ligand binding free energy calculations. Commun. Chem. 6, 222 (2023).
Article PubMed PubMed Central Google Scholar
Kuhn, B. et al. Prospective evaluation of free energy calculations for the prioritization of cathepsin L inhibitors. J. Med Chem. 60, 2485–2497 (2017).
Article CAS PubMed Google Scholar
Schindler, C. E. M. et al. Large-scale assessment of binding free energy calculations in active drug discovery projects. J. Chem. Inf. Model 60, 5457–5474 (2020).
Article CAS PubMed Google Scholar
Wang, D. D., Wu, W. & Wang, R. Structure-based, deep-learning models for protein-ligand binding affinity prediction. J. Cheminform. 16, 2 (2024).
Article PubMed PubMed Central Google Scholar
Liu, X. et al. Binding affinity prediction: from conventional to machine learning-based approaches. arXiv https://doi.org/10.48550/arXiv.2410.00709 (2024).
Zhang, Y., Li, S., Meng, K. & Sun, S. Machine Learning for Sequence and Structure-Based Protein-Ligand Interaction Prediction. J. Chem. Inf. Model 64, 1456–1472 (2024).
Article CAS PubMed Google Scholar
Karlov, D. S., Sosnin, S., Fedorov, M. V. & Popov, P. graphDelta: MPNN scoring function for the affinity prediction of protein-ligand complexes. ACS Omega 5, 5150–5159 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mqawass, G. & Popov, P. graphLambda: fusion graph neural networks for binding affinity prediction. J. Chem. Inf. Model 64, 2323–2330 (2024).
Article CAS PubMed Google Scholar
Jones, D. et al. Improved protein-ligand binding affinity prediction with structure-based deep fusion inference. J. Chem. Inf. Model 61, 1583–1592 (2021).
Article CAS PubMed Google Scholar
McNutt, A. T. & Koes, D. R. Improving DeltaDeltaG predictions with a multitask convolutional siamese network. J. Chem. Inf. Model 62, 1819–1829 (2022).
Article CAS PubMed PubMed Central Google Scholar
Mohamed Abdul Cader, J., Newton, M. A. H., Rahman, J., Mohamed Abdul Cader, A. J. & Sattar, A. Ensembling methods for protein-ligand binding affinity prediction. Sci. Rep. 14, 24447 (2024).
Article CAS PubMed PubMed Central Google Scholar
Yu, J. & Zheng, M. Efficient prediction of relative ligand binding affinity in drug discovery. Nat. Comput Sci. 3, 829–830 (2023).
Article Google Scholar
Mason, J. S. et al. High end GPCR design: crafted ligand design and druggability analysis using protein structure, lipophilic hotspots and explicit water networks. LID - 23. In Silico Pharmacol. (2013).
Lenselink, E. B. et al. Predicting Binding Affinities for GPCR Ligands Using Free-Energy Perturbation. ACS Omega 1, 293–304 (2016).
Article CAS PubMed PubMed Central Google Scholar
Deflorian, F. et al. Accurate Prediction of GPCR Ligand Binding Affinity with Free Energy Perturbation. J. Chem. Inf. Model 60, 5563–5579 (2020).
Article CAS PubMed Google Scholar
Cappel, D. et al. Relative Binding Free Energy Calculations Applied to Protein Homology Models. J. Chem. Inf. Model 56, 2388–2400 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zeledon, E. V. et al. Next-generation neuropeptide Y receptor small-molecule agonists inhibit mosquito-biting behavior. Parasit. Vectors 17, 276 (2024).
Article CAS PubMed PubMed Central Google Scholar
Xu, T. et al. Induced-fit docking enables accurate free energy perturbation calculations in homology models. J. Chem. Theory Comput 18, 5710–5724 (2022).
Article CAS PubMed Google Scholar
Fajer, M., Borrelli, K., Abel, R. & Wang, L. Quantitatively accounting for protein reorganization in computer-aided drug design. J. Chem. Theory Comput. 19, 3080–3090 (2023).
Article CAS PubMed Google Scholar
Shaik, A. B. et al. Structure activity relationships for a series of eticlopride-based dopamine D(2)/D(3) receptor bitopic ligands. J. Med. Chem. 64, 15313–15333 (2021).
Article CAS PubMed PubMed Central Google Scholar
Dvorak, C. A. et al. Identification and SAR of glycine benzamides as potent agonists for the GPR139 receptor. ACS Med. Chem. Lett. 6, 1015–1018 (2015).
Article CAS PubMed PubMed Central Google Scholar
Varadi, M. et al. AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res 50, D439–D444 (2022).
Article CAS PubMed Google Scholar
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/ (2021).
Wickham, H. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag, New York. ISBN 978-3-319-24277-4. https://cran.r-project.org/web/packages/ggplot2/citation.html (2016).
Abagyan, R. & Totrov, M. Biased probability Monte Carlo conformational searches and electrostatic calculations for peptides and proteins. J. Mol. Biol. 235, 983–1002 (1994).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors are grateful to Drs. Mike Gilson and Adrian Jinich (UC San Diego) for valuable discussions, to Dr. Sofia Endzhievskaya and Mr. Timothy Liu in the Kufareva lab (UC San Diego) for help with software deployment, and to Dr. Ed Miller (Schrödinger) for reading the manuscript and providing valuable feedback. This work was supported by NIH grants R21 AI149369, R21 AI156662, R01 AI161880, and R01 GM136202 (to I.K.). The Sanders Tri-Institutional Therapeutics Discovery Institute (TDI) is a 501(c)(3) organization and receives financial support from its parent institutes (Memorial Sloan Kettering Cancer Center, The Rockefeller University, and Weill Cornell Medicine) and from a generous contribution from Lewis Sanders and other philanthropic sources.

Author information

Mayako Michino
Present address: Manas AI Inc., New York, NY, USA

Authors and Affiliations

Sanders Tri-Institutional Therapeutics Discovery Institute, New York, NY, USA
Mayako Michino
Schrödinger, Inc., New York, NY, USA
Jeremie Vendome
Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA
Irina Kufareva

Authors

Mayako Michino
View author publications
Search author on:PubMed Google Scholar
Jeremie Vendome
View author publications
Search author on:PubMed Google Scholar
Irina Kufareva
View author publications
Search author on:PubMed Google Scholar

Contributions

M.M., J.V., and I.K. conducted the review of the literature and wrote, revised, and reviewed the main manuscript text. I.K. and J.V. performed data analyses. I.K., M.M., and J.V. prepared Figure 1, I.K. and M.M. prepared Figs. 2, 3, 5, I.K. prepared Figs. 4, 6, 7, and J.V. prepared Figs. 8, 9. J.V. and M.M. prepared Table 1.

Corresponding authors

Correspondence to Mayako Michino or Irina Kufareva.

Ethics declarations

Competing interests

I.K. is an Editorial Board Member for npj Drug Discovery. She was not part of a peer review process or decision-making of the manuscript. The other two authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supp Data 1

Supp Data 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Michino, M., Vendome, J. & Kufareva, I. AI meets physics in computational structure-based drug discovery for GPCRs. npj Drug Discov. 2, 16 (2025). https://doi.org/10.1038/s44386-025-00019-0

Download citation

Received: 18 December 2024
Accepted: 05 June 2025
Published: 03 July 2025
DOI: https://doi.org/10.1038/s44386-025-00019-0