GōMartini 3: From large conformational changes in proteins to environmental bias corrections

Souza, Paulo C. T.; Borges-Araújo, Luís; Brasnett, Christopher; Moreira, Rodrigo A.; Grünewald, Fabian; Park, Peter; Wang, Liguo; Razmazma, Hafez; Borges-Araújo, Ana C.; Cofas-Vargas, Luis Fernando; Monticelli, Luca; Mera-Adasme, Raúl; Melo, Manuel N.; Wu, Sangwook; Marrink, Siewert J.; Poma, Adolfo B.; Thallmair, Sebastian

doi:10.1038/s41467-025-58719-0

Download PDF

Article
Open access
Published: 30 April 2025

GōMartini 3: From large conformational changes in proteins to environmental bias corrections

Nature Communications volume 16, Article number: 4051 (2025) Cite this article

12k Accesses
44 Citations
38 Altmetric
Metrics details

Subjects

Abstract

Coarse-grained modeling has become an important tool to supplement experimental measurements, allowing access to spatio-temporal scales beyond all-atom based approaches. The GōMartini model combines structure- and physics-based coarse-grained approaches, balancing computational efficiency and accurate representation of protein dynamics with the capabilities of studying proteins in different biological environments. This paper introduces an enhanced GōMartini model, which combines a virtual-site implementation of Gō models with Martini 3. The implementation has been extensively tested by the community since the release of the reparametrized version of Martini. This work demonstrates the capabilities of the model in diverse case studies, ranging from protein-membrane binding to protein-ligand interactions and AFM force profile calculations. The model is also versatile, as it can address recent inaccuracies reported in the Martini protein model. Lastly, the paper discusses the advantages, limitations, and future perspectives of the Martini 3 protein model and its combination with Gō models.

Martini 3: a general purpose force field for coarse-grained molecular dynamics

Article 29 March 2021

Martini3-IDP: improved Martini 3 force field for disordered proteins

Article Open access 24 March 2025

Rescaling protein-protein interactions improves Martini 3 for flexible proteins in solution

Article Open access 05 August 2024

Introduction

The understanding of how proteins fold and perform their functions selectively, efficiently, and modulated by interactions with other biomolecules depends on the knowledge of their structure and dynamics. Despite tremendous progress in the experimental field^1,2, molecular modeling techniques have conquered their own space as an important and complementary set of approaches to study proteins^2,3. In particular, given the limitations in obtaining experimental high-resolution atomistic details from short to long time scales, all-atom (AA) molecular dynamics (MD) simulations have become widely used to study protein dynamics and even folding of simple systems^4,5. Because of the high computational costs associated with AA MD, these are usually limited to studying phenomena occurring on time scales of 1–100 µs (depending on the protein and/or system size)^6,7. Thus, a broad range of important biological phenomena remains out of reach. For instance, these include the long-range motion of protein domains as well as induced-fit mechanisms involving protein-ligand or protein-protein interactions. Systems involving transmembrane or peripheral membrane proteins can be even more challenging, as protein dynamics in a lipid bilayer environment can be slowed down, coupled to membrane fluctuations, and possibly dependent on lipid composition^8,9. AA approaches also struggle with interpreting single-molecule force spectroscopy (SMFS) data, requiring extensive sampling of non-equilibrium pulling processes in nanomechanical studies^10,11,12. Similarly, in the case of disordered proteins or domains, integrating small-angle x-ray scattering (SAXS) with MD data requires the determination of whole ensembles of conformations which can be difficult to obtain with AA methods^13,14. Enhanced sampling methods and GPU parallel computing help to reach longer timescales and better sampling with AA approaches^15,16,17. However, when considering biological length- and timescales, they are still limited to rather local processes.

One attractive alternative to AA protein models is the use of coarse-grained (CG) approaches. CG models are simplified representations which, due to a reduction of explicit degrees of freedom and a smoother interaction landscape, offer a substantial simulation speed-up. As a result, CG models can reach length- and timescales which are orders of magnitude larger than AA models. CG approaches offer a wide range of resolutions and strategies to define their interactions^18,19,20. For instance, structure-based protein models, like Gō-type models^21,22,23, define their interactions based on a known and usually folded structure. The potential energy (U_Gō) in the Gō model for proteins is constructed based on the native structure of the protein as follows, U_Gō = \({\sum }_{i < j}^{{NC}}V({\sigma }_{{ij}},\varepsilon )\), where NC denotes the set of native contacts and \(V({\sigma }_{{ij}},\varepsilon )\) is a Lennard-Jones (LJ) 12-6 potential. \({\sigma }_{{ij}}\) is given for each NC and depends on the distance between specific NC pairs (i.e., \({\sigma }_{{ij}}{=r}_{{ij}}/{2}^{1/6}\)). The \(\varepsilon\) represents the energy scale of the NCs and is usually uniform for all pairs. Gō-like models are a useful tool for modeling conformational changes in proteins such as folding processes, thermal unfolding, or mechanical response^24,25,26. However, environmental effects are usually neglected¹⁸. On the contrary, physics-based models, such as the well-known Martini force field^27,28,29, can be used to model protein dimerization and aggregation as well as interactions with lipid bilayers and other biomolecules^8,9,18,30. With each protein residue being represented by 1–5 beads, Martini still retains chemical specificity, because the beads are parametrized using experimental thermodynamic data such as partitioning free energies of small compounds between polar and apolar environments. While bonded potentials are parametrized and validated using atomistic and experimental data^27,28,29,30, traditional Martini protein models exhibit limitations in accurately representing stably folded proteins, often relying on a harmonic elastic network (EN) to maintain structural stability³¹. Although the dynamic accuracy of ENs can be improved via neural network-based structure predictions³², extensive tests have also shown the combination of Martini with ENs may also contribute to inaccurate protein−protein interactions^31,33. The observed stickiness of proteins^34,35,36 in Martini 2 may also affect their accessible conformational ensemble.

A possible way to keep a good compromise between high computational performance, accurate protein dynamics, and reliable interactions with the environment is the combination of structure-based and physics-based coarse-grained approaches. A recent example for this is the combination of Gō and Martini 2 models, called GōMartini³⁷. Several studies have shown that GōMartini models can be parametrized to reproduce protein flexibility from atomistic benchmark simulations^37,38,39. In addition, GōMartini has also shown great potential to study the nanomechanical stability of proteins^33,40,41. However, it has limitations in reproducing longer-range conformational changes³⁷. In addition, the model inherited parts of the stickiness limitations of Martini 2³³.

Here, we present the virtual-site implementation of an enhanced GōMartini model, which can be combined with the latest iteration of Martini, together with a diverse set of applications and comparisons to EN models and the previous GōMartini 2 model. Moreover, the fully reparameterized Martini 3 model for proteins is presented, pointing to which improvements in the model may enable more accurate predictions of protein packing and protein interactions^42,43. The GōMartini implementation together with Martini 3 has already shown that it can capture subtle changes in protein dynamics caused by interactions with membranes⁴⁴, single point mutations⁴⁵ and mechanostability⁴⁶. We also show how the virtual-site implementation can be used to implement an environmental bias to correct recently described inaccuracies of the model, such as underestimated dimensions of intrinsically disordered proteins (IDPs)^47,48 and low hydrophobicity of certain amphiphilic small peptides⁴⁹. The paper is structured as follows: first, we discuss the Martini 3 protein model, followed by the changes in the enhanced GōMartini 3 model as well as the improved virtual-site implementation facilitating high parallelization. We further demonstrate the power of the GōMartini 3 model using four case studies: (i) binding of a Pleckstrin homology (PH) domain to PI(4,5)P₂-enriched membranes, (ii) binding of benzene to T4 lysozyme, (iii) an allosteric pathway in Cu,Zn superoxide dismutase, and (iv) AFM-SMFS force profile calculations for the case of protein complexes such as antigen:antibody and dockerin:cohesin systems. Next, we give a perspective on how GōMartini 3 models can be further optimized and, moreover, how virtual Gō particles can be used to introduce environmental bias corrections through changes in the interaction with water beads. In the final section, we discuss the advantages, limitations, and future prospects of the approach and the overall development of the Martini 3 protein model.

Results

The Martini 3 Protein Model

The Martini 3 protein model is the natural evolution of the previous Martini 2 iteration²⁹, which now leverages the improvements introduced with the Martini 3 force field. However, it can still be considered as a prototype model, just like the current Martini 3 lipids, since the model has not been fully updated with the current parametrization rules. In particular, the core of the protein model is still based on a single particle backbone (BB), which is placed at the center of mass (and not the center of geometry) of the N, Cα, Cβ, and O atoms of the underlying atomistic backbone, and to which 1-5 side chain (SC) beads may be attached. This is in line with the original implementation of the Martini protein force field²⁸ and differs from the ELNEDIN model³¹, where the BB bead was placed at the position of the Cα atom. To connect consecutive amino acid residues, a harmonic bond or constraint, depending on the secondary structure, is placed between their BB beads. Angle and dihedral potentials are then placed over 3 or 4 consecutive BB beads, respectively, to define the secondary structure-dependent backbone torsion behavior. This set of BB bonded parameters – composed of the bond lengths, angles, dihedral angles, and their respective force constants for each of the secondary structure motifs – was inherited from the original implementation of the Martini protein force field²⁸. These were parameterized from a representative set of ~ 2000 proteins from the protein data bank (PDB), on which the Define Secondary Structure of Proteins (DSSP) algorithm⁵⁰ was used to determine the secondary structure motif associated with each residue.

In the original implementation of the Martini protein force field, the BB particle type depended on the secondary structure motif associated with a respective residue²⁸. When free in solution or in a coil or bend it was represented by a highly polar P5 bead; BB particles in beta strands or turns were represented by intermediately polar Nda beads with hydrogen-bonding capacities; and in helices by intermediately polar N0 beads without hydrogen-bonding capacities, with the C- and N-termini of a helix represented by intermediately polar Na and Nd beads with hydrogen-bond acceptor (Na) and donor (Nd) capacities, respectively. This choice was made to better represent the inter-backbone hydrogen bond character of each residue when present in a specific motif — i.e., the hydrogen bonds established within a helix would reduce the polar character of the amide backbone group²⁸. In contrast, thereto, all backbone beads are represented by polar P2 beads in Martini 3, regardless of the underlying secondary structure motif. The exceptions to this rule are charged terminal backbone beads — which are represented by charged Q5 beads with hard ion features — and GLY, ALA, VAL, and PRO residues. These four amino acids use different bead types to better represent slight differences in chemical group polarity and size. The GLY backbone is mapped as a smaller SP1 bead to represent the loss of the side chain (but keeping similar polarity compared to the default P2 backbone), while the PRO backbone is mapped as a smaller SP2a, due to the lack of hydrogen-donor capabilities. ALA and VAL are mapped as smaller SP2 beads to avoid overmapping issues, which could be caused by their side chain particles being mapped quite close to the backbone³³.

The side chains have been completely revisited for the Martini 3 protein model, following the modified parameterization guidelines established with the Martini 3 release and making use of the larger number of additional bead types and sizes specific for mappings finer than 4-to-1⁴³. The side chain models were parameterized from their backbone-less analogs and calibrated considering their molecular volume, partitioning behavior, solvent properties, and miscibility trends. The mapping and bead assignments of the Martini 3 protein model are shown in Fig. 1. The parameterization of the side chain analogs is described in detail elsewhere^43,51. To illustrate the quality of the current side-chain models, partitioning free energies of their analogs in three different water/oil systems are compared with experimental data in Supplementary Table 1 and Supplementary Fig. 1.

**Fig. 1: Mapping and bead chemical types of the Martini 3 protein model.**

Like in previous Martini protein models, the side chains are connected to the BB bead via harmonic bonds or linear constraints, and their dynamics are controlled by two angles spanning the first SC bead, the BB bead, and the BB bead of the two neighboring residues (-BB BB SC1 and + BB BB SC1). In the Martini 3 protein model, the use of the side chain dihedral corrections (side chain fix – scFix)⁵² becomes standard for protein models with defined secondary and tertiary structures. The scFix restrains the torsional flexibility of the side chains by adding dihedral potentials spanning the first SC and BB beads of consecutive residues (SC1 BB + BB + SC1, SC1 BB + BB + + BB, and SC1 BB -BB --BB) and thus preventing unrealistically high side chain flexibility⁵². The dihedral equilibrium angles are set at the value obtained from the atomistic reference structure used to build the Martini model.

The Martini 3 protein model still requires the use of a tertiary structure bias, such as an EN or Gō-like model, to maintain the native folded structure of proteins. The current EN implementation for Martini 3 applies harmonic bonds between BB beads based on a cutoff criterion and with a single force constant for all bonds. Standard values of 0.85 nm and 700 kJ/(mol nm²) are recommended for the upper distance cutoff and force constant, respectively. The value of the recommended force constant was slightly increased in relation to 500 kJ/(mol nm²) commonly used in Martini 2, as in certain scenarios, this value could be too low, inducing an increased level of stickiness³³. Apart from the distance cutoff, a residue pair must be separated by at least two residues for an EN bond to be applied between them. For instance, residue i can be bound to residue i + 3, which corresponds to a sequence distance of k = 3. While the current EN successfully maintains folded structures, it also prohibits studies which may involve conformational changes or unfolding due to the unbreakable harmonic bonds which are used to build the network.

Martini 3 protein models have been validated against a large variety of systems, including test cases reported in the main publication⁴³ and a series of spin-off studies published in separate works^{42,53,54,55,56}. Examples of performed validations are: aggregation levels of soluble proteins in water and polyleucine helices in lipid bilayers, dimerization-free energies of transmembrane (TM) peptides, binding of ions⁴³ and small molecules to proteins⁴², biomolecular condensates in different ion concentrations⁵⁷, and lipid interactions with transmembrane and peripheral proteins^53,54,55,56.

Enhanced GōMartini Model With Improved Implementation

The improved implementation of the Martini Gō-like model relies on the use of virtual interaction sites, which are constructed using the position of the BB bead as a reference. The virtual interaction sites are solely used to define the interactions within the Gō-like model, which are encoded as Lennard-Jones (LJ) potentials between virtual site pairs. By default, these particles do not interact with any other beads in the system. As an extended feature, the use of virtual sites allows changing specific interactions between proteins (BB particles) and other beads in the Martini interaction table without compromising the integrity of the original force field. For example, we show that a secondary structure-specific water bias can be applied to improve the properties of IDPs.

The major advantage of using virtual interaction sites is that it enables the use of non-bonded cutoffs as implemented in GROMACS. In the original 2017 implementation of the GōMartini model³⁷, the LJ potentials were defined in GROMACS as pair potentials within the protein topology. These pair potentials are treated internally like bonded potentials, and consequently, no cutoff is applied to them. This is not problematic as long as the minimum of the potential is close to or below 1 nm, and the distance of the connected beads stays in the region of this minimum position. However, because the GōMartini model aims to allow for more conformational flexibility — including the dissociation of some of the native contacts — the lack of a cutoff can severely restrict the applicability of the model. One of these restrictions was the incompatibility of the original implementation with increasing parallelization due to the specificities of the domain decomposition implementation in GROMACS. In practice, the parallelization of a simulation with a moderately-sized transmembrane protein, such as the light-harvesting complex II, embedded in a small membrane patch with a system size of ~ 19.700 CG beads³⁸ was restricted to about 10 processors. Our implementation based on virtual interaction sites circumvents this limitation at the minor cost of describing the BB of each amino acid by two CG beads instead of one. Considering the overall number of CG beads present in a typical system, the proportion of BB beads is usually a few percent. Thus, doubling the number of BB beads only slightly increases the total number of CG beads in the system. Note that while oligomeric proteins such as dimers, trimers, etc., can be modeled using the virtual-site implementation presented here, crowded environments such as the cytosol of a cell or oligomers composed of hundreds of monomers cannot be handled.

Besides the improved implementation, the Gō-like model itself was also modified to improve the structure and dynamics of the protein models. In the following, we describe the features adapted in the GōMartini 3 model. While the contact map calculation remains unaltered from the original implementation – defined by residue overlap (OV) and restricted chemical structural units (rCSU) criteria³⁷ – contacts in the contact map are now only included in the GōMartini 3 model, if they are within a certain distance range in the reference structure. We used a range between 0.3 – 1.1 nm. The lower boundary was chosen to avoid regions with excessively high bead density. These can create artifacts due to increased interactions with the surroundings, especially with other high bead density regions³³. The upper boundary was set to the non-bonded cutoff used in Martini simulations. Thus, only contacts which have their minimum position within the non-bonded cutoff are included in the model. Note that the underlying distance is measured between the BB beads of the Martini protein model. Thus, contacts are rarely excluded based on the lower boundary, while a few contacts are usually excluded due to the upper boundary.

The minimum sequence distance of the original model is k = 3³⁷. In our implementation, we used a minimum graph distance of k = 4, since at k = 3 the relative positions of BB particles can still be largely defined by bonded terms. Note that in the graph distance space of Martinize2 – the tool for automatic Martini protein topology generation⁵⁸ where we implemented the enhanced GōMartini 3 model –, are not only sequential BB–BB bonds considered, but also disulfide bridges. In any case, we recommend k = 3 in cases where the protein flexibility in loops is too high because no dihedrals are defined there.

Furthermore, regular non-bonded interactions – i.e., the Martini bead-bead interactions – between pairs of BB beads are excluded in the enhanced GōMartini 3 model if the amino acids have a contact according to the contact map. The reason for this choice is that the minimum position r_min of the sum of two LJ potentials, one from GōMartini and another one from the regular non-bonded Martini 3 interactions, is effectively at the larger r_min of the individual LJ potentials if the depths (ε) of the potentials are comparable (see Supplementary Fig. 2 and Supplementary Notes 2). As long as the distance between the BB beads in the reference structure is larger than the r_min of the regular non-bonded LJ potential, this does not impact the protein structure. However, if the reference structure has a shorter r_min the protein structure gets distorted. Excluding regular non-bonded interactions between BB beads connected in the GōMartini 3 model avoids this distortion. Overall, adding these exclusions is an upgrade relative to the previous GōMartini 2 implementation, as it improves protein packing in regions involving backbone-backbone interactions, such as beta-sheets (see Supplementary Fig. 10 and Supplementary Discussion 5).

Case Studies: Showing The Advantages Of GōMartini 3

In the first case study, we investigated the high membrane binding affinity of the PH domain of the phospholipase C PLCδ1. It is a peripheral membrane protein and representative of the phosphoinositol phosphate (PIP) binding family of PH domains. The PLCδ1 PH domain discussed here favorably binds PI(4,5)P₂^59,60,61. Figure 2A shows the PCLδ1 PH domain with a PI(4,5)P₂ lipid in its crystal structure binding pocket embedded in a POPC bilayer. Unbiased MD simulations starting from the membrane-bound structure confirm the high affinity of the PLCδ1 PH domain to a single PI(4,5)P₂ lipid (Supplementary Fig. 3), which has also been shown previously based on atomistic as well as CG simulations^{44,53,59,60,61}. We used three different structural bias models. Besides the GōMartini model, two different EN models with a cutoff distance of 0.8 nm were used. One was described by bonds of type 6 and a force constant of 500 kJ/(mol nm²), hereafter EN6, while the other one had bonds of type 1 and a force constant 700 kJ/(mol nm²) (hereafter EN1; for details of the models see Supplementary Methods 1). All three models confirm the strong binding of the PLCδ1 PH domain to PI(4,5)P₂. In the case of GōMartini and the EN1, the protein unbinds in one replica each, but it is able to find the PI(4,5)P₂ lipid again and re-binds to it (Supplementary Fig. 3).

**Fig. 2: PI(4,5)P₂ binding of the PLCδ1 PH domain studied at the CG Martini level.**

In order to quantify the binding affinity of the PLCδ1 PH domain, we calculated the potential of mean force (PMF) for the protein-lipid unbinding following the protocol of Naughton et al.⁶². Figure 2C depicts the corresponding PMFs along the distance between the lipid head group and the protein center of mass obtained with the three structural bias models. All three models confirm the strong affinity of the PLCδ1 PH domain to the PI(4,5)P₂ head group with a minimum at a protein-membrane distance of d ≈ 1.30–1.33 nm. The GōMartini model exhibits a minimum distance of d = 1.31 nm, while it is slightly shifted for the EN models (EN1: d = 1.30 nm, EN6: d = 1.33 nm). Overall, the three models agree on the location of the PMF minimum. The PMFs exhibit further differences between the three models. First, the error bars are larger in the case of GōMartini. This is expected because it contains fewer bonds than the EN models and because the LJ potential allows for more flexibility so that, in extreme cases, contacts can dissociate completely. As this increases the accessible conformational space, more sampling time is required to achieve the same level of error. Second, the depth of the minimum differs between the models. While the PMFs of the GōMartini model and the EN1 model are the same within the error bars around the global minimum (minima at − 26.3 ± 1.4 kJ/mol and − 26.7 ± 0.9 kJ/mol, respectively), the PMF of the EN6 has the highest minimum at − 21.1 ± 0.6 kJ/mol. In order to better understand the effect of bond type (and exclusions) on binding affinity, an additional PMF calculation was performed using bond type 1 with an EN force constant of 500 kJ/(mol nm²) (instead of 700 kJ/(mol nm²)). Excluding LJ interactions (bond type 1) lowers the PMF minimum by about 15 kJ/(mol nm²) (Supplementary Fig. 4). This result confirms an observation reported recently that EN models can overestimate the aggregation between proteins if the force constant is too low (here 500 kJ/(mol nm²)) and the non-bonded interactions in the network are excluded³³, due to a high bead density which can result in an overestimation of the interaction energy. This problem can be substantially increased when combined with the Martini 2 model (Supplementary Fig. 5 and Supplementary Discussion 1).

A characteristic of several PH domains is the existence of two binding modes to PIP lipids: a tightly bound structure corresponding to the crystal structure binding pocket and a loosely bound structure⁵⁹. Two different PIP interaction sites are known for PH domains: the canonical C-site and an alternative A-site which is the less common binding site. For the tightly as well as the loosely bound structure, both orientations have been detected⁵⁹. The PLCδ1 PH domain studied here preferentially orients its C-site towards the membrane at shorter and longer PI(4,5)P₂ protein distance⁵⁹. Figure 2C shows that the three models differ also in the binding strength of the loosely bound structure. While the GōMartini model exhibits the highest stabilization, the EN models show a reduced stabilization by more than 50%. To better understand the changes in orientation between the tightly and loosely bound structures, we analyzed the angles between two α-helices - α-helix_15,24 and α-helix_115,129 - and the membrane normal z for two windows of the umbrella sampling depicted in Fig. 2E. For the tightly bound structure, the probability distributions of the angles show a good agreement (upper panels, protein-membrane distance d = 1.3 nm). This changes at the loosely bound structure (lower panels, d = 2.1 nm). Here, the probability distributions of the α-helix_15,24-z angle differs between the three models. The GōMartini model stabilizes two orientations at 45° and 145°. Also, EN6 stabilizes two orientations (75° and 145°), while for the commonly used elastic network model EN1, only one orientation similar to the tightly bound structure is observed. This suggests that the GōMartini model, as well as the EN6 model, allow the protein to better adjust to the loosely bound structure, which stabilizes the interaction with the membrane.

Engineered mutants of T4 lysozyme are known as important benchmark systems to investigate ligand binding⁶³. In particular, the L99A mutant is a well-studied case^64,65,66, in which the mutation creates a small artificial cavity that can accommodate benzene and indole derivatives^67,68,69. It is our second test case for small molecule binding. Recently, we showed that the Martini 3 force field can accurately predict the L99A T4 lysozyme ligand-binding pocket and at least four binding pathways⁴². In addition, a nearly quantitative agreement of the binding free energy was obtained for nine different systems, including different ligands and the double mutant L99A/M102Q. Given the high similarity of apo and holo states of mutants of T4 lysozyme, which presents a ΔRMSD of 0.2 Å, the system was modeled using the EN approach⁴². However, recent atomistic studies using τ-Random Acceleration MD simulations indicated that maybe such a rigid CG approach was not fully adequate⁷⁰. In particular, it seems that ligand dissociations can involve intermediate metastable protein conformations, which can possibly impact dissociation pathways and rates⁷⁰. Our main hypothesis for this discrepancy was the limited flexibility of the EN approach, which possibly suppressed the small and local conformational changes necessary to open the binding pathways in the intermediate metastable states.

In order to verify this idea, we repeated the Martini 3 MD simulations involving benzene binding to L99A T4 lysozyme using our enhanced GōMartini approach. The main results are presented in Fig. 3. A total sampling of 0.9 ms per system was used here, with the GōMartini model calibrated to show an overall flexibility similar to the EN model. Distribution of the average protein backbone RMSF indicates that the GōMartini model was even slightly less flexible (see Fig. 3A) than the EN model. However, comparing the RMSF per residue (Fig. 3B) shows a slightly different pattern of flexibility, with the GōMartini model showing more rigid helical regions, but a slightly more flexible region around the L99A T4 lysozyme benzene pocket (C-terminal domain on the bottom of the structures displayed in Fig. 3B and D). This increased flexibility in the pocket seems to indeed have an impact on ligand binding, with clear local minima being observed in the PMF profile obtained with GōMartini (Fig. 3C). These are not observed with the EN model. It is worth mentioning that the binding free energy of the global minimum is almost identical between both models. The additional local minima observed with the GōMartini model are located at distances of 0.5 and 0.9 nm from the main pocket (located at ~ 0.2 nm in the PMF). Free energy estimates based on ligand densities indicate that the local minimum around 0.5 nm is located in a pre-pocket near the dissociation pathways between helices CD and DG, which also seems to be the most populated metastable intermediate for benzene observed in atomistic τ-Random Acceleration MD simulations⁷⁰. Although the EN model shows a local minimum at 0.65 nm, the free energy for this minimum is less than 5 kJ/mol, which reduces its significance compared to the deeper minima found in the GōMartini model. This result strongly suggests that the GōMartini approach can better capture subtle conformational fluctuations of the protein that are involved in induced-fit binding mechanisms. Further simulations using Gō potentials with the Martini 2 model indicate this version of Martini is not suitable for such applications (Supplementary Fig. 6 and Supplementary Discussion 2).

**Fig. 3: Unbiased simulations of ligand binding to L99A T4 lysozyme with elastic network and GōMartini models.**

The third test case is copper-zinc superoxide dismutase (SOD1), an example for allosteric path detection. SOD1 is a critical enzyme responsible for catalyzing the conversion of superoxide anions into hydrogen peroxide and molecular oxygen^71,72. It has gained significant attention due to its connection with amyotrophic lateral sclerosis (ALS), a neurodegenerative disorder⁷³. Over 100 different mutations in the SOD1 gene have been identified as causes for familial variants of the disease⁷⁴, but the precise molecular mechanisms underlying their pathogenicity remain a subject of active debate within the scientific community. It has been proposed that protein aggregation⁷², aberrant pro-oxidant catalysis⁷⁵, and metal dyshomeostasis⁷⁶ may be involved in the pathogenic mechanism. The proposed pathogenic processes listed have in common that they have been linked to the loss of Zn(II) ions from the holoprotein^{77,78,79,80,81,82,83}. Surprisingly, only a subset of ALS-linked SOD1 mutations occurs close to the metal site⁸⁴, raising questions about the molecular mechanisms involved in Zn(II) loss.

In its active dimeric form, each SOD1 monomer contains a Cu(I)/(II) ion, critical for catalytic function, and a Zn(II) ion, primarily serving a structural role. Close to these metal ions and the active site is the electrostatic loop (EL), which is known to be destabilized in several ALS-linked mutants^71,85. We have recently shown that a combination of the virtual site GōMartini approach and the Martini 3 model can provide insights into how subtle structural perturbations in SOD1, induced by mutations such as G93A, located 4 nm away from the catalytic site, might occur. These perturbations could increase the likelihood of the EL detaching from its native position and exposing the metal sites to water. Through extensive 480 μs CG MD simulations for both wild-type and G93A mutant SOD1, an allosteric pathway was identified explaining how the distant G93A mutation affects the EL⁴⁵. Here, we revisit this system to investigate whether similar results can be obtained using simpler EN models. Figures 4A and 4B reveal that overall flexibility trends in the GōMartini and EN models are comparable for wild-type SOD1. However, the GōMartini model exhibits reduced flexibility in the β-barrel core compared to the EN model, while the EL region displays the opposite trend. Strikingly, flexibility comparisons between wild-type and G93A mutant (Figs. 4C and 4D) demonstrate that the GōMartini model presents a more complex profile of the RMSF difference with increased stabilization around the mutation site and higher flexibility in multiple loops, including the EL. To elucidate the allosteric pathway through which these changes happen, we found that there were differences in residue-residue distance distributions connecting the mutation site and the EL when G93A mutant and wild-type are compared when using the GōMartini model (Fig. 4F). In contrast, our EN model failed to identify any differences connecting the mutation site and the EL. These results highlight the superior capability of Gō models in capturing subtle structural dynamic changes. Moreover, they suggest that the GōMartini approach has a promising potential to study long-range alterations in dynamics induced by single point mutations, even for ones introducing subtle molecular modifications such as the addition of a single methyl group, as exemplified by the G93A mutation of SOD1.

**Fig. 4: Comparison of the effect of elastic network and Gō models in the allosteric pathways of SOD1.**

Our fourth test case is an antigen:nanobody complex which we use to probe mechanical stability. The mechanical stability of proteins in enveloped viruses is of great relevance for virus-cell interaction⁸⁶ as emphasized for instance in studies on the SARS-CoV-2 spike (S) protein⁸⁷. SMFS experiments played a key role in unveiling this relevance and in enhancing our understanding of the molecular evolution of the SARS-CoV-2 variants⁸⁸. The key region of the S protein that is associated with cellular recognition is the so-called receptor binding domain (RBD), and in particular, this protein domain has presented key mutations in each of the variants of concern that enhanced the binding affinity of the entire S protein to the cellular receptor.

Here, we employed the GōMartini 3 approach for probing the interaction of an RBD-nanobody at a lower pulling speed than typically accessible by AA MD simulations, and furthermore, we avoided to apply position restraints on the RBD as they do not correspond to typical AFM-SMFS protocols. We have selected the GōMartini 3 approach due to its enhanced efficiency compared to its predecessor, GōMartini 2, which is implemented on the Martini 2 framework. This improvement makes it well-suited for studying significant conformational changes in proteins, particularly in nanomechanical and folding simulations (see Supplementary Fig. 7–10 and Supplementary Discussion 3–5). The GōMartini steered MD (SMD) simulations were conducted under similar conditions (without restraints), and we only fixed the position of one residue in the RBD, whereas the pulling residue was part of the nanobody. We calibrated the strength of the LJ potential (ε_LJ) in the GōMartini model following the AA SMD studies by Nguyen and Li⁸⁹. The GōMartini model was applied for both proteins as well as to define the protein complex interface. Note that in the AA reference study, position restraint potentials along all BB atoms in the RBD were applied, and a very high pulling speed was employed compared to the SMFS experiments. In this regard, the GōMartini SMD simulations reproduced quite well the average value of the rupture force, F_max, using the same pulling speed. Restraining the positions of certain groups of atoms is not equivalent to AFM-SMFS experiments and it is only a convenient way to avoid the protein unfolding in AA MD simulations. Thus, the use of a larger MD simulation box is recommended to capture the full dissociation process. Such simulations have a high cost in AA MD, and thus, large protein complexes undergoing conformational changes still suffer from limited sampling in SMD. We removed all artificial position restraints and performed the same study using the GōMartini model. The next step was to assess the impact of these artificial position restraints on the nanomechanics of protein complexes. Our average rupture force is about ~300 pN below the value reported by Nguyen and Li (F_max ~ 925 pN) (see Table 1 and Fig. 5A).

Table 1 Nanomechanical characterization of the RBD:H11-H4 complexes with and without position restraints at different pulling speed (v_pull) in SMD. The number of replicas is given next to F_max values

Full size table

**Fig. 5: Nanomechanics of the RBD:H11-H4 complex studied by GōMartini simulations at different pulling speeds.**

In fact, we showed how restraints have a negative impact on the mechanical stability of the protein complex, as they will overstabilize the protein complex which then will not give comparable results with SMFS experiments. In a second study by Golcuk et al.⁹⁰, position restraints were applied on a smaller number of non-hydrogen atoms located at the RBD:H11-H4 interface. This resulted in the rupture force being similar to the one obtained from our unrestrained GōMartini simulations at a lower pulling speed (see Fig. 5B), which is still about four orders of magnitude larger than the typical pulling speed in SMFS experiments (~ 10⁻⁹ nm/ps)⁸⁸. Note that in the AA SMD studies, a handful of replicas were performed whereas at the CG Martini level of resolution a large number of replicas can be run at the same computational effort as the few AA SMD replicas.

The nanomechanical characterization pulls at constant speed one side of the complex (i.e., nanobody) while the RBD remains anchored in space and this process perturbs the bound conformation of the H11-H4 nanobody starting from the pulling direction involving residues around LYS-128 (see Fig. 5C, D). The analysis of the protein chain at F_max revealed the stretching of the receptor binding module (residues 424-495), which is the region that is mostly in contact with the ACE2 receptor, such that part of the RBD is perturbed by the nanobody before dissociation. We identified several hydrophobic interactions as the most relevant ones for the buildup of F_max: VAL-483/SER-57, GLU-484/LYS-52, PHE-486/LEU-106, TYR-489/SER-103, TYR-489/TYR-104, and PHE-490/VAL-102 (see Fig. 5E). An additional nanomechanical study was performed with the dockerin:cohesin protein complex system, with the results displayed at the Supplementary Results 2. Our nanomechanical profiles captured the two most prominent dissociation pathways observed in by previous all-atom SMD simulation⁹¹.

Perspectives: How To Improve GōMartini And The Protein Model

The first perspective case focuses on improving contact maps and the strength of interactions. The combination of a Gō-like network with the Martini CG force field can be effectively employed to capture conformational changes. However, the choice of parameters to build the network is not obvious. To address this question, we explored the possibility of improving the key parameters of the GōMartini model: the strength of the interactions (ε_LJ) and the contact map. A convenient possibility is to fine-tune these parameters based on AA MD simulations instead of a single experimental structure. Nonetheless, it is worth noting that the initial GōMartini model can still exhibit a bias due to its starting configuration, thereby directing the simulations toward the native conformation. A common issue encountered in this context pertains to the definition of unnecessary Gō bonds within loop regions, mainly attributed to the tightly packed nature of these regions in the crystallographic and cryo-EM structures. Consequently, the native contacts may underestimate the flexibility of loop regions. Employing dynamical contact analysis of the AA MD simulations could distinguish between stable and transient contacts within the protein structure on the timescale of the AA simulation.

As a first exploration of how GōMartini parameters could be refined considering a dynamic contact analysis, several benchmark studies, including soluble (as per previous work by Poma et al.^37,39,40) and transmembrane proteins ranging from 76 to 4160 residues, were explored here. We initially focused on optimizing the effective depth of the LJ potential (ε_LJ) while preserving all Gō potentials, aiming to bring the standard GōMartini model in closer agreement with the protein dynamics observed in AA simulations, particularly in terms of RMSF. Notably, the optimal ε_LJ value exhibited significant variation across different systems, highlighting the importance of tailoring ε_LJ values to individual protein models rather than employing a uniform value across all systems.

As shown in Fig. 6, we computed the RMSF for the Cα atoms and BB beads in the AA and CG models, respectively, for three benchmark proteins: titin I-band (1TIT), glycoside hydrolase (3W0K), and the transmembrane domain of Ist2. Upon comparison of the original GōMartini model (blue lines) with the CHARMM36m AA reference (black), it becomes evident that although using specific ε_LJ values calibrated for each protein significantly improved the original GōMartini models, they still fail to capture the dynamics of several loop regions observed in AA simulations. Indeed, RMSF analysis indicates that these regions remain relatively rigid in standard GōMartini models compared to AA simulations. Note that the choice of atomistic force fields is not restricted to CHARMM36m, but it can be any preferred force field of the user or experimental data providing insights into protein flexibility.

**Fig. 6: Improving GōMartini to match AA models.**

To overcome this challenge and improve the accuracy of the GōMartini models, we further optimized them by checking the contact frequencies around each residue throughout an AA MD simulation (see Supplementary Methods 5 for more details). The results of this optimization are summarized in Supplementary Table 2. Figure 6 shows a good agreement between the modified GōMartini model (red lines) and the CHARMM36m AA reference (black), which demonstrates that the modified GōMartini model accurately captures characteristic fluctuations across most residues and is flexible enough to mimic the flexibility observed in AA MD simulations. It is important to highlight that, in the case of ubiquitin, cohesin, and aquaporin (see Supplementary Fig. 11), the method did not substantially improve the flexibility of the protein. Overall, it appears that investigations of systems featuring substantial conformational transitions require fine-tuning of Gō networks, shown here, and the possibility of flexible secondary structure, not included in current models.

We explored the option to use the Gō virtual sites to add water bias in IDPs and biomolecular condensates as a second perspective case. While the Martini 3 protein model has already greatly improved upon the previous iteration, two years after release, room for improvement has been identified regarding some specific aspects of the model. Studies have reported that Martini 3 underestimates the radius of gyration (R_g) of IDPs and multidomain proteins in solution when compared to experimental SAXS data^47,48. Simultaneously, it has also been shown recently that the behavior of transmembrane domains might be unstable in Martini 3, specifically in the case of transmembrane alpha-helical peptide insertion^49,92,93. For both cases, scaling of protein-water interactions, which was a common mitigation strategy employed in Martini 2, was suggested to resolve the issues. Thomasen et al.⁴⁷ found that increasing protein-water interactions by 10% results in improved agreement with SAXS data for IDPs and multidomain proteins, while Cabezudo et al.⁹² found that reducing protein-water interactions by 10% resulted in the correct insertion of transmembrane peptides. However, scaling interactions has the major downside of impacting all pair interactions that were altered, and not just the ones responsible for the unintended model behavior — i.e., scaling the P2-W pair interaction impacts not only protein BB-water interactions but also the interactions involving any other molecule containing P2 beads. This causes major transferability issues for the model and, as such, should be avoided if possible.

Although the virtual interaction sites built on top of BB beads are typically used only to define interactions to other sites for tertiary structure preservation, they additionally offer the possibility to effectively modify interactions between BB beads and other Martini beads in a site-specific manner. For example, by defining an interaction between a BB virtual site and water beads, it is possible to effectively increase the strength of the interaction between protein backbones and water. The strength of the resulting non-bonded interaction will be the sum of the P2-W and virtual site-W interactions. As the interaction is defined only between the virtual site and water beads, the increase in the strength of this interaction is restricted to P2 beads in the protein backbone only, and no other molecules are affected. Further, this approach is sufficiently versatile that it can be applied in only specific residues of proteins, such as transmembrane or disordered domains.

To showcase this, we tested the R_g of select IDPs using the enhanced GōMartini implementation. We used the set of IDPs from Thomasen et al.⁴⁷ to validate our approach against an existing one. The IDPs were coarse-grained, and virtual Gō sites included on the BB particles. An additional LJ interaction between the virtual Gō sites and water beads was added, with ε = 0.5 kJ/mol, which, when summed to the already existing BB-water interaction roughly corresponds to a 10% increase of the interaction. Although this value seems the same as proposed by Thomasen et al.^47,48, it is only applied to backbone-water interactions, while previous approaches applied the changes to the whole protein.

To further improve the model, we also developed a refined set of bonded parameters for backbones and side chains using AA simulations, as these are not implemented for coiled structures in Martinize2 (Supplementary Fig. 12). The addition of either component individually goes some way to improving the radius of gyration of the target set of IDPs, reducing the mean absolute error with respect to the experimental reference across the set from 1.35 nm to 1.25 nm in the case of the additional bonded parameters, and to 0.36 nm with the addition of the Gō site dedicated to water interactions (Supplementary Figs. 13 and 14). However, as we show in Fig. 7A, B, the combination of these extra parameters together further improved the R_g of the benchmark set with respect to the experimentally measured values, with a final mean absolute error of 0.28 nm.

**Fig. 7: Improving IDP global dimensions and condensation using Martini 3 with GōMartini model-based interaction rescaling.**

To additionally validate the use of virtual sites in improving the behavior of Martini IDPs, we carried out simulations of a known phase separating IDP. Recent work from Dzuricky et al. designed an artificial IDP that would demonstrate liquid-liquid phase separation based on an octapeptide repeating unit⁹⁴. In Fig. 7C, we show a phase-separated system of this protein after 5 µs, where the optimized IDP model above has been used for the WT20 construct (ie., 20 octapeptide repeats using the primary so-called ‘wild-type’ sequence). In contrast, Fig. 7D shows that without the optimization of the IDP described above, the phase separation of the model is visibly different, being more compact. This system in fact, does not form a liquid-like condensate but a solid-like aggregate, as evidenced by the analysis of the incoherent scattering curves in Supplementary Fig. 15.

As a further example of how increasing protein BB-water interactions can aid recapturing experimental behavior, we simulated a system of two FL dipeptides linked by a disulfide moiety. This system was shown to undergo liquid-liquid phase separation in the recent work of Abbas et al.⁹⁵. Figure 7E shows that native Martini 3 could capture the condensate formation with a coexisting dense and a dilute phase. However, the resulting condensate was too dry (~ 10% water weight content) compared with the experimental data (~62%). To alleviate this problem, again, we introduced virtual Gō sites on the BB beads, carrying an additional interaction with the water beads. As shown in Fig. 7E, increasing the BB-water interactions increases the water content of the condensate without affecting the phase separation. With an 8% increase in the strength of this interaction (corresponding to ε = 0.3464 kJ/mol), the water weight content already is above 50%, much closer to the experimental findings. Overall, these results demonstrate that beyond universal rescaling, the strength of the BB-water interaction can be fine-tuned to better reproduce the properties of biomolecular condensates.

In the third perspective case, we employ the Gō virtual sites to add water bias in TM helices and beta-sheet peptides. As mentioned in the previous section, there have been reports of issues surrounding the transmembrane insertion of some helical peptides using Martini 3^49,92,93,96. The solution previously proposed to overcome these issues was again to apply a rescaling of peptide-water interactions, similar to what has been done for IDPs. Here, we have also tested our GōMartini implementation, changing only the BB-water interactions. Four WALP α-helices – 16, 19, 23, and 27 residues in length, termed WALP16, WALP19, WALP23, and WALP27 – were coarse-grained using our GōMartini implementation and simulated embedded in a dimirystoylphosphatidylcholine (DMPC) membrane, as done by Spinti et al.⁴⁹. To facilitate the observation of WALP ejection from the membrane, a temperature of 310 K was used, instead of the 300 K used by Spinti et al.⁴⁹. A second set of these systems was run where an additional LJ interaction between the virtual Gō sites in helical residues and water beads were included, with ε = −1.0 kJ/mol, so effectively reducing the interactions with water to improve peptide insertion. The reduced LJ interaction substantially stabilized the transmembrane conformation of the four WALPs and reduced the TM peptide ejection in comparison to the control simulations (Fig. 8A, B and Supplementary Fig. 16).

**Fig. 8: Improving transmembrane peptide insertion and beta-sheet aggregation.**

Given the need to rescale BB-water interactions with both helical and coil protein segments, we aimed to assess whether beta-strand segments could also benefit from the rescaling of their BB-W interactions. To do so, we tested how the aggregation of RAD16-I is currently performing with Martini 3. RAD16-I is a synthetic amphipathic peptide that adopts a beta-strand structure in solution. It has been shown that RAD16-I associates to form a very stable beta-sheet in solution, even culminating in the formation of nanofibers with increasing peptide concentration^97,98. We assembled a simple system containing two copies of RAD16-I and followed their aggregation into a 2-strand beta sheet over the course of the simulation (Fig. 8D, E). As expected, with a GōMartini 3 model applied between the two chains, we can accurately reproduce the 2-strand beta sheet assembly. However, in the absence of any bias, we do not observe stable sheet assembly. To mitigate this, we applied an additional LJ interaction between the virtual Gō sites in strand residues and water beads with ε = − 0.5 kJ/mol. This reduction of BB-W interactions was sufficient to obtain sheet assembly similar to the one observed with the interchain Gō model. Note that while the fluctuations of the interface are lower for the interchain Gō model compared to the water bias correction, it requires a priori knowledge of the interface. Thus, the water bias correction is more generally applicable, still capturing the majority of the interface contacts.

Discussion

Despite attempts to explore multiple-state conformations states⁹⁹ or to enhance the accuracy of Martini protein models through approaches such as polarizable^29,100, titratable^101,102, and even possibly foldable¹⁰⁰ versions, the combination of standard Martini with a bias, such as an EN or a Gō model, remains the most attractive and useful option due to its computational performance and compatibility with large libraries of Martini models.

The virtual-site implementation of the enhanced Gō-like model with Martini 3 was initially introduced as a proof of concept in the works involving SOD1⁴⁵ and light-harvesting complex II³⁸, being officially adopted as the Gō approach tested during Martini 3 development. Since the release of Martini 3, the model has been recommended and included in our tutorials¹⁰³ and thus been extensively tested by the modeling community. However, the key features of the approach and the underlying Martini 3 protein model had not been presented until now. The main goal of this work was to finally detail all the advantages of the current implementation in relation to ENs and the previous GōMartini 2 implementation. Concerning the advantages in relation to ENs, we clearly show here how the improved conformational flexibility — stemming from the use of asymmetric potentials with finite dissociation energies — can be used to study long-range allosteric changes in proteins, protein-small molecule binding, and protein-membrane binding. These kinds of applications are still scarce within the Martini community, and we foresee more studies in the future showing the benefits of the more accurate protein flexibility introduced by GōMartini 3. However, it is important to highlight that this approach is currently not suitable for systems consisting of hundreds of copies of the same protein, given that the same Gō bonds which stabilize certain folded states will also wrongly impact protein-protein interactions. To circumvent the impact on protein-protein interactions, a unique protein model for each monomer is required. For small oligomers such as dimers, trimers, etc., this strategy is easily applicable whereas it becomes impracticable for systems containing hundreds of monomers. While unique protein models for hundreds of copies will remedy the impact on protein-protein interactions, the number of protein-protein interface combinations may possibly explode beyond what can be handled by this approach. For instance, a simulation box with 100 copies of the same protein, with each protein with two protein-protein interfaces, each with 10 contacts, would need to have a total of 10*(100!/(100-2)!) = 99,000 interface contacts to be defined. Thus, simulations of crowded membranes⁸ and even future cell-simulations¹⁰⁴ may still rely on simple EN approaches. Note that a work building on the pre-release of the GōMartini 3 model presented here tackled this challenge and provided a solution for systems with many copies of the same protein, such as amyloid fibrils¹⁰⁵.

In the Supplementary Discussion, we provide a comparison between GōMartini 2 and GōMartini 3 for protein-lipid binding (Supplementary Discussion 1), protein-ligand binding (Supplementary Discussion 2), a nanomechanics test case (Supplementary Discussion 4), and β-hairpin folding (Supplementary Discussion 5). The results show clear improvements of the GōMartini 3 model, such as no unspecific ligand binding and no overstabilization of protein-protein and protein-lipid contacts. These improvements are nevertheless intertwined with the improvements of the Martini 3 force field. The lower RMSD of the folded β-hairpin of 0.17 nm (GōMartini 3) compared to 0.24 nm obtained with GōMartini 2 is particularly due to the improvements in the model, namely the exclusion of the regular non-bonded interactions for the Gō-like bonds. Although nanomechanics studies have been performed before with GōMartini 2, our results here reinforce its accuracy and show some advantages in reproducing conformational transitions for simulations mimicking AFM profiles. The GōMartini 2 implementation by Poma et al. captured the unfolding profile of the I27 domain of titin, type I cohesin domain, and ubiquitin with experimental forces equal to 204 pN, 480 pN, 230 pN, respectively³⁷. Although GōMartini 2 correctly reproduced the expected trends, the forces were twice as large due to the speed of the SMD simulation (e.g., ~ 10^-3nm/ps). In this regard, the current virtual site implementation is more convenient as it allows for the full integration of GROMACS¹⁰⁶ and OpenMM¹⁰⁷ parallelization, and thus, one can use lower pulling speeds with the SMD protocol of ~ 10^-5nm/ps, which is significantly closer to the pulling speed of SMFS experiments of ~ 10^-9nm/ps, without compromising computational cost.

In addition to the gain in computational performance and numerical stability in relation to the previous GōMartini 2, the use of virtual sites provides the flexibility to introduce corrections to the backbone-water interactions. In contrast to recently published approaches, we suggest using only water interaction biases in relation to the backbone beads. One of the key reasons is the overall quality of the water/oil partitioning estimates of side chains (Supplementary Fig. 1 and Supplementary Table 1), which does not show any particular trend of being too hydrophilic or too hydrophobic, with average errors below ~ 3 kJ/mol. On the other hand, the hydrophobicity of the protein backbone has always been under debate, as its capability of forming internal hydrogen bonds in secondary structure motifs may affect its partitioning to different environments. This idea was one of the main assumptions of the original Martini 2 implementation²⁸ and was recently also incorporated in the SPICA CG model^108,109. Although some of our results presented here may point out that the Martini 3 model could benefit from the same approach, a broader view of peptides and proteins in different contexts indicates that water biases dependent on the secondary structure are not general. As a temporary solution, based on our preliminary findings, we suggest increasing the interactions between backbone-water for intrinsically disordered motifs by 1.0 kJ/mol while decreasing the interactions by 1.0 kJ/mol for simple helical transmembrane peptides. Intrachain β-sheets can be stabilized with standard GōMartini 3 interactions, while interchain interactions could benefit from reduced water interactions of 0.5 kJ/mol and/or the use of standard Gō-like bonds. These solutions are not generic and need to be used with caution. For instance, soluble globular proteins with high helical content, such as lysozyme³⁴, may aggregate too much if we consider the correction indicated for transmembrane proteins in this work. In other cases, such as protein or small disordered peptide dimers, the solution may be beneficial, although a direct application of an interchain Gō-like network can be more accurate. Indeed, in the case of protein complexes, several studies^39,40 reported the need to model the complexes with additional interchain Gō-like bonds at the interface either using the GōMartini approach or alternative Gō models (i.e., OLIVES¹¹⁰). The combined representation of structure-based models at the interface of protein complexes led to the capture of large conformational changes under nanomechanical probing^39,40, as it is studied by SMFS. Many aspects of the current protein model are being revisited now, including further improvements in side chain self-interaction, improved backbone torsions and side chain rotamers, and more detailed backbone models. These improvements take advantage of the specific features of Martini 3, including different bead sizes and labels.

One additional aspect touched by our work is the possibility of refining GōMartini parameters, i.e., the depth of the potentials and contact map. We show that the use of specific potential depths ε_LJ and improvements in the contact map based on multiple reference structures obtained for instance, from atomistic simulations, can greatly improve the overall flexibility of the models. Automatic refinement of these parameters may be possible via approaches based on particle swarm optimization strategies such as CGCompiler¹¹¹ and SwarmCG¹¹². Similar ideas can be developed considering AI-based approaches instead of atomistic MD references, such as the recent implementation of ENs using AlphaFold confidence scores³². It is promising to further expand this kind of approach in the future, because it may be the key for the simultaneous representation of multiple conformational states. One recent implementation in this direction involves replacing the single-basin Gō model with a multiple-basin Gō model¹¹³ or at least a double-well potential. This modification may allow large conformational changes, enabling a more accurate representation of the transitions between stable folded states of proteins.

A last important remark regarding the virtual site approach is related to our view for future protein model development in the Martini force field: development of Martini protein models and bias approaches, such as GōMartini and EN models, should be decoupled. In previous iterations, these two aspects were interconnected in a way that biases in secondary and tertiary structure were fully integrated in the model, even affecting bead types and mapping^28,31. Such integration blocked further development, as any attempts to change the model would need to involve developing both the core model and the bias. For instance, improving protein flexibility to allow secondary structure changes in Martini 2 would depend on dramatically changing fundamental aspects of the model as bead types depend on secondary structure, and ENs were of paramount importance for beta-sheet stability. Therefore, we advocate for a complete decoupling of the two developmental pathways. While the protein models should follow the typical building block rules and validation of Martini models, the structural biases should always come as an additional experimental/theoretical potential applied on top of the model, used to bias the simulated ensembles. This approach guarantees that the Martini protein model can independently evolve, with further improvements hopefully resulting in the use of less/weaker biases. Although the ultimate aspiration remains the creation of a bias-free Martini protein model, we recognize the enduring importance of approaches such as GōMartini as fundamental tools for accurately modeling proteins within the Martini universe.

Methods

General Workflow To Set Up The GōMartini Model

In order to build a GōMartini 3 model for a protein, a two-step procedure has to be followed. First, a contact map specifying the OV and rCSU contacts can be obtained from the web server http://pomalab.ippt.pan.pl/GoContactMap/^39,40 (which is replacing the previous one¹¹⁴: http://info.ifpan.edu.pl/~rcsu/rcsu/) or via the ContactMapGenerator program available at https://github.com/Martini-Force-Field-Initiative/GoMartini/tree/main/ContactMapGenerator, using the default settings^114,115. Subsequently, Martinize2 can be used to obtain the CG coordinates and topology files from the atomistic reference structure and the contact map¹¹⁶. Box 1 summarizes this workflow to set up the GōMartini 3 models. In order to activate the GōMartini model for the structure bias in Martinize2, the ‘-go’ flag has to be specified. In addition, the contact map file can be provided after the ‘-go’ flag. Note that the format of the contact map has to adhere to the specifications outlined in the Supplementary Notes 2. The Gō model can further be fine-tuned by adjusting the biasing strength (‘-go-eps’), the upper and lower cut-off distance (‘-go-up’ and ‘-go-low’), as well as the residue distance (‘-go-res-dist’). If these flags are omitted, Martinize2 uses the default values described in the Results section. The default value for the potential depth is still the one recommended in the original GōMartini 2 implementation: ε_LJ = 9.414 kJ/mol. In contrast to the previous implementation, Martinize2 utilizes the graph residue distance instead of the sequence distance, which are the same for almost all cases except those where amino acids are connected through the side chains (e.g., in the case of disulfide bridges). Aside from the definition of the intra-molecule parameters, Martinize2 writes the atom types and non-bonded interactions required to run a simulation with the GōMartini model. The file name of these files is preceded with the molecule name (by default ‘molecule_0’), which is also used in the naming of the virtual sites. Utilizing the ‘-go-moltype’ flag the name can be adjusted such that when martinizing multiple different proteins, the Gō definitions are compatible. More details including an example and tips and tricks are provided on the Martinize2 github page (https://github.com/marrink-lab/vermouth-martinize). The enhanced GōMartini 3 model using virtual sites is also implemented in MAD, the Martini Database¹¹⁷, which also includes the option to manually remove or add Gō interactions, allowing the user to include, for instance, additional experimental information which may correct issues originating from the reference structure.

Adding Virtual Site Water Bias

As previously mentioned, the virtual site approach underlying the enhanced GōMartini 3 model can also be used to specifically fine-tune interactions between Martini beads. For instance, it can be used to modify the strength of the protein-water interaction. Using the ‘-water-bias’ flag in Martinize2 allows to automatically generate the BB virtual sites and non-bonded interaction parameters required for the water bias. Whereas the water bias can be combined with the GōMartini model, it does not require it. For example, adding water bias with an EN is also possible. The water bias can be added depending on the secondary structure element, with its strength defined using the ‘-water-bias-eps’ flag. The water bias defines the depth of the LJ potential between the virtual site and the water bead. The values can be positive (to effectively increase the water-BB virtual site interactions) or negative (decreasing water-BB virtual site interaction). The default value for the strength of the water bias in Martinize2 is zero, so the user needs to define it. Some suggestions are presented in the perspective section of the Results. Finally, Martinize2 also supports the definition of intrinsically disordered regions (IDRs) using the ‘-id-regions’ flag. A special water bias can be defined for these regions with the previous flag using ‘idr’ instead of a secondary structure assignment. Defining IDRs is useful when a protein contains both a folded and a disordered domain because Martinize2 might still detect some (transient) secondary structure in the disordered domain.

Martini 3 IDPs From Sequence

Lastly, Martini 3 CG protein models for IDPs can also be generated directly from a sequence fasta file using Polyply¹¹⁸. As a fully disordered protein has no reference structure, a contact map is not necessary for this method of generation. Instead, Polyply gen_params automatically generates the molecule itp file from the sequence already including the virtual sites for the addition of an appropriate water bias and the automatic addition of additional backbone dihedral potentials. As Polyply should not be generally used to generate topologies of folded proteins, these parameters are added automatically (i.e., without any additional flag specifications) when Polyply is used with the Martini 3.0.0 force field library. The system coordinates (e.g., of a solvated IDP) can subsequently be generated using Polyply gen_coords (Box 1). The Polyply-generated files can be used directly, only externally requiring a further definition of the extra interactions between the BB virtual sites and water as in the case of Martinize2 and described in more detail in the previous section. An example of how to use Polyply to generate and use these parameters is available on the PolyplyGitHub wiki. The Polyply route is especially convenient for high-throughput simulations of many different disordered proteins.

General Simulation Settings

All simulations were performed with the program package GROMACS¹⁰⁶ (versions 2018.x to version 2023.x). Settings for the CG simulations follow the updated set of Martini run parameters¹¹⁹ using a time step of 20 fs. Specifically, the Verlet neighbor search algorithm was used to update the neighbor list, with a cutoff of 1.1 nm for the non-bonded interactions. Coulombic interactions were treated using reaction-field electrostatics with a dielectric constant of 15. The Parrinello–Rahman barostat¹²⁰ (coupling parameter of 12.0 ps) and the velocity-rescaling thermostat¹²¹ (coupling parameter of 1.0 ps) were used to maintain pressure and temperature, respectively. More technical details about the system setups, simulation settings, and analysis for each specific test case are given in the Supplementary Methods.

Martini Models And System Setup

All simulations were performed using the open-beta¹²² or more recent development versions of Martini 3 force field⁴³, with the protein models generated by Martinize²⁹ (for open-beta test cases), Martinize2⁵⁸ (for folded proteins simulated with the final Martini 3 release) or Polyply¹¹⁸ (for IDPs simulated with the final Martini 3 release). Except for IDP and biomolecular condensate systems, bonded parameters are still dependent on the secondary structure, which is calculated by the DSSP approach⁵⁰ using an atomistic reference structure. In addition, the side-chain dihedral corrections scFix⁵² are included for all secondary structure elements. The contact maps for the GōMartini models were generated using the contact map approach proposed by Cieplak and co-workers¹¹⁴ (http://info.ifpan.edu.pl/~rcsu/rcsu/) or the recent implementation from the Poma group^39,40 (http://pomalab.ippt.pan.pl/GoContactMap/). Besides the enhanced implementation of the GōMartini model, we also employed two different EN models to maintain the structural protein scaffold in the case of the PH domain, T4 lysozyme, and SOD1. Slightly modified versions of these two EN settings are typically used in combination with the CG force field Martini^28,29,31,33. Here, we want to focus solely on the impact of the structural bias models. Thus, all other bonded parameters of the protein models are unchanged between the Gō-like and the EN models. The EN models are set up based on a distance cutoff criterion between the BB beads in the CG reference structure of the protein. Harmonic potentials are used to constrain the protein flexibility and to maintain the protein structure. These can be applied in two different ways: (i) either the non-bonded interactions between the BB beads connected by a harmonic potential are excluded, or (ii) the harmonic potentials act on top of the non-bonded interactions. In the first case, the bond type corresponds to a regular chemical bond, i.e., bond type 1 in GROMACS. In the second case, bond type 6 is used in GROMACS. Here, we used the GROMACS bond types to distinguish between the different settings, namely EN type 1 and EN type 6, respectively. See more details about the protein models used in each specific test case in the Supplementary Methods.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Data generated in this study, including system setups, force field parameters, and GROMACS input files have been deposited on Zenodo and are publicly available: https://doi.org/10.5281/zenodo.11198318. The PDB accession codes in this work are given as follows: 5XRN, 4IU3, 1UBQ, 1TIT, 1AOH, 3W0K, 4CSK, IST2, 1MAI, 181L, 2C9V, 6ZH9, 1GB1 and 6QMB. Source data are provided as a Source Data file. Source data are provided in this paper.

Code availability

The code used to generate the protein models is available at github.com/marrink-lab/vermouth-martinize, pomalab.ippt.pan.pl/GoContactMap/, github.com/Martini-Force-Field-Initiative/GoMartini/tree/main/ContactMapGenerator, and https://github.com/marrink-lab/polyply_1.0/wiki/Tutorial:-Martini-3-IDPs---proteins.

References

Grimaldo, M., Roosen-Runge, F., Zhang, F., Schreiber, F. & Seydel, T. Dynamics of proteins in solution. Q. Rev. Biophys. 52, https://doi.org/10.1017/S0033583519000027 (2019).
Bonomi, M., Heller, G. T., Camilloni, C. & Vendruscolo, M. Principles of protein structural ensemble determination. Curr. Opin. Struct. Biol. 42, 106–116 (2017).
Article CAS PubMed Google Scholar
Papaleo, E. et al. The role of protein loops and linkers in conformational dynamics and allostery. Chem. Rev. 116, 6391–6423 (2016).
Article CAS PubMed Google Scholar
Freddolino, P. L., Harrison, C. B., Liu, Y. & Schulten, K. Challenges in protein folding simulations: Timescale, representation, and analysis. Nat. Phys. 6, 751–758 (2010).
Article CAS PubMed PubMed Central Google Scholar
Chung, H. S., Piana-Agostinetti, S., Shaw, D. E. & Eaton, W. A. Structural origin of slow diffusion in protein folding. Science 349, 1504–1510 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Beck, T. L., Carloni, P. & Asthagiri, D. N. All-atom biomolecular simulation in the exascale Era. J. Chem. Theory Comput. 20, 1777–1782 (2024).
Article CAS PubMed Google Scholar
Biriukov, D. & Vácha, R. Pathways to a shiny future: Building the foundation for computational physical chemistry and biophysics in 2050. ACS Phys. Chem. Au 4, 302–313 (2024).
Marrink, S. J. et al. Computational modeling of realistic cell membranes. Chem. Rev. 119, 6184–6226 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Corradi, V. et al. Emerging diversity in lipid-protein interactions. Chem. Rev. 119, 5775–5848 (2019).
Article CAS PubMed PubMed Central Google Scholar
Bernardi, R. C. et al. Mechanisms of nanonewton mechanostability in a protein complex revealed by molecular dynamics simulations and single-molecule force spectroscopy. J. Am. Chem. Soc. 141, 14752–14763 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gomes, D. E. B., Melo, M. C. R., Gomes, P. S. F. C. & Bernardi, R. C. Bridging the gap between in vitro and in silico single-molecule force spectroscopy. Preprint at https://doi.org/10.1101/2022.07.14.500151 (2022).
Lostao, A., Lim, K., Pallarés, M. C., Ptak, A. & Marcuello, C. Recent advances in sensing the inter-biomolecular interactions at the nanoscale - A comprehensive review of AFM-based force spectroscopy. Int. J. Biol. Macromol. 238, 124089 (2023).
Article CAS PubMed Google Scholar
Henriques, J., Arleth, L., Lindorff-Larsen, K. & Skepö, M. On the calculation of SAXS profiles of folded and intrinsically disordered proteins from computer simulations. J. Mol. Biol. 430, 2521–2539 (2018).
Article CAS PubMed Google Scholar
Pesce, F. & Lindorff-Larsen, K. Refining conformational ensembles of flexible proteins against small-angle x-ray scattering data. Biophys. J. 120, 5124–5135 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Páll, S. et al. Heterogeneous parallelization and acceleration of molecular dynamics simulations in GROMACS. J. Chem. Phys. 153, 134110 (2020).
Article ADS PubMed Google Scholar
Phillips, J. C. et al. Scalable molecular dynamics on CPU and GPU architectures with NAMD. J. Chem. Phys. 153, 044130 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lee, T.-S. et al. GPU-Accelerated molecular dynamics and free energy methods in amber18: Performance enhancements and new features. J. Chem. Inf. Model. 58, 2043–2050 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Kmiecik, S. et al. Coarse-grained protein models and their applications. Chem. Rev. 116, 7898–7936 (2016).
Article CAS PubMed Google Scholar
Ingólfsson, H. I. et al. The power of coarse graining in biomolecular simulations. Wiley Interdiscip. Rev. Comput. Mol. Sci. 4, 225–248 (2014).
Article PubMed Google Scholar
Borges-Araújo, L. et al. Pragmatic coarse-graining of proteins: Models and applications. J. Chem. Theory Comput. 19, 7112–7135 (2023).
Article PubMed Google Scholar
Taketomi, H., Ueda, Y. & Gō, N. Studies on protein folding, unfolding and fluctuations by computer simulation. I. The effect of specific amino acid sequence represented by specific inter-unit interactions. Int. J. Pept. Protein Res. 7, 445–459 (1975).
Article CAS PubMed Google Scholar
Go, N. & Taketomi, H. Studies on protein folding, unfolding and fluctuations by computer simulation. IV. Hydrophobic interactions. Int. J. Pept. Protein Res. 13, 447–461 (1979).
Article CAS PubMed Google Scholar
Cofas-Vargas, L. F., Moreira, R. A., Poblete, S., Chwastyk, M. & Poma, A. B. The go-martini approach: Revisiting the concept of contact maps and the modelling of protein complexes. Acta Phys. Pol. A 145, S9–S9 (2024).
Article ADS CAS Google Scholar
Poma, A. B., Thu, T. T. M., Tri, L. T. M., Nguyen, H. L. & Li, M. S. Nanomechanical stability of Aβ tetramers and fibril-like structures: Molecular dynamics simulations. J. Phys. Chem. B 125, 7628–7637 (2021).
Article CAS PubMed PubMed Central Google Scholar
Poma, A. B., Guzman, H. V., Li, M. S. & Theodorakis, P. E. Mechanical and thermodynamic properties of Aβ, Aβ, and α-synuclein fibrils: a coarse-grained method to complement experimental studies. Beilstein J. Nanotechnol. 10, 500–513 (2019).
Article CAS PubMed PubMed Central Google Scholar
Poma, A. B., Chwastyk, M. & Cieplak, M. Elastic moduli of biological fibers in a coarse-grained model: crystalline cellulose and β-amyloids. Phys. Chem. Chem. Phys. 19, 28195–28206 (2017).
Article CAS PubMed Google Scholar
Marrink, S. J., Risselada, H. J., Yefimov, S., Tieleman, D. P. & de Vries, A. H. The MARTINI force field: coarse grained model for biomolecular simulations. J. Phys. Chem. B 111, 7812–7824 (2007).
Article CAS PubMed Google Scholar
Monticelli, L. et al. The MARTINI coarse-grained force field: Extension to proteins. J. Chem. Theory Comput. 4, 819–834 (2008).
Article CAS PubMed Google Scholar
de Jong, D. H. et al. Improved parameters for the martini coarse-grained protein force field. J. Chem. Theory Comput. 9, 687–697 (2013).
Article PubMed Google Scholar
Marrink, S. J. & Tieleman, D. P. Perspective on the Martini model. Chem. Soc. Rev. 42, 6801–6822 (2013).
Article CAS PubMed Google Scholar
Periole, X., Cavalli, M., Marrink, S.-J. & Ceruso, M. A. Combining an elastic network with a coarse-grained molecular force field: Structure, dynamics, and intermolecular recognition. J. Chem. Theory Comput. 5, 2531–2543 (2009).
Article CAS PubMed Google Scholar
Jussupow, A. & Kaila, V. R. I. Effective molecular dynamics from neural network-based structure prediction models. J. Chem. Theory Comput. 19, 1965–1975 (2023).
Article CAS PubMed PubMed Central Google Scholar
Alessandri, R. et al. Pitfalls of the Martini model. J. Chem. Theory Comput. 15, 5448–5460 (2019).
Article CAS PubMed PubMed Central Google Scholar
Stark, A. C., Andrews, C. T. & Elcock, A. H. Toward optimized potential functions for protein-protein interactions in aqueous solutions: osmotic second virial coefficient calculations using the MARTINI coarse-grained force field. J. Chem. Theory Comput. 9, 4176–4185 (2013).
Majumder, A. & Straub, J. E. Addressing the excessive aggregation of membrane proteins in the MARTINI model. J. Chem. Theory Comput. 17, 2513–2521 (2021).
Article CAS PubMed PubMed Central Google Scholar
Javanainen, M., Martinez-Seara, H. & Vattulainen, I. Excessive aggregation of membrane proteins in the Martini model. PLoS ONE 12, e0187936 (2017).
Article PubMed PubMed Central Google Scholar
Poma, A. B., Cieplak, M. & Theodorakis, P. E. Combining the MARTINI and structure-based coarse-grained approaches for the molecular dynamics studies of conformational transitions in proteins. J. Chem. Theory Comput. 13, 1366–1374 (2017).
Article CAS PubMed Google Scholar
Thallmair, S., Vainikka, P. A. & Marrink, S. J. Lipid fingerprints and cofactor dynamics of light-harvesting complex II in different membranes. Biophys. J. 116, 1446–1455 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Mahmood, M. I., Poma, A. B. & Okazaki, K.-I. Optimizing Gō-MARTINI coarse-grained model for F-BAR protein on lipid membrane. Front. Mol. Biosci. 8, 619381 (2021).
Article CAS PubMed PubMed Central Google Scholar
Liu, Z. et al. Mapping mechanostable pulling geometries of a therapeutic anticalin/CTLA-4 protein complex. Nano Lett. 22, 179–187 (2022).
Article ADS PubMed Google Scholar
Cofas-Vargas, L. F. et al. Nanomechanical footprint of SARS-CoV-2 variants in complex with a potent nanobody by molecular simulations. Nanoscale 16, 18824–18834 (2024).
Souza, P. C. T. et al. Protein-ligand binding with the coarse-grained Martini model. Nat. Commun. 11, 3714 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Souza, P. C. T. et al. Martini 3: a general purpose force field for coarse-grained molecular dynamics. Nat. Methods 18, 382–388 (2021).
Article CAS PubMed Google Scholar
Thallmair, V. et al. Two cooperative binding sites sensitize PI(4,5)P recognition by the tubby domain. Sci. Adv. 8, eabp9471 (2022).
Article CAS PubMed PubMed Central Google Scholar
Souza, P. C. T., Thallmair, S., Marrink, S. J. & Mera-Adasme, R. An allosteric pathway in copper, Zinc superoxide dismutase unravels the molecular mechanism of the G93A amyotrophic lateral sclerosis-linked mutation. J. Phys. Chem. Lett. 10, 7740–7744 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gomes, P. S. F. C., Forrester, M., Pace, M., Gomes, D. E. B. & Bernardi, R. C. May the force be with you: The role of hyper-mechanostability of the bone sialoprotein binding protein during early stages of infections. Front. Chem. 11, 1107427 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Thomasen, F. E., Pesce, F., Roesgaard, M. A., Tesei, G. & Lindorff-Larsen, K. Improving martini 3 for disordered and multidomain proteins. J. Chem. Theory Comput. 18, 2033–2041 (2022).
Article CAS PubMed Google Scholar
Thomasen, F. E. et al. Rescaling protein-protein interactions improves Martini 3 for flexible proteins in solution. Nat. Commun. 15, 6645 (2024).
Article CAS PubMed PubMed Central Google Scholar
Spinti, J. K., Neiva Nunes, F. & Melo, M. N. Room for improvement in the initial martini 3 parameterization of peptide interactions. Chem. Phys. Lett. 819, 140436 (2023).
Article CAS Google Scholar
Kabsch, W. & Sander, C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637 (1983).
Article CAS PubMed Google Scholar
Alessandri, R. et al. Martini 3 coarse‐grained force field: Small molecules. Adv. Theory Simul. 5, 2100391 (2022).
Article CAS Google Scholar
Herzog, F. A., Braun, L., Schoen, I. & Vogel, V. Improved side chain dynamics in MARTINI simulations of protein-lipid interfaces. J. Chem. Theory Comput 12, 2446–2458 (2016).
Article CAS PubMed Google Scholar
Borges-Araújo, L., Souza, P. C. T., Fernandes, F. & Melo, M. N. Improved parameterization of phosphatidylinositide lipid headgroups for the martini 3 coarse-grain force field. J. Chem. Theory Comput. 18, 357–373 (2022).
Article PubMed Google Scholar
Borges-Araújo, L. et al. Martini 3 coarse-grained force field for cholesterol. J. Chem. Theory Comput. 19, 7387–7404 (2023).
Article PubMed Google Scholar
Grünewald, F. et al. Martini 3 coarse-grained force field for carbohydrates. J. Chem. Theory Comput. 18, 7555–7569 (2022).
Article PubMed PubMed Central Google Scholar
Koukos, P. I. et al. Martini 3 force field parameters for protein lipidation post-translational modifications. J. Chem. Theory Comput. 19, 8901–8918 (2023).
Tsanai, M., Frederix, P. W. J. M., Schroer, C. F. E., Souza, P. C. T. & Marrink, S. J. Coacervate formation studied by explicit solvent coarse-grain molecular dynamics with the Martini model. Chem. Sci. 12, 8521–8530 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kroon, P. C. et al. Martinize2 and Vermouth: Unified framework for topology generation. eLife 12, RP90627 (2023).
Naughton, F. B., Kalli, A. C. & Sansom, M. S. P. Modes of Interaction of Pleckstrin Homology Domains with Membranes: Toward a Computational Biochemistry of Membrane Recognition. J. Mol. Biol. 430, 372–388 (2018).
Article CAS PubMed Google Scholar
Rebecchi, M., Peterson, A. & McLaughlin, S. Phosphoinositide-specific phospholipase C-delta 1 binds with high affinity to phospholipid vesicles containing phosphatidylinositol 4,5-bisphosphate. Biochemistry 31, 12742–12747 (1992).
Article CAS PubMed Google Scholar
Uekama, N., Sugita, T., Okada, M., Yagisawa, H. & Tuzi, S. Phosphatidylserine induces functional and structural alterations of the membrane-associated pleckstrin homology domain of phospholipase C-delta1. FEBS J. 274, 177–187 (2007).
Article CAS PubMed Google Scholar
Naughton, F. B., Kalli, A. C. & Sansom, M. S. P. Association of peripheral membrane proteins with membranes: Free energy of binding of GRP1 PH domain with phosphatidylinositol phosphate-containing model bilayers. J. Phys. Chem. Lett. 7, 1219–1224 (2016).
Article CAS PubMed PubMed Central Google Scholar
Mobley, D. L. & Gilson, M. K. Predicting binding free energies: Frontiers and benchmarks. Annu. Rev. Biophys. 46, 531–558 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mondal, J., Ahalawat, N., Pandit, S., Kay, L. E. & Vallurupalli, P. Atomic resolution mechanism of ligand binding to a solvent inaccessible cavity in T4 lysozyme. PLoS Comput. Biol. 14, e1006180 (2018).
Article ADS PubMed PubMed Central Google Scholar
Nunes-Alves, A., Zuckerman, D. M. & Arantes, G. M. Escape of a small molecule from inside T4 Lysozyme by multiple pathways. Biophys. J. 114, 1058–1066 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Rydzewski, J. & Valsson, O. Finding multiple reaction pathways of ligand unbinding. J. Chem. Phys. 150, 221101 (2019).
Article ADS PubMed Google Scholar
Liu, L., Baase, W. A., Michael, M. M. & Matthews, B. W. Use of stabilizing mutations to engineer a charged group within a ligand-binding hydrophobic cavity in T4 lysozyme. Biochemistry 48, 8842–8851 (2009).
Article CAS PubMed Google Scholar
Morton, A. & Matthews, B. W. Specificity of ligand binding in a buried nonpolar cavity of T4 lysozyme: linkage of dynamics and structural plasticity. Biochemistry 34, 8576–8588 (1995).
Article CAS PubMed Google Scholar
Capelli, R., Carloni, P. & Parrinello, M. Exhaustive search of ligand binding pathways via volume-based metadynamics. J. Phys. Chem. Lett. 10, 3495–3499 (2019).
Article CAS PubMed Google Scholar
Nunes-Alves, A., Kokh, D. B. & Wade, R. C. Ligand unbinding mechanisms and kinetics for T4 lysozyme mutants from τRAMD simulations. Curr. Res. Struct. Biol. 3, 106–111 (2021).
Article CAS PubMed PubMed Central Google Scholar
Strange, R. W. et al. Variable metallation of human superoxide dismutase: atomic resolution crystal structures of Cu-Zn, Zn-Zn and as-isolated wild-type enzymes. J. Mol. Biol. 356, 1152–1162 (2006).
Article CAS PubMed Google Scholar
Wright, G. S. A., Antonyuk, S. V. & Hasnain, S. S. The biophysics of superoxide dismutase-1 and amyotrophic lateral sclerosis. Q. Rev. Biophys. 52, e12 (2019).
Article CAS PubMed Google Scholar
Trist, B., Hilton, J. B., Crouch, P. J., Hare, D. J. & Double, K. L. Superoxide dismutase 1 in health and disease: How a front-line antioxidant becomes neurotoxic. Angew. Chem. Int. Ed. Engl. 60, 9215–9246 (2020).
ALSoD. https://alsod.ac.uk/ (2015).
Crow, J. P., Sampson, J. B., Zhuang, Y., Thompson, J. A. & Beckman, J. S. Decreased zinc affinity of amyotrophic lateral sclerosis-associated superoxide dismutase mutants leads to enhanced catalysis of tyrosine nitration by peroxynitrite. J. Neurochem. 69, 1936–1944 (2002).
Article Google Scholar
Kim, J. et al. Accumulation of labile zinc in neurons and astrocytes in the spinal cords of G93A SOD-1 transgenic mice. Neurobiol. Dis. 34, 221–229 (2009).
Article CAS PubMed Google Scholar
Fujisawa, T. et al. A novel monoclonal antibody reveals a conformational alteration shared by amyotrophic lateral sclerosis-linked SOD1 mutants. Ann. Neurol. 72, 739–749 (2012).
Article CAS PubMed Google Scholar
Baziyar, P., Seyedalipour, B. & Hosseinkhani, S. Zinc binding loop mutations of hSOD1 promote amyloid fibrils under physiological conditions: Implications for initiation of amyotrophic lateral sclerosis. Biochimie 199, 170–181 (2022).
Article CAS PubMed Google Scholar
Das, B. et al. A Zn-dependent structural transition of SOD1 modulates its ability to undergo phase separation. EMBO J. 42, e111185 (2022).
Sannigrahi, A. et al. The metal cofactor zinc and interacting membranes modulate SOD1 conformation-aggregation landscape in an in vitro ALS model. Elife 10, (2021).
Nagao, C., Kuroi, K., Wakabayashi, T. & Nakabayashi, T. Pro-oxidant activity of an ALS-linked SOD1 mutant in Zn-deficient form. Molecules 25, https://doi.org/10.3390/molecules25163600 (2020).
Homma, K. et al. Genome-wide siRNA screening reveals that DCAF4-mediated ubiquitination of optineurin stimulates autophagic degradation of Cu,Zn-superoxide dismutase. J. Biol. Chem. 295, 3148–3158 (2020).
Article CAS PubMed PubMed Central Google Scholar
Homma, K. et al. SOD1 as a molecular switch for initiating the homeostatic ER stress response under zinc deficiency. Mol. Cell 52, 75–86 (2013).
Article CAS PubMed Google Scholar
Hayward, L. J. et al. Decreased metallation and activity in subsets of mutant superoxide dismutases associated with familial amyotrophic lateral sclerosis. J. Biol. Chem. 277, 15923–15931 (2002).
Article CAS PubMed Google Scholar
Molnar, K. S. et al. A common property of amyotrophic lateral sclerosis-associated variants: destabilization of the copper/zinc superoxide dismutase electrostatic loop. J. Biol. Chem. 284, 30965–30973 (2009).
Article CAS PubMed PubMed Central Google Scholar
Bauer, M. S. et al. A tethered ligand assay to probe SARS-CoV-2:ACE2 interactions. Proc. Natl. Acad. Sci. USA 119, e2114397119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Ray, A. et al. Single-molecule investigation of the binding interface stability of SARS-CoV-2 variants with ACE2. ACS Nanoscience Au 4, 136–145 (2024).
Koehler, M. et al. Molecular insights into receptor binding energetics and neutralization of SARS-CoV-2 variants. Nat. Commun. 12, 6977 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Nguyen, H. & Li, M. S. Antibody-nanobody combination increases their neutralizing activity against SARS-CoV-2 and nanobody H11-H4 is effective against Alpha, Kappa and Delta variants. Sci. Rep. 12, 9701 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Golcuk, M. et al. SARS-CoV-2 Delta variant decreases nanobody binding and ACE2 blocking effectivity. J. Chem. Inf. Model. 62, 2490–2498 (2022).
Article CAS PubMed PubMed Central Google Scholar
Schoeler, C. et al. Ultrastable cellulosome-adhesion complex tightens under load. Nat. Commun. 5, 5635 (2014).
Article ADS CAS PubMed Google Scholar
Claveras Cabezudo, A., Athanasiou, C., Tsengenes, A. & Wade, R. C. Scaling protein-water interactions in the martini 3 coarse-grained force field to simulate transmembrane helix dimers in different lipid environments. J. Chem. Theory Comput. 19, 2109–2119 (2023).
Article CAS PubMed Google Scholar
Valério, M. et al. Parainfluenza fusion peptide promotes membrane fusion by assembling into oligomeric porelike structures. ACS Chem. Biol. 17, 1831–1843 (2022).
Article PubMed PubMed Central Google Scholar
Dzuricky, M., Rogers, B. A., Shahid, A., Cremer, P. S. & Chilkoti, A. De novo engineering of intracellular condensates using artificial disordered proteins. Nat. Chem. 12, 814–825 (2020).
Article CAS PubMed PubMed Central Google Scholar
Abbas, M., Lipiński, W. P., Nakashima, K. K., Huck, W. T. S. & Spruijt, E. A short peptide synthon for liquid-liquid phase separation. Nat. Chem. 13, 1046–1054 (2021).
Article CAS PubMed Google Scholar
van Hilten, N., Stroh, K. S. & Risselada, H. J. Efficient quantification of lipid packing defect sensing by amphipathic peptides: Comparing martini 2 and 3 with CHARMM36. J. Chem. Theory Comput. 18, 4503–4514 (2022).
Article PubMed PubMed Central Google Scholar
Yokoi, H., Kinoshita, T. & Zhang, S. Dynamic reassembly of peptide RADA16 nanofiber scaffold. Proc. Natl. Acad. Sci. USA 102, 8414–8419 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Bowerman, C. J. & Nilsson, B. L. Self-assembly of amphipathic β-sheet peptides: insights and applications. Biopolymers 98, 169–184 (2012).
Article CAS PubMed Google Scholar
Plazinski, W., Lutsyk, V. & Plazinska, A. Exploring free energies of specific protein conformations using the martini force field. J. Chem. Theory Comput. 20, 2273–2283 (2024).
Article CAS PubMed PubMed Central Google Scholar
Sahoo, A., Lee, P.-Y. & Matysiak, S. Transferable and polarizable coarse grained model for proteins─ProMPT. J. Chem. Theory Comput. 18, 5046–5055 (2022).
Article CAS PubMed Google Scholar
Chiariello, M. G., Grünewald, F., Zarmiento-Garcia, R. & Marrink, S. J. pH-Dependent conformational switch impacts stability of the PsbS dimer. J. Phys. Chem. Lett. 14, 905–911 (2023).
Article CAS PubMed PubMed Central Google Scholar
Grünewald, F. et al. Titratable martini model for constant pH simulations. J. Chem. Phys. 153, 024118 (2020).
Article PubMed Google Scholar
Alessandri, R. et al. in A Practical Guide to Recent Advances in Multiscale Modeling and Simulation of Biomolecules. 1–34 (AIP Publishing, 2023).
Stevens, J. A. et al. Molecular dynamics simulation of an entire cell. Front. Chem. 11, 1106495 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Korshunova, K. et al. Martini 3 OliGo̅mers: A scalable approach for multimers and fibrils in GROMACS. J. Chem. Theory Comput. 20, 7635–7645 (2024).
Article CAS PubMed PubMed Central Google Scholar
Abraham, M. J. et al. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1-2, 19–25 (2015).
Article ADS Google Scholar
MacCallum, J. L. et al. An implementation of the Martini coarse-grained force field in OpenMM. Biophys. J. 122, 2864–2870 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Kawamoto, S. et al. SPICA Force field for proteins and peptides. J. Chem. Theory Comput. 18, 3204–3217 (2022).
Article CAS PubMed Google Scholar
Yamada, T. et al. Improved protein model in SPICA force field. J. Chem. Theory Comput. 19, 8967–8977 (2023).
Article CAS PubMed Google Scholar
Pedersen, K. B. et al. OLIVES: A Go̅-like model for stabilizing protein structure via hydrogen bonding native contacts in the martini 3 coarse-grained force field. J. Chem. Theory Comput. 20, 8049–8070 (2024).
Stroh, K. S., Souza, P. C. T., Monticelli, L. & Risselada, H. J. CGCompiler: Automated coarse-grained molecule parametrization via noise-resistant mixed-variable optimization. J. Chem. Theory Comput. 19, 8384–8400 (2023).
Article CAS PubMed PubMed Central Google Scholar
Empereur-Mot, C. et al. Automatic optimization of lipid models in the Martini force field using SwarmCG. J. Chem. Inf. Model. 63, 3827–3838 (2023).
Article CAS PubMed PubMed Central Google Scholar
Yang, S. & Song, C. Switching Go̅-Martini for Investigating Protein Conformational Transitions and Associated Protein–Lipid Interactions. J. Chem. Theory Comput. 6, 2618–2629 (2024).
Wołek, K., Gómez-Sicilia, À. & Cieplak, M. Determination of contact maps in proteins: A combination of structural and chemical approaches. J. Chem. Phys. 143, 243105 (2015).
Article ADS PubMed Google Scholar
Sobolev, V., Wade, R. C., Vriend, G. & Edelman, M. Molecular docking using surface complementarity. Proteins 25, 120–129 (1996).
Article CAS PubMed Google Scholar
martinize 2. https://github.com/marrink-lab/vermouth-martinize.
Hilpert, C. et al. Facilitating CG simulations with MAD: The MArtini database server. J. Chem. Inf. Model. 63, 702–710 (2023).
Article CAS PubMed Google Scholar
Grünewald, F. et al. Polyply; a python suite for facilitating simulations of macromolecules and nanomaterials. Nat. Commun. 13, 68 (2022).
Article ADS PubMed PubMed Central Google Scholar
de Jong, D. H., Baoukina, S., Ingólfsson, H. I. & Marrink, S. J. Martini straight: Boosting performance using a shorter cutoff and GPUs. Comput. Phys. Commun. 199, 1–7 (2016).
Article ADS MathSciNet Google Scholar
Parrinello, M. & Rahman, A. Polymorphic transitions in single crystals: A new molecular dynamics method. J. Appl. Phys. 52, 7182–7190 (1981).
Article ADS CAS Google Scholar
Bussi, G., Donadio, D. & Parrinello, M. Canonical sampling through velocity rescaling. J. Chem. Phys. 126, 014101 (2007).
Article ADS PubMed Google Scholar
Souza, P. C. T. & Marrink, S. J. Martini 3 open-beta version. http://cgmartini.nl/index.php/martini3beta (2013).
Quemener, E. & Corvellec, M. SIDUS—the solution for extreme deduplication of an operating system. Linux J. https://doi.org/10.5555/2555789.2555792.

Download references

Acknowledgements

This work was granted access to the HPC resources of IDRIS and TGCC under the allocations 2022-A0120713456 (P.C.T.S) and 2023-A0140713456 (P.C.T.S.) made by GENCI. We also acknowledge the support of the Center Blaise Pascal’s IT test platform at ENS de Lyon (Lyon, France) for the computer facilities. The platform operates the SIDUS solution developed by Emmanuel Quemener¹²³. We thank the Center for Information Technology of the University of Groningen for providing access to the Peregrine high-performance computing cluster. We also acknowledge the National Computing Facilities Foundation of The Netherlands Organization for Scientific Research (NWO) for providing computing time. L.B.A. and P.C.T.S. would like to thank the support of the French National Center for Scientific Research (CNRS) and the funding from research collaboration agreements with PharmCADD. S.J.M. received funding from the European Research Council (ERC) through an ERC Advanced grant “COMP-MICR-CROW-MEM”. S.T. acknowledges the support from the European Commission via a Marie Skłodowska-Curie Actions individual fellowship (MicroMod-PSII, grant agreement 748895), the Center for Multiscale Modeling in Life Sciences (CMMS), the Alfons und Gertrud Kassel Foundation, and the Dr. Rolf M. Schwiete Foundation. L.M. acknowledges funding by the Institut National de la Santé et de la Recherche Médicale (INSERM), CC-IN2P3 (https://cc.in2p3.fr), and the French supercomputing centers CINES and TGCC, supported by Grand Equipement National de Calcul Intensif (GENCI), for computing resources and services (grants number A0120710138, A0140710138). M.N.M. acknowledges Fundação para a Ciência e a Tecnologia for fellowship CEECIND/04124/2017/CP1428/CT0008. A.B.P and R.A.M. acknowledge Marek Cieplak for sharing the source code of the OV + rCSU contact map in Fortran. P.P. acknowledges FAPESP support (grant 2019/26557-8). A.B.P. acknowledges financial support from the National Science Center, Poland, under grant 2022/45/B/NZ1/02519 and gratefully acknowledges Polish high-performance computing infrastructure PLGrid (HPC Centers: ACK Cyfronet AGH) for providing computer facilities and support within computational grant no. PLG/2023/016519. R.M.-A. thanks, ANID-Chile, for financial support under FONDECYT N. 1200200.

Author information

Authors and Affiliations

Laboratoire de Biologie et Modélisation de la Cellule, CNRS, UMR 5239, Inserm, U1293, Université Claude Bernard Lyon 1, Ecole Normale Supérieure de Lyon, 46 Allée d’Italie, Lyon, France
Paulo C. T. Souza & Luís Borges-Araújo
Centre Blaise Pascal de Simulation et de Modélisation Numérique, Ecole Normale Supérieure de Lyon, 46 Allée d’Italie, Lyon, France
Paulo C. T. Souza & Luís Borges-Araújo
Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Nijenborgh 7, Groningen, The Netherlands
Christopher Brasnett, Peter Park, Liguo Wang & Siewert J. Marrink
NEIKER, Basque Research and Technology Alliance (BRTA), Parque Científico y Tecnológico de Bizkaia, P812, Derio, Spain
Rodrigo A. Moreira
Heidelberg Institute for Theoretical Studies (HITS), Schloss-Wolfsbrunnenweg 35, Heidelberg, Germany
Fabian Grünewald
Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, Brazil
Peter Park
Molecular Microbiology and Structural Biochemistry, CNRS UMR 5086 and Université Claude Bernard Lyon 1, 7 Passage du Vercors, Lyon, France
Hafez Razmazma & Luca Monticelli
Institut des Biomolecules Max Mousseron, UMR5247, CNRS, Université De Montpellier, ENSCM, 1919 Route de Mende, Montpellier, Cedex, France
Hafez Razmazma
Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, Oeiras, Portugal
Ana C. Borges-Araújo & Manuel N. Melo
Biosystems and Soft Matter Division, Institute of Fundamental Technological Research, Polish Academy of Sciences, ul. Pawińskiego 5B, 02-106, Warsaw, Poland
Luis Fernando Cofas-Vargas & Adolfo B. Poma
Departamento de Química, Facultad de Ciencias, Universidad de Tarapacá, Arica, Chile
Raúl Mera-Adasme
PharmCADD, Busan, Republic of Korea
Sangwook Wu
Department of Physics, Pukyong National University, Busan, Republic of Korea
Sangwook Wu
Frankfurt Institute for Advanced Studies, Ruth-Moufang-Straße 1, Frankfurt am Main, Germany
Sebastian Thallmair

Authors

Paulo C. T. Souza
View author publications
Search author on:PubMed Google Scholar
Luís Borges-Araújo
View author publications
Search author on:PubMed Google Scholar
Christopher Brasnett
View author publications
Search author on:PubMed Google Scholar
Rodrigo A. Moreira
View author publications
Search author on:PubMed Google Scholar
Fabian Grünewald
View author publications
Search author on:PubMed Google Scholar
Peter Park
View author publications
Search author on:PubMed Google Scholar
Liguo Wang
View author publications
Search author on:PubMed Google Scholar
Hafez Razmazma
View author publications
Search author on:PubMed Google Scholar
Ana C. Borges-Araújo
View author publications
Search author on:PubMed Google Scholar
Luis Fernando Cofas-Vargas
View author publications
Search author on:PubMed Google Scholar
Luca Monticelli
View author publications
Search author on:PubMed Google Scholar
Raúl Mera-Adasme
View author publications
Search author on:PubMed Google Scholar
Manuel N. Melo
View author publications
Search author on:PubMed Google Scholar
Sangwook Wu
View author publications
Search author on:PubMed Google Scholar
Siewert J. Marrink
View author publications
Search author on:PubMed Google Scholar
Adolfo B. Poma
View author publications
Search author on:PubMed Google Scholar
Sebastian Thallmair
View author publications
Search author on:PubMed Google Scholar

Contributions

P.C.T.S. and S.T. conceived and initiated the project, with later contributions into the design of the project byfrom S.J.M. and A.B.P. L.B.A., F.G., and S.T. wrote the codes. L.B.A. and P.C.T.S. reviewed and detailed the Martini 3 protein model, including the compilation of the partitioning free energies and SASA of the side-chain analogs. S.T. contributed to the studies involving the PH domain of PLCδ1. P.C.T.S. contributed to the studies involving the T4-lysozyme and SOD1, with additional contributions of R.M.A. and S.T. for the SOD1 work. R.A.M., L.F.C.V., and A.B.P. contributed to the nanomechanical studies. H.R., L.M., and P.C.T.S. contributed to the flexibility improvement benchmarks via refinements in the contact map and strength of interactions. C.B., L.W., P.P., and P.C.T.S contributed to the IDP and biomolecular condensate studies. L.B.A., A.C.B.A., S.W., M.N.M., and P.C.T.S. contributed to the transmembrane peptide studies. P.C.T.S. and S.T. wrote the manuscript with contributions from all authors. All authors discussed the results, revised the manuscript, and approved the final version of the manuscript. S.J.M., S.W., P.C.T.S., and A.B.P. provided most of the financial and computational resources of the project, with contributions from L.M. and S.T.

Corresponding authors

Correspondence to Paulo C. T. Souza, Siewert J. Marrink, Adolfo B. Poma or Sebastian Thallmair.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Robin Corey, and the other anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Transparent Peer Review file

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Souza, P.C.T., Borges-Araújo, L., Brasnett, C. et al. GōMartini 3: From large conformational changes in proteins to environmental bias corrections. Nat Commun 16, 4051 (2025). https://doi.org/10.1038/s41467-025-58719-0

Download citation

Received: 17 April 2024
Accepted: 26 March 2025
Published: 30 April 2025
Version of record: 30 April 2025
DOI: https://doi.org/10.1038/s41467-025-58719-0

This article is cited by

Oxygen depletion in biomolecular condensates is dominated by macromolecular density
- Ankush Garg
- Christopher Brasnett
- Magnus Kjaergaard
Nature Communications (2026)
Charged peptides enriched in aromatic residues decelerate condensate ageing driven by cross-β-sheet formation
- Ignacio Sanchez-Burgos
- Andres R. Tejedor
- Jorge R. Espinosa
Nature Communications (2025)
An integrative modelling approach to the mitochondrial cristae
- Chelsea M. Brown
- Marieke S. S. Westendorp
- Tsjerk A. Wassenaar
Communications Biology (2025)