Abstract
Cross-linking mass spectrometry (XL-MS) is a powerful technology for mapping protein-protein interactions (PPIs) at the systems level. While bivalent cross-links are effective for defining protein interactions and structures, multivalent cross-links offer enhanced spatial resolution to facilitate characterization of heterogeneous protein complexes. However, their identification remains challenging due to fragmentation complexity and the vast expansion of database search space. Here, we present tris-succinimidyl trioxane (TSTO), a novel trioxane-based, MS-cleavable homotrifunctional cross-linker capable of targeting three proximal lysines simultaneously. TSTO’s unique MS-cleavability enables concurrent release of cross-linked peptide constituents during collision-induced dissociation, permitting their unambiguous identification. The TSTO-based XL-MS platform is effective for mapping cellular networks from intact cells and tissues, illustrating its versatility for complex biological systems. Trimeric interactions captured by TSTO reveal structural details inaccessible to bifunctional reagents, enhancing modeling accuracy and precision. Furthermore, this development opens a new avenue for designing multifunctional MS-cleavable cross-linkers to further advance structural systems biology.
Similar content being viewed by others
Introduction
Cross-linking mass spectrometry (XL-MS) has emerged as a transformative technology for interactomics and structural proteomics1,2,3,4. By covalently linking proximal amino acid residues within or between proteins, XL-MS provides unique insights into the architecture of protein complexes and the spatial arrangement of individual protein domains. Due to various innovations, this methodology has revolutionized structural biology by complementing traditional structural techniques and offering a more holistic view of protein assemblies in their native environments4,5,6,7,8. Over the years, XL-MS has significantly evolved with the incorporation of various functionalities and targeting groups in cross-linker design, broadening its applications and enhancing the scope of XL-MS studies. In particular, diverse MS-cleavable cross-linking reagents have been developed to facilitate the detection and identification of cross-linked peptides1,3,4, enabling proteome-wide analyses of cellular networks to elucidate their structural organization from intact cells, subcellular organelles, tissues and clinical samples.
Up to now, the majority of cross-linking reagents are bifunctional (carrying two reactive groups), and current XL-MS analyses have focused on the identification of cross-links between two residues to infer pair-wise interactions. While bivalent cross-links are effective for PPI mapping, multivalent cross-links can offer enhanced spatial resolution and provide additional distance restraints to facilitate structural analysis of protein complexes. Furthermore, these higher-order cross-links can uncover interaction sites and conformations that are not easily detectable with binary cross-linkers. The resulting multimeric interactions can lead to the discovery of new functional insights and regulatory mechanisms within protein complexes, especially for those dynamic and heterogeneous ones. Although multivalent cross-links can be formed by bifunctional cross-linkers, the difficulty of their identification has rendered them largely invisible. Due to the additional expansion of the search space associated with each successive peptide, traditional non-cleavable cross-linking reagents are incapable of this task. As a result, MS-cleavable cross-linking reagents are critical for alleviating this issue by physically separating cross-linked constituents to facilitate their individual identifications1,9,10. During the course of our study, a homotetrafunctional MS-cleavable cross-linking reagent utilizing four NHS ester-targeting groups (aka Bisby) was reported11. However, the additional bonds required for cross-linking each successive peptide complicate cross-link fragmentation and identification. Therefore, new approaches are needed to permit effective identification of multivalent cross-links, especially for proteome-wide PPI profiling.
To this end, we aimed to design an MS-cleavable trifunctional cross-linker capable of simultaneously targeting three residues, while more importantly streamlining peptide identification through a core structure that enables synchronous release of all cross-linked peptides in a single step. This will allow us to expand the detection of protein connectivity and provide additional restraint information for improved structural elucidation at the systems level. Here, we present the design, synthesis and characterization of a novel membrane-permeable, MS-cleavable homotrifunctional cross-linker, TSTO (tris-succinimidyl trioxane), to enable simultaneous capture and identification of trimeric PPIs. XL-MS analysis of human 26S proteasomes has demonstrated that TSTO is effective in cross-linking protein complexes and that the resulting cross-linked peptides display unique and predicable fragmentation during collision-induced dissociation (CID), enabling their simplified and accurate identification using multistage mass spectrometry (MSn). Importantly, we have captured trimeric interactions to better define interfaces between proteasome subcomplexes. Moreover, TSTO has been successfully applied for in vivo cross-linking of HEK293 cells and mouse heart tissues, demonstrating its applicability in elucidating cellular networks. Apart from binary interactions, trimeric interactions captured by TSTO cross-linking facilitated the characterization of protein oligomers and enhanced structural modeling with improved accuracy and precision. In summary, we show that TSTO can uncover multimeric interactions to yield more detailed PPI networks, advancing our understanding of cellular processes and biological function.
Results
Developing a Trioxane-based MS-cleavable Homotrifunctional Cross-linker TSTO
In order to capture trimeric interactions, we designed a trioxane-based cross-linker TSTO and accomplished its synthesis through five steps (Fig. 1a). As shown, TSTO carries a unique symmetrical structure comprising three NHS esters connected via a central trioxane group12,13 (Fig. 1a), permitting concurrent cross-linking between three lysine residues to form a trivalent cross-link among three individual peptides (aka tripeptide tri-link, [α,β,γ]) (Fig. 1b, Type I). In comparison to a traditional cross-link between two individual peptides, accurate identification of a tri-link would be much more challenging due to further expansion of the database search space (n3). Therefore, the design of the central trioxane is crucial as it carries three equal MS-cleavable bonds that are weaker than peptide bonds and can be cleaved in parallel using CID to simultaneously release all three cross-linker arms in a single step. This leads to physical separation of the three cross-linked peptide constituents, yielding three fragment ions during MS2 analysis that can be subjected to MS3 sequencing (Fig. 1b, Type I). As shown, trioxane cleavage results in an identical and defined aldehyde remnant (AR) on each peptide constituent, allowing for their unambiguous identification. In addition, this minimizes the total number of MS2 fragments, simplifying ion selection for subsequent MS3 analysis. In addition to tripeptide TSTO cross-links, tri-links can be formed between two peptides, linking one lysine in one peptide (α) and two lysines in another peptide (β) (aka dipeptide tri-link, [α-β2]) (Fig. 1b, Type II). For dipeptide tri-links, two fragment ions would be observed in MS2, one corresponding to a peptide carrying a single AR-modified lysine and the other representing a peptide carrying two AR-modified lysines. Finally, TSTO cross-linking can yield traditional cross-links in which two lysines from two different peptides are cross-linked while the third NHS ester of TSTO is hydrolyzed (aka, dipeptide bi-link, [α-β]) (Fig. 1b, Type III). MS2 analysis of a dipeptide bi-link would yield two fragment ions corresponding to peptides each carrying a single AR-modified lysine, while the hydrolyzed arm is released as a neutral loss. In addition to TSTO inter-links formed by two or three peptides as described above, monomeric cross-linked species containing a single peptide including ‘intra-links’ and ‘dead-ends’ can occur due to the hydrolysis of one or two NHS esters of TSTO. Notably, TSTO inter-linked peptides are the most structurally informative products for PPI mapping, and thus the focus of our analysis. Owing to their unique MS-cleavability, TSTO cross-linked peptides can be identified using the same LC-MSn workflow that has been previously established for our sulfoxide-containing MS-cleavable cross-linkers1,4,14.
a TSTO synthesis. b TSTO cross-link products and their expected fragmentation during CID analysis. For tripeptide tri-links (Type I), MS2 yields three fragment peptides with single AR-modified lysines. Dipeptide tri-links (Type II) produce two fragment peptides, one with a single AR-modified lysine and one with two. Finally, dipeptide bi-links (Type III) result in two fragment peptides, each with a single AR-modified lysine. MS3 analyses of these fragments enable unambiguous identification of TSTO tri- and bi-links.
Characterization of TSTO Cross-linked Synthetic Peptide by MSn Analysis
We first characterized TSTO cross-linking on the synthetic peptide Ac-SR8 (Ac-SAKAYEHR). Under our experimental conditions, three Ac-SR8 cross-linked products were detected: dead-end modified Ac-SR8 (αDN), inter-linked Ac-SR8 homodimer [α-α], and Ac-SR8 homotrimer [α, α, α]. MS2 analysis of dead-end modified Ac-SR8 (m/z 667.31632+) yielded two dominant ions (m/z 542.26562+, 551.27082+) (Supplementary Fig. 1a). MS3 peptide fragment analysis determined these ions to be AR-modified Ac-SR8, with the lower mass ion corresponding to an AR moiety undergoing water loss (namely AR*), resulting in the detection of an ion doublet (αAR and αAR*) with mass difference (Δ) of 18.02 Da (Supplementary Fig. 1b, c). Two differently-charged species of inter-linked Ac-SR8 homodimer [α–α] were detected (m/z 773.70713+, 580.53194+), each fragmenting into dominant ions corresponding to αAR and αAR* during MS2 analysis (Supplementary Fig. 2a, b). Similarly, MS2 analysis of the Ac-SR8 homotrimer tri-link [α, α, α] (m/z 826.65084+) yielded three dominant ions (αAR*2+/αAR2+/αAR1+) (Supplementary Fig. 2c). While MS3 analyses of both αAR and αAR* resulted in their unambiguous identification (Supplementary Fig. 1b, c), selecting AR*-modified peptides for sequencing would be preferred due to the AR moiety’s propensity for dehydration.
Characterization of TSTO Cross-linked BSA by MSn Analysis
To characterize TSTO cross-linking in proteins, we performed XL-MS analysis on the model protein BSA, focusing on TSTO inter-linked peptides. As a result, all three types (I-III) of TSTO inter-linked peptides were identified by LC MSn, each displaying the characteristic MS2 fragmentation as expected. This is illustrated by MSn analyses of representative TSTO cross-linked BSA peptides (Supplementary Fig. 3). For a tripeptide tri-link [α, β, γ] (m/z 795.71206+), its MS2 analysis yielded three sets of dominant ions corresponding to αAR/αAR*, βAR/βAR*, and γAR/γAR* fragments (Supplementary Fig. 3a). As shown, MS3 analyses of the three cross-link fragments αAR* (m/z 638.31302+), βAR* (m/z 773.8743+), and γAR* (m/z 947.9347+) identified a tripeptide TSTO tri-link among BSA lysines K228, K374, and K498 (Supplementary Fig. 3b). MS2 analysis of a dipeptide tri-link [α-β2] (m/z 1054.75304+) resulted in two sets of dominant ion species: αAR/αAR*, and β2AR/βAR_AR*/β2AR* (Supplementary Fig. 3c). The detection of a fragment triplet with 18 Da increments indicates that peptide β carries two modified lysines, whereas peptide α only contains a single modified lysine. MS3 analyses of αAR* and β2AR* identified their sequences as 372LAKAR*EYEATLEECCAK386 and 490TPVSEKAR*VTKAR*CCTESLVNR507, respectively (Supplementary Fig. 2d), signifying a dipeptide tri-link [BSA:K374 - BSA:K495, K498]. Finally, for a dipeptide bi-link [α-β] (m/z 992.70174+), MS2 fragmentation produced two dominant ion pairs: αAR/αAR*, and βAR/βAR* (Supplementary Fig. 3e); MS3 analyses of αAR* and βAR* determined a cross-link between BSA:K117 and BSA:K489 (Supplementary Fig. 3f).
In total, 823 redundant cross-link spectra matches (CSMs) were identified, corresponding to 167 unique ones (Supplementary Data 1). Of these, 21 were tripeptide tri-links, 24 were dipeptide tri-links, and 122 were dipeptide bi-links. Overall, tri-links contributed ~ 27% (45/167) of the total unique CSMs. Breaking down tripeptide and dipeptide tri-links into their respective constituent residue pairs, a combined total of 118 K-K pairs were identified, with 37 being contributed by both TSTO tri- and bi-links, whereas 50 and 31 were unique contributed by TSTO bi-links and tri-links, respectively.
Considering the spacer arm length of TSTO (∼ 14 Å), lysine residues with Cα-Cα distance ≤ 35 Å were expected to be preferentially cross-linked. When mapped to the high-resolution crystal structure of BSA (PDB:4F5S) (Supplementary Fig. 4a), the overall mapped distance median was 21.1 Å with a satisfaction rate of cross-links under ≤ 35 Å of 90%. Taken together, these results demonstrate that TSTO is effective for protein cross-linking, and the resulting cross-linked peptides exhibit unique MS2 fragmentation patterns that are both predictable and reliable for unambiguous identification by LC MSn analysis.
TSTO XL-MS Analysis of the 26S proteasome
To explore TSTO’s capability in XL-MS analysis of protein complexes, we performed TSTO cross-linking of affinity-purified human 26S proteasomes. Similar to BSA, all three types of TSTO inter-linked peptides of the 26S proteasome were detected (Fig. 2). This was exemplified by MSn analyses of a representative tripeptide tri-link [α, β, γ] (m/z 998.93545+) among Rpt1:K116, Rpt4:K206, and Rpt3:K238 (Fig. 2a, b), a dipeptide tri-link [α-β2] (m/z 1209.09614+) [Rpn12:K281-Rpn3:K455, K461] (Fig. 2c, d), and a dipeptide bi-link [α-β] (m/z 773.41804+) between Rpt4:K72 and Rpt6:K222 (Fig. 2e, f). Cumulatively, TSTO-based XL-MS analysis of the 26S proteasome resulted in a total of 808 unique CSMs (Supplementary Data 2). Of these, 41.3% were tri-links, indicating that TSTO can efficiently cross-link three spatially proximal residues within the proteasome. To assess the accuracy of TSTO cross-links, we mapped them to a high-resolution 26S proteasome structure (PDB: 7QY715). As a result, all of the mapped tri-links and bi-links have ~ 90% satisfaction rates (≤35 Å), supporting their validity (Supplementary Fig. 4). To understand whether the introduction of a supplementary reactive group is beneficial for describing protein topologies, we constructed a TSTO XL-PPI map of the 26S proteasome comprising 92 edges (Fig. 3a). Compared to previous XL-MS studies using bifunctional linkers16,17, 31 additional inter-subunit PPIs were identified, with most of them at interaction interfaces between proteasomal subcomplexes. To better understand the benefits of TSTO over conventional reagents, we next examined the interactions captured by tri-links. Among the 35 trimeric interactions within the proteasome, 13 described multimeric interactions among the six ATPase subunits (Rpt1-6), illustrating their proximity within the 19S base subcomplex (Fig. 3b). In addition, the connectivity between the 19S and 20S subcomplexes was described by three trimeric interactions, including one among Rpt4, α1 and α7, and one between Rpt1 and α4 (Fig. 3c, d). Apart from trimeric interactions involving three proteins, TSTO was able to concurrently place two distant residues of one protein (α4: K27, K166) to another (Rpt1:K418), due to their proximity in the three-dimensional structure. Another tri-link involving Rpt6 and α3 exemplifies the capability of TSTO to capture interactions involving adjacent residues (Rpt6:K397, K402) that may be missed by bifunctional linkers due to the tendency to form intra-links (loop-links). Moreover, TSTO cross-links placed a small proteasome subunit Dss1 in proximity to Rpn3, Rpn6, and Rpn7 within the 19S lid (Fig. 3e and Supplementary Fig. 6), which has not been reported by previous XL-MS studies of the human 26S proteasome16,17. Nonetheless, these interactions were supported by the two human 26S structures (PDB: 6MSB18 and 7QY715) as all cross-links between Dss1 and Rpn3, Rpn6, and Rpn7 have Cα-Cα distances ≤26 Å, falling below the distance threshold (Fig. 3E and Supplementary Fig. 5). Collectively, these results have demonstrated that TSTO is effective in capturing multimeric interactions of proteasome complexes, providing three-dimensional contacts for the first time to support the spatial organization of these proximal subunits.
a MS2 fragmentation of a tripeptide tri-link [α, β, γ] (m/z 998.93545+) yielded a series of dominant ion doublets corresponding to αAR/αAR*, βAR/βAR*, and γAR/γAR* peptides. b MS3 analyses of the cross-link fragments αAR* (m/z 652.37622+), βAR* (m/z 895.47292+), and γAR* (m/z 922.98332+) identified them as 110YIINVKAR*QFAK120, 198VVSSSIVDKAR*YIGESAR213, and 230VVGSEFVQK AR*YLGEGPR245, signifying a tripeptide tri-link among Rpt1:K116, Rpt4:K206, and Rpt3:K238. c MS2 fragmentation of a dipeptide tri-link [α-β2] (m/z 1209.09614+) yielded two dominant sets of ions—a doublet representing αAR/αAR* and a triplet representing β2AR/βAR_AR*/β2AR* peptides. d MS3 analyses of the cross-link fragments αAR* (m/z 761.40222+) and β2AR* (m/z 1086.86122+) identified their sequences as 278AYEKAR*ILFTEATR289 and 444DGVIEASINHEKAR*GYVQSKAR*EMIDIYSTR470 respectively, signifying a dipeptide tri-link among Rpn12:K281, Rpn3:K455, and Rpn3:K461. e MS2 fragmentation of a dipeptide bi-link [α-β] (m/z 773.41804+) resulted in two dominant ion doublets αAR/αAR* and βAR/βAR*. f MS3 analyses of the cross-link fragments αAR* (m/z 591.83562+) and βAR* (m/z 878.96642+) identified their sequences as 69FIVKAR*ATNGPR78 and 214VSGSELVQKAR*FIGEGAR229, signifying a pair-wise cross-link between Rpt4:K72 and Rpt6:K222. Note: AR: aldehyde remnant moiety; AR*: aldehyde remnant moiety after water loss (i.e., AR-H2O). Source data are provided as a Source Data file.
a XL-PPI map of 26S cross-links. In total, 32 subunits were identified participating in inter-subunit interactions—19 from the 19S subcomplexes and 13 from the 20S subcomplex. 19S lid subunits shown in blue, 19S base subunits in purple, 20S α ring subunits in yellow, and 20S β ring subunits shown in green. Red lines represent interactions that have been captured as tri-links; gray lines represent other cross-links. Mapping of trimeric interactions to a high-resolution 26S proteasome structure: (b) lining the inner pore between Rpt subunits of the AAA-ATPase ring (19S base subcomplex), (c, d) Interactions describing connectivity of 19S base subcomplex subunits to the 20S, and (e) Dss1 with 19S lid subunits Rpn3 and Rpn7.
Integrative structure modeling using synthetic and experimental XL-MS data
To determine the value of trivalent versus bivalent cross-links, we applied integrative structure modeling to compute the structure of the proteasome base subcomplex (subunits Rpt1–Rpt6 and Rpn2). First, we generated synthetic datasets for the proteasome base subcomplex, replicating the experimental distance distribution of the proteasome TSTO dataset (Supplementary Data 2) to systematically compare trifunctional and bifunctional cross-linkers. Subsets of 20, 30, 40, 60, and 120 cross-links were randomly sampled based on the distance probability distribution, with at least six replicates created for each subset size. For this analysis, a cross-linking site was defined as the set of residues bridged by a single cross-linker. To illustrate the uniqueness of trifunctional linkages, we have focused our analysis on the comparison of trivalent and bivalent cross-links for integrative modeling. For each trivalent cross-link, a corresponding bivalent cross-link utilizing two of the three cross-linked lysines was used to ensure that the synthetic data reflects the spatial restraints of each cross-linker, providing a robust foundation for structural modeling.
Ensembles of the proteasome subcomplex configurations that satisfy the input information (i.e., the model) were found by exhaustive Monte Carlo sampling guided by the scoring function, starting with random initial configurations of the rigid components. The models computed using trivalent cross-links were generally more accurate and precise than the ensembles computed using bivalent ones. The accuracy is defined as the average Cα root-mean-square deviation (RMSD) between the cryo-EM structure (PDB ID: 5GJR) and each of the structures in the ensemble, while the precision is defined as the average RMSD between all solutions in the ensemble. Increasing the number of both trivalent and bivalent cross-links used for modeling improved model accuracy and precision, with diminishing returns observed as the number of cross-links increased from 40 to 120. For datasets containing bivalent cross-links, the average model accuracy plateaued at ~ 9.8 Å, while for trivalent cross-links, the plateau occurred at ~ 8.7 Å. The trivalent cross-linking data consistently produced better model accuracies, with average RMSDs of 19.7, 14.1, 9.7, 8.8, and 8.7 Å for datasets with 20, 30, 40, 60, and 120 cross-links, respectively. In contrast, the bivalent cross-linking data resulted in higher average RMSDs of 25.3, 19.0, 13.3, 9.7, and 9.8 Å for the same subset sizes. A similar trend was observed for the cluster precisions (Fig. 4). Furthermore, we computed the structure of the base subcomplex of the 26S human proteasome using integrative modeling with experimentally derived DSSO and TSTO cross-linking datasets. The DSSO dataset included 83 cross-links within the base subcomplex17. The TSTO dataset comprised 18 trivalent and 143 bivalent cross-links; from the TSTO data, 65 bivalent cross-links were randomly selected (five replicates) to ensure that the total number of cross-linking sites was consistent between the DSSO and TSTO datasets. Models computed using the TSTO dataset were considerably more accurate than models computed with the DSSO dataset (15.3 Å vs 33.2 Å). These results highlight TSTO’s ability to provide richer structural information, such as the spatial positioning of three protein regions, which is critical for generating accurate structural models.
a Model accuracy and (b) cluster precision were assessed as a function of the number of cross-links, using six replicates per condition with synthetic bifunctional and trifunctional cross-linker data. Black lines indicate mean values for each subset. c Model accuracy and (d) cluster precision were compared for models generated using TSTO (18 trivalent + 65 bivalent) and DSSO (83 bivalent) cross-linking data. For experimental TSTO data in (b) and (d), five datasets were generated, each combining all 18 trivalent with 65 randomly selected (from 143 available) bivalent cross-links. Source data are provided as a Source Data file.
In Vivo TSTO Cross-linking of the HEK293 Cells
We next investigated whether TSTO was applicable for system-wide XL-MS analysis to delineate cellular networks. To this end, we performed TSTO cross-linking of HEK293 cells stably expressing HBTH-tagged CSN2, a subunit of the COP9 signalosome. To evaluate TSTO in-cell cross-linking efficiency, immunoblot analysis was carried out to probe the formation of CSN2-containing cross-linked products, which were represented by high molecular weight protein bands. Based on the formation of CSN2-containing oligomer and its increased abundance with increasing cross-linker concentration, TSTO was determined to be membrane-permeable and suited for in-cell cross-linking (Supplementary Fig. 7). The general TSTO-based in vivo XL-MS workflow is illustrated in Fig. 5, in which cross-linked peptides were subjected to two-dimensional peptide separation prior to LC-MSn analysis19. Across two biological replicates, we identified a total of 9079 unique CSMs (Supplementary Data 3), of which 32.3% were tri-links. As shown, TSTO tri-links increased the total PPI yield by ~ 27% compared to bi-link data alone (Supplementary Fig. 8a). Altogether, TSTO in-cell cross-linking yielded an XL-proteome of 1512 proteins containing 1242 PPIs (Supplementary Fig. 8b). These results indicate that TSTO cross-linking of intact cells is effective, and the presence of tri-links remains abundant in increasingly complex systems. Importantly, accurate identification of TSTO cross-links at the systems level was achieved using LC-MSn.
To examine the validity of the TSTO cross-links, we first mapped them to available high-resolution structures of protein complexes identified here (Supplementary Fig. 8c)20. In total, 1790 K-K linkages were mapped across 539 CORUM complexes and 95% of them were considered satisfactory (≤35 Å). Next, we performed gene ontology (GO) analysis and confirmed that the TSTO XL-proteome covers a wide range of molecular functions, biological processes, and cellular components (Supplementary Fig. 9a). Compared to BioGRID and BioPlex databases, 48% of the TSTO XL-PPIs were known and 52% were novel. The STRING scores were found for 501 of the XL-PPIs, and ~ 70% were determined to be above 0.8 (Supplementary Fig. 9b), indicating high-confidence interactions19,20. Overall, among the 1242 inter-protein PPIs identified within this TSTO in vivo dataset, roughly one-third were novel when compared to an aggregate of recent systems-level cross-linking studies19,21,22,23,24. This suggests that while most XL-PPIs are supported, TSTO provides information that was not detected by bifunctional MS-cleavable reagents. Finally, to estimate the dynamic range of the XL proteome captured by TSTO cross-linking, we plotted the abundance distribution of the identified cross-linked proteins based on their copy numbers as determined by shotgun proteomics25 (Supplementary Fig. 10). Compared to the MS-proteome, the TSTO XL-proteome was shifted towards higher abundance proteins, however, it is comparable to previous proteome-wide XL-MS studies19,20,26. Nevertheless, TSTO is capable of targeting cellular proteins across all cellular compartments and capturing interactions among proteins spanning five orders of magnitude. Collectively, these results demonstrate that TSTO is effective for global profiling of endogenous PPIs from their cellular environments.
Multimeric Interactions of In Vivo Protein Complexes
Importantly, TSTO XL-MS has enabled the identification of multimeric interactions within various protein complexes. One well-represented complex is the 80S ribosome machinery. Specifically, extensive interactions describing subunit proximities were identified, particularly those between the 40S or 60S subcomplexes. This allows for an in-depth description of ribosomal PPIs with 3-D contacts (Supplementary Fig. 11). Interestingly, trimeric interactions were also found between 80S and several putative ribosome-binding partners, providing structural insights underlying their functional relevance in protein synthesis. Of these, the most frequent was SERBP1 (SERPINE mRNA-binding protein 1), which was shown to interact with 60S ribosome subunits RPL7A, RPL27, and RPL34 through several tri-links. While eukaryotic SERBP1 (Stm1 in yeast) has been associated with dormant ribosomes due to its role in clamping 40S and 60S subunits together to prevent mRNA access, it has been shown to primarily contact 40S subunits27,28,29. Interestingly, TSTO identified ribosome-SERBP1 cross-links spanning the 80S, including distant 60S subunits. In current high-resolution structures of ribosome-bound SERBP1, the majority of SERBP1 is unresolved, likely buried within the 80S. Together, the results suggest that the interaction of SERBP1 with ribosomal proteins is extensive, contacting various 40S and 60S subunits to inactivate 80S machinery. Another trivalent cross-link was identified between 40S ribosome subunits and UBAP2L (Ubiquitin-associated protein 2-like). While UBAP2L is a known RNA-binding protein (RBP) that may associate with ribosomes to facilitate protein synthesis30, its function and roles remain poorly characterized. TSTO identified a trimeric interaction between UBAP2L and neighboring 40S subunits RPS7 and RPS27, triangulating its position near the small ribosomal complex and correlating well with the role of the 40S in initial binding and reading of mRNA.
One notable aspect is TSTO’s unique ability to define trimers, especially homomeric trimers (Supplementary Data 3). This has been challenging for bifunctional linkers due to the difficulty in identifying multimeric interactions. For instance, the trimer of nucleoside-disphosphate kinase B (NME2) was detected due to the identification of a TSTO tri-link connecting three identical sequences containing NME2:K100. Interestingly, NME2 is known to form homohexameric structures comprising two stacked homotrimers. When mapped to a high-resolution structure of NME2 (PDB:8PYW), the loop regions containing the K100 residues of each homotrimer were found to localize along the axis of the hexamer, within 9.5 Å of one another (Fig. 6a)31. Similarly, a homotri-link was identified for the 10 kDa mitochondrial heat shock protein (HSPE1) through its K54 and K56 residues, suggesting an oligomeric complex. Indeed, HSPE1 has been shown to form a homoheptameric ring within the human mitochondrial chaperonin ‘football’ complex (PDB: 4PJ1)32, in which the longest distance spanned between any pair of HSPE:K54 or HSPE:K56 residues within neighboring triplets was 20 Å (Fig. 6b). In addition, homotrimeric TSTO cross-links were identified from proteins that are known to assemble into oligomeric complexes but currently lack high-resolution structures. For instance, ATPase family AAA domain-containing protein 3 A (ATAD3A) was shown to oligomerize through a specific residue (K262). Using AlphaFold333, we predicted the structure of an ATAD3A trimer at high confidence (90 > plDDT > 70) and mapped all possible K262 interactions satisfactorily (<18.5 Å) (Fig. 6c), exemplifying the capability of TSTO to facilitate structural modeling of protein oligomers.
a Tri-link through K100 residues of three NME2 proteins, mapped to a high-resolution structure (PDB: 8PYW). b Trimeric cross-links involving residues K54 and K56 of three HSPE1 proteins mapped to a high-resolution structure (PDB:4PJ1). c Tri-link through K262 of three ATAD3a proteins, mapped to an AlphaFold3-generated model for an ATAD3a trimer.
In Vivo TSTO Cross-linking of Mouse Heart Tissue
To illustrate the feasibility of TSTO in another biological context, we performed in-tissue cross-linking to identify cross-linked peptides using the same workflow for in-cell XL-MS as established above. Here, TSTO cross-linking of intact mouse hearts resulted in the identification of a total of 4823 CSMs using LC MSn (Supplementary Data 4). Among them, 191 unique trivalent and 861 unique bivalent cross-links were identified, providing a snapshot of protein interactions within the heart. The most abundant XL-PPIs involved proteins that are major structural and functional components of cardiac tissue (Supplementary Fig. 12), including actin, myosin, tropomyosin, and ATP synthase subunits. This observation correlates with the known role of these proteins in heart muscle function, particularly in the sarcomere—the contractile unit of muscle fibers.
To further explore protein interactions within the cardiac muscle, we generated an XL-PPI map focusing on the interactions of cardiac actins with myosins, tropomyosins, and troponins, all of which are key regulators of muscle contraction (Supplementary Fig. 13a). Myosin, a motor protein, binds actin and its function is regulated by tropomyosin and troponin. Tropomyosin, a coiled-coil protein, runs along actin filaments and controls myosin access to actin-binding sites, while the troponin complex (composed of troponin C, T, and I) binds actin and tropomyosin to modulate contraction by controlling myosin-binding site exposure. Consistent with these molecular functions, our cross-linking data revealed interactions among these proteins, with the most direct connections observed between myosin and actin, while tropomyosin and troponins were cross-linked with both actin and to each other (Supplementary Fig. 13a). Notably, trivalent cross-links were identified not only within myosins and tropomyosins but also between troponin and actin, as well as between actin and tropomyosin (Supplementary Data 4). In addition, we found that tropomyosin was cross-linked to an ATP synthase subunit. While this interaction is not typically associated with sarcomere function, it could suggest a potential regulatory link between contractile and energy-producing processes, possibly reflecting coordinated control of ATP synthesis and muscle contraction in cardiac cells. Further Gene Ontology (GO) enrichment analysis of heart-specific proteins captured by TSTO cross-linking revealed a significant representation of pathways associated with cellular energy metabolism, particularly those involved in ATP synthesis and mitochondrial function, which are critical for maintaining cardiac contractility and overall heart function (Supplementary Fig. 13b–d). Taken together, our results have shown the feasibility of TSTO for in-tissue cross-linking, expanding its applicability for different biological samples.
Discussion
In this work, we have designed, synthesized, and characterized a novel trioxane-based, MS-cleavable, membrane-permeable homotrifunctional cross-linker, TSTO, to dissect multimeric protein interactions. This cross-linker enables simultaneous cross-linking of up to three proteins, allowing for more in-depth PPI analysis and providing additional restraints to advance structural analysis of protein assemblies. We have demonstrated that all types of TSTO cross-linked peptides display unique and predictable CID-induced fragmentation and can be unambiguously identified using LC MSn analysis. TSTO’s distinct ability to concurrently release all three cross-link arms and leave an identical remnant on each cross-linked residue establishes it a brand-new class of MS-cleavable reagent. In addition, this distinctive feature minimizes the number of theoretical MS fragments corresponding to each peptide constituent, simplifying ion selection for subsequent MS3 analysis. While MS2-based data acquisitions have become popular for analyzing MS-cleavable dipeptide cross-links, we envision that MS3-based approaches would be preferred for the identification of tripeptide cross-links. The co-fragmentation of three peptides within a single spectrum heavily convolutes database searching, impeding identification of higher-order cross-linked species by MS2-based approaches. Moreover, the central trioxane presents an innovative core structure for developing multifunctional MS-cleavable cross-linkers as one arm can be replaced with other functional groups—such as enrichment, isobaric, or reporter tags—to enable cross-link purification, quantitation, or further improve cross-link detection and identification. The NHS ester groups within TSTO can also be replaced by other reactive chemistries to target specific or non-specific amino acids21,34,35,36, expanding the coverage of XL proteomes. As with TSTO, the cleavage of the trioxane within the mass spectrometer would release any additional functional groups, preventing their impact on cross-linked peptide identification. Thus, the development of TSTO presented here opens a new direction for designing diverse cross-linkers to further advance XL-MS technologies.
Compared to cross-linking reagents that target two residues, TSTO enhances the localization of interacting proteins by simultaneously targeting three residues, triangulating a multi-point attachment that provides greater spatial resolution and allows more precise mapping of protein interfaces and interaction sites. This is particularly useful for modeling multimeric interactions, especially for protein complexes that can exist in different compositional states. As bifunctional cross-linkers can only provide spatial restraints between two residues, it can be difficult to correctly assign them to individual compositional forms without reiterative modeling. Therefore, trivalent cross-links offer an additional restraint to help facilitate characterization of multi-protein interactions. As shown with cross-linking of the 26S proteasome, TSTO has identified trimeric interactions bridging subunits of different proteasome subcomplexes that accurately describe their positioning within the 26S holocomplex—such as those between the 19S lid and base, and the 19S base and 20S subcomplexes. In addition, the tri-links have accurately described the orientation of multi-subunit interactions within their respective subcomplexes, such as the localization of Dss1 to the outer 19S lid and the close proximities of solvent-accessible residues of the ATPase ring subunits lining the central pore of the 19S base. Moreover, integrative structural modeling of a proteasome subcomplex has demonstrated that trivalent cross-links are advantageous in providing additional spatial information to facilitate structure modeling with increased precision and accuracy. This analysis has established a solid foundation for us to better characterize the structural organization of cellular networks in the future.
Similarly, TSTO has identified trimeric interactions from in vivo XL-MS analyses, for instance, triangulating the positions of ribosome-interacting proteins relative to the larger 80S ribosomal machinery, as well as various subunit interfaces within and between its 40S and 60S subcomplexes. In addition to revealing the identities of heteromeric interactions, TSTO expands the capability of XL-MS to identify homomeric ones. While bifunctional cross-linkers can reveal homodimer formation by identifying cross-linked peptides with overlapping sequences, TSTO is capable of placing homodimeric interactions in the three-dimensional context of multiprotein assemblies by capturing a third residue, providing a clearer picture of how individual dimers are oriented as part of a larger protein complex. Furthermore, TSTO can unravel homotrimeric interactions—as confirmed for NME2 and HSPE1—which can be used in conjunction with structure prediction such as AlphaFold to model unknown structures.
While TSTO is highly effective for in-cell cross-linking at lower concentrations than bifunctional cross-linkers20,26,37, the heterogeneity of TSTO cross-links appears to impact the number of unique PPIs obtained for proteome-wide analysis. Based on our data, 62% of all inter-protein PPIs derived from trivalent cross-links were covered by bivalent ones. This is most likely attributed to the reduced number of unique interactions identified in the current dataset. However, it is noted that each interaction was identified with higher confidence as they were supported by an increased number of cross-links. In comparison to in-cell TSTO XL-MS analysis, the overall coverage of in-tissue XL-PPIs revealed by TSTO was less extensive, more likely due to the higher complexity and lower recovery of tissue proteins and cross-linked peptides. To enhance protein coverage and increase cross-link identification for proteome-wide experiments, fractionation of cross-linked protein complexes can be coupled with peptide separation to improve analysis sensitivity and dynamic range. Although cross-link enrichment through incorporation of affinity tags such as biotin, azide, or phosphonic acid have been successful with previous bifunctional cross-linkers, this approach is likely impractical for TSTO due to the central positioning of its trioxane functional group. However, recent developments in polyclonal antibodies that can recognize MS-cleavable cross-linkers38 suggest that similar affinity purification strategies could be adapted to enrich TSTO cross-links and increase their MS detectability. Moreover, while MS3-based analysis allows accurate cross-link identification, it is less sensitive than MS2 analysis. Thus, MS2-based workflows could be explored to further increase the sensitivity of TSTO cross-link identification. This will require software development to enable the identification of three cross-linked peptides from a single MS2 spectrum. Due to the unique MS-cleavability of TSTO, we anticipate MS2-based identification of TSTO cross-links would be highly feasible compared to the identification of tri-links formed by conventional bifunctional cross-linkers. Given its unique ability to form trivalent cross-links and its robustness in cross-link identification, TSTO represents a promising cross-linker that can provide additional spatial restraints to significantly enhance our understanding of protein modules and their organization in complex biological systems.
Beyond improving structural characterization and expanding the scope of detectable interactions, the TSTO-based XL-MS platform can be coupled with isotope-based quantitative strategies to enable the interrogation of how multimeric protein interactions fluctuate in response to biological stimuli, disease states, or drug treatments, providing a deeper functional understanding of cellular organization. The isotopic labels can be introduced into TSTO cross-links by SILAC labeling of lysines39, chemical labeling of cross-linked peptides with isobaric reagents (e.g., TMT)40, or coding isotopic labels in the linker design.
In summary, we have successfully established and employed the TSTO-based XL-MS platform to capture trimeric interactions of protein complexes and cellular networks from intact cells and tissues, yielding structural details that cannot be easily obtained by existing reagents. The information obtained can be combined with AlphaFold prediction and/or integrative structural modeling to enhance the characterization of cellular networks in future studies. Therefore, the TSTO-based XL-MS platform represents a highly promising approach for advancing XL-MS technology towards systems structural biology in vivo.
Materials and methods
Ethical statement
Animal experiments were performed according to the protocol approved by the Institutional Animal Care and Use Committee (IACUC) of the University of California, Irvine. Heart tissue used in the experiment were obtained from adult (20-week-old) C57BL/6 wildtype mice. Mice were maintained at constant temperature (23 °C) and humidity, and housed at 12 h light/12 h dark cycles and with free access to food and water.
Experimental design
This study did not involve randomization or blinding; no data or mice were excluded from the analysis.
TSTO synthesis
Starting with delta-valerolactone, (1) acid-catalyzed ring opening in methanol provided a methyl ester, which was (2) converted to an aldehyde via Swern oxidation in an 85% yield over two steps. (3) The resulting aldehyde was treated with 10 mol% zinc-(II) chloride, affording a symmetric trioxane tris-ester. (4) Hydrolysis of the tris-ester afforded a tris-acid, which was (5) converted to TSTO by ester formation with NHS and TFAA.
TSTO cross-linking of synthetic peptide Ac-SR8
Synthetic peptide Ac-SR8 was cross-linked at a concentration of 1 mM with 1 mM TSTO in anhydrous DMSO or pH 7.4 phosphate buffered saline. The reaction was carried out at RT with gentle vortexing, with aliquots of the original reaction volume being removed and quenched with ammonium bicarbonate at 5, 15, 30, and 60 min. After dilution by 1:500, the resulting cross-linked peptide mixtures were directly injected for MS analyses.
TSTO cross-linking of BSA
BSA was cross-linked at 20 mM with 1 mM TSTO in pH 7.4 phosphate-buffered saline. The reaction was carried out for 1 hr at RT, followed by quenching with ammonium bicarbonate. The protein was then reduced with tris(2-carboxyethyl)phosphine (TCEP) for 30 min at RT and alkylated with iodoacetamide in the dark at RT for 30 min. Cross-linked proteins were then digested in 8 M urea buffer using LysC for 4 hrs at 37 °C, followed by trypsin digestion at 37 °C overnight after diluting the urea concentration to <1.5 M. The resulting peptide mixtures were extracted and desalted using C18 tips (Agilent) prior to MS analyses.
Affinity purification and TSTO cross-linking of human 26S proteasomes
A stable 293HBTH-Rpn1 cell line41 was first grown to confluency. After native cell lysis, human 26S proteasomes were purified from the clarified lysate by binding to streptavidin–sepharose resin. Bead-bound proteasomes were cross-linked on-bead in PBS buffer (pH 7.5) with 0.75 mM TSTO for 1 h at room temperature. After quenching the cross-linking reaction using ammonium bicarbonate, the proteins were reduced with TCEP for 30 min at RT and alkylated with iodoacetamide in the dark at RT for 30 min. Cross-linked proteins were then digested in 8 M urea buffer using LysC for 4 hrs at 37 °C, followed by trypsin digestion at 37 °C overnight after diluting the urea concentration to <1.5 M. The resulting peptide mixtures were extracted and desalted using C18 tips (Agilent) prior to MS analyses.
Optimization of in vivo TSTO cross-linking by immunoblot analysis
A stable 293HBTH-CSN2 cell line42 was first grown to confluency, washed with PBS, and then gently pelleted. To determine the optimal cross-linking conditions, intact cells were cross-linked at various cross-linking concentrations ranging from 0.5 to 3 mM, at room temperature or 37 °C. Clarified lysates from each condition were separated by SDS-PAGE and transferred onto a PVDF membrane and stained using amido black. After rinsing off the dye and blocking using 5% milk in TBST, the membrane-bound proteins were incubated with streptavidin-HRP to monitor the oligomerization of HBTH-CSN2 in response to cross-linking conditions. Based on these results, in vivo cross-linking of intact HEK293 cells was performed at 1 mM.
In vivo TSTO cross-linking of human cells
Intact 293HBTH-CSN2 cells were cross-linked using 1 mM TSTO in PBS buffer (pH 7.4) for 1 h with rotation at room temperature. Afterwards, the cross-linking reaction was quenched using excess ammonium bicarbonate (50 mM) for 10 min. Cells were spun down, washed again with PBS, and needle-lysed on ice in denaturing buffer (8 M urea, 50 mM Tris-HCl pH 7.5). Lysate was clarified by centrifugation at 21,000 × g for 15 min in 4 °C, and the resulting supernatant was transferred to EMD Millipore 30,000 NMWL Microcon centrifugal tubes for FASP digestion similarly as described43. Briefly, proteins atop the filter were reduced using 10 mM TCEP for 20 min at RT, alkylated using 20 mM iodoacetamide in the dark at RT for 20 min, and then digested in 8 M urea buffer using LysC for 4 h at 37 °C followed by trypsin digestion at 37 °C overnight after urea dilution to 1.5 M. The resulting cross-linked peptide mixtures were then spun through the filter and desalted using Waters C18 Sep-Pak cartridges.
In vivo TSTO cross-linking of mouse heart tissue
Freshly excised mouse hearts were sliced into 1-2 mm cubes and washed in cold PBS to remove blood. The cubed cardiac tissue was incubated in 3 mL of 1 mM TSTO in PBS for 1 hr at RT with rotation. The cross-linking reaction was then quenched using 20 mM ammonium bicarbonate for an additional 15 min rotation at RT. The tubes were then gently centrifuged, and the tissues were washed using cold PBS twice. The tissues were then flash frozen and manually cryopulverized before lysis using a BioRuptor in denaturing buffer (8 M urea, 50 mM Tris-HCl pH 7.5). 5 cycles consisting of 30 s sonication followed by 30 s rest were used to lyse the cryopulverized tissue. Lysate was clarified by centrifugation at 21,000 × g for 15 min in 4 °C, and the resulting supernatant was transferred to EMD Millipore 30,000 NMWL Microcon centrifugal tubes for FASP digestion as described above. Following digestion with LysC and trypsin, the resulting cross-linked peptide mixtures were then spun through the filter and desalted using Waters C18 Sep-Pak cartridges.
SEC-HpHt enrichment of cross-linked peptides
Peptide separation by SEC was performed similarly as described44. Briefly, dried peptides were reconstituted in SEC mobile phase (0.1% formic acid and 30% ACN) and separated on a Superdex Peptide PC 3.2/30 column (300 × 3.2 mm) at a flow rate of 50 µL/min, monitored at 215, 254 and 280 nm UV absorbance. Two-minute fractions were collected, and only fractions 24 and 26 containing the most cross-linked peptides were collected. SEC-separated fractions were then vacuum dried and resuspended in ammonium water (pH 10). For high-pH separation, HpHt tips were prepared as described19. Pipette tips (200 μL) were first blocked with a layer of C8 membrane (Empore 3 M), then filled with 5 mg of C18 solid phase (3 μm, Durashell, Phenomenex). The tips were then washed sequentially with 90 μL of methanol, 90 μL of ACN and 90 μL of ammonia water (pH 10). After loading onto the tip, the peptides were washed once with 90 μL ammonia water and eluted using increasing percentage of ACN in ammonia water (6, 9, 12, 15, 18, 21, 25, 30, 35, and 50%). The 25, 30, 35 and 50% fractions were then combined with 6, 9, 12 and 21% fractions, respectively. The final SEC-HpHt fractions were vacuum dried and stored at − 80 oC before MS analysis.
LC-MSn analysis
LC-MSn analysis of cross-linked peptides was performed using an UltiMate 3000 UPLC (Thermo Fisher Scientific) liquid chromatograph coupled online to an Orbitrap Fusion Lumos mass spectrometer (Thermo Fisher Scientific). Peptides were separated by reverse-phase on a 50 cm × 75 μm I.D. Acclaim® PepMap RSLC column using gradients of 4% to 25% acetonitrile at a flow rate of 300 nL/min (solvent A: 100% H2O, 0.1% formic acid; solvent B: 100% acetonitrile, 0.1% formic acid) prior to MSn analysis. For each MSn acquisition, duty cycles consisted of one full Fourier transform scan mass spectrum (375–1500 m/z, resolution of 60,000 at m/z 400) followed by data-dependent MS2 and MS3 acquired at top speed in the Orbitrap and linear ion trap, respectively. Ions detected in MS1 with 4 + or greater charge were selected and subjected to CID fragmentation (NCE 23%) in MS2, and resulting ions were detected in the Orbitrap (resolution 30,000). Ions observed in MS2 spectra with charge 2 + or greater were selected and fragmented in MS3 using CID (NCE 35%) and detected in the linear ion trap in ‘Rapid’ mode. Ions selected for MS3 were either based on abundance (top 4 or 5 most intense ions in MS2) or targeted based on doublets with mass difference pairs (Δ = 18.02 Da) corresponding to cross-linker remnant moiety water loss. For 26S proteasome cross-links, each acquisition was 200 min and ion selection for MS3 was based on top intensity. For SEC-separated F24 fractions, the entire HpHt fraction was injected; for F26 fractions, half of the HpHt fraction was injected for each LC-MSn analysis. Each acquisition was 120 or 150 min, and both top intensity and mass difference-targeted methods were used for selecting ions for MS3 analysis.
Identification of TSTO Cross-links by MSn
Spectrometric data were extracted from.raw files using PAVA45. Extracted MS3 spectra were subjected to protein database searching via Batch-Tag within a developmental version of Protein Prospector (v. 6.3.5, University of California, San Francisco) against a randomly concatenated SwissProt database consisting of 20,418 human proteins and their corresponding decoys. Mass tolerances for parent ions and fragment ions were set as ± 15 ppm and 0.6 Da, respectively. Trypsin was set as the enzyme with three maximum missed cleavages allowed. Cysteine carbamidomethylation was selected as a constant modification, while protein N-terminal acetylation, methionine oxidation, and N-terminal conversion of glutamine to pyroglutamic acid were selected as variable modifications. Two additional defined variable modifications on uncleaved lysines and free protein N-termini were selected: AR (C5H6O2, 98.04 Da) and AR* (C5H4O, 80.03 Da), corresponding to remnant moieties for cleaved TSTO. MSn data (monoisotopic masses and charges of parent ions and corresponding fragment ions and MS3 database search results were integrated via in-house software xl-Tools39 to automatically generate, summarize and validate identified cross-linked peptide pairs. Experimental FDR calculated using a target-decoy approach was determined to be 0.04% at the cross-link level. Using a minimum peptide length of 5 residues for individual cross-linked peptides, the separation of inter/intra FDR calculation resulted in experimental FDRs of 0.2% and 0.0%, respectively.
Integrative modeling using TSTO data
Integrative modeling was used to demonstrate the effectiveness of the TSTO cross-linker in structural modeling by computing the structure of the 26S human proteasome base subcomplex (subunits Rpt1–Rpt6 and Rpn215) based on data from trifunctional and bifunctional cross-linkers. Our protocol proceeded through the standard four stages:
-
(1) Gathering information: Input information consisted of AlphaFold2-predicted structures46 for the subunits (Rpt1–Rpt6 and Rpn2), 642 TSTO cross-links (XLs) identified in this study, 281 DSSO cross-links reported by Yu et al.16, and synthetically generated cross-linking datasets. Synthetic cross-linking datasets were generated based on the cryo-EM structure of the human 26S proteasome (PDB ID: 5GJR47). Cross-linking probabilities were assigned to lysine residue pairs with Cα–Cα distances below 30 Å using a skewed normal distribution, with parameters (location: 8.0, scale: 14.8, skewness: 7.6) derived from fitting the experimental distance distribution of trivalent cross-links. Monte Carlo sampling, guided by the assigned probabilities and the Metropolis acceptance criterion, was used to select residue pairs and triplets for constructing the synthetic datasets.
-
(2) Representing the subunits and translating data into spatial restraints: Relying on the AlphaFold2 predictions of the proteasome base subcomplex, we represented each subunit as a rigid body using a 1-residue-per-bead representation. Spatial restraints for cross-linked residues were applied as upper bounds on the distances between residue pairs. TSTO trivalent cross-links were represented by a set of three distance restraints between all residue pairs, ensuring that distances spanned by all residue pairs are simultaneously within the cross-linker distance threshold. TSTO and DSSO bivalent cross-links were modeled using a single upper-bound distance restraint. Furthermore, excluded volume and sequence connectivity restraints were imposed on all components.
-
(3) Sampling: Structural models were computed using Replica Exchange Gibbs sampling, based on the Metropolis Monte Carlo (MC) algorithm. Each MC step consisted of a series of random transformations (i.e., rotations and/or translations) of the rigid bodies.
-
(4) Analysis and validation of the structural models of the 26 s proteasome followed the previously published five steps46,48,49.
To benchmark the four-stage protocol described here and assess the utility of the TSTO cross-linker for integrative structure modeling, we computed the distribution of the accuracy for the structural ensembles obtained by integrative modeling. The accuracy is defined as the mean of Cα RMSD between the EM structure and each of the structures in the ensemble. In addition, we computed the precision of each ensemble of models; the precision is defined as the average RMSD between all solutions in the ensemble.
The integrative structure modeling protocol [i.e., stages (ii), (iii), and (iv)] was scripted using the Python Modeling Interface (PMI) package, a library for modeling macromolecular complexes based on our open-source Integrative Modeling Platform (IMP) package version 2.20. All input files, scripts, and output files are available at [GitHub link].
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Unless otherwise stated, all data supporting the results of this study can be found in the article, supplementary, and source data files. The mass spectrometry data generated in this study have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifiers PXD054551 and PXD063825. Input files, scripts and data generated during integrative structure modeling are archived at Zenodo with accession code 15367945. In addition, the following published structures were used for cross-link mapping or structural modeling: 4F5S7QY75GJR8PYW4PJ1 Source data are provided in this paper.
Code availability
The Python Modeling Interface (PMI) package is available at [GitHub link].
References
Yu, C. & Huang, L. Cross-linking mass spectrometry: An emerging technology for interactomics and structural biology. Anal. Chem. 90, 144–165 (2018).
O’Reilly, F. J. & Rappsilber, J. Cross-linking mass spectrometry: methods and applications in structural, molecular and systems biology. Nat. Struct. Mol. Biol. 25, 1000–1008 (2018).
Piersimoni, L., Kastritis, P. L., Arlt, C. & Sinz, A. Cross-linking mass spectrometry for investigating protein conformations and protein-protein Interactions horizontal line A method for all seasons. Chem. Rev. 122, 7500–7531 (2022).
Yu, C. & Huang, L. New advances in cross-linking mass spectrometry toward structural systems biology. Curr. Opin. Chem. Biol. 76, 102357 (2023).
Tang, X., Wippel, H. H., Chavez, J. D. & Bruce, J. E. Crosslinking mass spectrometry: A link between structural biology and systems biology. Protein Sci. 30, 773–784 (2021).
Britt, H. M., Cragnolini, T. & Thalassinos, K. Integration of mass spectrometry data for structural biology. Chem. Rev. 122, 7952–7986 (2022).
Graziadei, A. & Rappsilber, J. Leveraging crosslinking mass spectrometry in structural and cell biology. Structure 30, 37–54 (2022).
Lee, K. & O’Reilly, F. J. Cross-linking mass spectrometry for mapping protein complex topologies in situ. Essays Biochem. 67, 215–228 (2023).
Sinz, A. Divide and conquer: cleavable cross-linkers to study protein conformation and protein-protein interactions. Anal. Bioanal. Chem. 409, 33–44 (2017).
Matzinger, M. & Mechtler, K. Cleavable cross-linkers and mass spectrometry for the ultimate task of profiling protein-protein interaction networks in vivo. J. Proteome Res. 20, 78–93 (2021).
Mohr, J. P., Caudal, A., Tian, R. & Bruce, J. E. Multidimensional cross-linking and real-time informatics for multiprotein interaction studies. J. Proteome Res. 23, 107–116 (2024).
Fan, X., Rodriguez-Escrich, C., Sayalero, S. & Pericas, M. A. Paraldehyde as an acetaldehyde precursor in asymmetric Michael reactions promoted by site-isolated incompatible catalysts. Chemistry 19, 10814–10817 (2013).
Brewer, P. J., di Meane, E. A., Vargha, G. M., Brown, R. J. & Milton, M. J. Gaseous reference standards of formaldehyde from trioxane. Talanta 108, 83–87 (2013).
Kao, A. et al. Development of a novel cross-linking strategy for fast and accurate identification of cross-linked peptides of protein complexes. Mol. Cell Proteomics 10, M110.002212 (2011).
Lee, D. et al. Molecular mechanism for activation of the 26S proteasome by ZFAND5. Mol. Cell 83, 2959–2975 (2023).
Wang, X. et al. Molecular details underlying dynamic structures and regulation of the human 26S proteasome. Mol. Cell Proteom. 16, 840–854 (2017).
Yu, C., Wang, X. & Huang, L. Developing a targeted quantitative strategy for sulfoxide-containing MS-cleavable cross-linked peptides to probe conformational dynamics of protein complexes. Anal. Chem. 94, 4390–4398 (2022).
Iacobucci, C. et al. First community-wide, comparative cross-linking mass spectrometry study. Anal. Chem. 91, 6953–6961 (2019).
Jiao, F. et al. Two-dimensional fractionation method for proteome-wide cross-linking mass spectrometry analysis. Anal. Chem. 94, 4236–4242 (2022).
Wheat, A. et al. Protein interaction landscapes revealed by advanced in vivo cross-linking-mass spectrometry. Proc. Natl. Acad. Sci. USA 118, https://doi.org/10.1073/pnas.2023360118 (2021).
Jiao, F. et al. Exploring an alternative cysteine-reactive chemistry to enable proteome-wide PPI analysis by cross-linking mass spectrometry. Anal. Chem. 95, 2532–2539 (2023).
Yugandhar, K. et al. MaXLinker: Proteome-wide cross-link identifications with high specificity and sensitivity. Mol. Cell Proteom. 19, 554–568 (2020).
Liu, F., Lossl, P., Rabbitts, B. M., Balaban, R. S. & Heck, A. J. R. The interactome of intact mitochondria by cross-linking mass spectrometry provides evidence for coexisting respiratory supercomplexes. Mol. Cell Proteom. 17, 216–232 (2018).
Bartolec, T. K. et al. Cross-linking mass spectrometry discovers, evaluates, and corroborates structures and protein-protein interactions in the human cell. Proc. Natl. Acad. Sci. USA 120, e2219418120 (2023).
Bekker-Jensen, D. B. et al. An optimized shotgun strategy for the rapid generation of comprehensive human proteomes. Cell Syst. 4, 587–599 (2017).
Jiang, P. L. et al. A membrane-permeable and immobilized metal affinity chromatography (IMAC) enrichable cross-linking reagent to advance in vivo cross-linking mass spectrometry. Angew. Chem. Int. Ed. Engl. 61, e202113937 (2022).
Ben-Shem, A. et al. The structure of the eukaryotic ribosome at 3.0 A resolution. Science 334, 1524–1529 (2011).
Du, M., Li, X., Dong, W. & Zeng, F. Implication of Stm1 in the protection of eIF5A, eEF2 and tRNA through dormant ribosomes. Front. Mol. Biosci. 11, 1395220 (2024).
Wells, J. N. et al. Structure and function of yeast Lso2 and human CCDC124 bound to hibernating ribosomes. PLoS Biol. 18, e3000780 (2020).
Luo, E. C. et al. Large-scale tethered function assays identify factors that regulate mRNA stability and translation. Nat. Struct. Mol. Biol. 27, 989–1000 (2020).
Shannon, A. et al. An exonuclease-resistant chain-terminating nucleotide analogue targeting the SARS-CoV-2 replicase complex. Nucleic Acids Res. 52, 1325–1340 (2024).
Nisemblat, S., Yaniv, O., Parnas, A., Frolow, F. & Azem, A. Crystal structure of the human mitochondrial chaperonin symmetrical football complex. Proc. Natl. Acad. Sci. USA 112, 6044–6049 (2015).
Abramson, J., et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493-500 (2024).
Gutierrez, C. B. et al. Developing an acidic residue reactive and sulfoxide-containing MS-cleavable homobifunctional cross-linker for probing protein-protein Interactions. Anal. Chem. 88, 8315–8322 (2016).
Gutierrez, C. B. et al. Development of a novel sulfoxide-containing MS-cleavable homobifunctional cysteine-reactive cross-linker for studying protein-protein interactions. Anal. Chem. 90, 7600–7607 (2018).
Gutierrez, C. et al. Enabling photoactivated cross-linking mass spectrometric analysis of protein complexes by novel MS-cleavable cross-linkers. Mol. Cell Proteom. 20, 100084 (2021).
Chavez, J. D. et al. Systems structural biology measurements by in vivo cross-linking with mass spectrometry. Nat. Protoc. 14, 2318–2343 (2019).
Singh, J., Ponnaiyan, S., Gieselmann, V. & Winter, D. Generation of antibodies targeting cleavable cross-linkers. Anal. Chem. 93, 3762–3769 (2021).
Yu, C. et al. Probing H2O2-mediated structural dynamics of the human 26S proteasome using quantitative cross-linking mass spectrometry (QXL-MS). Mol. Cell Proteom. 18, 954–967 (2019).
Yu, C. et al. Developing a multiplexed quantitative cross-linking mass spectrometry platform for comparative structural analysis of protein complexes. Anal. Chem. 88, 10301–10308 (2016).
Lin, D. W. et al. Microhomology-based CRISPR tagging tools for protein tracking, purification, and depletion. J. Biol. Chem. 294, 10877–10885 (2019).
Fang, L. et al. Mapping the protein interaction network of the human COP9 signalosome complex using a label-free QTAX strategy. Mol. Cell Proteom. 11, 138–147 (2012).
Wisniewski, J. R., Zougman, A., Nagaraj, N. & Mann, M. Universal sample preparation method for proteome analysis. Nat. Methods 6, 359–362 (2009).
Leitner, A. et al. Expanding the chemical cross-linking toolbox by the use of multiple proteases and enrichment by size exclusion chromatography. Mol. Cell Proteomics 11, https://doi.org/10.1074/mcp.M111.014126 (2012).
Guan, S., Price, J. C., Prusiner, S. B., Ghaemmaghami, S. & Burlingame, A. L. A data processing pipeline for mammalian proteome dynamics studies using stable isotope metabolic labeling. Mol. Cell Proteomics 10, https://doi.org/10.1074/mcp.M111.010728 (2011).
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Huang, X., Luan, B., Wu, J. & Shi, Y. An atomic structure of the human 26S proteasome. Nat. Struct. Mol. Biol. 23, 778–785 (2016).
Saltzberg, D. J. et al. Using integrative modeling platform to compute, validate, and archive a model of a protein complex structure. Protein Sci. 30, 250–261 (2021).
Viswanath, S., Chemmama, I. E., Cimermancic, P. & Sali, A. Assessing exhaustiveness of stochastic sampling for integrative modeling of macromolecular structures. Biophys. J. 113, 2344–2353 (2017).
Acknowledgements
We wish to thank Prof. A.L. Burlingame, Drs. Peter Baker and Robert Chalkley for their support of Protein Prospector, and the Huang lab members, especially Paul Morenkov and Fenglong Jiao, for their help. This work was supported by National Institutes of Health grants R35GM145249 to L.H., R01CA290875 to A.W. and L.H., R35GM151256 to I.E., and R01DK136940 and R01DK121146 to Q.Y.
Author information
Authors and Affiliations
Contributions
L.H. conceived the study and directed the research. C.Y., X.W., E.N., S.R., and L.H. designed the experiments. C.Y. performed XL-MS experiments, data acquisition and analyses. X.W. purified and cross-linked proteasome complexes, E.N. and S.R. performed cross-linker synthesis, S.G.B. and I.E. performed integrative structural modeling and AlphaFold prediction and wrote the structural modeling section. X.S. and Q.Y. performed mouse dissections for tissue extraction. C.Y. and L.H. wrote the manuscript with contributions from other authors.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Maria Gaczynska, Karl Mechtler, Pawel Osmulski, and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Yu, C., Novitsky, E., Balasubramani, S.G. et al. Trioxane-based MS-cleavable cross-linking mass spectrometry for profiling multimeric interactions of cellular networks. Nat Commun 16, 5585 (2025). https://doi.org/10.1038/s41467-025-60642-3
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41467-025-60642-3








