Characterizing and engineering post-translational modifications with high-throughput cell-free expression

Wong, Derek A.; Shaver, Zachary M.; Cabezas, Maria D.; Daniel-Ivad, Martin; Warfel, Katherine F.; Prasanna, Deepali V.; Sobol, Sarah E.; Fernandez, Regina; Tobias, Fernando; Filip, Szymon K.; Hulbert, Sophia W.; Faull, Peter; Nicol, Robert; DeLisa, Matthew P.; Balskus, Emily P.; Karim, Ashty S.; Jewett, Michael C.

doi:10.1038/s41467-025-60526-6

Download PDF

Article
Open access
Published: 05 August 2025

Characterizing and engineering post-translational modifications with high-throughput cell-free expression

Nature Communications volume 16, Article number: 7215 (2025) Cite this article

14k Accesses
9 Citations
26 Altmetric
Metrics details

Subjects

Abstract

Post-translational modifications (PTMs) are important for the stability and function of many therapeutic proteins and peptides. Current methods for studying and engineering PTMs are often limited by low-throughput experimental techniques. Here we describe a generalizable, in vitro workflow coupling cell-free gene expression (CFE) with AlphaLISA for the rapid expression and testing of PTM installing proteins. We apply our workflow to two representative classes of peptide and protein therapeutics: ribosomally synthesized and post-translationally modified peptides (RiPPs) and glycoproteins. First, we demonstrate how our workflow can be used to characterize the binding activity of RiPP recognition elements, an important first step in RiPP biosynthesis, and be integrated into a biodiscovery pipeline for computationally predicted RiPP products. Then, we adapt our workflow to study and engineer oligosaccharyltransferases (OSTs) involved in protein glycan coupling technology, leading to the identification of mutant OSTs and sites within a model vaccine carrier protein that enable high efficiency production of glycosylated proteins. We expect that our workflow will accelerate design-build-test-learn cycles for engineering PTMs.

Simplifying the detection and monitoring of protein glycosylation during in vitro glycoengineering

Article Open access 11 January 2023

De novo design of ribosomally synthesized and post-translationally modified peptides

Article 07 January 2025

Mimicked synthetic ribosomal protein complex for benchmarking crosslinking mass spectrometry workflows

Article Open access 08 July 2022

Introduction

Protein- and peptide-based biologics play an important role in treating and preventing a wide variety of illnesses. Currently, about 30% of all new US Food and Drug Administration (FDA) approved therapeutics entering the clinical setting are protein biologics¹. Common protein-based therapeutics include antibodies², blood coagulants^3,4, and vaccines^5,6, among others⁷. Peptide drugs continue to mature as important options for treating microbial infection⁸ and diabetes⁹, among other conditions¹⁰, with over 800 new peptide therapeutics either in clinical development or undergoing preclinical studies¹¹. Understanding how to design and produce protein- and peptide-based therapeutics with optimal characteristics continues to be a major focus in biological research.

For many biologics, post-translational modifications (PTMs) are important for stability and activity. Examples of PTMs include cyclization¹², methylation¹³, β-hydroxylation¹⁴, glycosylation¹⁵, and sulfation¹⁶, among many others¹⁷. Unfortunately, workflows for studying PTMs are often low throughput. For example, studies screening libraries of PTM installing enzymes or protein substrates often require overexpression of each variant in individual strains and labor-intensive protein purification steps. These methods are then coupled with low-throughput analytical methods such as mass spectrometry^18,19,20, Western blotting²¹, or ELISA²², which are often time intensive or involve complex data analysis. Additionally, techniques used to directly measure interactions between PTM installing enzymes and their substrates, such as fluorescence polarization²³, co-crystallization of the substrate in the enzyme active site^24,25, and isothermal titration calorimetry (ITC)²⁶, often limit studies to single digits or tens of variants.

Advances in cell-free gene expression (CFE) systems²⁷ have enabled the parallelized expression of proteins and peptides in hours, which can facilitate the rapid characterization and engineering of PTMs. CFE systems^28,29,30 use transcription and translation machinery, rather than living cells, supplemented with additional cofactors, energy sources, salts, and a DNA template to produce a desired protein. CFE systems have been successfully applied to a variety of high-throughput bioengineering applications, such as engineering transcription factors^31,32, constructing metabolic^33,34,35,36 and glycosylation pathways^37,38,39, and studying the substrate promiscuity of various PTM installing enzymes^40,41,42. However, many of these applications rely on liquid chromatography and mass spectrometry-based approaches or the ability to connect the targeted protein function with a visual output such as superfolder green fluorescent protein (sfGFP) production. Matching the throughput of CFE, AlphaLISA⁴³ is an in-solution, bead-based assay version of ELISA that is amenable to acoustic liquid handling robots and small (1–2 μL) reaction sizes in 384- or 1,536-well plate formats and has previously been used with cell-free systems to assess protein-protein interactions^44,45,46. By requiring only liquid transfer and incubation steps, AlphaLISA facilitates the analysis of hundreds to thousands of reactions in hours.

Here, we describe a general in vitro, plate-based platform for characterizing and engineering PTMs using CFE and AlphaLISA which we apply to both (i) ribosomally synthesized and post-translationally modified peptides (RiPPs) and (ii) glycoproteins. To begin, we show that our workflow can be used to detect interactions between RiPP recognition elements (RREs) and their native precursor peptides, a key first step in the biosynthesis of many RiPP products⁴⁷. We then characterize peptide residues important for RRE binding and assess RRE binding of computationally predicted RiPP products. By modifying the CFE portion of the workflow, we then directly measure the enzymatic attachment of glycans onto proteins. From a library of 285 unique enzyme variants, we identify 7 high-performing mutants, including a single mutant with a 1.7-fold improvement of glycosylation with a clinically relevant glycan. Finally, we systematically characterize accessible sites within an FDA approved carrier protein for protein glycosylation. We expect that our workflow will accelerate the characterization and engineering of PTMs important for protein- and peptide-based therapeutics.

Results

A cell-free AlphaLISA-based workflow can detect RRE-peptide interactions

The goal of our work was to develop a robust, high-throughput, and generalizable workflow that expedites the ability to characterize and engineer PTMs on peptides and proteins. Key to this development was the optimized integration of CFE and AlphaLISA, as well as the ability to study different classes of PTMs.

We chose to first apply our workflow to RiPPs (e.g., lanthipeptides^48,49, thiopeptides^50,51,52) due to growing interest in their use as antimicrobial therapeutics^{53,54,55,56,57,58,59}. While mature RiPPs vary in amino acid composition, RiPPs originate as a precursor peptide typically composed of an N-terminal leader sequence and C-terminal core sequence⁶⁰. Tailoring enzymes encoded within the same biosynthetic gene cluster (BGC) as the precursor peptide recognize a portion of the leader sequence and install PTMs on the core sequence, producing the mature RiPP⁶⁰. In around 65% of RiPP classes produced in prokaryotes, the recognition of the leader sequence by tailoring enzymes is facilitated by a standalone protein or portion of a fusion protein containing a RiPP precursor peptide recognition element (RRE)^47,61. In the absence of the RRE, individual reactions catalyzed by the tailoring enzymes often suffer from slow kinetics and low conversion rates⁶². Yet, despite their importance in catalyzing RiPP formation, current methods for studying interactions between RREs and their peptide substrate are low-throughput (e.g. fluorescence polarization^61,63,64 and co-crystallization²⁴).

To begin, we selected a panel of 13 RREs from a range of RiPP classes. We initially assessed their expression in PUREfrex via incorporation of FluoroTect^TM Green_Lys fluorescently labeled lysine (Supplementary Fig. 1). For 9 of these proteins, we tested the native sequence as well as fusion proteins in which the predicted RRE domain was fused to maltose-binding protein (MBP) due to their size and/or origin from a radical S-adenosyl-L-methionine (SAM) enzyme that could potentially make expression difficult. While some of the full-length constructs did produce soluble protein, we generally saw better expression when constructs were fused to MBP. We next tested the functionality of these MBP-tagged RRE proteins in an AlphaLISA assay with each of their respective peptide substrates (Fig. 1a). To do so, we expressed RRE fusion proteins and N-terminally sFLAG-tagged peptide substrates in individual PUREfrex reactions. We then assayed for RRE-peptide recognition by mixing an RRE protein-expressing PUREfrex reaction and the corresponding peptide substrate-expressing reaction with anti-FLAG donor beads and anti-MBP acceptor beads. Only in instances in which the RRE binds the peptide will the acceptor and donor bead be brought within close enough proximity to produce a chemiluminescent signal. A cross-titration of four different RRE-peptide pairs (PqqD, TbiB1, HcaF, TbtF) across multiple dilutions revealed a clear binding pattern consistent with RRE-peptide engagement (Fig. 1b–e), which we do not observe when assaying MBP only with the respective peptides (Supplementary Fig. 2).

**Fig. 1: A cell-free plate-based assay for detecting RRE-peptide interactions.**

Mapping RRE-peptide binding landscapes informs design

We next asked whether we could characterize a peptide-binding landscape to inform the design of a synthetic peptide capable of binding to a naturally occurring RRE. To do this, we chose the RRE domain of TbtF, the cyclodehydratase involved in thiomuracin^65,66,67,68 biosynthesis, and its leader sequence of TbtA (Fig. 2a). Mutating residues L(-32), L(−29), M(-27), D(-26), and F(-24) within the leader sequence of TbtA to an alanine was previously shown using fluorescence polarization to reduce binding affinity to TbtF⁶⁶. By creating an alanine positional scanning library, we demonstrated that our method could achieve similar results to those using fluorescence polarization as evidenced by a >100-fold decrease in AlphaLISA signal compared to the wild-type peptide sequence for all noted mutations. We also found that the mutation D(-30)A resulted in a >100-fold decrease in AlphaLISA signal. By using CFE combined with AlphaLISA, we characterized the peptide-binding landscape within hours without conventional cloning, transformation, expression, and purification workflows normally required for fluorescence polarization competition assays.

**Fig. 2: Cell-free workflow identifies peptide residues important for binding by TbtF.**

We then used TbtA’s peptide-binding landscape to design a synthetic peptide capable of binding to TbtF. We started with a synthetic peptide sequence the same length as the leader sequence of TbtA that does not bind to TbtF (Fig. 2b; peptide variant 2), using the first 40 amino acids of sfGFP with a G(-18)T mutation to ensure all residues in the region of interest differed from the wild-type TbtA leader sequence. We then created peptide variants by replacing residues in the synthetic peptide with residues identified from the alanine scan as important for binding by TbtF, starting with the six residues (L(-32), D(-30), L(-29), M(-27), D(-26), and F(-24)) that when mutated to an alanine resulted in the greatest decrease in AlphaLISA signal. We were unable to detect binding interactions between this engineered peptide variant (peptide variant 3) and TbtF. Next, we created peptide variants 4–10 by adding individually, or in combination, residues L(-34), P(-28), and M(-22), which in our initial screen also appeared to slightly reduce binding affinity to TbtF when mutated to an alanine. Adding both P(-28) and M(-22) (peptide variant 9) to peptide variant 3 enabled weak binding by TbtF, with ~25% AlphaLISA signal of the wild-type TbtA leader sequence. Further addition of residues resulted in a synthetic peptide (peptide variant 12) that is 40% identical to the leader sequence of TbtA (L(-34), N(-33), L(-32), D(-30), L(-29), P(-28), M(-27), D(-26), F(-24), E(-23), and M(-22)) and exhibits binding to TbtF (AlphaLISA signal) that is approximately equal to that observed with the wild-type TbtA leader sequence peptide. Interestingly, adding residues D(-20) and S(-31) (peptide variant 14) increased the signal further to ~2-fold higher than that observed with the wild-type TbtA leader sequence. These results highlight our assay’s ability to rapidly identify specific residues involved in RRE-peptide binding interactions and design peptide sequences with the minimum number of residues required for RRE engagement.

Screening computationally identified RRE-peptide pairs

We next wanted to show how our workflow could characterize RRE binding for BGCs computationally predicted via AntiSMASH⁶⁹. Successful heterologous expression of computationally predicted RiPP products in vivo can be a challenge due to the inability to precisely control expression timing and yield, as well as the absence of necessary cofactors⁷⁰. We chose to study lasso peptides due to their unique lariat structure which imparts the molecule with a wide range of beneficial characteristics, such as heat and protease stability^71,72, and because they have been successfully expressed before in cell-free systems⁷³. Additionally, lasso peptides have displayed a variety of bioactivities, including antimicrobial activity^{74,75,76,77,78,79}. Biosynthetically, lasso peptide BGCs typically encode (i) a precursor peptide, (ii) an RRE and (iii) a protease, or a fusion protein encoding both the RRE and protease, as well as (iv) a cyclase⁷¹. In all reported lasso peptide BGCs, RREs are important for guiding the protease to the precursor peptide substrate and in some cases are also required for cyclization by the cyclase^{26,80,81,82,83,84}.

To begin, we used AntiSMASH⁶⁹ to identify a total of 2,574 lasso peptide BGCs from a collection of 39,311 diverse genomes (Fig. 3a; Supplementary Table 2). Of these, 1,882 BGCs were predicted to contain all essential lasso peptide biosynthetic enzymes (Supplementary Table 3). We compared the identified BGCs to known lasso peptides by constructing a sequence similarity network of the predicted core peptide sequences and annotating known sequences within the resulting network. Sequences that matched computationally predicted but not experimentally verified sequences reported in the literature were maintained in the dataset while those that had been characterized experimentally at the time of our analysis were removed; in doing so, we reasoned that our workflow could help validate predictions generated by others in the field. From the remaining predicted BGCs, 47 were selected for study from 32 unique genera based on their potential for antibiotic activity (Supplementary Data 1).

**Fig. 3: Computationally guided screen of lasso peptide RREs.**

Of the 47 predicted lasso peptide BGCs, 5 were predicted to contain more than one precursor peptide and/or RRE, bringing the total number of predictions to 57 unique precursor peptides and 52 unique RREs. We applied our cell-free workflow to screen all 57 predicted precursor peptides with their associated predicted RREs (Fig. 3b, c; Supplementary Fig. 6). To account for potential differences in expression levels in the PUREfrex reactions as well as the fact that RREs reported in literature have a range of binding affinities, we tested each peptide-RRE pair at multiple concentrations. In instances where multiple RREs or precursor peptides were predicted in the same BGC, we screened all pairwise combinations. In total, we screened 72 different RRE-peptide pairs, 42 RRE-peptide pairs from clusters with a single predicted RRE and peptide pair (Fig. 3b), and 30 different combinations of RRE and peptides from clusters with multiple predicted genes for each (Fig. 3c; Supplementary Fig. 6).

Our initial screen yielded clear binding patterns for 27 of the 42 individual RRE-peptide pairs and 24 of the 30 RRE-peptide combinations from larger clusters (Fig. 3b, c; Supplementary Fig. 6). A subsequent validation experiment assaying all RRE-peptide pairs in biological triplicate at the dilution condition that yielded the highest AlphaLISA signal confirmed the results of our screen, with RRE-peptide pairs that produced higher AlphaLISA signal in our initial screen generally producing higher AlphaLISA signal in the validation experiment (Supplementary Fig. 7). Notably, in addition to identifying functional RREs and peptide pairs, our methodology enables the rapid testing of more complex BGCs. For example, 44B1-1 can bind to both precursor peptides identified in the BGC while 44B1-2 can only bind to the second precursor peptide (Fig. 3c). Similar behavior emerged in BGC 46 in which predicted RREs bound to both, only one, or neither of the precursor peptides (Fig. 3c).

Using the results from our large-scale RRE screen, we prioritized BGCs identified as “hits” for complete biosynthesis of a mature lasso peptide in vitro. To do so, we expressed precursor peptides in PUREfrex reactions and purified each related tailoring enzyme heterologously expressed in Escherichia coli. Small scale (10 μL) reactions were assembled by combining precursor peptides and purified tailoring enzymes and analyzed via matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) MS after overnight incubation at 37 °C. By testing 24 clusters, we successfully produced one peptide with the topology of a lasso peptide from BGC 24 (Supplementary Figs. 10 and 11). Subsequent characterization experiments confirmed that the production of this lasso peptide, Las24, is time dependent (Supplementary Fig. 12), that all proteins in the predicted BGC are necessary for maturation (Supplementary Fig. 13), that the sequence of the molecule matches the expected structure (Supplementary Fig. 14), that the molecule is resistant to carboxypeptidase (a common confirmation of threaded topology) (Supplementary Fig. 15), and that there is limited to no interaction of Las24 biosynthetic components with those from other lasso peptide BGCs (Supplementary Fig. 16). During our work, King et al. reported the heterologous production of this lasso peptide (termed Las-1010) in E. coli from the same biosynthetic cluster⁸⁵. Las-1010, which was previously bioinformatically identified⁸⁶ but not experimentally characterized, was found by King et al. to exhibit weak antibacterial activity against some bacterial strains⁸⁵. Taken together, we showed that the integration of CFE and AlphaLISA allows for rapid prototyping of RiPP BGCs by detecting RRE-peptide interactions, mapping RRE-peptide binding landscapes, and screening computationally identified RRE-peptide pairs.

A cell-free AlphaLISA-based workflow for prototyping in vitro glycosylation reactions

We next showed the generalizability of our cell-free AlphaLISA workflow by exploring an important PTM in protein biologics, namely glycosylation. To show this, we chose to study the activity of oligosaccharyltransferases (OSTs) using bacterial glycans relevant to conjugate vaccine production. Conjugate vaccines, composed of a pathogen-specific polysaccharide antigen (e.g., O-antigen polysaccharide or capsular polysaccharide (CPS)) linked to an immunogenic carrier protein, are a promising strategy to protect against bacterial infections⁸⁷. Both the glycan and carrier protein play an important role in developing long-lasting immunity⁸⁸.

A current challenge with manufacturing conjugate vaccines is the reliance on a multi-step process in which the glycan is isolated from the targeted pathogenic bacteria and chemically conjugated to a separately produced carrier protein⁸⁹. A related limitation is the lack of control over the site of glycan attachment using traditional chemical conjugation techniques⁹⁰. To address these challenges, recent work in the fields of glycobiology and synthetic biology has developed both cell^{91,92,93,94,95,96} and cell-free^97,98,99,100 based methods for producing conjugate vaccines using OSTs to site-specifically transfer glycans onto a carrier protein, called protein glycan coupling technology^101,102.

We modified our workflow used with RiPPs to characterize glycosylation of carrier proteins using OSTs. We first express our carrier protein in a standard CFE reaction and our OST in a CFE reaction supplemented with nanodiscs, which act as membrane mimics into which membrane-bound proteins can express solubly¹⁰³. We next mix CFE-expressed OST, CFE-expressed carrier protein, and a crude membrane fraction enriched with bacterial glycan to assemble an in vitro glycosylation (IVG) reaction (Supplementary Fig. 18a). The crude membrane fraction is produced from E. coli cells expressing a single biosynthetic pathway encoding a single pathogen-specific O-antigen or capsular polysaccharide. Glycosylation with that polysaccharide can then be detected with AlphaLISA (Fig. 4a).

**Fig. 4: CFE and AlphaLISA can be combined to prototype in vitro glycosylation reactions.**

As a model system to demonstrate this workflow, we selected the capsular polysaccharide from Streptococcus pneumoniae serotype 4 (CPS4) as the pathogen glycan. The CPS4 glycan is composed of the repeating tetrasaccharide unit PyrGal-ManNAc-FucNAc-GalNAc (PyrGal: pyruvate attached to galactose; ManNAc: N-acetylmannosamine; FucNAc: N-acetylfucosamine; GalNAc: N-acetylgalactosamine) and is important for conjugate vaccine protection against pneumococcal infection¹⁰⁴. Other groups have previously shown that the CPS4 glycan can be synthesized in strains of E. coli via recombinant expression of the CPS4 biosynthetic pathway and attached to proteins using the OST PglB from Campylobacter jejuni (CjPglB)^104,105,106. We began by overexpressing the CPS4 glycan in E. coli cells, harvesting and lysing the cells, and concentrating the membrane vesicles containing CPS4 via ultracentrifugation to produce CPS4-enriched crude membrane fractions (CMFs). Following verification of the presence of the CPS4 glycan in our CMF with an anti-CPS4 dot blot (Supplementary Fig. 19), we then showed that we can perform glycosylation in IVG reactions using the CPS4 glycan. Using a 6xHis tag on the carrier protein, we performed Western blot analysis to confirm transfer of the targeted bacterial glycan onto the protein, with the banding pattern above the aglycosylated protein corresponding to transfer of different chain lengths of the bacterial glycan (Supplementary Fig. 18b). IVG reactions using CMF prepared from cells without CPS4 overexpression confirmed that only CPS4 is being transferred onto the carrier protein in our system, while an anti-CPS4 Western blot confirmed the identity of the CPS4 glycan on our glycoconjugates (Supplementary Fig. 20).

We then asked whether we could adopt our cell-free AlphaLISA workflow to detect glycosylation. We hypothesized that we would be able to distinguish between glycosylated and aglycosylated proteins by incorporating anti-glycan serum antibodies into the AlphaLISA reaction and using Protein A AlphaLISA donor beads and anti-6xHis AlphaLISA acceptor beads. Indeed, when we prepared IVG reactions using an acceptor protein containing a sequon (a short sequence of amino acids) that can (DQNAT) or cannot (AQNAT) be glycosylated by CjPglB and analyzed the reactions using AlphaLISA, we observed a distinct binding pattern only when we use the protein containing DQNAT, confirming our ability to discriminate between glycosylated and aglycosylated samples (Fig. 4b, c).

Cell-free workflow enables engineering of CjPglB for increased transfer efficiency of CPS from S. pneumoniae serotype 4

We next sought to determine if we could use our workflow to identify OST variants that have improved glycan transfer efficiency. While CjPglB has demonstrated glycan substrate promiscuity, the efficiency with which it can glycosylate acceptor proteins with different glycans varies widely^91,92,93,95. To address this challenge, recent work has demonstrated that mutating PglB can lead to improvements in glycosylation efficiency^94,107. When we tested two previously identified CjPglB mutants, we observed improvements in glycosylation efficiency with the CPS4 glycan (Supplementary Fig. 18b). To improve glycosylation efficiency further, we designed a mutant library of CjPglB to test via AlphaLISA.

We identified 15 CjPglB residues for site saturation mutagenesis: 9 residues (Y77, S80, S196, N311, Y462, G476, G477, H479, and K522) based on their predicted location within 4 angstroms of where the innermost sugar of the native CjPglB glycan sits within the enzyme’s active site, 3 residues (Q287, L288, and K289) within external loop 5 (EL5) that have previously been shown to be highly mutatable, and 3 additional residues (D475, K478, and L480) located in a flexible loop located directly above the nitrogen atom of the amide group on the acceptor protein where the glycan is covalently linked^107,108 (Fig. 5a, b). Our library of CjPglB mutants contains each of these 15 residues individually mutated to all 19 other amino acids, resulting in a set of 285 unique single mutant CjPglB constructs along with the wild-type sequence.

**Fig. 5: Cell-free workflow identifies high-efficiency CjPglB mutants.**

Using our cell-free workflow, we rapidly expressed the complete mutant library and assayed for activity (Fig. 5c). Ten CjPglB mutants (S80V, S80T, Q287K, N311I, N311V, N311M, L480A, L480W, and L480R) produced higher AlphaLISA signal than the WT CjPglB construct. Most sites were inflexible to mutation and produced no hits, whereas the sites S80, N311, and L480 produced multiple high-signal mutants. Control reactions that contained all reaction components except the CPS4 antiserum produced AlphaLISA signal equivalent to background (Supplementary Fig. 21a), and duplicate measurements for each CjPglB mutant were consistent (Supplementary Fig. 21b).

To validate the results of our screen, we performed Western blot analysis of IVG reactions glycosylating the clinically relevant carrier protein Haemophilus influenzae protein D (PD) with CPS4 using each of the seven highest-signal mutants (S80V, S80T, Q287K, N311I, N311V, N311M, and L480R) and compared the transfer efficiency to WT CjPglB (Supplementary Fig. 22). Each mutant produced a higher transfer efficiency than the WT enzyme, with the PglB^Q287K variant raising the transfer efficiency by 38% (~1.7x) to an efficiency of 91%. An anti-CPS4 Western blot confirmed transfer of the CPS4 glycan using two top CjPglB hits (Supplementary Fig. 20).

Cell-free workflow enables rapid identification of sites accessible for glycosylation in in vitro glycosylation reactions

With an efficient OST in hand, we next asked at which locations throughout a model vaccine carrier protein we could attach the bacterial polysaccharide. Conventional technologies to produce conjugate vaccines use chemical methods to randomly conjugate glycans to a carrier protein⁸⁹, which can be inefficient due to accessibility of the glycan attachment site and reduce vaccine immunogenicity^109,110. In comparison, enzymatic production of conjugate vaccines using OSTs enables site-specific glycosylation of a carrier protein precisely at the synthetically inserted sequon, which could be exploited to achieve high levels of protein expression as well as efficient and efficacious glycosylation^111,112,113. However, thus far, cell-free approaches to produce conjugate vaccines have typically relied on placing the sequon at the C-terminus of carrier proteins^97,98,99, with limited exceptions⁹⁷.

We sought to discover which sites within a clinically relevant carrier protein can be efficiently, enzymatically glycosylated in vitro. We therefore used our workflow to screen a comprehensive library of carrier protein constructs containing a sequon placed between every pair of amino acids throughout the carrier protein PD (Fig. 6a)^114,115.

**Fig. 6: Sequon scanning of *H. influenzae* protein D.**

Our library contained 328 unique PD sequences in which the glycosylation sequon “DQNAT,” surrounded by short linkers, was placed between every two amino acids in the carrier protein, beginning with an N-terminal sequon placement and ending with a C-terminal sequon placement. We then applied our cell-free workflow to this library by synthesizing each construct in a CFE reaction, combining each synthesized carrier with CPS4 glycan and CjPglB^Q287K—the high efficiency mutant identified in the OST mutagenesis screen—and assessing glycosylation of each carrier protein construct in parallel in 1-µL AlphaLISA reactions using conditions optimized to detect low glycosylation levels (Fig. 6b; Supplementary Fig. 23).

Our screen identified three sections of PD that were amenable to glycosylation in IVG reactions: 32 sites at the N-terminal end of the carrier sequence, a stretch of ~20 internal sites, and 40 sites at the C-terminal end of the carrier sequence (Fig. 6b). Mapping AlphaLISA signal to a crystal structure of PD reveals one 3-dimensional section of the carrier protein that is able to be glycosylated (Fig. 6c)¹¹⁶. In total, 94 sequon positions showed statistically significant signal above a negative control containing the sequon “DQLAT” (Fig. 6b; Supplementary Figs. 24a and 25), with top performing variants producing signal >100x above background. Triplicate measurements for each sequon variant were consistent with each other, and all negative control variants had signal equivalent to background (Supplementary Figs. 24b and 25). A selection of sequon variants with high and low AlphaLISA signal were validated via Western blot, confirming the accuracy of our workflow for comparing glycosylation efficiency of different carrier protein constructs (Supplementary Fig. 26).

Discussion

In this work, we established an integrated workflow for expressing and characterizing proteins involved in PTM installation. This workflow uniquely combines methods for cell-free DNA assembly and amplification, cell-free gene expression, and binding characterization via AlphaLISA. We show that the platform is generalizable, fast (steps are carried out in hours), and readily scalable to 384- or 1536-well plates without the need for time intensive protein purification or cell-based cloning techniques. Moreover, the platform is designed with automation in mind, with each step consisting of simple liquid handling and temperature incubation steps. We showed the utility of the platform for characterizing the activity of RREs involved in RiPP biosynthesis as well as towards engineering systems for efficient conjugate vaccine production, including protein engineering to increase the efficiency of a glycan-installing enzyme.

Through our work characterizing the binding activity of TbtF to TbtA, we found that our methodology can within hours of obtaining DNA samples recapitulate findings obtained using traditional approaches that take days to weeks to perform. Looking forward, we can use our workflow for more advanced PTM engineering strategies. For example, recent efforts have created novel RiPP products by engineering peptide substrates to contain leader sequences recognized by tailoring enzymes from multiple classes of RiPPs¹¹⁷. In doing so, a peptide substrate was modified with RRE-dependent tailoring enzymes from two different BGCs. Creating more complex systems with even greater numbers of RRE-dependent modifications will require an understanding of appropriate design rules for enabling recognition of the precursor peptide by the desired tailoring enzymes. Using the information gained by mutational scanning, we were able to systematically produce a synthetic peptide with only 40% identity to the wild-type peptide that exhibits AlphaLISA binding signal on par with the wild-type peptide. Understanding the minimal set of amino acid residues required for recognition will be important for engineering increasingly complex molecules. Our workflow provides a method for understanding and prototyping these requirements.

Additionally, by coupling our workflow with computational prediction tools, we demonstrated how our platform can screen for natural product BGCs likely to function in an in vitro setting. While we were able to produce Las-1010⁸⁵ (Las24), many of the clusters we detected binding activity for in our screen did not produce mature lasso peptides. We hypothesize that this could be due to a number of reasons, including that the in vitro reaction environment may lack other important components natively present in in vivo systems, such as auxiliary genes as has been demonstrated for other natural products^118,119. Advances in developing cell-free lysates from non-E. coli-based organisms, such as Streptomyces^{120,121,122,123}, could be incorporated into future studies as a method for testing expression systems that contain native auxiliary factors. Despite this, by prioritizing BGCs with a demonstrated functional first step (RRE binding), our methods can be used as is to narrow down the number of proteins needed to be expressed and purified for attempts at in vitro reconstitution. Furthermore, we note that recent advances in deep learning models^124,125 and protein language models¹²⁶ have accelerated our ability to predict the substrate promiscuity of RiPP biosynthetic enzymes. Key to these computational tools and future Artificial Intelligence (AI) models is the ability to rapidly generate training datasets and validate any resulting predictions. Our workflow can be interfaced with these computational tools.

We also demonstrated how our workflow can be used to engineer both enzymes and substrates used in glycoprotein synthesis systems. With our platform, we rapidly assessed 285 unique mutants of CjPglB to enable efficient production of glycoproteins with the CPS from S. pneumoniae serotype 4, a major cause of pneumonia in disadvantaged communities¹²⁷. Importantly, our screen uncovered beneficial mutations in both undiscovered sites and sites that had been identified in previous CjPglB mutagenesis experiments¹⁰⁸ for other unrelated glycans, demonstrating the significance of a fast, high-throughput method to discover mutations unique for transferring each pathogen glycan of interest⁹⁴. Because the identity of all tested mutants, including both low- and high-performing mutations, is known at the time of assay, we believe our workflow could be readily interfaced with machine-learning guided strategies^128,129,130 to more rapidly engineer oligosaccharyltransferases.

To highlight the future potential for making and optimizing conjugate vaccines, we used our workflow to rapidly assess the glycosylation of 328 unique variants of the carrier protein PD in vitro to discover high efficiency glycosylation sites throughout the carrier protein. Over a quarter of all sites produced AlphaLISA signal significantly higher than a negative control. Mapping sites that produced high AlphaLISA signal to a crystal structure of PD revealed one section of the protein that was highly glycosylated, suggesting that steric effects may play a role in determining the ability to glycosylate unique sequon positions¹³¹ or that placing sequons in other locations of the protein may lead to low protein expression in IVG reactions.

One limitation of our work is that the results we obtain are semi-quantitative. With our current platform design, we can compare the relative binding affinity of different RRE-peptide pairs or glycosylation efficiency of specific OST mutants but are unable to provide exact quantitative measurements of these phenomena (e.g., k_d, % glycosylation, etc.). Thus, we suggest that our method can be integrated with more traditional assays by first using our workflow as a screening tool to down-select specific protein variants from larger libraries for follow-up experiments with smaller numbers of samples.

In sum, we developed a versatile, rapid, and robust cell-free platform for characterizing and engineering PTMs. We expect that this platform can be applied to other classes of PTMs and will accelerate the design and production of biologics with complex PTMs and improved therapeutic properties.

Methods

DNA design and preparation for RiPPs

For the initial screen of known RRE’s, gene constructs were ordered from Twist Biosciences (synthesized into pJL1 backbone between NdeI and SalI restriction sites). Briefly, sequences were retrieved from literature or Uniprot and codon optimized using the IDT Codon Optimization Tool. For full length RRE constructs, a codon optimized sequence for a Twin-Strep tag and PAS11 linker were added to the N-terminus of the nucleotide sequence. MBP-fusion RRE constructs were constructed by replacing either the C-terminus (for proteins in which the RRE domain was predicted to occur in the N-terminus) or N-terminus (for proteins in which the RRE domain was predicted to occur in the C-terminus) portion of the sequence with codon optimized sequences for MBP and a GS7 linker. For precursor peptide sequences, sequences encoding either the full-length precursor or leader sequence were fused to an N-terminal sFLAG tag and GS7 linker.

For all peptide sequences used in AlphaLISA based assays, an N-terminal sFLAG tag and GS7 linker were incorporated into the design. For sequences utilized in the AlphaLISA alanine scan workflow, each alanine variant peptide was constructed by replacing the corresponding wild-type codon with “GCC”. To construct synthetic sfGFP peptides, the first 40 amino acids of sfGFP (with a G23T mutation) was first codon optimized. Each variant was then constructed by replacing the appropriate wild-type codon with the codon corresponding to the desired residue change. All peptide sequences were ordered as eBlocks with overhang to a linearized pJL1 backbone for use in Gibson Assembly reactions.

For all computationally predicted lasso peptide proteases and cyclases, the predicted gene sequences were codon optimized using the IDT Codon Optimization Tool. At the N-terminus of each sequence, maltose binding protein (MBP) and a short linker were incorporated to enable soluble expression and detection via AlphaLISA based assays. All genes were synthesized by Twist Biosciences either in pJL1 (for expression in PUREfrex) or in a modified pET vector (for in vivo expression). The corresponding (untagged) precursor sequences were also synthesized by Twist Biosciences in pJL1 for use in assembling complete lasso peptide BGCs.

DNA templates for expression in PUREfrex were prepared either in plasmid form using ZymoPURE II Plasmid Midiprep Kit (Zymo Research) or as linear expression templates (LETs). For LETs, eBlocks were inserted into pJL1 using Gibson Assembly with a linearized pJL1 backbone. Following Gibson Assembly, each reaction was then diluted 10x in nuclease free water. 1 μL of diluted Gibson Assembly reaction was then used in a 50 μL PCR reaction using Q5 Hot Start High-Fidelity DNA Polymerase (New England Biolabs).

All nucleotide sequences used in the RiPPs portion of this study are provided in the Supplementary Information or in the Supplementary Data 1 File.

FluoroTect^TM gel

PUREfrex 2.1 (Gene Frontier) reactions were assembled according to manufacturer instructions using 1 μL of unpurified template LET and 0.5 μL of FluoroTect^TM (Promega) per 10 μL reaction. Following incubation at 37 °C for 6 h, samples were centrifuged at 12,000 x g for 10 min at 4 °C. 3 μL of supernatant was then mixed with 1 μL of 40 μg/mL RNase A and incubated at 37 °C for 10 min. Following incubation, 1 μL of 1 M DTT, 2.5 μL of 4X Protein Sample Loading Buffer for Western Blots (Li-COR Biosciences), and 2.5 μL of water were added to each sample and the samples were then incubated at 70 °C for 10 min. Samples were then loaded on a NuPAGE 4–12% Bis-Tris Protein Gel and run for 40 min at 200 V in MES Running Buffer. For comparison, a lane was loaded with BenchMark fluorescent protein standard (Thermo Fisher Scientific). The resulting gel was then imaged using both the 600 and 700 fluorescent channel on a LICOR Odyssey Fc (Li-COR Biosciences).

AlphaLISA reactions for RiPPs

PUREfrex 2.1 (Gene Frontier) reactions were assembled according to manufacturer instructions. Briefly, 1 μL of the unpurified LET reaction—encoding for the precursor peptide or RRE—was added as a template per 10 μL PUREfrex reaction. Reactions were then incubated at 37 °C for 5 h. After incubation, these samples were then diluted in a buffer consisting of 50 mM HEPES pH 7.4, 150 mM NaCl, 1 mg/mL BSA, and 0.015% v/v Triton X-100. Following dilution, an Echo 525 acoustic liquid handler was used to dispense 0.5 μL of diluted RRE, 0.5 μL of diluted peptide, and 0.5 μL of blank buffer from a 384-well polypropylene 2.0 Plus Source microplate (Labcyte) using the 384PP_Plus_GPSA fluid type into a ProxiPlate-384 Plus, White 384-shallow well destination microplate (Revvity). The plate was then sealed and equilibrated at room temperature for 1 h. Next, anti-FLAG Alpha Donor beads (Perkin Elmer) were used to immobilize the sFLAG tagged peptides and anti-Maltose-Binding (MBP) AlphaLISA acceptor beads were used to immobilize the MBP-tagged RREs. 0.5 μL of acceptor and donor beads diluted in buffer were added to each reaction to a final concentration of 0.08 mg/mL and 0.02 mg/mL donor and acceptor beads, respectively. Reactions were then equilibrated an additional hour at room temperature in the dark. For analysis, reactions were incubated for 10 min in a Tecan Infinite M1000 Pro (using Tecan i-control v. 3.9.1.0) plate reader at room temperature and then chemiluminescence signal was read using the AlphaLISA filter with an excitation time of 100 ms, an integration time of 300 ms, and a settle time of 20 ms. Results were visualized using Prism version 9.5.1 (GraphPad).

Computational prediction of lasso peptide BGCs

A diverse collection of 39,311 publicly available genomes (2020 April) spanning soil bacteria, metagenomes and extremophiles were analyzed using AntiSMASH 5.1.2 identifying 315,876 biosynthetic gene clusters (Supplementary Table 2). A total of 2,574 lasso peptide clusters were identified, and from this set we then performed an additional filtering step to identify 1,882 BGCs which contained a complete collection of essential biosynthetic enzymes (Supplementary Table 3). Specifically, we note that predictions using AntiSMASH rely on identifying clusters in which the predicted components include homology to PF13471 and a proximal asparagine synthetase, micJ25, or mcJC. Therefore, there is a possibility that clusters identified by AntiSMASH are missing essential enzymes (which we did observe) and we did not include these incomplete clusters in our follow-up analysis. To further prioritize these BGCs, a sequence similarity network^132,133 was used to group identified precursor peptides with a collection of known lasso peptide sequences. Peptide sequences that did not group with known sequences were considered novel and were nominated for further investigation. Subsequent filtering of the remaining novel BGCs included selecting BGCs based on a core peptide length of 17–27 amino acids and whether the mature lasso peptide is predicted to carry a positive charge at a neutral pH. Calculation of the predicted isoelectric point of the predicted core peptides used Thermo Fisher Scientific’s peptide analysis tool (https://www.thermofisher.com/us/en/home/life-science/protein-biology/peptides-proteins/custom-peptide-synthesis-services/peptide-analyzing-tool.html). This narrowed the selection to 202 BGCs, of which 47 were chosen. A total of 210 genes were synthesized by Twist Bioscience. All amino acid sequences and metadata for the 47 selected BGCs are provided in the Supplementary Data 1 File.

In vivo expression and purification of lasso peptide tailoring enzymes

For computationally predicted MBP-RREs and MBP-proteases, constructs of the target protein in pET.BCS.RBSU.NS backbone were transformed into BL21 Star (DE3) cells, plated on LB agar plates containing 100 μg/mL carbenicillin, and incubated at 37 °C. Single colonies were cultured in 50 mL of LB containing 100 μg/mL carbenicillin at 37 °C and 250 RPM. After overnight incubation, 20 mL of the overnight culture were used to inoculate 1 L of LB supplemented with 2 g/L glucose and 100 μg/mL carbenicillin. Cells were grown at 37 °C and 250 RPM and induced for protein production at OD₆₀₀ 0.6-0.8 with 500 μL of 1 M IPTG. Four hours post induction, cells were harvested via centrifugation at 5000 x g for 10 min at 4 °C and flash frozen in liquid nitrogen.

After thawing on ice, cell pellets were resuspended in lysis buffer composed of 50 mM Tris-HCl pH 7.4, 500 mM NaCl, 2.5 % (v/v) glycerol, and 0.1% Triton X-100. For cell pellets used to overexpress RREs and cyclases, the lysis buffer also contained 6 mM PMSF, 100 μM Leupeptin, and 100 μM E64. Cell suspensions were then supplemented with 1 mg/mL lysozyme and lysed via sonication using a Qsonica sonicator at 50% amplitude for 2 min with 10 s on 10 s off cycles. Following sonication, insoluble debris were removed via centrifugation at 14,000 x g for 30 min at 4 °C. Per 1 L of cell culture, 5 mL of amylose resin was equilibrated with 5 to 10 column volumes of wash buffer (50 mM Tris HCl, 500 mM NaCl, 2.5 % (v/v) glycerol, pH 7.4) in a 50 mL conical tube and mixed via inversion. Resin was separated from wash buffer by spinning at 2,000 x g for 2 min at 4 °C and the supernatant was then poured off. Equilibration was repeated for a total of 4 times with fresh equilibration buffer. Following the last equilibration, the cleared cell lysis supernatant was added to the resin and incubated for 2 h at 4 °C with constant agitation on a shake table. Following incubation on the resin, the resin was washed once with 5 column volumes of lysis buffer followed by 5 column volumes of wash buffer four times. For the last wash, the resuspended resin was loaded in a 25 mL gravity flow column and drained via gravity flow. For elution, 15 mL of elution buffer (50 mM Tris HCl, 300 mM NaCl, 10 mM maltose, 2.5% (v/v) glycerol, pH 7.4) was added to the gravity flow column and collected. Samples were then buffer exchanged into storage buffer (50 mM HEPES, 300 mM NaCl, 0.5 mM TCEP, 2.5% (v/v) glycerol, pH 7.5) using amicon spin filters (50 kDa MWCO) by spinning at 4,500 x g for 10–15 min. Samples were then aliquoted, flash frozen, and stored at -80 °C until use. Total protein concentration of each purified sample was determined using a Bradford assay (Biorad). Percent purity of each sample was determined by running diluted aliquots of each purified protein on a 4–12% Bis-Tris gel and staining with Optiblot Blue (Abcam). After destaining, each gel was imaged using the 700 fluorescent channel on a LICOR Odyssey Fc (Li-COR Biosciences, USA) and percent purity was determined via densitometry using Licor Image Studio Lite (v. 5.2.5). Final concentrations of each protein were then calculated by multiplying the total protein content by the percent purity.

Computationally identified cyclases were expressed and purified according to the process outlined above for computationally identified RREs except for transforming into BL21 Star (DE3) cells already transformed with pG-KJE8. LB agar and media for cell growth were supplemented with 20 μg/mL chloramphenicol in addition to 100 μg/mL carbenicillin. At inoculation, LB was supplemented with 2 g/L glucose, 100 μg/mL carbenicillin, 20 μg/mL chloramphenicol, and 2 ng/mL anhydrotetracycline per 1 L of media for induction of folding chaperones.

In vitro enzymatic assembly of lasso peptide BGCs

PUREfrex 2.1 (Gene Frontier) reactions to express the precursor peptide were assembled according to manufacturer instructions using 1 μL of 200 ng/μL plasmid (pJL1 backbone encoding precursor peptide of interest) per 10 μL reaction and incubated at 37 °C for at least 5 h. Purified proteins were buffer exchanged using Zeba Micro Spin Desalting Columns (7 K MWCO) into synthetase buffer (50 mM Tris-HCl pH 7.5, 125 mM NaCl, 20 mM MgCl₂). 10 μL reactions were then assembled using 5 μL of PUREfrex reaction, and the appropriate volume of each individual purified enzyme or buffer such that both the RRE and protease were at a final concentration of 10 μM and the cyclase was at a final concentration of 1 μM. Reactions were supplemented to a final concentration of 10 mM DTT and 5 mM ATP and incubated at 37 °C for varying lengths of time. For analysis, samples were desalted using Pierce C18 spin tips (10 μL bed), spotted on a MALDI target plate using 50% saturated CHCA matrix in 80% ACN with 0.1% TFA, and analyzed using a Bruker RapiFlex MALDI-TOF mass spectrometer (flexControl v. 4.0) in reflector positive mode at Northwestern University’s Integrated Molecular Structure Education and Research Center (IMSERC). MALDI-TOF data were analyzed using flexAnalysis v. 4.9 (Bruker).

LC-MS/MS

Reactions were assembled as described above at a scale of 300 μL, desalted using Pierce C18 spin tips, and concentrated to 25 μL using a SpeedVac Vaccuum concentrator system. Samples were then injected on a 1290 Infinity II UHPLC System (Agilent Technologies Inc., Santa Clara, California, USA) onto a Poroshell 120 EC-C18 column (1.9 μm, 50 × 2.1 mm) (Phenomenex, Torrance, California, USA) for C-18 chromatography which was maintained at 45 °C with a constant flow rate at 0.500 ml/min, using a gradient of mobile phase A (water, 0.1 % formic acid) and mobile phase B (100% acetonitrile, 0.1% formic acid). The gradient program was as follows: 0–1 min, 2% B; 1–11 min, 10 – 40% B; 11–12 min, 40–90% B; 12–14 min, hold 90% B; 3 min hold at 10% B. “Targeted MS/MS” in positive ion mode acquisition was conducted on the samples on an Agilent 6545 quadrupole time-of-flight mass spectrometer equipped with a JetStream ionization source. The source conditions were as follows: Gas Temperature, 325 °C; Drying Gas flow, 13 L/ min; Nebulizer, 35 psi; Sheath Gas Temperature, 275 °C; Sheath Gas Flow, 12 L/ min; VCap, 4000 V; Fragmentor, 175 V; Skimmer, 65 V; and Oct 1 RF, 750 V. The acquisition rate in Auto MS/MS mode was 8 spectra/second, from m/z 100 – 1700 m/z range for MS1 and 3 spectra/second for MS/MS. A ramped collision energy was utilized with a slope and offset of 3.1 and 1, respectively for +2 ions, and a slope and offset of 3.6 and -4.8, respectively for ≥3 ions, and utilizing m/z 121.05087300 and m/z 922.00979800 in positive ion mode as reference masses which is introduced into the ion source by a separate nebulizer and the flow was maintained by an isocratic pump. Additionally, a Targeted Mass Table was created to acquire data on the cyclic peptides (Supplementary Table 5).

Carboxypeptidase treatment of lasso peptides

Assembled reactions (20 μL scale) were desalted using Pierce C18 spin column and eluted into 20 μL of acetonitrile. After solvent removal under vacuum, reactions were resuspended in a solution containing carboxypeptidase Y at 50 ng/μL in 1X PBS (10 μL) and incubated at room temperature overnight. The mixtures were evaporated to dryness and resuspended in 3 μL saturated α-Cyano-4-hydroxycinnamic acid (CHCA) matrix solution in TFA (trifluoroacetic acid). Samples were then spotted on a matrix assisted laser desorption/ionization (MALDI) plate and analyzed using a Bruker RapiFlex MALDI-TOF mass spectrometer (flexControl v. 4.0) in reflector positive mode at Northwestern University’s Integrated Molecular Structure Education and Research Center (IMSERC). MALDI-TOF data were analyzed using flexAnalysis v. 4.9 (Bruker).

DNA design and preparation for glycosylation

For sequences used in the PglB mutant screen, the wild-type sequence for PglB was retrieved from Uniprot (Q5HTX9) and codon optimized using the IDT Codon Optimization Tool. A codon optimized linker and c-myc tag were appended to the C-terminus of the sequence. Each single variant sequence was then created using the codon optimized wild-type sequence as the template and replacing the respective codon with the most prevalent codon for the replacement amino acid. All protein sequences were ordered as eBlocks with overhang to a linearized pJL1 backbone for use in Gibson Assembly reactions.

The wild-type sequence for Haemophilus influenzae protein D was retrieved from Uniprot (Q06282) and codon optimized using the IDT Codon Optimization Tool. Codon optimized linkers, a StrepII tag, and a 6xHis tag were appended to the C-terminus of the PD sequence. Each sequon variant was created by inserting the DNA sequence “AGAGCAGGAGGTGACCAGAACGCTACACGCGCAACCACA” (AA sequence: “RAGGDQNATRATT”) between each codon in the wild-type PD sequence. Eleven negative controls were added by instead inserting the DNA sequence “AGAGCAGGAGGTGACCAGTTGGCTACACGCGCAACCACA” (AA sequence: “RAGGDQLATRATT”), in which the asparagine in the sequon is replaced with a leucine that is not glycosylated. All sequon variants were ordered as eBlocks with overhang to linearized pJL1 backbone for use in Gibson Assembly reactions.

The cell-free library generation for the PglB mutant screen was prepared as follows: (1) each backbone gBlock was amplified in a 50 µL PCR reaction with 0.1 ng template added; (2) PCR products were cleaned with a Clean and Concentrate kit (Zymo); (3) cleaned PCR product was diluted to 6.7 ng/µL with nuclease-free water; (4) eBlocks were mixed with their respective pair of backbone gBlocks for a final concentration of 1.5 ng/µL of each component in a 5 µL Gibson reaction; (5) 4 µL of Gibson product was added to a 16 µL rolling circle amplification (RCA) reaction using phi29-XT polymerase (NEB); (6) the completed RCA reaction was diluted 1:1 with the addition of 20 µL nuclease-free water. All PCR reactions used Q5 Hot Start DNA polymerase (NEB). The diluted RCA product serves as a template for expression of each PglB mutant in CFE. To express sfGFP carrier protein, 200 µL CFE reactions were prepared containing 13.3 ng/µL plasmid encoding the carrier.

The cell-free library generation for the PD sequon walking experiment was prepared using the same workflow as the PglB mutant screen, but with the following exceptions: (1) sequon variant eBlocks and backbone gBlocks were added to 5 µL Gibson reactions for a final concentration of 8 µM of each sequence; (2) Gibson reactions were diluted 6x in nuclease-free water; (3) 1 µL diluted Gibson product was added to 9 µL PCR reactions to generate linear expression templates of each sequon variant. Expression of each sequon variant was performed by adding 1 µL of linear expression template to a 4 µL CFE reaction.

All nucleotide sequences used in the glycosylation portion of this study are provided in the Supplementary Information or in the Supplementary Data 1 File.

Cell extract preparation

Extract from BL21 Star^TM (DE3) cells was prepared based on previous reports^134,135,136. Briefly, an overnight culture was used to inoculate a culture of 2 x YTPG at the 10 L scale (target optical density at 600 nm (OD₆₀₀) = 0.06-0.08) in a Sartorius Stedim BIOSTAT Cplus bioreactor. The culture was then incubated at 37 °C with agitation set to 250 RPM. Once the culture reached OD₆₀₀ = 0.6, the cells were induced for T7 RNA polymerase expression by adding IPTG to a final concentration of 0.5 mM. At OD₆₀₀ = 3.0, the cells were harvested and centrifuged at 8,000 x g for 5 min. The resulting cell pellet was then collected and washed 3x with 25 mL of S30 buffer (10 mM Tris acetate pH 8.2, 14 mM magnesium acetate, and 60 mM potassium acetate) by resuspending in cycles of 15 s vortexing and 15 s on ice. In between each wash step, cells were pelleted via centrifugation at 10,000 x g for 2 min and the supernatant was poured off. After the final wash step, the supernatant was poured off, the mass of the cell pellet was recorded, and the cell pellets were flash frozen and stored at -80 °C. For lysate preparation, the cell pellets were thawed on ice for 1 hr. Next, 1 mL of S30 buffer per gram of cell pellet was added to each tube. The cells were then resuspended via vortexing, again in cycles of 15 s vortexing and 15 s on ice. After resuspension, the cells were then lysed via homogenization using a single pass through an Avestin EmulsiFlex-B15 homogenizer between 20,000–25,000 psig. Following homogenization, the lysed sample was centrifuged at 12,000 x g for 10 min at 4 °C. The supernatant was then collected and centrifuged again at 12,000 x g for 10 min at 4 °C. Following the final centrifugation, the supernatant was pooled, aliquoted, flash frozen, and stored at -80 °C until use.

For extracts enriched with CjPglB^Q287K and CPS from S. pneumoniae serotype 4 and derived from Hobby strain¹⁰⁵, the above directions provided for BL21 Star^TM (DE3) cells were followed with the following changes: Prior to growing the overnight cultures, electrocompetent Hobby cells were transformed with pSF-CjPglB^Q287K-LpxE-KanR and pB-4¹⁰⁴ and plated on LB agar plates containing 50 mg/mL Kanamycin and 20 mg/mL of tetracycline. During each cell growth phase, the cultures were also supplemented with 50 mg/mL Kanamycin and 20 μg/mL of tetracycline. At OD₆₀₀ = 0.6-0.8, the culture was supplemented with 0.1% w/v arabinose in addition to 0.5 mM IPTG to induce for CjPglB^Q287K and CPS4 expression respectively, and the incubator was turned down to 220 RPM and 30 °C. Additionally, the supernatant from the first 12,000 x g centrifugation spin was collected and underwent runoff by wrapping the tubes in aluminum foil and incubating at 37 °C and 250 RPM for 1 h. Following runoff, the tubes were centrifuged at 10,000 x g at 4 °C for 10 min and the supernatant was collected, mixed, and aliquoted before flash freezing and storing at -80 °C until use.

Crude membrane fraction

For producing crude membrane fraction, Hobby strain cells were transformed with pB-4¹⁰⁴ and plated on LB agar plates containing 20 μg/mL of tetracycline. A single colony was then used to inoculate a 50 mL overnight culture of LB supplemented with 20 μg/mL of tetracycline. The next morning, 1 L of 2xYTPG supplemented with 20 μg/mL of tetracycline was inoculated with a target starting OD₆₀₀ = 0.06-0.08. The culture was the incubated at 37 °C with agitation set to 250 RPM. At OD₆₀₀ = 0.6-0.8, the culture was supplemented with 0.5 mM IPTG and the culture was then incubated overnight at 30 °C and agitation of 220 RPM. The next morning, the cells were harvested via centrifugation at 8,000 x g for 5 min at 4 °C. After pouring off the supernatant, 1 mL per gram of cell pellet of resuspension buffer (50 mM Tris HCl, pH 7.5, 25 mM NaCl) was added to the pellets. The cells were then resuspended via vortexing in cycles of 15 s vortexing and 15 s on ice. After the cells were fully resuspended, the sample was lysed via homogenization using a single pass through an Avestin EmulsiFlex-B15 homogenizer between 20,000–25,000 psig. Following lysis, the sample was then centrifuged at 12,000 x g at 4 °C for 30 min. The supernatant was then ultracentrifuged at 100,000 x g at 4 °C for 1 h to pellet the membrane vesicles. Following ultracentrifugation, the supernatant was poured off and 0.2 mL/gram original cell pellet of resuspension buffer (50 mM Tris HCl, pH 7.5, 25 mM NaCl, 1% w/v DDM) was added to the pellet before incubating overnight at 4 °C on a shake table. The next morning, the samples were pipette mixed to ensure complete resuspension of the pellet and the samples were incubated at room temperature for 30 min. Finally, the samples were spun at 16,000 x g for 1 h at 4 °C and the supernatant was mixed, aliquoted, flash frozen, and stored at -80 °C until use.

Nanodisc supplemented CFE reactions

In vitro expression of each WT or mutant PglB construct was performed by adding 1 µL of diluted RCA product to a 4 µL BL21 Star^TM (DE3) CFE reaction supplemented with 66.7 µM MSP1E3D1 POPC nanodiscs (Cube Biotech). CFE reactions were carried out using the PANOx-SP reaction, with reaction formulations previously described^45,137,138. Briefly, reactions were assembled with the following final concentrations: 8 mM magnesium glutamate, 10 mM ammonium glutamate, 130 mM potassium glutamate, 1.2 mM ATP, 0.85 mM GTP, 0.85 mM UTP, 0.85 mM CTP, 34 μg/mL folinic acid, 0.17 mg/mL tRNA, 0.4 mM nicotinamide adenine dinucleotide (NAD), 0.27 mM coenzyme A (CoA), 4 mM oxalic acid, 1 mM putrescine, 1.5 mM spermidine, 57 mM HEPES pH 7.2, 2 mM of each of the 20 standard amino acids, 33 mM phosphoenolpyruvate (PEP) and 30% v/v cell extract. All reactions were incubated at 30 °C overnight.

In vitro glycosylation reactions

In the PglB mutagenesis screen, in vitro glycosylation reactions were performed by combining 0.4 µL unpurified sfGFP acceptor, 1 µL unpurified PglB mutant, and 3 µL S. pneumoniae CPS4 crude membrane fraction in a 5 µL reaction volume containing 0.1% w/v DDM (Anatrace), 1% w/v Ficoll 400 (Sigma), 10 mM manganese chloride (Sigma), and 50 mM HEPES (Sigma).

In the sequon walking experiment, in vitro glycosylation reactions were performed by combining 1 µL unpurified sequon variant and 2.5 µL enriched extract containing CjPglB^Q287K and S. pneumoniae CPS4 in a 5 µL reaction volume containing 0.1% w/v DDM, 1% w/v Ficoll 400, 10 mM manganese chloride, and 50 mM HEPES.

AlphaLISA reactions for glycoconjugates

Completed in vitro glycosylation reactions were diluted in a buffer consisting of 50 mM HEPES pH 7.4, 150 mM NaCl, 1 mg/mL BSA, and 0.015% v/v Triton X-100. All glycoconjugate AlphaLISA experiments were performed with 1 µL reaction volumes with a 0.08 mg/mL final concentration of Protein A donor beads and 0.02 mg/mL final concentration of anti-6xHis acceptor beads, which immobilize the S. pneumoniae CPS4 antiserum and the 6xHis-tagged glycoconjugates, respectively. Following dilution, an Echo 525 acoustic liquid handler was used to dispense 0.25 µL diluted in vitro glycosylation product, 0.25 µL S. pneumoniae CPS4 antiserum, 0.25 µL blank buffer, and 0.125 µL anti-6xHis acceptor beads diluted in buffer from a 384-well polypropylene 2.0 Plus Source microplate using the 384PP_Plus_GPSA fluid type into an AlphaPlate 1536-well destination microplate (Revvity). The plate was sealed and equilibrated for 1 h at room temperature. Following incubation, 0.125 µL of Protein A donor beads diluted in buffer were transferred to each reaction. Reactions were equilibrated for an additional hour at room temperature in the dark. For analysis, reactions were incubated for 10 min in a Biotek Synergy Neo2 plate reader at room temperature, and chemiluminescent signal was read using the AlphaLISA filter with an excitation time of 100 ms, an integration time of 300 ms, and a settle time of 20 ms. For the PD sequon walking experiment, replicate AlphaLISA reactions were performed on separate plates. Signal for each plate was normalized using the formula: \({Normalized\; signal}=\frac{({Raw\; signal}-{Mean\; neg}.{control\; signal})}{{Mean\; pos}.{control\; signal}}\). Results were visualized using Prism version 10.3.1 (GraphPad).

Western blotting

Samples were loaded on a 4–12% Bis-Tris gel and run with either MOPS SDS or MES SDS buffer for 45 min at 200 V. A semidry transfer cell was then used to transfer the samples to Immobilon-P-poly(vinylidene difluoride) PVDF 0.45 μm membranes at 80 mA per blot for 45 min. After transferring, the membranes were blocked for 30 min at room temperature in Intercept Blocking Buffer (Licor) with gentle shaking. Following blocking, the blots were briefly rinsed with 1x PBST and then probed for 1 h at room temperature with gentle shaking using one of the following antibodies diluted into Intercept Blocking Buffer with 0.2% Tween20: anti-6xHis (Abcam, ab1187) at 1:7500 dilution, type 4 pneumococcal antiserum (Cedarlane, 16747(SS)) at 1:1000 dilution, or anti-myc (Abcam, ab9106) at 1:1000 dilution. Following primary incubation, membranes were rinsed twice with 1x PBST followed by 3 five min washes in 1x PBST at room temperature with gentle shaking. Following washing, the blots were probed for 1 h at room temperature with gentle shaking using a fluorescent goat, anti-rabbit antibody GAR-680RD (Licor, 926-68071) at a dilution of 1:10,000 in Intercept Blocking Buffer with 0.2% Tween20 and 0.1% SDS. Then, the membranes were washed as described earlier. Finally, the blots were imaged with either a Licor Odyssey Fc or an Azure 600 imager and analyzed by densitometry using Licor Image Studio Lite (v. 5.2.5). The fluorescence background was subtracted from each membrane before assessing densitometry. All uncropped dot blots and Western blots are provided (Supplementary Figs. 27–33).

Statistics and reproducibility

All sample sizes, error bars, and statistical tests are defined in Figure legends. No statistical method was used to predetermine sample size. No data were excluded from the analyses. The experiments were not randomized, and researchers were not blinded to the experimental conditions. Statistical analyses were performed using Excel version 16.95.4 and GraphPad Prism version 10.3.1.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Source data are provided with this paper (data split between four source data files, with the relevant file noted in each figure caption). All sequences and accession codes for proteins used throughout this study are included in the Supplementary Information or in Supplementary Data 1. Protein structures used in this work include a homology model of C. jejuni PglB⁹⁴ and PDB ID: 8CWP. The LC-MS/MS data generated in this study has been deposited in the Zenodo repository under https://doi.org/10.5281/zenodo.15385022. Source data are provided with this paper.

References

Anselmo, A. C., Gokarn, Y. & Mitragotri, S. Non-invasive delivery strategies for biologics. Nat. Rev. Drug Discov. 18, 19–40 (2019).
Article CAS PubMed Google Scholar
Sharma, P., Joshi, R. V., Pritchard, R., Xu, K. & Eicher, M. A. Therapeutic antibodies in medicine. Molecules 28, 6438 (2023).
Bray, G. et al. A multicenter study of recombinant factor VIII (recombinate): safety, efficacy, and inhibitor risk in previously untreated patients with hemophilia A. The recombinate study group. Blood 83, 2428–2435 (1994).
CAS PubMed Google Scholar
Roth, D. A. et al. Human recombinant factor IX: safety and efficacy studies in hemophilia B patients previously treated with plasma-derived factor IX concentrates. Blood 98, 3600–3606 (2001).
Article CAS PubMed Google Scholar
Crosnier, J. et al. Randomised placebo-controlled trial of hepatitis B surface antigen vaccine in French haemodialysis units: I, medical staff. Lancet 317, 455–459 (1981).
Article Google Scholar
Essink, B. et al. Pivotal phase 3 randomized clinical trial of the safety, tolerability, and immunogenicity of 20-valent pneumococcal conjugate vaccine in adults aged ≥18 years. Clin. Infect. Dis. 75, 390–398 (2022).
Article PubMed Google Scholar
Leader, B., Baca, Q. J. & Golan, D. E. Protein therapeutics: a summary and pharmacological classification. Nat. Rev. Drug Discov. 7, 21–39 (2008).
Article CAS PubMed Google Scholar
Chen, C. H. & Lu, T. K. Development and challenges of antimicrobial peptides for therapeutic applications.Antibiotics 9, 24 (2020).
Mathieu, C., Gillard, P. & Benhalima, K. Insulin analogues in type 1 diabetes mellitus: getting better all the time. Nat. Rev. Endocrinol. 13, 385–399 (2017).
Article CAS PubMed Google Scholar
Wang, L. et al. Therapeutic peptides: current applications and future directions. Signal Transduct. Target. Ther. 7, 48 (2022).
Article CAS PubMed PubMed Central Google Scholar
Rossino, G. et al. Peptides as therapeutic agents: challenges and opportunities in the green transition era.Molecules 28, 7165 (2023).
Joo, S. H. Cyclic peptides as therapeutic agents and biochemical tools. Biomol. Ther. 20, 19–26 (2012).
Article CAS Google Scholar
Chatterjee, J., Rechenmacher, F. & Kessler, H. N-methylation of peptides and proteins: an important element for modulating biological functions. Angew. Chem. Int. Ed. 52, 254–269 (2013).
Article CAS Google Scholar
Hansson, K. & Stenflo, J. Post-translational modifications in proteins involved in blood coagulation. J. Thromb. Haemost. 3, 2633–2648 (2005).
Article CAS PubMed Google Scholar
Liu, L. Antibody glycosylation and its impact on the pharmacokinetics and pharmacodynamics of monoclonal antibodies and FC-fusion proteins. J. Pharm. Sci. 104, 1866–1884 (2015).
Article CAS PubMed Google Scholar
Stone, M. J., Chuang, S., Hou, X., Shoham, M. & Zhu, J. Z. Tyrosine sulfation: an increasingly recognised post-translational modification of secreted proteins. N. Biotechnol. 25, 299–317 (2009).
Article CAS PubMed Google Scholar
Walsh, G. Post-translational modifications of protein biopharmaceuticals. Drug Discov. Today 15, 773–780 (2010).
Article CAS PubMed Google Scholar
Doll, S. & Burlingame, A. L. Mass spectrometry-based detection and assignment of protein posttranslational modifications. ACS Chem. Biol. 10, 63–71 (2015).
Article CAS PubMed Google Scholar
Zhang, L. et al. Analysis of monoclonal antibody sequence and post-translational modifications by time-controlled proteolysis and tandem mass spectrometry*. Mol. Cell. Proteom. 15, 1479–1488 (2016).
Article CAS Google Scholar
Larsen, M. R., Trelle, M. B., Thingholm, T. E. & Jensen, O. N. Analysis of posttranslational modifications of proteins by tandem mass spectrometry. BioTechniques 40, 790–798 (2006).
Article CAS PubMed Google Scholar
Pillai-Kastoori, L., Schutz-Geschwender, A. R. & Harford, J. A. A systematic approach to quantitative western blot analysis. Anal. Biochem. 593, 113608 (2020).
Article CAS PubMed Google Scholar
Tong, Q.-H., Tao, T., Xie, L.-Q. & Lu, H.-J. ELISA–PLA: A novel hybrid platform for the rapid, highly sensitive and specific quantification of proteins and post-translational modifications. Biosens. Bioelectron. 80, 385–391 (2016).
Article CAS PubMed Google Scholar
Ongpipattanakul, C. & Nair, S. K. Molecular basis for autocatalytic backbone N-methylation in RiPP natural product biosynthesis. ACS Chem. Biol. 13, 2989–2999 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chekan, J. R., Ongpipattanakul, C. & Nair, S. K. Steric complementarity directs sequence promiscuous leader binding in RiPP biosynthesis. Proc. Natl. Acad. Sci. USA 116, 24049–24055 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Napiórkowska, M., Boilevin, J., Darbre, T., Reymond, J. L. & Locher, K. P. Structure of bacterial oligosaccharyltransferase PglB bound to a reactive LLO and an inhibitory peptide. Sci. Rep. 8, 16297 (2018).
Article ADS PubMed PubMed Central Google Scholar
Zhu, S. et al. The B1 protein guides the biosynthesis of a lasso peptide. Sci. Rep. 6, 35604 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Hunt, A. C. et al. Cell-free gene expression: methods and applications. Chem. Rev. 125, 91–149 (2025).
Article CAS PubMed Google Scholar
Silverman, A. D., Karim, A. S. & Jewett, M. C. Cell-free gene expression: an expanded repertoire of applications. Nat. Rev. Genet. 21, 151–170 (2020).
Article CAS PubMed Google Scholar
Carlson, E. D., Gan, R., Hodgman, C. E. & Jewett, M. C. Cell-free protein synthesis: applications come of age. Biotechnol. Adv. 30, 1185–1194 (2012).
Article CAS PubMed Google Scholar
Garenne, D. et al. Cell-free gene expression. Nat. Rev. Methods Prim. 1, 49 (2021).
Article CAS Google Scholar
Ekas, H. M. et al. Engineering a PbrR-based biosensor for cell-free detection of lead at the legal limit. ACS Synth. Biol. 13, 3003–3012 (2024).
Article CAS PubMed Google Scholar
Ekas, H. M. et al. An automated cell-free workflow for transcription factor engineering. ACS Synth. Biol. 13, 3389–3399 (2024).
Article CAS PubMed PubMed Central Google Scholar
Dudley, Q. M., Karim, A. S., Nash, C. J. & Jewett, M. C. In vitro prototyping of limonene biosynthesis using cell-free protein synthesis. Metab. Eng. 61, 251–260 (2020).
Article CAS PubMed Google Scholar
Karim, A. S. et al. In vitro prototyping and rapid optimization of biosynthetic enzymes for cell design. Nat. Chem. Biol. 16, 912–919 (2020).
Article CAS PubMed Google Scholar
Liew, F. E. et al. Carbon-negative production of acetone and isopropanol by gas fermentation at industrial pilot scale. Nat. Biotechnol. 40, 335–344 (2022).
Article CAS PubMed Google Scholar
Karim, A. S. & Jewett, M. C. A cell-free framework for rapid biosynthetic pathway prototyping and enzyme discovery. Metab. Eng. 36, 116–126 (2016).
Article CAS PubMed Google Scholar
Kightlinger, W. et al. A cell-free biosynthesis platform for modular construction of protein glycosylation pathways. Nat. Commun. 10, 5404 (2019).
Article ADS PubMed PubMed Central Google Scholar
Thames, A. H. et al. GlycoCAP: A cell-free, bacterial glycosylation platform for building clickable azido-sialoglycoproteins. ACS Synth. Biol. 12, 1264–1274 (2023).
Article CAS PubMed PubMed Central Google Scholar
Lin, L. et al. Sequential glycosylation of proteins with substrate-specific N-glycosyltransferases. ACS Cent. Sci. 6, 144–154 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kightlinger, W. et al. Design of glycosylation sites by rapid synthesis and analysis of glycosyltransferases. Nat. Chem. Biol. 14, 627–635 (2018).
Article CAS PubMed Google Scholar
Liu, W.-Q. et al. Cell-free biosynthesis and engineering of ribosomally synthesized lanthipeptides. Nat. Commun. 15, 4336 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Lin, L., Kightlinger, W., Warfel, K. F., Jewett, M. C. & Mrksich, M. Using high-throughput experiments to screen n-glycosyltransferases with altered specificities. ACS Synth. Biol. 13, 1290–1302 (2024).
Article CAS PubMed Google Scholar
Beaudet, L. et al. AlphaLISA immunoassays: the no-wash alternative to ELISAs for research and drug discovery. Nat. Methods 5, an8–an9 (2008).
Article CAS Google Scholar
Hunt, A. C. et al. Multivalent designed proteins neutralize SARS-CoV-2 variants of concern and confer protection against infection in mice. Sci. Transl. Med. 14, eabn1252 (2022).
Article CAS PubMed PubMed Central Google Scholar
Hunt, A. C. et al. A rapid cell-free expression and screening platform for antibody discovery. Nat. Commun. 14, 3897 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
DeWinter, M. A. et al. Point-of-care peptide hormone production enabled by cell-free protein synthesis. ACS Synth. Biol. 12, 1216–1226 (2023).
Article CAS PubMed Google Scholar
Kloosterman, A. M., Shelton, K. E., van Wezel, G. P., Medema, M. H. & Mitchell, D. A. RRE-Finder: a genome-mining tool for class-independent RiPP discovery. mSystems 5, e00267–00220 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Z. J. et al. Activity of gut-derived nisin-like lantibiotics against human gut pathogens and commensals. ACS Chem. Biol. 19, 357–369 (2024).
Article CAS PubMed PubMed Central Google Scholar
Repka, L. M., Chekan, J. R., Nair, S. K. & van der Donk, W. A. Mechanistic understanding of lanthipeptide biosynthetic enzymes. Chem. Rev. 117, 5457–5520 (2017).
Article CAS PubMed PubMed Central Google Scholar
Schwalen, C. J., Hudson, G. A., Kille, B. & Mitchell, D. A. Bioinformatic expansion and discovery of thiopeptide antibiotics. J. Am. Chem. Soc. 140, 9494–9501 (2018).
Article CAS PubMed PubMed Central Google Scholar
Rice, A. J. et al. Enzymatic pyridine aromatization during thiopeptide biosynthesis. J. Am. Chem. Soc. 144, 21116–21124 (2022).
Article CAS PubMed PubMed Central Google Scholar
Vinogradov, A. A. et al. De novo discovery of thiopeptide pseudo-natural products acting as potent and selective tnik kinase inhibitors. J. Am. Chem. Soc. 144, 20332–20341 (2022).
Article CAS PubMed PubMed Central Google Scholar
Ayikpoe, R. S. et al. A scalable platform to discover antimicrobials of ribosomal origin. Nat. Commun. 13, 6135 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Hudson, G. A. & Mitchell, D. A. RiPP antibiotics: biosynthesis and engineering potential. Curr. Opin. Microbiol. 45, 61–69 (2018).
Article CAS PubMed PubMed Central Google Scholar
Fu, Y., Jaarsma, A. H. & Kuipers, O. P. Antiviral activities and applications of ribosomally synthesized and post-translationally modified peptides (RiPPs). Cell. Mol. Life Sci. 78, 3921–3940 (2021).
Article CAS PubMed PubMed Central Google Scholar
Shin, J. M. et al. Biomedical applications of nisin. J. Appl Microbiol 120, 1449–1465 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chiumento, S. et al. Ruminococcin C, a promising antibiotic produced by a human gut symbiont. Sci. Adv. 5, eaaw9969 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Pelton, J. M. et al. Cheminformatics-guided cell-free exploration of peptide natural products. J. Am. Chem. Soc. 146, 8016–8030 (2024).
Article CAS PubMed PubMed Central Google Scholar
Rice, A. J. et al. Cell-free synthetic biology for natural product biosynthesis and discovery. Chem. Soc. Rev. 54, 4314−4352 (2025).
Arnison, P. G. et al. Ribosomally synthesized and post-translationally modified peptide natural products: overview and recommendations for a universal nomenclature. Nat. Prod. Rep. 30, 108–160 (2013).
Article CAS PubMed PubMed Central Google Scholar
Burkhart, B. J., Hudson, G. A., Dunbar, K. L. & Mitchell, D. A. A prevalent peptide-binding domain guides ribosomal natural product biosynthesis. Nat. Chem. Biol. 11, 564–570 (2015).
Article CAS PubMed PubMed Central Google Scholar
Kretsch, A. M. et al. Peptidase activation by a leader peptide-bound RiPP recognition element. Biochemistry 62, 956–967 (2023).
Article CAS PubMed Google Scholar
Dunbar, K. L., Tietz, J. I., Cox, C. L., Burkhart, B. J. & Mitchell, D. A. Identification of an auxiliary leader peptide-binding protein required for azoline formation in ribosomal natural products. J. Am. Chem. Soc. 137, 7672–7677 (2015).
Article CAS PubMed PubMed Central Google Scholar
Shelton, K. E. & Mitchell, D. A. Bioinformatic prediction and experimental validation of RiPP recognition elements. Methods Enzymol. 679, 191–233 (2023).
Article CAS PubMed Google Scholar
Hudson, G. A., Zhang, Z., Tietz, J. I., Mitchell, D. A. & van der Donk, W. A. In vitro biosynthesis of the core scaffold of the thiopeptide thiomuracin. J. Am. Chem. Soc. 137, 16012–16015 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Z. et al. Biosynthetic timing and substrate specificity for the thiopeptide thiomuracin. J. Am. Chem. Soc. 138, 15511–15514 (2016).
Article CAS PubMed PubMed Central Google Scholar
Fleming, S. R. et al. Flexizyme-enabled benchtop biosynthesis of thiopeptides. J. Am. Chem. Soc. 141, 758–762 (2019).
Article CAS PubMed PubMed Central Google Scholar
Cogan, D. P. et al. Structural insights into enzymatic [4+2] aza-cycloaddition in thiopeptide antibiotic biosynthesis. Proc. Natl. Acad. Sci. USA 114, 12928–12933 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Blin, K. et al. antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline. Nucleic Acids Res. 47, W81–W87 (2019).
Article CAS PubMed PubMed Central Google Scholar
Montalbán-López, M. et al. New developments in RiPP discovery, enzymology and engineering. Nat. Prod. Rep. 38, 130–239 (2021).
Article PubMed Google Scholar
Hegemann, J. D., Zimmermann, M., Xie, X. & Marahiel, M. A. Lasso peptides: an intriguing class of bacterial natural products. Acc. Chem. Res. 48, 1909–1919 (2015).
Article CAS PubMed Google Scholar
Cheng, C. & Hua, Z.-C. Lasso peptides: heterologous production and potential medical application. Front. Bioeng. Biotechnol. https://doi.org/10.3389/fbioe.2020.571165 (2020).
Si, Y., Kretsch, A. M., Daigh, L. M., Burk, M. J. & Mitchell, D. A. Cell-free biosynthesis to evaluate lasso peptide formation and enzyme–substrate tolerance. J. Am. Chem. Soc. 143, 5917–5927 (2021).
Article CAS PubMed PubMed Central Google Scholar
Tan, S., Moore, G. & Nodwell, J. Put a bow on it: knotted antibiotics take center stage. Antibiotics 8, 117 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kuznedelov, K. et al. The antibacterial threaded-lasso peptide capistruin inhibits bacterial RNA polymerase. J. Mol. Biol. 412, 842–848 (2011).
Article CAS PubMed PubMed Central Google Scholar
Metelev, M. et al. Acinetodin and klebsidin, RNA polymerase targeting lasso peptides produced by human isolates of Acinetobacter gyllenbergii and Klebsiella pneumoniae. ACS Chem. Biol. 12, 814–824 (2017).
Article CAS PubMed Google Scholar
Metelev, M. et al. Structure, bioactivity, and resistance mechanism of streptomonomicin, an unusual lasso peptide from an understudied halophilic actinomycete. Chem. Biol. 22, 241–250 (2015).
Article CAS PubMed PubMed Central Google Scholar
Cheung-Lee, W. L., Parry, M. E., Jaramillo Cartagena, A., Darst, S. A. & Link, A. J. Discovery and structure of the antimicrobial lasso peptide citrocin. J. Biol. Chem. 294, 6822–6830 (2019).
Article CAS PubMed PubMed Central Google Scholar
Tan, S., Ludwig, K. C., Müller, A., Schneider, T. & Nodwell, J. R. The lasso peptide siamycin-I targets lipid ii at the gram-positive cell surface. ACS Chem. Biol. 14, 966–974 (2019).
Article CAS PubMed Google Scholar
Koos, J. D. & Link, A. J. Heterologous and in vitro reconstitution of fuscanodin, a lasso peptide from Thermobifida fusca. J. Am. Chem. Soc. 141, 928–935 (2019).
Article CAS PubMed Google Scholar
DiCaprio, A. J., Firouzbakht, A., Hudson, G. A. & Mitchell, D. A. Enzymatic reconstitution and biosynthetic investigation of the lasso peptide fusilassin. J. Am. Chem. Soc. 141, 290–297 (2019).
Article CAS PubMed Google Scholar
Sánchez-Hidalgo, M., Martín, J. & Genilloud, O. Identification and heterologous expression of the biosynthetic gene cluster encoding the lasso peptide humidimycin, a caspofungin activity potentiator. Antibiotics 9, 67 (2020).
Article PubMed PubMed Central Google Scholar
Cheung, W. L., Chen, M. Y., Maksimov, M. O. & Link, A. J. Lasso peptide biosynthetic protein larb1 binds both leader and core peptide regions of the precursor protein LarA. ACS Cent. Sci. 2, 702–709 (2016).
Article CAS PubMed PubMed Central Google Scholar
Yan, K.-P. et al. Dissecting the maturation steps of the lasso peptide microcin J25 in vitro.ChemBioChem 13, 1046–1052 (2012).
Article CAS PubMed Google Scholar
King, A. M. et al. Systematic mining of the human microbiome identifies antimicrobial peptides with diverse activity spectra. Nat. Microbiol. 8, 2420–2434 (2023).
Article CAS PubMed Google Scholar
Tietz, J. I. et al. A new genome-mining tool redefines the lasso peptide biosynthetic landscape. Nat. Chem. Biol. 13, 470–478 (2017).
Article CAS PubMed PubMed Central Google Scholar
Goldblatt, D. Conjugate vaccines. Clin. Exp. Immunol. 119, 1–3 (2000).
Article CAS PubMed PubMed Central Google Scholar
Anderluh, M. et al. Recent advances on smart glycoconjugate vaccines in infections and cancer. FEBS J. 289, 4251–4303 (2022).
Article CAS PubMed Google Scholar
Micoli, F. et al. Glycoconjugate vaccines: current approaches towards faster vaccine design. Expert Rev. Vaccines 18, 881–895 (2019).
Article CAS PubMed Google Scholar
Hulbert, S. W., Desai, P., Jewett, M. C., DeLisa, M. P. & Williams, A. J. Glycovaccinology: The design and engineering of carbohydrate-based vaccine components. Biotechnol. Adv. 68, 108234 (2023).
Article CAS PubMed Google Scholar
Feldman, M. F. et al. Engineering N-linked protein glycosylation with diverse O antigen lipopolysaccharide structures in Escherichia coli. Proc. Natl. Acad. Sci. USA 102, 3016–3021 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Wacker, M. et al. Prevention of Staphylococcus aureus infections by glycoprotein vaccines synthesized in Escherichia coli. J. Infect. Dis. 209, 1551–1561 (2014).
Article CAS PubMed Google Scholar
Garcia-Quintanilla, F., Iwashkiw, J. A., Price, N. L., Stratilo, C. & Feldman, M. F. Production of a recombinant vaccine candidate against Burkholderia pseudomallei exploiting the bacterial N-glycosylation machinery. Front. Microbiol. 5, 381(2014).
Ihssen, J. et al. Increased efficiency of Campylobacter jejuni N-oligosaccharyltransferase PglB by structure-guided engineering. Open Biol. 5, 140227 (2015).
Article PubMed PubMed Central Google Scholar
Ravenscroft, N. et al. Characterization and immunogenicity of a Shigella flexneri 2a O-antigen bioconjugate vaccine candidate. Glycobiology 29, 669–680 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ravenscroft, N. et al. Purification and characterization of a Shigella conjugate vaccine, produced by glycoengineering Escherichia coli. Glycobiology 26, 51–62 (2015).
PubMed Google Scholar
Stark, J. C. et al. On-demand biomanufacturing of protective conjugate vaccines. Sci. Adv. 7, eabe9444 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Warfel, K. F. et al. A low-cost, thermostable, cell-free protein synthesis platform for on-demand production of conjugate vaccines. ACS Synth. Biol. 12, 95–107 (2023).
Article CAS PubMed Google Scholar
Williams A. J. et al. A low-cost recombinant glycoconjugate vaccine confers immunogenicity and protection against enterotoxigenic Escherichia coli infections in mice. Front. Mol. Biosci. 10, 1085887 (2023).
Jaroentomeechai, T. et al. Single-pot glycoprotein biosynthesis using a cell-free transcription-translation system enriched with glycosylation machinery. Nat. Commun. 9, 2686 (2018).
Article ADS PubMed PubMed Central Google Scholar
Terra, V. S. et al. Recent developments in bacterial protein glycan coupling technology and glycoconjugate vaccine design. J. Med. Microbiol. 61, 919–926 (2012).
Article CAS PubMed Google Scholar
Langdon, R. H., Cuccui, J. & Wren, B. W. N-linked Glycosylation in bacteria: an unexpected application. Future Microbiol. 4, 401–412 (2009).
Article CAS PubMed Google Scholar
Schoborg, J. A. et al. A cell-free platform for rapid synthesis and testing of active oligosaccharyltransferases. Biotechnol. Bioeng. 115, 739–750 (2018).
Article CAS PubMed Google Scholar
Kay, E. J., Yates, L. E., Terra, V. S., Cuccui, J. & Wren, B. W. Recombinant expression of Streptococcus pneumoniae capsular polysaccharides in Escherichia coli. Open Biol. 6, 150243 (2016).
Article PubMed PubMed Central Google Scholar
Kay, E. J. et al. Engineering a suite of E.coli strains for enhanced expression of bacterial polysaccharides and glycoconjugate vaccines. Micro. Cell Fact. 21, 66 (2022).
Article CAS Google Scholar
Reglinski, M. et al. A recombinant conjugated pneumococcal vaccine that protects against murine infections with a similar efficacy to Prevnar-13. NPJ Vaccines 3, 53 (2018).
Article PubMed PubMed Central Google Scholar
Ihssen, J. et al. Structural insights from random mutagenesis of Campylobacter jejunioligosaccharyltransferase PglB. BMC Biotechnol. 12, 67 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ihnken, L. A. et al. Mutated pglb Oligosaccharyltransferase Enzymes. USA Patent. https://patents.google.com/patent/WO2021028303A1/en (2020).
Fairman, J. et al. Non-clinical immunological comparison of a next-generation 24-valent pneumococcal conjugate vaccine (VAX-24) using site-specific carrier protein conjugation to the current standard of care (PCV13 and PPV23). Vaccine 39, 3197–3206 (2021).
Article CAS PubMed Google Scholar
Frasch, C. E. Preparation of bacterial polysaccharide–protein conjugates: analytical and manufacturing challenges. Vaccine 27, 6468–6470 (2009).
Article CAS PubMed Google Scholar
Kay, E., Cuccui, J. & Wren, B. W. Recent advances in the production of recombinant glycoconjugate vaccines. npj Vaccines 4, 16 (2019).
Article PubMed PubMed Central Google Scholar
Wassil, J. et al. A phase 2, randomized, blinded, dose-finding, controlled clinical trial to evaluate the safety, tolerability, and immunogenicity of a 24-valent pneumococcal conjugate vaccine (VAX-24) in healthy adults 65 years and older. Vaccine 42, 126124 (2024).
Article CAS PubMed Google Scholar
Stefanetti, G. et al. Sugar-protein connectivity impacts on the immunogenicity of site-selective salmonella o-antigen glycoconjugate vaccines. Angew. Chem. Int. Ed. Engl. 54, 13198–13203 (2015).
Article CAS PubMed PubMed Central Google Scholar
Li, M. et al. Shotgun scanning glycomutagenesis: a simple and efficient strategy for constructing and characterizing neoglycoproteins. Proc. Natl. Acad. Sci. USA 118, e2107440118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Techner, J. M. et al. High-throughput synthesis and analysis of intact glycoproteins using SAMDI-MS. Anal. Chem. 92, 1963–1971 (2020).
Article CAS PubMed Google Scholar
Jones, S. P. et al. Vaccine target and carrier molecule nontypeable Haemophilus influenzae protein D dimerizes like the close Escherichia coli GlpQ homolog but unlike other known homolog dimers. Proteins Struct. Funct. Bioinforma. 91, 161–170 (2023).
Article CAS Google Scholar
Burkhart, B. J., Kakkar, N., Hudson, G. A., van der Donk, W. A. & Mitchell, D. A. Chimeric leader peptides for the generation of non-natural hybrid RiPP products. ACS Cent. Sci. 3, 629–638 (2017).
Article CAS PubMed PubMed Central Google Scholar
Garcie, C. et al. The bacterial stress-responsive Hsp90 chaperone (HtpG) is required for the production of the genotoxin colibactin and the siderophore yersiniabactin in Escherichia coli. J. Infect. Dis. 214, 916–924 (2016).
Article CAS PubMed Google Scholar
Washio, K., Lim, S. P., Roongsawang, N. & Morikawa, M. Identification and characterization of the genes responsible for the production of the cyclic lipopeptide arthrofactin by Pseudomonas sp. MIS38. Biosci. Biotechnol. Biochem. 74, 992–999 (2010).
Article CAS PubMed Google Scholar
Xu, H., Liu, W. Q. & Li, J. A streptomyces-based cell-free protein synthesis system for high-level protein expression. Methods Mol. Biol. 2433, 89–103 (2022).
Article CAS PubMed Google Scholar
Des Soye, B. J., Davidson, S. R., Weinstock, M. T., Gibson, D. G. & Jewett, M. C. Establishing a high-yielding cell-free protein synthesis platform derived from Vibrio natriegens. ACS Synth. Biol. 7, 2245–2255 (2018).
Article CAS PubMed Google Scholar
Li, J., Wang, H. & Jewett, M. C. Expanding the palette of Streptomyces-based cell-free protein synthesis systems with enhanced yields. Biochem. Eng. J. 130, 29–33 (2018).
Article CAS Google Scholar
Moore, S. J. et al. A Streptomyces venezuelae cell-free toolkit for synthetic biology. ACS Synth. Biol. 10, 402–411 (2021).
Article CAS PubMed PubMed Central Google Scholar
Vinogradov, A. A., Chang, J. S., Onaka, H., Goto, Y. & Suga, H. Accurate models of substrate preferences of post-translational modification enzymes from a combination of mrna display and deep learning. ACS Cent. Sci. 8, 814–824 (2022).
Article CAS PubMed PubMed Central Google Scholar
Chang, J. S., Vinogradov, A. A., Zhang, Y., Goto, Y. & Suga, H. Deep learning-driven library design for the de novo discovery of bioactive thiopeptides. ACS Cent. Sci. 9, 2150–2160 (2023).
Article CAS PubMed PubMed Central Google Scholar
Mi, X., Barrett, S. E., Mitchell, D. A. & Shukla, D. LassoESM: A tailored language model for enhanced lasso peptide property prediction. bioRxiv https://doi.org/10.1101/2024.10.25.620295 (2024).
Kobayashi, M. et al. Use of 21-Valent pneumococcal conjugate vaccine among u.S. Adults: recommendations of the advisory committee on immunization practices - United States, 2024. Morb. Mortal. Wkly. Rep. 73, 793–798 (2024).
Article Google Scholar
Kouba, P. et al. Machine learning-guided protein engineering. ACS Catal. 13, 13863–13895 (2023).
Article CAS PubMed PubMed Central Google Scholar
Notin, P., Rollins, N., Gal, Y., Sander, C. & Marks, D. Machine learning for functional protein design. Nat. Biotechnol. 42, 216–228 (2024).
Article CAS PubMed Google Scholar
Landwehr, G. M. et al. Accelerated enzyme engineering by machine-learning guided cell-free expression. Nat. Commun. 16, 865 (2025).
Article CAS PubMed PubMed Central Google Scholar
Ramírez, A. S. et al. Molecular basis for glycan recognition and reaction priming of eukaryotic oligosaccharyltransferase. Nat. Commun. 13, 7296 (2022).
Article ADS PubMed PubMed Central Google Scholar
Zallot, R., Oberg, N. & Gerlt, J. A. The EFI Web resource for genomic enzymology tools: leveraging protein, genome, and metagenome databases to discover novel enzymes and metabolic pathways. Biochemistry 58, 4169–4182 (2019).
Article CAS PubMed Google Scholar
Oberg, N., Zallot, R. & Gerlt, J. A. EFI-EST, EFI-GNT, and EFI-CGFP: Enzyme function initiative (EFI) web resource for genomic enzymology tools. J. Mol. Biol. 435, 168018 (2023).
Article CAS PubMed PubMed Central Google Scholar
Hershewe, J. M. et al. Improving cell-free glycoprotein synthesis by characterizing and enriching native membrane vesicles. Nat. Commun. 12, 2363 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Kwon, Y. C. & Jewett, M. C. High-throughput preparation methods of crude extract for robust cell-free protein synthesis. Sci. Rep. 5, 8663 (2015).
Article CAS PubMed PubMed Central Google Scholar
Stark, J. C. et al. Rapid biosynthesis of glycoprotein therapeutics and vaccines from freeze-dried bacterial cell lysates. Nat. Protoc. 18, 2374–2398 (2023).
Article CAS PubMed Google Scholar
Jewett, M. C. & Swartz, J. R. Mimicking the Escherichia coli cytoplasmic environment activates long-lived and efficient cell-free protein synthesis. Biotechnol. Bioeng. 86, 19–26 (2004).
Article CAS PubMed Google Scholar
Jewett, M. C., Calhoun, K. A., Voloshin, A., Wuu, J. J. & Swartz, J. R. An integrated cell-free metabolic platform for protein production and synthetic biology. Mol. Syst. Biol. 4, 220 (2008).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors would like to thank Rui Gan, Jonathan Bogart, and Thuy Aziz for helpful discussions. This work was supported by the National Institutes of Health (NIH) 1U19AI142780-01 (R.N., E.P.B., and M.C.J.), DTRA (HDTRA1−20-1-0004) (M.C.J.), the National Science Foundation (CBET - 1936789) (M.C.J.), and DARPA (W911NF-23-2-0039) (M.P.D., A.S.K., and M.C.J.). D.A.W. acknowledges support from the National Science Foundation Graduate Research Fellowship under grant number DGE-1842165. Z.M.S. acknowledges support from the National Science Foundation National Research Traineeship under grant number 2021900. M.D. acknowledges support from the Canadian Institutes of Health Research Postdoctoral Fellowship under grant no. MFE-176575. This work made use of the IMSERC MS facility at Northwestern University, which has received support from the Soft and Hybrid Nanotechnology Experimental (SHyNE) Resource (NSF ECCS-2025633), the State of Illinois, and the International Institute for Nanotechnology (IIN).

Author information

These authors contributed equally: Derek A. Wong, Zachary M. Shaver.

Authors and Affiliations

Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, 60208, USA
Derek A. Wong, Maria D. Cabezas, Katherine F. Warfel, Deepali V. Prasanna, Sarah E. Sobol, Regina Fernandez, Ashty S. Karim & Michael C. Jewett
Chemistry of Life Processes Institute, Northwestern University, Evanston, IL, 60208, USA
Derek A. Wong, Zachary M. Shaver, Maria D. Cabezas, Katherine F. Warfel, Deepali V. Prasanna, Sarah E. Sobol, Regina Fernandez, Ashty S. Karim & Michael C. Jewett
Center for Synthetic Biology, Northwestern University, Evanston, IL, 60208, USA
Derek A. Wong, Zachary M. Shaver, Maria D. Cabezas, Katherine F. Warfel, Deepali V. Prasanna, Sarah E. Sobol, Regina Fernandez, Ashty S. Karim & Michael C. Jewett
Interdisciplinary Biological Sciences Program, Northwestern University, Evanston, IL, 60208, USA
Zachary M. Shaver
Medical Scientist Training Program, Northwestern University, Evanston, IL, 60208, USA
Zachary M. Shaver
Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Martin Daniel-Ivad, Robert Nicol & Emily P. Balskus
Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, 02138, USA
Martin Daniel-Ivad & Emily P. Balskus
Department of Chemistry, Northwestern University, Evanston, IL, 60208, USA
Fernando Tobias
Integrated Molecular Structure Education and Research Center (IMSERC), Northwestern University, Evanston, IL, 60208, USA
Fernando Tobias
Proteomics Center of Excellence, Northwestern University, Chicago, IL, 60611, USA
Szymon K. Filip & Peter Faull
Biochemistry, Molecular and Cell Biology (BMCB) Program, Cornell University, Ithaca, NY, 14853, USA
Sophia W. Hulbert & Matthew P. DeLisa
Robert Frederick Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, 14853, USA
Matthew P. DeLisa
Cornell Institute of Biotechnology, Cornell University, Ithaca, NY, 14853, USA
Matthew P. DeLisa
Howard Hughes Medical Institute, Harvard University, Cambridge, MA, 02138, USA
Emily P. Balskus
Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA
Michael C. Jewett

Authors

Derek A. Wong
View author publications
Search author on:PubMed Google Scholar
Zachary M. Shaver
View author publications
Search author on:PubMed Google Scholar
Maria D. Cabezas
View author publications
Search author on:PubMed Google Scholar
Martin Daniel-Ivad
View author publications
Search author on:PubMed Google Scholar
Katherine F. Warfel
View author publications
Search author on:PubMed Google Scholar
Deepali V. Prasanna
View author publications
Search author on:PubMed Google Scholar
Sarah E. Sobol
View author publications
Search author on:PubMed Google Scholar
Regina Fernandez
View author publications
Search author on:PubMed Google Scholar
Fernando Tobias
View author publications
Search author on:PubMed Google Scholar
Szymon K. Filip
View author publications
Search author on:PubMed Google Scholar
Sophia W. Hulbert
View author publications
Search author on:PubMed Google Scholar
Peter Faull
View author publications
Search author on:PubMed Google Scholar
Robert Nicol
View author publications
Search author on:PubMed Google Scholar
Matthew P. DeLisa
View author publications
Search author on:PubMed Google Scholar
Emily P. Balskus
View author publications
Search author on:PubMed Google Scholar
Ashty S. Karim
View author publications
Search author on:PubMed Google Scholar
Michael C. Jewett
View author publications
Search author on:PubMed Google Scholar

Contributions

D.A.W. designed research, performed experiments, performed MALDI-MS on reactions, analyzed data, and wrote the paper. Z.M.S. designed research, performed experiments, analyzed data, and wrote the paper. M.D.C. designed research, performed experiments, performed MALDI-MS on reactions, analyzed data, and edited the paper. M.D. computationally identified all lasso peptide BGCs and wrote the paper. K.F.W. performed experiments. D.V.P. performed experiments. S.E.S. performed experiments. R.F. performed experiments. F.T. optimized liquid chromatography and mass spectrometry parameters. S.K.F. analyzed proteomics data. S.W.H. designed research. P.F. supervised research. R.N. supervised research and edited the paper. M.P.D. supervised research and edited the paper. E.P.B. supervised research and edited the paper. A.S.K. supervised research, analyzed data, and edited the paper. M.C.J. designed and directed research, analyzed data, and wrote the paper.

Corresponding authors

Correspondence to Emily P. Balskus, Ashty S. Karim or Michael C. Jewett.

Ethics declarations

Competing interests

M.C.J. and M.P.D. have a financial interest in National Resilience and Gauntlet Bio. M.C.J. also has a financial interest in Stemloop Inc. and Synolo Therapeutics. M.C.J.’s interests are reviewed and managed by Northwestern University and Stanford University in accordance with their competing interest policies. M.P.D.s interests are reviewed and managed by Cornell University. All other authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Jesko Koehnke, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Information

Supplementary Data 1

Reporting Summary

Peer Review file

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Wong, D.A., Shaver, Z.M., Cabezas, M.D. et al. Characterizing and engineering post-translational modifications with high-throughput cell-free expression. Nat Commun 16, 7215 (2025). https://doi.org/10.1038/s41467-025-60526-6

Download citation

Received: 15 March 2024
Accepted: 28 May 2025
Published: 05 August 2025
Version of record: 05 August 2025
DOI: https://doi.org/10.1038/s41467-025-60526-6