Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Nature Communications
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. nature communications
  3. articles
  4. article
Identification and engineering of highly functional potyviral proteases in cells using co-evolutionary models
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 26 February 2026

Identification and engineering of highly functional potyviral proteases in cells using co-evolutionary models

  • Medel B. Lim Suan Jr1 na1,
  • Cheyenne Ziegler  ORCID: orcid.org/0000-0002-6379-31622 na1,
  • Zain Syed  ORCID: orcid.org/0009-0006-4450-33062,
  • Arjun Sai Yedavalli2,
  • Jaimahesh Nagineni  ORCID: orcid.org/0009-0003-2518-16372,
  • Rodrigo Raposo2,
  • Ajay Tunikipati2,
  • Jaideep Kaur1,
  • Faruck Morcos  ORCID: orcid.org/0000-0001-6208-15611,2,3,4 &
  • …
  • P. C. Dave P. Dingal  ORCID: orcid.org/0000-0002-1052-25371,2,3 

Nature Communications , Article number:  (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Coevolution
  • Machine learning
  • Protein engineering
  • Protein function predictions
  • Synthetic biology

Abstract

Efficiency and substrate specificity of proteases in the Potyviridae family have not been comprehensively profiled. Here we develop a model that learns co-evolutionary features to accurately predict and experimentally validate protease performance at single amino-acid resolution. We identify and engineer several proteases that perform better than the commercially available tobacco etch virus protease. To demonstrate the resolving power of our methods, we engineer protease crosstalk to selectively trigger a synthetic cell-death program in human cells.

Data availability

The data generated in this study have been deposited in the Zenodo database under accession code https://doi.org/10.5281/zenodo.15039890. The overview of aligned Potyviridae sequences and the list of plasmids (Supplementary Data 2) used in this study are provided as Supplementary Data. The plasmid sequences and maps for all proteases, the 7-, GS-flanked 7-, and 20-amino acid substrate of TEVp, and the H2B-sfGFP reporter used here are available on Addgene: https://www.addgene.org/Dave_Dingal/. Source data are provided with this paper. The protein structural data used in this study are available in the PDB database under accession code 1LVM. The protein family profile HMM used in this study are available in the InterPro database under accession code PF00863. Source data are provided with this paper.

Code availability

To facilitate the testing of thousands of potyviral proteases against peptide targets by multiple laboratories, we created an interactive web application for ProSSpeC (https://coevolutionary.org/prosspec/). Code is available at https://github.com/morcoslab/ProSSpeC and archived on Zenodo with https://doi.org/10.5281/zenodo.1832102535.

References

  1. Neurath, H. & Walsh, K. A. Role of proteolytic enzymes in biological regulation (a review). Proc. Natl. Acad. Sci. USA. 73, 3825–3832 (1976).

    Google Scholar 

  2. Kapust, R. B. & Waugh, D. S. Controlled intracellular processing of fusion proteins by TEV protease. Protein Expr. Purif. 19, 312–318 (2000).

    Google Scholar 

  3. Rawlings, N. D. & Salvesen, G. Handbook of Proteolytic Enzymes, 1–3 (Academic Press, 2013).

  4. Xie, M. & Fussenegger, M. Designing cell function: assembly of synthetic gene circuits for cell biology applications. Nat. Rev. Mol. Cell Biol. 19, 507–525 (2018).

    Google Scholar 

  5. Fernandez-Rodriguez, J. & Voigt, C. A. Post-translational control of genetic circuits using potyvirus proteases. Nucleic Acids Res. 44, 6493–6502 (2016).

    Google Scholar 

  6. Fink, T. et al. Design of fast proteolysis-based signaling and logic circuits in mammalian cells. Nat. Chem. Biol. 15, 115–122 (2018).

    Google Scholar 

  7. Sanchez, M. I. & Ting, A. Y. Directed evolution improves the catalytic efficiency of TEV protease. Nat. Methods 17, 167–174 (2019).

    Google Scholar 

  8. Carrington, J. C. & Dougherty, W. G. Small nuclear inclusion protein encoded by a plant potyvirus genome is a protease. J. Virol. 61, 2540–2548 (1987).

    Google Scholar 

  9. Morcos, F. et al. Direct-coupling analysis of residue coevolution captures native contacts across many protein families. Proc. Natl. Acad. Sci. USA. 108, E1293–E1301 (2011).

    Google Scholar 

  10. Dos Santos, R. N., Morcos, F., Jana, B., Andricopulo, A. D. & Onuchic, J. N. Dimeric interactions and complex formation using direct coevolutionary couplings. Sci. Rep. 5, 1–10 (2015).

    Google Scholar 

  11. Morcos, F., Jana, B., Hwa, T. & Onuchic, J. N. Coevolutionary signals across protein lineages help capture multiple protein conformations. Proc. Natl. Acad. Sci. USA. 110, 20533–20538 (2013).

    Google Scholar 

  12. Cheng, R. R., Morcos, F., Levine, H. & Onuchic, J. N. Toward rationally redesigning bacterial two-component signaling systems using coevolutionary information. Proc. Natl. Acad. Sci. USA. 111, E563–E571 (2014).

  13. Jiang, X. L., Dimas, R. P., Chan, C. T. Y. & Morcos, F. Coevolutionary methods enable robust design of modular repressors by reestablishing intra-protein interactions. Nat. Commun. 12, 5592 (2021).

  14. Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).

    Google Scholar 

  15. Abramson, J. et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493–500 (2024).

    Google Scholar 

  16. Savinov, A., Swanson, S., Keating, A. E. & Li, G.-W. High-throughput discovery of inhibitory protein fragments with AlphaFold. Proc. Natl. Acad. Sci. USA. 122, e2322412122 (2025).

    Google Scholar 

  17. Zhou, Q. et al. Global pairwise RNA interaction landscapes reveal core features of protein recognition. Nat. Commun. 9, 2511 (2018).

  18. Dimas, R. P., Jiang, X. L., De La Paz, J. A., Morcos, F. & Chan, C. T. Y. Engineering repressors with coevolutionary cues facilitates toggle switches with a master reset. Nucleic Acids Res. 47, 5449–5463 (2019).

  19. Kipniss, N. H. et al. Engineering cell sensing and responses using a GPCR-coupled CRISPR-Cas system. Nat. Commun. 2017 8, 1–10 (2017).

    Google Scholar 

  20. Kapust, R. B. et al. Tobacco etch virus protease: mechanism of autolysis and rational design of stable mutants with wild-type catalytic proficiency. Protein Eng. 14, 993–1000 (2001).

    Google Scholar 

  21. Xia, S. et al. Synthetic protein circuits for programmable control of mammalian cell death. Cell 187, 2785–2800.e16 (2024).

    Google Scholar 

  22. Gao, X. J., Chong, L. S., Kim, M. S. & Elowitz, M. B. Programmable protein circuits in living cells. Science 361, 1252–1258 (2018).

    Google Scholar 

  23. Goh, C. J. & Hahn, Y. Analysis of proteolytic processing sites in potyvirus polyproteins revealed differential amino acid preferences of NIa-pro protease in each of seven cleavage sites. PLoS ONE 16, e0245853 (2021).

  24. Kapust, R. B., Toözseór, J., Copeland, T. D. & Waugh, D. S. The P1′ specificity of tobacco etch virus protease. Biochem. Biophys. Res. Commun. 294, 949–955 (2002).

  25. Huber, L. et al. Data-driven protease engineering by DNA-recording and epistasis-aware machine learning. Nat. Commun. 16, 5466 (2025).

  26. Beaumont, L. P., Mehalko, J., Johnson, A., Wall, V. E. & Esposito, D. Unexpected tobacco etch virus (TEV) protease cleavage of recombinant human proteins. Protein Expr. Purif. 220, 106488 (2024).

  27. Song, J. et al. PROSPER: an integrated feature-based tool for predicting protease substrate cleavage sites. PLoS ONE 7, e50300 (2012).

  28. Song, J. et al. IProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites. Brief. Bioinform. 20, 638–658 (2019).

  29. Song, J. et al. PROSPERous: high-throughput prediction of substrate cleavage sites for 90 proteases with improved accuracy. Bioinformatics 34, 684–687 (2018).

  30. Gasteiger, E. et al. ExPASy: the proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res. 31, 3784–3788 (2003).

  31. Eddy, S. R. Accelerated profile HMM searches. PLoS Comput. Biol. 7, e1002195 (2011).

    Google Scholar 

  32. Koutra, D., Shah, N., Vogelstein, J. T., Gallagher, B. & Faloutsos, C. DELTACON: principled massive-graph similarity function with attribution. ACM Trans. Knowl. Discov. Data 10, 1–43 (2016).

  33. Kumar, S. et al. MEGA12: molecular evolutionary genetic analysis version 12 for adaptive and green computing. Mol. Biol. Evol. 41, msae263 (2024).

  34. Feng, S. et al. Bright split red fluorescent proteins for the visualization of endogenous proteins and synapses. Commun. Biol. 2, 1–12 (2019).

    Google Scholar 

  35. Lim Suan, M. B. et al. Identification and engineering of highly functional potyviral proteases in cells using co-evolutionary models. ProSSpeC. https://doi.org/10.5281/zenodo.18321025 (2025).

Download references

Acknowledgements

We thank members of the Dingal lab and Morcos lab for their advice, expertise, and discussions. We thank Elliott Joe, Ahmed Adookkattil, and Shashwat Singh for data analysis support. We also thank the UTD Flow Cytometry Core for infrastructure and support. We acknowledge the UTD Office of Information Technology Cyberinfrastructure Research Computing for providing high-performance computing and services. M.B.L. is supported by the UTD Eugene McDermott Graduate Fellowship. This research was supported by a UTD Startup Fund and National Institutes of Health-NIGMS awards to the labs of P.C.D.P.D. (R35GM150967) and of F.M. (R35GM133631). F.M. acknowledges support from the National Science Foundation (MCB-1943442).

Author information

Author notes
  1. These authors contributed equally: Medel B. Lim Suan Jr, Cheyenne Ziegler.

Authors and Affiliations

  1. Department of Bioengineering, The University of Texas at Dallas, Richardson, TX, USA

    Medel B. Lim Suan Jr, Jaideep Kaur, Faruck Morcos & P. C. Dave P. Dingal

  2. Department of Biological Sciences, The University of Texas at Dallas, Richardson, TX, USA

    Cheyenne Ziegler, Zain Syed, Arjun Sai Yedavalli, Jaimahesh Nagineni, Rodrigo Raposo, Ajay Tunikipati, Faruck Morcos & P. C. Dave P. Dingal

  3. Center for Systems Biology, The University of Texas at Dallas, Richardson, TX, USA

    Faruck Morcos & P. C. Dave P. Dingal

  4. Department of Physics, The University of Texas at Dallas, Richardson, TX, USA

    Faruck Morcos

Authors
  1. Medel B. Lim Suan Jr
    View author publications

    Search author on:PubMed Google Scholar

  2. Cheyenne Ziegler
    View author publications

    Search author on:PubMed Google Scholar

  3. Zain Syed
    View author publications

    Search author on:PubMed Google Scholar

  4. Arjun Sai Yedavalli
    View author publications

    Search author on:PubMed Google Scholar

  5. Jaimahesh Nagineni
    View author publications

    Search author on:PubMed Google Scholar

  6. Rodrigo Raposo
    View author publications

    Search author on:PubMed Google Scholar

  7. Ajay Tunikipati
    View author publications

    Search author on:PubMed Google Scholar

  8. Jaideep Kaur
    View author publications

    Search author on:PubMed Google Scholar

  9. Faruck Morcos
    View author publications

    Search author on:PubMed Google Scholar

  10. P. C. Dave P. Dingal
    View author publications

    Search author on:PubMed Google Scholar

Contributions

M.B.L., Z.S., A.S.Y., A.T., R.R., J.N., and J.K. performed experiments. M.B.L. and P.C.D.P.D. analyzed experimental results. Computational modeling and full stack development of the ProSSpeC web app performed by C.Z. Conceptual planning and resources provided by P.C.D.P.D. and F.M. M.B.L., C.Z., F.M., and P.C.D.P.D. authored and edited this manuscript, including illustrations. The final version of this manuscript is approved by all authors.

Corresponding authors

Correspondence to Faruck Morcos or P. C. Dave P. Dingal.

Ethics declarations

Competing interests

The Board of Regents of The University of Texas System have filed a pending patent application on behalf of co-inventors P.C.D.P.D., F.M., C.Z., and M.B.L. of the engineered proteases described (US Provisional Application No. 63/885,099). The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Zhongyue Yang and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lim Suan, M.B., Ziegler, C., Syed, Z. et al. Identification and engineering of highly functional potyviral proteases in cells using co-evolutionary models. Nat Commun (2026). https://doi.org/10.1038/s41467-026-69961-5

Download citation

  • Received: 16 May 2025

  • Accepted: 13 February 2026

  • Published: 26 February 2026

  • DOI: https://doi.org/10.1038/s41467-026-69961-5

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • Reviews & Analysis
  • News & Comment
  • Videos
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims & Scope
  • Editors
  • Journal Information
  • Open Access Fees and Funding
  • Calls for Papers
  • Editorial Values Statement
  • Journal Metrics
  • Editors' Highlights
  • Contact
  • Editorial policies
  • Top Articles

Publish with us

  • For authors
  • For Reviewers
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Nature Communications (Nat Commun)

ISSN 2041-1723 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing