Abstract
Specific protein–protein interactions are crucial in signaling networks and for the assembly of multi-protein complexes, and represent a challenging goal for protein design. Optimizing interaction specificity requires both positive design, the stabilization of a desired interaction, and negative design, the destabilization of undesired interactions. Currently, no automated protein-design algorithms use explicit negative design to guide a sequence search. We describe a multi-state framework for engineering specificity that selects sequences maximizing the transfer free energy of a protein from a target conformation to a set of undesired competitor conformations. To test the multi-state framework, we engineered coiled-coil interfaces that direct the formation of either homodimers or heterodimers. The algorithm identified three specificity motifs that have not been observed in naturally occurring coiled coils. In all cases, experimental results confirm the predicted specificities.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to the full article PDF.
USD 39.95
Prices may be subject to local taxes which are calculated during checkout






Similar content being viewed by others
Accession codes
References
Harbury, P.B., Plecs, J.J., Tidor, B., Alber, T. & Kim, P.S. High-resolution protein design with backbone freedom. Science 282, 1462–1467 (1998).
Nauli, S., Kuhlman, B. & Baker, D. Computer-based redesign of a protein folding pathway. Nat. Struct. Biol. 8, 602–605 (2001).
Dahiyat, B.I. & Mayo, S.L. De novo protein design: fully automated sequence selection. Science 278, 82–87 (1997).
Pabo, C. Molecular technology. Designing proteins and peptides. Nature 301, 200 (1983).
Ponder, J.W. & Richards, F.M. Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. J. Mol. Biol. 193, 775–791 (1987).
Kuhlman, B. & Baker, D. Native protein sequences are close to optimal for their structures. Proc. Natl. Acad. Sci. USA 97, 10383–10388 (2000).
Gordon, D.B., Marshall, S.A. & Mayo, S.L. Energy functions for protein design. Curr. Opin. Struct. Biol. 9, 509–513 (1999).
Hellinga, H.W. & Richards, F.M. Optimal sequence selection in proteins of known structure by simulated evolution. Proc. Natl. Acad. Sci. USA 91, 5803–5807 (1994).
Desjarlais, J.R. & Handel, T.M. De novo design of the hydrophobic cores of proteins. Protein Sci. 4, 2006–2018 (1995).
Lazar, G.A., Desjarlais, J.R. & Handel, T.M. De novo design of the hydrophobic core of ubiquitin. Protein Sci. 6, 1167–1178 (1997).
Malakauskas, S.M. & Mayo, S.L. Design, structure and stability of a hyperthermophilic protein variant. Nat. Struct. Biol. 5, 470–475 (1998).
Marshall, S.A. & Mayo, S.L. Achieving stability and conformational specificity in designed proteins via binary patterning. J. Mol. Biol. 305, 619–631 (2001).
Marvin, J.S. & Hellinga, H.W. Conversion of a maltose receptor into a zinc biosensor by computational design. Proc. Natl. Acad. Sci. USA 98, 4955–4960 (2001).
Wilson, C., Mace, J.E. & Agard, D.A. Computational method for the design of enzymes with altered substrate specificity. J. Mol. Biol. 220, 495–506 (1991).
Shimaoka, M. et al. Computational design of an integrin I domain stabilized in the open high affinity conformation. Nat. Struct. Biol. 7, 674–678 (2000).
O'Shea, E.K., Klemm, J.D., Kim, P.S. & Alber, T. X-ray structure of the GCN4 leucine zipper, a two-stranded, parallel coiled coil. Science 254, 539–544 (1991).
Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
O'Shea, E.K., Rutkowski, R., Stafford, W.F. III & Kim, P.S. Preferential heterodimer formation by isolated leucine zippers from Fos and Jun. Science 245, 646–648 (1989).
O'Shea, E.K., Lumb, K.J. & Kim, P.S. Peptide Velcro: design of a heterodimeric coiled coil. Curr. Biol. 3, 658–667 (1993).
Jorgensen, W.L. & Tiradorives, J. The OPLS potential functions for proteins: energy minimizations for crystals of cyclic-peptides and crambin. J. Am. Chem. Soc. 110, 1666–1671 (1988).
Brooks, B.R. et al. CHARMM: a program for macromolecular energy, minimization, and dynamics calculations. J. Comput. Chem. 4, 187–217 (1983).
Koehl, P. & Levitt, M. De novo protein design. II. Plasticity in sequence space. J. Mol. Biol. 293, 1183–1193 (1999).
Janin, J. Quantifying biological specificity: the statistical mechanics of molecular recognition. Proteins 25, 438–445 (1996).
Ghirlanda, G., Lear, J.D., Lombardi, A. & DeGrado, W.F. From synthetic coiled coils to functional proteins: automated design of a receptor for the calmodulin-binding domain of calcineurin. J. Mol. Biol. 281, 379–391 (1998).
Hellinga, H.W. Rational protein design: combining theory and experiment. Proc. Natl. Acad. Sci. USA 94, 10015–10017 (1997).
Hellinga, H.W. Construction of a blue copper analogue through iterative rational protein design cycles demonstrates principles of molecular recognition in metal center formation. J. Am. Chem. Soc. 120, 10055–10066 (1998).
Yue, K. et al. A test of lattice protein-folding algorithms. Proc. Natl. Acad. Sci. USA 92, 325–329 (1995).
Deutsch, J.M. & Kurosky, T. New algorithm for protein design. Phys. Rev. Lett. 76, 323–326 (1996).
Irback, A., Peterson, C., Potthast, F. & Sandelin, E. Monte Carlo procedure for protein design. Phys. Rev. E 58, R5249–R5252 (1998).
Dima, R.I., Banavar, J.R., Cieplak, M. & Maritan, A. Statistical mechanics of protein-like heteropolymers. Proc. Natl. Acad. Sci. USA 96, 4904–4907 (1999).
Banavar, J.R. et al. Structure-based design of model proteins. Proteins 31, 10–20 (1998).
Rossi, A., Maritan, A. & Micheletti, C. A novel iterative strategy for protein design. J. Chem. Phys. 112, 2050–2055 (2000).
Koehl, P. & Levitt, M. De novo protein design. I. In search of stability and specificity. J. Mol. Biol. 293, 1161–1181 (1999).
Jones, S. & Thornton, J.M. Principles of protein-protein interactions. Proc. Natl. Acad. Sci. USA 93, 13–20 (1996).
Buckle, A.M., Schreiber, G. & Fersht, A.R. Protein-protein recognition: crystal structural analysis of a barnase-barstar complex at 2.0-Å resolution. Biochemistry 33, 8878–8889 (1994).
Shoichet, B.K., Baase, W.A., Kuroki, R. & Matthews, B.W. A relationship between protein stability and protein function. Proc. Natl. Acad. Sci. USA 92, 452–456 (1995).
Derewenda, U. et al. An unusual buried polar cluster in a family of fungal lipases. Nat. Struct. Biol. 1, 36–47 (1994).
Warshel, A. & Aqvist, J. Electrostatic energy and macromolecular function. Annu. Rev. Biophys. Biophys. Chem. 20, 267–298 (1991).
Lumb, K.J. & Kim, P.S. A buried polar interaction imparts structural uniqueness in a designed heterodimeric coiled coil. Biochemistry 34, 8642–8648 (1995).
Bolon, D.N. & Mayo, S.L. Polar residues in the protein core of Escherichia coli thioredoxin are important for fold specificity. Biochemistry 40, 10047–10053 (2001).
Desjarlais, J.R. & Handel, T.M. Side-chain and backbone flexibility in protein core design. J. Mol. Biol. 290, 305–318 (1999).
Keating, A.E., Malashkevich, V.N., Tidor, B. & Kim, P.S. Side-chain repacking calculations for predicting structures and stabilities of heterodimeric coiled coils. Proc. Natl. Acad. Sci. USA 98, 14825–14830 (2001).
Jencks, W.P. Catalysis in Chemistry and Enzymology (Dover, New York; 1987).
Harbury, P.B., Tidor, B. & Kim, P.S. Repacking protein cores with backbone freedom: structure prediction for coiled coils. Proc. Natl. Acad. Sci. USA 92, 8408–8412 (1995).
Dunbrack, R.L. & Karplus, M. Backbone-dependent rotamer library for proteins. J. Mol. Biol. 230, 543–574 (1993).
Koehl, P. & Delarue, M. Application of a self-consistent mean field theory to predict protein side-chains conformation and estimate their conformational entropy. J. Mol. Biol. 239, 249–275 (1994).
Mendes, J., Soares, C.M. & Carrondo, M.A. Improvement of side-chain modeling in proteins with the self-consistent mean field theory method based on an analysis of the factors influencing prediction. Biopolymers 50, 111–131 (1999).
Sitkoff, D., Sharp, K.A. & Honig, B. Accurate calculation of hydration free energies using macroscopic solvent models. J. Phys. Chem. 98, 1978–1988 (1994).
Havranek, J.J. & Harbury, P.B. Tanford-Kirkwood electrostatics for protein modeling. Proc. Natl. Acad. Sci. USA 96, 11145–11150 (1999).
Street, A.G. & Mayo, S.L. Pairwise calculation of protein solvent-accessible surface areas. Fold. Des. 3, 253–258 (1998).
Dahiyat, B.I. & Mayo, S.L. Probing the role of packing specificity in protein design. Proc. Natl. Acad. Sci. USA 94, 10172–10177 (1997).
Fauchère, J.-L. & Pliska, V. Hydrophobic parameters-π of amino-acid side-chains from the partitioning of N-acetyl-amino-acid amides. Eur. J. Med. Chem. 18, 369–375 (1983).
Wimley, W.C., Creamer, T.P. & White, S.H. Solvation energies of amino acid side chains and backbone in a family of host-guest pentapeptides. Biochemistry 35, 5109–5124 (1996).
Lacroix, E., Viguera, A.R. & Serrano, L. Elucidating the folding problem of α-helices: local motifs; long-range electrostatics; ionic-strength dependence and prediction of NMR parameters. J. Mol. Biol. 284, 173–191 (1998).
Mitchell, M. An Introduction to Genetic Algorithms (MIT Press, Cambridge, Massachusetts; 1996).
Kleid, D.G. et al. Cloned viral protein vaccine for foot-and-mouth disease: responses in cattle and swine. Science 214, 1125–1129 (1991).
Kunkel, T.A., Bebenek, K. & McClary, J. Efficient site-directed mutagenesis using uracil-containing DNA. Methods Enzymol. 204, 125–139 (1991).
Edelhoch, H. Spectroscopic determination of tryptophan and tyrosine in proteins. Biochemistry 6, 1948–1954 (1967).
Schagger, H. & von Jagow, G. Tricine-sodium dodecyl sulfate-polyacrylamide gel electrophoresis for the separation of proteins in the range from 1 to 100 kDa. Anal. Biochem. 166, 368–379 (1987).
Lu, M. et al. Helix capping in the GCN4 leucine zipper. J. Mol. Biol. 288, 743–752 (1999).
Humphrey, W., Dalke, A. & Schulten, K. VMD: visual molecular dynamics. J. Mol. Graph. 14, 33–38 (1996).
Acknowledgements
We thank F.E. Boas, J.A. Silverman, D.R. Halpin, S.J. Wrenn and R.L. Baldwin for stimulating conversations and criticism during the course of this work and for comments on the manuscript. We also acknowledge helpful suggestions from the anonymous referees. This research was funded by a Searle scholar grant to P.B.H. from the Chicago Community Trust.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Rights and permissions
About this article
Cite this article
Havranek, J., Harbury, P. Automated design of specificity in molecular recognition. Nat Struct Mol Biol 10, 45–52 (2003). https://doi.org/10.1038/nsb877
Received:
Accepted:
Published:
Issue date:
DOI: https://doi.org/10.1038/nsb877
This article is cited by
-
Taking the Myc out of cancer: toward therapeutic strategies to directly inhibit c-Myc
Molecular Cancer (2021)
-
A rational blueprint for the design of chemically-controlled protein switches
Nature Communications (2021)
-
Conceptual Constructive Models and Abstraction-as-Aggregation
Philosophia (2021)
-
DIP/Dpr interactions and the evolutionary design of specificity in protein families
Nature Communications (2020)
-
The IKK-binding domain of NEMO is an irregular coiled coil with a dynamic binding interface
Scientific Reports (2019)


