Introduction

In the complex realm of microorganisms, viruses stand out as enigmatic entities, existing at the intersection of life and non-life. Viruses consist of a minimal genome containing the necessary information for replication, which is encapsulated in a protein shell called a capsid, and in some cases, also a lipid envelope derived from the host cell membrane, studded with viral glycoproteins that mediate host recognition and entry.

Viral infection begins with a multistep entry process, where the virus first attaches to host cell surface receptors and penetrates the cell through membrane fusion or endocytosis (Fig. 1). Virions that enter endosomes escape in response to environmental signals such as pH changes or enzyme activation. Once inside the cytosol, the viral capsid undergoes uncoating, releasing its genetic material for replication. Subsequent stages in the life cycle include the synthesis of viral components, followed by virion assembly and release. As obligate intracellular parasites, viruses rely on the host cell’s machinery for propagation. Understanding which host factors are recruited throughout the viral life cycle is crucial, as these interactions drive viral pathogenicity and can inform therapeutic strategies to combat emerging threats. However, studying these mechanisms using live viruses presents significant challenges. Viral entry often involves multiple receptors and attachment factors, making it difficult to isolate specific interactions. Moreover, modifying viral surface properties to study these processes can be inherently limited by strict biosafety regulations, which further restrict research accessibility when handling highly pathogenic viruses.

Fig. 1: Simplified viral life cycle.
figure 1

The majority of viruses—whether enveloped or nonenveloped—depend on the host cell’s endocytic pathways for entry (left)77,163,164,165,166. Some viruses bypass complex entry mechanisms by directly fusing their envelope with the plasma membrane, allowing their capsids to reach the cytosol (middle). Endocytosed viruses undergo a multistep process that moves them from the cell surface to specific intracellular sites where their genetic material is released. Initially, viruses attach to cellular receptors, which may be proteins, lipids, or carbohydrates, typically in a specific and multivalent fashion. These interactions trigger cellular signaling pathways, leading to virus uptake through various endocytic mechanisms, primarily macropinocytosis and clathrin-mediated endocytosis. Following vesicular trafficking, host cell cues—such as pH changes or proteolytic cleavage—induce conformational changes in the virus, facilitating penetration into the cytosol. Enveloped viruses achieve this through membrane fusion, while nonenveloped viruses disrupt the vacuolar membrane through pore formation or lysis. The precise site of penetration varies depending on the virus and host cell type, occurring at the plasma membrane, early or late endosomes, macropinosomes, or the endoplasmic reticulum. After cytosolic entry, RNA viruses typically replicate in the cytoplasm, whereas most DNA viruses continue their journey to the nucleus. The entry process concludes with controlled uncoating of the viral genome, marking the start of replication (right). Newly synthesized viral components are then assembled into progeny virions, which are subsequently released from the host cell via budding or lysis.

To address these challenges, cages derived from nonviral proteins are a promising class of tools for studying viruses. These hollow particles are well-defined structures that can mimic the architecture of non-enveloped viruses, eliminating the need to work with live, infectious agents. Their versatility allows them to be engineered to imitate specific viral functions in controlled settings1,2, providing a platform to dissect virus-host interactions and cellular responses without the complexity associated with studying whole viruses. Additionally, the low inherent toxicity of proteins and their inability to replicate make them safer and more accessible for exploratory research in laboratory settings.

In this perspective, engineering efforts aimed at mimicking various stages of the viral life cycle using artificial protein cages are briefly reviewed. While numerous examples of virus-like particles have been described in the literature, this article focuses exclusively on scaffolds derived from nonviral proteins, such as ferritin, lumazine synthase, and computationally designed architectures (Fig. 2). Through engineering to exhibit virus-like behaviors, these artificial systems offer valuable insights into viral processes1 and present new opportunities to develop innovative medical and biotechnological solutions to address current and future health challenges2.

Fig. 2: Nonviral protein cages: structural diversity and design principles.
figure 2

Surface representations of different protein cages varying in size, symmetry, and the number of constituent capsomers, derived from either naturally occurring (ad) or computationally designed scaffolds (ei). a AaLS-wt (PDB 5MPP), a 60-subunit self-assembling cage, served as a template for multiple directed evolution campaigns. b NC-4 (PDB 7A4J) and c AaLS-13 (PDB 5MQ7) are evolved variants of AaLS-wt, optimized to encapsulate their own encoding mRNA55 and toxic proteins167, respectively, with both exhibiting increased diameters. d A variant of human heavy-chain ferritin (PDB 2CEI) with positively charged surface residues was encapsulated within AaLS-13, forming matryoshka-like assemblies168. e OP (PDB 6FDB) was engineered by introducing positive charges onto the lumenal surface of the computationally designed O3-33 cage43. Owing to its positively charged lumen and porous architecture, OP can be efficiently loaded in vitro with a desired oligonucleotide44 or anionic surfactants, forming a hydrophobic core that enables the sequestration of non-polar molecules45. f I3-01 is a 60-subunit icosahedral cage composed exclusively of trimeric subunits. Its variant, mi3, was created by mutating surface-exposed cysteines (Cys76 and Cys100) to alanine169. The SpyTag/SpyCatcher technology170 has been used to encapsulate both proteinaceous and non-proteinaceous cargo within mi3 cages13. g I53-50 (PDB 7SGE) is a 120-subunit icosahedral assembly comprising trimeric (blue) and pentameric (gray) components171. I53-50 has been engineered to encapsulate nucleic acids52 as well as drug-loaded synthetic polymers63. h O42.1 is a 30 nm porous octahedral structure consisting of six tetramers (gray) and 12 dimeric Fc subunits (blue). i Closure of the O42.1 cage’s pores with pH-responsive trimeric plugs (yellow) resulted in the O432 variant. Modifying the interior-facing residues of the trimeric plug to introduce internal charge enabled the packaging of nucleic acid or protein cargo172. All structures were visualized using ChimeraX173,174. O42.1 and O432 were modeled from density maps deposited in the Electron Microscopy Data Bank175 (EMD-29602).

Binding, internalization, and cargo release

While human ferritin exhibits low-affinity binding to transferrin receptor-1, which mediates its internalization3, nonviral protein cages (NVPCs) typically lack inherent membrane-binding capabilities. However, surface engineering can enhance their affinity for cellular membranes. For instance, the trans-activator of transcription (Tat) peptide from the human immunodeficiency virus (HIV)4 is widely employed to facilitate the cellular uptake of protein cages. Its ability to penetrate cell membranes arises from its positively charged, arginine- and lysine-rich sequence, which interacts with negatively charged cell-surface molecules such as heparin sulfate. This enables Tat-fused proteins to cross cellular barriers in an endocytosis-independent manner5,6. In contrast, the tripeptide arginine-glycine-aspartic acid (RGD), found in the adenovirus penton base protein7, is critical for receptor-mediated viral entry as it interacts with αv integrins on host cells, triggering endocytosis8. To enhance cell binding and uptake, RGD motifs have been incorporated into protein cages such as lumazine synthase9, ferritin10,11,12, and a a variant of the computationally designed I3-01 cage, known as mi313. While RGD peptides are effective in targeting integrins, full-length antibodies14,15,16,17, antibody fragments18,19, and mimics20 have also been used to direct protein cages to alternative receptors.

Antibody-binding domains (ABDs)21, such as those derived from protein A22,23, are often employed to display antibodies on the surface of NVPCs24,25,26,27,28,29,30,31. These domains specifically bind to the Fc region of antibodies, ensuring proper orientation of the Fab regions for receptor recognition. Using this method, variants of lumazine synthase have been successfully targeted to cells overexpressing CD4432 or the human epidermal growth factor receptor 232,33, with minimal uptake observed in receptor-negative cells. However, the non-covalent nature of ABD-antibody interactions can result in loss of targeting specificity in the presence of competing immunoglobulins, restricting this approach to in vitro applications33. Improved targetability in competitive environments could potentially be achieved through covalent linkage of antibodies to the cage or ensuring full antibody occupancy on the cage surface, thereby reducing the likelihood of antibody exchange. However, such strategies may lead to challenges, such as antibodies crosslinking multiple nanoparticles, potentially causing precipitation.

Recent advances in computational protein design have made it possible to precisely orient antibodies in non-natural geometric assemblies. By grafting residues from protein A onto dimeric, trimeric, tetrameric, or pentameric helical repeat proteins, researchers have constructed dihedral, tetrahedral, octahedral, and icosahedral architectures that display 2, 6, 12, and 30 protein A-binding antibodies per cage, respectively34. These antibody nanocages have been successfully targeted to cells overexpressing the epidermal growth factor receptor. However, their performance in the presence of competing immunoglobulins remains uncertain. Furthermore, after internalization, these engineered protein cages tend to become trapped within endosomes, potentially limiting their functional efficacy33,34.

Once internalized via endocytosis, viruses must escape the endolysosomal pathway to avoid degradation, deliver their cargo, and successfully infect the host. Although no universal escape mechanism exists, a common feature is the exploitation of endosomal acidification. As the endosome matures, its pH gradually decreases due to the activity of ATP-dependent proton pumps35. Endosomal acidification serves as a crucial trigger for conformational changes in viral proteins, facilitating their interaction with the endosomal membrane36. A prominent example is the influenza A virus, where pH-driven changes in hemagglutinin enable membrane fusion and genome release37,38. Additionally, the influenza M2 protein forms a proton channel that allows protons to flow into the virus interior, further promoting uncoating39,40.

In contrast to their enveloped counterparts, non-enveloped viruses cannot fuse with the endosomal membrane. To gain access to the cytoplasm, they disrupt endosomes through lysis or pore formation processes that remain incompletely understood41. Mimicking these mechanisms using membrane-less protein cages presents a significant challenge. Although a general strategy for achieving endosomal escape has yet to be established, the pH-responsive behavior of viruses has inspired engineering efforts. For example, researchers designed trimeric “plug” elements to seal large pores in octahedral antibody nanoparticles42. These plugs remain closed at neutral pH but dissociate in vitro at a pH of 6.7—higher than the typical endosomal pH, which ranges from 6.5 (early endosome) to 5.5 (late endosome).

In studies on an engineered variant of the computationally designed O3-33 cage43, known as OP44,45,46, researchers attributed cytoplasmic siRNA delivery to a hexahistidine tag on each of the cage’s 24 subunits44. Hypothetically, protonation of the histidine residues induces an osmotic imbalance, leading to water influx and endosomal swelling—a phenomenon known as the proton sponge effect47. This swelling can destabilize the endosomal membrane, ultimately releasing the cargo into the cytoplasm. Alternatively, protonated histidine residues may directly induce endosomal membrane permeabilization48. Engineering such pH-responsive features in artificial cages may bring us closer to replicating intracellular cargo release and enhances our understanding of viral entry processes following endosomal priming, including membrane fusion, uncoating, and genome release.

Nucleic acid packaging

After successful replication, the virus progresses to the late stages of its life cycle, including assembly and release from the host cell. These processes ensure the formation of infectious progeny and their dissemination. During assembly, the newly synthesized viral genome is efficiently encapsulated within the viral capsid. Some viruses achieve remarkable packaging efficiency by utilizing relatively simple recognition motifs49. Many capsid proteins are positively charged, allowing them to bind tightly to the negatively charged phosphate backbone of nucleic acids. Inspired by these natural mechanisms, artificial RNA encapsulation systems using NVPCs have been developed1. By incorporating positively charged residues on the lumenal surfaces of these cages, particles capable of encapsulating endogenous nucleic acids in cellular environments were created44,46,50,51. In the case of stable, porous cages, such as OP, unwanted bacterial RNA guests can be removed using a combination of RNase A treatment to degrade contaminant RNA and high-ionic-strength buffers to weaken electrostatic interactions44. The resulting empty cages can then be loaded in vitro with desired synthetic oligonucleotides. To achieve selective genome packaging in vivo, researchers have employed directed evolution strategies.

In 2017, David Baker’s laboratory—led by the 2024 Nobel Prize laureate in Chemistry—used the computationally designed two-component cage I53-50 as a starting point for evolution52. A library of protein cages with variably charged interior surfaces was generated and selected for genome packaging and cargo protection. After several rounds of evolution, a nucleocapsid (I53-50-v4) with a positively charged inner surface was generated. These evolved protein cages were capable of packaging one full-length genome for every 11 particles. Similarly, in 2018, Hilvert and coworkers utilized the wild-type enzyme lumazine synthase from the hyperthermophilic organism Aquifex aeolicus (AaLS), which naturally forms 60-subunit dodecahedral cages, as a scaffold. By applying circular permutation to the AaLS monomers, the N- and C-termini were repositioned from the exterior to the interior surface53. The λN+ peptide, derived from bacteriophage lambda, was genetically introduced to the newly relocated N-terminus54. This peptide binds the boxB RNA hairpin structure through its arginine-rich domain. By flanking the capsid protein mRNA sequence with boxB tags, assemblies capable of binding their own encoding mRNA were generated and subjected to directed evolution. After several rounds of mutagenesis and screening, the best variant, NC-4, emerged as icosahedrally symmetric closed shells that encapsulated an average of two copies of its coding mRNA per particle55. Remarkably, these cages protected their cargo from nuclease degradation for days. This efficient packaging was achieved through the co-evolution of packaging signals within the RNA genome, mirroring a hallmark of viral evolution55. Structurally, NC-4 adopted an architecture resembling that of T = 4 viruses and featured 3D-domain swaps, which have been reported to enhance the stability of certain viral capsids56,57. NC-4 provides compelling evidence supporting the hypothesis that viruses may have emerged from preexisting cellular components58,59.

Despite their ability to encapsulate their own genetic material with high precision, some viruses package foreign genomes. This process, known as viral reassortment, is particularly prevalent among RNA viruses and occurs when two or more viruses of the same genera with segmented genomes, such as the influenza virus, co-infect a cell and exchange genetic material60. Reassortment of gene segments in influenza viruses can result in significant antigenic changes to the hemagglutinin and neuraminidase proteins, potentially creating chimeric viruses with an increased risk of zoonotic spillover61. The 2009 H1N1 pandemic is a notable example of such a reassortment event62. Understanding this process is vital for improving pandemic preparedness.

Artificial nucleocapsids offer modular and controllable systems to study the fundamental mechanisms underlying viral reassortment. Encapsulation of foreign RNA sequences in vitro has been demonstrated using the nucleocapsids I53-50-v4 and NC-463,64. For I53-50-v4, preventing cage assembly in the cytoplasm of E. coli was achieved by separately producing the pentameric and trimeric components. This approach required a single point mutation to enhance the solubility of the pentameric subunits. Single-stranded RNA molecules of varying sizes were encapsulated simply by mixing the cage components with negatively charged RNA cargo63. Similarly, the assembly of NC-4 cages in cells was inhibited by fusing each monomer to a maltose-binding protein64. Selective proteolytic cleavage of this steric block in the presence of up to 2600 nucleotides of foreign RNA triggered capsid assembly.

The nucleocapsids assembled in vitro exhibited the same structural characteristics as those formed in vivo and efficiently protected their cargo from nuclease degradation. Although replicating genome packaging within a physiologically relevant environment can be challenging, the ability to precisely control cargo composition and encapsulation offers significant advantages for studying viral packaging mechanisms. For instance, it could allow assessment of the encapsulation efficiency of genomes with different packaging signals, aligning with studies using virus-derived proteins65,66,67,68,69,70.

In host cells, viruses can hijack existing biomolecular condensates or even induce new ones to support their life cycle71. For example, the replication and assembly of influenza virus are facilitated by its genome—eight single-stranded, negative-sense RNA segments—undergoing liquid-liquid phase separation72,73,74. Since these biomolecular condensates are fragile and difficult to observe and analyze75, novel characterization methods are highly sought after.

In a recent study, human ferritin and mi3, were engineered to interact with liquid-liquid phase-separated proteins76. Encapsulating the condensates with a stabilizing layer of cages hardened them and prevented coalescence, enabling detailed structural visualization using cryoET. By adjusting the condensate-to-cage ratio, researchers could significantly vary the size of the assembled condensates. Once the manipulation of these membrane-less organelles within cells becomes possible, it could open new avenues for exploring phase-separated viral genomes, such as that of influenza.

Virion release

Virus release from host cells can occur through various mechanisms, including lysis, exocytosis, and budding77. Most enveloped viruses, such as HIV, acquire their envelope via the budding process78. The HIV-1 Gag protein plays a pivotal role in this pathway79. The N-terminal domain of Gag recruits phosphatidylinositol (4,5) bisphosphate at the host plasma membrane80, while the C-terminal p6 region facilitates interactions with the host ESCRT machinery to mediate the final membrane fission step required for release81.

This understanding has been leveraged to engineer artificial NVPCs capable of directing their own release from eukaryotic cells. For instance, the first six amino acids of the HIV-1 Gag protein and its p6 region were genetically fused to the N- and C-termini, respectively, of the computationally designed protein cages I3-01 and O3-3382,83. Expression of these constructs in human embryonic kidney (HEK) 293T cells resulted in the production of extracellular vesicles of varying sizes containing multiple copies of the designed nanocages. Cryo-electron tomography revealed that most of the nanocages were membrane-associated.

In contrast to these nonviral systems, HIV-1 capsids are released individually from the plasma membrane84. This variation in the number of proteinaceous capsids per vesicle highlights the significant influence of factors such as particle size and the surface density of Gag and p6 proteins on the budding process. Adapting this strategy to various scaffolds presents an opportunity to study how these factors affect membrane curvature and the recruitment of host machinery. To gain structural insights into virus-host interactions, cryo-electron tomography provides high-resolution 3D visualizations of macromolecular complexes in their native cellular environment85,86. Unlike polymorphic viruses such as influenza, which are challenging to image due to their variability, protein cages offer a consistent size and high contrast, making them a reliable alternative for structural studies in cells while reducing variability and enhancing data quality.

Lipid envelope coating opens new possibilities for creating innovative types of pseudotyped particles based on nonviral capsids. Traditional pseudotyping involves producing viral vectors with envelope proteins from a different virus to either limit or broaden the range of cell types they can infect, referred to as their tropism87. To generate pseudotyped NVPCs, the engineered I3-01 scaffold was co-expressed with a beta-lactamase fusion protein designed for encapsulation within the cages, along with the vesicular stomatitis virus glycoprotein (VSV-G)83. VSV-G, a trimeric spike protein, is commonly used for pseudotyping as it facilitates receptor binding and membrane fusion88. All three proteins, encoded on separate plasmids, were efficiently co-released in extracellular vesicles from HEK 293T producer cells. After harvesting, the uptake of particles by HeLa cells was assessed using a beta-lactamase colorimetric activity assay, confirming successful cargo release into the host cytoplasm, likely due to VSV-G-mediated membrane fusion at either the plasma or endosomal membrane.

VSV-G-pseudotyped particles are ideal for studying membrane fusion mechanisms, and extending this approach to other spike proteins could enable functional investigations without requiring intact pathogens. Traditional pseudotyping methods allowed virus studies in BSL-2 labs instead of higher-containment BSL-3 or BSL-4 facilities89, which are associated with high costs, limited access, and strict biosafety protocols. These constraints slow research, reduce throughput, and increase logistical complexity. Pseudotyped NVPCs offer a safer alternative, potentially enabling research in BSL-1 labs and broadening participation in viral mechanism studies.

Immune modulation

The viral envelope plays an important role in modulating the immune response against enveloped viruses90,91. By masking underlying viral proteins, the envelope reduces the immune system’s ability to recognize and neutralize the virus. This immune evasion strategy can also be adapted to nonviral systems, as demonstrated by the enveloped I3-01 particles, where the acquired membrane shielded the protein cage from antibody recognition83. Surrounding protein-based compartments with host-derived components appears to be a general mechanism for reducing immunogenicity and appearing less foreign to the immune system. For instance, the previously described AaLS variant displaying ABDs was less immunogenic than its parent scaffold lacking these binding peptides33. This reduction in immunogenicity was attributed to the cage’s recruitment of circulating immunoglobulins, forming a protective shield of host proteins that masked the underlying particle.

Glycans on viral glycoproteins further contribute to immune evasion by mimicking host molecules and obscuring key antigenic sites, thereby preventing detection by neutralizing antibodies92,93,94. Artificial protein cages can be similarly shielded through glycoengineering platforms. For example, an AaLS variant and the computationally designed I53-50-v4 were successfully glucosylated in the cytoplasm of E. coli through co-expression with the asparagine N-glucosyltransferase from Actinobacillus pleuropneumoniae95. As demonstrated with the AP205 bacteriophage, the installed N-linked glucose can serve as a primer for sugar chain elongation95. A particularly promising immune-stealthing modification involves linear polysaccharides composed of α-2,8-linked sialic acid, for which enzymatic pathways have already been engineered96. Alternatively, cell-free biosynthesis platforms offer modular approaches for modifying proteins with diverse oligosaccharides97,98. Moreover, the eukaryotic glycosylation machinery can be exploited to replicate specific viral glycosylation patterns, a key factor in the development of protein-based vaccines.

Training the immune system to recognize pathogens remains one of the most effective strategies to combat infectious diseases. While traditional vaccines using live-attenuated or inactivated viruses have been successful, they pose challenges related to safety99,100. Safer alternatives involve using isolated pathogen antigens to elicit robust immune responses. However, antigen fragments alone are often weakly immunogenic, even when combined with adjuvants101. To enhance immunogenicity, antigen fragments can be multivalently displayed on scaffolding proteins. Protein-based particles improve antigen delivery to lymph nodes—key sites for immune activation—and promote efficient uptake by antigen-presenting cells102,103. Additionally, presenting antigens on nanoparticles enhances B-cell activation by effectively cross-linking B-cell receptors, leading to stronger antibody production102,103.

Although naturally occurring protein cages such as ferritin104 and AaLS105 have been widely used in vaccine development, computational design has greatly expanded the range of available antigen display platforms106. These artificial scaffolds exhibit diverse symmetries and sizes, often exceeding those of natural cages, enabling precise control over antigen number, placement, and spacing107. Additionally, some architectures incorporate two distinct protein components, including a trimeric unit ideal for presenting trimeric antigens, which distinguishes them from homomeric scaffolds like ferritin and AaLS. This feature enables the formation of particles displaying multiple antigens simply by mixing independently produced scaffolding proteins and antigenic fragments. These mosaic designs are under development to create vaccines targeting viruses with overlapping seasons108 or to protect against heterotypic viruses109,110 and diverse coronavirus subgenera111. In contrast, assembling similar constructs with homomeric cages requires additional post-production conjugation techniques, such as SpyTag/SpyCatcher technologies112,113,114.

The two-component icosahedral I53-50 cage is widely used as a vaccine carrier due to its high valency, efficient production, and capacity to display diverse antigens, including the respiratory syncytial virus fusion protein115, influenza hemagglutinin109, and the SARS-CoV-2 spike116 and receptor-binding domain117. However, when presenting heavily glycosylated antigens such as the HIV-1 envelope protein, issues with assembly and stability were observed, likely due to excessive crowding on the cage surface118. Interestingly, using an equimolar amount of the trimeric antigen for immunization, displaying it on a tetrahedral cage produced a more stable construct that elicited an immune response comparable to that of I53-50, despite presenting five times fewer trimers (four versus 20)118. This may be due to the greater epitope accessibility offered by tetrahedral symmetry compared to icosahedral symmetry34.

In a separate study aimed at developing a livestock vaccine for East Coast fever, a deadly disease caused by Theileria parva, researchers compared the immune responses elicited by displaying a parasitic antigen on I53-50 and two other icosahedral cages119. Notably, I53-50 induced a stronger immune response than the other scaffolds. However, one of the alternative cages exclusively triggered an IgG1 response with no detectable IgG2, highlighting differences in immune polarization among these designs.

While these studies provide valuable insights into the immunological characteristics of computationally designed cages, further comparative assessments are necessary to fully determine the advantages of different platforms. Systematic testing of diverse building block arrangements120 could clarify how factors like particle size, antigen valency and spacing, as well as surface properties, affect vaccine potency, guiding the design of more effective formulations.

Future vaccine development will benefit from advancements in computational tools capable of predicting viral glycoprotein mutations that may compromise existing vaccines121. Given that nucleocapsids directly link genotype and phenotype122, it should theoretically be possible to randomize the sequence of displayed viral spikes and select for variants that acquire resistance to neutralizing antibodies in vivo, offering a valuable approach for training prediction tools. Early detection of unusual mutations associated with zoonotic spillovers or emerging pathogens through viral surveillance will also play a crucial role in pandemic prevention123. The ability to design artificial protein cages that display viral glycoproteins with precise geometries presents exciting opportunities. Such constructs could incorporate predicted mutations linked to increased public health risks, enhancing pandemic preparedness by enabling ready-to-use vaccines or providing model systems for a deeper understanding of viral pathogenesis.

Model systems for pathogenicity studies

A significant concern in modern virology is the presence of polybasic cleavage sites (PCSs) in the spike proteins of many phylogenetically distant viruses124. These sequences, usually rich in basic amino acids like arginine or lysine, are targeted and cleaved by host cell proteases, a process essential for virus activation. PCSs are particularly notable in SARS-CoV-2 and MERS-CoV, where they enhance replication in the respiratory tract and human transmissibility125,126,127,128,129. Similarly, PCSs in the hemagglutinin proteins of highly pathogenic avian influenza viruses (e.g., H5 and H7 subtypes) are associated with increased virulence and organ tropism, raising significant concerns about zoonotic transmission130,131,132,133,134,135.

Cleavage of PCSs by furin generates a C-terminal R/KXXR/K motif, known as the C-end rule (CendR) motif, where R and K represent arginine and lysine, and X denotes any amino acid. This motif binds neuropilin (NRP) receptors136, which are transmembrane proteins involved in signaling, angiogenesis, and immune modulation137,138,139. NRPs facilitate cellular entry for viruses bearing CendR motifs140,141,142,143, including SARS-CoV-2144,145; however, the broader role of NRP-CendR interactions in viral pathogenicity remains unclear. Critical questions include whether NRPs facilitate the entry of all PCS-containing viruses (e.g., Ebola), if additional host factors contribute to entry of viruses with CendR peptides, how PCS-mediated internalization impacts viral trafficking, and whether PCSs reflect convergent evolution or influence zoonotic transitions.

Inserting or removing PCSs from viruses could help address these questions, but the process is not straightforward. Removing or altering PCSs may impair viral replication, rendering the virus non-viable, while inserting PCSs could result in unintended outcomes, such as gain-of-function, potentially creating new harmful strains. These concerns underscore the ethical and safety challenges of handling pathogenic viruses. Developing alternative approaches is essential for responsible research. Nonviral self-assembling protein cages offer a promising solution for studying PCSs mechanisms while minimizing risks. By designing protein cages that display PCS-containing spikes or CendR motifs, associated viral entry mechanisms could be investigated in a controlled and safe manner.

Exploring viral processes through imaging techniques can also be significantly simplified using protein cages. They can be engineered to incorporate fluorophores or contrast agents at specific sites146,147 or to encapsulate imaging modalities such as fluorescent proteins148. These labeled particles enable tracking via confocal microscopy to study processes such as endocytosis, trafficking, and intracellular sorting. Proteomics profiling of cage-positive cells isolated through laser microdissection can identify specific host factors involved in the endocytosis of virus-mimicking particles149. Combining this with subcellular fractionation allows for spatial identification of proteins associated with key compartments, such as surface receptors, endosomal markers, or nuclear proteins150,151. Functional validation of these factors can be achieved using small-molecule inhibitors152, RNA interference, or CRISPR-based gene knockouts.

Protein cages are useful scaffolds for cryo-electron microscopy studies due to their uniform size and symmetrical structure, which streamline image processing and enhance the precision of 3D reconstruction153,154. This makes them particularly suitable for examining structural changes resulting from the incorporation of PCSs into viral glycoproteins. For instance, cages engineered to display a mutated hemagglutinin protein with an introduced PCS could reveal structural differences that explain the enhanced virulence of H5 or H7 influenza subtypes in poultry. However, NVPCs are not enveloped and are smaller than influenza viruses, which may limit their ability to fully mimic native viruses. Harnessing pseudotyped nonviral cages with an envelope could overcome these limitations by providing a more virus-like platform. Integrating structural data with phenotypic insights from cage entry assays could establish a comprehensive genotype-structure-phenotype relationship, deepening our understanding of how PCSs in spike proteins influence viral virulence.

Conclusion

Historically, many pandemic-causing infectious diseases originated from animals. The emergence of such life-threatening pathogens is facilitated by urbarnization155, which increases animal-human contact, and global warming156,157,158, which promotes the spread of wildlife and virus vectors like mosquitoes to new geographical areas. Understanding how these pathogens evolve is crucial for developing effective treatments and preventive measures. NVPCs, with their exceptional engineerability, provide a remarkable platform for mimicking and studying viral structure and function. Importantly, these systems eliminate the safety concerns associated with live pathogens or pseudotyped viruses, which require at least biosafety level 2 facilities. By enabling rapid, safe investigations of viral glycoproteins and their mutations, NVPCs can facilitate the early identification of variants with pandemic potential, as well as the host factors that promote their entry.

Despite significant progress, critical questions about viral assembly remain unanswered. How are segmented genomes packaged with one copy of each segment per particle? What roles do viral and host condensates play in this process? Which host factors influence genome packaging efficiency and fidelity? NVPCs offer a unique opportunity to dissect these complex mechanisms by enabling controlled experiments that would be challenging with live viruses. Addressing these questions will not only deepen our understanding of viral biology but also improve our ability to intervene at later stages of the viral lifecycle, thus limiting further spread. Democratizing the use of NVPCs as versatile tools for studying viruses will advance both fundamental research and global preparedness for future pandemics.

The Coalition for Epidemic Preparedness Innovations (CEPI) has identified 25 virus families that are most likely to produce the next deadly viral threat. Leveraging NVPCs to gain insights into these virus families, as well as utilizing them as vaccine platforms, could play a pivotal role in mitigating the impact of future pandemics on global public health. Unlike the lipids used in mRNA vaccines, these particles can be produced through entirely bioprocess-based methods, reducing carbon emissions, chemical waste, and environmental impact. This approach also lowers production costs, making treatment and immunization more affordable and accessible. While large-scale production of NVPCs can benefit from the expertise gained with other protein therapeutics, much remains to be learned to ensure proper cage assembly and purity, while also meeting good manufacturing practice requirements. Establishing efficient manufacturing methods could enable rapid initial production and scalable outputs, key elements in achieving CEPI’s ambitious “100 Days Mission” to respond to and contain future pandemics within 100 days159.

The end of the infectious disease crisis can be accelerated through vaccine equity. The rise of resistant pathogens underscores the global failure to share vaccines equitably. Ensuring global recovery requires that the majority of the population, particularly the world’s poorest and most vulnerable, have access to sufficient vaccine doses. The temperature stability exhibited by some NVPCs46,118,160 could substantially contribute to efficient worldwide distribution by alleviating the challenges of cold-chain storage and transport.

While vaccines prevent infections before they occur, they don’t treat existing infections or reduce symptom severity and illness duration like antiviral drugs161. If NVPCs can fully replicate the viral entry process, they could serve as novel delivery platforms for drugs targeting virus-specific components to reduce infectivity and alleviate disease symptoms (Fig. 3). For instance, nucleic acid therapeutics could be used to interfere with the viral genome, while small molecules could inhibit key viral enzymes, such as reverse transcriptase or polymerase. Additionally, competing with the same host factors as the target virus to enter cells and release antiviral payloads could further enhance the efficacy of this strategy. However, to fully realize the potential of NVPCs in molecular delivery, several critical questions remain: do they bind to proteins in the bloodstream? How long do they circulate? Where do they accumulate? How immunogenic are they? Systematic in vivo experiments are essential to answer these questions. Understanding how factors such as size, shape, symmetry, and surface charge affect their behavior in the body will enable scientists to leverage the engineering toolbox at their disposal to continually improve NVPC design162.

Fig. 3: Next-generation antiviral cages.
figure 3

By mimicking the early stages of the viral life cycle, NVPCs can be leveraged to develop antiviral strategies that both treat infected patients and prevent further viral spread. First, NVPCs compete for receptor binding, blocking viral entry. After cellular uptake via receptor-mediated endocytosis, engineered cages escape the endosome and release their payload (here depicted as an siRNA molecule). The delivery of nucleic acids targeting viral mRNA or host entry factors provides a final antiviral effect, significantly reducing viral infectivity. This figure was inspired by a template from BioRender176,177.

NVPCs are safe, malleable particles that resemble viruses and can be easily modified to mimic their functions, offering insights into viral virulence and enhancing therapeutic strategies against emerging threats. This perspective introduces NVPCs to an audience that may be unfamiliar with the technology, with the goal of inspiring cell biologists and virologists to incorporate them into their research. Additionally, the insights gained from studying viruses could benefit the protein engineering community, aiding in the development of NVPC-based delivery platforms to tackle other major health challenges. By fostering collaboration between these communities in cutting-edge multidisciplinary efforts, we can improve our ability to respond quickly to future pandemics and other global health crises.