Introduction

Nα-terminal (Nt) modifications are key determinants of protein fate, influencing folding, activity, localization, and stability across all domains of life1,2,3. Among these, Nt-formylmethionine (Nt-fMet) is a hallmark of bacterial and organellar translation, in which it initiates protein synthesis and orchestrates early co-translational processing4,5,6,7. Beyond initiation, Nt-fMet contributes to translational fidelity, co-translational folding, protein stability, the assembly of multiprotein complexes, and membrane integrity8,9,10,11.

For decades, Nt-fMet-directed protein synthesis in eukaryotic cytosol was considered absent or negligible, largely due to the presumed lack of a cytosolic methionyl-transfer RNA (tRNA) formyltransferase (FMT) and technical barriers to detection5,12. However, recent advances in Nt-proteomics13 and the development of pan-fMet antibodies have revealed robust synthesis of fMet-bearing proteins on cytosolic ribosomes in both yeast and human cells14,15,16,17. These discoveries uncovered a ribosome quality control (RQC) mechanism directed by Nt-fMet (the fMet-RQC pathway)18 and a proteolytic degradation route targeting Nt-fMet (the fMet/N-degron pathway)14,17.

In parallel, stress-induced cytosolic fMet-protein synthesis, along with the fMet-RQC and fMet/N-degron pathways, implies an endogenous pool of N-formylated peptides (formyl peptides) within eukaryotic cells19. These peptides, previously attributed solely to bacterial or mitochondrial origins20,21,22, may activate formyl peptide receptors (FPRs), thereby modulating innate stress responses and cellular signaling19.

This Review highlights recent advances in cytosolic fMet-protein synthesis and its regulation through the fMet-RQC and fMet/N-degron pathways. Topics specific to bacterial and organellar translation, FPR signaling, and therapeutic implications are discussed in detail elsewhere6,7,21,22,23.

Protein Nt-formylation

Mechanistic diversity of protein formylation

Protein formylation arises via two distinct processes. Nε-lysine formylation is a non-enzymatic, damage-associated adduct, in which reactive metabolites (for example, 3′-formylphosphate from oxidative DNA damage or formaldehyde) modify lysine side chains24,25 (Fig. 1a). Low-level Nε-formyllysine is detected on histones and can compete with acetylation, but its in vivo roles remain unresolved25.

Fig. 1: Eukaryotic cytosolic fMet-protein synthesis and regulatory features of formyltransferase.
Fig. 1: Eukaryotic cytosolic fMet-protein synthesis and regulatory features of formyltransferase.The alternative text for this image may have been generated using AI.
Full size image

a Chemical conversion of ε-lysine to Nε-formyllysine by formylphosphate derived from DNA or formaldehyde. To date, no dedicated Nε-deformylase has been reported in vivo. b Canonical pathway of Nt-Met formylation. Met-tRNAf is converted to fMet-tRNAf by formyltransferase (FMT) using 10-fTHF as a one-carbon donor. IF2 then delivers fMet-tRNAf to the small subunit ribosome. Following translation initiation, PDF deformylase removes the formyl group. c Domain organizations of bacterial (Escherichia coli), yeast (Saccharomyces cerevisiae), and human (Homo sapiens) FMTs and the related GARF homolog. ScFmt1 and HsMTFMT contain a mitochondrial targeting sequence (MTS). d Recognition of initiator Met-tRNAf by EcFMT. Left: EctRNAf with key determinants for formylation. Right: the EcFMT catalytic pocket highlighting contacts between R42 and the tRNA C3:G70 base pair, and between N301 and the tRNA D arm. e Substrate specificity of EcFMT. EcFMT efficiently recognizes its cognate E. coli Met-tRNAf and, to a lesser but detectable extent, the eukaryotic S. cerevisiae Met-tRNAi. f Dual localization and stress-dependent activation of ScFmt1. ScFmt1 is constitutively mitochondrial, generating fMet-tRNAmt for mitochondrial translation. Upon nutrient depletion or cold stress, Gcn2-dependent phosphorylation retains ScFmt1 in the cytosol, where it utilizes 10-fTHF to produce cytosolic fMet-tRNAi for non-canonical initiation in the cytosol. GARFT, glycinamide ribonucleotide formyltransferase; MTFMT, mitochondrial FMT; PDF, peptide deformylase; THF, tetrahydrofolate; tRNA, transfer RNA.

By contrast, Nt-fMet is generated pre-translationally when FMT transfers a formyl group from 10-formyltetrahydrofolate (10-fTHF) onto the α-amine of initiator Met-tRNA, yielding fMet-tRNA for initiation6,7 (Fig. 1b). Alternative one-carbon donors such as 10-formyldihydrofolate also support FMT activity, linking formylation to folate metabolism and antifolate sensitivity26. For clarity, initiator tRNAs are, hereafter, denoted tRNAf (bacteria), tRNAmt (mitochondria), and tRNAi (cytosol).

Catalytic mechanism and structural features of FMT

FMT transiently formylates the Nt-α-amine, thereby licensing downstream processing events: peptide deformylase (PDF) removes the formyl group before sequence-directed Nt-Met excision and/or subsequent modification such as Nt-acetylation1,2. FMT shares functional hallmarks with other folate-dependent FMTs, including glycinamide ribonucleotide formyltransferase, such as substrate-induced conformational changes and formyl transfer via an activated carbon intermediate27 (Fig. 1c).

Bacterial FMTs comprise an N-terminal Rossmann-like catalytic domain and a C-terminal β-barrel tRNA-binding domain27. In Escherichia coli, a 16-residue insertion loop docks onto the initiator tRNA acceptor stem at the hallmark C1·A72 mismatch, positioning the methionylated 3′ end into the catalytic cleft through an induced-fit mechanism27,28 (Fig. 1d). Efficient substrate recognition relies on conserved tRNA identity elements (G2–C71, C3–G70, G4–C69, and A73) and a conserved arginine in the insertion loop (for example, E. coli FMT Arg42)29,30 (Fig. 1d).

These structural features are conserved in both E. coli Met-tRNAf and Saccharomyces cerevisiae initiator Met-tRNAi, enabling cross-species recognition. Consistently, ectopic expression of E. coli FMT in yeast results in efficient formylation of the endogenous Met-tRNAi and implies fMet-initiated translation in the cytosol of yeast cells31 (Fig. 1e).

FMT substrate compatibility and compartmental specificity

Despite conserved overall folds, FMT substrate preferences differ by lineage and compartment. In budding yeast, the mitochondrial ScFmt1 stringently recognizes initiator-type identity in the acceptor stem (notably the non-Watson–Crick 1–72 signature) and, accordingly, does not act on the cytosolic Met-tRNAi, which carries eukaryotic initiator features instead of the bacterial-type C1•A72 mismatch32.

Under stress (for example, cold shock or nutrient starvation), a fraction of ScFmt1 relocalizes to the cytosol, enabling conditional Nt-formylation outside the organelle14,18. In metazoan mitochondria, in which a single Met-tRNAmt services both initiation and elongation, MTFMT relies more on the methionyl moiety and limited stem cues than on bacterial-type identity elements33. Pathogenic MTFMT variants impair Nt-formylation and mitochondrial translation, causing Leigh syndrome with oxidative phosphorylation (OXPHOS) defects22,34,35. Thus, initiator tRNA sequence, subcellular localization, and lineage-specific FMT recognition shape the spatial and temporal dynamics of Nt-formylation across eukaryotes and their organelles6,22,36.

FMT conservation and functional divergence

Formylation of Met-tRNAf enhances bacterial initiation by favoring high-affinity binding to initiation factor IF2, excluding elongation factor EF-Tu, and stabilizing P-site loading on the ribosome6,7,37 (Fig. 1b). In E. coli, loss of FMT slows growth and compromises initiation fidelity, whereas loss or inactivation of PDF is lethal or severely deleterious, underscoring the essential nature of the formylation–deformylation cycle7,11,38.

A parallel mechanism operates in mitochondria: the mitochondrial IF2 (mtIF2) preferentially binds fMet-tRNAmt39. Although translation can proceed to some extent with non-formylated Met-tRNAmt, deficiency of MTFMT reduces translational efficiency, destabilizes OXPHOS complexes I/IV, and impairs mitochondrial proteostasis22,35,40. By contrast, archaea lack FMT and initiate with unmodified Met-tRNAi delivered by a/eIF2; an IF2 ortholog (aIF5B) acts later to promote subunit joining — an evolutionary bypass of Nt-formylation41,42.

Similarly, obligate endosymbionts with extreme genome reductions, such as mealybug-associated symbionts, have independently lost both FMT and PDF. These lineages compensate by importing host or co-symbiont factors or by relaxing their dependence on Nt-formylation altogether, illustrating a flexible reconfiguration of the translation initiation machinery43,44,45.

Cytosolic Nt-formylation in eukaryotes

In eukaryotes, FMTs are nuclear-encoded and primarily targeted to mitochondria (and plastids in plants), reflecting their bacterial ancestry. Nevertheless, under specific stresses (for example, nutrient limitation and cold) and in some cancer contexts (for example, colorectal cancer), FMT can accumulate in the cytosol, enabling fMet-initiated translation outside organelles14,15,18,46.

Consistent with possible extra-mitochondrial pools, the Human Protein Atlas annotates human mitochondrial FMT (MTFMT) as mainly nucleoplasmic with additional cytosolic localization across cell lines47. Subcellular fractionation in certain human cells detected fMet-bearing proteins and MTFMT in both mitochondrial and cytosolic fractions48. However, the cellular cues, which might drive any extra-mitochondrial relocalization of MTFMT, and the physiological role of such pools remain to be determined.

In S. cerevisiae, starvation or entry into stationary phase triggers Gcn2 (eIF2α kinase)-dependent phosphorylation that retains a fraction of ScFmt1 in the cytosol, increasing cytosolic Nt-formylation and enabling stress-responsive initiation14. In parallel, Gcn2-mediated phosphorylation of eIF2α dampens bulk translation49 while permitting selective fMet-initiated programs decoupled from global protein synthesis14 (Fig. 1f).

In mammals, GCN2 (EIF2AK4) is activated by uncharged tRNAs and ribosome collisions; GCN1 recruits GCN2 to collided disomes, and contacts with the ribosomal P-stalk relay the collision signal to the integrated stress response (ISR)49,50. When eIF2 is inhibited by eIF2α phosphorylation, alternative initiation routes engage: eIF2D and the MCTS1·DENR heterodimer promote re-initiation/initiator-tRNA delivery on upstream open reading frame (uORF)-programmed transcripts (including ATF4), and eIF5B can support eIF2-independent initiation, especially under stress or internal ribosome entry site/non-AUG contexts; by contrast, eIF2A contributes little to global or uORF-mediated initiation in human cells51,52,53. These conserved GCN2 signaling and eIF2-bypass routes may regulate cytosolic retention of MTFMT and fMet-initiated translation48. It also remains plausible that other regulatory pathways modulate fMet levels and MTFMT localization or activity.

Protein deformylation

PDF catalysis and specificity

PDF is a ribosome-associated mononuclear metallohydrolase that removes the Nt-formyl group co-translationally as nascent chains emerge the exit tunnel2 (Fig. 1b). Structural and dynamics studies place PDF on the large ribosomal subunit near uL22, consistent with in situ, on-ribosome catalysis54,55. The active site of PDF is built around three conserved motifs — GXGXAAXQ, EGCLS, and HEXXH — that chelate the catalytic metal and position the nucleophilic water; Fe2+ is the physiological cofactor (activity can be retained with Ni2+ in vitro)56. These motifs underpin high selectivity for the small formyl group while excluding bulkier acyl substituents. Substrate recognition relies largely by backbone interactions at the N terminus, which explains broad tolerance of PDFs for diverse +1/+2 residues across bacterial proteomes57.

On translating ribosomes, PDF competes kinetically with signal recognition particle (SRP): SRP binding to signal-sequence nascent chains shields the fMet and delays deformylation, introducing a timing checkpoint before membrane targeting58. When deformylation is delayed or blocked, formylated N termini accumulate and trigger stress and quality control responses in bacteria. Acute PDF inhibition rapidly induces proteostasis and membrane stress programs11.

PDF conservation, functional relevance, and therapeutic implications

PDF is a ubiquitous and essential enzyme in bacteria, where it co-translationally removes the Nt-formyl group from nascent polypeptide chains, a critical step for proper protein maturation and function2,11. Homologous enzymes are retained in many eukaryotic organelles of bacterial origin, namely, mitochondria and plastids, where translation initiates with fMet. By contrast, the eukaryotic cytosol and archaea do not utilize fMet and generally lack canonical PDF41,59,60,61.

Notably, canonical PDF genes are absent from the genomes of S. cerevisiae and Caenorhabditis elegans, indicating lineage-specific losses of deformylation capacity in some eukaryotes60. Eukaryotes with reduced or absent mitochondria also lack PDF: Giardia possesses mitosomes without organellar translation, and Monocercomonoides has secondarily lost mitochondria altogether62. Remarkably, giant marine viruses encode their own PDFs within specialized translational modules, co-opting the formylation–deformylation cycle to ensure proper processing of viral proteins63,64.

In mammals, mitochondrial PDF (mtPDF) facilitates efficient synthesis and assembly of OXPHOS complexes. Genetic or pharmacological inhibition of mtPDF reduces the expression of mtDNA-encoded subunits and impairs respiratory function, highlighting the organelle-restricted yet essential role of deformylation in mitochondrial bioenergetics65. Furthermore, mtPDF is upregulated in several tumor types, suggesting its relevance in cancer metabolism66.

The strict requirement for PDF in bacteria, combined with its absence from the eukaryotic cytosol, has made it an attractive antibacterial target67,68. Actinonin-class inhibitors that block the PDF active site lead to toxic accumulation of formylated proteins and collapse bacterial proteostasis11,69,70. Although early compounds suffered from poor bioavailability and off-target effects, next-generation PDF inhibitors with enhanced specificity, pharmacokinetics, and reduced toxicity are now in development. Selective inhibition of mtPDF may also hold as an anticancer strategy, particularly in tumors reliant on organellar translation and OXPHOS activity68,71.

The fMet-RQC pathway: a novel ribosome surveillance system

Canonical RQC pathways

Aberrant ribosome stalling, caused by defective mRNAs, rare codons, polybasic tracts, or drug-induced pauses72,73,74,75,76, generates incomplete nascent chains that contribute to cellular stress, aging, and neurodegeneration77,78,79,80,81,82. Canonical RQC pathways mitigate these stressors by detecting stalled ribosomes, splitting subunits, and routing aberrant chains for degradation74,79,80,81 (Fig. 2a).

Fig. 2: Canonical RQC and fMet-RQC pathways.
Fig. 2: Canonical RQC and fMet-RQC pathways.The alternative text for this image may have been generated using AI.
Full size image

a Canonical RQC in yeasts and mammals. Collisions from stalled ribosomes or damaged mRNAs (rare codons, polybasic tracts, premature stops, drug-induced pausing, ribosome defects) trigger subunit splitting by Rli1 (yeast) or ABCE1 (mammals), with Rqc2 (NEMF) stabilizing the 60S–peptidyl-tRNA complex. Resolution proceeds via two branches. RQC-L: Ltn1 (LTN1) ubiquitinates the nascent chain; release is catalyzed by Vms1 (ANKZF1), followed by proteasomal degradation. RQC-C: peptidyl-tRNA hydrolysis by Pth1 (PTRH1) couples to C-degron surveillance; in mammals, the CRL2 receptor KLHDC10 recognizes Ala-tail/C-degron substrates, with no established budding-yeast counterpart79,80,151. b The yeast fMet-RQC pathway. Cold stress retains ScFmt1 in the cytosol, increasing the production of fMet-tRNAi from Met-tRNAi. When approximately 41 residues of an fMet-initiated nascent chain emerge from the exit tunnel, its Nt-fMet is sensed by Nip1 (eIF3c), which recruits Arf1/Arf2 GTPases to induce ribosome stalling. Stalled ribosomes then recruit Hcr1 (eIF3j) and the recycling factor Rli1, which splits subunits. The resulting 60S–peptidyl-tRNA species bearing fMet chains are extracted by as-yet unknown factors and degraded by the proteasome (mechanism unresolved), whereas 40S–mRNA species partition into stress granules to conserve metabolic resources and preserve mRNAs and translation components. During prolonged cold, Arf1/Arf2 dissociates from stalled ribosomes, allowing continued elongation and accumulation of fMet-initiated proteins that are subsequently cleared by the fMet/N-degron pathway. Under these conditions, Nt-fMet-bound Nip1 can co-aggregate with ribosomal subunits; such aggregates are only partially resolved by the Hsp104 disaggregase18. RQC, ribosome quality control; tRNA, transfer RNA.

Ribosome stalling causes ribosome collisions, which activate surveillance pathways such as no-go decay, non-stop decay, and non-functional rRNA decay72,75,80,83 (Fig. 2a). Collided ribosomes are selectively ubiquitinated on small subunit proteins by the E3 ligase Hel2 (ZNF598), which is an essential step for the canonical RQC activation84,85. This Hel2 (ZNF598)-dependent ubiquitination initiates ribosome rescue through two distinct routes: (i) the Dom34 (PELO)–Hbs1 (HBS1L)–Rli1 (ABCE1) pathway, in which Dom34/PELO mimics peptidyl-tRNA-hydrolysis-lacking eRF1 and probes stalled ribosome, Hbs1/HBS1L provides GTPase-driven ribosome conformational transition, and Rli1/ABCE1 (an ATPase) powers subunit splitting and recycling86,87, and (ii) the RQT (RQC trigger) pathway, composed of the RNA helicase/ATPase Slh1 (ASCC3), the ubiquitin-binding adaptor Cue3 (ASCC2), and the zinc-finger scaffold Rqt4 (TRIP4), which recognizes ubiquitinated stalled ribosomes and drives their dissociation88,89. (Mammalian counterparts of yeast RQC components are indicated in parentheses.)

Following subunit splitting by Rli1 (ABCE1), assisted by ATP hydrolysis and the tRNA/nascent-chain-binding factor Rqc2 (NEMF), 60S–peptidyl-tRNA complexes are processed through two branches. In the RQC-L (large subunit) branch, Rqc2 catalyzes C-terminal alanine/threonine (CAT) tailing90, Ltn1 (LTN1/Listerin) ubiquitinates the stalled chain73, and the peptidyl-tRNA hydrolase Vms1 (ANKZF1) hydrolyzes the peptidyl-tRNA bond, enabling proteasomal degradation76,91. In the RQC-C branch, peptidyl-tRNA hydrolase Pth1 (PTRH1) and C-end rule E3s — particularly CRL2–KLHDC10 and PIRH2 — target nascent peptides for degradation through recognizing C-terminal degrons (including Ala-tails)92,93,94 (Fig. 2a).

Cytosolic Nt-formylation triggers a new RQC mechanism

In yeast, cytosolic translation normally initiates with unmodified Met. However, under stress conditions such as cold exposure or nutrient deprivation, and upon ectopic expression of E. coli FMT, Met-tRNAi can be enzymatically converted into formylated fMet-tRNAi, initiating translation with fMet14,18. Because yeast lacks a PDF60, fMet-initiated proteins can accumulate and become potentially toxic. Notably, overexpression of E. coli FMT in S. cerevisiae converts up to ~70% of Met-tRNAi to fMet-tRNAi31. Despite this high conversion rate, Nt-formylated proteins represent only ~7% of detected protein species, and only ~5% of these are degraded by the fMet/N-degron pathway14. This discrepancy points to an additional co-translational quality control mechanism that limits the toxic buildup of fMet-bearing nascent chains18. Both ectopic E. coli FMT expression and cold-induced cytosolic retention of ScFmt1 similarly produce fMet–tRNAi, triggering fMet-dependent translation on cytosolic ribosomes. This activates the fMet-RQC pathway, a specialized branch of RQC distinct from canonical mechanisms18.

As the Nt-formylated nascent chain exits the ribosomal tunnel (around codon 41), elongation stalls. This stall is initiated by the recognition of nascent Nt-fMet by Nip1 (eIF3c), a translation initiation factor that localizes from the 40S to the 60S subunit, where it directly binds Nt-fMet. This event triggers a surveillance cascade that is independent of ribosome collision or mRNA damage18 (Fig. 2b). Nip1 recruits Arf1/Arf2, small GTPases of the ARF/ARL family, together with the recycling factor Rli1, likely via Hcr1 (eIF3j, a subunit of eukaryotic translation initiation factor eIF3). The GTPase activity of Arf1/Arf2 promotes Rli1-mediated subunit dissociation and also recruits the disaggregase Hsp104, thus preventing co-aggregation of short fMet-bearing nascent chains with ribosomes18 (Fig. 2b).

Mechanistically, the fMet-RQC pathway is distinguishable from canonical RQCs: disrupting canonical components (for example, Hel2, Dom34, and Ltn1) does not increase fMet-protein levels, whereas deleting fMet-RQC-specific factors (for example, Nip1 and Arf1) does18. Nonetheless, the fMet-RQC pathway likely cooperates with core RQC machinery — including the Dom34–Hbs1–Rli1 axis, the RQT complex, and the Rqc1/Rqc2/Ltn1 module — to eliminate stalled, fMet-bearing chains. The details of this interplay remain to be elucidated.

Nip1 acts as Nt-fMet sensor on the 60S

Nip1, a core subunit of eIF3 that typically associates with the 40S ribosomal subunit during translation initiation, is repurposed during early elongation to recognize Nt-fMet on the 60S subunit18,95. Similar to other eIF3 subunits (for example, eIF3e and eIF3h) that monitor nascent-chain properties during elongation96,97, Nip1 senses emerging Nt-fMet and activates the fMet-RQC pathway independently of ribosome collisions or no-go decay/non-stop decay surveillance18.

By linking an initiation factor to co-translational quality control, Nip1 participates in a broader network of fMet-recognition systems. This includes PDF, which removes Nt-formyl groups from nascent chains54; FPRs (FPR1/FPR2), which mediate immune and stress signaling20,21; and fMet/N-recognins such as Psh1 and TRIM52, which target formylated proteins for proteasomal degradation14,48.

ARF GTPases coordinate ribosome disassembly and stress-granule formation

Small GTPases of the ARF/ARL family, including yeast Arf1/Arf2 and mammalian ARF1, ARF3–5, are best known as GTP-regulated molecular switches involved in COPI/II vesicle trafficking, Golgi organization, lipid remodeling, and cytoskeletal dynamics98,99,100.

Recent studies expand their functional repertoire to include roles in RQC and stress granule (SG) assembly18. In yeast, GTP-bound Arf1/Arf2 associates with Nip1-bound 60S ribosomes during early elongation. Upon GTP hydrolysis, Arf1/2 stimulates Rli1-mediated ribosomal subunit dissociation. They also recruit Hsp104 to dissolve ribosome–nascent chain aggregates, whereas Hcr1 (eIF3j) stabilizes stalled 80S complexes101 and supports Rli1 loading during the Arf1/2 GTPase cycle18. Although these processes are characterized in yeast, direct mammalian evidence is not yet available.

Yeasts form SGs under near-freezing conditions as a protective response102,103. Arf1 has been implicated in cold tolerance104, although its mechanistic role was previously unclear. Recent evidence positions Arf1/Arf2 in fMet-RQC, in which they govern ribosome stalling and SG formation in response to cytosolic Nt-formylation. In arf1Δ cells, cytosolic ScFmt1 becomes toxic, as its detrimental effects outweigh the adaptive benefits of mitochondrial ScFmt1 during cold stress. This impairs fMet-RQC-dependent SG assembly and promotes aggregation of fMet-bearing polypeptides18.

Under acute cold stress, cytosolic ScFmt1 promotes fMet–tRNAi synthesis and activates fMet-RQC, thereby triggering SG assembly and repressing fMet-protein translation18. This response conserves ribosome and mRNA integrity while preserving102,103 (Fig. 2b). However, during prolonged cold, Arf1 levels decline and its binding to Nip1 weakens. As a result, fMet-RQC activity diminishes, leading to accumulation of toxic, aggregation-prone fMet-nascent chains and ribosome co-aggregation18 (Fig. 2b) — a phenomenon reminiscent of CAT-tail-induced aggregation seen in aging and neurodegeneration77,78.

Loss of Arf1/Arf2 allows continued fMet-protein synthesis18, which disrupts Nt-processing events (for example, Met excision and Nt-acetylation) and accelerates proteostasis collapse1,2,11. Although SG formation is often associated with eIF2α phosphorylation105, the precise roles of Nip1 and Arf1/2 in SG dynamics remain to be elucidated.

The fMet/N-degron pathway

Parallel evolution of Nt-surveillance

Since the discovery of N-degrons in 1986, the chemical identity of a protein’s Nt-residue has been recognized as a key determinant of its half-life by directing proteins to distinct proteolytic pathways3,106. In eukaryotes, five major branches have emerged: the Arg/N-degron pathway (targeting unmodified Nt-Arg, Lys, His, Leu, Ile, Phe, Trp, Tyr, and Met-Φ (a hydrophobic residue)), Ac/N-degron pathway (acetylated Nt-residues), Pro/N-degron pathway (Nt-Pro), GASTC/N-degron pathway (Nt-Gly, Ala, Ser, Thr, and Cys), and the fMet/N-degron pathway (Nt-fMet). Together, these systems maintain proteostasis via the ubiquitin–proteasome and, in some contexts, autophagy–lysosome machineries3,14,107,108,109,110,111,112,113,114,115 (Fig. 3).

Fig. 3: A unified framework of N-degron pathways and substrate recognition mechanisms.
Fig. 3: A unified framework of N-degron pathways and substrate recognition mechanisms.The alternative text for this image may have been generated using AI.
Full size image

ae Overview of eukaryotic N-degron pathways, classified by whether the Nt-residue is unmodified or bears an initiating modification and by the corresponding recognins3,111,152. a Mammalian Arg/N-degron pathway. Tertiary destabilizing residues Nt-Asn/Gln are deamidated by NTAN1/NTAQ1 to Nt-Asp/Glu, and Nt-Cys is oxidized by the dioxygenase ADO to Cys-sulfinic/sulfonic acid. These secondary residues are then arginylated by ATE1 to generate the primary destabilizing residue Nt-Arg. Primary destabilizing Nt-residues (Arg, Lys, His, Leu, Ile, Phe, Trp, Tyr, and Met-Φ (Met followed by a bulky hydrophobic at position 2)) are recognized by UBR1/2/4/5 (Arg/N-recognins), leading to polyubiquitylation and proteasomal degradation; arginylated clients can also be routed to the p62/SQSTM1-mediated autophagy–lysosome pathway. b The Ac/N-degron pathway. Nt-Met, Ser, Thr, Val, Ala, Cys, and Gly are co-translationally or post-translationally acetylated by NatA–NatH complexes, generating Ac/N-degrons. These are recognized by Ac/N-recognins, including Doa10 and Not4 in yeast and MARCHF6 and DCAF10 in mammals, triggering polyubiquitination and proteasomal degradation. c The yeast Pro/N-degron pathway. Nt-Pro is recognized by the GID E3 complex through interchangeable substrate receptors, including Gid4 (canonical) and Gid10 (conditional), driving polyubiquitination and proteasomal degradation. d The mammalian GASTC/N-degron pathway. Nt-Gly, Ala, Ser, Thr, and Cys are directly recognized by ZYG11B, ZER1, GID11, or IAP family members. e The fMet/N-degron pathway. Nt-fMet is specifically recognized by fMet/N-recognins, including Psh1 in yeast and TRIM52 in mammals, directing substrates to polyubiquitylation-mediated proteasomal degradation. f MARCHF6 Ac/N-degron recognition model. Structural model of the MARCHF6 Ac/N-recognin domain complexed with an Nt-acetylated peptide (Ac–Ala–Ser–Val–Ala–Val–Asp). Arg554 and Asn579 contact the Nt-acetyl group and position P1 to P3 of the peptide150. g TRIM52 architecture. Predicted dimeric organization of mammalian TRIM52, highlighting the RING domain, B-box, acidic region, and a putative fMet recognition core spanning residues 144–149 (ref. 48). h TRIM52–fMet docking. Docking of an Nt-fMet peptide into the TRIM52 dimer interface shows Tyr148 and nearby positively charged residues (Lys28, Arg144, and Arg149) coordinating the Nt-fMet, suggesting a unique recognition mechanism distinct from Ac/N-recognin.

Bacterial fMet/N-degron pathway

Bacterial translation initiates with fMet installed by FMT. As the nascent chain emerges from the 50S tunnel, PDF removes the formyl group, licensing Met aminopeptidase (MetAP), which preferentially excises Met when position 2 residue is small; PDF and MetAP compete at overlapping ribosomal sites near L22 (refs. 9,116). Genetics affirms this hierarchy: fmtΔ cells grow slowly yet survive30, whereas loss or inhibition of PDF is lethal in FMT-positive cells because MetAP cannot act on formylated N termini11,117; accordingly, fmtΔ pdfΔ double mutants resemble fmtΔ cells, underscoring the essential role of deformylation in Nt-maturation118. Despite robust co-translational deformylation, many nascent chains escape PDF owing to low PDF:ribosome stoichiometry (~2–3 µM versus ~30 µM), sequence-dependent kinetics, and early compaction that transiently buries the first ~10 residues8,9. Empirically, small DNA-encoded peptides frequently retain Nt-fMet — evidence that PDF escape is routine rather than exceptional11,119.

The notion that Nt-fMet can create a specific N-degron arose by analogy to the Ac/N-degron: a chemically analogous, position-specific modification that creates an N-degron107. Early hints came from chloroplast D2, in which blocking deformylation destabilized the protein120; later, bacterial reporters bearing the D2 N terminus showed formylation-dependent, PDF-sensitive turnover9,14. A ribosome-proximal fMet/N-degron was suggested: a recognin–protease module captures escaped Nt-fMet and initiates processive degradation9. The AAA⁺ protease FtsH was implicated in the turnover of YfgM, a membrane protein predicted to retain Nt-fMet, with Lon and ClpP-containing proteases as additional candidates of fMet/N-recognins9,121.

Conceptually, a tight PDF cloud processes most nascent chains co-translationally, whereas a looser recognin–protease cloud potentially intercepts PDF escapees co-translationally or post-translationally; trigger factor (engaging after ~100 residues) and SRP further shape the kinetic window by modulating early folding and Nt-exposure9,58. Functionally, the fMet/N-degron acts as front-line quality control under translational stress (for example, mistranslation and fidelity-lowering antibiotics), removing aberrant Nt-fMet species when PDF misses its window9. Nonetheless, key questions remain: identifying the bacterial fMet/N-recognin(s), mapping their native substrates, and elucidating how they coordinate with PDF in vivo9,14.

The yeast fMet/N-degron pathway

Cytosolic translation in eukaryotes was long thought to start with unformylated Met, but N-terminomics revealed Nt-formylation on nuclear-encoded proteins in S. cerevisiae14,15,16,17. Low-level endogenous Nt-fMet also arises from incomplete mitochondrial import of the native ScFmt1 (refs. 14,18). Genetic and biochemical evidence established Nt-formylation as a degradation signal (fMet/N-degron)14. Particularly, co-expression of active EcPDF removes Nt-fMet and stabilizes Nt-formylatable reporters, whereas an E3-ligase screen identified the tripartite motif (TRIM)-family E3 ligase Psh1 as the fMet/N-recognin14.

Psh1 directly binds Nt-fMet and, with its E2 partner Ubc3, mediates polyubiquitylation and proteasomal turnover; loss of Psh1 stabilizes these Nt-formylated proteins and sensitizes cells to proteotoxic stress, cold, and sodium azide14. Verified Psh1 clients with fMet/N-degrons include Cse4 (centromeric H3 variant), Pgd1 (mediator subunit), and Rps22a (40S subunit)14. Because Nt-formylation and Nt-acetylation are mutually exclusive, the fMet/N-degron branch likely complements the Ac/N-degron pathway: when Ac-CoA is limiting and Nt-acetylation wanes, Gcn2-driven ScFmt1 retention may elevate Nt-formylation so that Psh1-mediated clearance preserves proteostasis across fluctuating nutrient and temperature conditions14 (Fig. 3e).

The mammalian fMet/N-degron pathway

TRIM proteins are a large family of RING-type E3 ligases with a conserved N-terminal tripartite architecture: a RING finger domain (for E2 engagement and ubiquitin transfer), one or two B-box domains (zinc-binding motifs for structural integrity and protein–protein interactions), and a coiled-coil domain (for oligomerization)122. Their C-terminal regions are more variable, incorporating domains such as PRY/SPRY, NHL, or PHD-BR that confer substrate specificity, particularly in immunity and cancer123.

In the yeast fMet/N-degron pathway, the E3 ligase Psh1 acts as fMet/N-recognin14. Structural similarity between yeast Psh1 and mammalian TRIMs124, along with evidence for a cytosolic pool of human MTFMT47, motivated the search for a mammalian fMet/N-degron pathway. This led to the identification of TRIM52 as a human fMet/N-recognin48.

TRIM52 functions as a dimer and recognizes Nt-fMet through an evolutionarily conserved acidic loop embedded within its bipartite (split) RING domain. Tyr148 is essential for substrate binding but dispensable for E3 catalysis; the Y148A mutation abolishes fMet recognition while preserving polyubiquitylation activity. Structural modeling and molecular-dynamics simulations suggest that the formyl group helps neutralize local electrostatic repulsion to stabilize the substrate–E3 interface48.

Endogenous Nt-formylated substrates of TRIM52 include TPD54 (a tumor-associated trafficking protein) and SPTAN1 (a spectrin-associated cytoskeletal protein), which are polyubiquitylated and subsequently degraded by the proteasome48. Loss of TRIM52 causes accumulation of Nt-formylated proteins and caspase-3-dependent apoptosis; both phenotypes are rescued by reintroducing wild-type TRIM52 or by enhancing deformylation, which reduces the pool of fMet-bearing substrates48. A cytosolic fraction of MTFMT catalyzes Nt-formylation of nuclear-encoded proteins, paralleling stress-induced relocalization of yeast Fmt1. Notably, human TRIM52 can functionally substitute for Psh1 in yeast, underscoring the evolutionary conservation of fMet/N-degron recognition48.

Beyond its fMet/N-degron activity, TRIM52 is implicated in antiviral defense, inflammation, genome maintenance, and cancer, acting through nuclear factor (NF)-κB, STAT3, and Wnt/β-catenin signaling125,126,127,128,129. It restricts Japanese encephalitis virus by promoting NS2A degradation and activates NF-κB to induce pro-inflammatory cytokines130,131. TRIM52 has also been proposed as a prognostic marker for sepsis132. Its levels are tightly controlled by proteolytic network; loss of regulation leads to accumulation of TOP2 (topoisomerase 2)–DNA adducts, cell-cycle arrest, and engagement of DNA-repair factors133. How TRIM52 integrates its role as the fMet/N-recognin with proteome surveillance, immune regulation, and genome integrity remains an open question48.

fMet derivatives and physiological implications

From bacterial signatures to endogenous signals

Formyl peptides arise from both exogenous and endogenous sources and have immune as well as broader physiological consequences (Fig. 4). N-formylated peptides were first linked to neutrophil chemotaxis when N-terminally blocked E. coli peptides were found to attract leukocytes; structure–activity studies subsequently identified formyl–Met–Leu–Phe (fMLF) as the prototypic ligand and led to the discovery of the neutrophil G protein-coupled receptor FPR1 (refs. 134,135). Once considered exclusively pathogen-associated molecular patterns, formyl peptides are now also recognized as damage-associated molecular patterns during sterile inflammation, and, in addition to bacterial and mitochondrial sources, endogenous stress-responsive ligands may arise through cytosolic fMet-protein synthesis, the fMet–RQC pathway, and the fMet/N-degron pathway12,19,21,136 (Fig. 4a).

Fig. 4: Sources, immune signaling, and physiological implications of cytosolic fMet translation and formyl peptides.
Fig. 4: Sources, immune signaling, and physiological implications of cytosolic fMet translation and formyl peptides.The alternative text for this image may have been generated using AI.
Full size image

a Multiple sources of formyl peptides. Mitochondrial stress or leakage releases fMet-containing peptides as DAMPs (damage-associated molecular patterns), whereas bacterial fMet peptides act as PAMPs (pathogen-associated molecular patterns). Independently, under cold stress or nutrient starvation in yeast and under certain cancer contexts, normally mitochondria-directed FMTs accumulate in the cytosol and produce fMet-tRNAi. Cytosolic initiation with fMet-tRNAi generates fMet-bearing nascent chains and proteins. These cytosolic fMet species are monitored by the fMet–ribosome quality control (RQC) pathway in yeast (a mammalian counterpart is yet to be defined) and by the fMet/N-degron pathway in both yeast and mammals, likely yielding short formylated peptides. b Formyl peptides from bacteria, mitochondria, or cytosolic translation act as potent immune cues by activating the G protein-coupled FPR1 and FPR2, triggering degranulation, reactive oxygen species (ROS) production, inflammatory signaling, chemotaxis, and phagocytosis. c Physiological and pathological implications of cytosolic fMet translation. Left panel: fMet impacts proteostasis by influencing protein synthesis, degradation, aggregation, and RQC and by competing with other essential Nt-modifications; it can also interfere with proper targeting to organelles or membranes. Middle-left panel: fMet regulation may intersect with mitochondrial oxidative phosphorylation (OXPHOS) and maintenance of the proton gradient. Middle-right panel: fMet pathways may modulate cold adaptation in poikilothermic organisms and in body extremities. Right panel: Nt-fMet suppresses cancer cell proliferation and stemness-like traits (for example, SOX2 and CD24), indicating broader pathophysiological relevance beyond immune signaling46. FMT, formyltransferase; FPR, formyl peptide receptor; IMM, inner mitochondrial membrane; OMM, outer mitochondrial membrane; tRNA, transfer RNA.

FPR1 binds short formyl peptides with high affinity, whereas FPR2 is more permissive, accommodating broader range of ligands, including longer peptides, and can drive pro-inflammatory or anti-inflammatory signaling depending on ligand context; together, these receptors mediate chemotaxis, degranulation, reactive oxygen species production, inflammatory signaling, and phagocytosis in response to formyl peptides (Fig. 4b).

Notably, intact fMLF is rare in native samples and rapid deformylation or membrane sequestration limits extracellular release54,58. Beyond immune signaling, cytosolic fMet-linked pathways may have broader physiological and pathological consequences through effects on proteostasis, mitochondrial function, stress adaptation, and cancer-related phenotypes (Fig. 4c).

Clinical associations and therapeutic angles

Circulating fMet rises sharply in acute inflammatory states, particularly septic shock, in which plasma levels track severity and mortality; levels exceed those in bacteremia without sepsis, implicating host-derived production. Elevated fMet and fMet-peptides is also reported in hypertension, severe COVID-19, systemic sclerosis, vasculitis, and rheumatoid arthritis, where it promotes neutrophil activation137,138,139,140,141,142,143,144. Population studies link high fMet to impaired mitochondrial translation, age-related disease, and increased all-cause mortality141,142,143,144. Neutralization strategies underscore pathogenicity: immobilized anti-formyl peptide antibodies restore neutrophil (polymorphonuclear leukocyte) function in sepsis models, and both peptide-specific and pan-specific anti-fMet antibodies are in development15,16,17,145. Although fMLF remains a canonical probe, its rarity in vivo highlights the need to target endogenous, stress-responsive ligands.

Detection of fMet derivatives

N-terminomics

Proteome-wide detection of fMet-bearing proteins has been challenging owing to their rare and transient nature — mainly caused by co-translational deformylation or proteolytic degradation8,9,14. Additional technical barriers include the chemically blocked α-amino group, which resists conventional Nt-labeling, and the small mass shift introduced by Nt-formylation (+27.9949 Da), which closely resembles that by dimethylation (+28.0313 Da). As a result, reliable detection of Nt-formylation has historically required ultrahigh-resolution mass spectrometry146.

Recent advances in N-terminomics have begun to overcome these obstacles. Platforms such as iNrich and its miniaturized variant, tipNrich, enable selective enrichment of Nt-peptides — including fMet-bearing species — from femtomole-scale inputs, without requiring large protein quantities147,148. These tools provide unprecedented sensitivity for detecting low-abundance Nt-formylated proteoforms implicated in stress responses and disease (Fig. 5a).

Fig. 5: Experimental strategies for detecting N-terminally formylated peptides and proteins.
Fig. 5: Experimental strategies for detecting N-terminally formylated peptides and proteins.The alternative text for this image may have been generated using AI.
Full size image

a Workflow for Nt-proteomics analysis to detect formylated (fMet) and acetylated (AcMet) peptides. Proteins are subjected to chemical amine labeling to block free α-amines. After proteolytic digestion, Nt-peptides are enriched and analyzed by liquid chromatography tandem mass spectrometry (LC-MS/MS), allowing the identification of Nt-modifications based on mass spectra13,148. b Generation of polyclonal anti-fMet antibodies. Synthetic peptides containing Nt-fMet (for example, fMet–Gly–Ser–Gly–Cys: fMGSGC) or fMet–Xaa–Cys (fMXC, in which Xaa indicates any of the 20 standard amino acids) conjugated to keyhole limpet hemocyanin (KLH) are used to immunize rabbits, producing pan-specific anti-fMet antibodies. These antibodies enable immunodetection of fMet-containing proteins in bacteria, yeast, and human cells via immunoblotting or enzyme-linked immunmosorbent assay (ELISA)15,16,17. c Summary of anti-fMet antibodies. Sequence-specific anti-fMet antibodies recognize defined epitopes, including fMLC (anti-fMLF), fMDIAIGTYQEKC (anti-fMD-D2), fMSSKQQWVSSAGSC (anti-fCse4), fMGSGC (anti-fMGSG), and fMXC mixtures (X = any amino acid; anti-fMX; Merck, AB356487). C denotes the C-terminal cysteine added to conjugate the antigen peptides to carrier proteins. An anti-free fMet antibody (ImmuneChem, ICP1898) was raised against D-fMet or L-fMet conjugates. Although many commercial pan-fMet reagents are marketed for ELISA, in our hands these antibodies showed minimal reactivity toward N-terminally formylated peptides and proteins. By contrast, anti-fMXC gave the most reliable detection of Nt-formylated targets by immunoblotting and ELISA across bacteria, yeast, and mammalian cells. These reagents enable targeted detection and characterization of Nt-formylated species across systems.

Anti-fMet antibodies

Monoclonal anti-fMLF antibodies have been first used to detect free formyl peptides in enzyme-linked immunosorbent assays149. However, these antibodies exhibit limited affinity for formylated N termini embedded within polypeptides. As such, their application is generally restricted to specific formylated peptides and not suitable for general analysis.

To overcome these limitations, polyclonal pan-specific anti-fMet antibodies have been developed using both a defined peptide (fMet–Gly–Ser–Gly–Cys) and a synthetic peptide library featuring diverse second-position residues (fMet–Xaa–Cys; Xaa = any amino acid) as immunogens14,15,16,17. These antibodies exhibit broad sequence tolerance, high sensitivity, and cross-species reactivity, as validated by enzyme-linked immunosorbent assays and immunoblotting using synthetic peptides and a range of biological samples. They provide a scalable and cost-effective method for detecting Nt-formylated proteins across cytosolic, mitochondrial, and bacterial compartments (Fig. 5b).

Furthermore, many commercial anti-fMet reagents have been raised against free fMet haptens (Fig. 5c), which may not closely mimic the chemical context of Nt-formylated residues within peptide bonds. Consequently, their reactivity toward bona fide Nt-formylated proteins can be weak or variable (Fig. 5c). To ensure specificity, anti-fMet signals should be rigorously validated by: (i) competition assays using fMet peptides (not free fMet), (ii) genetic or pharmacological perturbation of FMT and PDF enzymes, and (iii) N-terminomic mass spectrometry of the same protein. Antibodies lacking these validation criteria should not be used to infer Nt-formylation in proteins.

An integrated platform for Nt-formylation mapping

The integration of high-sensitivity N-terminomic profiling with pan-fMet antibody detection has established a versatile platform for mapping Nt-formylation across diverse species and stress conditions. These advances have enabled the discovery of stress-inducible cytosolic fMet-protein synthesis and its regulation by both the fMet-RQC and fMet/N-degron pathways14,18,48. Together, they open new avenues for exploring Nt-formylation as a regulatory mechanism in translation control, proteostasis maintenance, and its emerging roles in disease and therapy.

Conclusions and future directions

Nt-formylation — once viewed as a prokaryotic relic — is now recognized as a conserved, stress-inducible cytosolic modification in eukaryotes. Under metabolic or environmental stress, Gcn2–ScFmt1-driven fMet initiation reshapes proteostasis by coordinating nascent-chain maturation, activating a dedicated fMet-RQC pathway and engaging the fMet/N-degron system for selective clearance14,18.

This program competes with other Nt-modifications and can redirect organelle and membrane targeting1,2. Its downstream effects intersect with mitochondrial metabolism (for example, OXPHOS), may facilitate cold adaptation in poikilotherms and peripheral tissues, and correlate with reduced proliferation and stemness-like traits in some cancers14,18,46.

Upstream regulation likely integrates ISR signaling and stress-dependent trafficking or activation of FMT. Partitioning of FMT between mitochondria and cytosol, combined with ISR input, likely defines quantitative thresholds for cytosolic Nt-formylation. By tuning the pool of fMet–tRNAᵢ in a tissue-specific and stress-specific manner, this axis potentially connects nutrient-sensing networks (for example, mTOR and AMPK) to translation initiation chemistry and broader homeostatic programs.

Mechanistically, the fMet-RQC pathway is established in yeast, and several core components (eIF3C, eIF3J, ARF GTPases, and ABCE1) are conserved, motivating biochemical reconstitution and structural analysis in mammals18. Key priorities include: (i) elucidating delivery routes for fMet–tRNAᵢ (eIF2D–DENR/MCTS1 versus eIF5B), (ii) identifying nascent-chain and ribosomal tunnel features that enforce the ≈codon-41 stall, (iii) characterizing mammalian counterparts of eIF3C and eIF3J, and (iv) resolving ARF/ARL–ABCE1 mechanics during ribosome splitting. Determining whether a cytosolic deformylase exists in eukaryotes and how its ISR/mTOR-regulated kinetics compete with MetAPs and Nt-acetyltransferases will clarify proteome-wide partitioning throughout maturation, fMet-RQC, and fMet/N-degron fates, as well as crosstalk among Nt-modifications.

A comprehensive atlas is now essential. Proteome-scale mapping of Nt-formylated proteins and formyl-peptides across cell types, tissues, and stress will define the eukaryotic substrate landscape and constrain models of specificity. Because direct detection is limited by lability and scarcity, progress will rely on sensitive formyl-preserving peptidomics (for example, short-peptide enrichment and derivatization/ion-mobility strategies), orthogonal validation (fit-for-purpose anti-fMet reagents and genetic/chemical perturbation of FMT, PDF, and TRIM52), and rigorous attribution to separate cytosolic from mitochondrial/bacterial products. Extending these pipelines to in vivo system will evaluate circulating formyl-peptides/proteins as biomarkers of cellular stress and disease states.

Across kingdoms, E3 ligases decode initiation chemistry to couple synthesis with degradation. In mammals, TRIM52 recognizes Nt-fMet and eliminates cytotoxic fMet-bearing proteins through proteasomal elimination, thereby maintaining proteostasis48. Defining the grammar of the mammalian fMet/N-degron — including substrate motifs, the endogenous substrateome, regulatory inputs, and integration with innate immunity and stress signaling — will establish the organizing principles of this pathway and clarify the conservation of fMet/N-degron pathway in multicellular eukaryotes.

The Nt-formylation axis is tractable: engineer mtPDFs for cytosolic activity; tune the balance between Nt-formylation and deformylation to regulate protein stability; modulate TRIM52 to reinforce proteostasis; and neutralize formyl peptides to temper FPR-driven inflammation. Systematic mapping of Nt-formylation dynamics across tissues and disease states promises to refine core principles of translational control and stress adaptation, while advancing diagnostics and interventions in immunity, aging, neurodegeneration, ischemia–reperfusion injury, and oncology.