Subcellular proteomics reveals a blueprint for endosymbiont integration in trypanosomatid Angomonas deanei

Hammond, Michael; Chmelová, Ľubomíra; van Geelen-Kuenzel, Natascha A.; Maurya, Anay K.; Ferreira, Eden R.; Puente, Vanesa; Cadena, Lawrence Rudy; Záhonová, Kristína; Dowle, Adam; Mottram, Jeremy C.; Nowack, Eva C. M.; Lukeš, Julius; Yurchenko, Vyacheslav

doi:10.1038/s41467-026-70084-0

Download PDF

Article
Open access
Published: 03 March 2026

Subcellular proteomics reveals a blueprint for endosymbiont integration in trypanosomatid Angomonas deanei

Nature Communications volume 17, Article number: 2241 (2026) Cite this article

3306 Accesses
5 Altmetric
Metrics details

Subjects

Abstract

The acquisition of endosymbionts is a fundamental process that has driven the evolution of eukaryotes. The tree of life is filled with cases of internalised prokaryotes that have become integrated into their hosts, often forming mutually beneficial relationships. The trypanosomatid Angomonas deanei is one such case, harbouring a single β-proteobacterial endosymbiont. This symbiotic relationship is highly advanced, as evidenced by the identification of host-encoded proteins that are targeted to the bacterium and control its division. To deeper understand this integration, we performed an in-depth subcellular proteomic analysis to determine the compartmental localisation of both host and endosymbiont proteins. Our analysis resolved over 5,000 host proteins and over 400 endosymbiont proteins. We used this rich dataset to identify several novel host-encoded proteins targeted to the bacterium, and validated our predictions using genetic manipulations and microscopy. By mapping the localised enzymatic repertoire, we were able to shed light on metabolic interplay between the two organisms. We confirmed an energetic basis for the previously observed association between the host’s glycosomes and its endosymbiont, and discovered an interaction between the endosymbiont and the host’s acidocalcisomes. This subcellular proteomic dataset provides a comprehensive foundation for future research into the remarkable process of bacterial integration.

The evolutionary origin of host association in the Rickettsiales

Article Open access 07 July 2022

VESPA: an optimized protocol for accurate metabarcoding-based characterization of vertebrate eukaryotic endosymbiont and parasite assemblages

Article Open access 09 January 2024

Origin and function of beneficial bacterial symbioses in insects

Article 27 March 2025

Introduction

Mitochondria and plastids, the central energy-providing organelles of eukaryotic cells, originated from bacterial endosymbionts that were acquired over one billion years ago and gradually became extensively integrated into the host cell^1,2. This process involved (i) a massive reduction of endosymbiont genes either through gene loss or replacement of certain endosymbiont proteins by those of the host, often via Endosymbiotic Gene Transfer (EGT), (ii) the emergence of protein translocons allowing for their import into the endosymbiont, (iii) an intricate interlinking of host and endosymbiont metabolism, (iv) the establishment of nuclear control over organelle division and segregation, (v) the creation of contact sites between endosymbiont and host organelle membranes, and (vi) the evolution of anterograde and retrograde signalling systems. This complex integration generated a synergistic and homoeostatic system, in which the former host and endosymbiont can no longer be regarded as separate organisms, but rather parts of a novel entity with new cellular and biochemical properties.

In nature, countless more-recently acquired bacterial endosymbionts provide diverse physiological benefits to their hosts, often with notable ecological and economic impact, such as the spheroid bodies from cyanobacteria, which allow diatom Epithemia to fix nitrogen, or the sulphate-reducing symbionts of obligatory anaerobe Anaeramoeba spp., which interact with host hydrogenosomes^3,4,5,6. Traditionally thought to interact mostly by metabolite exchange, recent studies have revealed certain symbioses progressing this integration much further. Indeed, along with metabolic functions, the endosymbiont-derived chromatophores of Paulinella chromatophora and nitroplasts of Braarudosphaera bigelowii synchronise their cell division with that of their hosts akin to true organelles⁷.

Trypanosomatid flagellates (Euglenozoa: Kinetoplastea) are obligatory parasites of vertebrates, invertebrates, and plants^8,9. Their notable dixenous representatives (two hosts in their life cycle) are of medical (Leishmania and Trypanosoma) or agricultural (Phytomonas) importance^10,11. All these lineages, however, have evolved from common ancestors with monoxenous parasites (one host in their life cycle) of insects^12,13.

Trypanosomatid endosymbionts were first described in mosquito-infecting Trypanosoma culicis¹⁴ that was later renamed Strigomonas culicis¹⁵. Its bacterium, Candidatus Kinetoplastibacterium spp. (Betaproteobacteria: Burkholderiales: Alcaligenaceae) belongs to the same bacterial group as those found in trypanosomatids of the genera Kentomonas and Angomonas, which together form the subfamily Strigomonadinae¹⁶. As judged from phylogenetic inferences, endosymbiont acquisition by a common ancestor of Strigomonadinae was a single evolutionary event 40–120 million years ago, marking the start of a long co-evolution process^17,18. The subfamily Strigomonadinae constitutes one of only two recognised endosymbiotic acquisitions within Trypanosomatidae, the other being Novymonas esmeraldas and its multiple copies of endosymbiont Ca. Pandoraea novymonadis¹⁹.

The relationships between symbionts and their Strigomonadinae hosts appear to be well-integrated and mutualistic, as evident from their highly reduced genome sizes relative to free-living bacteria^20,21, coordinated cell cycles^22,23, association with host glycosomes^24,25, and established metabolic cooperation alleviating the hosts’ dependence on the environmental availability of essential nutrients, such as heme, nucleotides, and certain amino acids^26,27,28.

Originally isolated from a reduviid bug Zelus leucogrammus²⁹, A. deanei infects a wide range of mosquito and blowfly species in nature^30,31,32. At least one strain of Angomonas deanei can be experimentally deprived of its endosymbiont³³ enabling resolution of factors essential to support this relationship. However, another A. deanei strain (ATCC PRA-265, used in this study) has proven incapable of experimental deprivation through all attempted conditions thus far³⁴. The establishment of a “toolkit” for genetic manipulation of the A. deanei nuclear genome has facilitated investigations of certain molecular underpinnings mediating this endosymbiont relationship^35,36,37. An initial proteomic characterisation of isolated endosymbionts has identified seven Endosymbiont-Targeted host Proteins (ETPs) regarded as key candidates in gaining nuclear control over the endosymbiont²⁴. Two of these ETPs, namely the dynamin-like protein ETP9 and the mostly intrinsically disordered protein ETP2, were shown to play essential roles in the division of the bacterial endosymbiont^34,38. Similarly, ornithine cyclodeaminase (originally encoded by an endosymbiont gene that was later transferred to the nucleus via EGT) is targeted to A. deanei glycosomes, presumably facilitating proline production in these organelles²⁴. It is plausible to suggest that other ETPs and EGTs are yet to be discovered and will be key in exploring the extent and defining features of this endosymbiotic integration.

To gain a more comprehensive perspective on the state of host-endosymbiont interactions in terms of protein targeting, metabolism, and cell biology, we employed subcellular proteomics, resolving over 5000 proteins and assigning 2938 of them to specific cell compartments, subcellular structures, as well as the endosymbiont, further identifying seven new putative ETPs through predictive clustering in A. deanei. The enzymatic localisation implies the endosymbiont’s metabolic dependence on energy substrates provided by the glycosomes, reflecting their close association. We additionally use this dataset to identify a novel association between the endosymbiont and acidocalcisomes, likely mediating calcium signalling between these compartments.

This localised repository of both A. deanei and endosymbiont proteins enhances our knowledge of the relationship between this trypanosomatid and its internal bacterium. Our work also shows the informative power of the subcellular proteomics for hypothesis exploration across new model organisms, particularly those, facilitating endosymbiotic relationships.

Results

LOPIT-DC: marker assignment for 21 well-defined cell compartments

Angomonas deanei was lysed via nitrogen cavitation and underwent Localisation of Organelle Proteins by Isotope Tagging by Differential Ultracentrifugation (LOPIT-DC), with fractionated distribution of proteins verified by label-free detection gels and western blot analysis (Supplementary Fig. 1). In total, 5796 proteins passed quality thresholds and were present across all four biological replicates. They included 5323 proteins encoded by the host and 473 by the endosymbiont, constituting 51% and 65% of their predicted proteomes, respectively. Analysis of t-SNE (t-distributed Stochastic Neighbour Embedding) representation for proteins, highlighting genome of origin (host or bacterium) demonstrates a single distinct cluster of endosymbiont-encoded proteins populated with several previously identified ETPs²⁴, with just six endosymbiont-encoded contaminant proteins outside of this cluster (individual inspection of fractional profiles of these proteins shows fractional inconsistency across replicates) (Fig. 1A, left and middle). Sub-localisation of the endosymbiont-encoded proteins revealed a mostly homogenous distribution of its four predicted bacterial compartments (i.e., cytoplasm, periplasm, inner and outer membranes), suggesting the bacterium remained mostly intact upon lysis of A. deanei cells (Fig. 1A, right) and informed our designation of a single set of marker proteins to define the endosymbiont for supervised classification (10–20 proteins were used for all marker groups within this dataset) (Supplementary Data 1A).

Fig. 1: Cell compartments of A. deanei illuminated via marker proteins across subcellular proteome. — **Fig. 1: Cell compartments of *A. deanei* illuminated via marker proteins across subcellular proteome.**

To assess host cell compartment distribution beyond the selection of antibody markers employed (Supplementary Fig. 1C), we analysed the host-encoded dataset via various annotation pipelines. Proteins with predicted mitochondrial targeting peptides (mTPs) were enriched across one expansive region of the t-SNE plot. Conversely, proteins with predicted signal peptides were enriched within three primary clusters, inspection of which showed canonical proteins of the endoplasmic reticulum (ER), Golgi apparatus, and acidocalcisomes, with each of these clusters also enriched in transmembrane domain proteins (TMDs). Other inspected clusters enriched for TMDs included proteins corresponding to the glycosomes and mitochondrial membranes (Fig. 1B, C).

In total, we curated a list of 351 marker proteins (Supplementary Data 1A), additionally informed by marker lists used for subcellar proteomics of Trypanosoma brucei and T. congolense³⁹. This list corresponded to 21 cellular regions (Fig. 1C) that displayed distinct fractional abundance profiles (four examples are shown in Fig. 1D, all profiles are documented in Supplementary Fig. 2). Certain components within specific cell compartments or complexes displayed fractional distinction enabling their sub-designation, for example, allowing us to assign separate markers for the mitochondrial matrix, as well as the inner and outer mitochondrial membranes (Fig. 1C, Supplementary Data 1A). Similarly, the nucleus separated into a chromatin cluster, other soluble components, as well as membranous proteins, while subunits of the proteasome complex showed fractional distinction from the proteasomal regulatory subunits (Fig. 1C). In contrast, proteins of other organelles/structures, including the large and small subunits of cytosolic ribosomes, remained tightly associated, as such we employed single marker sets for each of these compartments/structures.

Validation of subcellular localisations predicted by LOPIT-DC

Supervised classification predicted 2898 (50%) of the 5796 identified proteins to distinct sub-cellular structures based on scores above support vector modelling thresholds (Fig. 2A), with the remaining proteins classified with lower confidence, but ultimately predicted as ‘unknown’ (Supplementary Data 1B–D). To verify localisation of the predicted clusters, we tagged 20 proteins that either served as reference markers or were newly assigned to these clusters with enhanced green fluorescent protein (eGFP), using an overexpression system described previously^24,35 (Fig. 2A, Supplementary Data 1E).

Fig. 2: Tagged cell lines of A. deanei validate predictive clustering across multiple compartments. — **Fig. 2: Tagged cell lines of *A. deanei* validate predictive clustering across multiple compartments.**

Fluorescent signal confined to distinct cellular regions was observed for 13 of the tested proteins. By contrast, two other proteins showed ambiguous patterns, while five other recombinant proteins displayed no signal. The above-mentioned 13 proteins served to verify prediction localisations to the cytosol, nucleus, mitochondrion, glycosomes, Golgi apparatus, and endosymbiont (Fig. 2). Cytosolic marker CAD2222212 (Fig. 2B¹) and candidate CAD2221863 (Fig. 2B²) both show a broadly distributed signal lacking specific enrichment to any other recognisable sub-compartment. Conversely, a nuclear candidate CAD2215914 co-localises entirely with the nuclear DNA (Fig. 2B³), while a second nuclear candidate (CAD2220566) is confined to a smaller nuclear region, likely corresponding to the nucleolus (Fig. 2B⁴). Three mitochondrial candidates (CAD2222276, CAD2213008, and CAD2219020) display a tubular signal near the periphery of the cell (Fig. 2B^5–7), which reflects the structure and position of this single reticulated organelle³⁵. Moreover, one of these proteins (CAD2219020) additionally shows enrichment around the kDNA (Fig. 2B⁷, Supplementary Fig. 3⁷), a signal reminiscent of the kinetoplast-proximal profile typical for the T. brucei proteins of the mitochondrial matrix tagged with GFP⁴⁰.

To verify organelles lacking DNA, we employed double tagged cell lines, using the trans-Golgi network marker protein Arf-like 1 (Arl1) C-terminally tagged with the V5 epitope²⁴, for which we observed a co-localisation of the Golgi marker CAD2212931, and the candidate Golgi-associated protein CAD2219791 (Fig. 2B^8,9, Supplementary Fig. 3^8,9). To visualise the glycosomes, we employed the peroxisomal targeting signal 1 (PTS1 -SKL) fused to the C-terminus of the red fluorescent protein mCherry^35,41,42. This signal overlapped with that of the PTS1-bearing glycosomal marker CAD2212694, as well as with the signals for both glycosomal candidates, CAD2217526 and CAD2213015, the latter also bearing a PTS1 (Fig. 2B^10–12, Supplementary Fig. 3^10–12). Two microtubule candidates (CAD2214043 and CAD2217941) showed a faint flagella pattern, however, due to the very low fluorescence signals, their localisations remain ambiguous (Supplementary Fig. 3; Supplementary Data 1E).

To verify the newly assigned group of the host-encoded endosymbiont-localised proteins, we N-terminally tagged representative CAD2214939 and observed the corresponding fluorescence signal exclusively at the endosymbiont (Fig. 2B¹³, Supplementary Fig. 3¹³). We term this protein ETP10 as the newest member of this group of host-encoded proteins identified in a similar manner²⁴. Overall, the cell lines described above validate our predictive clustering.

An expanded list of identified ETPs

Of the 430 proteins confidently predicted to be localised in or at the endosymbiont (Supplementary Data 1B, C), 11 are encoded by the nuclear genome of A. deanei (Fig. 3A). This includes four previously identified ETPs [ETP1 (CAD2220707), ETP2 (CAD2221027, for corrected gene model please refer to³⁸), ETP3 (CAD2213480), and ETP5 (CAD2216821)²⁴], ETP10 (CAD2214939) newly identified via tagging in this study, along with six novel putative ETPs (Fig. 3A^A-F). Of the latter category, only CAD2214941 (A) and CAD221941 (D) possess functional annotation as a ‘myosin heavy chain protein’ and ‘structural maintenance of chromosomes protein’ respectively (Supplementary Data 1F). Two candidates, CAD2215126 (C) and CAD2218418 (F) are predicted to possess TMDs, while one other, CAD2222252 (D) contains an mTP (Supplementary Data 1F).

**Fig. 3: New putative endosymbiont-targeted proteins (ETPs) identified in subcellular dataset.**

Phylogenetic distribution of previously identified ETPs within this LOPIT-DC dataset (except for Euglenozoa-wide ETP5) is confined to the Kinetoplastea clade, with ETP1 lacking orthologues in any other species, suggesting its recent emergence in Angomonas spp. (Fig. 3B, Supplementary Data 1G). Strigomonadinae-restricted ETP2 and ETP10 are further absent from the divergent genome of Kentomonas sorsogonicus⁴³. Notably, putative ETPs CAD2214941 (A) and CAD2214943 (B) show sequence similarity and are assigned to the same orthogroup in TriTrypDB, yet, surprisingly, do not show similar distribution patterns amongst trypanosomatids (Fig. 3B). An inspection of their genomic position (on chromosome 4) shows their adjacency to one another, as well as to ETP10 (Fig. 3C). Each of these three genes remains interspersed by one shorter gene, which lacked the necessary peptide coverage for placement into this dataset, likely influenced by their notably shorter length relative to their neighbours (Fig. 3C). All three of these proteins (ETP10, A, B) additionally possess multiple coiled-coil regions, while ETP A and B are further predicted to carry extensive regions of α-helical globular domains (Supplementary Data 1F).

Notably, the previously identified ETP9 (CAD2212698) is predicted neither to the endosymbiont cluster, nor to any other clusters within this LOPIT-DC clustering (Figs. 1A, 3A). Previous studies have characterised ETP9 as a dynamin-like protein, interacting transiently with the outer division-site of the endosymbiont during late-stage division, otherwise, uniquely showing a weak cytosolic signal for the remainder of the cell cycle³⁴. Lacking confident designation to a marker-based predictive cluster, we employed a ‘marker-less’ unsupervised classification, which distributed ETP9 to a small group of 40 mostly hypothetical proteins (Fig. 3A, Supplementary Fig. 4, Supplementary Data 1H). Along with ETP9, these proteins are unified via their prominent fractional intensity at fraction 5 (9,000 × g), suggesting an associated density slightly lower than that of the endosymbiont, which peaks in the preceding fraction 4 (4500 × g) (Fig. 3D). This fractional profile suggests an associated density greater than specific protein complexes (such as the cytosolic ribosomes, Fig. 1D) and other low-density organelles (such as the Golgi apparatus, Supplementary Fig. 2), which sediment primarily in the following fractions. Since the only other investigated protein within this cluster is ETP9’s paralogue - a dynamin-related protein (CAD2218610, Fig. 3A), we term this the “dynamin cluster”.

Endosymbiont association with host nucleus and endoplasmic reticulum characterised in ‘contact site’ cluster

The bacterial endosymbiont of A. deanei has previously been documented in spatial proximity to the glycosomes²⁴, nucleus²², and ER⁴⁴. This prompted us to investigate whether fractional protein evidence of these associations was present in the analysed dataset. Predictive clustering designates a group of 44 host-encoded proteins, which we term “contact site” (Fig. 4A, blue), exhibiting a fractional profile mirroring that of the endosymbiont, but with reduced peak intensity (Fig. 4B). This group is enriched for nuclear membrane proteins along with several canonical ER components (Supplementary Data 1I) but shows distinct separation from the ‘native’ fractional profiles of both the soluble nucleus and ER clusters, of notably lower densities (Fig. 4B), suggesting that these proteins specifically sedimented with the endosymbiont. We suggest that unlike ETPs assigned directly to the endosymbiont cluster, this group of host-encoded proteins are targeted to host organelles, which then directly or indirectly tether to the bacterium and remain attached post cell lysis.

**Fig. 4: Nucleus, endoplasmic reticulum, glycosomes, and acidocalcisomes all show organelle interaction with endosymbiont.**

Manual inspection of the contact site cluster by a combination of targeting signal prediction, functional annotation, DeepLOC compartment prediction, analysis of orthologues present in T. brucei combined with their established localisations allowed assignment of these proteins to their “organelles of origin”, with approximately half of this group including the nuclear membrane (22), followed by a notable contingent of the ER (17), with a minority (7) appearing to originate from other or unknown cellular regions (Fig. 4C, Supplementary Data 1I). This contact site appears to comprise all nuclear pore complex proteins and is distinct from the clusters of both soluble nuclear proteins and chromatin components, which are depleted for the TMD-containing proteins (Fig. 1B, Fig. 4B). A separate TMD-enriched cluster of the ER proteins was resolved within this contact site (Fig. 1B, Fig. 4B, C), suggesting that while the entire membranous proteome of the nucleus has seemingly sedimented with the endosymbiont, only a small portion of the ER has remained attached after cell lysis, likely containing proteins immediately adjacent to the endosymbiont. The presence of unidirectional calcium (CAD2220154) and UDP-galactose (CAD2218422) transporters within the ER-assigned component of the contact site specifically suggests the transfer of these solutes from the endosymbiont to the ER (Supplementary Data 1I).

Acidocalcisomes interact with the endosymbiont in a similar manner to the glycosomes

Glycosomes exhibit a distinct fractional profile, with prominent intensity in fraction 4, representing endosymbiont-associated organelles, while their presence in fractions 5 through 8 reflects the densities of the un-associated or ‘free’ glycosomes sedimenting closer to their presumed native density (Fig. 4D). Accordingly, we demonstrate by fluorescent microscopy a subset of glycosomes in close proximity to the endosymbiont (Gl₁), as well as those separated and distributed across the cell (Gl₂) (Fig. 4E).

Another multicopy organelle of interest are the acidocalcisomes, lysosome-related organelles that also show a similar proteomic profile of endosymbiont-associated and endosymbiont-free patterns (Fig. 4F), suggesting an association with the endosymbiont in a similar manner to the glycosomes. This fractional similarity to the glycosomes can be observed via t-SNE spatial resolutions, which depict both the acidocalcisome and glycosome clusters proximal to each other (Fig. 4A).

We confirm this fractional association with transmission electron microscopy images showing a subset of the acidocalcisomes in proximity to the endosymbiont (Al₁), as well as those distant from it (Al₂) (Fig. 4G). Acidocalcisomes typically serve as acidified reservoirs of calcium, polyphosphates, amino acids, and various heavy metals^45,46. In the analysed dataset, this cluster is accordingly enriched for transporters of lysine, polyamines, phosphates, calcium, magnesium, potassium, zinc, and other metals (Supplementary Data 1B), suggesting the potential transfer of these substrates from acidocalcisomes to the endosymbiont.

Enzyme localisation expands on putative metabolite transfer between host and endosymbiont

Metabolic interplay between the host and endosymbiont represents a key feature of endosymbiotic integration. In the case of A. deanei, its bacterium is known to provide various metabolites and cofactors, such as heme, purines, and essential amino acids to the host^27,33,47, while the glycosomes are presumed to supply proline to the endosymbiont²⁴. Using the dataset of the current study, we localised core metabolic enzymes encoded by the host (Fig. 5, circles) and endosymbiont (Fig. 5, squares) to highlight putative inter-compartmental exchange of metabolites.

**Fig. 5: Enzyme distribution shows metabolic interdependency between host and the endosymbiont.**

We proteomically validated the glycosomal localisation of EGT-derived ornithine cyclodeaminase²⁴, which converts ornithine to proline, a key amino acid for energy generation of various insect-stage trypanosomatids^48,49 and endosymbiont localisation of a bacterial proline tRNA ligase (Fig. 5A). Furthermore, we document two glycosomal enzymes necessary to initiate this metabolic synthesis from the precursor amino acid, arginine, and localise succeeding enzymatic steps, allowing us to suggest not only transfer of proline from the glycosome to the endosymbiont, but also the mitochondrion, where it can be subsequently processed to 2-oxoglutarate for tricarboxylic acid (TCA) cycle incorporation, as commonly employed in the endosymbiont-lacking trypanosomatids⁵⁰ (Fig. 5A). The presence of a glycosome-localised copy of glutamate dehydrogenase also suggests that 2-oxoglutarate passes to the endosymbiont, since the bacterium is unable to generate this metabolite endogenously (Fig. 5A). While A. deanei possesses a complete set of enzymes for the TCA cycle, the endosymbiont’s metabolism is reduced almost entirely to the minimal set of enzymes necessary to power oxidative phosphorylation through NADH, exclusively generated by the conversion of 2-oxoglutarate to succinyl-CoA (succinyl coenzyme A), with additional capacity to produce succinate, and only possessing respiratory complexes I and V (Supplementary Data 1C, J). Lacking the capacity to process this metabolite further, we presume bacterial succinate is then shunted to the mitochondrion for utilisation (Fig. 5A).

Angomonas deanei is known to depend on its endosymbiont for heme synthesis^51,52,53, and accordingly, the pathway of heme synthesis from glutamate shows endosymbiont-exclusive enzymes from step II to VII (Fig. 5B). Step VIII is performed by enzymes from both the bacterium and the host (localised in the cytosol), allowing metabolite transfer to occur either in the form of protoporphyrinogen IX to the cytosol (after step VII), or protoporphyrin IX directly to the mitochondrion (after step VIII), to which the two final host-encoded enzymes of this pathway are confined (Fig. 5B). Notably, the final three host-encoded steps of heme synthesis all represent ancestral bacterial acquisitions via horizontal gene transfer^47,52.

In a typical trypanosomatid, the first seven steps of glycolysis/gluconeogenesis are confined to the glycosomes, with the remaining three enzymes localised in the cytosol^54,55 (Fig. 5C). The endosymbiont genome lacks the first three glycolytic enzymes⁵⁶ rendering a functional dependency on the adjacent glycosomes to initiate glycolysis from glucose and perform the energy consuming steps I and IIIa, before fructose 1,6-bisphosphate is putatively transferred to the bacterium (Fig. 5C).

Similar to glycolysis, the first three enzymes (the oxidative phase) of the Pentose Phosphate Pathway (PPP) have been lost from the endosymbiont genome, leaving this bacterium unable to conventionally power the subsequent steps (the non-oxidative phase), which nonetheless are proteomically detected (Fig. 5D). As PPP of the host is primarily cytosolic, with just a single glycosomally-localised enzyme powering step IV in tandem with its cytosolic counterpart (Fig. 5D), the endosymbiont’s PPP likely remains dependent on bacterial glycolysis/gluconeogenesis to supply shared metabolites, namely glyceraldehyde 3-phosphate and fructose-6-phosphate (Fig. 5C). In turn, this illuminates the metabolic rationale for retaining the gluconeogenic enzyme fructose-1,6-bisphosphatase (step IIIb, generating fructose-6-phosphate) in the genome, despite conventional gluconeogenesis being non-functional beyond this step in the endosymbiont (Fig. 5C).

The non-oxidative phase of PPP produces ribose-5-phosphate for subsequent nucleotide synthesis (Fig. 5D). The host trypanosomatid is known to depend on its endosymbiont for nucleotide provision⁵³ and, perhaps unsurprisingly, the host-encoded ribose-5-phosphate isomerase (step IV) has minimal peptide recovery (Fig. 5D). This suggests a near-complete ‘ceding’ of this key pathway subcomponent from the host to the endosymbiont, though without the noticeable gene loss that is observed in various pathways of the endosymbiont (Fig. 5A, C, D).

Discussion

Subcellular proteomic studies have provided valuable insights into biology of organelles for various protists, demonstrating, among others, production of one-carbon units and formate in mitochondrion-related organelles⁵⁷ or revealing lipid droplets spatially positioned adjacent to endosymbiotic green algae⁵⁸. Here, we employed LOPIT-DC to expand the proteomic perspective on the singular endosymbiont-derived compartment present in A. deanei, beyond initial characterisations. Seven ETPs were first identified in the A. deanei endosymbiont, screened by mass spectrometry for enriched host-encoded components²⁴, five of which were also resolved in the current dataset. New insights from the LOPIT-DC analysis include seven new host-encoded proteins that display similar fractional distribution to proteins in the endosymbiont cluster, including four aforementioned ETPs (Fig. 1A). These proteins represent promising candidates for in-depth investigations to clarify the control exerted by A. deanei over its singular endosymbiont, as has been shown for previous ETPs^34,38, further validated by the fluorescent signal observed for the new representative, ETP10 (Fig. 2B¹³). The broad signal of ETP10 observed across the entire endosymbiont is highly reminiscent of ETP1, which sub-localises specifically to the bacterial envelope^24,35. ETP10, like most other ETPs, lacks functional annotation, but here we show its genomic adjacency to a putative ETPA, functionally annotated as ‘myosin motor protein’ (Fig. 3C). Consequently, we posit that this position on the chromosome 4, with a putative ETPB being in the neighbouring position, represents a tandem gene array that has undergone radical genome rearrangement in order to account for endosymbiont presence.

Two previously identified ETPs not localised in this study, namely ETP7 and ETP8²⁴, lack the necessary peptide coverage to be confidently resolved under the current analysis scheme that relies on peptide presence in every fraction across all quadruplicates. It, thus, remains likely that relaxing the proteomic thresholds for inclusion may reveal a greater selection of less-abundant ETP candidates, albeit with reduced localisation confidence.

The LOPIT-DC localisation of ETP9, which was extraneous to the endosymbiont (Fig. 3A), is not entirely unexpected given its temporary association with the bacterium during late-stage endosymbiont division across the cell cycle³⁴. Uniquely, ETP9 exhibits no fractional association with other marker proteins employed in this study. Marker-based predictions are inherently limited by existing information available for a given organism and/or its close relatives, and we view ETP9’s distinct, reproducible fractionation pattern, shared with a small cohort of other proteins, as indicative of a novel undescribed cellular component for A. deanei, rather than an artefact of cell lysis. Within this cluster, only ETP9’s paralogue of dynamin-related protein has been investigated to any degree and localised to an apical region of the cell adjacent to the mitochondrion and flagellar pocket³⁸. Fractional comparison to other marker-based profiles shows greatest similarity to the microtubule and flagellum/microtubule clusters, which also peak in fraction 5, albeit with notably reduced intensity relative to the dynamin cluster (Supplementary Fig. 4C). As such, we interpret this cluster as a specialised component of microtubule-associated proteins, supported by a selection of motor and microtubule-binding domain proteins amongst the limited available annotations for this cohort (Supplementary Data 1H). An inspection of genes encoding this cluster revealed a strikingly high number (16) of adjacent or near-adjacent (separated by one gene) genes across A. deanei chromosomes 2, 4, 5, 9, and 21 (Supplementary Fig. 4D). Of these adjacent genes, eight are further assigned to matching orthogroups via TriTrypDB, which together suggests a series of gene duplications at these loci. We further note that one gene of this dynamin cluster, located on chromosome 4, immediately neighbours putative ETPB, predicted here to the endosymbiont (Fig. 3C, Supplementary Fig. 4D^VI). While validating experimentally this collection of several dozen proteins was beyond the scope of the present study, we speculate that, similar to ETP9, proteins of this cluster are of functional relevance to the endosymbiont and, thus, constitute a promising avenue of investigation in regard to the cytoskeletal modifications known to be produced by the endosymbiont-bearing Strigomonadinae⁵⁹.

We note that all ETPs discovered in this organism so far have been of eukaryotic origin (Supplementary Data 1G). The capacity for EGT in A. deanei has been documented only for ornithine cyclodeaminase, which is targeted to the adjacent glycosomes instead of the endosymbiont itself, representing an intriguing variation on the previously posited hypothesis on the necessity of protein targeting back to the endosymbiont before functional gene transfer can occur⁶⁰. While A. deanei appears to have developed the necessary mechanisms for targeting proteins to the endosymbiont, mass gene transfer from the bacterium to the host nucleus is yet to be documented. However, further investigation of protein phylogeny and experimental tagging is needed before we can conclusively rule out the presence of any ETPs of bacterial origin in this organism.

The metabolic factors leading to glycosomal association of Ca. Kinetoplastibacterium crithidii with its host remain incompletely defined. Transfer of proline to the proline-auxotrophic bacterium has been predicted²⁴ and is supported here with the localisation of proline tRNA ligase within the endosymbiont (Fig. 5A). Other metabolic inferences have been made from genome analysis of this endosymbiont, noting the loss of genes encoding several enzymatic steps across core metabolic pathways for the TCA cycle, glycolysis, and the PPP^26,27,61. Here, we complement these predictions with evidence that such modified pathways are proteomically present within the bacterium (Fig. 5), as opposed to being functionally redundant and lacking expression. Our enzymatic localisations ultimately predict an endosymbiont that is dependent on energy-rich substrates, including 2-oxoglutarate and fructose-1,6-bisphosphate, which can both be conveniently generated and supplied by the glycosomes (Fig. 5A, C).

We compare these findings with the endosymbiont Ca. Pandorea novymonadis of the related trypanosomatid Novymonas esmeraldas, regarded as a more recent acquisition⁵³, which, importantly, does not appear to be associated with the glycosomes of this host¹⁹. This bacterium is equally auxotrophic for proline, having lost this pathway since its divergence from the free-living relatives, but not via EGT to its host that can presumably produce proline in the cytosol⁶¹. A partial gene loss is also noted across core metabolic pathways of this endosymbiont, including the oxidative phase of the PPP. The losses across glycolysis of hexokinase (step Ia), phosphofructokinase (IIIa), phosphoglycerate mutase (VIII), and a pyruvate kinase (X) suggest an even more baroque transfer of metabolites between the bacterium and presumably the glycosomes, with a similar net ATP generation for the endosymbiont as that of A. deanei⁶².

In contrast to A. deanei, the symbiont of N. esmeraldas retains a complete TCA cycle with complexes I, II, IV, and V for oxidative phosphorylation⁶². The respiratory chain of the A. deanei endosymbiont is restricted to just complexes I and V⁶³, rendering the bacterium critically dependent on NADH likely generated from processing exogenous 2-oxoglutarate to power its limited oxidative phosphorylation (Fig. 6). As such, we propose that glycosomal energy provision for 2-oxoglutarate likely served as a critical contributor in the eventual endosymbiont association amongst members of the Strigomonadinae. Previous metabolic studies have demonstrated the endosymbiont stimulating A. deanei respiration by unknown means, while also showing a critical dependency of the trypanosomatid on complex II for initiating this process (with complex I being dispensable)⁶³. In this context, the bacterial processing of 2-oxoglutarate to succinate and return of this metabolite to the host, as postulated here (Fig. 5), represents a plausible mechanism for such respiratory stimulation. Key questions to answer for this metabolic arrangement include the specific transporters used for 2-oxoglutarate and succinate translocation across multiple membranes, as well as the specific electron shuttle employed by Ca. Kinetoplastibacterium spp., which has lost the ability to synthesise ubiquinone, yet nonetheless respires independently of A. deanei⁶³.

Fig. 6: Endosymbiont-interacting compartments of A. deanei. — **Fig. 6: Endosymbiont-interacting compartments of *A. deanei*.**

Established associations between the endosymbiont and the nucleus, ER, as well as the glycosomes can all be proteomically reconstructed via fractional comparisons generated in this work (Supplementary Fig. 2). We note that previous endosymbiont extractions²⁴, in which cells were sonicated followed by gradient centrifugation, did not produce definitive protein evidence for these associations, though we noted among ‘contaminant’ host-encoded proteins of that study CAD2219332, which we predict here to the glycosomes, and CAD2219478, classified (albeit with lower confidence) to the contact site (Supplementary Data 1K). We interpret the harsher purification conditions of the previously reported extraction procedure, in contrast to the nitrogen cavitation employed here, as less conducive to preserving these sensitive interactions with the endosymbiont.

We additionally use the current dataset to demonstrate a new organelle association between the endosymbiont and a subset of acidocalcisomes, which can also be documented by electron microscopy (Fig. 4). Moreover, an analysis of previous literature shows numerous examples of A. deanei acidocalcisomes imaged adjacent to the endosymbiont^22,23,59,64, which likely escaped notice due to the dispersed nature of this multicopy organelle across the cell.

While glycosomal proximity to the bacterium has traditionally been presumed to represent a key feature of metabolic integration with the endosymbiont (Fig. 5)²⁴, the functional interplay between the acidocalcisomes and this bacterium are less intuitive. In T. cruzi, the acidocalcisomes supply calcium to the mitochondrion to stimulate and regulate energy production⁶⁵ and, additionally, merge with the contractile vacuole to mediate osmotic regulation⁶⁶. While localised to the ER in mammals, the calcium channel inositol 1,4,5-triphosphate receptor is localised to the acidocalcisomes of T. cruzi, T. brucei⁶⁷, and A. deanei (Supplementary Data 1B). In the abovementioned members of the genus Trypanosoma, the acidocalcisomes coordinate calcium release for mitochondrial signalling to mediate cell growth, differentiation and infectivity⁶⁷. The signalling systems between A. deanei and its endosymbiont remain mostly undefined but, based on the current dataset, we can postulate presence of a putative signalling channel supplying calcium to the endosymbiont by the acidocalcisomes. In turn, calcium is returned to the host through the ER via a unidirectional calcium importer localised to the contact site (Fig. 6). While non-specific porins are predicted to the outer endosymbiont membrane, specific calcium transporters in its inner membrane are yet to be identified in the bacterial genome (Supplementary Data 1C), though we also consider the possibility of specific ETPs being employed to mediate such a role.

A subcellular analysis of A. deanei ultimately demonstrates that its intimate relationship with the endosymbiont involves the participation of dozens of host-encoded proteins through a complex combination of organelle contact sites and specific ETPs involved in metabolic and signalling coordination. Our work demonstrates the informative power of LOPIT-DC to reveal complex molecular interactions across an individual cell. The dissected trypanosomatid represents a highly suitable experimental model to study the molecular underpinnings of well-integrated endosymbiosis. The generated dataset shall also serve as a repository to explore, which factors related to control, signalling and metabolism have precipitated such an indispensable association.

Methods

Cultivation and validation

Angomonas deanei Carvalho ATCC PRA-265 (first described as Crithidia deanei) isolated from Zelus leucogrammus (Hemiptera, Reduviidae)²⁹ was cultivated at 28 °C in Schneider’s Drosophila medium supplemented with 10 µg/ml hemin (both from Sigma-Aldrich/ Merck, Darmstadt, Germany), 50 units/ml penicillin, 50 mg/ml streptomycin (both from Biowest, Nuaillé, France) as described elsewhere⁶⁸. Note that presence of abovementioned antibiotics has no effect on the bacterial endosymbiont. Cells were diluted twice a week in fresh medium once they reached a density of 1 × 10⁸ cells/ml. Species identity was validated as described previously^69,70.

Sample preparation

Cell lysis was performed by nitrogen cavitation as described previously^39,71,72 using a pre-chilled cell disruption vessel 4639 (Parr Instrument Company, Moline, USA). A total of 4 × 10⁹A. deanei cells from the logarithmic growth phase were washed in 1 × PBS (Phosphate-Buffered Saline), resuspended in the homogenisation medium (0.25 M sucrose, 10 mM HEPES/KOH pH 7.4, 1 mM EDTA, Halt protease inhibitor cocktail (all from Thermo Fisher Scientific, Waltham, USA)), and disrupted at 1200 psi for 15 min on ice. Lysates were brought to 2 mM of magnesium acetate tetrahydrate and 500 units of Benzonase nuclease (Sigma-Aldrich/-Merck) and kept at room temperature (RT) for 20 min, to remove nucleic acids and reduce sample viscosity, followed by 15 min incubation on ice. Lysates underwent differential centrifugation as described previously⁷³ (Supplementary Fig. 1A), with fraction 4 modified from 5000 × g to 4500 × g to enable this fraction to be collected in conventional centrifuge Falcon tubes. The experiment was performed in four biological replicates. Replicate protein abundance was assessed via 4-20% Mini-PROTEAN TGX Stain-Free Protein Gels (Bio-Rad), where 10 μg protein for each fraction was loaded into separate wells, with protein band size referenced against a Precision Plus Protein Strep-tagged recombinant molecular ladder (Bio-Rad) (Supplementary Fig. 1B). Immunoblots were additionally performed on each replicate using the following primary antibodies as indicators for compartment fractional distribution: anti-histone H3 (1:1,000, Abcam, Cambridge, UK, ab18521), anti-EF1α (1:1,000, Merck, 05-235 clone CBP-KK1 [monoclonal]), anti-tubulin β (1:2,000, Sigma-Aldrich/Merck, T0198, clone D66 [monoclonal]), anti-BIP (1:1000, provided by Dr. Bangs⁷⁴), anti-MVK and anti-HMGCS (1:10,000 and 1:5,000 respectively, provided by Dr. González-Pacanowska⁷⁵), anti-OPB (1:20,000)⁷⁶ (Supplementary Fig. 1C). Secondary HRP-labelled anti-rabbit and anti-mouse IgG antibodies were from Promega (Madison, USA); HRP-labelled anti-sheep IgG antibody was from GeneTex (Irvine, USA), all at 1:5,000.

Digestion, TMT labelling, and high pH reverse phase fractionation

Differential centrifugation fractions were diluted at a 1:3 ratio with 50 mM triethylammonium bicarbonate (TEAB) in 5% sodium dodecyl sulphate and the equivalent of 50 µg of total protein was taken from each fraction for processing. Proteins were reduced with 5.7 mM tris(2-carboxyethyl)phosphine at 55 °C for 15 min, alkylated with 22.7 mM methyl methanethiosulfonate for 10 min at RT, and acidified with 27.5% phosphoric acid precipitated with 7 volumes of 100 mM TEAB 90% (v:v) methanol. Precipitated proteins were captured on S-trap C02-micro columns (ProtiFi, Fairport, USA), washed 5 × with 165 µl 100 mM TEAB 90% (v:v) methanol, and digested with 20 µl 0.1 µg/µl Trypsin/Lys-C mix (Promega) in aqueous 50 mM TEAB at 47 °C for 2 h. Peptides were recovered from the S-traps by centrifugation at 4000 × g for 60 s. Columns were further washed with 40 µl aqueous 0.2% (v:v) formic acid and 40 µl 50% (v:v) acetonitrile:water and eluates were combined. Peptides were dried in a vacuum concentrator and resuspended in 50 µl aqueous 50 mM TEAB for TMT labelling.

Peptides were labelled using TMT11-131C Label Reagent (Thermo Fisher Scientific) following the manufacturer’s protocol, with the exception that half of the TMT reagent (0.4 mg) was used per fraction. Samples were combined post labelling, dried in a vacuum concentrator, resuspended in 100 µl water, and loaded onto the 1260 Infinity II LC System (Agilent, Santa Clara, USA) equipped with a Waters XBridge 3.5 µm C18 column (2.1 × 150 mm) (Thermo Fisher Scientific). Separation used gradient elution of solvents A (0.1% ammonium hydroxide) and B (acetonitrile containing 0.1% of ammonium hydroxide). The flow rate was 200 µl/min; the column temperature was 40 °C. The linear multi-step gradient profile for the elution was: 5–35% B over 20 min, 35–80% B over 5 min, the gradient was followed by washing with 80% solvent B for 5 min before returning to initial conditions and re-equilibrating for 7 min prior to subsequent injections. Eluant was collected at 1 min intervals into the protein LoBind tubes (Eppendorf, Hamburg, Germany). Peptide elution was monitored by UV absorbance at 215 and 280 nm. Fractions were pooled across the UV elution profile to give 12 fractions for LC-MS/MS acquisition. Peptide fractions were dried in a vacuum concentrator before reconstituting in 20 µl aqueous 0.1% (v:v) trifluoroacetic acid.

Liquid chromatography-tandem mass spectrometry (LC-MS/MS)

Fractionated TMT-labelled peptides were loaded onto a Vanquish Neo nano UHPLC equipped with an Acclaim PepMap 100 Å C18 trap (5 µm, 1 × 5 mm) and an Easy-Spray PepMap Neo nano C18 analytical column (2 μm, 75 μm × 500 mm) (all Thermo Fisher Scientific). Separation used gradient elution of solvents A (0.1% formic acid) and B (80% acetonitrile containing 0.1% formic acid). The flow rate for the capillary column was 250 nl/min; the column temperature was 40 °C. The linear multi-step gradient profile was: 5–32% B over 70 min, 32–50% B over 15 min, 50–99% B over 2 min, and then proceeded to wash with 99% solvent B for 3 min. The column was returned to initial conditions and re-equilibrated before subsequent injections.

The nanoLC system was interfaced with an Orbitrap Exploris480 mass spectrometer equipped with an EasyNano ionisation source via FAIMS Pro Duo (all Thermo Fisher Scientific). Positive ESI-MS and MS2 were acquired using Xcalibur v. 4.7 (Thermo Fisher Scientific). Instrument source settings specified ion spray voltage at 1800 V and ion transfer tube temperature at 275 °C. The FAIMS CV was set to −45 V. The MS1 spectra were acquired with 120 K resolution, scan range of m/z 350–1200, AGC target, and other settings left at default. Data-dependent acquisition was performed in a top speed mode using 1 s cycle, selecting the most intense precursors with charge states 2–5. Dynamic exclusion was performed for 45 s post precursor selection with a tolerance of 10 ppm and a minimum threshold for fragmentation set at 5e³. The MS2 spectra were acquired with 45 K resolution; quadrupole isolation 0.7 m/z; HCD collision energy 35%; AGC target, first mass 110 m/z; max fill time 96 msec.

Database searching and TMT quantification

Peak picking, database searching, TMT reporter ion extraction and quantification were performed using Proteome Discoverer v. 3.1 (Thermo Fisher Scientific). CHIMERYS v. 2.7 (Thermo Fisher Scientific)⁷⁷ was used as the search engine set against the custom-built A. deanei and Ca. Kinetoplastibacterium crithidii proteomes along with common proteomic contaminants. Search criteria were specified as follows: charge, 1-6; mass error, 10 ppm; missed cleavage, max 2; dynamic modifications, oxidation (M); static modifications, TMT6/10/11-plex (peptide N-terminus and K). Searches were run with a strict false discovery rate of 0.01 and filtered to require a minimum of two unique peptides per accepted protein.

LOPIT subcellular predictions

Relative, normalised, TMT-derived protein abundances across differential centrifugation fractions among the four biological replicates were used as input values for LOPIT subcellular localisation prediction. Data was filtered to require protein quantification in all four biological replicates, and a minimum of two unique corresponding peptides. Data was imported and processed in R, primarily via pRoloc package as detailed in published Bioconductor workflows⁷⁸.

The 351 manually curated protein groups were used as a training set for support vector machine (SVM) model with ‘svmOptimisation’ and svmClassification’ functions using pRoloc. Marker proteins were chosen based on their shared fractional distribution patterns across four biological replicates. 100 rounds of five-fold cross-validation were performed to optimise the SVM parameters based on marker protein abundance profiles. The optimal parameters for the SVM classifier were then applied to all proteins in the dataset with a corresponding SVM score ranging from 0 to 1, with 1 being the score of marker proteins. The SVM classifier was then applied to non-marker proteins, with corresponding weights applied to each marker category (Supplementary Data 1D). Each protein was thus classified to one compartment, and any protein whose classification fell below the global median SVM score was reset to ‘unknown’ while the other half of the dataset was considered “predicted” to its corresponding compartment due to their higher SVM scores. Quantitative separation values were additionally calculated to determine spatial resolution between each predicted cluster (Supplementary Fig. 5).

Unsupervised clustering was also performed, using the K-means (KM) algorithm implemented in the MLearn function from the MLInterfaces package in Rstudio v. 1.78.0. KM generates k-random centroids and includes surrounding data points iteratively such that all data points are included in one of the k clusters and the size of each centroid is minimised. KM clusters were generated with 21 clusters (Supplementary Fig. 5) corresponding to the number of marker groups used in Supplementary Data 1B, C. Cluster 1 and its 40 constituent proteins was ultimately designated as the ‘dynamin cluster’ (Supplementary Fig. 4A).

All proteins of this dataset underwent annotation via GhostKOALA⁷⁹, with host-encoded proteins undergoing targeting signal predictions via SignalP 6.0⁸⁰, TargetP 2.0⁸¹, DeepLoc 2.1⁸², DeepTMHMM 1.0⁸³, and BLASTp searches against parasitic T. brucei and free-living Bodo saltans from the TriTrypDB release 68⁸⁴ with e-value cut-off 1e^-5 (Supplementary Data 1B). In certain cases, orthologue localisation was also determined using TrypTag⁸⁵ or T. brucei LOPIT data³⁹ (Supplementary Data 1F, H, I). Endosymbiont-encoded proteins underwent DeepLocPro 1.0 compartment predictions⁸⁶ and BLASTp searches against the predicted proteome of free-living Burkholderia thailandensis E264 (Supplementary Data 1C). Putative ETP proteins along with newly identified ETP10 additionally underwent analysis via InterPro v. 98.0⁸⁷ (Supplementary Data 1F).

Construction of plasmids

For the generation of plasmids pAdea423, pAdea425, pAdea429, pAdea436, pAdea444, pAdea445, pAdea446, pAdea447, pAdea448, pAdea450, pAdea451, pAdea461, pAdea462, pAdea463, pAdea464, and pAdea465, genes for CAD2219020, CAD2214939, CAD2218596, CAD2222276, CAD2213008, CAD2221863, CAD2220061, CAD2222212, CAD2219791, CAD2221326, CAD2212931, CAD2215914, CAD2220566, CAD2219447, CAD2217941, and CAD2214043 were amplified from A. deanei genomic DNA (gDNA) using the primer combinations 3140/3141, 3134/3135, 3207/3208, 3209/3210, 3211/3212, LC01/LC02, LC03/LC04, LC05/LC06, LC07/LC08, LC15/LC16, LC11/LC12, VP01/VP02, VP03/VP04, VP05/VP06, VP07/VP08, and VP09/VP10, respectively (Supplementary Data 2). For the C-terminal eGFP-tagging, these inserts were used to replace the lacZ cassette in the ‘tagging vector’ pAdea235 containing an egfp gene immediately downstream of the lacZ cassette (Supplementary Fig. 6) employing Golden Gate cloning^24,88. Similarly, for the generation of plasmids pAdea424, pAdea426, pAdea432, pAdea433, pAdea449, and pAdea452, genes for CAD2219020, CAD2214939, CAD2212694, CAD2217526, CAD2219791, and CAD2212931 were amplified from gDNA using primer pairs 3151/3152, 3145/3146, 3122/3123, 3126/3127, LC09/LC10, and LC13/LC14, respectively. These inserts were used to replace the lacZ cassette in the tagging vector pAdea043 with an egfp gene immediately upstream of the lacZ cassette by Golden Gate cloning resulting in expression cassettes for fusion constructs with N-terminal eGFP-tags. For the construction of plasmid pAdea428, the gene for CAD2218596 was amplified from gDNA using primer pair 3149/3150 and the backbone (pUMA 1467-δ-ama fr 5′-neo^r-gapdh ir-δ-ama fr 3′) from pAdea340 (Supplementary Fig. 6) using primer pair 3147/3148. Both fragments were assembled by Gibson assembly as described earlier²⁴. For the construction of plasmid pAdea427, the gene for CAD2217526 was amplified from gDNA using primer pair 3130/3131. pAdea235 was digested with BsaI to remove the lacZ cassette, which was replaced with the amplified insert using Gibson assembly, resulting in an expression cassette for a fusion construct with C-terminal eGFP-tag. For the construction of plasmids pAdea439 and pAdea443, the genes for CAD2219607 and CAD2213015 were amplified from gDNA using primer pairs 3128/3129 and 3124/3125, respectively. pAdea043 was digested with BsaI to remove the lacZ cassette, which was replaced with the amplified insert using Gibson assembly, resulting in expression cassettes for fusion constructs with N-terminal eGFP-tags. All plasmids were verified by sequencing at Microsynth (Balgach, Switzerland) and Eurofins Genomics (Ebersberg, Germany).

Generation of transgenic cell lines

DNA cassettes excised from plasmids were stably integrated into the A. deanei nuclear genome via homologous recombination as described earlier²⁴. In brief, 1 × 10⁷ cells were resuspended in 17–18 µl of the P3 primary cell solution (Lonza, Basel, Switzerland), mixed with 2-6 µg of the linearised plasmid (in 2–3 µl water) and pulsed using transfection programme FP 158 in a 4D Nucleofector (Lonza). Electroporated cells were recovered in growth medium for 6 h at 28 °C before the respective antibiotic(s) were added. Hygromycin B Gold (InvivoGen, San Diego, USA) and G418 (neomycin) (Sigma-Aldrich/Merck) were used at the final concentration of 500 µg/ml. Clonal cell lines were generated by limiting dilution. Genomic DNA of selected clones was isolated using an adapted DNAzol-based protocol (Thermo Fisher Scientific) and clones were verified by a touch-down PCR using a combination of primers that bind outside of the insertion cassette in the genome and/or Phusion PCR using a combination of primers where one primer binds outside the insertion cassette and the other inside (Supplementary Data 2). To verify the localisation of potential Golgi apparatus and glycosomal proteins, the generated cell lines expressing eGFP-tagged Golgi and glycosomal candidate proteins were additionally transfected with an Arl1-V5 and mCherry-SKL constructs, respectively (Supplementary Fig. 6) to assess co-localisation.

Epifluorescence microscopy, immunofluorescence assay, and transmission electron microscopy

To detect the autofluorescence of fluorescent fusion proteins in A. deanei, epifluorescence microscopy was performed as described previously²⁴. In brief, log-phase grown cells were fixed with 3-4% formaldehyde and incubated for 10 min at RT in the dark. Fixed cells were washed twice in 1 × PBS, spotted onto poly-L-lysine-coated glass slides, stained with Hoechst 33342 (30 µg/ml final concentration in PBS), and mounted with antifade reagent SlowFade or Prolong Diamond (both Thermo Fisher Scientific). Imaging was performed with an Axio Imager M.1 (Zeiss, Oberkochen, Germany) using an EC Plan-Neofluar 100×/1.30 Oil Ph3M27 objective (Zeiss). Images were analysed with Zen Blue v. 2.5 (Zeiss) and processed with ImageJ2 software⁸⁹. For mitochondrial staining, cells were centrifuged at 7000 × g for 5 min at RT and the cell pellet was resuspended in 1 × PBS supplemented with 10 mM glucose (Thermo Fisher Scientific). The MitoTracker DeepRed FM Dye (Thermo Fisher Scientific) in DMSO was added to the cells to a final concentration of 1 µM and incubated at 28 °C for 30 min. The cells were washed twice in 1 × PBS and proceeded with formaldehyde fixation as above.

The immunofluorescence assay was performed as described earlier³⁴. In brief, cells were fixed with formaldehyde, washed three times in 1 × PBS, and spotted onto poly-L-lysine-coated glass slides. Attached cells were permeabilised with 0.2% TritonX-100, washed with 1 × PBS, and blocked for 45 min at RT in 1% blocking solution (albumin bovine fraction V, pH 7.0 (SERVA, Heidelberg, Germany) in 1 × PBS). Next, cells were incubated with mouse anti-V5 primary antibody (ChromoTek/Proteintech, Planegg, Germany) at 1:100 for 1.5 h at RT, washed thrice in 1% blocking solution, and incubated with anti-mouse IgG secondary antibody conjugated to CruzFluor™ 594 (Santa Cruz Biotechnology, Dallas, USA) at 1:100 for 1 h at RT. Cells were washed, stained with Hoechst 33342, mounted with SlowFade Diamond, and imaged as above.

Angomonas deanei wildtype cells were prepared for transmission electron microscopy and imaged as described earlier³⁴.

Homology searches

Identified and candidate ETPs served as queries in BLASTp v. 2.11.0 + ⁹⁰ and tBLASTn searches (e-value cut-off 1e^-5) against a custom-built protein database covering the eukaryotic and prokaryotic diversity and trypanosomatid genomes in NCBI, respectively. Open reading frames of genomic hits were extracted and translated in Geneious Prime v. 2025.1.2⁹¹. All identified hits were searched by BLASTp (e-value cut-off 1e^-5) against the A. deanei genome-derived proteome to identify reciprocal best hits, i.e., orthologues. Additionally, A. deanei ETPs were aligned by MAFFT v. 7.458⁹² with their orthologues from TriTrypDB release 68, profile Hidden Markov Models were prepared, and HMMER v. 3.3 searches were performed (e-value cut-off 1e^-10) in the same protein database. Again, identified hits were searched by BLASTp (e-value cut-off 1e^-5) against A. deanei proteome.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The raw data generated in this study have been deposited to the ProteomeXchange Consortium via the MassIVE partner repository (MSV000098972, doi:10.25345/C55D8NT39) with the dataset identifier PXD067873. Source data are provided with this paper.

References

Roger, A. J., Muñoz-Gómez, S. A. & Kamikawa, R. The origin and diversification of mitochondria. Curr. Biol. 27, R1177–R1192 (2017).
Article CAS PubMed Google Scholar
Keeling, P. J. The endosymbiotic origin, diversification and fate of plastids. Philos. Trans. R. Soc. Lond. B Biol. Sci. 365, 729–748 (2010).
Article CAS PubMed PubMed Central Google Scholar
Husník, F. et al. Bacterial and archaeal symbioses with protists. Curr. Biol. 31, R862–R877 (2021).
Article PubMed Google Scholar
Kaltenpoth, M., Florez, L. V., Vigneron, A., Dirksen, P. & Engl, T. Origin and function of beneficial bacterial symbioses in insects. Nat. Rev. Microbiol. 23, 551–567 (2025).
Vorburger, C. Defensive symbionts and the evolution of parasitoid host specialization. Annu Rev. Entomol. 67, 329–346 (2022).
Article CAS PubMed Google Scholar
Jerlström-Hultqvist, J. et al. A unique symbiosome in an anaerobic single-celled eukaryote. Nat. Commun. 15, 9726 (2024).
Article ADS PubMed PubMed Central Google Scholar
Sørensen, M. E. S., Stiller, M. L., Kröninger, L. & Nowack, E. C. M. Protein import into bacterial endosymbionts and evolving organelles. FEBS J. https://doi.org/10.1111/febs.17356 (2025).
Kostygov, A. Y. et al. Euglenozoa: taxonomy, diversity and ecology, symbioses and viruses. Open Biol. 11, 200407 (2021).
Article PubMed PubMed Central Google Scholar
Maslov, D. A. et al. Recent advances in trypanosomatid research: genome organization, expression, metabolism, taxonomy and evolution. Parasitology 146, 1–27 (2019).
Article PubMed Google Scholar
Stuart, K. et al. Kinetoplastids: related protozoan pathogens, different diseases. J. Clin. Invest 118, 1301–1310 (2008).
Article CAS PubMed PubMed Central Google Scholar
Frolov, A. O., Kostygov, A. Y. & Yurchenko, V. Development of monoxenous trypanosomatids and phytomonads in insects. Trends Parasitol. 37, 538–551 (2021).
Article CAS PubMed Google Scholar
Kostygov, A. Y. et al. Phylogenetic framework to explore trait evolution in Trypanosomatidae. Trends Parasitol. 40, 96–99 (2024).
Article CAS PubMed Google Scholar
Lukeš, J., Skalický, T., Týč, J., Votýpka, J. & Yurchenko, V. Evolution of parasitism in kinetoplastid flagellates. Mol. Biochem Parasitol. 195, 115–122 (2014).
Article PubMed Google Scholar
Novy, F. G., MacNeal, W. J. & Torrey, H. N. The trypanosomes of mosquitoes and other insects. J. Infect. Dis. 4, 223–276 (1907).
Article Google Scholar
Teixeira, M. M. et al. Phylogenetic validation of the genera Angomonas and Strigomonas of trypanosomatids harboring bacterial endosymbionts with the description of new species of trypanosomatids and of proteobacterial symbionts. Protist 162, 503–524 (2011).
Article PubMed Google Scholar
Votýpka, J. et al. Kentomonas gen. n., a new genus of endosymbiont-containing trypanosomatids of Strigomonadinae subfam. n. Protist 165, 825–838 (2014).
Article PubMed Google Scholar
Du, Y., Maslov, D. A. & Chang, K. P. Monophyletic origin of β-division proteobacterial endosymbionts and their coevolution with insect trypanosomatid protozoa Blastocrithidia culicis and Crithidia spp. Proc. Natl. Acad. Sci. USA 91, 8437–8441 (1994).
Article ADS CAS PubMed PubMed Central Google Scholar
Skalický, T. et al. Endosymbiont capture, a repeated process of endosymbiont transfer with replacement in trypanosomatids Angomonas spp. Pathogens 10, 702 (2021).
Article PubMed PubMed Central Google Scholar
Kostygov, A. et al. Novel trypanosomatid - bacterium association: evolution of endosymbiosis in action. mBio 7, e01985–01915 (2016).
Article CAS PubMed PubMed Central Google Scholar
Alves, J. M. et al. Genome evolution and phylogenomic analysis of Candidatus Kinetoplastibacterium, the beta-proteobacterial endosymbionts of Strigomonas and Angomonas. Genome Biol. Evol. 5, 338–350 (2013).
Article PubMed PubMed Central Google Scholar
Silva, F. M. et al. The reduced genome of Candidatus Kinetoplastibacterium sorsogonicusi, the endosymbiont of Kentomonas sorsogonicus (Trypanosomatidae): loss of the haem-synthesis pathway. Parasitology 145, 1287–1293 (2018).
Article PubMed Google Scholar
Motta, M. C. et al. The bacterium endosymbiont of Crithidia deanei undergoes coordinated division with the host cell nucleus. PLoS One 5, e12415 (2010).
Article ADS PubMed PubMed Central Google Scholar
Catta-Preta, C. M. et al. Endosymbiosis in trypanosomatid protozoa: the bacterium division is controlled during the host cell cycle. Front. Microbiol. 6, 520 (2015).
Article PubMed PubMed Central Google Scholar
Morales, J. et al. Host-symbiont interactions in Angomonas deanei include the evolution of a host-derived dynamin ring around the endosymbiont division site. Curr. Biol. 33, 28–40 (2023).
Article CAS PubMed Google Scholar
Loyola-Machado, A. C. et al. The symbiotic bacterium fuels the energy metabolism of the host trypanosomatid Strigomonas culicis. Protist 168, 253–269 (2017).
Article CAS PubMed Google Scholar
Klein, C. C. et al. Biosynthesis of vitamins and cofactors in bacterium-harbouring trypanosomatids depends on the symbiotic association as revealed by genomic analyses. PLoS One 8, e79786 (2013).
Article ADS PubMed PubMed Central Google Scholar
Alves, J. M. et al. Endosymbiosis in trypanosomatids: the genomic cooperation between bacterium and host in the synthesis of essential amino acids is heavily influenced by multiple horizontal gene transfers. BMC Evol. Biol. 13, 190 (2013).
Article PubMed PubMed Central Google Scholar
Alves, J. M. P. in The handbook of microbial metabolism of amino acids (ed J. P. F. D’Mello) Ch. 27, 371–383 (CAB International, 2017).
Carvalho, E. M. Estudos sobre a posição sistemática, na biologia e a transmissão de tripanosomatideos encontrados em Zelus leucogrammus (Perty, 1834) (Hemiptera, Reduvidae). Re. v. Patol. Trop. 2, 223–274 (1973).
Google Scholar
Borghesan, T. C. et al. Genetic diversity and phylogenetic relationships of coevolving symbiont-harboring insect trypanosomatids, and their Neotropical dispersal by invader African blowflies (Calliphoridae). Front Microbiol 9, 131 (2018).
Article PubMed PubMed Central Google Scholar
d’Avila-Levy, C. M. et al. Influence of the endosymbiont of Blastocrithidia culicis and Crithidia deanei on the glycoconjugate expression and on Aedes aegypti interaction. FEMS Microbiol. Lett. 252, 279–286 (2005).
Article PubMed Google Scholar
van Geelen-Kuenzel, N. A., Yurchenko, V., Lukeš, J. & Nowack, E. C. M. Angomonas deanei. Trends Parasitol, https://doi.org/10.1016/j.pt.2025.1012.1008 (2026).
Mundim, M. H. & Roitman, I. Extra nutritional requirements of artificially aposymbiotic Crithidia deanei. J. Protozool. 24, 329–331 (1977).
Article CAS Google Scholar
Maurya, A. K. et al. A nucleus-encoded dynamin-like protein controls endosymbiont division in the trypanosomatid Angomonas deanei. Sci. Adv. 11, eadp8518 (2025).
Article ADS CAS PubMed Google Scholar
Morales, J. et al. Development of a toolbox to dissect host-endosymbiont interactions and protein trafficking in the trypanosomatid Angomonas deanei. BMC Evol. Biol. 16, 247 (2016).
Article PubMed PubMed Central Google Scholar
Gonçalves, C. S. et al. Importance of Angomonas deanei KAP4 for kDNA arrangement, cell division and maintenance of the host-bacterium relationship. Sci. Rep. 11, 9210 (2021).
Article ADS PubMed PubMed Central Google Scholar
Kröninger, L. et al. T7 RNA polymerase-based gene expression from a transcriptionally silent rDNA spacer in the endosymbiont-harboring trypanosomatid Angomonas deanei. PLoS One 20, e0322611 (2025).
Article PubMed PubMed Central Google Scholar
Maurya, A. K., Cadena, L. R., Ehret, G. & Nowack, E. C. M. Host-encoded ETP2 is involved in recruiting the dynamin-like protein ETP9 to the endosymbiont division site in trypanosomatid Angomonas deanei. mBio 16, 2247 (2025).
Moloney, N. M. et al. Mapping diversity in African trypanosomes using high resolution spatial proteomics. Nat. Commun. 14, 4401 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Pyrih, J. et al. Comprehensive sub-mitochondrial protein map of the parasitic protist Trypanosoma brucei defines critical features of organellar biology. Cell Rep. 42, 113083 (2023).
Article CAS PubMed Google Scholar
Blattner, J. et al. Glycosome assembly in trypanosomes: variations in the acceptable degeneracy of a COOH-terminal microbody targeting signal. J. Cell Biol. 119, 1129–1136 (1992).
Article CAS PubMed PubMed Central Google Scholar
Chmelová, L. et al. Intricate balance of dually-localized catalase modulates infectivity of Leptomonas seymouri (Kinetoplastea: Trypanosomatidae). Int J. Parasitol. 54, 391–400 (2024).
Article PubMed Google Scholar
Zavataro, A. L. E. et al. The genome of the endosymbiont-harboring trypanosomatid Kentomonas sorsogonicus. Protistology 18, 72–81 (2024).
Google Scholar
Catta-Preta, C. M. C., de Azevedo-Martins, A. C., de Souza, W. & Motta, M. C. M. Effect of the endoplasmic reticulum stressor tunicamycin in Angomonas deanei heat-shock protein expression and on the association with the endosymbiotic bacterium. Exp. Cell Res 417, 113162 (2022).
Article CAS PubMed Google Scholar
Huang, G. et al. Proteomic analysis of the acidocalcisome, an organelle conserved from bacteria to human cells. PLoS Pathog. 10, e1004555 (2014).
Article PubMed PubMed Central Google Scholar
Docampo, R. Advances in the cellular biology, biochemistry, and molecular biology of acidocalcisomes. Microbiol. Mol. Biol. Rev. 88, e0004223 (2024).
Article PubMed Google Scholar
Kořený, L., Lukeš, J. & Oborník, M. Evolution of the haem synthetic pathway in kinetoplastid flagellates: an essential pathway that is not essential after all? Int J. Parasitol. 40, 149–156 (2010).
Article PubMed Google Scholar
Hannaert, V., Bringaud, F., Opperdoes, F. R. & Michels, P. A. Evolution of energy metabolism and its compartmentation in Kinetoplastida. Kinetoplastid Biol. Dis. 2, 11 (2003).
Article PubMed PubMed Central Google Scholar
Opperdoes, F. R., Butenko, A., Flegontov, P., Yurchenko, V. & Lukeš, J. Comparative metabolism of free-living Bodo saltans and parasitic trypanosomatids. J. Eukaryot. Microbiol 63, 657–678 (2016).
Article CAS PubMed Google Scholar
Michels, P. A. M. et al. Carbohydrate metabolism in trypanosomatids: new insights revealing novel complexity, diversity and species-unique features. Exp. Parasitol. 224, 108102 (2021).
Article CAS PubMed Google Scholar
Salzman, T. A., Batlle, A. M., Angluster, J. & de Souza, W. Heme synthesis in Crithidia deanei: influence of the endosymbiote. Int J. Biochem 17, 1343–1347 (1985).
Article CAS PubMed Google Scholar
Alves, J. M. et al. Identification and phylogenetic analysis of heme synthesis genes in trypanosomatids and their bacterial endosymbionts. PLoS One 6, e23518 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Harmer, J., Yurchenko, V., Nenarokova, A., Lukeš, J. & Ginger, M. L. Farming, slaving and enslavement: histories of endosymbiosis during kinetoplastid evolution. Parasitology 145, 1311–1323 (2018).
Article PubMed Google Scholar
Szöör, B., Haanstra, J. R., Gualdrón-López, M. & Michels, P. A. Evolution, dynamics and specialized functions of glycosomes in metabolism and development of trypanosomatids. Curr. Opin. Microbiol. 22, 79–87 (2014).
Article PubMed Google Scholar
Morales, J. et al. Differential remodelling of peroxisome function underpins the environmental and metabolic adaptability of diplonemids and kinetoplastids. Proc. Biol. Sci. 283, 0520 (2016).
Andrade-Alviárez, D. et al. Delineating transitions during the evolution of specialised peroxisomes: glycosome formation in kinetoplastid and diplonemid protists. Front, Cell Dev. Biol. 10, 979269 (2022).
Article PubMed PubMed Central Google Scholar
Zítek, J. et al. Reduced mitochondria provide an essential function for the cytosolic methionine cycle. Curr. Biol. 32, 5057–5068 (2022).
Article PubMed PubMed Central Google Scholar
Chen, Y.-J., Mostafa, K. M., Hsu, C.-C. & Leu, J.-Y. Spatial proteomics reveals lipid droplet reorganization in symbiotic Paramecium bursaria cells. BioRxiv, https://doi.org/10.1101/2025.1105.1112.652804 (2025).
Freymuller, E. & Camargo, E. P. Ultrastructural differences between species of trypanosomatids with and without endosymbionts. J. Protozool. 28, 175–182 (1981).
Article CAS PubMed Google Scholar
Keeling, P. J., McCutcheon, J. P. & Doolittle, W. F. Symbiosis becoming permanent: survival of the luckiest. Proc. Natl. Acad. Sci. USA 112, 10101–10103 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Kostygov, A. Y. et al. Genome of Ca. Pandoraea novymonadis, an endosymbiotic bacterium of the trypanosomatid Novymonas esmeraldas. Front Microbiol 8, 1940 (2017).
Article PubMed PubMed Central Google Scholar
Zakharova, A. et al. A new model trypanosomatid Novymonas esmeraldas: genomic perception of its “Candidatus Pandoraea novymonadis” endosymbiont. mBio 12, e01606–e01621 (2021).
Article CAS PubMed PubMed Central Google Scholar
de Azevedo-Martins, A. C. et al. Biochemical and phylogenetic analyses of phosphatidylinositol production in Angomonas deanei, an endosymbiont-harboring trypanosomatid. Parasit. Vectors 8, 247 (2015).
Article PubMed PubMed Central Google Scholar
Chang, K. P. Ultrastructure of symbiotic bacteria in normal and antibiotic-treated Blastocrithidia culicis and Crithidia oncopelti. J. Protozool. 21, 699–707 (1974).
Article CAS PubMed Google Scholar
Docampo, R. & Huang, G. New insights into the role of acidocalcisomes in trypanosomatids. J. Eukaryot. Microbiol 69, e12899 (2022).
Article CAS PubMed PubMed Central Google Scholar
Docampo, R., Jimenez, V., King-Keller, S., Li, Z. H. & Moreno, S. N. The role of acidocalcisomes in the stress response of Trypanosoma cruzi. Adv. Parasitol. 75, 307–324 (2011).
Article PubMed PubMed Central Google Scholar
Ramakrishnan, S., Asady, B. & Docampo, R. Acidocalcisome-mitochondrion membrane contact sites in Trypanosoma brucei. Pathogens 7, 33 (2018).
Article PubMed PubMed Central Google Scholar
Yurchenko, V., Lukeš, J., Tesařová, M., Jirků, M. & Maslov, D. A. Morphological discordance of the new trypanosomatid species phylogenetically associated with the genus Crithidia. Protist 159, 99–114 (2008).
Article CAS PubMed Google Scholar
Votýpka, J. et al. Cosmopolitan distribution of a trypanosomatid Leptomonas pyrrhocoris. Protist 163, 616–631 (2012).
Article PubMed Google Scholar
Yurchenko, V. et al. Diversity of trypanosomatids in cockroaches and the description of Herpetomonas tarakana sp. n. J. Eukaryot. Microbiol. 63, 198–209 (2016).
Article PubMed Google Scholar
Hauser, R., Pypaert, M., Hausler, T., Horn, E. K. & Schneider, A. In vitro import of proteins into mitochondria of Trypanosoma brucei and Leishmania tarentolae. J. Cell Sci. 109, 517–523 (1996).
Article CAS PubMed Google Scholar
Barylyuk, K. et al. A Comprehensive subcellular atlas of the Toxoplasma proteome via hyperLOPIT provides spatial context for protein functions. Cell Host Microbe 28, 752–766 (2020).
Article CAS PubMed PubMed Central Google Scholar
Geladaki, A. et al. Combining LOPIT with differential ultracentrifugation for high-resolution spatial proteomics. Nat. Commun. 10, 331 (2019).
Article ADS PubMed PubMed Central Google Scholar
Schwartz, K. J., Peck, R. F. & Bangs, J. D. Intracellular trafficking and glycobiology of TbPDI2, a stage-specific protein disulfide isomerase in Trypanosoma brucei. Eukaryot. Cell 12, 132–141 (2013).
Article CAS PubMed Google Scholar
Carrero-Lérida, J., Pérez-Moreno, G., Castillo-Acosta, V. M., Ruiz-Pérez, L. M. & González-Pacanowska, D. Intracellular location of the early steps of the isoprenoid biosynthetic pathway in the trypanosomatids Leishmania major and Trypanosoma brucei. Int J. Parasitol. 39, 307–314 (2009).
Article PubMed Google Scholar
Munday, J. C., McLuskey, K., Brown, E., Coombs, G. H. & Mottram, J. C. Oligopeptidase B deficient mutants of Leishmania major. Mol. Biochem. Parasitol. 175, 49–57 (2011).
Article CAS PubMed Google Scholar
Frejno, M. et al. Unifying the analysis of bottom-up proteomics data with CHIMERYS. Nat. Methods 22, 1017–1027 (2025).
Article CAS PubMed PubMed Central Google Scholar
Crook, O. M., Breckels, L. M., Lilley, K. S., Kirk, P. D. W. & Gatto, L. A Bioconductor workflow for the Bayesian analysis of spatial proteomics. F1000Res 8, 446 (2019).
Article PubMed PubMed Central Google Scholar
Kanehisa, M., Sato, Y. & Morishima, K. BlastKOALA and GhostKOALA: KEGG Tools for Functional Characterization of Genome and Metagenome Sequences. J. Mol. Biol. 428, 726–731 (2016).
Article CAS PubMed Google Scholar
Teufel, F. et al. SignalP 6.0 predicts all five types of signal peptides using protein language models. Nat. Biotechnol. 40, 1023–1025 (2022).
Article CAS PubMed PubMed Central Google Scholar
Armenteros, J. J. A. et al. Detecting sequence signals in targeting peptides using deep learning. Life Sci. Alliance 2, e201900429 (2019).
Article Google Scholar
Ødum, M. T. et al. DeepLoc 2.1: multi-label membrane protein type prediction using protein language models. Nucleic Acids Res. 52, W215–W220 (2024).
Article PubMed PubMed Central Google Scholar
Hallgren, J. et al. DeepTMHMM predicts alpha and beta transmembrane proteins using deep neural networks. BioRxiv 04, 08–487609 (2022).
Google Scholar
Shanmugasundram, A. et al. TriTrypDB: an integrated functional genomics resource for kinetoplastida. PLoS Negl. Trop. Dis. 17, e0011058 (2023).
Article CAS PubMed PubMed Central Google Scholar
Billington, K. et al. Genome-wide subcellular protein map for the flagellate parasite Trypanosoma brucei. Nat. Microbiol. 8, 533–547 (2023).
Article CAS PubMed PubMed Central Google Scholar
Moreno, J., Nielsen, H., Winther, O. & Teufel, F. Predicting the subcellular location of prokaryotic proteins with DeepLocPro. Bioinformatics 40, btae677 (2024).
Article CAS PubMed PubMed Central Google Scholar
Blum, M. et al. InterPro: the protein sequence classification resource in 2025. Nucleic Acids Res 53, D444–D456 (2025).
Article CAS PubMed PubMed Central Google Scholar
Engler, C., Kandzia, R. & Marillonnet, S. A one pot, one step, precision cloning method with high throughput capability. PLoS One 3, e3647 (2008).
Article ADS PubMed PubMed Central Google Scholar
Rueden, C. T. et al. ImageJ2: imageJ for the next generation of scientific image data. BMC Bioinforma. 18, 529 (2017).
Article Google Scholar
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinforma. 10, 421 (2009).
Article Google Scholar
Kearse, M. et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28, 1647–1649 (2012).
Article PubMed PubMed Central Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors thank Ingrid Škodová-Sveráková (Comenius University, Bratislava) for helpful discussions. We acknowledge support of the Czech Science Foundation (25-15298S to V.Y and J.L.), European Union’s Operational Programme “Just Transition” (CZ.10.03.01/00/22_003/0000003 LERCO to V.Y.), German Research Foundation (SFB1535, project ID 458090666 to E.C.M.N.), Wellcome Trust (221944/A/20/Z to J.C.M.), and PhD fellowships of the Jürgen Manchot Graduate School (MOI IV to A.K.M. and MOI V to L.R.C.). Computational resources were provided by the e-INFRA CZ project 90254 supported by the Czech Ministry of Education, Youth and Sports.

Author information

Ľubomíra Chmelová
Present address: Masaryk University, Faculty of Science, Department of Experimental Biology, Brno, Czechia
Anay K. Maurya
Present address: LPHI, UMR5294, CNRS, University of Montpellier, Inserm, Montpellier, France

Authors and Affiliations

Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czechia
Michael Hammond, Vanesa Puente, Kristína Záhonová & Julius Lukeš
Faculty of Sciences, University of South Bohemia, České Budějovice, Czechia
Michael Hammond & Julius Lukeš
Life Science Research Centre, Faculty of Science, University of Ostrava, Ostrava, Czechia
Ľubomíra Chmelová, Kristína Záhonová & Vyacheslav Yurchenko
Institute of Microbial Cell Biology, Heinrich Heine University, Düsseldorf, Germany
Natascha A. van Geelen-Kuenzel, Anay K. Maurya, Lawrence Rudy Cadena & Eva C. M. Nowack
York Biomedical Research Institute and Department of Biology, University of York, York, UK
Eden R. Ferreira & Jeremy C. Mottram
Department of Parasitology, Faculty of Science, Charles University, BIOCEV, Vestec, Czechia
Kristína Záhonová
Division of Infectious Diseases, Department of Medicine, University of Alberta, Edmonton, AB, Canada
Kristína Záhonová
Bioscience Technology Facility, Department of Biology, University of York, York, UK
Adam Dowle

Authors

Michael Hammond
View author publications
Search author on:PubMed Google Scholar
Ľubomíra Chmelová
View author publications
Search author on:PubMed Google Scholar
Natascha A. van Geelen-Kuenzel
View author publications
Search author on:PubMed Google Scholar
Anay K. Maurya
View author publications
Search author on:PubMed Google Scholar
Eden R. Ferreira
View author publications
Search author on:PubMed Google Scholar
Vanesa Puente
View author publications
Search author on:PubMed Google Scholar
Lawrence Rudy Cadena
View author publications
Search author on:PubMed Google Scholar
Kristína Záhonová
View author publications
Search author on:PubMed Google Scholar
Adam Dowle
View author publications
Search author on:PubMed Google Scholar
Jeremy C. Mottram
View author publications
Search author on:PubMed Google Scholar
Eva C. M. Nowack
View author publications
Search author on:PubMed Google Scholar
Julius Lukeš
View author publications
Search author on:PubMed Google Scholar
Vyacheslav Yurchenko
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualisation (V.Y., J.L., and E.C.M.N.), methodology (J.C.M.), validation (M.H., L.Ch., and E.R.F.), formal analysis (M.H., K.Z., E.R.F., and A.D.), investigation (M.H., L.Ch., N.A.G.-K., A.K.M., E.R.F., V.P., L.R.C., K.Z., and A.D.), resources (V.Y., J.L., E.C.M.N., and J.C.M.), data curation (M.H., A.D., V.Y., J.L., J.C.M., and E.C.M.N.), writing - original draft (M.H., E.C.M.N., and V.Y.), writing - review & editing (all authors), visualisation (M.H., E.R.F., A.D., and K.Z.), supervision (V.Y., J.L., E.C.M.N., and J.C.M.), project administration (V.Y.), funding acquisition (V.Y., J.L., E.C.M.N., and J.C.M.).

Corresponding authors

Correspondence to Eva C. M. Nowack, Julius Lukeš or Vyacheslav Yurchenko.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Description of Additional Supplementary Files (download PDF )

Supplementary Dataset 1 (download XLSX )

Supplementary Dataset 2 (download XLSX )

Reporting Summary (download PDF )

Transparent Peer Review file (download PDF )

Source data

Source Data (download ZIP )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hammond, M., Chmelová, Ľ., van Geelen-Kuenzel, N.A. et al. Subcellular proteomics reveals a blueprint for endosymbiont integration in trypanosomatid Angomonas deanei. Nat Commun 17, 2241 (2026). https://doi.org/10.1038/s41467-026-70084-0

Download citation

Received: 23 October 2025
Accepted: 16 February 2026
Published: 03 March 2026
Version of record: 05 March 2026
DOI: https://doi.org/10.1038/s41467-026-70084-0