Model of metabolism and gene expression predicts proteome allocation in Pseudomonas putida

Tibocha-Bonilla, Juan D.; Gandhi, Vishant; Lieng, Chloe; Moyne, Oriane; Santibáñez-Palominos, Rodrigo; Zengler, Karsten

doi:10.1038/s41540-025-00521-1

Download PDF

Article
Open access
Published: 24 May 2025

Model of metabolism and gene expression predicts proteome allocation in Pseudomonas putida

Juan D. Tibocha-Bonilla¹,
Vishant Gandhi²,
Chloe Lieng³,
Oriane Moyne³,
Rodrigo Santibáñez-Palominos³ &
…
Karsten Zengler^2,3,4,5

npj Systems Biology and Applications volume 11, Article number: 55 (2025) Cite this article

3833 Accesses
2 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The genome-scale model of metabolism and gene expression (ME-model) for Pseudomonas putida KT2440, iPpu1676-ME, provides a comprehensive representation of biosynthetic costs and proteome allocation. Compared to a metabolic-only model, iPpu1676-ME significantly expands on gene expression, macromolecular assembly, and cofactor utilization, enabling accurate growth predictions without additional constraints. Multi-omics analysis using RNA sequencing and ribosomal profiling data revealed translational prioritization in P. putida, with core pathways, such as nicotinamide biosynthesis and queuosine metabolism, exhibiting higher translational efficiency, while secondary pathways displayed lower priority. Notably, the ME-model significantly outperformed the M-model in alignment with multi-omics data, thereby validating its predictive capacity. Thus, iPpu1676-ME offers valuable insights into P. putida’s proteome allocation and presents a powerful tool for understanding resource allocation in this industrially relevant microorganism.

Proteome allocations change linearly with the specific growth rate of Saccharomyces cerevisiae under glucose limitation

Article Open access 20 May 2022

Addressing genome scale design tradeoffs in Pseudomonas putida for bioconversion of an aromatic carbon source

Article Open access 14 January 2025

An updated genome-scale metabolic network reconstruction of Pseudomonas aeruginosa PA14 to characterize mucin-driven shifts in bacterial metabolism

Article Open access 08 October 2021

Introduction

Pseudomonas putida KT2440 is a versatile and metabolically robust biotechnological workhorse^1,2. It thrives in diverse environments^3,4, degrades a wide range of organic compounds^4,5, and as a result, is employed to produce a variety of bulk and fine chemicals⁶. Recent studies have focused on harnessing the potential of P. putida KT2440 and on understanding and manipulating its metabolic network⁷. Genome-scale metabolic models (M-models) have long been used in metabolic engineering to identify metabolic bottlenecks and potential improvements in metabolic pathways for bioproduction^8,9,10. While M-models provide valuable insights into metabolic capabilities, they do not account for macromolecular expression and the biosynthetic cost of enzymes and, as a consequence, require extensive constraining^11,12,13. Thus, predictions using M-models can lack robustness¹¹, making it challenging to predict engineering strategies to improve performance¹⁴.

Models of metabolism and gene expression (ME-models) mechanistically describe gene expression pathways and their intertwined role with metabolic pathways to achieve optimal resource allocation for growth¹². As a result, ME-models make predictions beyond the scope of traditional M-models, including unconstrained by-product secretion¹¹, overflow metabolism¹⁵, cofactor usage¹⁵, protein overproduction¹⁴, and proteomic responses to stress conditions¹⁶. However, the reconstruction of ME-models is time-intensive and requires extensive manual curation, which has led to a reduced number of reconstructed ME-models, only available for Bacillus subtilis¹⁴, Clostridium ljungdahlii¹⁵, Escherichia coli^12,17, and Thermotoga maritima¹¹.

Here, we reconstructed an ME-model for P. putida KT2440, iPpu1676-ME, offering an unprecedented level of detail in the cellular function and proteome allocation of this bacterium. We show the improved predictive capabilities of proteome limitation in P. putida KT2440. Furthermore, we interrogated the gene expression of P. putida KT2440 using transcriptomics (RNA-Seq) and translatomics (Ribo-Seq) data. We analyzed the translational prioritization of pathways, as well as pathways that were significantly less prioritized for translation. When contrasting the model predictions against these sequencing datasets, we found stronger agreement with the ME-model compared to the M-model. Thus iPpu1676-ME represents a valuable asset for accurate predictive modeling as a tool for bioprocess design and optimization and metabolic engineering^8,9,10 in this industrially important strain.

Results

iPpu1676-ME predicts proteome limitation and overflow metabolism in P. putida KT2440

The ME-model of Pseudomonas putida KT2440, iPpu1676-ME, was reconstructed based on a previous genome-scale metabolic model (M-model), iJN1462¹. The gene expression machinery was integrated into the ME-model following available ME-model reconstruction protocols^12,18. Protein complex stoichiometries, function, localization, translocation pathways, and transcriptional unit compositions were retrieved and mapped from the genome of P. putida KT2440 (AE015451.2¹⁹) as well as the strain-specific database in BioCyc²⁰. iPpu1676-ME consists of 7526 metabolites, 14,414 reactions, and 1676 genes. Compared to the original M-model template this represents an increase of 250% in metabolite, 392% in reaction, and 15% in gene coverage (Fig. 1a–c). The expression machinery (E-matrix) adds up to 5443 metabolites, including types of RNA (mRNA, tRNA, and rRNA), proteins with and without modifications, and complexes (Fig. 1a). In addition, the E-matrix contains 5040 reactions, including translation and modification, translocation, transcription, and tRNA charging (Fig. 1b). Finally, the added 214 genes in iPpu1676-ME (Fig. 1c) correspond to mapped gene expression machinery including tRNA ligases, ribosomal proteins, RNA polymerase subunits, transcription factors, and protein modification machinery (including translocation machinery) (Supplementary Fig. 1).

**Fig. 1: Properties and predictions of the ME-model of *P. putida* KT2440.**

As previously reported, the integration of the E-matrix provides the ME-model with a quantitative description of the biosynthetic cost of biochemical reactions¹². Reaction fluxes are limited by the underlying gene expression cost, resulting in reduced flux variability¹¹ and higher certainty in model predictions. The flux variability analysis of iPpu1676-ME showed reduced flux ranges (Fig. 1e). The majority of the M-model ranges are above 10⁰, while ME-model ranges are mostly below it, increasing the certainty of predictions in the latter. Moreover, the ME-model readily recapitulates the maximum growth rate of P. putida KT2440 in a glucose-containing minimal medium (Fig. 1d), which has been reported at 0.58 (σ = 0.02) h⁻¹ and at a glucose uptake rate of 8.15 (σ = 2.00) mmol/gDW/h from five different studies^{6,21,22,23,24}. As opposed to the M-model, whose metabolic reactions are unconstrained by biosynthetic costs, the ME-model reaches proteome limitation when in nutrient excess and is capable of generating biologically relevant simulations without any additional constraints (Fig. 1d).

We further assessed the accuracy of intracellular flux predictions by contrasting them to previously published reports of metabolic flux analysis (MFA) in P. putida KT2440^25,26. Notably, both M- (Supplementary File 1) and ME-models (Supplementary File 2) reproduce the overall activity of the core metabolism assessed in the experimental MFA studies. At a glucose uptake rate of 2.21 mmol/gDW/h²⁵ (simulation constraints and flux distributions are provided in Supplementary Data 1), glucose is converted to 6-phospho-d-gluconate, which is then assimilated and conveyed to glycerol-3-phosphate, bypassing the upper half of glycolysis. Interestingly, there is minimal but nonzero flux through part of the pentose phosphate pathway predicted by both M- and ME-models and observed in MFA²⁶. The second half of glycolysis is active and feeds into the tricarboxylic acid cycle, with all enzymatic steps being active^25,26. However, only the M-model incorrectly predicted isocitrate lyase and malate synthase (the glyoxylate shunt) to be active, which were observed to be inactive in MFA^25,26. Furthermore, we found that the ME-model correctly predicts the activity of pyruvate kinase^25,26, which is inactive in the M-model simulations. In the latter, phosphoenolpyruvate is converted to pyruvate through dGTP:pyruvate 2-O-phosphotransferase.

Another improvement in ME-models is the prediction of proteome limitation, which leads to the mechanistic prediction of overflow metabolites in ME-models¹⁵. In iPpu1676-ME, this leads to predicting 2-ketogluconate and acetate secretion in excess of glucose (Fig. 1e). While 2-ketogluconate is a known secreted metabolite by P. putida, acetate was predicted as a minor by-product, which has been shown in oxygen-limited conditions^22,27. Notably, iPpu1676-ME maintains the same 85% accuracy in gene essentiality prediction¹ of 54 metabolic gene knockout strains in M9 minimal medium²⁸.

Multi-omic data reveal translational prioritization in P. putida KT2440

Translational efficiency (TE) refers to the rate of protein synthesis per unit of mRNA transcript²⁹. The translation of a group of genes is said to be prioritized if their TE (see the “Methods” section) is high relative to other genes²⁹. Translational prioritization readily informs the resource allocation strategies of an organism^30,31,32. It can be used to infer specific objectives^30,33, such as maximizing growth or uptaking a substrate. Thus, we aimed to interrogate the proteome allocation of P. putida and its translational prioritization using RNA-Seq and Ribo-Seq. Then, we contrasted it with the predictions by iPpu1676-ME.

P. putida was grown in glucose-containing M9 minimal medium in three biological replicates, and paired RNA-Seq and Ribo-Seq were performed. The three samples yielded a wide range of gene activity in both datasets with four and six orders of magnitude differences, as observed in the raw read counts from RNA-Seq (Supplementary Fig. 2a) and Ribo-Seq (Supplementary Fig. 2b), respectively, reinforcing the disparity in resource allocation throughout the genome. In order to discard technical artifacts³⁴ as a confounding factor, we performed two-tailed t-tests (p < 0.05) and calculated Pearson correlation coefficients of the replicates. No significance in the means of the read count distributions was observed, and all replicates were very strongly correlated (PCC > 0.9) in both RNA-Seq and Ribo-Seq (Fig. 2a).

**Fig. 2: Multi-omic data of *P. putida* KT2440 growth in glucose.**

When contrasting across datasets, RNA-Seq and Ribo-Seq counts-per-million (CPM) show significant correlations, with a PCC of 0.78 (p = 0.0) in the three samples (Fig. 2b). Despite the significant correlation at the whole-dataset level, there are variations in the resource allocation at the pathway level in the transcriptome and the translatome, which shows differential translational prioritization across the genome of P. putida. Thus, we assessed whether metabolic pathways were observed with high or low translational prioritization. We measured the translational prioritization of a pathway using the average TE of its associated genes (see the “Methods” section) and calculated the significance of the prioritization through a one-tailed Mann–Whitney U (MWU) test. As part of the MWU test, we sorted and ranked the genes in the RNA-Seq and Ribo-Seq datasets and contrasted if their ranks differed significantly in either dataset. A high TE and a low right-tailed MWU test p-value for a significant increase in rank in the Ribo-Seq dataset (p < 0.05) support high translational prioritization.

A right-tailed Mann–Whitney U test revealed that there were ten metabolic pathways with significantly higher ranks in the Ribo-Seq dataset (p < 0.05), which can be evidenced by the calculated translational efficiencies (TE)^30,35,36 between 1.4 and 2.1 (Fig. 2c). Notably, the highest prioritization (highest TE) was calculated for nicotinamide biosynthesis (TE = 2.1, p = 5.38e−4), which produces the essential cofactor NAD⁺. Other core metabolic pathways, such as urea cycle (TE = 1.9, p = 0.02), pyrimidine biosynthesis (TE = 1.7, p = 5.04e−3), lipid A biosynthesis (TE = 1.7, p = 0.02), and queuosine biosynthesis (TE = 1.7, p = 0.006), which are directly associated with cell proliferation³⁷, showed significant prioritization. As expected, there was significant prioritization of gene expression, including translation (TE = 1.8, p = 0.03) and tRNA charging (TE = 1.6, p = 0.04).

On the other hand, 11 metabolic pathways had significantly lower ranks as calculated by a left-tailed Mann–Whitney U test (p < 0.05). The lowest prioritization was shown for pathways less necessary for growth in glucose, namely levulinate metabolism (TE = 0.2, p = 7.02e−4), phenylacetyl-CoA catabolon (TE = 0.3, p = 0.03), cellulose metabolism (TE = 0.5, p = 0.02), and starch and sucrose metabolism (TE = 1.0, p = 0.01). Membrane-associated pathways, such as inner membrane transport (TE = 1.0, p = 0.02), outer membrane transport (TE = 0.8, p = 8.5e−3), iron uptake (TE = 0.7, p = 0.03), peptidoglycan biosynthesis (TE = 0.8, p = 0.04), and murein recycling (TE = 0.5, p = 0.01), showed low translational prioritization. It is worth noting that a TE equal to 1.0 can still mean an overall lower prioritization due to the variation in ranks in both datasets, such is the case for starch and sucrose metabolism and inner membrane transport (Fig. 2d).

The ME-model recapitulates optimal proteome allocation

Our translational prioritization analysis highlighted significantly higher TEs for growth-required pathways, both metabolic and gene expression-related (Fig. 2c, d). Most of the transcriptome and translatome are allocated for these pathways (Fig. 3a), led by translation and followed by energy and biomass production. However, multi-omic data does not provide a quantitative understanding of the metabolic and gene expression rates. Predictive metabolic models are used to attain this understanding. Thus, here, we assessed the improved performance of the ME-model of P. putida KT2440 over the M-model when contrasting against the expression levels inferred from multi-omics (simulation constraints and flux distributions are provided in Supplementary Data 2).

**Fig. 3: Correlation between model predictions and multi-omics at the pathway level.**

We compared M- and ME-model predictions of cumulative pathway-level fluxes of P. putida KT2440 against pathway-level expression from RNA-Seq, Ribo-Seq, and the calculated TE from them. The ME-model significantly outperformed the M-model in all cases (Fig. 3b). As a reference, pathway-level RNA-Seq and Ribo-Seq are correlated with a PCC of 0.94 (Fig. 3c). In the M-model, several subsystems were predicted with low flux, causing relevant data points to be more sparse. The PCCs were 0.50 for RNA-Seq (p = 3.55e−18) and 0.49 for Ribo-Seq (p = 5.50e−17), which signifies an existent but weak correlation between both datasets. On the other hand, the ME-model showed stronger positive correlations with PCCs of 0.71 (p = 2.27e−44) and 0.76 (p = 1.40e−53) for RNA-Seq and Ribo-Seq, respectively. Notably, despite the stronger correlation between the ME-model predictions and RNA-Seq and Ribo-Seq, there was only a weak positive correlation for TE (PCC = 0.46). Therefore, the predicted expression fluxes by the ME-model are predictive of the observed transcriptome and translatome but not of the TE. Thus, the predictive capability of iPpu1676-ME to recapitulate proteome allocation of P. putida KT2440 showcases its potential to be further used to interrogate and optimize the resource allocation in this organism.

Discussion

Pseudomonas putida KT2440 has broad potential for biotechnological applications^1,2, and several studies have been focusing on optimizing culture conditions, growth, and metabolic capabilities for this bacterium. While bioinformatics tools and databases have informed potential pathways in P. putida^38,39, only predictive metabolic models can quantitatively estimate improvements in metabolism, growth, product rates, and yields^40,41,42. Similar to previous studies using ME-models, we here showed the prediction of by-product secretion¹¹, overflow metabolism¹⁵, and prediction of proteome allocation¹⁶. While previous ME-model reconstructions have widely proven prediction improvements, multi-omics datasets (RNA-Seq and Ribo-Seq) have not been integrated or contrasted with these simulations. Our translational prioritization analysis showed significantly high TE for queuosine biosynthesis alongside core biomass precursor biosynthetic pathways. Interestingly, this pathway has been reported as a cell division regulator in other bacteria³⁷. On the other hand, transport and other membrane-associated functions were found to have low translational prioritization.

The ME-model for P. putida KT2440, iPpu1676-ME, showed significant improvements in the predictive capabilities of transcriptome and translatome over the template M-model, iJN1462¹. iPpu1676-ME achieved an outstanding PCC of 0.71 against RNA-Seq and 0.76 against Ribo-Seq. As a reference, the multi-omics model and analytics (MOMA), a semi-supervised machine learning pipeline, achieved PCCs between 0.58 and 0.85 with RNA-Seq in E. coli across 16 strains⁴³. It is worth noting that the higher agreement of Ribo-Seq and the predicted proteome allocation by the ME-model underscores the precision of this sequencing technology in identifying the metabolic goal of an organism³⁰. In addition, it highlights the ability of a ME-model to predict optimal proteome allocation in a metabolically diverse organism such as P. putida KT2440^1,2.

Some limitations affect our analysis. Determination of the active proteome is challenging through sequencing technologies due to various technical limitations. For example, RNA degradation affects measurements by RNA-Seq⁴⁴, and RNA transcription trends do not always carry over to translation due to translational prioritization effects^29,30. On the other hand, direct measurement of the proteome through proteomics is hindered by the difficulty of whole-proteome determinations and the inherent noise in mass spectrometry data⁴⁵. On the other hand, Ribo-Seq provides an accurate representation of translation in vivo^29,30. While it cannot detect protein stability, modification, and folding⁴⁶, it has been shown to provide a genome-scale understanding of translational prioritization^29,30. Furthermore, we noticed variation between the replicates, which can be due to the inherent flexibility of the metabolism of P. putida^1,2. However, the correlation was strong enough (PCC > 0.9) to ensure there were no significant outliers in the replicates. Another limitation of this study is that metabolic models, and thus ME-models, are limited to modeling enzymes with either a metabolic or a gene expression function¹². However, there is a fraction of the proteome that can have an alternative function, which can be structural or still unknown. For example, this fraction has been estimated to be approximately 36% of the proteome in E. coli¹², and it might be the cause of some of the unexplained variances in our comparison between model predictions and Ribo-Seq.

Here, we provide a modeling framework alongside multi-omic datasets that were not available to date, yielding an important resource for further understanding the translational resource allocation in this industrially relevant microorganism. Overall, we envision that the ME-model and multi-omic resources brought forward in this work will serve as powerful tools for the metabolic engineering and optimization of P. putida KT2440.

Methods

Reconstruction and simulation of the ME-model

The ME-model of P. putida KT2440, iPpu1676-ME, was reconstructed using coralME¹⁸, with the available genome AE015451.2¹⁹, M-model (iJN1462¹), and BioCyc²⁰-derived annotation files as inputs. Input manual curation files of iPpu1676-ME are explained in Table 1. The code and scripts in this work were developed and run in Python 3.10, COBRApy⁴⁷ version 0.26.3, and coralME version 1.1.5. Simulations were performed using the Quad MINOS software courtesy of Prof. Michael A. Saunders at Stanford University⁴⁸. The Quad MINOS solver was compiled under Ubuntu 22.04 with gfortran version 5 and Python 3.10 (pip 22.3.1, wheel 0.38.4, numpy 1.21.6, scipy 1.7.3, cython 0.29.32, and cpython 0.0.6). Flux distributions were calculated with a binary search algorithm that looks for the maximum possible growth rate that is feasible.

Table 1 Input and manual curation files and their description

Full size table

Computations were performed on a 64-bit Ubuntu 22.04.3 LTS (Jammy Jellyfish); AMD Ryzen 9 7900X@4.70 GHz (12 cores, 24 threads); 4 × 32 GB 6000 MHz DDR5 RAM. It is worth noting that iPpu1676-ME is one of the largest ME-models available. As a reference, the E. coli ME-model contains 1678 genes¹². As such, optimizing iPpu1676-ME typically takes five minutes in the computer described here. This is several orders of magnitude longer than the associated M-model, iJN1462¹ (~72 ms). While this computation time still allows for most analyses performed in metabolic modeling, it can hinder its application in large-scale sampling of conditions, e.g., simulating a bioreactor.

Gene essentiality predictions

Gene essentiality was predicted in iPpu1676-ME by closing (setting upper and lower bounds to zero) the translation reaction of a gene and testing for ME-model feasibility. Feasibility is tested at a growth rate of 10⁻³ with the QuadMINOS solver in quad-precision and a tolerance of 10⁻¹⁶.

Flux variability analysis

Flux variability analysis (FVA) was performed using the built-in method model.fva() in coral ME. In order to generate comparable datasets between the M- and the ME-models, we cast the M-model into an ME-model instance (function from_cobra()). The FVA in both models was performed using the QuadMINOS solver in quad-precision and a tolerance of 10⁻¹⁶.

Culture and sequencing of P. putida KT2440

A preculture of Pseudomonas putida KT2440 was grown in 5 mL of liquid 1× M9 medium (Sigma Aldrich, M6030) with 30 mM glucose under oxic conditions at 37 °C for 2 days. The optical density at 600 nm (OD₆₀₀) was measured to assess carrying capacity using the Molecular Devices SpectraMax M3 Multi-Mode Microplate Reader (VWR, cat # 89429-536). The preculture was diluted in triplicate to an OD₆₀₀ of 0.1 in 25 mL of 1x M9 medium and incubated under oxic conditions at 37 °C in a shaking incubator, with optical density measured hourly. Samples were harvested by centrifugation and pellets were saved for multi-omics analysis as detailed below (RNA-, and Ribo-Seq).

Transcriptomic (RNA-Seq) sample preparation

RNA was extracted from P. putida replicates using the RNeasy mini kit (Qiagen), with rRNA removal performed using the QIAseq FastSelect-5S/16S/23S kit (Qiagen). RNA-Seq libraries were constructed using the KAPA RNA HyperPrep kit (Roche) and barcoded with TruSeq indexes (Illumina). Amplification was monitored in real-time using SYBR-Green and halted upon reaching the amplification plateau.

Translatomic (Ribo-Seq) sample preparation

The preparation of Ribo-Seq samples utilized a protocol adapted from previously described methods designed for axenic bacterial cultures^33,49. Briefly, the bacterial cultures were treated with chloramphenicol and pelleted. At this point, we modified this protocol to address the chloramphenicol resistance of Pseudomonas putida KT2440 by resuspending the pellet in RNAlater and flash-freezing in liquid nitrogen. Samples were thawed, pelleted, and RNAlater removed, before proceeding to mechanical bacterial lysis. Bacterial lysis was performed in the presence of a lysis solution containing additional chloramphenicol and Guanosine-5′-[(β,γ)-imido] triphosphate (GMPPNP) to inhibit protein elongation. Lysates were treated with MNase and DNase to digest nucleic acids that were not protected by ribosomes. Monosomes were recovered using RNeasy mini spin size-exclusion columns (Qiagen) and RNA Clean & Concentrator-5 kit (Zymo). rRNA was removed with the QIAseq FastSelect-5S/16S/23S kit (Qiagen), and MetaRibo-Seq libraries were constructed using the NEBNext Small RNA Library Prep set for Illumina. Amplification was followed in real-time with SYBR-Green and stopped upon plateau plateau, and PCR products were purified with the Select-a-size DNA Clear & Concentrator kit (Zymo).

Sequencing

Library quantity and average size were assessed with the 4200 TapeStation System (Agilent). Library concentrations were quantified using the Qubit dsDNA HS Assay kit and QuBit 2.0 Fluorometer (Invitrogen). Sequencing was performed by UCSD IGM on the Illumina NovaSeq S4, PE100 platform with a minimum sequencing depth of 50 million reads for transcriptomic samples and 100 million reads for translatomic samples.

Sequence alignment and post-processing of RNA-Seq and Ribo-Seq

Paired-end read sequencing files from RNA-Seq and Ribo-Seq (FASTQ format) were processed using Python 3.7. Reads were trimmed using trim_galore version 0.6.10 (Cutadapt version 2.6)⁵⁰. Reads were aligned to the genome of P. putida KT2440 using bowtie2 version 2.2.5⁵¹. For Ribo-Seq, single-end read sequence alignment was performed since read length is short enough that both directions are redundant. For RNA-Seq, reads aligning to ribosomal RNA were discarded. Raw read counts were estimated using Woltka version 0.1.5⁵², with the built-in function “classify”. Finally, cross-sample analyses in this work were performed using the counts per million (CPM) normalization of read counts, as shown in Eq. (1).

$$CPM=\frac{Raw\,read\,count}{\sum Raw\,read\,counts\,in\,sample}\,\ast {10}^{6}$$

(1)

Statistical analysis of translational prioritization from multi-omics

Translational prioritization was inferred from calculating translational efficiency (TE) (Eq. (2))^30,35,36, and significance was calculated using the Mann–Whitney U test. Higher translational prioritization was determined if TE ≥ 1.0 and p < 0.05 in a right-tailed Mann–Whitney U test, while lower was determined if TE ≤ 1.0 and p < 0.05 in a left-tailed Mann–Whitney U test. The test was performed using scipy.stats.mannwhitneyu package of scipy version 1.11.1⁵³.

$$TE=\frac{Ribo-Seq\,CPM}{RNA-Seq\,CPM}$$

(2)

Benchmarking analysis of model predictions with multi-omics

Benchmarking of the models (M- and ME-models) predictions was performed with a correlative analysis between the pathway activity inferred from RNA-Seq and Ribo-Seq and the predicted pathway fluxes in the M-model (iJN1462¹) and the ME-model (iPpu1676-ME). For RNA-Seq and Ribo-Seq, gene-level CPM data were grouped and summed by annotated subsystem in the ME-model. For M-model predictions, the reaction fluxes were grouped and summed by annotated subsystem in the M-model. For ME-model predictions, protein translation fluxes were grouped and summed by annotated subsystem in the M-model. We then log-transformed the fluxes as log₁₀(f + 1), where f is the combined flux of a subsystem. The data in all three samples was used in the regression to maintain maximal statistical power. Then, linear regression was performed with the statsmodels version 0.14.1, which returned the regressed linear model and 95% confidence interval. The Pearson correlation coefficient (PCC) and significance (p-value) were calculated using scipy.stats.pearsonr of scipy version 1.11.1.

Data availability

Data is provided within the manuscript and in supplementary information. Additionally, all data in this work, except for the raw sequencing files, are available in the GitHub repository at https://github.com/jdtibochab/pputidame. The code and data have been deposited in Zenodo (https://doi.org/10.5281/zenodo.14984894). Raw RNA-Seq and Ribo-Seq files have been deposited in the Sequence Read Archive (SRA) under BioProject PRJNA1238387.

Code availability

All code used in this work is available in the GitHub repository at https://github.com/jdtibochab/pputidame. The code and data have been deposited in Zenodo (https://doi.org/10.5281/zenodo.14984894).

References

Nogales, J. et al. High-quality genome-scale metabolic modelling of Pseudomonas putida highlights its broad metabolic capabilities. Environ. Microbiol. 22, 255–269 (2020).
Article CAS PubMed Google Scholar
Nikel, P. I. & de Lorenzo, V. Robustness of Pseudomonas putida KT2440 as a host for ethanol biosynthesis. N. Biotechnol. 31, 562–571 (2014).
Article CAS PubMed Google Scholar
Loeschcke, A. & Thies, S. Pseudomonas putida—a versatile host for the production of natural products. Appl. Microbiol. Biotechnol. 99, 6197–6214 (2015).
Article CAS PubMed PubMed Central Google Scholar
Franden, M. A. et al. Engineering Pseudomonas putida KT2440 for efficient ethylene glycol utilization. Metab. Eng. 48, 197–207 (2018).
Article CAS PubMed Google Scholar
Kohlstedt, M. et al. From lignin to nylon: cascaded chemical and biochemical conversion using metabolically engineered Pseudomonas putida. Metab. Eng. 47, 279–293 (2018).
Article CAS PubMed Google Scholar
van Duuren, J. B. et al. Reconciling in vivo and in silico key biological parameters of Pseudomonas putida KT2440 during growth on glucose under carbon-limited condition. BMC Biotechnol. 13, 93 (2013).
Article PubMed PubMed Central Google Scholar
Salvachúa, D. et al. Metabolic engineering of Pseudomonas putida for increased polyhydroxyalkanoate production from lignin. Microb. Biotechnol. 13, 290–298 (2020).
Article PubMed Google Scholar
Han, Y., Rangel, A. T., Pomraning, K. R., Kerkhoven, E. J. & Kim, J. Advances in genome-scale metabolic models of industrially important fungi. Curr. Opin. Biotechnol. 84, 103005 (2023).
Article CAS PubMed Google Scholar
Lv, X. et al. New synthetic biology tools for metabolic control. Curr. Opin. Biotechnol. 76, 102724 (2022).
Article CAS PubMed Google Scholar
Bi, X. et al. Construction of multiscale genome-scale metabolic models: frameworks and challenges. Biomolecules 12, 721 (2022).
Article CAS PubMed PubMed Central Google Scholar
Lerman, J. A. et al. In silico method for modelling metabolism and gene product expression at genome scale. Nat. Commun. 3, 929 (2012).
Article PubMed Google Scholar
Lloyd, C. J. et al. COBRAme: a computational framework for genome-scale models of metabolism and gene expression. PLoS Comput. Biol. 14, e1006302 (2018).
Article PubMed PubMed Central Google Scholar
Panikov, N. S. Genome-scale reconstruction of microbial dynamic phenotype: successes and challenges. Microorganisms 9, 2352 (2021).
Article CAS PubMed PubMed Central Google Scholar
Tibocha-Bonilla, J. D. et al. Predicting stress response and improved protein overproduction in Bacillus subtilis. Npj Syst. Biol. Appl. 8, 1–12 (2022).
Article Google Scholar
Liu, J. K. et al. Predicting proteome allocation, overflow metabolism, and metal requirements in a model acetogen. PLoS Comput. Biol. 15, e1006848 (2019).
Article PubMed PubMed Central Google Scholar
Zhao, J., Chen, K., Palsson, B. O. & Yang, L. StressME: unified computing framework of Escherichia coli metabolism, gene expression, and stress responses. PLoS Comput. Biol. 20, e1011865 (2024).
Article CAS PubMed PubMed Central Google Scholar
O’Brien, E. J., Lerman, J. A., Chang, R. L., Hyduke, D. R. & Palsson, B. O. Genome-scale models of metabolism and gene expression extend and refine growth phenotype prediction. Mol. Syst. Biol. 9, 693 (2013).
Article PubMed PubMed Central Google Scholar
Tibocha-Bonilla, J. D., Santibanez-Palominos, R., Weng, Y., Kumar, M. & Zengler, K. Proteome allocation of the microbiome reveals how diet and metabolic dysbiosis impact disease. Preprint at https://doi.org/10.1101/2024.11.21.624724 (2024).
Nelson, K. E. et al. Complete genome sequence and comparative analysis of the metabolically versatile Pseudomonas putida KT2440. Environ. Microbiol. 4, 799–808 (2002).
Article CAS PubMed Google Scholar
Karp, P. D. et al. The BioCyc collection of microbial genomes and metabolic pathways. Brief. Bioinform. 20, 1085–1093 (2019).
Article CAS PubMed Google Scholar
del Castillo, T. et al. Convergent peripheral pathways catalyze initial glucose catabolism in Pseudomonas putida: genomic and flux analysis. J. Bacteriol. 189, 5142–5152 (2007).
Article PubMed PubMed Central Google Scholar
Blank, L. M., Ionidis, G., Ebert, B. E., Bühler, B. & Schmid, A. Metabolic response of Pseudomonas putida during redox biocatalysis in the presence of a second octanol phase. FEBS J. 275, 5173–5190 (2008).
Article CAS PubMed Google Scholar
Wang, Y. et al. Growth of engineered Pseudomonas putida KT2440 on glucose, xylose, and arabinose: hemicellulose hydrolysates and their major sugars as sustainable carbon sources. GCB Bioenergy 11, 249–259 (2019).
Article CAS Google Scholar
Nikel, P. I., Chavarría, M., Fuhrer, T., Sauer, U. & de Lorenzo, V. Pseudomonas putida KT2440 strain metabolizes glucose through a cycle formed by enzymes of the Entner–Doudoroff, Embden–Meyerhof–Parnas, and pentose phosphate pathways. J. Biol. Chem. 290, 25920 (2015).
Article CAS PubMed PubMed Central Google Scholar
Sudarsan, S., Dethlefsen, S., Blank, L. M., Siemann-Herzberg, M. & Schmid, A. The functional structure of central carbon metabolism in Pseudomonas putida KT2440. Appl. Environ. Microbiol. 80, 5292–5303 (2014).
Article PubMed PubMed Central Google Scholar
Nikel, P. I., Chavarría, M., Fuhrer, T., Sauer, U. & de Lorenzo, V. Pseudomonas putida KT2440 strain metabolizes glucose through a cycle formed by enzymes of the Entner–Doudoroff, Embden–Meyerhof–Parnas, and pentose phosphate pathways*. J. Biol. Chem. 290, 25920–25932 (2015).
Article CAS PubMed PubMed Central Google Scholar
Weimer, A. et al. Systems biology of electrogenic Pseudomonas putida - multi-omics insights and metabolic engineering for enhanced 2-ketogluconate production. Microb. Cell Factories 23, 246 (2024).
Article CAS Google Scholar
Molina-Henares, M. A. et al. Identification of conditionally essential genes for growth of Pseudomonas putida KT2440 on minimal medium through the screening of a genome-wide mutant library. Environ. Microbiol. 12, 1468–1485 (2010).
Article CAS PubMed Google Scholar
Ingolia, N. T. Ribosome profiling: new views of translation, from single codons to genome scale. Nat. Rev. Genet. 15, 205–213 (2014).
Article CAS PubMed Google Scholar
Al-Bassam, M. M. et al. Optimization of carbon and energy utilization through differential translational efficiency. Nat. Commun. 9, 4474 (2018).
Article PubMed PubMed Central Google Scholar
Sen, P. & Orešič, M. Integrating omics data in genome-scale metabolic modeling: a methodological perspective for precision medicine. Metabolites 13, 855 (2023).
Article CAS PubMed PubMed Central Google Scholar
Passi, A. et al. Genome-scale metabolic modeling enables in-depth understanding of big data. Metabolites 12, 14 (2022).
Article CAS Google Scholar
Moyne, O. et al. Guild and niche determination enable targeted alteration of the microbiome. Preprint at https://doi.org/10.1101/2023.05.11.540389 (2023).
Parekh, S., Ziegenhain, C., Vieth, B., Enard, W. & Hellmann, I. The impact of amplification on differential expression analyses by RNA-seq. Sci. Rep. 6, 25533 (2016).
Article CAS PubMed PubMed Central Google Scholar
Re, A. Translational efficiency in gas-fermenting bacteria: adding a new layer of regulation to gene expression in acetogens. iScience 26, (2023).
Bagheri, A., Astafev, A., Al-Hashimy, T. & Jiang, P. Tracing translational footprint by Ribo-Seq: principle, workflow, and applications to understand the mechanism of human diseases. Cells 11, 2966 (2022).
Article CAS PubMed PubMed Central Google Scholar
Adeleye, S. A. & Yadavalli, S. S. Queuosine biosynthetic enzyme, QueE moonlights as a cell division regulator. PLoS Genet. 20, e1011287 (2024).
Article CAS PubMed PubMed Central Google Scholar
Ling, C. et al. Muconic acid production from glucose and xylose in Pseudomonas putida via evolution and metabolic engineering. Nat. Commun. 13, 4925 (2022).
Article CAS PubMed PubMed Central Google Scholar
Werner, A. Z. et al. Lignin conversion to β-ketoadipic acid by Pseudomonas putida via metabolic engineering and bioprocess development. Sci. Adv. 9, eadj0053 (2023).
Article CAS PubMed PubMed Central Google Scholar
Orth, J. D., Thiele, I., Palsson & B, Ø What is flux balance analysis?. Nat. Biotechnol. 28, 245–248 (2010).
Article CAS PubMed PubMed Central Google Scholar
Lázaro, J. et al. Combination of genome-scale models and bioreactor dynamics to optimize the production of commodity chemicals. Front. Mol. Biosci. 9, 855735 (2022).
Canto-Encalada, G. et al. Flux balance analysis of the ammonia-oxidizing bacterium Nitrosomonas europaea ATCC19718 unravels specific metabolic activities while degrading toxic compounds. PLoS Comput Biol. 18, e1009828 (2022).
Article CAS PubMed PubMed Central Google Scholar
Kim, M., Rai, N., Zorraquino, V. & Tagkopoulos, I. Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli. Nat. Commun. 7, 13090 (2016).
Article CAS PubMed PubMed Central Google Scholar
Gallego Romero, I., Pai, A. A., Tung, J. & Gilad, Y. RNA-seq: impact of RNA degradation on transcript quantification. BMC Biol. 12, 42 (2014).
Article PubMed PubMed Central Google Scholar
Xu, W., Tian, Y., Wang, S. & Cui, Y. Feature selection and classification of noisy proteomics mass spectrometry data based on one-bit perturbed compressed sensing. Bioinformatics 36, 4423–4431 (2020).
Article CAS PubMed Google Scholar
Prensner, J. R. et al. What can Ribo-Seq, immunopeptidomics, and proteomics tell us about the noncanonical proteome?. Mol. Cell. Proteom.22, 100631 (2023).
Article CAS Google Scholar
Ebrahim, A., Lerman, J. A., Palsson, B. O. & Hyduke, D. R. COBRApy: COnstraints-Based Reconstruction and Analysis for Python. BMC Syst. Biol. 7, 74 (2013).
Article PubMed PubMed Central Google Scholar
Ma, D. et al. Reliable and efficient solution of genome-scale models of metabolism and macromolecular expression. Sci. Rep. 7, 40863 (2017).
Article CAS PubMed PubMed Central Google Scholar
Latif, H. et al. A streamlined ribosome profiling protocol for the characterization of microorganisms. BioTechniques https://doi.org/10.2144/000114302 (2015).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. J. 17, 10–12 (2011).
Article Google Scholar
Langmead, B., Wilks, C., Antonescu, V. & Charles, R. Scaling read aligners to hundreds of threads on general-purpose processors. Bioinformatics 35, 421–432 (2019).
Article CAS PubMed Google Scholar
Zhu, Q. et al. Phylogeny-aware analysis of metagenome community ecology based on matched reference genomes while bypassing taxonomy. mSystems 7, e00167–22 (2022).
Article PubMed PubMed Central Google Scholar
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This material is based upon work supported by the U.S. Department of Energy, Office of Science, Office of Biological and Environmental Research, and Genomic Science Program under Secure Biosystems Design Science Focus Area (SFA) contract number DE-AC36-08GO28308. We would like to thank Nidhi Pareddy for her contributions to the manual curation of the gene expression machinery in the ME-model. Furthermore, the authors thank Sherlyn Weng and Kankan Zhao for fruitful discussions.

Author information

Authors and Affiliations

Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093-0760, USA
Juan D. Tibocha-Bonilla
Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093-0412, USA
Vishant Gandhi & Karsten Zengler
Department of Pediatrics, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093-0760, USA
Chloe Lieng, Oriane Moyne, Rodrigo Santibáñez-Palominos & Karsten Zengler
Center for Microbiome Innovation, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093-0403, USA
Karsten Zengler
Program in Materials Science and Engineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093-0418, USA
Karsten Zengler

Authors

Juan D. Tibocha-Bonilla
View author publications
Search author on:PubMed Google Scholar
Vishant Gandhi
View author publications
Search author on:PubMed Google Scholar
Chloe Lieng
View author publications
Search author on:PubMed Google Scholar
Oriane Moyne
View author publications
Search author on:PubMed Google Scholar
Rodrigo Santibáñez-Palominos
View author publications
Search author on:PubMed Google Scholar
Karsten Zengler
View author publications
Search author on:PubMed Google Scholar

Contributions

J.T.B., R.S.P., and K.Z. conceived the study. J.T.B. and R.S.P. designed the modeling framework for reconstructing the ME-model of P. putida. J.T.B. reconstructed and curated the ME-model. C.L. and O.M. designed and optimized the Ribo-Seq library preparation for P. putida. V.G. performed the experiments and prepared the libraries. J.T.B. processed the raw sequencing reads and performed the subsequent computational analyses. All authors contributed to the writing of this manuscript. K.Z. supervised the study.

Corresponding author

Correspondence to Karsten Zengler.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary File 1

Supplementary File 2

Supplementary Data 1

Supplementary Data 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tibocha-Bonilla, J.D., Gandhi, V., Lieng, C. et al. Model of metabolism and gene expression predicts proteome allocation in Pseudomonas putida. npj Syst Biol Appl 11, 55 (2025). https://doi.org/10.1038/s41540-025-00521-1

Download citation

Received: 04 December 2024
Accepted: 20 April 2025
Published: 24 May 2025
Version of record: 24 May 2025
DOI: https://doi.org/10.1038/s41540-025-00521-1