Abstract
Signaling pathways are useful models for interpreting molecular data, but their coverage has long been constrained by classic biochemistry methods. The growing corpus of kinase-substrate interactions, coupled to phosphoproteomics improvements, pave the way to revisit classic signaling pathways. In this study, we explore context-specific signaling pathway inference from phosphoproteomics and kinase-substrate networks. Focusing on epidermal growth factor (EGF), we conduct a meta-analysis and generate three datasets representing the most comprehensive characterization of the EGF response to date. We infer kinase-kinase pathways and compare them to different ground truth sets. Literature-curated networks consistently yield the highest recovery of ground-truth interactions, with modest gains from network propagation methods. Up to 90% of interactions are absent from current ground truth sets, indicating many unexplored interactions supported by data and knowledge. Our results demonstrate the limitations of traditional views on signaling pathways and point to opportunities for generating better mechanistic hypotheses.
Data availability
All the data to reproduce the results presented in this study can be retrieved from Zenodo: [https://zenodo.org/records/18390833]. The mass spectrometry proteomics raw data generated in this study have been deposited to the ProteomeXchange Consortium via the PRIDE56 partner repository with the dataset identifier PXD056666.
Code availability
All the code to reproduce the results of this manuscript can be accessed at: [https://zenodo.org/records/18390833].
References
Koseska, A. & Bastiaens, P. I. Cell signaling as a cognitive process. EMBO J. 36, 568–582 (2017).
Lynch, T. J. et al. Activating mutations in the epidermal growth factor receptor underlying responsiveness of non-small-cell lung cancer to gefitinib. N. Engl. J. Med. 350, 2129–2139 (2004).
Davies, H. et al. Mutations of the BRAF gene in human cancer. Nature 417, 949–954 (2002).
Arakhamia, T. et al. Posttranslational modifications mediate the structural diversity of tauopathy strains. Cell 180, 633–644.e12 (2020).
Chuang, H.-C. et al. The kinase GLK controls autoimmunity and NF-κB signaling by activating the kinase PKC-θ in T cells. Nat. Immunol. 12, 1113–1118 (2011).
Garrido-Rodriguez, M., Zirngibl, K., Ivanova, O., Lobentanzer, S. & Saez-Rodriguez, J. Integrating knowledge and omics to decipher mechanisms via large-scale models of signaling networks. Mol. Syst. Biol. https://doi.org/10.15252/msb.202211036 (2022).
Tognetti, M. et al. Deciphering the signaling network of breast cancer improves drug sensitivity prediction. Cell Syst. 12, 401–418.e12 (2021).
Fröhlich, F., Gerosa, L., Muhlich, J. & Sorger, P. K. Mechanistic model of MAPK signaling reveals how allostery and rewiring contribute to drug resistance. Mol. Syst. Biol. https://doi.org/10.15252/msb.202210988 (2023).
Kustatscher, G. et al. An open invitation to the Understudied Proteins Initiative. Nat. Biotechnol. 40, 815–817 (2022).
Bortel, P. et al. Systematic optimization of automated phosphopeptide enrichment for high-sensitivity phosphoproteomics. Mol. Cell. Proteom. 23, 100754 (2024).
Skowronek, P. et al. Rapid and In-depth Coverage Of The (Phospho-)proteome With Deep Libraries And Optimal Window Design for dia-PASEF. Mol. Cell. Proteom. 21, 100279 (2022).
Mertins, P. et al. Reproducible workflow for multiplexed deep-scale proteome and phosphoproteome analysis of tumor tissues by liquid chromatography-mass spectrometry. Nat. Protoc. 13, 1632–1661 (2018).
Ochoa, D. et al. The functional landscape of the human phosphoproteome. Nat. Biotechnol. 38, 365–373 (2020).
Zhou, Z. et al. Phosformer: an explainable transformer model for protein kinase-specific phosphorylation predictions. Bioinformatics 39, btad046 (2023).
Johnson, J. L. et al. An atlas of substrate specificities for the human serine/threonine kinome. Nature 613, 759–766 (2023).
Yaron-Barir, T. M. et al. The intrinsic substrate specificity of the human tyrosine kinome. Nature 629, 1174–1181 (2024).
Needham, E. J., Parker, B. L., Burykin, T., James, D. E. & Humphrey, S. J. Illuminating the dark phosphoproteome. Sci. Signal. 12, eaau8645 (2019).
Invergo, B. M. et al. Prediction of signed protein kinase regulatory circuits. Cell Syst. 10, 384–396.e9 (2020).
Sriraja, L. O., Werhli, A. & Petsalaki, E. Phosphoproteomics data-driven signalling network inference: does it work? Comput. Struct. Biotechnol. J. 21, 432–443 (2023).
Türei, D. et al. Integrated intra- and intercellular signaling knowledge for multicellular omics analysis. Mol. Syst. Biol. https://doi.org/10.15252/msb.20209923 (2021).
Hijazi, M., Smith, R., Rajeeve, V., Bessant, C. & Cutillas, P. R. Reconstructing kinase network topologies from phosphoproteomics data reveals cancer-associated rewiring. Nat. Biotechnol. 38, 493–502 (2020).
Müller-Dott, S. et al. Comprehensive evaluation of phosphoproteomic-based kinase activity inference. Nat. Commun. 16, 4771 (2025).
Reddy, R. J. et al. Early signaling dynamics of the epidermal growth factor receptor. Proc. Natl. Acad. Sci. USA 113, 3114–3119 (2016).
Lancaster, N. M. et al. Fast and deep phosphoproteome analysis with the Orbitrap Astral mass spectrometer. Nat. Commun. 15, 7016 (2024).
Chen, H. et al. Omics analyses uncover host networks defining virus-permissive and -hostile cellular states. Mol. Cell Proteom. 24, 100966 (2025).
Tuechler, N. et al. Dynamic multi-omics and mechanistic modeling approach uncovers novel mechanisms of kidney fibrosis progression. Mol. Syst. Biol. 21, 1030–1065 (2025).
Babur, Ö et al. Causal interactions from proteomic profiles: molecular data meet pathway knowledge. Patterns 2, 100257 (2021).
Paull, E. O. et al. Discovering causal pathways linking genomic events to transcriptional states using Tied Diffusion Through Interacting Events (TieDIE). Bioinformatics 29, 2757–2764 (2013).
Giudice, G., Chen, H., Koutsandreas, T. & Petsalaki, E. phuEGO: a network-based method to reconstruct active signaling pathways from phosphoproteomics datasets. Mol. Cell. Proteom. 23, 100771 (2024).
Köksal, A. S. et al. Synthesizing signaling pathways from temporal phosphoproteomic data. Cell Rep. 24, 3607–3618 (2018).
Terfve, C. D. A., Wilkes, E. H., Casado, P., Cutillas, P. R. & Saez-Rodriguez, J. Large-scale models of signal propagation in human cells derived from discovery phosphoproteomic data. Nat. Commun. 6, 8033 (2015).
Lun, X.-K. et al. Analysis of the human kinome and phosphatome by mass cytometry reveals overexpression-induced effects on cancer-related signaling. Mol. Cell 74, 1086–1102.e5 (2019).
Hill, S. M. et al. Inferring causal molecular networks: empirical assessment through a community-based effort. Nat. Methods 13, 310–318 (2016).
Hosseini-Gerami, L. et al. Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis. BMC Bioinform. 24, 154 (2023).
Lo Surdo, P. et al. SIGNOR 3.0, the SIGnaling network open resource 3.0: 2022 update. Nucleic Acids Res. 51, D631–D637 (2023).
Milacic, M. et al. The Reactome Pathway Knowledgebase 2024. Nucleic Acids Res. 52, D672–D678 (2024).
Agrawal, A. et al. WikiPathways 2024: next generation pathway database. Nucleic Acids Res. 52, D679–D689 (2024).
Oliinyk, D. et al. µPhos: a scalable and sensitive platform for high-dimensional phosphoproteomics. Mol. Syst. Biol. 20, 972–995 (2024).
Oliinyk, D. et al. nanoPhos enables ultra-sensitive and cell-type resolved spatial phosphoproteomics. Preprint at bioRxiv https://doi.org/10.1101/2025.05.29.656770 (2025).
Savitski, M. M. et al. Tracking cancer drugs in living cells by thermal profiling of the proteome. Science 346, 1255784 (2014).
Burtscher, M. L. et al. Network integration of thermal proteome profiling with multi-omics data decodes PARP inhibition. Mol. Syst. Biol. 20, 458–474 (2024).
Outhwaite, I. R., Singh, S., Berger, B. T., Knapp, S., Chodera, J. D. & Seeliger, M. A. Death by a thousand cuts through kinase inhibitor combinations that maximize selectivity and enable rational multitargeting. eLife 12, e86189 (2023).
Wiśniewski, J. R. & Gaugaz, F. Z. Fast and sensitive total protein and Peptide assays for proteomic analysis. Anal. Chem. 87, 4110–4116 (2015).
Leutert, M., Rodríguez-Mias, R. A., Fukuda, N. K. & Villén, J. R2-P2 rapid-robotic phosphoproteomics enables multidimensional cell signaling studies. Mol. Syst. Biol. 15, e9021 (2019).
Rappsilber, J., Mann, M. & Ishihama, Y. Protocol for micro-purification, enrichment, pre-fractionation and storage of peptides for proteomics using StageTips. Nat. Protoc. 2, 1896–1906 (2007).
Chambers, M. C. et al. A cross-platform toolkit for mass spectrometry and proteomics. Nat. Biotechnol. 30, 918–920 (2012).
Kong, A. T., Leprevost, F. V., Avtonomov, D. M., Mellacheruvu, D. & Nesvizhskii, A. I. MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics. Nat. Methods 14, 513–520 (2017).
Huber, W., von Heydebreck, A., Sültmann, H., Poustka, A. & Vingron, M. Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics 18, S96–S104 (2002).
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015).
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B Stat. Methodol. 57, 289–300 (1995).
Metz, K. S. et al. Coral: clear and customizable visualization of human kinome data. Cell Syst. 7, 347–350.e1 (2018).
Hornbeck, P. V. et al. PhosphoSitePlus, 2014: mutations, PTMs and recalibrations. Nucleic Acids Res. 43, D512–D520 (2015).
Badia-I-Mompel, P. et al. decoupleR: ensemble of computational methods to infer biological activities from omics data. Bioinform. Adv. 2, vbac016 (2022).
Steinegger, M. & Söding, J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat. Biotechnol. 35, 1026–1028 (2017).
Rodriguez-Mier, P., Garrido-Rodriguez, M., Gabor, A. & Saez-Rodriguez, J. Unifying multi-sample network inference from prior knowledge and omics data with CORNETO. Nat. Mach. Intell. 7, 1168–1186 (2025).
Perez-Riverol, Y. et al. The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences. Nucleic Acids Res. 50, D543–D552 (2022).
Acknowledgements
MGR was supported through state funds approved by the State Parliament of Baden-Württemberg for the Innovation Campus Health + Life Science Alliance Heidelberg Mannheim. We thank Ana Mellado-Fuentes for her assistance with the HEK293T experiment and Attila Gabor for fruitful discussions. We thank the authors of studies re-analyzed in this manuscript for providing transparent access to their data. We acknowledge EMBL IT Services for support with high-performance computing.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
M.G.R.: Conceptualization, Data Curation, Formal Analysis, Investigation, Methodology, Project Administration, Software, Writing - Original Draft Preparation. C.P.: Conceptualization, Investigation, Methodology, Validation, Writing - Original Draft Preparation. M.L.B.: Investigation, Visualization. I.B.: Investigation, Visualization. P.R.M.: Methodology, Software, Formal analysis. S.M.D.: Software, Formal analysis. M.S.: Resources, Project administration, Writing - Original Draft Preparation, Supervision, Funding acquisition. J.S.R.: Resources, Project administration, Writing - Original Draft Preparation, Supervision, Funding acquisition.
Corresponding authors
Ethics declarations
Competing interests
J.S.R. reports funding from GSK, Pfizer, and Sanofi and fees/honoraria from Travere Therapeutics, Stadapharm, Astex, Pfizer, Grunenthal, Moderna, and Owkin. The remaining authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Garrido-Rodriguez, M., Potel, C., Burtscher, M.L. et al. Benchmarking EGF signaling pathway inference using phosphoproteomics and kinase-substrate interactions. Nat Commun (2026). https://doi.org/10.1038/s41467-026-69332-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-026-69332-0