Abstract
Breast cancer is a heterogeneous disease with a high incidence, but its proteomes have not yet been thoroughly characterized. To construct a comprehensive dynamic network of breast cancer-related proteins, we integrated the whole-cell proteome (WCP), phospho-proteome, malonyl-proteome of breast cancer tumor tissues and adjacent healthy tissues. We identified 2,417 differentially expressed proteins (DEPs), 646 differentially phosphorylated proteins (DPPs), and 107 differentially malonylated proteins (DMPs). Functional enrichment analysis revealed that these differentially expressed proteins are involved in extracellular matrix (ECM) interactions and immune-related pathways. Protein‒protein interaction (PPI) analysis revealed posttranslational modification (PTM) crosstalk between proteins involved in phosphorylation and malonylation. The acetyltransferase EP300 and deacetylase HDAC1 are involved in the DPP network, whereas the phosphatase PKM is a hub protein in the DMP network. Kinase-substrate enrichment analysis (KSEA) revealed the activation of the kinases CSNK1D, ROCK1, ROCK2, and CDK2. Overall, this study provides a foundation for understanding the functions of phosphorylation and malonylation in breast cancer. It systematically reveals critical features of breast cancer, providing a resource for exploring PTM crosstalk within and across proteins involved in the disease.
Similar content being viewed by others
Introduction
Breast cancer is a major global health concern, representing a significant proportion of cancer cases in women1. It typically originates in breast tissue, specifically affecting the inner lining of milk ducts or the lobules2. Breast cancer can be classified into different subtypes on the basis of hormone receptor status and genetic characteristics, resulting in diverse pathophysiological and clinical profiles3. The etiology of breast cancer is multifactorial and involves both genetic and environmental factors4. Understanding the complexity of breast cancer is essential for developing effective prevention strategies, optimizing treatment approaches, and improving patient care.
Advances in high-throughput sequencing technologies have significantly enhanced our understanding of gene‒phenotype associations in tumor tissues. Comprehensive transcriptomic analyses of various tumor types have provided valuable insights into the mechanisms of tumor proliferation, invasion, and recurrence5,6,7. Despite this progress, it is important to note that the correlation between mRNA and protein abundance is only 0.48. This low correlation indicates that transcript levels alone are often insufficient to predict protein levels, complicating the understanding of the genotype‒phenotype relationship.
A comprehensive understanding of cancer development and progression requires in-depth proteomic research. Extensive proteomic data can ultimately guide the development of more effective therapeutic strategies, bridging the gap between transcriptomic data and clinical outcomes9. An Integrated analysis of transcriptome, proteomics and phosphor-proteomics clearly identified LIG1 as a triple-negative breast cancer chemotherapy-resistance and multicancer-type poor prognosis marker10. In a prospective study of deep-scale proteomics and post-translational modification (PTM) proteomics of breast cancer, researcher uncovered hypoacetylation leaded to increased activity of the glycolysis pathway in breast cancer tissue11.
PTMs are generally enzymatic modifications of proteins following protein biosynthesis that significantly impact cancer occurrence and progression by influencing key mechanisms such as signal transduction, protein stability, subcellular localization, protein interactions, and metabolic rates12,13. Although more than 300 types of PTMs are known14, only a few have been thoroughly investigated, including phosphorylation, acetylation, ubiquitination, and malonylation.
Phosphorylation, which occurs on serine, threonine, and tyrosine residues, is a common PTM involved in cell signaling, cell cycle control, and apoptosis15,16. It is one of the most extensively studied PTMs, with many proteins undergoing this modification. The phosphorylation of key proteins such as STAT317, RhoA18, and mTOR19 has been linked to cancer cell development, invasion, and drug resistance.
Malonylation is an emerging form of protein acylation that occurs on lysine residues20. Malonyl-CoA serves as a precursor for de novo fatty acid synthesis and a key inhibitor of fatty acid oxidation, as well as a precursor to malonylation21. Malonylation is closely related to energy metabolism, particularly fatty acid synthase and the TCA cycle22. In lipopolysaccharide (LPS)stimulated macrophages, malonyl-CoA increases the malonylation of GAPDH and stimulates GAPDH activity and mRNA binding capacity, which in turn promotes the production of proinflammatory cytokines23. In malonyl-CoA decarboxylase (MCD)-deficient diseases, elevated malonylation impairs mitochondrial function and fatty acid oxidation24. Malonylation of mTOR inhibits pathological angiogenesis, suggesting the potential for malonylation in tumor microangiogenesis25. NAD-dependent protein deacylase sirtuin-5 (SIRT5) demalonylates SDHA and TPI, contributing to drug resistance and recurrence in wild-type Kras colorectal cancer22. However, there is still a lack of research on the role of malonylation in breast cancer.
To further elucidate the mechanisms underlying breast cancer, we systematically integrated proteomics, succinylproteomics, and phosphoproteomics to identify critical features of proteins and posttranslational modifications. This comprehensive profiling could contribute to the development of novel biomarkers and targeted therapeutics.
Results
Overview of multilevel proteomics
To investigate the posttranslational modifications (PTMs) involved in breast cancer, we analyzed the changes in the whole-cell proteome (WCP), phospho-proteome, and malonyl-proteome of breast cancer tumor tissues and adjacent healthy tissues (Fig. 1a). All tumor samples and healthy tissues were distinguishable on the basis of the first two components (Fig. 1b, Supplementary Fig. S1a). Consistently, the correlation matrix of tumor and healthy tissues was significantly different, highlighting their distinct protein expression patterns (Fig. 1c, Supplementary Fig. S1b). To determine whether different PTMs target different proteins, we analyzed the overlap between proteins identified in the two PTM proteomes and WCP. As shown in Fig. 1d, approximately half of the proteins identified in the WCP did not contain PTM sites, whereas 461 proteins contained both PTMs.
Overview of multilevel proteomics. (a) Flowchart of three proteomes, created with BioGDP.com66; (b) PCA of WCP (n = 5); (c) PCC analysis of WCP (n = 5); (d) Venn diagrams of identified proteins in three proteomes, WCP: Whole-cell proteome; P: phospho-proteome; Ma: malonyl-proteome. T: breast cancer tumor tissues. N: adjacent healthy tissues.
Profiles of deps in the whole-cell proteome
In the WCP, we identified a total of 7,459 proteins, with 1,630 notably upregulated and 787 significantly downregulated in tumor tissues (Fig. 2a and Supplementary materials). The top 5 upregulated proteins were S100A8, S100A9, S100P, SPRR1B, and AZU1, whereas the top 5 significantly downregulated proteins were ZBTB2, OXTR, PLIN1, IGSF1, and DHRS2 (Fig. 2b). Subcellular localization revealed that the upregulated proteins were located primarily in the cytoplasm (27.83%), nucleus (26.47%), and extracellular space (14.86%), whereas the downregulated proteins were located mainly in the extracellular space (33.33%), cytoplasm (18.58%), and nucleus (17.68%) (Fig. 2c). The changes in the subcellular localization of proteins between breast cancer tissues and adjacent healthy tissues may indicate significant alterations in cellular functions.
Functional enrichment analysis of deps
Next, we performed functional enrichment analysis on the differentially expressed proteins (DEPs). KEGG enrichment analysis (Fig. 3a) revealed that the DEPs were enriched in immune regulation and cell adhesion-related pathways (glycosaminoglycan biosynthesis-keratan sulfate, ECM-receptor interaction). Similarly, GO enrichment analysis (Supplementary Fig. S2) revealed that the differentially expressed proteins were enriched in immune signal regulation (leukocyte proliferation, T-cell binding, and antigen binding) and extracellular matrix regulation processes (extracellular matrix, integrin binding, and extracellular matrix structural constituents).
Functional analysis of DEPs. (a) KEGG pathway analyses of the DEPs; (b) Statistical analysis chart of four groups of DEPs, which are divided according to fold change. Q1 (FC < 0.5), Q2(0.5 < FC < 0.667), Q3(1.5 < FC < 2.0), and Q4(FC > 2.0); (c) KEGG pathway analyses of the four groups; (d) Biological processes of the four groups, blue represents high enrichment, white represents low enrichment, “*” p value < 0.05, “* *” p value < 0.01, “* * *” p value < 0.001.
To further investigate the potential biological functions of these proteins, we divided the differentially expressed proteins into four groups on the basis of FC (Fig. 3b). Group Q1 (FC < 0.5) included 391 proteins, group Q2 (0.5 < FC < 0.667) included 396 proteins, group Q3 (1.5 < FC < 2.0) included 973 proteins, and group Q4 (FC > 2.0) included 657 proteins. Proteins in groups Q1 and Q2 were enriched mainly in the ECM‒receptor interaction, focal adhesion, and gap junction pathways, whereas proteins in groups Q3 and Q4 were enriched mainly in immune-related pathways such as the phagosome, lysosome, and infection pathways (Fig. 3c‒d). These findings suggest that DEPs with high fold changes may play important roles in remodeling of the tumor immune microenvironment.
Profiles of DPPs and motif analysis of phosphorylation sites
In the phospho-proteome, we identified a total of 4,870 proteins (569 upregulated and 787 downregulated) and 18,108 phosphorylation sites (787 upregulated and 1,154 downregulated) (Fig. 4a and Supplementary materials). The top 5 upregulated DPP included SNAP23, PDCD11, SNPH, TANC2, and TNKS1BP1, whereas the top 5 downregulated DPP included TRIM16, TUBA1B, SMC3, IRF2BPL, and ADD3 (Fig. 4b).
Characteristics of the differentially phosphorylated proteins (DPPs). (a) Statistical analysis of DPPs and phosphorylation sites; (b) Volcano plot of DPPs; (c) Motif analysis of phosphorylation from positions − 6 to + 6 (the phosphorylation sites were set to zero). Left: serine phosphorylation site. Right: Threonine phosphorylation site; (d) Subcellular localization prediction of DPPs, Left: Upregulated DPPs; Right: Downregulated DPPs.
The phosphorylation sites exhibited a certain degree of conservation, and we used Motif-X to analyze the frequency of sequence motifs around the modification site (Fig. 4c). For serine phosphorylation sites, proline (P) and glutamic acid (E) were notably enriched upstream and downstream of the modification site. Arginine (R) substantially increased in frequency from positions − 6 to -1, whereas aspartic acid (D) and serine (S) were considerably more frequent downstream. For threonine phosphorylation sites, proline (P) and serine (S) were enriched both upstream and downstream, whereas phenylalanine (F) and leucine (L) were notably infrequent. These findings suggest a preference for certain kinases in breast cancer cells. Then, we predicted the subcellular localization of the DPPs (Fig. 4d). Both upregulated and downregulated proteins showed similar localization patterns, primarily in the nucleus (up 51.49%, down 50.91%), followed by the cytoplasm (up 21.79%, down 21.3%), plasma membrane (up 10.37%, down 9.35%), and mitochondria (up 4.75%, down 5.45%). More than half of the DPPs are localized in the nucleus, indicating that phosphorylation may play a significant role in the nucleus. Protein phosphorylation represent a mechanism that is frequently employed by cells to regulate transcription factor activity26. Phosphorylation of CREB at Ser133 in response to cAMP stimulus is sufficient to induce target gene expression27. The nuclear factor-kappa B (NF-κB) family of transcription factors has recently emerged as a major regulator of the growth and elaboration of neural processes. NF-κB is activated by a variety of extracellular signals, and either promotes or inhibits growth depending on the phosphorylation status of the p65 NF-κB subunit28.
In our study, we found some transcription factors, such as CREB1, STAT3, FOXA1, were DPPs. For example, STAT3 phosphorylation at tyrosine 705 (pSTAT3 Y705) was downregulated. STAT3 dimerization, nuclear translocation and transcriptional activity required for pSTAT3 Y70529.
Profiles of DMPs and motif analysis of Kma sites
In the malonyl proteome, we identified 107 differentially malonylated proteins (DMPs), with 116 upregulated and 110 downregulated malonylation-lysine (Kma) sites (Fig. 5a and Supplementary materials). Top upregulated DMPs included EZR, DDTL and EIF3CL, whereas the top downregulated DMPs included COL1A1, CAVIN2, FBN1, ANXA2 and APOD (Fig. 5b). Notably, the EZR had multiple Kma sites among these DMPs.
Characteristics of the DMPs. (a) Statistical analysis of differentially malonylated proteins (DMPs) and malonylation sites; (b) Volcano plot of DMPs; (c) Motifs analysis of malonylation from positions − 10 to + 10 (the malonylation sites were set to zero); (d) Subcellular localization prediction of DMPs, up: Upregulated DMPs; Down: Downregulated DMPs.
Motif analysis revealed that alanine (A), glycine (G), and valine (V) were enriched around the Kma sites, whereas proline (P), arginine (R), tryptophan (W), and glutamine (Q) were less common near the Kma sites (Fig. 5c). These findings indicate distinct amino acid preferences around Kma sites, suggesting that different malonylation enzymes may be involved in regulating the malonylation of breast cancer. The partial biochemical preferences of enzymes for specific substrates may be determined by the residues surrounding the modification sites30. These findings indicate distinct amino acid preferences around Kma sites, which may suggest different malonylation enzymes may be involved in regulating the malonylation of breast cancer.
Moreover, we predicted the subcellular localization of DMPs (Fig. 5d) and found that most DMPs were mainly located in the cytoplasm (up 45.31%, down 53.49%), nucleus (up 21.88%, down 11.63%), mitochondria (up 7.81%, down 13.95%) and endoplasmic reticulum (up 6.25%, down 4.65%). Reports suggest that Kma primarily regulates cellular glycolipid metabolic activities29,31,32. However, the cytoplasm is the main site of metabolism, which aligns with our localization prediction. Our localization prediction found that DMP was main located in the cytoplasm, and enrichment analysis showed DMPs were enriched in galactose metabolism, glycolysis metabolism and glycerolipid metabolism (Fig. 6d).
Functional enrichment analysis of DPPs and DMPs
Similarly, we classified DPPs and DMPs into four groups on the basis of their fold changes (Fig. 6a-b). Clustering analysis (Fig. 6c) further revealed that the DPPs in the Q1 (FC < 0.5) and Q2 (0.5 < FC < 0.667) groups were enriched mainly in the glycolysis/ gluconeogenesis, thyroid hormone synthesis, and insulin secretion pathways related to carbohydrate and lipid metabolism; the DPPs in the Q3 (1.5 < FC < 2.0) group were enriched in inositol phosphate metabolism, PPAR signaling pathway, and viral carcinogenesis; and the proteins in the Q4 (FC > 2.0) group were enriched in the base excision repair, beta-alanine metabolism, and fat digestion and absorption pathways.
DMPs in the Q1 group were enriched mainly in the ECM‒receptor interaction, focal adhesion, and PI3K‒Akt signaling pathways; those in the Q2 group were enriched in metabolic pathways such as galactose metabolism, glycolysis and glycerolipid metabolism; those in the Q3 group were not enriched in any pathway; and those in the Q4 group were enriched in base excision repair, NF-κB pathway, and pyrimidine metabolism (Fig. 6d). Given that tumor cells are characterized by mismatch repair defects and abnormal NF-κB activation, the Q4 group of DMPs might represent potential targets for breast cancer treatment.
Network analysis of PTMs
We performed PPI network analysis on the DPPs and DMPs. As shown in Fig. 7a, the DPP network consists of 1006 nodes and 4683 edges, whose hub proteins include ACTB, CTNNB1, EP300, HDAC1, and CREBBP. We found that kinases such as MAPK3 and PIK3R1, as well as the acetyltransferase EP300 and deacetylase HDAC1, interact with many DPPs. The DMP network, composed of 74 nodes and 122 edges, which are hub proteins, were identified primarily as enzymes, including TPI1, LDHA, and TKT. Interestingly, one of the hub protein phosphatases, PKM, presented increased malonylation levels (Fig. 7b). These results suggest potential crosstalk between phosphorylation and malonylation modifications.
Moreover, we performed Kinase-Substrate Enrichment Analysis (KSEA) for analysis kinase activity, which scores each candidate kinase based on the relative hyperphosphorylation or dephosphorylation of most substrates. KSEA results indicated marked activation of the kinases CSNK1D, ROCK1, ROCK2, and CDK2 and inhibition of CSNK2A2, PDPK1, CLK1, CSNK2A1, and SRC (Supplementary Fig. S3, Supplementary Table S2). The regulatory enzymes involved in the Kma modification are not yet fully understood. Our proteomics data revealed no significant difference in the total protein expression of the acetyltransferases EP300 and CBP or the deacetylases SIRT2 and SIRT5 between tumor and adjacent normal tissues. Malonylation levels were also unchanged. However, we observed decreased phosphorylation of acetyltransferase EP300 at S2309 and increased phosphorylation of SIRT2 at S368. However, the specific mechanisms and effects of these changes require further investigation.
Discussion
Proteomics has provided insights into breast cancer. Anurag et al. performed proteogenomics to explore the molecular basis for triple-negative breast cancer and reported the activation of DNA repair, E2F targets, the G2-M checkpoint, interferon-gamma signaling, and immune checkpoint pathways10. In another study, researchers performed proteomics on formaldehyde-fixed, paraffin-embedded (FFPE) samples of malignant tumors and normal breast tissue samples. They reported that the main categories of proteins upregulated in tumor samples were RNA-binding proteins, heat shock proteins, and DNA repair proteins33. In our study, we identified a total of 7,459 proteins and found significant upregulation of S100 family proteins (S100A8, S100A9, and S100P). The S100 family has been shown to play significant roles in immune responses34,35. A previous study reported that S100A8/A9 may promote breast cancer cell growth by remodeling the tumor immune microenvironment36. High expression of S100A8/A9 predicts poor prognosis in breast cancer patients37. Our functional enrichment analysis also revealed that the DEPs were significantly enriched in terms of immune regulation and the extracellular matrix.
The enrichment of these differential pathways may be closely related to significant changes in the tumor immune microenvironment (TIME) of breast cancer, indicating extensive immune cell infiltration in tumor tissues. Different types of infiltration have distinct effects on tumor tissues. For example, extensive infiltration of Treg cells hinders the host antitumor immune response and promotes tumor growth and metastasis38, whereas increased infiltration of helper TH1 cells and cytotoxic T lymphocytes can enhance the effectiveness of immunotherapy39.
Posttranslational modifications (PTMs) are ubiquitous and play crucial roles in various cellular functions40. Phosphorylation is a common, reversible, and highly regulated PTM of proteins. Studies have shown that the proto-oncogene Src promotes the development of breast cancer by directly binding and phosphorylating lipin-141. AXL stimulates the phosphorylation of a network of focal adhesion (FA) proteins, promoting tumor invasion42. Malonylation is a newly discovered PTM that has been shown to be involved in energy metabolism, macrophage inflammatory signaling, and chondrocyte metabolic regulation43. In mouse models of cancer cachexia, protein lysine malonylation was systemic declined44. Lysine malonylation of TKT inhibited its activity, the activation of TKT could accelerate CRC tumorigenesis in mice model45.
We performed PTM proteomics to investigate the characteristics of phosphorylation and malonylation in breast cancer and revealed its potential significance. There have been no reports about malonylation in breast cancer.
In the phosphorylation proteome, we identified a total of 4,870 phosphorylated proteins, among which PDCD11, SNPH, and TNAC2 presented significantly increased phosphorylation levels. PDCD11 can promote the G2/M transition in colorectal cancer (CRC) cells, facilitating tumor progression46. However, the mechanisms by which phosphorylation regulates PDCD11 function remain unclear and require further investigation.
Syntaphilin (SNPH) functions as an anchor, immobilizing mitochondria47. Phosphorylation by PAK5 allows SNPH to mobilize damaged mitochondria for replacement with healthy mitochondria, promoting neuron survival and regeneration48. The increased phosphorylation of SNPH in tumor cells may be related to a higher mitochondrial turnover rate. While the mitochondrial network is mobile and not evenly distributed throughout the cell49, researchers have reported that breast cancer cells with more mitochondria at their leading edge exhibit faster migration velocities50. Thus, we propose that phosphorylating SNPH to unanchor mitochondria might promote their accumulation at the leading edge of tumor cells, potentially regulating tumor invasion. This assumption requires further investigation.
Our KSEA results revealed significant upregulation of casein kinase 1 delta (CSNK1D) activity in breast cancer tissues. CSNK1D plays a tumor-promoting role in various cancers. Inhibition of CSNK1D can delay the growth of breast tumors51. Its substrates include TACC1, TACC2, XPO1, MTA2, and SP1. The transforming acidic coiled coil (TACC) family comprises tumor-promoting factors that are involved in the microtubule-dependent coupling of the nucleus and the centrosome. TACC2 is a prognostic predictor, with increased expression correlating with poor prognosis in breast cancer patients52,53. Phosphorylation is suggested as a mechanism for TACC activation54,55. In our study, TACC2 has 15 phosphorylation sites, with significant increases in phosphorylation at S2321 and S2512. These findings suggest that TACC2 may be a potential therapeutic target, with S2321 and S2512 as active sites.
In the malonylation proteomics analysis, 107 DMPs were identified, with 116 upregulated Kma sites and 110 downregulated Kma sites. Among these, Ezrin (EZR) has multiple Kma sites, suggesting that it may play a significant role in cellular processes. EZR, a member of the Ezrin-Radixin-Moesin (ERM) family, interacts with cell surface adhesion molecules such as CD44, E-cadherin, and ICAM-1, regulating cell morphology and migration56. EZR has been identified as an oncogene57 and is associated with high levels of metastatic behavior in various types of cancer56. Currently, there are no reports of Kma in the EZR. Many acyltransferases typically promote cancer progression20, and the acyltransferases largely overlap20. Moreover, five HDAC inhibitors are currently used in clinical tumor treatments22. We hypothesize that the malonylation of EZR also promotes breast cancer proliferation and invasion.
KEGG enrichment analysis revealed that phosphorylation and malonylation modifications primarily impact tumor cell glycolipid metabolism, ECM interactions, and mismatch repair. These findings indicate that phosphorylation and malonylation may be associated with typical tumor cell features, including glycolipid metabolism, increased de novo fatty acid synthesis, and mismatch repair deficiencies.
In summary, our study provides a proteomic overview of breast cancer and clarifies the changes in phosphorylation and malonylation in breast cancer tissues and paracancerous tissues. In addition, our study identified several novel protein markers that may have potential value in clinical diagnosis. Our study offers new insights into the complexity of breast cancer and lays the foundation for future research on breast cancer.
Materials and methods
Sample collection and clinical information
Fresh tumor samples were collected from breast cancer patients at the First Affiliated Hospital of Bengbu Medical University. Breast cancer tumor tissues and adjacent nontumorous tissues from five patients were used for proteomic analysis. This study was approved by the Bengbu Medical College Ethics Committee (No. 2021KY165), all methods were carried out in accordance with relevant guidelines and regulations, and all patients provided written informed consent. Supplementary Table S1 provides the basic clinical information of all patients.
Proteomics and PTM proteomics
The samples were ground into a powder via liquid nitrogen and then lysed in four volumes of lysis buffer (1% Triton X-100, 1% protease inhibitors, 3 µM TSA, 50 mM NAM). The lysates were sonicated (227 W, 2 s on, 3 s off, for 10 min) and then centrifuged at 12,000 × g for 10 min at 4 °C. The supernatant was transferred to a new centrifuge tube, and the protein concentration was determined via a BCA assay, adjusting to a uniform concentration. TCA was added to a final concentration of 20%, the mixture was vortexed, and the mixture was allowed to precipitate at 4 °C for 2 h. The mixture was centrifuged at 4500 × g for 5 min, the supernatant was discarded, and the pellet was washed with prechilled acetone 2–3 times. After air-drying the pellet, TEAB was added to a final concentration of 200 mM, the pellet was resuspended, and trypsin was added at a 1: 50 ratio overnight digestion. Dithiothreitol (DTT) was added to the peptides at a final concentration of 5 mM, and the mixture was incubated at 56 °C for 30 min. IAA was then added to the mixture at a final concentration of 11 mM, and the mixture was incubated in the dark at room temperature for 15 min.
Phosphorylated peptide enrichment
Peptide mixtures were first incubated with IMAC microsphere suspensions with vibration in loading buffer (50% acetonitrile/0.5% acetic acid). To remove the nonspecifically adsorbed peptides, the IMAC microspheres were sequentially washed with 50% acetonitrile/ 0.5% acetic acid and 30% acetonitrile/ 0.1% trifluoroacetic acid. To elute the enriched phosphopeptides, elution buffer containing 10% NH4OH was added, and the enriched phosphopeptides were eluted via vibration. The supernatant containing phosphopeptides was collected and lyophilized for LC‒MS/MS analysis.
Malonylated peptide enrichment
The peptide was dissolved in an IP buffer solution (100 mM NaCl, 1 mM EDTA, 50 mM Tris-HCl, 0.5% NP-40, pH 8.0). Then, the supernatant was transferred to prewashed antibody resin (PTM904, Jingjie Biotechnology) and placed on a rotating shaker at 4 °C with gentle shaking and an overnight incubation. After incubation, the resin was washed four times with IP buffer solution and twice with deionized water. Finally, the resin-bound peptide segment was eluted three times with 0.1% trifluoroacetic acid as the eluent. The eluent was collected and drained by vacuum freezing, followed by desalting according to the C18 ZipTips instructions before vacuum freezing and draining again for LC‒MS analysis.
LC‒MS/MS analysis
The peptides were dissolved in mobile phase A via liquid chromatography and separated via the Easy-nLC1000 ultrahigh-performance liquid phase system. The mobile phase consisted of solvent A (0.1% formic acid, 2% acetonitrile/in water) and solvent B (0.1% formic acid in acetonitrile). Peptides were separated with following gradient: 0–18 min, 2-22%B;18–22 min, 22-35%B; 22–26 min, 35-90%B; 26–30 min, 90%B. The peptide segment was subsequently injected into the capillary ion source for ionization and introduced into the timsTOF Pro mass spectrometer for data acquisition. The ion source voltage was set to 1.75 kV, and time-of-flight (TOF) analysis was employed to detect and analyze both the parent ions of the peptide segment and its secondary fragments. The data acquisition mode utilized is known as data-independent parallel cumulative serial fragmentation (DIA-PASEF) mode. After setting the scanning range of primary mass spectrometry from 300 to 1500 m/z, samples were collected in PASEF mode 20 times for each primary mass spectrometry scan. The full MS scan was set as100-1700 (MS/MS scan range) and 22 PASEF (MS/MS mode) -MS/MS scans were acquired per cycle. The MS/MS scan range was set as 395–1395 and isolation window was set as 20 m/z.
Database search
The DIA data were retrieved via the Spectronaut (v17) search engine with default parameters. The database utilized was “Homo_sapiens_9606_SP_20220107.fasta,” comprising 20,389 protein sequences. Trypsin/P was specified as the cleavage enzyme, allowing up to 2 missed cleavages. Carbamidomethylation of cysteine was designated as a fixed modification. Variable modifications included methionine oxidation (proteome, phospho-proteome), N-terminal protein acetylation (proteome, malonyl-proteome, phospho-proteome), lysine malonylation (malonyl-proteome), and phosphorylation of serine, threonine, and tyrosine (phospho-proteome). An inverse library was also incorporated to calculate the false discovery rate (FDR) resulting from random matching. The FDR for protein, peptide, and peptide-spectrum match (PSM) identification was set at 1%. Search results were filtered at 1% FDR, and the peptide confidence level was set for at least one unique peptides per protein for protein identification. The corresponding spectral library was imported into Spectronaut (v.17.0) software to predicts the retention time by nonlinear correction and searched against with DIA data.
Bioinformatics methods
The fold change (FC) was used to calculate the difference between tumor tissue (T) and adjacent nontumorous tissues (N). Fisher’s exact test was used to analyze the significance of functional enrichment of differentially expressed proteins. For phospho-proteome and malonyl-proteome, FC was calculated as the ratio of the mean relative quantification values for each modification site between two groups. The relative quantification values of the modification site are divided by the relative quantification values of its corresponding protein to eliminate the influence of protein expression on the modification expression. Differentially expressed proteins (DEPs), differentially phosphorylated proteins (DPPs) and differentially malonylated proteins (DMPs) were identified by an absolute value of FC > 1.5 and a p value < 0.05, which divided identified protein into 4 groups according to fold change, Q1 (FC < 0.5), Q2 (0.5 < FC < 0.667), Q3 (1.5 < FC < 2.0), and Q4 (FC > 2.0). Gene Ontology (GO) annotation was performed via eggNOG-mapper (v2.1.6)58. Subcellular localization prediction of the proteins was performed via WoLF PSORT and PSORTb (v3.0). Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis and protein‒protein interaction network analysis were conducted via Diamond software (v2.2.11.149)59,60,61,62,63, all differentially expressed protein database accession or sequence were searched against the STRING database for protein-protein interactions. Only interactions between the proteins belonging to the searched data set were selected, thereby excluding external candidates. STRING defines a metric called “confidence score” to define interaction confidence; we fetched all interactions that had a confidence score > 0.7 (high confidence).
Motif analysis utilized the MoMo analysis tool64, employing the motif-x algorithm, to examine the motif characteristics of modification sites. The analysis focuses on peptide sequences consisting of 10 amino acids both upstream and downstream of all identified modification sites, while for phosphorylation modifications, sequences of 6 amino acids upstream and downstream are utilized. The background for this analysis comprises peptide sequences that include 10 amino acids upstream and downstream of all potential modification sites within the species, with the same 6 amino acids upstream and downstream specification for phosphorylation modifications. A characteristic sequence form is deemed a motif of the modified peptides when the number of peptides exhibiting this specific sequence exceeds 20, and the p-value from the statistical test is less than 0.000001. Kinase-Substrate Enrichment Analysis (KSEA) was performed via the KSEA app65.
For further hierarchical clustering based on differentially expressed protein functional classification. We collated all the categories obtained after enrichment along with their p values, and then filtered for those categories which were at least enriched in one of the clusters with p value < 0.05. This filtered p value matrix was transformed by the function x = − log10 (p value). These p values were then clustered by one-way hierarchical clustering (Euclidean distance, average linkage clustering) in Genesis.
Data availability
The mass spectrometry proteomics data have been deposited to the iProX database (https://www.iprox.cn/page/DSV021.html;?url=1750831793988uOpk) with password tJ8g.
References
BRAY, F. et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries [J]. Cancer J. Clin. 68 (6), 394–424 (2018).
LEE, S. SHANTI A. Effect of exogenous pH on cell growth of breast Cancer cells [J]. Int. J. Mol. Sci., 22(18):9910. (2021).
SCHMIDT, B. et al. Resveratrol, Curcumin and Piperine alter human glyoxalase 1 in MCF-7 breast Cancer cells [J]. Int. J. Mol. Sci., 21(15):5244. (2020).
LICHTENSTEIN, P. et al. Environmental and heritable factors in the causation of cancer–analyses of cohorts of twins from sweden, denmark, and Finland [J]. N. Engl. J. Med. 343 (2), 78–85 (2000).
LIU, S. Q. et al. Single-cell and spatially resolved analysis uncovers cell heterogeneity of breast cancer [J]. J. Hematol. Oncol. 15 (1), 19 (2022).
PRADAT, Y. et al. Integrative Pan-Cancer genomic and transcriptomic analyses of refractory metastatic Cancer [J]. Cancer Discov. 13 (5), 1116–1143 (2023).
KEELAN, S. et al. Dynamic epi-transcriptomic landscape mapping with disease progression in Estrogen receptor-positive breast cancer [J]. Cancer communications (London, England), 43(5): 615–619. (2023).
LIU, Y. & BEYER, A. On the dependency of cellular protein levels on mRNA. Abundance [J] Cell. 165 (3), 535–550 (2016).
FERRAROTTO, R. et al. Proteogenomic Analysis of Salivary Adenoid Cystic Carcinomas Defines Molecular Subtypes and Identifies Therapeutic Targets [J]. Clin. cancer research: official J. Am. Association Cancer Res., 27(3): 852–864. (2021).
ANURAG, M. et al. Proteogenomic markers of chemotherapy resistance and response in Triple-Negative breast Cancer [J]. Cancer Discov. 12 (11), 2586–2605 (2022).
KRUG, K. et al. Proteogenomic landscape of breast Cancer tumorigenesis and targeted therapy [J]. Cell 183 (5), 1436–56e31 (2020).
CHEN, L., LIU, S. & TAO, Y. Regulating tumor suppressor genes: post-translational modifications [J]. Signal. Transduct. Target. Therapy. 5 (1), 90 (2020).
GEFFEN, Y. et al. Pan-cancer analysis of post-translational modifications reveals shared patterns of protein regulation [J]. Cell, 186(18): 3945–67e26. (2023).
YAO, S. U. N. L. & GUO, Z. Z, Et Al. Comprehensive Analysis of the Lysine Acetylome in Aeromonas hydrophila Reveals cross-talk between Lysine Acetylation and Succinylation in LuxS [J]81229–1239 (Emerging microbes & infections, 2019). 1.
BILBROUGH, T. & PIEMONTESE, E. Dissecting the role of protein phosphorylation: a chemical biology toolbox [J]. Chem. Soc. Rev. 51 (13), 5691–5730 (2022).
VIATOUR, P. et al. Phosphorylation of NF-kappaB and IkappaB proteins: implications in cancer and inflammation [J]. Trends Biochem. Sci. 30 (1), 43–52 (2005).
ZHENG, L. et al. Roquin-1 Regulates Macrophage Immune Response and Participates in Hepatic Ischemia-Reperfusion Injury [J]. Journal of immunology (Baltimore, Md: 2020, 204(5): 1322-33. (1950).
ZHANG, X. et al. Matrine Inhibits the Development and Progression of Ovarian cancer by Repressing cancer Associated Phosphorylation Signaling Pathways [J]10770 (Cell death & disease, 2019). 10.
CHEN, X. et al. Activation of mTOR Mediates hyperglycemia-induced Renal Glomerular Endothelial Hyperpermeability Via the RhoA/ROCK/pMLC Signaling Pathway [J]13105 (Diabetology & metabolic syndrome, 2021). 1.
FU, Y. et al. Oncometabolites drive tumorigenesis by enhancing protein acylation: from chromosomal remodelling to nonhistone modification [J]. J. Experimental Clin. cancer Research: CR. 41 (1), 144 (2022).
PENG, C. et al. The first identification of lysine malonylation substrates and its regulatory enzyme [J]. Mol. Cell. Proteomics: MCP. 10 (12), M111012658 (2011).
SHANG, S., LIU, J. & HUA, F. Protein acylation: mechanisms, biological functions and therapeutic targets [J]. Signal. Transduct. Target. Therapy. 7 (1), 396 (2022).
GALVáN-PEñA, S. et al. Malonylation of GAPDH is an inflammatory signal in macrophages [J]. Nat. Commun. 10 (1), 338 (2019).
COLAK, G. et al. DAI L,. Proteomic and Biochemical Studies of Lysine Malonylation Suggest Its Malonic Aciduria-associated Regulatory Role in Mitochondrial Function and Fatty Acid Oxidation [J]. Molecular & cellular proteomics: MCP, 14(11): 3056-71. (2015).
BRUNING, U. et al. Impairment of angiogenesis by fatty acid synthase Inhibition involves mTOR malonylation [J]. Cell Metabol., 28(6): 866 – 80.e15. (2018).
WHITMARSH A J, DAVIS, R. J. Regulation of transcription factor function by phosphorylation [J]. Cell. Mol. Life Sci. 57 (8–9), 1172–1183 (2000).
DüSTER, R. et al. Structural basis of Cdk7 activation by dual T-loop phosphorylation [J]. Nat. Commun. 15 (1), 6597 (2024).
GUTIERREZ, H. & DAVIES, A. M. Regulation of neural process growth, elaboration and structural plasticity by NF-κB [J]. Trends Neurosci. 34 (6), 316–325 (2011).
HONG J Y, CHUNG K S, SHIN, J. S. et al. The Anti-Proliferative activity of the hybrid TMS-TMF-4f compound against human cervical Cancer involves apoptosis mediated by STAT3 inactivation [J]. Cancers, 11(12): 1927. (2019).
VINOGRADOV A A, CHANG J S, ONAKA, H. et al. Accurate models of substrate preferences of Post-Translational modification enzymes from a combination of mRNA display and deep learning [J]. ACS Cent. Sci. 8 (6), 814–824 (2022).
NISHIDA Y, RARDIN M J, CARRICO, C. et al. SIRT5 regulates both cytosolic and mitochondrial protein malonylation with Glycolysis as a major target [J]. Mol. Cell. 59 (2), 321–332 (2015).
CARRICO, C. et al. The mitochondrial acylome emerges: proteomics, regulation by sirtuins, and metabolic and disease implications [J]. Cell Metabol. 27 (3), 497–512 (2018).
YEN T Y, WONG, R. et al. Over-Expression of RNA processing, heat shock, and DNA repair proteins in breast tumor compared to normal tissue [J]. Proteomics, 15-16: e2000044. (2020).
BRESNICK A R, WEBER D J & ZIMMER D B S100 proteins in cancer [J]. Nat. Rev. Cancer. 15 (2), 96–109 (2015).
CHEN, Y. et al. S100A8 and S100A9 in Cancer [J]. Biochim. Et Biophys. Acta Reviews cancer. 1878 (3), 188891 (2023).
CHEN, Y. et al. Critical Role of the MCAM-ETV4 axis Triggered by Extracellular S100A8/A9 in Breast cancer Aggressiveness [J]21627–640 (Neoplasia (New York, NY), 2019). 7.
ZHANG, X. et al. S100A8/A9 as a risk factor for breast cancer negatively regulated by DACH1 [J]. Biomark. Res. 11 (1), 106 (2023).
FATTORI, S. & ROUX, H. Therapeutic targeting of Tumor-Infiltrating regulatory T cells in breast Cancer [J]. Cancer Res. 82 (21), 3868–3879 (2022).
HUANG, D. et al. Targeting regulator of G protein signaling 1 in tumor-specific T cells enhances their trafficking to breast cancer [J]. Nat. Immunol. 22 (7), 865–879 (2021).
TAN, X. et al. SUMO1 promotes mesangial cell proliferation through inhibiting autophagy in a cell model of IgA nephropathy [J]. Front. Med. 9, 834164 (2022).
SONG, L. et al. Proto-oncogene Src links lipogenesis via lipin-1 to breast cancer malignancy [J]. Nat. Commun. 11 (1), 5842 (2020).
ABU-THURAIA A, GOYETTE M A, BOULAIS, J. et al. AXL confers cell migration and invasion by hijacking a PEAK1-regulated focal adhesion protein network [J]. Nat. Commun. 11 (1), 3586 (2020).
FADó, R., RODRíGUEZ-RODRíGUEZ, R. & CASALS, N. The return of malonyl-CoA to the brain: cognition and other stories [J]. Prog. Lipid Res. 81, 101071 (2021).
KOJIMA, Y. et al. Decreased liver B vitamin-related enzymes as a metabolic hallmark of cancer cachexia [J]. Nat. Commun. 14 (1), 6246 (2023).
WANG, H. L. et al. Sirtuin5 protects colorectal cancer from DNA damage by keeping nucleotide availability [J]. Nat. Commun. 13 (1), 6121 (2022).
DING, L. et al. Programmed cell death 11 modulates but not entirely relies on p53-HDM2 loop to facilitate G2/M transition in colorectal cancer cells [J]. Oncogenesis 12 (1), 57 (2023).
CHEN, Y. & SHENG, Z. H. Kinesin-1-syntaphilin coupling mediates activity-dependent regulation of axonal mitochondrial transport [J]. J. Cell Biol. 202 (2), 351–364 (2013).
HUANG, N. et al. Reprogramming an energetic AKT-PAK5 axis boosts axon energy supply and facilitates neuron survival and regeneration after injury and ischemia [J]. Curr. Biology: CB, 31(14): 3098 – 114.e7.(2021).
SCHULER, M. H. et al. Miro1-mediated mitochondrial positioning shapes intracellular energy gradients required for cell migration [J]. Mol. Biol. Cell. 28 (16), 2159–2169 (2017).
LIBRING, S. & BERESTESKY E D, REINHART-KING C, A. The Movement of Mitochondria in Breast Cancer: Internal Motility and Intercellular Transfer of Mitochondria [J] (Clinical & experimental metastasis, 2024).
ROSENBERG L H, LAFITTE, M. Therapeutic targeting of casein kinase 1δ in breast cancer [J]. Sci. Transl. Med. 7 (318), 318ra202 (2015).
CHENG, S. et al. Transforming acidic coiled-coil-containing protein 2 (TACC2) in human breast cancer, expression pattern and clinical/prognostic relevance [J]. Cancer Genomics Proteom. 7 (2), 67–73 (2010).
ONODERA, Y. et al. TACC2 (transforming acidic coiled-coil protein 2) in breast carcinoma as a potent prognostic predictor associated with cell proliferation [J]. Cancer Med. 5 (8), 1973–1982 (2016).
BARROS T P, KINOSHITA, K. et al. Aurora A activates D-TACC-Msps complexes exclusively at centrosomes to stabilize centrosomal microtubules [J]. J. Cell Biol. 170 (7), 1039–1046 (2005).
NELSON K N, MEYER A N, SIARI, A. et al. Oncogenic gene fusion FGFR3-TACC3 is regulated by tyrosine phosphorylation [J]. Mol. cancer Research: MCR. 14 (5), 458–469 (2016).
XU, J. & ZHANG, W. EZR promotes pancreatic cancer proliferation and metastasis by activating FAK/AKT signaling pathway [J]. Cancer Cell Int. 21 (1), 521 (2021).
SONG, Y. et al. Ezrin mediates invasion and metastasis in tumorigenesis: A review [J]. Front. Cell. Dev. Biology. 8, 588801 (2020).
HUERTA-CEPAS, J. & SZKLARCZYK, D. EggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses [J]. Nucleic Acids Res. 47 (D1), D309–d14 (2019).
KANEHISA, M., SATO, Y. & KAWASHIMA, M. KEGG mapping tools for Uncovering hidden features in biological data [J]. Protein Science: Publication Protein Soc. 31 (1), 47–53 (2022).
GABLE A L, S. Z. K. L. A. R. C. Z. Y. K. D. et al. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets [J]. Nucleic Acids Res. 47 (D1), D607–d13 (2019).
KANEHISA, M. et al. KEGG: biological systems database as a model of the real world [J]. Nucleic Acids Res. 53 (D1), D672–d7 (2025).
KANEHISA M. Toward Understanding the origin and evolution of cellular organisms [J]. Protein Science: Publication Protein Soc. 28 (11), 1947–1951 (2019).
KANEHISA, M. KEGG: Kyoto encyclopedia of genes and genomes [J]. Nucleic Acids Res. 28 (1), 27–30 (2000).
CHENG, A. et al. MoMo: discovery of statistically significant post-translational modification motifs [J]. Bioinf. (Oxford England), 35(16): 2774–2782. (2019).
WIREDJA D D, KOYUTüRK, M. The KSEA app: a web-based tool for kinase activity inference from quantitative phosphoproteomics [J]. Bioinf. (Oxford England). 33 (21), 3489–3491 (2017).
JIANG, S. et al. Generic diagramming platform (GDP): a comprehensive database of high-quality biomedical graphics [J]. Nucleic Acids Res. 53 (D1), D1670–d6 (2025).
Acknowledgements
The study was supported by 2023 Bengbu Medical University Science and Technology Project Natural Science key project (2023byzd055).
Author information
Authors and Affiliations
Contributions
Chenxu Guo and Jun Qian had full access to all of the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. Mingliang Zhang and Xin Jin were involved in the study concepts and design. All authors (Chenxu Guo,Mingliang Zhang,Xin Jin,Chao Zhu,Rui Xu,Jiahe Sun and Jun Qian) involved in the acquisition, analysis and interpretation of data. Mingliang Zhang and Xin Jin supervised the analysis. Chenxu Guo and Jun Qian involved in the draft of the manuscript. All authors read, critically revised and approved the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Guo, C., Zhang, M., Jin, X. et al. Integrated proteome, phospho-proteome and malonyl-proteome revealed a molecular alteration of breast cancer. Sci Rep 15, 34004 (2025). https://doi.org/10.1038/s41598-025-11573-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-11573-y