Introduction

Traditional cancer treatment strategies, such as chemotherapy and radiotherapy, have been the cornerstone of cancer care for decades. While effective, these approaches often come with significant short- and long-term side effects, including toxicity, fatigue, myalgias, cognitive dysfunction, and infertility1. Immunotherapy is often considered the “fifth pillar” of new-age therapies that aim to eliminate cancer cells via immune system activation. This can be achieved by targeting the tumor antigens or by passively enhancing the anti-tumor immune response by administering cultured lymphocytes2. Recent clinical studies have confirmed the efficacy and safety of immunotherapy in managing large cohorts of patients3,4,5. Several targeted immunotherapies (e.g., drug-target conjugates), therapeutic cancer vaccines (e.g. T-VEC), adoptive targeted therapies (e.g., CAR T-cells), and immunomodulators such as interleukins have been approved and marketed for use in different cancer types2,6. Interleukins are small glycoproteins that can have a pleiotropic effects on the immune system. They bind to receptors on the cell surface that can aid in immune cells’ differentiation, survival, and proliferation7.

Currently, two interleukin-based drugs are approved by the Food and Drug Administration (FDA) for treating human malignancies. This includes the interferon-alpha-based drug Roferon-A and IL-2 based Aldesleukin or Proleukin8. IL-2 is an early cytokine that was commercially approved to treat metastatic renal carcinoma and metastatic melanoma by the FDA. IL-2 is a 15.5 KDa alpha-helical cytokine that belongs to the gamma chain family of immune modulators. CD4 + T-cells chiefly produce IL-2, but this can also be made by CD8 + T-cells, dendritic cells, and natural killer cells (NK-cells)3. While IL-2 is known for its immune-stimulating activity at low concentrations, it also promotes the expansion of regulatory T-cells and immune tolerance at higher concentrations, reflecting its dual role in immune regulation. IL-2 works through the JAK-STAT pathway and helps maintain and differentiate CD4 + T-cells in various subsets, including Th1, Th2, and Th17. It can promote CD8 + T-cell and NK-cell cytotoxic activity and the generation of memory cells9. The excellent immune-stimulating activity of IL-2 has led to the development of several treatment regimens. IL-2 has been used in clinical settings as a monotherapy agent and combined with several other therapeutic regimes. The low dose of IL-2 combined with interferon-alpha shows more significant therapeutic benefits for patients with renal cell carcinoma10. IL-2, when combined with chemotherapeutic agents, including cisplatin and dacarbazine, has been extensively investigated in patients with metastatic melanoma11. The IL-2, combined with targeted therapies such as gefitinib, showed a positive response in non-small cell lung cancer (NSCLC) patients12. Combining IL-2 with a therapeutic cancer vaccine also indicates a synergistic effect in advanced melanoma13.

IL-2 therapy has a number of limitations, including a short half-life; high doses of IL-2 lead to toxicity, vascular leakage, and hypotension14,15,16. The clinicians and researchers employed several strategies for overcoming the challenges faced by IL-2 therapy. One such strategy widely employed by researchers in literature to improve the efficacy of IL-2 therapy is the generation of IL-2 inducing mutant peptides. The mutant version of IL-2 inducing peptides reportedly has low toxicity and can activate Natural-killer cells (NK-cells) without producing a high level of pro-inflammatory cytokines, thus preventing vascular leakage7,17. There are other mutants of IL-2 inducing peptides that can activate immune cells without pro-inflammatory activity and thus are free from toxicity-related problems17. These mutant versions of IL-2 inducing peptides are known as “Superkine’’ because of their enhanced antitumor property. These points highlight that the mutant version of natural IL-2 inducing peptides possesses a high therapeutic index. Thus, generating an improved version of IL-2 has become an important area of research. However, identifying and generating such mutants in clinical setups is time-consuming and cost-intensive. In-silico computation methods can help scientists and clinicians in this regard. Several past methods have also been developed that targeted and utilized the therapeutic potential of interleukin-based therapy for managing various human malignancies. CytoPred is one such method that is developed for the identification and classification of cytokines18. In addition, methods have been developed for inducing specific types of cytokines, including IL4pred19IL10Pred20IFNepitope21and IL6pred22 for IL-4, IL-10, IFN-gamma, and IL-6, respectively. Despite the huge therapeutic potential, no method is available in the literature for identifying and generating peptides that can induce the natural secretion of IL-2. In the present study, a computational method for predicting and designing peptides that can potentially induce IL-2 cytokine has been developed.

Results

In this study, three distinct datasets were used to develop the prediction algorithm – main dataset, alternate dataset 1 and alternate dataset 2. The distribution of peptides with respect to the MHC allele in our main dataset is available in Supplementary File 1 (S1.xlsx). All analyses and model building were conducted on these datasets. Based on this comprehensive analysis, different prediction models were built, trained, and externally validated to assess their performance in predicting IL-2 inducing and non-inducing peptides.

Dataset length distribution analysis

Both positive and negative datasets were subjected to length distribution analysis. The length distribution analysis of both the positive and negative datasets of the main dataset is provided in Fig. 1. The distribution analysis reveals that positive/IL-2 inducing peptides predominantly consist of peptide lengths ranging from 12 to 20, with a length of 15 as the most abundant. In negative/IL-2 non-inducing peptides, the peptides were mainly 15, 17, and 20-length amino acid sequences. The distribution analysis of the other two datasets has been reported in the Supplementary File 2: Figure SF1 and Figure SF2.

Fig. 1
figure 1

Histogram plot showing the frequency distribution of peptide length of the main dataset.

Average amino acid composition analysis

The positive and negative datasets were analysed for their average amino acid composition. In the main positive dataset, amino acid residues Ala, Phe, Leu, Pro, and Tyr are highly significant (adjusted p-value < 0.05), while in the main negative dataset, the residues Cys, Asp and Thr are significant. The average amino acid composition analysis for the main dataset is provided in Fig. 2, along with the adjusted p-value, and the other two datasets are provided in Supplementary File 2: Figure SF3 and Figure SF4. Asterisks (*) denote residues showing statistically significant differences after Benjamini-Hochberg correction for multiple comparisons (adjusted p < 0.05).

Fig. 2
figure 2

Bar plot of the average amino acid composition in the IL-2 inducers and non-inducers of the main dataset with adjusted p-values using Benjamini-Hochberg indicating significance (* shows the significant amino acid).

Positional preference analysis

The TSL reveals the positional preference of amino acids for positive and negative datasets. In the main positive dataset, IL-2 inducing peptides show a strong enrichment for hydrophobic residues like L, G, Y, and F, and positively charged residues like R and K at specific positions. The depleted amino acids, such as T, D, S, and E, suggest that IL-2-inducing peptides avoid hydrophilic and acidic residues at these positions, as shown in Fig. 3. The positional preference analysis for the other two datasets is provided in Supplementary File 2: Figure SF5 and Figure SF6.

Fig. 3
figure 3

Two-sample logo illustrating positional amino acid conservation in IL-2 inducers versus non-inducers for the main dataset. Significant residues (p < 0.05) were identified using a position-wise two-sample t-test.

Alignment-based approaches

Motif search

The MERCI software has been utilised to determine the motifs present exclusively in the IL-2 inducing peptides but not in IL-2 non-inducing peptides. Similarly, the motifs that are exclusive to IL-2 non-inducing peptides were computed. The different parameters and classification methods (NONE/BETS-RUSSELL/KOOLMAN) available in MERCI software have been utilised to differentiate the motifs into IL-2 inducing and non-inducing peptides. Among these, for the main dataset, the “NONE” classification scheme with fp 10 was selected as the best parameter because of its high accuracy despite having low coverage. The results of the top 10 motifs and the number of sequences from the independent datasets that occurred in the positive and negative classes of the main dataset are presented in Table 1. It is revealing that the motifs were hydrophobic in nature and also exclusive to IL-2 inducers. This result was also seen during the positional preference analysis. The results of motif coverage of the other two datasets were presented in Supplementary File 1 (S2.xlsx).

Table 1 Top 10 IL-2 inducer motifs of the main dataset and the number of sequences in which they occurred.

BLAST search

The blastp-short was applied to perform a similarity search against a training dataset (main dataset) of IL-2 inducing and IL-2 non-inducing peptides. The search used blastp-short with an e-value range from 10− 7 to 10− 1 (Table 2). Blastp-short achieved optimal performance at an e-value of 10− 3, correctly identifying 2100 IL-2 induced peptides and 402 IL-2 non-induced peptides, with 272 incorrect hits. E-values lower than 10− 3 did not provide adequate sequence coverage, while values higher than 10− 3 exhibited a higher error rate. The complete results for BLAST from e-values 10− 7 to 10− 1 for the other two datasets are detailed in the Supplementary File 1 (S3.xlsx).

Table 2 BLAST results for test datasets searched against the training datasets (main dataset).

AI-based classification methods

ML models

Various ML-based prediction models were trained, including DT, RF, KNN, MLP, ET, SVR, XGB and Lasso. First, the features of IL-2 and IL-2 non-inducing peptides using Pfeature were computed. Using only DPC as the feature, the ET method yielded the best results with an AUC of 0.81 and an MCC of 0.45 on the test data of the main dataset. Length was used as an additional feature to recognise the importance of peptide length in MHC antigen presentation and IL-2 induction. The ET-based model using these hybrid features (DPC and peptide length) achieved a higher AUC of 0.82 and an MCC of 0.48 on the test set. The details for the best classifiers against each feature set for the independent dataset of the main dataset have been provided in Table 3. The statistical details for different classifiers using various feature sets for all three datasets are in the Supplementary File 1 (S4, S5 and S6.xlsx).

Table 3 The test data performance for the best classifiers developed using various types of peptide features for the main dataset (only the best models are shown).

DL models

Using the TabNet classification model, a maximum AUC of 0.65 and an MCC of 0.20 was achieved on the test data of the main dataset with Compositional Enhance Transition and Distribution (CeTD) as a feature.

For the 1D CNN model, an AUC of 0.71 and an MCC of 0.32 were achieved on the test data of the main dataset using DPC-length features. The top 3 best performances for the main dataset, using the independent dataset, have been provided in Table 4, and the detailed results for the TabNet and CNN models applied to IL-2 sequences for the other two datasets are provided in Supplementary File 1 (S7.xlsx).

Table 4 The top 3 best feature performances of the DL in the main dataset.

LLM models

For this study, a pre-trained large language protBERT model was utilised. The model was fine-tuned for specific tasks by adjusting the number of epochs. Among the epochs tested, epoch 5 provided the best results, with a maximum AUC of 0.69 and an MCC of 0.21 for the main dataset.

Furthermore, since embeddings generated by the fine-tuned model can serve as valuable features, these features were extracted and applied in various ML algorithms. The ET method was the best-performing model, achieving an AUC of 0.71 and an MCC of 0.31. The performance of the Fine-tuned and best classifier model for the main dataset has been provided in Table 5, and the details results for this LLM model for each dataset are given in Supplementary File 1. (S8.xlsx)

Table 5 The performance of the fine-tuned and best classifier model for the main dataset.

Feature selection

Four feature selection methods – mrMR, SVC-L1, RFE, and SHAP were applied to the DPC-length feature on each dataset. The best model’s AUC for the main dataset decreased with all three methods when 10, 100, and 200 features were selected. When 200 features were selected, the model’s AUC remained at approximately 0.82 for each method. The Performance metrics results of our feature selection approach on the main dataset using SVC-L1, mRMR, RFE, and SHAP have been provided in Table 6. The selected features were reported in the Supplementary File 1 (S9.xlsx).

Table 6 Performance metrics results of our feature selection approach on the main dataset using SVC-L1, mRMR, RFE and SHAP.

Ensemble method

To enhance the predictive capability of our model, we employed an ensemble approach in this study. The highest AUC and MCC were achieved using the ensemble method compared to all other methods, including ML, DL, and LLM on the main dataset. Combining our top-performing model, DPC-length, with ET, along with motif scores using fp set to 10 and “NONE” as the classification method, an AUC of 0.84 and an MCC of 0.51 was obtained. We have tried other classification methods also, but this approach surpassed all other methods evaluated, highlighting its effectiveness in predicting biologically significant IL-2 inducers. The performance of the top 10 motifs on fp 10 developed using hybrid features (DPC-LEN and motif score) on all three datasets can be found in Table 7. The statistical details of various classifiers built using the hybrid feature set (DPC + Length) have been presented in Supplementary File 1 for all the datasets (S10, S11 and S12.xlsx). The comparative performance of ML, DL, LLM and Ensemble models is provided in Table 8, highlighting the best-performing approach across all three datasets.

Table 7 The performance of the top 10 motifs on fp 10 developed using hybrid features (DPC-LEN and motif score) on all three datasets.
Table 8 Comparative performance of ML, DL, LLM, and ensemble models across the three datasets.

Benchmarking

To properly evaluate the performance and significance of our IL2pred method, we conducted comprehensive benchmarking against three established cytokine prediction tools: IL6pred, IL4pred and IL10pred19,20,22. This comparative analysis is important for validating that our method achieves comparable performance to existing state-of-the-art cytokine predictors and demonstrating that IL-2 induction represents a distinct prediction task requiring specialised modelling. The comparative performance metrics are provided in Table 9. To further analyse the biological differences between cytokine-inducing peptides, we generated an average amino acid composition difference plot comparing IL-2 inducers with the benchmark data. The average amino acid composition difference plot for benchmark data has been provided in Fig. 4. Additionally, we performed pairwise statistical comparisons of amino acid compositions between IL-2, IL-4, IL-6, and IL-10 using the Mann-Whitney U test. These results provide further evidence of significant compositional differences, supporting the need for a dedicated IL-2 predictor. The detailed statistical results have been provided in Supplementary File 1. (S13.xlsx)

Fig. 4
figure 4

Average amino acid composition difference plot comparing IL-2, IL-4, IL-6, and IL10 inducer.

Table 9 Performance comparison of cytokine prediction methods on independent dataset.

Methods

Datasets

Main dataset

A total of 8596 experimentally validated IL-2 inducing and non-inducing peptides were extracted from the largest repository of the immune epitope database (IEDB), filtering for MHC binders from any host organism that was experimentally confirmed to either induce or not induce IL-2 production. Of these, 4475 peptides were MHC binders, which can trigger IL-2 secretion as measured by different immunological assays. These epitopes were termed IL-2 inducing peptides and grouped under a positive set. We also extract 4121 MHC binding peptides that do not trigger the IL-2 secretion and are termed non-inducers. The MHC-binding peptides that do not induce IL-2 are called non-inducers and are grouped under a negative set. Literature evidence suggests that peptides of length between 8 and 25 are most suitable for MHC antigen processing and presentation. Thus, all peptides of length below 8 and above 25 were removed. Additionally, all the redundant peptides were removed. The final main dataset consists of 3429 IL-2 inducing and 3145 non-inducing peptides. One of the major features of our main dataset is all peptides are experimentally validated.

Alternate dataset 1

Our main dataset has all MHC binders, which means models developed on the main dataset are only suitable to MHC binders. In case the user does not know whether a given peptide is an MHC binder or a non-binder, then the model developed on the main dataset cannot be used. In order to overcome this limitation, we changed our negative dataset of non-induces from MHC binders to non-binders. In the Alternate dataset, we extracted and selected 3429 non-MHC-binding peptides from IEDB and assigned them as non-inducers. Finally, our Alternate dataset 1 contains 3429 MHC binding IL-2 inducing peptides as positive peptides and 3429 MHC non-binding IL-2 non-inducing peptides as negative peptides. Models developed on the Alternate dataset 1 are suitable for predicting IL-2 inducing peptides in MHC non-binders.

Alternate dataset 2

Our models developed on our main dataset are suitable for predicting IL-2 inducing peptides in MHC binding peptides. Similarly, our models developed on Alternate dataset 1 are suitable to predict IL-2 inducing peptides in MHC non-binding peptides. In case the user has no idea whether their peptide is an MHC binder or a non-binder, then the above datasets are not suitable. In this study, we proposed alternate dataset 2 that contains 3429 MHC binding IL-2 inducing peptides, referred to as positive peptides and 3429 IL-2 non-induces. In Alternate Dataset 2, IL-2 non-inducers contain a mixture of MHC binders and non-binders, which do not induce IL-2.

Length distribution and composition analysis

To better understand the characteristics of the peptides in both the positive and negative datasets, a comprehensive analysis was performed on their length distribution and amino acid composition. This analysis was carried out using custom Python scripts, which were designed to generate bar plots for visual representation.

Positional preference analysis

In this analysis, a Two Sample Logo (TSL) was constructed to compare the proportional representation or relative abundance of amino acid residues between IL-2-inducing (positive) peptides and non-inducing (negative) peptides. The analysis was focused on identifying the preference for specific amino acids at particular positions within peptide sequences to distinguish between the two peptide groups. Notably, the first eight positions represent the N-terminal residues, while the final eight positions correspond to the C-terminal residues of the peptides.

Alignment-based approach

Motif search

Identification of the motifs within peptides is crucial in annotating their function. The MERCI software was used to identify specific motifs in both positive and negative datasets23. The software was implemented in two steps; firstly, the motifs for a positive dataset were extracted by providing the IL-2 inducing peptides as positive and non-inducing peptides as negative sets. In the second iteration, the motifs were retrieved for the negative dataset by inputting IL-2 non-inducing peptides as positive and IL-2-inducing peptides as negative sets. The motifs were calculated using the earlier approach in both positive and negative datasets. The different classification methods (NONE/BETTS-RUSSELL/KOOLMAN) were used to identify motifs in a mutually exclusive manner with the help of MERCI motif analyses. NONE applies no grouping; BETTS-RUSSELL categorizes amino acids into polar, hydrophobic, and small groups, while KOOLMAN focuses on aliphatic, aromatic, and other properties. The peptides containing unique motifs from both sets were screened to understand the overall coverage of the various motifs in the complete data.

BLAST search

This study utilised blastp-short to annotate IL-2 peptides based on their similarity to IL-2-inducing or non-inducing sequences24. Initially, a database was constructed using the “makeblastdb” command for IL-2 peptide sequences in the training dataset. The self-hits were excluded from the training dataset results and considered the top hit after removing them. The first hit was considered for the independent dataset to calculate results at various e-values.

AI-based classification method

Feature estimation

The different features of peptides present in both positive and negative datasets have been calculated. These calculated features are employed in developing ML-based prediction method development. For the feature generation, the Pfeature web server25 was used, and the embeddings were generated from the pre-trained protBERT model26. With the help of Pfeature, 10,000 peptide descriptors were calculated for both positive and negative datasets. Pfeature includes (I) Composition-based descriptors – Amino Acid Composition (AAC), Dipeptide Composition (DPC), Repetitive Residue Information (RRI), Physico-Chemical Properties (PCP), Distance Distribution of Residues (DDR); (II) Binary profiles – Amino acid based binary profile (AAB); and LLM include Embeddings of Large Language Models trained on ProtBERT developed by Rostlab.

Prediction models

ML models

Various ML-based algorithms have been implemented to classify IL-2 inducing and non-inducing peptides. The “scikit-learn” Python package was used for the classification. The classification algorithm includes - decision trees (DT), random forest (RF), multi-layer perceptron (MLP), eXtreme gradient boosting (XGBoost), support vector with the kernel as a radial basis (SVR), ExtraTreesClassifier (ET) and LASSO. Hyperparameter tuning was implemented using the grid search CV technique with “ROC” as the optimization metric. The DT algorithm was based on the non-parametric supervised algorithm; RF is an ensemble-based method that fits numerous decision trees to predict the outcome of the dependent variable; KNN is an instance-based learning algorithm, and XGBoost is a tree-boosting classification algorithm based on an iterative search approach for making the final prediction.

DL models

For the DL approach, two distinct models were applied: first, a 1D CNN model tailored for processing sequential data and extracting hierarchical features along the sequence, and second, TabNet, a modern neural network architecture designed explicitly for tabular data27. The 1D CNN Model is particularly effective for sequential data processing, capturing patterns and dependencies along the peptide sequences. On the other hand, TabNet excels in tabular data by dynamically selecting and reasoning from features using sequential attention, making it suitable for our IL-2 inducing and non-inducing peptides dataset. Each model was fine-tuned independently on the datasets by adjusting hyperparameters to optimize their performance for the classification task.

LLM models

LLM are excellent at text classification because they deeply understand language. Trained on large amounts of text, they can be designed for specific tasks like classifying sequences, as was done in this research. LLMs are flexible and are particularly good at extracting important text features, allowing them to make accurate classifications even with limited data. This study used LLMs to differentiate between IL-2 and non-IL-2 sequences. The protBERT model was chosen and fine-tuned to match the characteristics of the datasets. After this fine-tuning, the model predicted whether each sequence belonged to the IL-2 or non-IL-2 category.

Feature selection

As the literature suggests that not all descriptors are effective for developing machine learning models, three techniques were employed to select the best features: minimum Redundancy - Maximum Relevance (mRMR), SVC-L128 and Recursive feature elimination (RFE). Feature selection was performed on DPC with length. The top 10, 100, and 200 features were computed using both methods on the datasets. After the feature selection step, all the features mentioned above were tested independently.

Ensemble method

In this approach, the best model obtained was combined with motif information from MERCI for reliable and biologically significant prediction of IL2-inducers. Along with the prediction models, the query peptide was searched against the test data with different thresholds for false positive (fp) values; if any motif was found in the peptide, the probability score of test data was increased by 0.5 for IL-2 inducing peptides and in the case of IL-2 non-inducers, the score was decreased by 0.5. This gives the probability of a sequence belonging to a specific class instead of a binary outcome.

Cross-validation

For developing the ML, DL, and LLM models, the standard protocols were followed as in previous studies [31498794]. Each dataset was divided into an 80:20 split for training and external validation, respectively. The main dataset contained 5,259 peptides for training and 1,315 peptides for testing. The alternate dataset 1 included 5,486 peptides for training and 1,372 for testing, while the alternate dataset 2 consisted of 5,487 peptides for training and 1,372 for testing. The different classifiers were trained and evaluated using a five-fold cross-validation method, a widely accepted technique for optimizing model parameters and performance. In the five-fold cross-validation, the training dataset was divided into five equal parts; iteratively, four parts were used for training and one for testing to fine-tune the model parameters. This process was repeated five times, ensuring each subset was used for both training and testing. All the classifiers were implemented using an in-house python script. Performance evaluation was based on metrics such as specificity, sensitivity, accuracy, MCC, and AUC. The statistical performance-evaluation parameters of the ML models are explained below:

$$\text{Sensitivity} =\:\frac{TP}{\left(TP+FN\right)}$$
(1)
$$\text{Specificity} =\:\frac{TN}{\left(TN+FP\right)}$$
(2)
$$\text{Accuracy} = \:\frac{TP+TN}{\left(TP+TN+FP+FN\right)}$$
(3)
$$\text{MCC}=\:\frac{TP\times\:TN-FP\times\:FN}{\sqrt{\left(TP+FP\right)\left(TP+FN\right)\left(TN+FP\right)\left(TN+FN\right)}}$$
(4)

Architecture of Web-server

A web server named “IL2pred” (https://webs.iiitd.edu.in/raghava/il2pred/) was developed to predict the IL-2 inducing and non-inducing peptides. The web server’s front end was developed using HTML5, JAVA, CSS3, and PHP scripts. It is based on responsive templates that adjust the screen based on the device’s size. It is compatible with almost all modern mobile, tablet, and desktop devices. The overall workflow of the complete methodology is summarized below in Fig. 5.

Fig. 5
figure 5

Schematic representation of the workflow of the study.

Modules and functionalities of IL2pred web-server

This web server is compatible with all sorts of devices, viz. desktops, tablets, and phones, and hence provides an interactive and better experience to the users. Built with Python 3.12.2, the server utilizes essential scientific libraries including scikit-learn (v1.5.2), Pandas, and NumPy for efficient computational operations. The webserver has four main modules – (1) Prediction; (2) Design; (3) Protein Scan; (4) Motif Scan. The “Prediction module” allows users to identify the peptides with IL-2 inducing potential. Users of the web server can submit the peptides of interest in FASTA format as well as upload the FASTA format files from their local machine. The results of the user-defined query are displayed on the HTML page as IL2-inducer and IL-2-non-inducer, along with the prediction score and selected properties. The results of the user-defined query can also be downloaded from the web server in the CSV format. The “Design module” is of utmost importance for a biologist who wishes to design new, potent peptides with enhanced IL-2 inducing potential. The input for the design module is a single-line peptide sequence file, which can either be pasted in the box provided on the module page or locally uploaded onto the web server site. Once the user-defined query is submitted on the web server, the algorithm implemented on the “Design Module” generates all possible mutants of the given query peptide sequence and then ranks them based on their IL-2 inducing potential. The ranking algorithm is of greater importance as it ranks the newly generated single-letter mutant peptides for their ability to induce IL-2. Thus, it helps biologists narrow down their choice of selecting the best mutant peptide with enhanced IL-2 inducing ability. The prediction score and the other predicted metrics are displayed on the web server site, which can be downloaded as a CSV file for further use. The “Protein Scan” module lets users predict IL-2 inducing regions within a given protein sequence. Users can submit the protein sequence of their interest in a single-line format. The algorithm generates all possible fragments of the user-defined length and then predicts the IL-2 inducing potential of all generated fragments overlappingly. This will help in the identification of IL-2 rich and scarce regions in a given protein sequence and thus help biologists in selecting and prioritizing appropriate regions of protein for their study the last model is the Motif scan, which will allow users to find crucial motifs present in the query protein sequence to find out its role as an IL-2 inducer or IL-2 non-inducer. The results from this module can be downloaded in CSV format. The user-friendly web server, along with pip package and standalone maintained in GitHub, is available at (https://webs.iiitd.edu.in/raghava/il2pred ).

Discussion

The high-throughput studies have significantly enriched our knowledge regarding the tumor onset to progression. However, overall patient survival remains dismal, and finding novel therapeutics continues to be a major challenge. Evidence from pre-clinical and clinical studies advocates activating the immune system to treat human malignancies7. Recently, various peptide-based prediction models have been developed using machine learning methods such as random forest, support vector machine, and deep neural networks. Notably, several of these models focus on the identification of interleukin-inducing peptides, including IL-6, IL-10, and IL-4 inducing peptides. Pro-inflammatory cytokines are mediators of various biological processes controlled by the body. Uncontrolled levels of these cytokines can lead to many pathological conditions, such as cancer and autoimmunity. Cytokine therapy has emerged to increase anti-tumor immunity using lymphocyte activators such as IL-2. IL-2 is critical to the survival, growth, and activation of T-cells and NK cells; it showed clear anti-tumor effects. Unfortunately, it had severe side effects associated with it, such as vesicular leakage, edema, and hypotension. However, considering its importance in immunotherapy, novel mutants of IL-2, such as FSD13, were designed to handle the side effects to a large extent and showcased improved anti-tumor effects with limited toxicity29. As mentioned above, IL-2 is one such cytokine that can boost the antitumor immune response by maintaining and promoting the activity of CD4 + T-cells and cytotoxic CD8 + T-cells. Thus, identifying peptides or antigenic regions that can boost the production of IL-2 would become an important consideration while designing cancer treatment regimens and vaccine candidates.

The present study is a systematic attempt to make an in-silico model for predicting the IL-2 inducing capability of the peptide/antigenic region. The in-silico model is developed on the sequence features of IL-2 inducing and non-IL-2 inducing peptides extracted from the largest repository of experimentally validated immune epitopes database. Notably, all peptides in this dataset are known MHC binders, which may limit the model’s applicability to cases where MHC-binding status is already established. To overcome this constraint and enhance the model’s generalizability, we curated two additional datasets incorporating peptides with varying degrees of MHC-binding specificity. Literature evidence suggests that the length of the peptide can affect its binding to the MHC complex and, thus, immune activation30. The distribution of peptide size in the main dataset revealed distinct variations between IL-2 inducers and non-inducers. Both classes of peptides ranged from 8 to 25 amino acids, but a highly prominent peak appeared at 15 residues, particularly among the non-inducers. Notably, IL-2 inducers were slightly shorter in peptide length on average. This finding implies that the shorter peptides are more likely to be structurally preferred for IL-2 induction, possibly because they can bind more effectively to MHC or are better at activating T-cells. In contrast, more representative of non-inducers are the longer sequences, and may have flanking regions that may interfere with antigen presentation or lower general immunogenicity. Such differences in the frequency distribution of peptide lengths highlight the importance of sequence length as a cytokine induction determinant. The compositional analysis reveals that IL-2 inducing peptides are enriched in Ala, Phe, Leu, Pro and Tyr residues. Past studies also reveal that peptides enriched in Ala, Phe, and Leu residues have increased anti-tumor capabilities31 as they can induce apoptosis in cancer cells via modulation of autophagy. Thus, we can suggest that sequence composition can also be used to discriminate IL-2 inducing peptides from the non-IL-2 inducing peptides to some extent.

In addition to the above, the development of the IL-2 prediction models was also carried out using several other sequence features on multiple machine learning-based algorithms. It is observed that the DPC-based ET-model can discriminate between the IL-2 inducing and non-IL-2 inducing peptides with good accuracy. However, the hybrid model developed on DPC and length of peptides with motif score performed better than the DPC-length model alone in terms of balanced model performance measures. Both models were incorporated into the web server. We believe the developed tool will help the scientific community better understand IL-2 inducing/non-IL-2 inducing peptides. However, with the increased number of organism-specific datasets, the applicability of the developed model can further be improved. We anticipate that the developed tool may be of great use to the scientific community for identifying IL-2 inducing/non-IL-2 inducing peptides from the proteomes for improving the current cancer immunotherapeutics. Also, the developed tools can generate the mutant peptides that are predicted to have IL-2 inducing potential.

Conclusion

Past studies strongly reveal that direct administration of IL-2 may have several toxic effects. The capability of an antigenic region or mutant peptide to induce the IL-2 response is of great significance for activating the immune response towards malignant tissue. However, identifying such mutant IL-2 inducing peptides with conventional experimental techniques is time-consuming and costly. Therefore, a computational tool in the form of a web server is provided to the scientific community for predicting and identifying the IL-2 inducing/non-inducing peptides from the given proteomes. The web server is freely available to the scientific community at https://webs.iiitd.edu.in/raghava/il2pred/. We hypothesize that the developed tools will be highly utilized by the scientific community for identifying and prioritizing potential immunotherapeutic candidates.