Immunoinformatics based designing of a multi-epitope cancer vaccine targeting programmed cell death ligand 1

Mahafujul Alam, Syed Sahajada; Mir, Showkat Ahmad; Samanta, Arijit; Nayak, Binata; Ali, Safdar; Hoque, Mehboob

doi:10.1038/s41598-025-87063-y

Download PDF

Article
Open access
Published: 11 April 2025

Immunoinformatics based designing of a multi-epitope cancer vaccine targeting programmed cell death ligand 1

Syed Sahajada Mahafujul Alam¹,
Showkat Ahmad Mir²,
Arijit Samanta¹,
Binata Nayak²,
Safdar Ali³ &
…
Mehboob Hoque ORCID: orcid.org/0000-0001-8450-7738^1,4

Scientific Reports volume 15, Article number: 12420 (2025) Cite this article

7744 Accesses
11 Citations
Metrics details

Subjects

Abstract

Tumor cells express programmed cell death ligand 1 (PD-L1), which recognizes the immune checkpoint molecule programmed cell death 1 (PD-1) on T cells, suppressing the antitumor immune response. Inhibiting the PD-1:PD-L1 interaction has the potential to reactivate the immune response against tumors. Recent advancements in cancer therapy have demonstrated remarkable promise of immunotherapy, which exploits immune checkpoint inhibition by small molecules or monoclonal antibodies. This strategy has shown impressive clinical success in treating a wide range of cancer subtypes, albeit with certain limitations. This study aims to design a novel multi-epitope vaccine against PD-L1 by using an immunoinformatics approach. For attaining enhanced efficacy and minimize side effects, the vaccine was constructed using antigenic, non-allergenic, and non-toxic epitopes (5 CTL, 3 HTL, and 2 B-cell epitopes) predicted from the IgV domain of PD-L1. The vaccine design includes a large ribosomal subunit protein bL12 adjuvant, a 6xHis tag for purification, and appropriate linkers to connect the epitopes. The modelled 3D structure of the vaccine construct was docked with TLR4 immune receptor, demonstrating strong antigenic properties and stable binding, as validated by molecular dynamics simulations. Immune simulation studies suggest that the vaccine construct could potentially elicit significant immune regulators such as B cells, T-cells, and memory cells. Thus, the findings indicate that the vaccine may effectively suppress the PD-1:PD-L1 axis by targeting PD-L1, restoring the anticancer immune response. However, its efficacy needs to be validated in both in vitro and in vivo settings.

Evolving landscape of PD-L2: bring new light to checkpoint immunotherapy

Article Open access 15 December 2022

High baseline levels of PD-L1 reduce the heterogeneity of immune checkpoint signature and sensitize anti-PD1 therapy in lung and colorectal cancers

Article Open access 04 March 2025

Mature tertiary lymphoid structures predict immune checkpoint inhibitor efficacy in solid tumors independently of PD-L1 expression

Article 12 August 2021

Introduction

Despite the constant efforts and remarkable success in combating cancer, the incidence of cancer cases and related deaths continue to rise. One of the emerging approaches with immense potential for cancer management is cancer immunotherapy which activates the host immune system, allowing it to counter cancer cells. The immune system’s precision, dynamic nature, and memory enable it to target cancer cells while sparing healthy ones. The immune memory helps it to re-evaluate and eliminate the disease upon cancer recurrence. Immunotherapy consists of three main components: checkpoint inhibition, adoptive cell therapy, and vaccines. The checkpoint inhibition relies on the blockade of immune checkpoint molecules like cytotoxic T lymphocyte-associated antigen 4 (CTLA-4), the programmed cell death 1 (PD-1) receptor, and its ligand, programmed cell death ligand 1 (PD-L1). The PD-1 and PD-L1 are transmembrane proteins with immunoglobulin (Ig)-like extracellular domains that enable interaction and signal transduction to intracellular regions. PD-1, also known as CD279, is a type I transmembrane receptor found on the surface of T cells, B cells, monocytes, natural killer cells, and dendritic cells. It has two ligands that are found naturally: PD-L1 (B7-H1, CD274) and PD-L2 (B7-DC, CD273)¹. PD-L1 has been reported to be overexpressed in a number of cancer types such as melanoma, lymphoma, glioblastoma, as well as carcinoma of ovary, colon, lung squamous cells, and breast². The PD-1 and PD-L1 interaction sends a negative signal to the cytotoxic T-lymphocyte, inhibiting antitumor immunity. As a result, blocking the PD-1: PD-Ll interaction reactivates cytotoxic CD8 + T cells, reinstating antitumor immunity.

The development of immune checkpoint inhibitors has ushered in a new phase of immunotherapy for cancer. Monoclonal antibodies (mAbs) designed to target PD-1 and PD-L1 are critical in rescuing T-lymphocytes from exhaustion and reinvigorating the immune response against cancerous cells. Several anti-PD-1 and anti-PD-L1 mAbs have exhibited remarkable clinical outcomes and anticancer activity in patients with various cancers. The US FDA has approved three anti-PD-1 antibodies, namely nivolumab, pembrolizumab, and cemiplimab as well as three anti-PD-L1 antibodies, atezolizumab, durvalumab, and avelumab, for the treatment of various cancer types³. Moreover, several additional checkpoint inhibitors are currently being tested in clinical trials. While these mAb therapies as monotherapy have demonstrated notable clinical efficacy, challenges such as limited response rates, toxicity issues, resistance, steep costs, long half-life, and sophisticated therapeutic regimens remain important impediments⁴. Addressing these limitations will necessitate the development of more effective immune checkpoint inhibitors or novel combinational approaches. Another paradigm of the cancer immunotherapy includes development of vaccines.

Cancer vaccines promote antigen-specific immune responses by presenting tumor antigens to the patient’s immune system. The challenges for vaccination against cancer include limited penetrability, immune response waning, and development of resistance. Multi-target vaccines designed to target immunogenicity-optimized epitopes may be able to tackle some of these problems⁵. Understanding immune evasion mechanisms, designing effective formulations, and combining immunotherapy approaches can pave the way for future cancer vaccine development. Multi-epitope vaccines effectively activate both humoral and cellular immune responses by targeting T and B cell epitopes simultaneously, offering advantages such as high specificity, superior safety, ease of production and storage, and long-lasting efficacy⁶. Additionally, the incorporation of adjuvants in multi-epitope vaccines is expected to elicit long-lasting immunological responses and achieve high immunogenicity⁷. In this study, we assessed the T-cell and B-cell epitopes from human PD-L1 to design a multi-epitope cancer vaccine. This vaccine would potentially elicit both humoral and cell-mediated immunity, which will generate polyclonal antibodies targeting the PD-1: PD-L1 signaling axis, restoring cytotoxic T-cell functionality.

Materials and methods

A robust multi-peptide cancer vaccine against PD-L1 was designed using computational approach combining multiple bioinformatics tools and techniques as illustrated in Fig. 1.

Protein sequence retrieval and domain analysis

To predict potential epitopes in the protein and design a multi-epitope cancer vaccine, the PD-L1 protein sequence with accession number Q9NZQ7 was obtained from the Uniprot database (https://www.uniprot.org/) on September 14, 2023. Additionally, domain analysis was conducted using the Protter⁸ server (https://wlab.ethz.ch/protter/start/) with the same Uniprot accession number on February 12, 2024.

Cytotoxic T lymphocyte (CTL) epitopes prediction

A consensus list of high-binding and promiscuous cytotoxic T lymphocyte (CTL) epitopes was compiled by using the following webtools: NetCTL version 1.2 (https://services.healthtech.dtu.dk/services/NetCTL-1.2/), PickPocket version 1.1 (https://services.healthtech.dtu.dk/services/PickPocket-1.1/), and NetMHCpan − 4.1 (https://services.healthtech.dtu.dk/services/NetMHCpan-4.1/)⁹. All these web servers were accessed on September 14, 2023. NetCTL uses artificial neural networks to predict binding to MHC class I and proteasomal cleavage, and it employs a weight matrix to estimate TAP transport efficiency^10,11. PickPocket, on the other hand, relies on position-specific weight matrices for its predictions¹². NetCTLpan employs artificial neural networks for the epitope predictions. All the parameters were utilized in their default settings. The consensus list was created by selecting the top 10 affinity-sorted epitopes from PickPocket and comparing them with the high-scoring epitopes predicted by NetCTLpan and NetMHCpan to find common epitopes. This improved prediction diversity and accuracy. This analysis utilized a default set of 12 representative HLA supertypes and nonameric peptide epitopes. The 12 shared HLA supertypes in both algorithms were HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*24:02, HLA-A*26:01, HLA-B*07:02, HLA-B*08:01, HLA-B*27:05, HLA-B*39:01, HLA-B*40:01, HLA-B*58:01, and HLA-B*15:01.

Helper T lymphocytes (HTL) epitopes prediction

The helper T lymphocytes (HTL) epitopes were predicted by using the NetMHCIIpan 4.0 (https://services.healthtech.dtu.dk/services/NetMHCIIpan-4.0/)⁹ server accessed on September 15, 2023, focusing on 15 amino acid sequences. Strong binding peptides were identified with a threshold of 2% of %Rank, while weak binding peptides were filtered at 10% of %Rank. Thirteen common HLA Class II alleles, including HLA-DRB1-0101, HLA-DRB1-0301, HLA-DRB1-0401, HLA-DRB1-0701, HLA-DRB1-0801, HLA-DRB1-0901, HLA-DRB1-1001, HLA-DRB1-1101, HLA-DRB1-1201, HLA-DRB1-1301, HLA-DRB1-1401, HLA-DRB1-1501, and HLA-DRB1-1601 were analysed to assess binding affinities and identify potential HTL epitopes.

Linear B-cell epitopes prediction

The ABCpred (https://webs.iiitd.edu.in/raghava/abcpred/index.html)¹³ web server was accessed on September 15, 2023, for predicting the linear B-cell epitopes of PD-L1. Default parameters were used for B-cell epitope prediction, with a length of 16 amino acid residues selected for prediction. The ten highest-ranking predicted epitopes were subsequently selected for further analysis. The ABCpred has been trained on B-cell epitopes sourced from the Bcipep database. It utilizes a recurrent neural network for classifying epitopes and non-epitopes, enhancing accuracy in the prediction process¹³.

Epitope screening

The best epitopes were selected based on their antigenicity, toxicity, and allergenicity due to the abundance of predicted epitopes. To predict the antigenicity of the epitopes, the VaxiJen v2.0 (https://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) server was employed¹⁴. This server is a versatile tool that can be used to calculate the antigenicity of a wide range of microorganisms, including bacteria, viruses, fungi as well as tumors, and parasites. Its prediction accuracy typically falls between 70% and 89%, making it a reliable choice for such analyses¹⁵. In this study, the tumor was the target, and an antigenicity threshold of 0.5 was set to identify the best epitopes for the vaccine. In addition to antigenicity, toxicity and allergenicity of the epitopes were also evaluated to ensure the safety of the vaccine. The ToxinPred (https://webs.iiitd.edu.in/raghava/toxinpred/multi_submit.php) server was used to predict the toxicity of the selected epitopes. In this study, the Swiss-Prot SVM-based method was employed to predict toxicity¹⁶. The AllerTOP v.2.0 (https://www.ddg-pharmfac.net/AllerTOP/) server was used to assess the allergenicity of the epitopes. Not all HTL epitopes trigger the production of cytokines, and even when they do, the cytokines released may vary among them. To further evaluate the selected epitopes, IL4pred (https://webs.iiitd.edu.in/raghava/il4pred/predict.php) and IFNepitope (https://webs.iiitd.edu.in/raghava/ifnepitope/predict.php) servers were used to predict their ability to induce the cytokines IL-4 and IFN-γ, respectively. For predicting IL-4 inducing HTL epitopes, a threshold of 0.2 was selected and an SVM-based model was utilized by the IL4pred server^17,18. Similarly, for predicting IFN-γ inducing HTL epitopes, an SVM-based model was employed, but with an IFN-γ versus other cytokine models used by the IFNepitope server. All these web servers were accessed on September 17, 2023.

Worldwide human population coverage analysis

The population coverage of the selected epitopes was assessed by using the IEDB population coverage analysis (http://tools.iedb.org/population/)¹⁹ tool accessed on November 14, 2023. The assessment of the human population was conducted globally.

Multi-epitope vaccine construction

To enhance the immunogenicity of the selected epitopes, specific linkers such as EAAAK, GGGS, GPGPG, HEYGAEALERAG, AAY, and KK were used to connect different components in a rational manner. Additionally, to boost immune responses, adjuvant molecules were used. Four different adjuvants were tried, leading to the creation of four distinct constructs. These constructs were further analyzed for their physicochemical properties, antigenicity, allergenicity, and secondary structures. Additionally, the Pan DR epitope (PADRE – AKFVAAWTLKAAA) adjuvant was fused to serve as a stimulus for HTL. The process involved the sequential addition of CTL epitopes, followed by HTLs and B-cell epitopes. Finally, a 6xHis tag was added to the C-terminal portion for subsequent purification of the vaccine protein.

Physicochemical properties and solubility analysis of the vaccine

After designing the chimeric sequences, their physicochemical properties were evaluated using the ProtParam (https://web.expasy.org/protparam/) webserver²⁰. Additionally, using the SoluProt 1.0 server (https://loschmidt.chemi.muni.cz/soluprot/), the solubility of chimaeras upon expression in bacteria was assessed²¹. The TargetTrack database served as the training set for the gradient-boosting machine algorithm that developed SoluProt²¹. Additionally, the solubility was also predicted using the SOLpro (https://scratch.proteomics.ics.uci.edu/) server²². According to estimates using tenfold cross validation, SolPro, an SVM-based method for predicting protein solubility from sequences, achieves a global accuracy exceeding 74% ²².

Evaluation of the antigenicity and allergenicity of the vaccine

In the development of vaccines, evaluating the antigenicity of the final vaccine construct is a crucial step. To predict the antigenic behaviour, two online servers were used, the VaxiJen v2.0 and ANTIGENpro (https://scratch.proteomics.ics.uci.edu/)²³. Moreover, to ensure the safety of the vaccine, AllerTOP version 2.0 was employed to assess its potential allergenicity.

Prediction of the secondary structure

The percentage of secondary structure elements in the vaccine construct was determined by using the Prabi (https://npsa-prabi.ibcp.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_gor4.html) server ²⁴.

Structure prediction and validation of the multi‑epitope vaccine

The 3D structure of the designed multi-epitope vaccine was predicted using the trRosetta server (https://yanglab.nankai.edu.cn/trRosetta²⁵. The trRosetta is a web-based server that predicts protein structures using deep learning and Rosetta. A neural network predicts inter-residue geometries, which are used as restraints for energy minimization-based structure prediction with Rosetta²⁵. The 3D model of the multi-epitope vaccine protein underwent a two-step refinement process. Initially, the ModRefiner server (https://seq2fun.dcmb.med.umich.edu//ModRefiner/)²⁶ was used. Subsequently, refinement was carried out using the GalaxyRefine server (https://galaxy.seoklab.org/cgi-bin/submit.cgi?type=REFINE)²⁷.

The protein structure was validated using two different servers: ProSA-web (https://prosa.services.came.sbg.ac.at/prosa.php) and SAVES v6.0 (https://saves.mbi.ucla.edu/)²⁸. ProSA-web calculates the overall quality Z-score of a protein structure. If the Z-score is outside the typical range for native proteins, it suggests that there could be errors in the structure²⁹. On the other hand, SAVES v6.0 uses the PROCHECK tool to evaluate the stereochemical quality of the protein structure by checking the geometry of individual residues and the overall structural geometry. This helps identify any anomalies or irregularities in the protein structure²⁸.

Disulfide bond engineering in the multi-epitope vaccine

Disulfide engineering is a method to introduce disulfide bonds to protein structures. Such bonds stabilize the folded conformation of the protein by lowering its conformational entropy, thus increasing the free energy of the denatured state³⁰. Residue pairs in the vaccine construct that may potentially mutate to cysteine and form interprotein disulfide bonds were identified using the Disulfide by Design 2.13 server available at (http://cptweb.cpt.wayne.edu/DbD2/.)³¹.

Discontinuous B cell epitope prediction

The Ellipro server (http://tools.iedb.org/ellipro/) was utilized to predict discontinuous B-cell epitopes in the designed vaccine construct. Ellipro uses a residue clustering algorithm and Thornton’s method to identify the epitopes, to which the PI (protrusion index) values were assigned^15,32.

Interaction study of T cell epitopes with MHC molecules

Interactions of the selected CTL and HTL epitopes with MHC-I and MHC-II molecules respectively were evaluated by performing molecular docking analysis. Molecular docking is a computational method that determines the best orientation of a ligand in a complex with a receptor. For this purpose, the 3D structures of the selected MHC-I and MHC-II epitopes were modelled using the PEP-FOLD 4.0 online server(https://bioserv.rpbs.univ-paris-diderot.fr/services/PEP-FOLD4/)³³. The best models for docking with their corresponding alleles: HLA-A*03:01 (PDB ID: 2XPG) for the MHC-I epitope and HLA-DRB1-1501 (PDB ID: 1BX2) for all MHC-II epitopes. Molecular docking was performed using the HPEPDOCK 2.0 (http://huanglab.phys.hust.edu.cn/hpepdock/ ) online server³⁴, and the docking results of all epitopes with their corresponding MHC alleles were visualized using PyMOL 3.1.

Molecular docking of the designed vaccine with TLR4 receptor

The activation of a robust immune response depends on the interaction between an antigenic molecule and a specific immune receptor. The engagement of Toll-like receptors (TLRs) by vaccine epitopes is a critical step in initiating the immune response, with TLR4 being one of the key receptors implicated in recognizing pathogens and vaccine components. To identify the binding pockets or cavities in the (TLR4) receptor, the CASTp (http://sts.bioe.uic.edu/castp/) server was employed³⁵. CASTp excels in identifying and measuring surface-accessible binding pockets, providing information about both accessible binding pockets and inner inaccessible cavities for protein molecules³⁵. The interaction between the vaccine construct and TLR4 was evaluated by performing molecular docking. The docking analysis was conducted using ClusPro 2.0 server, where the refined model of the vaccine construct was submitted as the ligand and TLR4 (PDB ID: 3FXI) was submitted as the receptor. The PDBsum (https://www.ebi.ac.uk/thornton-srv/databases/pdbsum/) server was used to visualise the bonds formed between the residues of the vaccine construct and TLR4 in the docked complex³⁶.

Molecular dynamics simulation of the vaccine‑receptor complex

In this study, we conducted molecular dynamic (MD) simulations to investigate the structural dynamics and specific interactions between the designed multiepitope vaccine and TLR4 by using Gromacs 22.4v³⁷. The complex is composed of Chain A, and C of TLR4, and vaccine subunit chain V and their topology was generated by using AMBER99SB-ILDN force field. Then the complex was imported in SPC water box and added 195,640 solvents. The system was neutralised by adding 377 Na⁺ and 370 Cl ⁻ atoms to maintain the physiological pH with a concentration of 0.15 M. The MD simulations analyse several critical parameters to reveal the stability and specific binding interactions, providing insights into the potential efficacy of the vaccine design. In this study, MD simulations have been performed to elucidate the structural stability and binding efficacy of the vaccine-TLR4 complex. The long-range electrostatic interactions and hydrogen bond distances were handled using PME and LINCS algorithms respectively. The vaccine-TLR4 complex, comprising chains A and C of the receptor and chain V of the vaccine construct were equilibrated for 2 ns. The MD simulations were done for 175 ns with a time interval of 0.2 fs³⁹ in which the root-mean-square deviation (RMSD) was monitored to assess structural stability. The RMSF was employed to determine the fluctuations of each amino acid present in the complex⁴⁰. The root mean square distribution of the TLR4 and Vaccine construct was conducted by the cluster analysis. First, we have optimised the cutoff values from 0.4 − 0.2 and more clusters were obtained at 0.2 cutoff following the gromos method. Also, the hydrogen bonds occurred between the TLR4, and vaccine construct was generated by using the hbond module followed by the previous established protocols⁴⁰.

Moreover, the structural compactness and dynamics of the TLR4 complexed with the designed multiepitope vaccine was further enriched by examining the radius of gyration (RoG) throughout the MD simulations. The RoG is a critical parameter that quantifies the molecule’s compactness, providing insights into the structural integrity and conformational changes over time. Therefore, such analysis was employed by various studies to determine the behaviour of the complex in the aqueous medium⁴¹.

Principal component and free energy landscape analysis

The module GROMACS gmx sham yielded meta-stable conformations, and two-dimensional free energy landscape images were generated using the gmx xpm2ps module. The Gibbs energy landscape is a very fundamental indicator of the thermodynamic characteristics associated with the simulated complex⁴². Analysis was done using tools available within GROMACS, beginning with the diagonalisation of the covariance matrix through modules of gmx covar and gmx anaeig, representing the energy of particular components⁴².

$$\:\varDelta\:G\:\left(PC1,\:PC2\right)=\:-KBTlnP\:\left(PC1,\:PC2\right)\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\left(1\right)$$

In Eq. (1), K represents the Boltzmann constant, while PC1 and PC2 denote the first two principal reaction coordinates. The fluctuations in enthalpy (ΔH), standard free energy (ΔG), and entropy (ΔS) were computed using the provided formula.

$$\:\varDelta\:G=\:\varDelta\:H-T\varDelta\:S\:\:\:\:\:\:\:\:\left(2\right)$$

In Eq. (2), the symbols ΔH, TΔS, and ΔG represent enthalpy, temperature (in Kelvin), system entropy, and Gibbs free energy, respectively.

Molecular mechanics poisson boltzmann surface area (MM-PBSA) calculation

We have employed the MM-PBSA method to calculate the average binding free energy of vaccine construct with TLR4 using Gromacs 5.2v software. The TLR4 is composed of two chains A & C initially making two groups, (1) Vaccine construct (2) TLR4 Chain A&C. Following the bootstrap method was employed to calculate the average binding free energy by using “MmPbSaStat.py” code available on Kumari GitHub 2014. The required files were obtained from MD simulations (175 ns) for free energy calculations. The ΔG_bind was calculated according to the following equation (i).

ΔG_binding = ΔG_complex− (ΔG_protein + ΔG_ligand) (i).

In this particular approach, the computation of entropy via normal mode analysis is frequently omitted because its inclusion does not improve agreement with experimental data.

Immune simulation for vaccine efficacy

To understand the immunogenicity and immune response of the multi-epitope vaccine, in silico immune simulations were performed using the C-ImmSim server (https://kraken.iac.rm.cnr.it/C-IMMSIM/index.php), accessed on November 13, 2023⁴³, . To initiate the immune response generated by the designed chimeric antigen against PD-L1, this study adopted an immunization schedule similar to that was followed for the schistosomiasis vaccines rSh28GST and rSm14, which are currently under clinical trials^44,45. The immunization schedule involves three doses administered at 4-week intervals ensuring optimal vaccine efficacy. Injections containing 1,000 vaccine proteins each were given at time-steps 1, 84, and 168, spaced four weeks apart, totalling 1,050 simulation steps. Each time-step is equivalent to 8 h in real-life, and time-step 1 corresponds to the injection at time = 0. The default simulation parameters were used for all other simulation aspects.

Codon optimization and in silico cloning of the final vaccine construct

The Java Codon Adaptation Tool (http://www.jcat.de/Start.jsp) was utilized to perform back translation and codon optimization for the final vaccine construct. The protein sequence of the vaccine was provides as input to JCat, and the host organism chosen for expressing the vaccine construct was E. coli (K12 strain). Within this server, two parameters were calculated: the codon adaptive index (CAI) and the GC content. These parameters play a crucial role in assessing protein expression levels. Following the addition of BamHI and HindIII sites to its 5’ and 3’ ends, the nucleic acid sequence was restriction digested and cloned into the pET-28a (+) vector using Snapgene software. The pET28a plasmid was chosen for cloning our vaccine construct due to its strong T7 promoter, common restriction sites, controlled expression with a lac operator, self-encoding lac repressor, and medium copy number, allowing high-level expression without overloading cells⁴⁶.

Results

Retrieval of PD-L1 protein sequence and preliminary analysis

The PD-L1 protein sequence (Q9NZQ7) was retrieved from the Uniprot database. This protein, which is 290 amino acids long, belongs to the B7 family of type I transmembrane protein receptors⁴⁷. The protein consists of two extracellular domains (IgV and IgC), a transmembrane domain, and a cytoplasmic domain⁴⁷. The Ig-V domain spans from amino acid 19 to 127, while the IgC domain spans from amino acids 133 to 225, joined by a short stalk region covering amino acid residues 128–132. Another short stalk region connects the IgC domain to the transmembrane domain⁴⁸. The IgV domain of PD-L1 serves as the sole interaction domain for PD-1 binding⁴⁹. Therefore, in this study, we selected the segment of the protein spanning from residues 19 to 225 that encompasses the complete IgV domain as well as a brief stretch of amino acids at the C terminal of the IgV continuing to a few residues of IgC domain. The domains of PD-L1 are visualized using Protter, an open-source visualisation tool, as depicted in Fig. 2.

Prediction of CTLs in PD-L1

The CTL epitopes are crucial for inducing cellular immunity mediated by CD8⁺ CTLs restricted by MHC class I molecules. Therefore, these epitopes are promising candidates for designing subunit vaccines targeting a range of diseases⁵⁰. The CTLs targeting PD-L1 can identify and eliminate cancerous lymphoma cells and normal immune cells that express PD-L1, potentially boosting the effector stage of the immune response⁵¹.

Here, the potential CTL epitopes of PD-L1 were predicted by using NetCTL-1.2, NetMHCpan-4.1, and PickPocket 1.1, employing the default 12 representative HLA supertypes. These supertypes are globally distributed, making them representative across populations⁵². Subsequently, the three prediction algorithms were employed to predict and compile a consensus list of top high binders. The consensus list was selected to enhance prediction accuracy by considering results from different algorithms. These predictions generated 73 CTL epitopes from NetCTL-1.2, 31 from NetMHCpan-4.1, and 120 from PickPocket-1.1 (Supplementary Table S1). The common epitopes from all three lists were considered for further analysis in designing the multi-epitope vaccine. We identified 23 common epitopes, from which 5 CTLs were selected for vaccine construction after using strict screening criteria.

Prediction of HTLs in PD-L1

The CD4 + HTLs play a crucial role in both humoral and cell-mediated immune responses⁵³. Consequently, epitopes specific to HTL receptors are deemed crucial components of prophylactic and therapeutic vaccines. The HTLs play a pivotal role in initiating and sustaining long-term antitumor CTL responses⁵⁴. In a recent study, Hirata-Nozaki et al., (2019)⁵⁵ reported that HTLs specific to PD-L1 produce effector cytokines and augment cytotoxic activity against tumor cells expressing PD-L1. Notably, when PD-L1-specific HTLs were transferred into immunodeficient mice, there was a significant inhibition in the growth of PD-L1-positive human lung carcinoma⁵⁵.

In this study, the potential HTL epitopes of PD-L1 were predicted by using NetMHCIIpan 4.0 as described in the Methods section. A total of 73 epitopes with high binding affinity were generated and are detailed in Supplementary Table S2. Out of these epitopes, finally three HTLs were selected for vaccine construction following certain screening criteria.

Prediction of linear B-cell epitopes in PD-L1

B-cell epitopes are regions on antigen surfaces that B-cell receptors (BCR) recognize, initiating immune responses. This process is fundamental to the adaptive immune system and is responsible for immunological memory and targeted responses to antigens in vertebrates⁵⁶. Mapping B-cell epitopes is crucial for diagnostics and effective vaccine design⁵⁷. Recently, Guo and colleagues showed that a PD-L1 B-cell epitope peptide vaccine produced robust immune responses and demonstrated significant antitumor immunity across multiple syngeneic mice models⁴.

Top ten B-cell epitopes with the highest scores are shown in Supplementary Table S3. The identified B cell epitopes were ranked based on their scores derived from a trained recurrent neural network. A higher score indicates a greater likelihood of the peptide being an epitope¹³. Considering all the selection criteria, finally two B cell epitopes were selected for vaccine construction.

Screening the predicted epitopes

The predicted epitopes were evaluated for antigenicity using the VaxiJen v2.0 server, for toxicity with ToxinPred v2.0, and for allergenicity with AllerTOP v2.0. Additionally, for HTL epitopes, the evaluation included their ability to induce IL-4 and IFN-γ (Supplementary Table S4). The IFN-γ and IL-4 play crucial roles in regulating the development and differentiation of immune cells, as well as the overall immune response of an organism. The cytokine IFN-γ promotes T helper type 1 (Th1) responses, while IL-4 stimulates T helper type 2 (Th2) responses⁵⁸. IFN-γ enhances the antigen presentation ability of antigen-presenting cells and promotes the differentiation of CD4 + Th1 cells. On the other hand, IL-4 stimulates the proliferation of activated B cells^59,60. Thus, the HTLs that stimulate IFN-γ and IL-4, which enhance immune response, are considered suitable vaccine candidates.

Only epitopes that demonstrated characteristics of being antigenic, non-toxic, and non-allergenic properties, specifically within or overlapping with the Ig-V domain, were chosen for the effective vaccine construction. Overlapping sequences were chosen due to the presence of amino acids within the Ig-V domain, which makes up the hotspot region for PD-1 binding, making it a potential target for PD-1: PD-L1 axis inhibition. Following all the screening criteria, the following ten epitopes were identified: five CTL epitopes (YRQRARLLK, KLQDAGVYR, ISYGGADYK, KRITVKVNA, ITVKVNAPY), three HTL epitopes (DLYVVEYGSNMTIEC, YGGADYKRITVKVNA, GGADYKRITVKVNAP), and two B-cell epitopes (HGEEDLKVQHSSYRQR, ALQITDVKLQDAGVYR). The attributes of the selected epitopes are presented in Table 1.

Table 1 Features of the predicted epitopes of PD-L1.

Full size table

Worldwide human population coverage analysis

The IEDB population coverage calculation tool was used to estimate the anticipated Global response to a specific set of MHC-restricted epitopes. The finally selected CTL and HTL epitopes were subjected to population coverage analysis to assess their likelihood of binding to MHC molecules across the global population. The calculated world population coverage for MHC class I was found to be 25.83%, while for MHC class II, it was 33.41%. The cumulative population coverage for both MHC class I and II molecules was 50.61%. The Fig. 3 displays various region-wise data. It is to be noted that certain country-specific data were not available.