Abstract
Several computational methods have been developed to identify neutralizing antibodies (NAbs) covering four dengue virus serotypes (DENV-1 to DENV-4); however, limitations of the dataset and the resulting performance remain. Here, we developed a new computational framework to predict potent and stable NAbs against DENV-1 to DENV-4 using only antibody (CDR-H3) and epitope sequences as input. Specifically, our proposed computational framework employed sequence-based ML and molecular dynamic simulation (MD) methods to achieve more accurate identification. First, we built a novel dataset (nā=ā1108) by compiling the interactions of CDR-H3 and epitope sequences with the half maximum inhibitory concentration (IC50) values, which represent neutralizing activities. Second, we achieved an accurately predictive ML model that showed high AUC values of 0.879 and 0.885 by tenfold cross-validation and independent tests, respectively. Finally, our computational framework could be applied to filter approximately 2.5 million unseen antibodies into two final candidates that showed strong and stable binding to all four serotypes. In addition, the most potent and stable candidate (1B3B9_V21) was evaluated for its development potential as a therapeutic agent by molecular docking and MD simulations. This study provides an antibody computational approach to facilitate the high-throughput identification of NAbs and accelerate the development of therapeutic antibodies.
Similar content being viewed by others
Introduction
Dengue virus (DENV), which has four antigenically distinct serotypes (DENV-1 to DENV-4), has worldwide spread and causes 400 million infections/year, with 100 million cases of mild to severe dengue disease (dengue fever, dengue haemorrhagic fever, or dengue shock syndrome) and 20,000 deaths1. To date, there is neither a fully effective, licenced dengue vaccine nor an approved therapeutic agent due to the phenomenon of antibody-dependent enhancement (ADE), which is a pathological immune response to subsequent infection with a different serotype, leading to severe disease2. ADE occurs when non-neutralizing or sub-neutralizing levels of antibodies from a previous dengue infection or vaccination bind to a different serotype of dengue virus during a subsequent infection. Recent updates show that various approved dengue vaccines demonstrate differing efficacy rates across each serotype, typically ranging from approximately 60% and 80% in clinical trials involving both immune and non-immune participants to dengue3. To mitigate the risk of ADE, dengue vaccine or therapeutic candidates must induce strong and balanced immune responses against all four dengue serotypes.
DENV particles are composed of seven nonstructural proteins (NS1, NS2A, NS2B, NS3, NS4A, NS4B, NS5) and three structural proteins (E, C, prM/M). Dengue envelope (E) protein is the principal target for potent broadly neutralizing antibodies (NAbs), containing neutralizing epitopes on EDII (envelope domain II with fusion loop), EDI-EDII (interdomain), EDIII (envelope domain III), and EDE (envelope dimer epitope)4,5,6,7. The overall structure of the E protein is conserved across all four DENV serotypes, exhibiting 60ā70% amino acid sequence identity8. Developing cross-reactive neutralizing therapeutic antibodies or vaccines against all serotypes requires navigating the structural variability of the E protein and identifying conserved epitopes that induce protective immunity without enhancing infection. The fusion loop, a conserved region situated within EDII, plays a vital role in the viral fusion process with host cells and may potentially trigger ADE8. Variations in the sequence and structure of the EDII among serotypes impact antibody binding, neutralization efficacy, and ADE, thereby impacting the design of both therapeutics and vaccines. Addressing these challenges emphasizes the critical importance of gaining a thorough understanding of the structure of DENV and the interactions between antibodies and antigens. This understanding is crucial for refining strategies aimed at developing vaccines and therapeutics.
Over the past three decades, there have been many attempts to develop NAbs against E proteins of all four serotypes, known as cross-neutralizing antibodies, with half maximum inhibitory concentration (IC50) values (or neutralizing activities) less than 10 µg/mL to reduce the effects of ADE as promising therapeutic agents for dengue infection9,10. Recent studies on therapeutic antibodies have established a threshold of IC50āā¤ā10 µg/mL to distinguish between neutralizing (IC50āā¤ā10 µg/mL) and nonneutralizing (IC50ā>ā10 µg/mL) activities11,12. One of our antibody candidates, 1B3B9, targets the fusion loop and effectively neutralizes all four dengue serotypes. However, it exhibits varying levels of neutralizing activity (ranging from 0.125 to 3 ug/ml) against different serotypes, which could potentially lead to ADE6,13. To tackle this challenge, we aim to mutate this antibody and apply a computational framework to enhance its effectiveness.
The design of antibodies for the treatment of various diseases, such as DENV infectious diseases, requires two structural parts: fragment antigen-binding (Fab) and constant fragment crystallizable (Fc) regions14. The Fab region contains the hypervariable regions or complementary determining regions (CDRs) CDR-H1, CDR-H2, and CDR-H3 on each heavy chain and CDR-L1, CDR-L2, and CDR-L3 on each light chain. CDR-H3 is the most hypervariable region responsible for binding and neutralizing activities that should be considered for antibody-specified epitope prediction15. Antibody design and prediction (screening) is the most important step to limit laborious, time-consuming, and expensive laboratory work in the process of therapeutic antibody discovery and development. Recently, the integration of experimental (in vitro and in vivo) and computational (in silico) methods has assisted and enriched the effective generation of therapeutic antibodies due to the advancement of experimental studies, sequence data, structural data, and computational approaches, especially machine learning (ML) and molecular dynamics (MD) simulation techniques15.
ML and MD can be used to detect and predict important patterns of epitopes on antigens, paratopes on antibodies, or paratopeāepitope interactions and enable us to develop novel therapeutic antibodies quickly to combat emerging and reemerging diseases15,16. Most of the computational algorithms for antibody screening that showed high accuracy involved the use of complex methods and structural data17,18. Simple, sequence-based ML methods that can be used to screen for antibodies with desired properties have been reported for only some diseases, such as cancers19,20, HIV infection21,22, and SARS-CoV-2 infection23. Previous prediction approaches for dengue NAbs have used only ML-based24,25 or MD-based26 methods, which have different advantages in high-throughput prediction for antibody screening or highly efficient prediction for antibody developability, respectively24,27. None of the previous approaches used the dengue antibody dataset containing full-length CDR-H3 sequences and epitope sequences together with the in vitro IC50 values. Additionally, there is no complete in silico screening approach for cross-neutralizing antibodies against the four DENV serotypes24,28.
In this study, we developed an computational framework, incorporating a sequence-based ML method and a simple MD method, to accurately identify potent and stable neutralizing antibodies against all four dengue serotypes. The development process in our framework consists of four major steps: (1) dataset preparation, (2) feature extraction and ML analysis, (3) ML screening, and (4) MD screening. The major contributions from this process can be summarized as follows. First, we comprehensively generated a novel sequence-based dataset with IC50 activities from available experimental results over the last three decades. Second, we developed a computational framework, which is the first integrative approach (ML and MD methods) to identify potent and stable antibodies against DENV-1 to DENV-4. We compared and demonstrated the performance of three different encoding schemes in conjunction with ten ML algorithms to select the best performing model and capture the crucial information on NAbs. We then applied this computational framework to screen several million unseen antibodies into two final NAb candidates. Finally, we identified the most potent and stable 1B3B9_V21 antibody as a promising therapeutic agent against dengue infection by molecular docking and MD analysis.
Results
Computational framework overview and dataset analysis
We designed a novel in silico prediction framework integrating sequence-based ML and simple MD methods to screen high-throughput antibody variants for potent and stable NAb candidates against four serotypes of DENV. The computational framework overview consists of four steps: dataset preparation, feature extraction and ML analysis, ML screening, and MD screening (Fig.Ā 1). In the dataset preparation, we collected CDR-H3 sequences, epitope sequences, and experimental IC50 values from well-defined PubMed and Google patent databases during 1992ā2022 (nā=ā100 publications; Supplemental Table 1). Missing and redundant data were not included in our dataset. All sequences were paired to form a total of 1108 antibodyāantigen interactions and further labelled according to IC50 values as the neutralizing class (positive dataset; IC50āā¤ā10 μg/mL; nā=ā554) and the nonneutralizing class (negative dataset; IC50ā>ā10 μg/mL; nā=ā554) (Table 1).
Overview of the computational framework. We generated a dataset of CDR-H3 epitope IC50 values from the PubMed and Google patent databases (nā=ā100 publications). The computational framework includes 4 steps: (1) dataset preparation (nā=ā1,108 interactions), (2) feature extraction (nā=ā3 methods: sequence-based, atom-based, fingerprint-based feature descriptors) and ML analysis (nā=ā10 models: XGB, SVM, RF, PLS, MLP, LR, LN, ET, DT, and 1NN), (3) ML screening (XGB model), which provided 28 potential NAb candidates, and (4) MD screening, which provided 2 stable NAb candidates (1B3B9_V2 and 1B3B9_V21).
Overall, we found that the antibodyāantigen interactions in the neutralizing (N) and nonneutralizing (NN) classes had binding residues on EDII (Nā=ā32.13%, NNā=ā30.51%), EDIII (Nā=ā25.27%, NNā=ā37.36%), EDE (Nā=ā21.84%, NNā=ā15.52%), interdomain (Nā=ā19.13%, NNā=ā14.62%), and EDI (Nā=ā1.62%, NNā=ā1.99%) (Fig.Ā 2A). We randomly divided our balanced dataset into a training dataset (80%, nā=ā443) and an independent dataset (20%, nā=ā111) (Table 1). To visualize the diversity of our training data, we separately analysed the sequence-extracted input of CDR-H3 and epitope sequences in the t-distributed stochastic neighbour embedding (t-SNE) plot and labelled them with epitope domains. We found that CDR-H3 sequences are diverse, as illustrated in Fig.Ā 2B. In contrast, the epitope sequences in all domains are well clustered in several areas (Fig.Ā 2C). We also evaluated the IC50 labels of all input interactions and found that the IC50 cut-off value representing a high neutralizing antibody level (IC50āā¤ā10 μg/ml) can be used to explicitly divide our dataset into neutralizing and nonneutralizing classes (Fig.Ā 2D).
Visualization of the input dataset. We visualized the input dataset before feature extraction and ML analysis as follows: (A) Numbers of neutralizing (nā=ā554) and nonneutralizing (nā=ā554) interactions based on target epitope domains are shown in a bar plot. EDI is envelope domain I. EDII is envelope domain II. EDIII is envelope domain III. EDI-EDII is interdomain. EDE is the envelope dimer epitope domain. (B) Diversity of CDR-H3 antibody sequences based on target domains including EDI, EDII, EDIII, interdomain, and EDE by t-SNE (nā=ā306; perplexityā=ā30, learning rateā=ā100), (C) Diversity of epitope sequences based on target domains including EDI, EDII, EDIII, interdomain, and EDE by t-SNE (nā=ā609; perplexityā=ā30, learning rateā=ā100), and (D) Distribution of IC50 values of each antibodyāantigen interaction by scatter plot (nā=ā1,108; cut-off value for neutralizing classāā¤ā10 μg/ml).
Analysis and selection of the best ML-performing method
In this section, we describe a comparative experiment on a variant ML model developed by using three encoding methods and ten ML models. In the ML analysis, we performed tenfold cross-validation and independent tests on our dataset to demonstrate the prediction performance of all ML models (Supplemental Tables 6ā8). To compare the ML performance for NAb prediction, we demonstrated the top five performing models with the highest AUC scores of each experiment in terms of five different matrices by tenfold cross-validation and independent tests (Table 2). We found that the AUCs of the top five ML models using the fingerprint-based encoding method were higher than those of other encoding methods (Table 2). Finally, we selected the best predictive approach of the XGB algorithm and fingerprint-based encoding method for the first-round screening of potential antibody candidates. This ML model yielded an ACC of 0.802, Sn of 0.788, Sp of 0.817, MCC of 0.604, and AUC of 0.885, as indicated by an independent test (Table 2). To confirm the interpretability of the XGB model, we illustrated the performance of this model and the other top five ML models of each method in ROC curves with AUC values of 0.554ā0.885 by the independent test (Fig.Ā 3AāC). The XGB model has an AUC of 0.885, which represents the strong ability of the XGB model to classify neutralizing or nonneutralizing antibodies (Fig.Ā 3C). Furthermore, we determined the top 30 fingerprint features connecting with the corresponding amino acids, which are often found in the neutralizing class, via the F score plot to demonstrate the accuracy of those features (Fig.Ā 4). The important features correlated with some amino acids (not all or nearly all amino acids) were considered unique features that might play a major role in the neutralization mechanism of dengue antibodies, including alcohol (amino acids S, T, and Y) and aromaticity (amino acids F, Y, and W) features (Fig.Ā 4). However, there are some features (labelled as could not be determined; ND) that have not been reported to be related to amino acids and binding properties in antibody interactions.
Performances of the top five ML models in the three encoding methods. We illustrated the performances of the top five ML models in three encoding methods by ROC curves with the AUC values. (A) ML analysis using sequence-based features. (B) ML analysis using atom-based features. (C) ML analysis using fingerprint-based features. We found that the XGB model and fingerprint encoding method are the most suitable approaches for classifying neutralizing and nonneutralizing antibody classes.
Top 30 important features with corresponding amino acids. We analysed our dataset and found the top 30 important features and corresponding amino acids of the neutralizing class from XGB model analysis. (All AA; all amino acids, ND; could not be determined, A; alanine, V; valine, L; leucine, I; isoleucine, M; methionine, T; threonine, G; glycine, Y; tyrosine, S; serine, C; cysteine, F; phenylalanine, W; tryptophan, N; asparagine, Q; glutamine, K; lysine, R; arginine, H; histidine, P; proline).
Generation and screening of new antibody variants
We first generated new antibody variants (unseen antibodies) by applying single, double, and triple mutations to all amino acids of the CDR-H3 sequence (TTLSGYSADWPEDY) of the 1B3B9 neutralizing human monoclonal antibody, resulting in 2,529,794 CDR-H3 variants. These CDR-H3 variants were paired with the most cross-reactive epitopes of dengue virus (FL epitope residues; CCDRWCFCCK) as antibodyāantigen sequences for feature extraction by the fingerprint-based method into the numerical input for the first ML screening of cross-neutralizing antibodies (potential candidates). To precisely predict the potential antibodies, we used the selected XGB model with a confidence score of 0.9900 for screening all antibody variants, and this ML screener filtered approximately 2.5 million variants to obtain 28 potential candidates. All 28 ML-screened candidates contain triple-point mutations, and the major residues are S100, Y102, S103, A104, and D105 (Supplemental Table 10). The minor mutation residues of these candidates are W106, E108, D109, and Y110, whereas there is no mutation at T97, T98, L99, G101, or P107, as shown in Supplemental Table 10.
To empower the screening framework and increase antibody developability, we screened these 28 ML-screened candidates with an MD simulation tool to test the stability of each antibody molecule. We designed a simple MD by performing homology modelling of these candidates, introducing the mutated CDR-H3 sequences into the 1B3B9 antibody template, and sent it to the SWISS-MODEL server. We determined the quality of each modelled structure with Ramachandran favoured and QMEAN scores before performing MD simulations. All constructed 3D structures showed QMEAN scores less than 1, reflecting the native-like structure (Supplemental Table 10). Then, we used MD to screen a stable conformation of each 3D structure and showed MD-screened candidates in a scatter plot with the average RMSD (a standard measure of structural distance between coordinates or structure changes; Y axis) and average end-to-end distance (a distance between the first and the last carbon atom in a protein or structure length; X axis) values, which are the representative parameters for molecular stability in Fig.Ā 5A. From this MD plot, 1B3B9 (green circle), 1B3B9_V2 (red star), and 1B3B9_V21 (yellow star) had average values (X, Y) of (39.22ā±ā4.33, 2.94ā±ā0.32), (38.08ā±ā4.88, 2.89ā±ā0.46), and (38.77ā±ā3.44, 2.65ā±ā0.36), respectively (Fig.Ā 5A,B). The RMSD plot of all structures is provided in Supplemental Fig. 1. We accordingly concluded that 1B3B9_V2 and 1B3B9_V21 have lower RMSD and end-to-end distance values than the 1B3B9 template and other candidates, which implies that the two candidates have more stable structural conformations and higher developability as synthetic antibodies. In this finding, we proposed the best 1B3B9_V21 antibody, which has the lowest average RMSD, representing a more stable configuration, for further characterization as a potent and stable candidate targeting the four envelope proteins of DENV-1 to DENV-4.
ML and MD screening for antibody candidates. We generated approximately 2.5 million CDR-H3 variants from human neutralizing 1B3B9 antibody (template) by random mutation with single, double, and triple points for further screening by ML and MD methods. (A) Distribution of ML-screened NAb candidates by scatter plot of the average end-to-end distance (Ć ) and average RMSD (Ć ) calculated by Amber 16. The 1B3B9 (template) antibody is indicated by a green circle. The final screened 1B3B9_V2 and 1B3B9_V21 NAbs are shown as red and yellow stars. (B) ML, structure modelling quality, and MD analysis of the antibody template (1B3B9) and outstanding candidates (1B3B9_V2 and 1B3B9_V21). Red letters represent mutated amino acid residues of outstanding antibody candidates compared with the antibody template.
In silico characterization of the best 1B3B9_V21 NAb candidate
We investigated the ability of the best predicted antibody candidate (1B3B9_V21) compared to the real-world antibody template (1B3B9) in terms of binding interactions and binding energies (affinities and stabilities) using molecular docking and MD simulations, respectively. We used complexed four envelope proteins of DENV-1 to DENV-4 (PDB: 4CCT, 5A1Z, 3J6T, 4CBF) with 1B3B9 and 1B3B9_V21 in both docking and MD experiments. In a molecular docking study at a resolution of 4.5Ā Ć , 1B3B9_V21 straddled FL epitopes from the top of EDII with interactions involving both the heavy chain and light chains (Fig.Ā 6A). The binding motifs of 1B3B9_V21 were located at the fusion loop (W101, N103, G104, C105, G106, L107 and F108 residues) on the E protein of DENV-2 (Fig.Ā 6A) with 447 atom-to-atom contacts (Fig.Ā 6B). Structural analysis showed that 1B3B9 and 1B3B9_V21 bound to different regions of FL epitopes within R99āF108 (Supplemental Tables 11 and 12). 1B3B9_V21 revealed higher numbers of atom-to-atom contacts with DENV-2 E, DENV-3 E, and DENV-4 E proteins than 1B3B9, except DENV-1 E protein (Supplemental Tables 11 and 12). We therefore sought to determine whether 1B3B9_V21 could bind to DENV-1 to DENV-4 with strong affinity. Next, we tested the binding affinities and stabilities of 1B3B9 and 1B3B9_V21 with DENV-1 to DENV-4 by using MD simulations. We found that 1B3B9_V21 in complex with DENV-1, DENV-2 and DENV-3 showed lower ĪGbind energies with lower average RMSD values (except for the DENV-3 system) than 1B3B9, whereas the ĪGbind of 1B3B9_V21 in complex with DENV-4 was similar to that of 1B3B9 (Fig.Ā 6C and Supplemental Fig. 2). These results were in good agreement with H-bond and contact analyses showing that 1B3B9_V21 in complex with DENV-1, DENV-2 and DENV-3 exhibited higher H-bond formations and number of contact atoms compared to 1B3B9_V21 in complex with DENV-4 (Supplemental Figs. 3 and 4). The RMSD plots of all antibodyāantigen complexes are available in Supplemental Fig. 2. All data from in silico characterization indicated that 1B3B9_V21, which is the best NAb candidate from our computational framework, exhibits cross-neutralizing and stable binding with DENV-1 to DENV-4. Moreover, 1B3B9_V21 also provided binding interactions for these cross-reactive FL epitopes to other flaviviruses such as Zika virus and Japanese encephalitis virus (Supplemental Fig. 5). Binding site was also occupied between FL and 1B3B9_V21 heavy chain especially CDR-H3 mutated amino acid sequence (Supplemental Table 13). We suggest that our computational framework has potential for use in the efficient design and screening of potent and stable dengue therapeutic antibodies.
Molecular docking and MD analysis of the most potent and stable neutralizing antibody candidate (1B3B9_V21) against DENV-1 to DENV-4. We obtained the best NAb candidate, 1B3B9_V21, with potent and stable neutralizing activities against all four serotypes. We performed homology modelling, molecular docking, and MD simulations. (A) Structure of 1B3B9_V21 Fab bound to the fusion loop epitope on the EDII domain of DENV-2 (PDB ID: 5A1Z). (B) Residues participating in interactions between DENV-2 E and 1B3B9_V21 Fab are shown in the table with the number of atom-to-atom contacts analysed by the contact command of AMBER16. The distance cut-off is 4.5Ā Ć . (C) Binding energies of 1B3B9_V21 with DENV-1 to DENV-4 (PDB ID: 4CCT, 5A1Z, 3J6T, 4CBF) were determined in terms of binding affinities (ĪGbind; kcal/mol) and binding stabilities (RMSD; Ć ). Red letters represent mutated amino acid residues of the new antibody candidate compared with the antibody template.
Discussion
Antibody treatment is a promising strategy to combat severe dengue. An ideal antibody therapeutic should sufficiently neutralize all four DENV serotypes to reduce ADE effects, which is the major problem for dengue treatment9,10. A simple, rapid, and efficient antibody design and screening method, especially an in silico method, is the first crucial step to accelerate antibody discovery and the development of dengue therapeutic antibodies.
In this study, we propose a well-characterized dataset and in silico approach to predict potent and stable neutralizing antibodies against DENV-1 to DENV-4 to decrease ADE effects by screening for antibodies with strong and stable binding, thus providing an outstanding 1B3B9_V21 antibody candidate for further antibody engineering and optimization. We newly generated a CDR-H3-epitope-IC50 dataset from well-defined and real experimental results of 100 publications from 1992 to 2022. IC50 values obtained from different laboratory methods are unlikely to significantly impact algorithms, as each assay can accurately reflect the actual activities being measured28. We found that our dataset has a balance of neutralizing (positive) and nonneutralizing (negative) categories with highly diverse properties of CDR-H3 sequences and lower variation among epitope sequences, similar to the results of a previous study23. Our computational framework is designed to combine sequence-based ML screening and simple MD screening to identify the most potent and stable candidates in the most practical workflow. The combination of the XGB model and fingerprint-based method achieved excellent ML performance (AUCā=ā0.885 by independent test) in the prediction of cross-neutralizing antibodies against four DENV serotypes. We chose the AUC metric to evaluate the model's performance and selection for classifying neutralizing and non-neutralizing antibodies because of its comprehensive evaluation across all potential classification thresholds and various operational scenarios, as demonstrated in previous study23. Interestingly, the ML screener was used to screen approximately 2.5 million unseen antibodies to obtain 28 NAb candidates. The MD screener was used to screen 28 potential antibodies to find 2 stable NAb candidates, which may indicate the developability of these screened candidates as synthetic antibodies. Compared to existing approaches17,18,23, our framework uses a sequence-based ML model (no need for antibody and antigen structural data) and a simple MD protocol (requiring only an antibody structure with 15Ā ns MD production) that is easily applied in real-world experiments. In addition, our ML-based framework might be usable for antibody screening against other flaviviruses according to the cross-reactive epitopes that we used in this study, which are conserved among DENV, Zika, Japanese encephalitis, yellow fever, West Nile, and tick-borne encephalitis viruses29,30. Our approach has five key features. (1) It has two successive steps: general ML and simple MD methods. (2) It reduces the limitations of needing the full antibody sequence (instead requiring only the CDR-H3 sequence) and full antigen sequence. (3) It decreases time-consuming tasks by running all processes within 2Ā weeks. (4) It enables cost-effective screening to reduce laboratory work. (5) It is applicable to screening other flaviviruses because of the use of cross-reactive epitopes as shown in Supplemental Table 13.
Regarding the details of ML analysis, we considered the fingerprint-based method to be suitable for the prediction of antibodyāantigen interactions because it can extract information from biophysical and biochemical properties without relying on the sequence or 3D structure. This approach facilitates the identification of unknown therapeutic candidates on a large scale for subsequent experimental validation31. We identified that the important features of 28 NAb candidates belong to the substructure count fingerprint (FP4C), which lists the top 30 fingerprints alongside their respective descriptions in Supplemental Table 9. Three of the top 30 fingerprints belong to the general class of alcohols (FP4C14: secondary alcohol, FP4C13: primary alcohol, and FP4C12: alcohol), which are found in the side chains of certain amino acids such as serine and threonine. These hydroxyl groups can act as both hydrogen bond donors and acceptors, influencing hydrogen bonding networks, binding affinity, and the neutralizing activity of antibodies32. Two of the 30 most important features were FP4C274 (aromatic ring) and FP4C184 (heteroaromatic ring), which are present in the side chains of certain amino acids like phenylalanine, tryptophan, and histidine. Aromaticity plays a significant role in cooperative interactions involving hydrophobicity, charge, and hydrogen bonding properties, making it well-suited for creating binding sites for epitopes, irrespective of the presence of polar and nonpolar surface residues33. For the FP4C307 (chiral center) fingerprint, it is recognized as a pivotal feature that significantly improves the predictive performance of machine learning models in drug discovery tasks, particularly in predicting KRAS G12C inhibitors and other biological activities34. Regarding FP4C84 (carboxylic acid) and FP4C88 (carboxylic acid derivative), which are found in the side chains of amino acids like aspartic acid and glutamic acid, they participate in hydrogen bonding, ionic interactions, and salt bridges critical for protein structure and proteināprotein interactions35. FP4C302 (rotatable bond), with an average F-score of 40, represents single bonds that allow free rotation, contributing to molecular flexibility. This structural characteristic is crucial for determining overall molecular flexibility. Interestingly, all compounds in the dataset possessed rotatable bonds, particularly abundant in active compounds, suggesting their importance for biological activity36. FP4C295 (CāOāNāS bond) refers to the presence of a carbonāoxygenāsulfur bond in molecular structures. This feature can influence proteināprotein binding by participating in hydrogen bonding interactions with amino acid residues on the protein surface, enhancing binding affinity and specificity37. Additionally, other significant FP4Cs include miscellaneous descriptors such as FP4C1-3 and FP4C300. FP4C1 (primary carbon), FP4C2 (secondary carbon), and FP4C3 (tertiary carbon) carbons indicate that carbon atoms with two or three neighbouring carbons in drug molecules may enhance metabolic stability by limiting access to metabolic pathways38. Based on feature importance (Supplemental Table 9) and amino acid analysis (Fig.Ā 4), we suggest that the essential characteristics of our identified neutralizing antibodies encompass alcohols (present in amino acids S, T, and Y) and aromaticity (linked to amino acids F, Y, and W). These features are essential for the binding mechanism and maintaining the correct secondary structure at the antibodyāantigen binding sites39.
Furthermore, we observed that the mutated residues of outstanding candidates are mostly located in the central area of the CDR-H3 region (S100, Y102, S103, A104, D105, W106, E108, D109, and Y110), which are crucial sites for antibody engineering and improvement, as supported by previous studies40,41. After characterization by in silico methods, 1B3B9_V21 revealed binding motifs at the most cross-reactive fusion loop region on the EDII protein of DENV-1 to DENV-4 (W101, G102, G104, G106, and L107 residues). These binding sites have been reported in many studies of cross-neutralizing antibodies against all four DENV6,29,42. In molecular dynamics, 1B3B9_V21 showed high binding affinities (less than -22.83ā±ā4.89Ā kcal/mol) and stabilities (less than 11.83ā±ā1.960Ā Ć ) with DENV-1 to DENV-4. Most of the binding energies of 1B3B9_V21 are lower than those of the real 1B3B9 antibody. Therefore, we suggest 1B3B9_V21 as a potent and stable NAb candidate for development as a therapeutic agent. To accomplish the application of the NAb candidates, in vitro tests of the neutralizing and ADE activities of these predicted antibodies are still required.
In conclusion, we have presented the first ML-based framework that employed a sequence-based ML method and a simple MD method for the accurate and rapid identification of NAbs against DENV-1 to DENV-4. We also provided an updated dengue antibody dataset including unique information on CDR-H3 sequences, epitope sequences, and IC50 values. We used three different feature-encoding methods and ten ML algorithms to compare and exhibit the best performing model. Our ML model can be used for large-scale identification of NAbs. The MD method can be used to select stable NAbs for more accurate identification. Our outstanding 1B3B9_V21 candidate showed high potential for development as a therapeutic antibody, and this NAb candidate is warranted for further in vitro analysis. Our proposed computational framework might support novel opportunities to discover, design, and engineer therapeutic antibodies against four dengue viruses and other flaviviruses.
Methods
Computational framework design
We built a dataset, model, and computational framework for screening cross-neutralizing antibodies against DENV. The computational framework contains four steps (Fig.Ā 1): (1) dataset preparation of CDR-H3-epitope-IC50 pairing data, (2) feature extraction and ML analysis for the best performing model, (3) generation of CDR-H3 antibody variants and ML screening for potential antibody candidates, and (4) MD screening for stable antibody candidates.
Dataset preparation
The dataset of anti-DENV CDR-H3 sequences was prepared from published data that satisfied three criteria: (1) including complete CDR-H3 amino acid sequences specific to DENV and published in PubMed or Google patent databases, (2) including epitopes on the E protein in EDII (fusion loop), EDI-EDII (interdomain), EDIII, or EDE, and (3) including neutralizing activities against DENV in µg/ml, as characterized by a focus reduction neutralization test (FRNT) or plaque reduction neutralization test (PRNT) or enzyme-linked immunosorbent assay (ELISA). The CDR-H3 sequences were annotated using the IMGT numbering scheme through the Antibody Region-Specific Alignment (AbRSA) web service. The neutralization (NT) activities derived from FRNT and PRNT represent actual experimental NT values, while antibodies that do not bind in the ELISA assay were categorized as non-neutralizing because they fail to bind to the antigen. All data (Supplemental Table 1) were carefully preprocessed to handle missing data and remove redundant data. The compilation of interactions between CDR-H3 and epitope sequences followed the principles outlined by Magar et al.23. Our dataset was derived from experimental results that included the sequences of CDR-H3 and epitopes, along with corresponding in vitro IC50 values against any of the four serotypes of dengue virus. In this dataset, we collected CDR-H3 sequences along with their epitopes and neutralizing activities (IC50; µg/ml) and further divided them into a positive (neutralizing) dataset (IC50āā¤ā10 µg/ml) and a negative (nonneutralizing) dataset (IC50ā>ā10 µg/ml). Each positive and negative dataset was randomly divided into a training dataset (80%) and an independent dataset (20%) (Table 1).
Feature extraction and machine learning analysis
To evaluate the contribution of variant ML methods at the different levels of each antibodyāantigen interaction, we performed a comparative analysis on different feature extraction schemes and ML methods. In the case of the feature extraction schemes, we applied three types of feature descriptors addressing multiple aspects, including 12 sequence-based, 7 atom-based, and 12 fingerprint-based feature descriptors (Supplemental Tables 2ā4) following the previous studies43,44,45. The CDR-H3 and epitope sequences were concatenated and encoded into a single embedding of an antibodyāantigen complex using our in-house Python-based code. Data normalization was performed to transform the input features into the same scale using MināMax Scaler. Afterwards, the scaled training set was employed to train ten different ML algorithms: extreme gradient boosting (XGB), support vector machine (SVM), random forest (RF), partial least squares regression (PLS), multilayer perceptron (MLP), logistic regression (LR), linear support vector classification (LN), extra tree (ET), decision tree (DT), and k-nearest neighbours (KNN) classifiers. The algorithm selection was guided by considerations of accuracy, computational resources, and interpretability to derive the optimal predictive model for the dataset. Simpler models such as PLS, LR, LN, and DT provide easier interpretability, whereas more complex models like XGB, SVM, RF, MLP, ET, and KNN offer greater effectiveness in capturing intricate patterns within high-dimensional data spaces and deliver superior predictive capabilities24. All models are designed specifically for binary classification. Hyperparameter searching was performed as described in Supplemental Table 5. Herein, we evaluated and compared the performance of the developed ML methods based on tenfold cross-validation (CV) and independent tests. We demonstrated ML performances based on five metrics: accuracy (ACC), sensitivity (Sn), specificity (Sp), Matthewās correlation coefficient (MCC), and area under the ROC curve (AUC), as previously described43,46. The ML algorithm with the greatest AUC for predicting NAbs was selected to screen a panel of unseen antibodies in the ML screening step. Model comparison, analysis and evaluation were performed using Python script 3.8 (all codes; available at GitHub).
In silico antibody library
An in silico antibody library of 2,529,794 variants was generated by making single, double, and triple mutations on all amino acids in the CDR-H3 region of the 1B3B9 neutralizing human monoclonal antibody (antibody template)6 using Python script 3.8. All antibody variants were screened and ranked by the best ML algorithm and simple MD simulation in further steps to discover new potent and stable NAb candidates.
ML and MD screening methods
In the first ML screening step, all antibody variants were filtered by the best performing ML algorithm with a confidence score cut-off of 0.990 to screen for potent neutralizing antibody candidates against four serotypes of DENV. Then, we constructed the 3D structure of the 1B3B9 Fab antibody template and ML-screened antibody variants using the SWISS-MODEL server47 (PDB 3t2n.1). Homology assessments and structural comparisons were performed using the SWISS-ExPASy server in terms of Ramachandran plots and QMEAN scores48 to validate the model quality of all 3D structures before screening by MD simulation.
In the second MD screening step, to determine the Fab structural stability as previously described (18), we performed MD simulations of all Fab antibody variants in a solvated environment using Amber 16. Each system was simulated under the periodic boundary condition with isothermal-isobaric (NPT) ensemble. The temperature and pressure were controlled by using Langevin thermostat49 with collision frequency of 2.0 and Berendsen barostat50. Topologies of each Fab were generated according to the ff14SB forcefield51. The Fab molecule was cantered in a cubic simulation box, extending 1Ā nm from the molecule surface. Then, the system box was solved by TIP3B model water atoms52, and a net positive charge was neutralized with chloride ions. Energy minimization was carried out using 1500 steps of steepest descent (SD) followed by 1500 steps of conjugated gradient (CG) methods with constrained solvent molecules. Then, the whole system was fully minimized using the same procedure. The minimized system was subjected to two rounds of equilibration at 310Ā K and 1Ā atm. First, the molecular system was equilibrated in the NVT ensemble for 100 picoseconds and a 2-femtosecond time step. Second, the equilibration was applied in a round of NPT simulation for 100 picoseconds to ensures that the simulated system is at physiological temperature and pressure. Then, the system was carried out in NPT and no constraints for 500 picoseconds. Finally, a 15Ā ns unrestrained NPT simulation at 310Ā K and 1Ā atm was carried out under identical simulation parameters. We collected the MD results and demonstrated the stabilities of all variants in the scatter plot of average end-to-end distance values at the X axis and averaged the root-mean-square displacement (RMSD) values at the Y axis to select the outstanding NAb candidates.
Characterization by molecular docking and MD simulation
The antibody template (1B3B9) and the best screened NAb candidate were further characterized for their binding sites, binding affinities, and binding stabilities with DENV-1 to DENV-4 (utilizing PDB IDs: 4CCT, 5A1Z, 3J6T, 4CBF, which correspond to the strains used in previous in vitro studies)6 using molecular docking and MD simulation. First, antibodyāantigen docking was performed by using the ZDOCK server53 using the default parameters and the target residues. The best posture of each docked complex was selected according to the lowest-energy ZRANK score. Then, MD simulations of each antibodyāantigen complex were executed at 310Ā K and 1Ā atm for 100Ā ns using Amber 16. The MD procedure was the same as in the MD screening method above. The MD trajectories were saved every 10Ā ps.
The binding affinity of the antibodyāantigen complex was calculated as binding free energy (ĪGbind; kcal/mol) based on the MM/PB(GB)SA approach from the last 20Ā ns of the MD production using the CPPTRAJ54 and MMPBSA.py55 modules of AMBER16. The binding stability of each complex was evaluated in terms of RMSD according to the number of antigenāantibody hydrogen bonds and the number of atomic contacts using the CPPTRAJ modules of AMBER16. The H-bond interactions were calculated using two criteria: (1) distance between the hydrogen donor (HD) and hydrogen acceptor (HA) of ā¤ā3.5Ā Ć and (2) HDāHĀ·Ā·Ā·HA angle of ā„ā150°. The number of atom contacts was counted as the number of atoms within 4.5Ā Ć of each complex.
Data availability
All data generated or analyzed during this study are included in this published article and its supplementary information files. The dataset, feature descriptions, ML performances, RMSD plots of MD simulations, data of ML-screened antibodies, and numbers of atom-to-atom contacts are provided in the supplementary information. All codes in this study are available online at https://github.com/PiyatidaNatsrita/In-silico-dengue-virus-antibody-prediction-2022.git.
References
Bhatt, S. et al. The global distribution and burden of dengue. Nature. 496, 504ā507. https://doi.org/10.1038/nature12060 (2013).
Khetarpal, N. & Khanna, I. Dengue fever: Causes, complications, and vaccine strategies. J. Immunol. Res. 2016, 6803098. https://doi.org/10.1155/2016/6803098 (2016).
Thomas, S. J. Is new dengue vaccine efficacy data a relief or cause for concern?. NPJ Vaccines. 8, 55. https://doi.org/10.1038/s41541-023-00658-2 (2023).
Beltramello, M. et al. The human immune response to dengue virus is dominated by highly cross-reactive antibodies endowed with neutralizing and enhancing activity. Cell Host Microbe. 8, 271ā283. https://doi.org/10.1016/j.chom.2010.08.007 (2010).
Dejnirattisai, W. et al. A new class of highly potent, broadly neutralizing antibodies isolated from viremic patients infected with dengue virus. Nat. Immunol. 16, 170ā177. https://doi.org/10.1038/ni.3058 (2015).
Pitaksajjakul, P. et al. Antibody germline characterization of cross-neutralizing human IgGs against 4 serotypes of dengue virus. Biochem. Biophys. Res. Commun. 446, 475ā480. https://doi.org/10.1016/j.bbrc.2014.02.131 (2014).
Xu, M. et al. Protective capacity of the human anamnestic antibody response during acute dengue virus infection. J. Virol. 90, 11122ā11131. https://doi.org/10.1128/JVI.01096-16 (2016).
Stiasny, K., Kiermayr, S., Holzmann, H. & Heinz, F. X. Cryptic properties of a cluster of dominant flavivirus cross-reactive antigenic sites. J. Virol. 80, 9557ā9568. https://doi.org/10.1128/JVI.00080-06 (2006).
Dussupt, V., Modjarrad, K. & Krebs, S. J. Landscape of monoclonal antibodies targeting zika and dengue: Therapeutic solutions and critical insights for vaccine development. Front. Immunol. 11, 621043. https://doi.org/10.3389/fimmu.2020.621043 (2021).
Fibriansah, G. & Lok, S. M. The development of therapeutic antibodies against dengue virus. Antiviral Res. 128, 7ā19. https://doi.org/10.1016/j.antiviral.2016.01.002 (2016).
Dejnirattisai, W. et al. SARS-CoV-2 Omicron-B.1.1.529 leads to widespread escape from neutralizing antibody responses. Cell. 185, 467ā484. https://doi.org/10.1016/j.cell.2021.12.046 (2022).
VanBlargan, L. A. et al. An infectious SARS-CoV-2 B.1.1.529 Omicron virus escapes neutralization by therapeutic monoclonal antibodies. Nat. Med. 28, 490ā495. https://doi.org/10.1038/s41591-021-01678-y (2022).
Injampa, S. et al. Generation and characterization of cross neutralizing human monoclonal antibody against 4 serotypes of dengue virus without enhancing activity. PeerJ. 5, e4021. https://doi.org/10.7717/peerj.4021 (2017).
Chan, K. R., Ong, E. Z. & Ooi, E. E. Therapeutic antibodies as a treatment option for dengue fever. Expert Rev. Anti Infect. Ther. 11, 1147ā1157. https://doi.org/10.1586/14787210.2013.839941 (2013).
Sormanni, P., Aprile, F. A. & Vendruscolo, M. Third generation antibody discovery methods: In silico rational design. Chem. Soc. Rev. 47, 9137ā9157. https://doi.org/10.1039/c8cs00523k (2018).
Akbar, R. et al. In silico proof of principle of machine learning-based antibody design at unconstrained scale. MAbs. 14, 2031482. https://doi.org/10.1080/19420862.2022.2031482 (2022).
Ruffolo, J. A., Sulam, J. & Gray, J. J. Antibody structure prediction using interpretable deep learning. Patterns (NY). 3, 100406. https://doi.org/10.1016/j.patter.2021.100406 (2022).
Shan, S. et al. Deep learning guided optimization of human antibody against SARS-CoV-2 variants with broad neutralization. Proc. Natl. Acad. Sci. 119, e2122954119. https://doi.org/10.1073/pnas.2122954119 (2022).
Liu, G. et al. Antibody complementarity determining region design using high-capacity machine learning. Bioinformatics. 36, 2126ā2133. https://doi.org/10.1093/bioinformatics/btz895 (2020).
Li, X., Van Deventer, J. A. & Hassoun, S. ASAP-SML: An antibody sequence analysis pipeline using statistical testing and machine learning. PLOS Comput. Biol. 16, e1007779. https://doi.org/10.1371/journal.pcbi.1007779 (2020).
Rawi, R. et al. Accurate prediction for antibody resistance of clinical HIV-1 isolates. Sci. Rep. 9, 1ā12. https://doi.org/10.1038/s41598-019-50635-w (2019).
Yu, W. H. et al. Predicting the broadly neutralizing antibody susceptibility of the HIV reservoir. JCI Insight. 4, e130153. https://doi.org/10.1172/jci.insight.130153 (2019).
Magar, R., Yadav, P. & Barati Farimani, A. Potential neutralizing antibodies discovered for novel corona virus using machine learning. Sci. Rep. 11, 5261. https://doi.org/10.1038/s41598-021-84637-4 (2021).
Horst, A. et al. Machine learning detects anti-DENV signatures in antibody repertoire sequences. Front. Artif. Intell. 4, 715462. https://doi.org/10.3389/frai.2021.715462 (2021).
Natali, E. et al. The dengue-specific immune response and antibody identification with machine learning. NPJ Vaccines. 9, 16. https://doi.org/10.1038/s41541-023-00788-7 (2024).
Wong, Y. H. et al. Molecular basis for dengue virus broad cross-neutralization by humanized monoclonal antibody 513. Sci. Rep. 8, 8449. https://doi.org/10.1038/s41598-018-26800-y (2018).
Rathore, A. S., Sarker, A. & Gupta, R. D. Designing antibody against highly conserved region of dengue envelope protein by in silico screening of scFv mutant library. PLoS One. 14, e0209576. https://doi.org/10.1371/journal.pone.0209576 (2019).
Chaudhury, S. et al. Dengue virus antibody database: Systematically linking serotype-specificity with epitope mapping in dengue virus. PLoS Negl. Trop. Dis. 11, e0005395. https://doi.org/10.1371/journal.pntd.0005395 (2017).
Deng, Y. Q. et al. A broadly flavivirus cross-neutralizing monoclonal antibody that recognizes a novel epitope within the fusion loop of e protein. PLoS One. 6, e16059. https://doi.org/10.1371/journal.pone.0016059 (2011).
França, R.K.A. de O, Silva, J.M., Rodrigues, L.S., Sokolowskei, D., Brigido, M.M., Maranhão, A.Q. New anti-flavivirus fusion loop human antibodies with zika virus-neutralizing potential. Int. J. Mol. Sci. 23, 7805. https://doi.org/10.3390/ijms23147805 (2022).
Schaduangrat, N. et al. StackPR is a new computational approach for large-scale identification of progesterone receptor antagonists using the stacking strategy. Sci. Rep. 12, 16435. https://doi.org/10.1038/s41598-022-20143-5 (2022).
Wang, D., Ge, Y., Zhong, B. & Liu, D. Specific epitopes form extensive hydrogen-bonding networks to ensure efficient antibody binding of SARS-CoV-2: Implications for advanced antibody design. Comput. Struct. Biotechnol. J. 19, 1661ā1671. https://doi.org/10.1016/j.csbj.2021.03.021 (2021).
HofstƤdter, K., Stuart, F., Jiang, L., Vrijbloed, J. W. & Robinson, J. A. On the importance of being aromatic at an antibody-protein antigen interface: Mutagenesis of the extracellular interferon γ receptor and recognition by the neutralizing antibody A6. J. Mol. Biol. 285, 805ā815. https://doi.org/10.1006/jmbi.1998.2343 (1999).
Srisongkram, T., Khamtang, P. & Weerapreeyakul, N. Prediction of KRASG12C inhibitors using conjoint fingerprint and machine learning-based QSAR models. J. Mol. Graph Model. 122, 108466. https://doi.org/10.1016/j.jmgm.2023.108466 (2023).
Chan, A. W., Laskowski, R. A. & Selwood, D. L. Chemical fragments that hydrogen bond to Asp, Glu, Arg, and His side chains in protein binding sites. J. Med. Chem. 53, 3086ā3094. https://doi.org/10.1021/jm901696w (2010).
Wicker, J. G. & Cooper, R. I. Beyond rotatable bond counts: Capturing 3D conformational flexibility in a single descriptor. J. Chem. Inf. Model. 56, 2347ā2352. https://doi.org/10.1021/acs.jcim.6b00565 (2016).
Jasper, J. B., Jasper, J. B., Humbeck, L., Brinkjost, T. & Koch, O. A novel interaction fingerprint derived from per atom score contributions: Exhaustive evaluation of interaction fingerprint performance in docking based virtual screening. J. Cheminform. 10, 15. https://doi.org/10.1186/s13321-018-0264-0 (2018).
Uetrecht, J.P., & Trager, W. Drug Metabolism: Chemical and Enzymatic Aspects (1st ed.). CRC Press. https://doi.org/10.1201/b14488 (2007).
Goulet, D. R. & Atkins, W. M. Considerations for the design of antibody-based therapeutics. J. Pharm. Sci. 109, 74ā103. https://doi.org/10.1016/j.xphs.2019.05.031 (2020).
Parameswaran, P. et al. Convergent antibody signatures in human dengue. Cell Host Microbe. 13, 691ā700. https://doi.org/10.1016/j.chom.2013.05.008 (2013).
Bürckert, J. P. et al. Functionally convergent B cell receptor sequences in transgenic rats expressing a human B cell repertoire in response to tetanus toxoid and measles antigens. Front. Immunol. 8, 1834. https://doi.org/10.3389/fimmu.2017.01834 (2017).
Pipattanaboon, C. et al. Cross-reactivity of human monoclonal antibodies generated with peripheral blood lymphocytes from dengue patients with Japanese encephalitis virus. Biologics. 7, 175ā187. https://doi.org/10.2147/BTT.S47438 (2013).
Charoenkwan, P., Nantasenamat, C., Hasan, M. M. & Shoombuatong, W. iTTCA-Hybrid: Improved and robust identification of tumor T cell antigens by utilizing hybrid feature representation. Anal. Biochem. 599, 113747. https://doi.org/10.1016/j.ab.2020.113747 (2020).
Duvenaud, D.K., et al. Convolutional networks on graphs for learning molecular fingerprints. Adv. Neural Inform. Process. Syst. https://doi.org/10.48550/arXiv.1509.09292 (2015).
Kim, S., Bolton, E. E. & Bryant, S. H. PubChem3D: Conformer ensemble accuracy. J. Cheminform. 5, 1. https://doi.org/10.1186/1758-2946-5-1 (2013).
Charoenkwan, P., Nantasenamat, C., Hasan, M. M. & Shoombuatong, W. Meta-iPVP: A sequence-based meta-predictor for improving the prediction of phage virion proteins using effective feature representation. J. Comput. Aided Mol. Des. 34, 1105ā1116. https://doi.org/10.1007/s10822-020-00323-z (2020).
Waterhouse, A. et al. SWISS-MODEL: Homology modelling of protein structures and complexes. Nucleic Acids Res. 46, W296-303. https://doi.org/10.1093/nar/gky427 (2018).
Benkert, P., Biasini, M. & Schwede, T. Toward the estimation of the absolute quality of individual protein structure models. Bioinformatics. 27, 343ā350. https://doi.org/10.1093/bioinformatics/btq662 (2011).
Uberuaga, B. P., Anghel, M. & Voter, A. F. Synchronization of trajectories in canonical molecular-dynamics simulations: Observation, explanation, and exploitation. J. Chem. Phys. 120, 6363ā6374. https://doi.org/10.1063/1.1667473 (2004).
Berendsen, H. J. C., Postma, J. P. M., van Gunsteren, W. F., DiNola, A. & Haak, J. R. Molecular dynamics with coupling to an external bath. J. Chem. Phys. 81, 3684ā3690. https://doi.org/10.1063/1.448118 (1984).
Klaewkla, M., Charoenwongpaiboon, T. & Mahalapbutr, P. Molecular basis of the new COVID-19 target neuropilin-1 in complex with SARS-CoV-2 S1 C-end rule peptide and small-molecule antagonists. J. Mol. Liq. 335, 116537. https://doi.org/10.1016/j.molliq.2021.116537 (2021).
Jorgensen, W. L., Chandrasekhar, J., Madura, J. D., Impey, R. W. & Klein, M. L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 79, 926ā935. https://doi.org/10.1063/1.445869 (1983).
Chen, R., Li, L. & Weng, Z. ZDOCK: An initial-stage protein-docking algorithm. Proteins. 52, 80ā87. https://doi.org/10.1002/prot.10389 (2003).
Roe, D. R. & Cheatham, T. E. PTRAJ and CPPTRAJ: Software for processing and analysis of molecular dynamics trajectory data. J. Chem. Theory Comput. 9, 3084ā3095. https://doi.org/10.1021/ct400341p (2013).
Miller, B.R., McGee, T.D.J., Swails, J.M., Homeyer, N., Gohlke, H., Roitberg, A.E. MMPBSA.py: An Efficient Program for End-State Free Energy Calculations. J Chem Theory Comput. 8, 3314ā21. https://doi.org/10.1021/ct300418h (2012)
Acknowledgements
This research project was financially supported by the Young Researcher Development Project of Khon Kaen University.
Author information
Authors and Affiliations
Contributions
PN prepared the dataset, performed all in silico experiments, and wrote the manuscript. CP designed the study, supervised dataset preparation and data analysis, and wrote and edited the manuscript. PC and WS supervised the ML analysis and edited the manuscript. PM and TR supervised MD analysis and edited the manuscript. SC and KF assisted with result interpretation and edited the manuscript. All authors reviewed and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Natsrita, P., Charoenkwan, P., Shoombuatong, W. et al. Machine-learning-assisted high-throughput identification of potent and stable neutralizing antibodies against all four dengue virus serotypes. Sci Rep 14, 17165 (2024). https://doi.org/10.1038/s41598-024-67487-8
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-024-67487-8








