Machine-learning-assisted high-throughput identification of potent and stable neutralizing antibodies against all four dengue virus serotypes

Natsrita, Piyatida; Charoenkwan, Phasit; Shoombuatong, Watshara; Mahalapbutr, Panupong; Faksri, Kiatichai; Chareonsudjai, Sorujsiri; Rungrotmongkol, Thanyada; Pipattanaboon, Chonlatip

doi:10.1038/s41598-024-67487-8

Download PDF

Article
Open access
Published: 26 July 2024

Machine-learning-assisted high-throughput identification of potent and stable neutralizing antibodies against all four dengue virus serotypes

Piyatida Natsrita¹,
Phasit Charoenkwan³,
Watshara Shoombuatong⁴,
Panupong Mahalapbutr⁵,
Kiatichai Faksri^1,2,
Sorujsiri Chareonsudjai¹,
Thanyada Rungrotmongkol⁶ &
…
Chonlatip Pipattanaboon¹

Scientific Reports volume 14, Article number: 17165 (2024) Cite this article

5032 Accesses
6 Citations
Metrics details

Subjects

Abstract

Several computational methods have been developed to identify neutralizing antibodies (NAbs) covering four dengue virus serotypes (DENV-1 to DENV-4); however, limitations of the dataset and the resulting performance remain. Here, we developed a new computational framework to predict potent and stable NAbs against DENV-1 to DENV-4 using only antibody (CDR-H3) and epitope sequences as input. Specifically, our proposed computational framework employed sequence-based ML and molecular dynamic simulation (MD) methods to achieve more accurate identification. First, we built a novel dataset (n = 1108) by compiling the interactions of CDR-H3 and epitope sequences with the half maximum inhibitory concentration (IC50) values, which represent neutralizing activities. Second, we achieved an accurately predictive ML model that showed high AUC values of 0.879 and 0.885 by tenfold cross-validation and independent tests, respectively. Finally, our computational framework could be applied to filter approximately 2.5 million unseen antibodies into two final candidates that showed strong and stable binding to all four serotypes. In addition, the most potent and stable candidate (1B3B9_V21) was evaluated for its development potential as a therapeutic agent by molecular docking and MD simulations. This study provides an antibody computational approach to facilitate the high-throughput identification of NAbs and accelerate the development of therapeutic antibodies.

Accurate identification of broadly neutralizing antibodies against dengue virus based on deep stacking strategy with multi-perspective features

Article Open access 10 December 2025

The dengue-specific immune response and antibody identification with machine learning

Article Open access 20 January 2024

Multivalent administration of dengue E dimers on liposomes elicits type-specific neutralizing responses without immune interference

Article Open access 09 June 2025

Introduction

Dengue virus (DENV), which has four antigenically distinct serotypes (DENV-1 to DENV-4), has worldwide spread and causes 400 million infections/year, with 100 million cases of mild to severe dengue disease (dengue fever, dengue haemorrhagic fever, or dengue shock syndrome) and 20,000 deaths¹. To date, there is neither a fully effective, licenced dengue vaccine nor an approved therapeutic agent due to the phenomenon of antibody-dependent enhancement (ADE), which is a pathological immune response to subsequent infection with a different serotype, leading to severe disease². ADE occurs when non-neutralizing or sub-neutralizing levels of antibodies from a previous dengue infection or vaccination bind to a different serotype of dengue virus during a subsequent infection. Recent updates show that various approved dengue vaccines demonstrate differing efficacy rates across each serotype, typically ranging from approximately 60% and 80% in clinical trials involving both immune and non-immune participants to dengue³. To mitigate the risk of ADE, dengue vaccine or therapeutic candidates must induce strong and balanced immune responses against all four dengue serotypes.

DENV particles are composed of seven nonstructural proteins (NS1, NS2A, NS2B, NS3, NS4A, NS4B, NS5) and three structural proteins (E, C, prM/M). Dengue envelope (E) protein is the principal target for potent broadly neutralizing antibodies (NAbs), containing neutralizing epitopes on EDII (envelope domain II with fusion loop), EDI-EDII (interdomain), EDIII (envelope domain III), and EDE (envelope dimer epitope)^4,5,6,7. The overall structure of the E protein is conserved across all four DENV serotypes, exhibiting 60–70% amino acid sequence identity⁸. Developing cross-reactive neutralizing therapeutic antibodies or vaccines against all serotypes requires navigating the structural variability of the E protein and identifying conserved epitopes that induce protective immunity without enhancing infection. The fusion loop, a conserved region situated within EDII, plays a vital role in the viral fusion process with host cells and may potentially trigger ADE⁸. Variations in the sequence and structure of the EDII among serotypes impact antibody binding, neutralization efficacy, and ADE, thereby impacting the design of both therapeutics and vaccines. Addressing these challenges emphasizes the critical importance of gaining a thorough understanding of the structure of DENV and the interactions between antibodies and antigens. This understanding is crucial for refining strategies aimed at developing vaccines and therapeutics.

Over the past three decades, there have been many attempts to develop NAbs against E proteins of all four serotypes, known as cross-neutralizing antibodies, with half maximum inhibitory concentration (IC50) values (or neutralizing activities) less than 10 µg/mL to reduce the effects of ADE as promising therapeutic agents for dengue infection^9,10. Recent studies on therapeutic antibodies have established a threshold of IC50 ≤ 10 µg/mL to distinguish between neutralizing (IC50 ≤ 10 µg/mL) and nonneutralizing (IC50 > 10 µg/mL) activities^11,12. One of our antibody candidates, 1B3B9, targets the fusion loop and effectively neutralizes all four dengue serotypes. However, it exhibits varying levels of neutralizing activity (ranging from 0.125 to 3 ug/ml) against different serotypes, which could potentially lead to ADE^6,13. To tackle this challenge, we aim to mutate this antibody and apply a computational framework to enhance its effectiveness.

The design of antibodies for the treatment of various diseases, such as DENV infectious diseases, requires two structural parts: fragment antigen-binding (Fab) and constant fragment crystallizable (Fc) regions¹⁴. The Fab region contains the hypervariable regions or complementary determining regions (CDRs) CDR-H1, CDR-H2, and CDR-H3 on each heavy chain and CDR-L1, CDR-L2, and CDR-L3 on each light chain. CDR-H3 is the most hypervariable region responsible for binding and neutralizing activities that should be considered for antibody-specified epitope prediction¹⁵. Antibody design and prediction (screening) is the most important step to limit laborious, time-consuming, and expensive laboratory work in the process of therapeutic antibody discovery and development. Recently, the integration of experimental (in vitro and in vivo) and computational (in silico) methods has assisted and enriched the effective generation of therapeutic antibodies due to the advancement of experimental studies, sequence data, structural data, and computational approaches, especially machine learning (ML) and molecular dynamics (MD) simulation techniques¹⁵.

ML and MD can be used to detect and predict important patterns of epitopes on antigens, paratopes on antibodies, or paratope–epitope interactions and enable us to develop novel therapeutic antibodies quickly to combat emerging and reemerging diseases^15,16. Most of the computational algorithms for antibody screening that showed high accuracy involved the use of complex methods and structural data^17,18. Simple, sequence-based ML methods that can be used to screen for antibodies with desired properties have been reported for only some diseases, such as cancers^19,20, HIV infection^21,22, and SARS-CoV-2 infection²³. Previous prediction approaches for dengue NAbs have used only ML-based^24,25 or MD-based²⁶ methods, which have different advantages in high-throughput prediction for antibody screening or highly efficient prediction for antibody developability, respectively^24,27. None of the previous approaches used the dengue antibody dataset containing full-length CDR-H3 sequences and epitope sequences together with the in vitro IC50 values. Additionally, there is no complete in silico screening approach for cross-neutralizing antibodies against the four DENV serotypes^24,28.

In this study, we developed an computational framework, incorporating a sequence-based ML method and a simple MD method, to accurately identify potent and stable neutralizing antibodies against all four dengue serotypes. The development process in our framework consists of four major steps: (1) dataset preparation, (2) feature extraction and ML analysis, (3) ML screening, and (4) MD screening. The major contributions from this process can be summarized as follows. First, we comprehensively generated a novel sequence-based dataset with IC50 activities from available experimental results over the last three decades. Second, we developed a computational framework, which is the first integrative approach (ML and MD methods) to identify potent and stable antibodies against DENV-1 to DENV-4. We compared and demonstrated the performance of three different encoding schemes in conjunction with ten ML algorithms to select the best performing model and capture the crucial information on NAbs. We then applied this computational framework to screen several million unseen antibodies into two final NAb candidates. Finally, we identified the most potent and stable 1B3B9_V21 antibody as a promising therapeutic agent against dengue infection by molecular docking and MD analysis.

Results

Computational framework overview and dataset analysis

We designed a novel in silico prediction framework integrating sequence-based ML and simple MD methods to screen high-throughput antibody variants for potent and stable NAb candidates against four serotypes of DENV. The computational framework overview consists of four steps: dataset preparation, feature extraction and ML analysis, ML screening, and MD screening (Fig. 1). In the dataset preparation, we collected CDR-H3 sequences, epitope sequences, and experimental IC50 values from well-defined PubMed and Google patent databases during 1992–2022 (n = 100 publications; Supplemental Table 1). Missing and redundant data were not included in our dataset. All sequences were paired to form a total of 1108 antibody–antigen interactions and further labelled according to IC50 values as the neutralizing class (positive dataset; IC50 ≤ 10 μg/mL; n = 554) and the nonneutralizing class (negative dataset; IC50 > 10 μg/mL; n = 554) (Table 1).

Table 1 Details of the benchmark dataset used for ML analysis.

Full size table

Overall, we found that the antibody–antigen interactions in the neutralizing (N) and nonneutralizing (NN) classes had binding residues on EDII (N = 32.13%, NN = 30.51%), EDIII (N = 25.27%, NN = 37.36%), EDE (N = 21.84%, NN = 15.52%), interdomain (N = 19.13%, NN = 14.62%), and EDI (N = 1.62%, NN = 1.99%) (Fig. 2A). We randomly divided our balanced dataset into a training dataset (80%, n = 443) and an independent dataset (20%, n = 111) (Table 1). To visualize the diversity of our training data, we separately analysed the sequence-extracted input of CDR-H3 and epitope sequences in the t-distributed stochastic neighbour embedding (t-SNE) plot and labelled them with epitope domains. We found that CDR-H3 sequences are diverse, as illustrated in Fig. 2B. In contrast, the epitope sequences in all domains are well clustered in several areas (Fig. 2C). We also evaluated the IC50 labels of all input interactions and found that the IC50 cut-off value representing a high neutralizing antibody level (IC50 ≤ 10 μg/ml) can be used to explicitly divide our dataset into neutralizing and nonneutralizing classes (Fig. 2D).

Analysis and selection of the best ML-performing method

In this section, we describe a comparative experiment on a variant ML model developed by using three encoding methods and ten ML models. In the ML analysis, we performed tenfold cross-validation and independent tests on our dataset to demonstrate the prediction performance of all ML models (Supplemental Tables 6–8). To compare the ML performance for NAb prediction, we demonstrated the top five performing models with the highest AUC scores of each experiment in terms of five different matrices by tenfold cross-validation and independent tests (Table 2). We found that the AUCs of the top five ML models using the fingerprint-based encoding method were higher than those of other encoding methods (Table 2). Finally, we selected the best predictive approach of the XGB algorithm and fingerprint-based encoding method for the first-round screening of potential antibody candidates. This ML model yielded an ACC of 0.802, Sn of 0.788, Sp of 0.817, MCC of 0.604, and AUC of 0.885, as indicated by an independent test (Table 2). To confirm the interpretability of the XGB model, we illustrated the performance of this model and the other top five ML models of each method in ROC curves with AUC values of 0.554–0.885 by the independent test (Fig. 3A–C). The XGB model has an AUC of 0.885, which represents the strong ability of the XGB model to classify neutralizing or nonneutralizing antibodies (Fig. 3C). Furthermore, we determined the top 30 fingerprint features connecting with the corresponding amino acids, which are often found in the neutralizing class, via the F score plot to demonstrate the accuracy of those features (Fig. 4). The important features correlated with some amino acids (not all or nearly all amino acids) were considered unique features that might play a major role in the neutralization mechanism of dengue antibodies, including alcohol (amino acids S, T, and Y) and aromaticity (amino acids F, Y, and W) features (Fig. 4). However, there are some features (labelled as could not be determined; ND) that have not been reported to be related to amino acids and binding properties in antibody interactions.

Table 2 Top five ML algorithms based on three different feature encoding methods.

Full size table

Generation and screening of new antibody variants

We first generated new antibody variants (unseen antibodies) by applying single, double, and triple mutations to all amino acids of the CDR-H3 sequence (TTLSGYSADWPEDY) of the 1B3B9 neutralizing human monoclonal antibody, resulting in 2,529,794 CDR-H3 variants. These CDR-H3 variants were paired with the most cross-reactive epitopes of dengue virus (FL epitope residues; CCDRWCFCCK) as antibody–antigen sequences for feature extraction by the fingerprint-based method into the numerical input for the first ML screening of cross-neutralizing antibodies (potential candidates). To precisely predict the potential antibodies, we used the selected XGB model with a confidence score of 0.9900 for screening all antibody variants, and this ML screener filtered approximately 2.5 million variants to obtain 28 potential candidates. All 28 ML-screened candidates contain triple-point mutations, and the major residues are S100, Y102, S103, A104, and D105 (Supplemental Table 10). The minor mutation residues of these candidates are W106, E108, D109, and Y110, whereas there is no mutation at T97, T98, L99, G101, or P107, as shown in Supplemental Table 10.

To empower the screening framework and increase antibody developability, we screened these 28 ML-screened candidates with an MD simulation tool to test the stability of each antibody molecule. We designed a simple MD by performing homology modelling of these candidates, introducing the mutated CDR-H3 sequences into the 1B3B9 antibody template, and sent it to the SWISS-MODEL server. We determined the quality of each modelled structure with Ramachandran favoured and QMEAN scores before performing MD simulations. All constructed 3D structures showed QMEAN scores less than 1, reflecting the native-like structure (Supplemental Table 10). Then, we used MD to screen a stable conformation of each 3D structure and showed MD-screened candidates in a scatter plot with the average RMSD (a standard measure of structural distance between coordinates or structure changes; Y axis) and average end-to-end distance (a distance between the first and the last carbon atom in a protein or structure length; X axis) values, which are the representative parameters for molecular stability in Fig. 5A. From this MD plot, 1B3B9 (green circle), 1B3B9_V2 (red star), and 1B3B9_V21 (yellow star) had average values (X, Y) of (39.22 ± 4.33, 2.94 ± 0.32), (38.08 ± 4.88, 2.89 ± 0.46), and (38.77 ± 3.44, 2.65 ± 0.36), respectively (Fig. 5A,B). The RMSD plot of all structures is provided in Supplemental Fig. 1. We accordingly concluded that 1B3B9_V2 and 1B3B9_V21 have lower RMSD and end-to-end distance values than the 1B3B9 template and other candidates, which implies that the two candidates have more stable structural conformations and higher developability as synthetic antibodies. In this finding, we proposed the best 1B3B9_V21 antibody, which has the lowest average RMSD, representing a more stable configuration, for further characterization as a potent and stable candidate targeting the four envelope proteins of DENV-1 to DENV-4.

In silico characterization of the best 1B3B9_V21 NAb candidate

We investigated the ability of the best predicted antibody candidate (1B3B9_V21) compared to the real-world antibody template (1B3B9) in terms of binding interactions and binding energies (affinities and stabilities) using molecular docking and MD simulations, respectively. We used complexed four envelope proteins of DENV-1 to DENV-4 (PDB: 4CCT, 5A1Z, 3J6T, 4CBF) with 1B3B9 and 1B3B9_V21 in both docking and MD experiments. In a molecular docking study at a resolution of 4.5 Å, 1B3B9_V21 straddled FL epitopes from the top of EDII with interactions involving both the heavy chain and light chains (Fig. 6A). The binding motifs of 1B3B9_V21 were located at the fusion loop (W101, N103, G104, C105, G106, L107 and F108 residues) on the E protein of DENV-2 (Fig. 6A) with 447 atom-to-atom contacts (Fig. 6B). Structural analysis showed that 1B3B9 and 1B3B9_V21 bound to different regions of FL epitopes within R99—F108 (Supplemental Tables 11 and 12). 1B3B9_V21 revealed higher numbers of atom-to-atom contacts with DENV-2 E, DENV-3 E, and DENV-4 E proteins than 1B3B9, except DENV-1 E protein (Supplemental Tables 11 and 12). We therefore sought to determine whether 1B3B9_V21 could bind to DENV-1 to DENV-4 with strong affinity. Next, we tested the binding affinities and stabilities of 1B3B9 and 1B3B9_V21 with DENV-1 to DENV-4 by using MD simulations. We found that 1B3B9_V21 in complex with DENV-1, DENV-2 and DENV-3 showed lower ΔG_bind energies with lower average RMSD values (except for the DENV-3 system) than 1B3B9, whereas the ΔG_bind of 1B3B9_V21 in complex with DENV-4 was similar to that of 1B3B9 (Fig. 6C and Supplemental Fig. 2). These results were in good agreement with H-bond and contact analyses showing that 1B3B9_V21 in complex with DENV-1, DENV-2 and DENV-3 exhibited higher H-bond formations and number of contact atoms compared to 1B3B9_V21 in complex with DENV-4 (Supplemental Figs. 3 and 4). The RMSD plots of all antibody–antigen complexes are available in Supplemental Fig. 2. All data from in silico characterization indicated that 1B3B9_V21, which is the best NAb candidate from our computational framework, exhibits cross-neutralizing and stable binding with DENV-1 to DENV-4. Moreover, 1B3B9_V21 also provided binding interactions for these cross-reactive FL epitopes to other flaviviruses such as Zika virus and Japanese encephalitis virus (Supplemental Fig. 5). Binding site was also occupied between FL and 1B3B9_V21 heavy chain especially CDR-H3 mutated amino acid sequence (Supplemental Table 13). We suggest that our computational framework has potential for use in the efficient design and screening of potent and stable dengue therapeutic antibodies.

Discussion

Antibody treatment is a promising strategy to combat severe dengue. An ideal antibody therapeutic should sufficiently neutralize all four DENV serotypes to reduce ADE effects, which is the major problem for dengue treatment^9,10. A simple, rapid, and efficient antibody design and screening method, especially an in silico method, is the first crucial step to accelerate antibody discovery and the development of dengue therapeutic antibodies.

In this study, we propose a well-characterized dataset and in silico approach to predict potent and stable neutralizing antibodies against DENV-1 to DENV-4 to decrease ADE effects by screening for antibodies with strong and stable binding, thus providing an outstanding 1B3B9_V21 antibody candidate for further antibody engineering and optimization. We newly generated a CDR-H3-epitope-IC50 dataset from well-defined and real experimental results of 100 publications from 1992 to 2022. IC50 values obtained from different laboratory methods are unlikely to significantly impact algorithms, as each assay can accurately reflect the actual activities being measured²⁸. We found that our dataset has a balance of neutralizing (positive) and nonneutralizing (negative) categories with highly diverse properties of CDR-H3 sequences and lower variation among epitope sequences, similar to the results of a previous study²³. Our computational framework is designed to combine sequence-based ML screening and simple MD screening to identify the most potent and stable candidates in the most practical workflow. The combination of the XGB model and fingerprint-based method achieved excellent ML performance (AUC = 0.885 by independent test) in the prediction of cross-neutralizing antibodies against four DENV serotypes. We chose the AUC metric to evaluate the model's performance and selection for classifying neutralizing and non-neutralizing antibodies because of its comprehensive evaluation across all potential classification thresholds and various operational scenarios, as demonstrated in previous study²³. Interestingly, the ML screener was used to screen approximately 2.5 million unseen antibodies to obtain 28 NAb candidates. The MD screener was used to screen 28 potential antibodies to find 2 stable NAb candidates, which may indicate the developability of these screened candidates as synthetic antibodies. Compared to existing approaches^17,18,23, our framework uses a sequence-based ML model (no need for antibody and antigen structural data) and a simple MD protocol (requiring only an antibody structure with 15 ns MD production) that is easily applied in real-world experiments. In addition, our ML-based framework might be usable for antibody screening against other flaviviruses according to the cross-reactive epitopes that we used in this study, which are conserved among DENV, Zika, Japanese encephalitis, yellow fever, West Nile, and tick-borne encephalitis viruses^29,30. Our approach has five key features. (1) It has two successive steps: general ML and simple MD methods. (2) It reduces the limitations of needing the full antibody sequence (instead requiring only the CDR-H3 sequence) and full antigen sequence. (3) It decreases time-consuming tasks by running all processes within 2 weeks. (4) It enables cost-effective screening to reduce laboratory work. (5) It is applicable to screening other flaviviruses because of the use of cross-reactive epitopes as shown in Supplemental Table 13.

Regarding the details of ML analysis, we considered the fingerprint-based method to be suitable for the prediction of antibody–antigen interactions because it can extract information from biophysical and biochemical properties without relying on the sequence or 3D structure. This approach facilitates the identification of unknown therapeutic candidates on a large scale for subsequent experimental validation³¹. We identified that the important features of 28 NAb candidates belong to the substructure count fingerprint (FP4C), which lists the top 30 fingerprints alongside their respective descriptions in Supplemental Table 9. Three of the top 30 fingerprints belong to the general class of alcohols (FP4C14: secondary alcohol, FP4C13: primary alcohol, and FP4C12: alcohol), which are found in the side chains of certain amino acids such as serine and threonine. These hydroxyl groups can act as both hydrogen bond donors and acceptors, influencing hydrogen bonding networks, binding affinity, and the neutralizing activity of antibodies³². Two of the 30 most important features were FP4C274 (aromatic ring) and FP4C184 (heteroaromatic ring), which are present in the side chains of certain amino acids like phenylalanine, tryptophan, and histidine. Aromaticity plays a significant role in cooperative interactions involving hydrophobicity, charge, and hydrogen bonding properties, making it well-suited for creating binding sites for epitopes, irrespective of the presence of polar and nonpolar surface residues³³. For the FP4C307 (chiral center) fingerprint, it is recognized as a pivotal feature that significantly improves the predictive performance of machine learning models in drug discovery tasks, particularly in predicting KRAS G12C inhibitors and other biological activities³⁴. Regarding FP4C84 (carboxylic acid) and FP4C88 (carboxylic acid derivative), which are found in the side chains of amino acids like aspartic acid and glutamic acid, they participate in hydrogen bonding, ionic interactions, and salt bridges critical for protein structure and protein–protein interactions³⁵. FP4C302 (rotatable bond), with an average F-score of 40, represents single bonds that allow free rotation, contributing to molecular flexibility. This structural characteristic is crucial for determining overall molecular flexibility. Interestingly, all compounds in the dataset possessed rotatable bonds, particularly abundant in active compounds, suggesting their importance for biological activity³⁶. FP4C295 (C–O–N–S bond) refers to the presence of a carbon–oxygen–sulfur bond in molecular structures. This feature can influence protein–protein binding by participating in hydrogen bonding interactions with amino acid residues on the protein surface, enhancing binding affinity and specificity³⁷. Additionally, other significant FP4Cs include miscellaneous descriptors such as FP4C1-3 and FP4C300. FP4C1 (primary carbon), FP4C2 (secondary carbon), and FP4C3 (tertiary carbon) carbons indicate that carbon atoms with two or three neighbouring carbons in drug molecules may enhance metabolic stability by limiting access to metabolic pathways³⁸. Based on feature importance (Supplemental Table 9) and amino acid analysis (Fig. 4), we suggest that the essential characteristics of our identified neutralizing antibodies encompass alcohols (present in amino acids S, T, and Y) and aromaticity (linked to amino acids F, Y, and W). These features are essential for the binding mechanism and maintaining the correct secondary structure at the antibody–antigen binding sites³⁹.

Furthermore, we observed that the mutated residues of outstanding candidates are mostly located in the central area of the CDR-H3 region (S100, Y102, S103, A104, D105, W106, E108, D109, and Y110), which are crucial sites for antibody engineering and improvement, as supported by previous studies^40,41. After characterization by in silico methods, 1B3B9_V21 revealed binding motifs at the most cross-reactive fusion loop region on the EDII protein of DENV-1 to DENV-4 (W101, G102, G104, G106, and L107 residues). These binding sites have been reported in many studies of cross-neutralizing antibodies against all four DENV^6,29,42. In molecular dynamics, 1B3B9_V21 showed high binding affinities (less than -22.83 ± 4.89 kcal/mol) and stabilities (less than 11.83 ± 1.960 Å) with DENV-1 to DENV-4. Most of the binding energies of 1B3B9_V21 are lower than those of the real 1B3B9 antibody. Therefore, we suggest 1B3B9_V21 as a potent and stable NAb candidate for development as a therapeutic agent. To accomplish the application of the NAb candidates, in vitro tests of the neutralizing and ADE activities of these predicted antibodies are still required.

In conclusion, we have presented the first ML-based framework that employed a sequence-based ML method and a simple MD method for the accurate and rapid identification of NAbs against DENV-1 to DENV-4. We also provided an updated dengue antibody dataset including unique information on CDR-H3 sequences, epitope sequences, and IC50 values. We used three different feature-encoding methods and ten ML algorithms to compare and exhibit the best performing model. Our ML model can be used for large-scale identification of NAbs. The MD method can be used to select stable NAbs for more accurate identification. Our outstanding 1B3B9_V21 candidate showed high potential for development as a therapeutic antibody, and this NAb candidate is warranted for further in vitro analysis. Our proposed computational framework might support novel opportunities to discover, design, and engineer therapeutic antibodies against four dengue viruses and other flaviviruses.

Methods

Computational framework design

We built a dataset, model, and computational framework for screening cross-neutralizing antibodies against DENV. The computational framework contains four steps (Fig. 1): (1) dataset preparation of CDR-H3-epitope-IC50 pairing data, (2) feature extraction and ML analysis for the best performing model, (3) generation of CDR-H3 antibody variants and ML screening for potential antibody candidates, and (4) MD screening for stable antibody candidates.

Dataset preparation

The dataset of anti-DENV CDR-H3 sequences was prepared from published data that satisfied three criteria: (1) including complete CDR-H3 amino acid sequences specific to DENV and published in PubMed or Google patent databases, (2) including epitopes on the E protein in EDII (fusion loop), EDI-EDII (interdomain), EDIII, or EDE, and (3) including neutralizing activities against DENV in µg/ml, as characterized by a focus reduction neutralization test (FRNT) or plaque reduction neutralization test (PRNT) or enzyme-linked immunosorbent assay (ELISA). The CDR-H3 sequences were annotated using the IMGT numbering scheme through the Antibody Region-Specific Alignment (AbRSA) web service. The neutralization (NT) activities derived from FRNT and PRNT represent actual experimental NT values, while antibodies that do not bind in the ELISA assay were categorized as non-neutralizing because they fail to bind to the antigen. All data (Supplemental Table 1) were carefully preprocessed to handle missing data and remove redundant data. The compilation of interactions between CDR-H3 and epitope sequences followed the principles outlined by Magar et al.²³. Our dataset was derived from experimental results that included the sequences of CDR-H3 and epitopes, along with corresponding in vitro IC50 values against any of the four serotypes of dengue virus. In this dataset, we collected CDR-H3 sequences along with their epitopes and neutralizing activities (IC50; µg/ml) and further divided them into a positive (neutralizing) dataset (IC50 ≤ 10 µg/ml) and a negative (nonneutralizing) dataset (IC50 > 10 µg/ml). Each positive and negative dataset was randomly divided into a training dataset (80%) and an independent dataset (20%) (Table 1).

Feature extraction and machine learning analysis

To evaluate the contribution of variant ML methods at the different levels of each antibody–antigen interaction, we performed a comparative analysis on different feature extraction schemes and ML methods. In the case of the feature extraction schemes, we applied three types of feature descriptors addressing multiple aspects, including 12 sequence-based, 7 atom-based, and 12 fingerprint-based feature descriptors (Supplemental Tables 2–4) following the previous studies^43,44,45. The CDR-H3 and epitope sequences were concatenated and encoded into a single embedding of an antibody–antigen complex using our in-house Python-based code. Data normalization was performed to transform the input features into the same scale using Min–Max Scaler. Afterwards, the scaled training set was employed to train ten different ML algorithms: extreme gradient boosting (XGB), support vector machine (SVM), random forest (RF), partial least squares regression (PLS), multilayer perceptron (MLP), logistic regression (LR), linear support vector classification (LN), extra tree (ET), decision tree (DT), and k-nearest neighbours (KNN) classifiers. The algorithm selection was guided by considerations of accuracy, computational resources, and interpretability to derive the optimal predictive model for the dataset. Simpler models such as PLS, LR, LN, and DT provide easier interpretability, whereas more complex models like XGB, SVM, RF, MLP, ET, and KNN offer greater effectiveness in capturing intricate patterns within high-dimensional data spaces and deliver superior predictive capabilities²⁴. All models are designed specifically for binary classification. Hyperparameter searching was performed as described in Supplemental Table 5. Herein, we evaluated and compared the performance of the developed ML methods based on tenfold cross-validation (CV) and independent tests. We demonstrated ML performances based on five metrics: accuracy (ACC), sensitivity (Sn), specificity (Sp), Matthew’s correlation coefficient (MCC), and area under the ROC curve (AUC), as previously described^43,46. The ML algorithm with the greatest AUC for predicting NAbs was selected to screen a panel of unseen antibodies in the ML screening step. Model comparison, analysis and evaluation were performed using Python script 3.8 (all codes; available at GitHub).

In silico antibody library

An in silico antibody library of 2,529,794 variants was generated by making single, double, and triple mutations on all amino acids in the CDR-H3 region of the 1B3B9 neutralizing human monoclonal antibody (antibody template)⁶ using Python script 3.8. All antibody variants were screened and ranked by the best ML algorithm and simple MD simulation in further steps to discover new potent and stable NAb candidates.

ML and MD screening methods

In the first ML screening step, all antibody variants were filtered by the best performing ML algorithm with a confidence score cut-off of 0.990 to screen for potent neutralizing antibody candidates against four serotypes of DENV. Then, we constructed the 3D structure of the 1B3B9 Fab antibody template and ML-screened antibody variants using the SWISS-MODEL server⁴⁷ (PDB 3t2n.1). Homology assessments and structural comparisons were performed using the SWISS-ExPASy server in terms of Ramachandran plots and QMEAN scores⁴⁸ to validate the model quality of all 3D structures before screening by MD simulation.

In the second MD screening step, to determine the Fab structural stability as previously described (18), we performed MD simulations of all Fab antibody variants in a solvated environment using Amber 16. Each system was simulated under the periodic boundary condition with isothermal-isobaric (NPT) ensemble. The temperature and pressure were controlled by using Langevin thermostat⁴⁹ with collision frequency of 2.0 and Berendsen barostat⁵⁰. Topologies of each Fab were generated according to the ff14SB forcefield⁵¹. The Fab molecule was cantered in a cubic simulation box, extending 1 nm from the molecule surface. Then, the system box was solved by TIP3B model water atoms⁵², and a net positive charge was neutralized with chloride ions. Energy minimization was carried out using 1500 steps of steepest descent (SD) followed by 1500 steps of conjugated gradient (CG) methods with constrained solvent molecules. Then, the whole system was fully minimized using the same procedure. The minimized system was subjected to two rounds of equilibration at 310 K and 1 atm. First, the molecular system was equilibrated in the NVT ensemble for 100 picoseconds and a 2-femtosecond time step. Second, the equilibration was applied in a round of NPT simulation for 100 picoseconds to ensures that the simulated system is at physiological temperature and pressure. Then, the system was carried out in NPT and no constraints for 500 picoseconds. Finally, a 15 ns unrestrained NPT simulation at 310 K and 1 atm was carried out under identical simulation parameters. We collected the MD results and demonstrated the stabilities of all variants in the scatter plot of average end-to-end distance values at the X axis and averaged the root-mean-square displacement (RMSD) values at the Y axis to select the outstanding NAb candidates.

Characterization by molecular docking and MD simulation

The antibody template (1B3B9) and the best screened NAb candidate were further characterized for their binding sites, binding affinities, and binding stabilities with DENV-1 to DENV-4 (utilizing PDB IDs: 4CCT, 5A1Z, 3J6T, 4CBF, which correspond to the strains used in previous in vitro studies)⁶ using molecular docking and MD simulation. First, antibody–antigen docking was performed by using the ZDOCK server⁵³ using the default parameters and the target residues. The best posture of each docked complex was selected according to the lowest-energy ZRANK score. Then, MD simulations of each antibody–antigen complex were executed at 310 K and 1 atm for 100 ns using Amber 16. The MD procedure was the same as in the MD screening method above. The MD trajectories were saved every 10 ps.

The binding affinity of the antibody–antigen complex was calculated as binding free energy (ΔG_bind; kcal/mol) based on the MM/PB(GB)SA approach from the last 20 ns of the MD production using the CPPTRAJ⁵⁴ and MMPBSA.py⁵⁵ modules of AMBER16. The binding stability of each complex was evaluated in terms of RMSD according to the number of antigen–antibody hydrogen bonds and the number of atomic contacts using the CPPTRAJ modules of AMBER16. The H-bond interactions were calculated using two criteria: (1) distance between the hydrogen donor (HD) and hydrogen acceptor (HA) of ≤ 3.5 Å and (2) HD–H···HA angle of ≥ 150°. The number of atom contacts was counted as the number of atoms within 4.5 Å of each complex.

Data availability

All data generated or analyzed during this study are included in this published article and its supplementary information files. The dataset, feature descriptions, ML performances, RMSD plots of MD simulations, data of ML-screened antibodies, and numbers of atom-to-atom contacts are provided in the supplementary information. All codes in this study are available online at https://github.com/PiyatidaNatsrita/In-silico-dengue-virus-antibody-prediction-2022.git.

References

Bhatt, S. et al. The global distribution and burden of dengue. Nature. 496, 504–507. https://doi.org/10.1038/nature12060 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Khetarpal, N. & Khanna, I. Dengue fever: Causes, complications, and vaccine strategies. J. Immunol. Res. 2016, 6803098. https://doi.org/10.1155/2016/6803098 (2016).
Article CAS PubMed PubMed Central Google Scholar
Thomas, S. J. Is new dengue vaccine efficacy data a relief or cause for concern?. NPJ Vaccines. 8, 55. https://doi.org/10.1038/s41541-023-00658-2 (2023).
Article PubMed PubMed Central Google Scholar
Beltramello, M. et al. The human immune response to dengue virus is dominated by highly cross-reactive antibodies endowed with neutralizing and enhancing activity. Cell Host Microbe. 8, 271–283. https://doi.org/10.1016/j.chom.2010.08.007 (2010).
Article CAS PubMed Google Scholar
Dejnirattisai, W. et al. A new class of highly potent, broadly neutralizing antibodies isolated from viremic patients infected with dengue virus. Nat. Immunol. 16, 170–177. https://doi.org/10.1038/ni.3058 (2015).
Article CAS PubMed Google Scholar
Pitaksajjakul, P. et al. Antibody germline characterization of cross-neutralizing human IgGs against 4 serotypes of dengue virus. Biochem. Biophys. Res. Commun. 446, 475–480. https://doi.org/10.1016/j.bbrc.2014.02.131 (2014).
Article CAS PubMed PubMed Central Google Scholar
Xu, M. et al. Protective capacity of the human anamnestic antibody response during acute dengue virus infection. J. Virol. 90, 11122–11131. https://doi.org/10.1128/JVI.01096-16 (2016).
Article CAS PubMed PubMed Central Google Scholar
Stiasny, K., Kiermayr, S., Holzmann, H. & Heinz, F. X. Cryptic properties of a cluster of dominant flavivirus cross-reactive antigenic sites. J. Virol. 80, 9557–9568. https://doi.org/10.1128/JVI.00080-06 (2006).
Article CAS PubMed PubMed Central Google Scholar
Dussupt, V., Modjarrad, K. & Krebs, S. J. Landscape of monoclonal antibodies targeting zika and dengue: Therapeutic solutions and critical insights for vaccine development. Front. Immunol. 11, 621043. https://doi.org/10.3389/fimmu.2020.621043 (2021).
Article CAS PubMed PubMed Central Google Scholar
Fibriansah, G. & Lok, S. M. The development of therapeutic antibodies against dengue virus. Antiviral Res. 128, 7–19. https://doi.org/10.1016/j.antiviral.2016.01.002 (2016).
Article CAS PubMed Google Scholar
Dejnirattisai, W. et al. SARS-CoV-2 Omicron-B.1.1.529 leads to widespread escape from neutralizing antibody responses. Cell. 185, 467–484. https://doi.org/10.1016/j.cell.2021.12.046 (2022).
Article CAS PubMed PubMed Central Google Scholar
VanBlargan, L. A. et al. An infectious SARS-CoV-2 B.1.1.529 Omicron virus escapes neutralization by therapeutic monoclonal antibodies. Nat. Med. 28, 490–495. https://doi.org/10.1038/s41591-021-01678-y (2022).
Article CAS PubMed PubMed Central Google Scholar
Injampa, S. et al. Generation and characterization of cross neutralizing human monoclonal antibody against 4 serotypes of dengue virus without enhancing activity. PeerJ. 5, e4021. https://doi.org/10.7717/peerj.4021 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chan, K. R., Ong, E. Z. & Ooi, E. E. Therapeutic antibodies as a treatment option for dengue fever. Expert Rev. Anti Infect. Ther. 11, 1147–1157. https://doi.org/10.1586/14787210.2013.839941 (2013).
Article CAS PubMed Google Scholar
Sormanni, P., Aprile, F. A. & Vendruscolo, M. Third generation antibody discovery methods: In silico rational design. Chem. Soc. Rev. 47, 9137–9157. https://doi.org/10.1039/c8cs00523k (2018).
Article CAS PubMed Google Scholar
Akbar, R. et al. In silico proof of principle of machine learning-based antibody design at unconstrained scale. MAbs. 14, 2031482. https://doi.org/10.1080/19420862.2022.2031482 (2022).
Article PubMed PubMed Central Google Scholar
Ruffolo, J. A., Sulam, J. & Gray, J. J. Antibody structure prediction using interpretable deep learning. Patterns (NY). 3, 100406. https://doi.org/10.1016/j.patter.2021.100406 (2022).
Article CAS Google Scholar
Shan, S. et al. Deep learning guided optimization of human antibody against SARS-CoV-2 variants with broad neutralization. Proc. Natl. Acad. Sci. 119, e2122954119. https://doi.org/10.1073/pnas.2122954119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Liu, G. et al. Antibody complementarity determining region design using high-capacity machine learning. Bioinformatics. 36, 2126–2133. https://doi.org/10.1093/bioinformatics/btz895 (2020).
Article CAS PubMed Google Scholar
Li, X., Van Deventer, J. A. & Hassoun, S. ASAP-SML: An antibody sequence analysis pipeline using statistical testing and machine learning. PLOS Comput. Biol. 16, e1007779. https://doi.org/10.1371/journal.pcbi.1007779 (2020).
Article CAS PubMed PubMed Central Google Scholar
Rawi, R. et al. Accurate prediction for antibody resistance of clinical HIV-1 isolates. Sci. Rep. 9, 1–12. https://doi.org/10.1038/s41598-019-50635-w (2019).
Article CAS Google Scholar
Yu, W. H. et al. Predicting the broadly neutralizing antibody susceptibility of the HIV reservoir. JCI Insight. 4, e130153. https://doi.org/10.1172/jci.insight.130153 (2019).
Article PubMed PubMed Central Google Scholar
Magar, R., Yadav, P. & Barati Farimani, A. Potential neutralizing antibodies discovered for novel corona virus using machine learning. Sci. Rep. 11, 5261. https://doi.org/10.1038/s41598-021-84637-4 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Horst, A. et al. Machine learning detects anti-DENV signatures in antibody repertoire sequences. Front. Artif. Intell. 4, 715462. https://doi.org/10.3389/frai.2021.715462 (2021).
Article PubMed PubMed Central Google Scholar
Natali, E. et al. The dengue-specific immune response and antibody identification with machine learning. NPJ Vaccines. 9, 16. https://doi.org/10.1038/s41541-023-00788-7 (2024).
Article CAS PubMed PubMed Central Google Scholar
Wong, Y. H. et al. Molecular basis for dengue virus broad cross-neutralization by humanized monoclonal antibody 513. Sci. Rep. 8, 8449. https://doi.org/10.1038/s41598-018-26800-y (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Rathore, A. S., Sarker, A. & Gupta, R. D. Designing antibody against highly conserved region of dengue envelope protein by in silico screening of scFv mutant library. PLoS One. 14, e0209576. https://doi.org/10.1371/journal.pone.0209576 (2019).
Article CAS PubMed PubMed Central Google Scholar
Chaudhury, S. et al. Dengue virus antibody database: Systematically linking serotype-specificity with epitope mapping in dengue virus. PLoS Negl. Trop. Dis. 11, e0005395. https://doi.org/10.1371/journal.pntd.0005395 (2017).
Article PubMed PubMed Central Google Scholar
Deng, Y. Q. et al. A broadly flavivirus cross-neutralizing monoclonal antibody that recognizes a novel epitope within the fusion loop of e protein. PLoS One. 6, e16059. https://doi.org/10.1371/journal.pone.0016059 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
França, R.K.A. de O, Silva, J.M., Rodrigues, L.S., Sokolowskei, D., Brigido, M.M., Maranhão, A.Q. New anti-flavivirus fusion loop human antibodies with zika virus-neutralizing potential. Int. J. Mol. Sci. 23, 7805. https://doi.org/10.3390/ijms23147805 (2022).
Schaduangrat, N. et al. StackPR is a new computational approach for large-scale identification of progesterone receptor antagonists using the stacking strategy. Sci. Rep. 12, 16435. https://doi.org/10.1038/s41598-022-20143-5 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, D., Ge, Y., Zhong, B. & Liu, D. Specific epitopes form extensive hydrogen-bonding networks to ensure efficient antibody binding of SARS-CoV-2: Implications for advanced antibody design. Comput. Struct. Biotechnol. J. 19, 1661–1671. https://doi.org/10.1016/j.csbj.2021.03.021 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hofstädter, K., Stuart, F., Jiang, L., Vrijbloed, J. W. & Robinson, J. A. On the importance of being aromatic at an antibody-protein antigen interface: Mutagenesis of the extracellular interferon γ receptor and recognition by the neutralizing antibody A6. J. Mol. Biol. 285, 805–815. https://doi.org/10.1006/jmbi.1998.2343 (1999).
Article PubMed Google Scholar
Srisongkram, T., Khamtang, P. & Weerapreeyakul, N. Prediction of KRASG12C inhibitors using conjoint fingerprint and machine learning-based QSAR models. J. Mol. Graph Model. 122, 108466. https://doi.org/10.1016/j.jmgm.2023.108466 (2023).
Article CAS PubMed Google Scholar
Chan, A. W., Laskowski, R. A. & Selwood, D. L. Chemical fragments that hydrogen bond to Asp, Glu, Arg, and His side chains in protein binding sites. J. Med. Chem. 53, 3086–3094. https://doi.org/10.1021/jm901696w (2010).
Article CAS PubMed Google Scholar
Wicker, J. G. & Cooper, R. I. Beyond rotatable bond counts: Capturing 3D conformational flexibility in a single descriptor. J. Chem. Inf. Model. 56, 2347–2352. https://doi.org/10.1021/acs.jcim.6b00565 (2016).
Article CAS PubMed PubMed Central Google Scholar
Jasper, J. B., Jasper, J. B., Humbeck, L., Brinkjost, T. & Koch, O. A novel interaction fingerprint derived from per atom score contributions: Exhaustive evaluation of interaction fingerprint performance in docking based virtual screening. J. Cheminform. 10, 15. https://doi.org/10.1186/s13321-018-0264-0 (2018).
Article CAS PubMed PubMed Central Google Scholar
Uetrecht, J.P., & Trager, W. Drug Metabolism: Chemical and Enzymatic Aspects (1st ed.). CRC Press. https://doi.org/10.1201/b14488 (2007).
Goulet, D. R. & Atkins, W. M. Considerations for the design of antibody-based therapeutics. J. Pharm. Sci. 109, 74–103. https://doi.org/10.1016/j.xphs.2019.05.031 (2020).
Article CAS PubMed Google Scholar
Parameswaran, P. et al. Convergent antibody signatures in human dengue. Cell Host Microbe. 13, 691–700. https://doi.org/10.1016/j.chom.2013.05.008 (2013).
Article CAS PubMed PubMed Central Google Scholar
Bürckert, J. P. et al. Functionally convergent B cell receptor sequences in transgenic rats expressing a human B cell repertoire in response to tetanus toxoid and measles antigens. Front. Immunol. 8, 1834. https://doi.org/10.3389/fimmu.2017.01834 (2017).
Article CAS PubMed PubMed Central Google Scholar
Pipattanaboon, C. et al. Cross-reactivity of human monoclonal antibodies generated with peripheral blood lymphocytes from dengue patients with Japanese encephalitis virus. Biologics. 7, 175–187. https://doi.org/10.2147/BTT.S47438 (2013).
Article CAS PubMed PubMed Central Google Scholar
Charoenkwan, P., Nantasenamat, C., Hasan, M. M. & Shoombuatong, W. iTTCA-Hybrid: Improved and robust identification of tumor T cell antigens by utilizing hybrid feature representation. Anal. Biochem. 599, 113747. https://doi.org/10.1016/j.ab.2020.113747 (2020).
Article CAS PubMed Google Scholar
Duvenaud, D.K., et al. Convolutional networks on graphs for learning molecular fingerprints. Adv. Neural Inform. Process. Syst. https://doi.org/10.48550/arXiv.1509.09292 (2015).
Kim, S., Bolton, E. E. & Bryant, S. H. PubChem3D: Conformer ensemble accuracy. J. Cheminform. 5, 1. https://doi.org/10.1186/1758-2946-5-1 (2013).
Article CAS PubMed PubMed Central Google Scholar
Charoenkwan, P., Nantasenamat, C., Hasan, M. M. & Shoombuatong, W. Meta-iPVP: A sequence-based meta-predictor for improving the prediction of phage virion proteins using effective feature representation. J. Comput. Aided Mol. Des. 34, 1105–1116. https://doi.org/10.1007/s10822-020-00323-z (2020).
Article ADS CAS PubMed Google Scholar
Waterhouse, A. et al. SWISS-MODEL: Homology modelling of protein structures and complexes. Nucleic Acids Res. 46, W296-303. https://doi.org/10.1093/nar/gky427 (2018).
Article CAS PubMed PubMed Central Google Scholar
Benkert, P., Biasini, M. & Schwede, T. Toward the estimation of the absolute quality of individual protein structure models. Bioinformatics. 27, 343–350. https://doi.org/10.1093/bioinformatics/btq662 (2011).
Article CAS PubMed Google Scholar
Uberuaga, B. P., Anghel, M. & Voter, A. F. Synchronization of trajectories in canonical molecular-dynamics simulations: Observation, explanation, and exploitation. J. Chem. Phys. 120, 6363–6374. https://doi.org/10.1063/1.1667473 (2004).
Article ADS CAS PubMed Google Scholar
Berendsen, H. J. C., Postma, J. P. M., van Gunsteren, W. F., DiNola, A. & Haak, J. R. Molecular dynamics with coupling to an external bath. J. Chem. Phys. 81, 3684–3690. https://doi.org/10.1063/1.448118 (1984).
Article ADS CAS Google Scholar
Klaewkla, M., Charoenwongpaiboon, T. & Mahalapbutr, P. Molecular basis of the new COVID-19 target neuropilin-1 in complex with SARS-CoV-2 S1 C-end rule peptide and small-molecule antagonists. J. Mol. Liq. 335, 116537. https://doi.org/10.1016/j.molliq.2021.116537 (2021).
Article CAS PubMed PubMed Central Google Scholar
Jorgensen, W. L., Chandrasekhar, J., Madura, J. D., Impey, R. W. & Klein, M. L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 79, 926–935. https://doi.org/10.1063/1.445869 (1983).
Article ADS CAS Google Scholar
Chen, R., Li, L. & Weng, Z. ZDOCK: An initial-stage protein-docking algorithm. Proteins. 52, 80–87. https://doi.org/10.1002/prot.10389 (2003).
Article CAS PubMed Google Scholar
Roe, D. R. & Cheatham, T. E. PTRAJ and CPPTRAJ: Software for processing and analysis of molecular dynamics trajectory data. J. Chem. Theory Comput. 9, 3084–3095. https://doi.org/10.1021/ct400341p (2013).
Article CAS PubMed Google Scholar
Miller, B.R., McGee, T.D.J., Swails, J.M., Homeyer, N., Gohlke, H., Roitberg, A.E. MMPBSA.py: An Efficient Program for End-State Free Energy Calculations. J Chem Theory Comput. 8, 3314–21. https://doi.org/10.1021/ct300418h (2012)

Download references

Acknowledgements

This research project was financially supported by the Young Researcher Development Project of Khon Kaen University.

Author information

Authors and Affiliations

Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, 40002, Thailand
Piyatida Natsrita, Kiatichai Faksri, Sorujsiri Chareonsudjai & Chonlatip Pipattanaboon
Research and Diagnostic Center for Emerging Infectious Diseases, Khon Kaen University, Khon Kaen, 40002, Thailand
Kiatichai Faksri
Modern Management and Information Technology, College of Arts, Media and Technology, Chiang Mai University, Chiang Mai, 50200, Thailand
Phasit Charoenkwan
Center for Research Innovation and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, 10700, Thailand
Watshara Shoombuatong
Department of Biochemistry, Faculty of Medicine, Khon Kaen University, Khon Kaen, 40002, Thailand
Panupong Mahalapbutr
Center of Excellent in Biocatalyst and Sustainable Biotechnology, Department of Biochemistry, Faculty of Science, Chulalongkorn University, Bangkok, 10330, Thailand
Thanyada Rungrotmongkol

Authors

Piyatida Natsrita
View author publications
Search author on:PubMed Google Scholar
Phasit Charoenkwan
View author publications
Search author on:PubMed Google Scholar
Watshara Shoombuatong
View author publications
Search author on:PubMed Google Scholar
Panupong Mahalapbutr
View author publications
Search author on:PubMed Google Scholar
Kiatichai Faksri
View author publications
Search author on:PubMed Google Scholar
Sorujsiri Chareonsudjai
View author publications
Search author on:PubMed Google Scholar
Thanyada Rungrotmongkol
View author publications
Search author on:PubMed Google Scholar
Chonlatip Pipattanaboon
View author publications
Search author on:PubMed Google Scholar

Contributions

PN prepared the dataset, performed all in silico experiments, and wrote the manuscript. CP designed the study, supervised dataset preparation and data analysis, and wrote and edited the manuscript. PC and WS supervised the ML analysis and edited the manuscript. PM and TR supervised MD analysis and edited the manuscript. SC and KF assisted with result interpretation and edited the manuscript. All authors reviewed and approved the final manuscript.

Corresponding author

Correspondence to Chonlatip Pipattanaboon.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Natsrita, P., Charoenkwan, P., Shoombuatong, W. et al. Machine-learning-assisted high-throughput identification of potent and stable neutralizing antibodies against all four dengue virus serotypes. Sci Rep 14, 17165 (2024). https://doi.org/10.1038/s41598-024-67487-8

Download citation

Received: 15 December 2023
Accepted: 11 July 2024
Published: 26 July 2024
Version of record: 26 July 2024
DOI: https://doi.org/10.1038/s41598-024-67487-8