Figure 2

Visualization of the input dataset. We visualized the input dataset before feature extraction and ML analysis as follows: (A) Numbers of neutralizing (nā=ā554) and nonneutralizing (nā=ā554) interactions based on target epitope domains are shown in a bar plot. EDI is envelope domain I. EDII is envelope domain II. EDIII is envelope domain III. EDI-EDII is interdomain. EDE is the envelope dimer epitope domain. (B) Diversity of CDR-H3 antibody sequences based on target domains including EDI, EDII, EDIII, interdomain, and EDE by t-SNE (nā=ā306; perplexityā=ā30, learning rateā=ā100), (C) Diversity of epitope sequences based on target domains including EDI, EDII, EDIII, interdomain, and EDE by t-SNE (nā=ā609; perplexityā=ā30, learning rateā=ā100), and (D) Distribution of IC50 values of each antibodyāantigen interaction by scatter plot (nā=ā1,108; cut-off value for neutralizing classāā¤ā10 μg/ml).