Fig. 4: Clustering of top 200 candidate antigens based on proximity measured from tree-based model.

First two dimensions of UMAP are shown. Top 200 candidate antigens from the final tree-filtered model were grouped based on k-means clustering. Points represent top 200 candidate antigens in three groups, 48 known antigens (light cyan), and four reference antigens (yellow; protein names noted by text).