Fig. 4: Visualization of sequence groups (clades).

t-SNE clustering of Protein Embedding Values (a, b) and Attention Weight Matrices (c, d). All sequences are shown in (a, c), colored by WHO label or Nextstrain clade label; and Omicron subclades are highlighted in (b, d), colored by Nextstrain clade label.