Extended Data Fig. 1: Distribution of Enrich2 scores and protein specific classification of tolerance.

(a) Workflow of the experimental virology pipeline to rescue virus from plasmid libraries and the subsequent sequencing steps at different timepoints to measure variant frequency change in the population. The technologies used for sequencing and the analysis pipelines to detect different variants are detailed. (b–d) Histograms visualizing the distribution of variant scores were shown for (b) 8 AA insertion, (c) 1 AA deletion, and (d) AA substitution; inset histograms show the distribution for variants with a score higher than 0. Area plot showing the proportion of variants within Enrich2 score bins for 8 AA insertion (e), 1 AA deletion (f), and AA change (g) across different viral proteins. The width of the column represents the total number of variants for each protein. The two-sided chi-squared statistics for (e) df = 20, χ² = 140.43, (f) df = 20, χ² = 84.685, and (g) df = 20, chi-squared = 1793.8.