Figure 3

Sequence variations of the second hypervariable region of G protein [G-HVR2] and motif clusters of the duplications in G-HVR2. (A) Phylogenetic trees based on RSV A G and RSV B G HVR2 sequences. Reference sequences KX858754.1 and KX858755.1 for RSV A and B, respectively are used to root the trees. Geographic metadata of the samples are depicted on the outer-rim of the trees. Nodes with green dots indicate G proteins with extra amino acid extensions. Nodes with red dots indicate G proteins with truncations due to pre-mature stop codons in the ORF. (B) Pairwise sequence alignment of the duplicated regions in the G HRV2 of RSV A and RSV B. The duplication regions (AA 261–283 and 285–307 for RSV A; AA 240–259 and 260–279 for RSV B are indicated as duplication #1 and duplication #2. Motifs used for the analysis are highlighted with a grey shade. Clustering was performed based upon the sequences in the duplicated regions. Blank spaces indicate identical amino acids with the chosen reference for RSV A and RSV B, respectively. Percentages of each representative cluster for RSV A and RSV B are indicated.