Extended Data Fig. 2: Phylogenetic and sequence analysis of AAV2 genomes.
From: Adeno-associated virus 2 infection in children with non-A–E hepatitis

a) Maximum likelihood phylogeny of AAV2 from hepatitis cases CVR1-9. The nine AAV2 genome sequences generated from the plasma samples via target enrichment (highlighted in green) were aligned with a range of the closest AAV GenBank sequences39. AAV2 reference sequences are denoted by accession number, country and year of sampling b), Phylogeny of HAdV41 genome from case 5. The HAdV41 genome sequence from the faecal sample of patient 5 (red) was combined with complete genomes of HAdV41 from GenBank. Bootstrap values >70 are indicated. HAdV41 reference sequences are denoted by accession number, country and year of sampling; c), Key mutations and hierarchical clustering of AAV2 genomes. Mutations in published AAV2 sequences are highlighted in (blue) and case sequences (green); d) Mutations over-represented in hepatitis cases versus controls. Mutations in VP1-3, Rep78 and 52 and AAP are highlighted by % representation in case sequences (green) and published sequences (blue).