Figure 2

Overview of the NAT gene dataset compiled for the purposes of this study. The top panel depicts the main taxonomic groups represented in the dataset for prokaryotic (left) and eukaryotic (right) species. The bottom panel depicts the distribution of annotated NAT sequences in prokaryotes (left) and eukaryotes (right). Archaea (Halobacteria) and protists are indicated with blue font. The compiled NAT gene dataset also included 467 sequences annotated previously32,33.