Figure 2

Circular tree for all BamHI-800 sequences using Maximum Likelihood and the Tamura 3-model of sequence evolution73. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories (+ G, parameter = 2.7852)). The tree with the highest log likelihood (-22,314.97) is shown. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The percentage of trees in which the associated taxa clustered together is shown next to the main branches (values lower than 75% have been omitted). This analysis involved 230 nucleotide sequences and 746 positions in the final dataset (all positions with less than 95% site coverage were eliminated, and ambiguous bases were allowed at any position (partial deletion option)). Species names abbreviations and colour codes at branch leafs as in Fig. 1 and Supplementary Table S1.