Figure 6

Genes in the Si-responsive gene clusters are poorly conserved. The phylogenetic distribution of proteins included in the (a) TpSIL2, (b) CinY1 and (c) SiMat7 clusters were analysed on a local BLAST server using tBlastN searches. Light green rectangles indicate presence of sequences with significant score against the corresponding T. pseudonana protein (tBlastN scores <1e–10) in each diatom class. Greyscale rectangles indicate presence of sequences with significant score in non-diatom microalgae. The greyscale heat map represents the number of species within each taxonomical class with significant score against the corresponding T. pseudonana protein. Asterisks indicate significant tBlastN scores (<1e–10) against non-algal sequences in the NCBI nr database. Coloured circles indicate the presence of selected known domains. Numbers indicate JGI Thaps3 gene IDs. C, Coscinodiscophyceae; M, Mediophyceae; F, Fragilariophyceae; B, Bacillariophyceae; ND, non-diatom microalgae; NA, non-algal species.