Fig. 3: Negative correlations between the occurrence of dnd genes/gene clusters and prophages are predominant in bacteria.
From: The origin and impeded dissemination of the DNA phosphorothioation system in prokaryotes

a Correlation analysis between the occurrence of dnd genes/gene clusters and prophages among different genera. The black solid line in each sub-plot refers to the best fitting, and the gray shadow displays the 95% confidence interval from linear regressions. The dashed lines depict the 1:1 linear relationship. The number of analyzed genera (n), R2 values and P values of linear regressions are shown in each sub-plot. Each circle represents a single genus, and is depicted in the color representing the taxonomy at the phylum and class levels (for Proteobacteria). b, c Correlation analysis between the occurrence of dnd genes/gene clusters and prophages in thoroughly sequenced bacterial genera (with ≥100 genomes). The correlation type is identified based on Kendall’s τ coefficient and significance values. n1 and n2 below each pie chart b refer to the number of genera and included genomes, respectively. The heatmap c shows Kendall’s τ coefficient, and the crosses indicate statistical unachievability due to a small sample size (number of genomes <30) or the complete absence of dnd genes/gene clusters or prophages in the corresponding genus. The taxonomy of the genera and phyla/classes are shown below and above the heatmap, respectively. Source data are provided as a Source Data file.