Fig. 2: PCN is associated with host phylogeny and plasmid groups.

a PCN distribution for all analysed plasmids (n = 6327). Inset plots represent the same plasmids separated according to the classification of their hosts. The dotted line represents the anti-mode (5.75 copies) of the distribution. b Distribution of PCN within each host genus. The numbers on the right represent the number of plasmids for each genus. c Proportion of HCPs (purple) and LCPs (green) (y-axis) by host genus (x-axis). The numbers within each bar denote the number of plasmids belonging to each category. Asterisks denote that the group of plasmids where they are placed (HCPs or LCPs) is significantly overrepresented compared to the complete plasmid dataset (white dashed line). The x-axis represents host genera, abbreviated to three (or four) letters. d Boxplots representing the PCN per PTU from Gram-negative (Pseudomonadota, purple) and -positive (Bacillota, green) bacteria (Supplementary Dataset 2). Only the most abundant PTUs are indicated for each group. Numbers on the right of each boxplot show the number of plasmids belonging to each plasmid group.