Table 2 Significant orthogroups identified by Boruta in clinical categories comparison. The table displays the relevant orthogroups identified by Boruta to distinguish PBC-associated compared to LI-associated genomes, and LS-associated compared to BAC-associated genomes. For each orthogroup, the gene name, whether available, the gene product, the mean length and standard deviation of the predicted protein, and the prevalence of the orthogroup in the genomes of the clinical categories are reported. The p-values of the Chi-squared test used to evaluate the distribution of orthogroups according to the clinical categories are displayed.
Orthogroup (PBC vs. LI) | Gene name | Gene product | Protein mean length (sd) | Prevalence/PBC genomes | Prevalence/LI genomes | p-value |
|---|---|---|---|---|---|---|
group_13 | – | DUF1353 domain-containing protein | 149.5 (0.9) | 9/19 | 40/51 | 0.026 |
group_1948 | cas8a1 | Type I CRISPR-associated protein Cas8a1/Csx8 | 414.8 (96.5) | 15/19 | 20/51 | 0.007 |
group_1955 | ftsY | Signal recognition particle-docking protein FtsY | 77.1 (5.8) | 2/19 | 26/51 | 0.005 |
group_2124 | – | T2SP-E domain-containing protein | 126 | 5/19 | 19/51 | 0.566 |
group_2260 | – | Sodium:solute symporter | 490 | 8/19 | 7/51 | 0.025 |
group_2788 | – | Hypotethical protein | 93 | 4/19 | 0/51 | 0.005 |
Orthogroup (LS vs. BAC) | Gene name | Gene product | Mean length (sd) | Prevalence/LS genomes | Prevalence/BAC genomes | p-value |
|---|---|---|---|---|---|---|
group_1749 | – | POP1 domain-containing protein | 407.9 (122.1) | 11/11 | 4/8 | 0.038 |