Table. 1 Description of the gene sequences clustering approach.

From: The microbiome of cryospheric ecosystems

Annotation

Category

Number of clusters

Uniprot match (%)

KEGG

Cryosphere

47

61.70

Shared

1663

54.18

Non-cryosphere

2325

55.14

Ambiguous

Cryosphere

113

40.71

Shared

1056

52.65

Non-cryosphere

3105

54.17

Unassigned

Cryosphere

170

17.65

Shared

1524

5.18

Non-cryosphere

2122

46.94

  1. Table summarising the 12’125 largest gene sequence clusters present in at least two samples. The annotation refers to the assignment of the genes to one KEGG Orthologous group (KO), multiple KOs or unassigned (Ambiguous) and only unassigned (Unassigned). Distribution of assigned (KEGG), ambiguous and unassigned functional gene clusters highlighting the bias against cryospheric gene clusters. Shared refers to the representatives of both categories of samples contained gene sequences in the cluster. The number of clusters is shown, along with the proportion of clusters having a consensus sequence matching the UniProt database.