Table 1 Published microbiome datasets analyzed.

From: Colorectal cancer-associated bacteria are broadly distributed in global microbiomes and drivers of precancerous change

Dataset role

NCBI BioProject ID

References

CRC cases

Adenoma cases*

Healthy controls

Median sequencing depth

Training data

PRJEB10878

Yu et al.11

75

0

53

56.2 M reads/sample

Training data

PRJEB6070

Zeller et al.9

91

42

66

58.0 M reads/sample

Training data

PRJEB7774

Feng et al.10

46

47

63

52.7 M reads/sample

Test data (validation)

PRJDB4176

Yachida et al.12

258

0

251

46.3 M reads/sample

  1. *Adenoma metagenomes were not used for training the CAG-based model.