Figure 1

Integrative analysis and visualization of the 30 samples using three datasets. In the 30 samples (13 advanced colorectal cancers [ACRCs], 10 high risk adenomas [HRAs], and 7 normal controls [NCs]), there were 529 species identified by metagenome analysis and 763 genes identified by RNA sequencing. The survey results identified 93 associated variables. Each dataset was merged and normalized to a value between 0 and 1. Principal component analysis (PCA) was applied to reduce the dimension of features from each dataset, and 10 principal components (PCs) were retrieved. (A) Each two-dimensional PCA plot represents two PCs, reduced from 1385 features. (B) The scree plot represents 10 PCs. (C) The biplot represents the direction of each feature. (D) In the ternary plot, 1385 features corresponding to ACRCs, HRAs, and NCs are presented as points and are distinguished by three colors.