Fig. 1: An overview of the workflow.

In the tables for each gene, S1, S2, ..., Si represent sample IDs, and PC1, PC2, ..., PCk represent the 1, 2, ..., k-th principal component of each Gene-PCA, respectively. The number of PC is 8 or 4 or 1, which depends on the length of the input, that is, the number of SNPs.