Fig. 6: Analysis results in the 10X Genomics scRNA-seq data.

Results are shown for comparing CD4+ T cells versus CD8+ T cells. a p-values from iDEA for GSE analysis display expected enrichment of small p-values (for true signals) and a long flat tail towards large p-values. b Quantile-quantile plots of −log10(p-values) from GSE methods including iDEA (orange), fGSEA (green), CAMERA (navyblue), PAGE (skyblue) and GSEA (yellow) are shown under permuted null. The p-values from all methods in the permuted data are not discernable from the null expectation. Here λgc is the genomic control factor. c Number of identified enriched gene sets by iDEA (orange), fGSEA (green), CAMERA (navyblue), PAGE (skyblue) and GSEA (yellow) are plotted against different empirical false discovery rates (FDR). iDEA is as the same powerful than other methods for GSE analysis. d Number of identified DE genes by iDEA (orange) and zingeR (blue) are plotted against different empirical FDR values. iDEA is more powerful than zingeR for DE analysis. e Heatmap shows the normalized expression level (log10-transformation with pseudo-count 0.1) for selected 30 DE genes (rows) identified by iDEA for cells in the two cell types (columns). Genes are sorted by Hierarchical clustering, cells are ordered by cell types (CD4: blue; CD8: red). These DE genes clearly distinguish two compared cell types. f Bubble plot shows –log10(p-values) for GSE analysis from iDEA (y-axis) for different gene sets. Gene sets are colored by six projects: HPCA (red), FANTOM (yellow), BLUEPRINT (blue), NOVERSHTERN (green), ENCODE (orange), IRIS (deep blue). The size of the dot represents the number of genes contained in the gene set. Names for ten of the gene sets that are closely related to CD4+ and CD8+ immune process are highlighted in the panel.