Fig. 2

Consensus cumulative distribution function and cluster consensus score to determine at what number of clusters.
(a) The lines by colors indicating the cumulative distribution functions (CDF) of the consensus matrix for each number of clusters. The CDF reaches an approximate maximum, consensus and cluster confidence is at a maximum at this K. (b) The changes in area under the CDF curves comparing K and K − 1. For K = 2, there is no K − 1, so the total area under the curve rather than the relative increase is plotted. The relative increases in consensus are used to determine K at which there is appreciable increase. (c) The mean consensus score for different numbers of clusters (K ranges from 2 to 7). Cluster is indicated by color following the same color scheme as the cluster matrices and tracking plots. The bars are grouped by K which is marked on the horizontal axis. High values indicate a cluster has high stability and low values indicate a cluster has low stability. For K = 4, the mean consensus score was 0.97 for cluster 1, 0.99 for cluster 2, 0.96 for cluster 3, 0.97 for cluster 4.