Fig. 6: Strategy for selective removal of variation due to batches.

a–f The batch registration strategy is illustrated with synthetic data that models a single cluster as a two-dimensional Gaussian distribution. Eighteen Gaussians of 300 cells each represent 18 subjects, each with a different cluster centroid. Each subject is represented by a different color in a, and the three batches are designated in b. Six samples were designated as old and 12 as young (c). The cluster movement vectors necessary to register the centroids of the blue and green batches onto the red batch are shown in d, and the batches and young/old samples after registration are shown in e, f. Colors represent different subjects (a), batches (b, d, e), or age (c, f).