Supplementary Figure 9: Sequence-set balancing (SSB).

The figure illustrates the SSB process. (a) The sequences from each set are separately binned by region length and GC-content. In the figure only a subset of the bins are shown: region lengths from 500-700bps and GC-content from 45-47%. (b) Bins with uneven numbers are highlighted in red. (c) Sequences are randomly removed from bins that possess more sequences than their corresponding bin in the other set.