Supplementary Figure 9: Sequence-set balancing (SSB). | Nature Methods

Supplementary Figure 9: Sequence-set balancing (SSB).

From: Predicting the human epigenome from DNA motifs

Supplementary Figure 9

The figure illustrates the SSB process. (a) The sequences from each set are separately binned by region length and GC-content. In the figure only a subset of the bins are shown: region lengths from 500-700bps and GC-content from 45-47%. (b) Bins with uneven numbers are highlighted in red. (c) Sequences are randomly removed from bins that possess more sequences than their corresponding bin in the other set.

Back to article page