Figure 6

The workflow for motif cassette discovery and validation. (a) 100 experimental (down-regulated) and control (up-regulated) sequences are used for de novo motif discovery via MEME. (b) The sequences of motifs are reduced to strings of tokens and used by the modified general sequence pattern (GSP) algorithm to find cassettes of composite patterns. (c) The MAST program is used to find the motifs in non-discovery sets of experimental, control and comparative sequences. (d) These motif results are mined for the cassette patterns discovered above. (e) The enrichment of the cassettes in comparison to the control set is calculated, as is a p-value. Note: the values shown are only for illustration purposes.