Fig. 5: The principle of automatic repeat classification and masking.
From: Repetitive DNA sequence detection and its role in the human genome

Sub-graph (a): A simple example of the distribution characteristics of repeats in the reference genome, where the black blocks represent chromosomes. Sub-graph (b): Principle of repeat detection, where the final sequence composed of colored bases represents the consensus repeat sequence. Sub-graph (c): Principle of automatic repeat classification, where black and dark green cylinders represent nucleic acid and protein libraries, respectively. Sub-graph (d): Principle of automatic repeat masking. Light green cylinders in Sub-graphs (c) and (d) represent the user-defined repeat library, and black blocks in Sub-graph (d) indicate the sequencing reads from various samples of the same or similar species.