Table 3 Comparison between a complex genome (angiosperm) and a more simple genome (mammal), and the subsequent problems faced by RAD tag sequencing in complex genomes, adapted from Kejnovsky et al. (2009)

From: Next-generation hybridization and introgression

 

Complex angiosperm genome

Simple mammal genome

Difficulty for RAD tag sequencing

Occurrence of multi-gene families

Large multi-gene families common (37.4% of genes occur in families with >5 members)

Fewer, smaller multi-gene families (in humans 1.4% of genes occur in families with >5 members)

Paralog determination for studies without a reference genome. Paralog discrimination for pairwise SNP comparisons

Genome size variation

Very large: 2000-fold

Relatively small: 5-fold

Larger genomes require more markers for high coverage—making studies more expensive

Retrotransposable elements

Diverse range, varying in copy number with frequent insertions. Retroelements contribute up to 80% of the genome

Low diversity of retroelement families, with a low frequency of retorelement insertion

Restriction digest sites in different retrotransposons may be hard to analyse, and many reads will be uninformative

Polyploidy

Frequent occurrence; all species have successive rounds of ancient polyploidization, with many having more recent events too

Absent in mammals, only one controversial report of a polyploid rodent

Discrimination of nearly identical paralogs difficult

Recombination and translocation

Frequent recombination and chromosome translocations

Lower rates of recombination combined with small number of large translocations

Pairwise comparison between recombined genes difficult

  1. Abbreviation: RAD, restriction site-associated DNA.