Fig. 1: Genome annotation and content of strain specific haplotypes. | Nature Genetics

Fig. 1: Genome annotation and content of strain specific haplotypes.

From: Sixteen diverse laboratory mouse reference genomes define strain-specific haplotypes and novel functional loci

Fig. 1: Genome annotation and content of strain specific haplotypes.

a, Summary of the strain-specific gene sets showing the number of genes broken down by GENCODE biotype. b, Heterozygous SNP (hSNP) density for a 50 Mb interval on chromosome 11 in 200 kb windows for 17 inbred mouse strains based on sequencing read alignments to the C57BL/6J (GRCm38) reference genome (top). Labels indicate genes overlapping the densest regions. SNPs visualized in CAST/EiJ and WSB/EiJ for 71.006–71.170 Mb on GRCm38 (bottom), including Derl2 and Mis12 (upper panel) and Nlrp1b (lower panel). Grey indicates the strain base agrees with the reference, other colors indicate SNP differences and height corresponds to sequencing depth. c, Total amount of sequence and protein-coding genes in regions enriched for hSNPs (relative to the GRCm38 reference genome) per strain. d, Top PantherDB categories of coding genes in regions enriched for hSNPs based on protein class (left). Intersection of genes in the defence and immunity category for the wild-derived and classical inbred strains (right). e, Box plot of sequence divergence (%)for LTRs, LINEs, and SINEs within and outside of hSNP regions. Sequence divergence is relative to a consensus sequence for the transposable element type (n = number of repeats in GRCm38, *** indicated P < 0.001 using Welch’s two-sample t-test. Box plots show 25th and 75th percentiles, and the median value.

Back to article page