Fig. 2: Genome annotation of NCMD assembly. | Scientific Data

Fig. 2: Genome annotation of NCMD assembly.

From: A chromosome-level genome assembly of the Korean crossbred pig Nanchukmacdon (Sus scrofa)

Fig. 2

(a) Workflow for annotating protein-coding genes. Grey-colored boxes represent programs, and green- and brown-colored boxes respectively indicate their input and output for ab initio and homology-based prediction. The boxes named “Protein sequences” and “RNA-seq reads” represent the protein sequences obtained from the UniProtKB/Swiss-Prot database and RNA-seq data from 24 different tissues of Nanchukmacdon, respectively. “Gene annotation” means the collection of reference gene annotations of related six species (cow, goat, human, pig, mouse and sheep) for GeMoMa program or the gene annotation of pig for LiftOver and Liftoff program. (b) Gene annotation statistics for the assemblies of diverse pig breeds. The annotation statistics of 13 pig assemblies except the NCMD assembly were obtained from the Ensembl database (Release 109). (c) Statistics for the annotated non-coding genes of the NCMD assembly. (d) Sequence divergence of repetitive elements in the NCMD assembly.

Back to article page