Figure 1

A protein sequence that was incorrectly inferred from an erroneous gene prediction (M514_09783) in the draft gene set of the Trichuris suis genome (adult female; PRJNA208416). (A) Amino acid domain architecture (based on InterProScan46) of the inferred “protein kinase/CAP” fusion protein, containing domains typical of protein kinases (“SH2 domain”; blue; IPR000980 and “protein kinase domain”; orange-brown; IPR000719) and a CAP domain (light brown; IPR014044). (B) Original gene model for M514_09783 on scaffold71 (accession number: KL367486) of the genome assembly of T. suis (top); exons are displayed as thick blue boxes, introns as blue lines and the 5′ untranslated region as a thin blue box. Eight transcripts (de novo-assembled from publicly available RNA-Seq data) are mapped to scaffold71 in this region (below), refuting the original gene prediction and providing evidence for two independent gene models. The longest transcript isoforms represent the curated gene models and are indicated by asterisks.