Extended Data Fig. 2: MANE collaboration UTR definition. | Nature

Extended Data Fig. 2: MANE collaboration UTR definition.

From: A joint NCBI and EMBL-EBI transcript set for clinical genomics and research

Extended Data Fig. 2

Graphic display of the 5′ terminal UTR exon of the gene PTPRC (HGNC:9666) in NCBI GDV to illustrate how we defined the 5′ end of the transcript. Annotation tracks (top to bottom) show transcripts in RefSeq Annotation Release 109_20210514, transcripts in Ensembl Release 104 and the MANE Select (v0.95) track. The longest 5′ UTR among the RefSeq and Ensembl/GENCODE annotation sets is flagged at the first base with a blue vertical box. The “FANTOM Total CTSS Counts” track displays histograms representing CAGE tag counts at each base position. The strongest CAGE peak (the most abundant start site or the base position with the absolute maximum CAGE tag count) is highlighted with a yellow vertical box. The “RefSeq Processed CAGE” track at the bottom displays the start site (highlighted with a green vertical box) selected by the UTR algorithm. Details of how the UTR algorithm works are covered in the Methods and provided in Supplementary Method 3: UTR algorithm. A similar logic was used to compute polyA clusters and determine the 3′ ends of transcripts.

Back to article page