Extended Data Fig. 3: Frequency of TSS signatures in RefSeq, Ensembl, and MANE transcripts. | Nature

Extended Data Fig. 3: Frequency of TSS signatures in RefSeq, Ensembl, and MANE transcripts.

From: A joint NCBI and EMBL-EBI transcript set for clinical genomics and research

Extended Data Fig. 3

A) Frequency of A, C, G, T nucleotides at each position (y-axis) relative to the transcription start site (x-axis). MANE transcripts show an enrichment of C at −1, and purine (A or G) at +1. B) Count of transcripts with a best Inr motif (y-axis) placed relative to the TSS (x-axis). The peak of Inr motifs at −3 corresponds to the core CA motif located at −1 to +1. C) Count of transcripts with a TATA-box (y-axis) placed relative to the transcription start site (x-axis). The peak of TATA-box motifs at −31 corresponds to the core TATAAA box motif located at −28 to −23 upstream of the TSS. Details of the methods are available in Supplementary Methods 1.

Source data

Back to article page