Fig. 5

ORFeome reconstruction identifies unannotated ORF variants and novel ORFs. a Histograms show the distribution of ORFscores of annotated ORFs (red) and all possible ORFs in our FLT (blue). The distribution demonstrates the discriminative power of ORFscore at 5 to classify whether ORFs were translated or not. b Number of ORFs per gene in our ORFeome, compared to that in RefSeq/Ensembl annotation. c Scheme of ORF types, including annotated ORFs, ORF variants, and novel ORFs. d Pie charts show the proportion of different ORF types in our ORFeome. e–g Alluvial diagrams show the impact of RNA isoform diversity on the diversity of ORF variants, including e the impact of the number of alternative events, f the impact of single alternative events, and g the impact of pairs of alternative events. ORF types: Canonical, annotated ORF; N-term, including N-terminal truncation, N-terminal extension, and N-terminal divergence; Internal, including internal insertion, internal deletion, and internal divergence; C-term, C-terminal divergence; In frame, including in frame and partial in frame; Diff frame, different frame. Source data for panel b are provided in a Source Data file