Fig. 2: Ribo-seq identifies thousands of novel CHO cell ORFs.
From: Detection of host cell microprotein impurities in antibody drug products

In this study, we utilised the ORF-RATER algorithm to identify ORFs initiating at near cognate (i.e., NUG) start codons from the Ribo-seq data. A total of (a) 10,201 ORFs were identified, including 4491 that were not previously annotated in the Chinese hamster genome. These ORFs included N-terminal extensions in annotated protein-coding genes. For instance, we identified (b) a CUG initiated N-terminal extension in a transcript of Aurka. The RNA-seq, CHX coverage of the transcript (full coverage [coloured grey] and P-site offset CHX coverage [coloured by reading frame relative to the annotated TIS]) along with the HARR-ND coverage (P-site offset) are shown, illustrating the initiation signal at the CUG start codon upstream of the NCBI annotated AUG start codon. Source data are provided as a Source Data file.