Figure 2: Length distribution of identified features for four different selection criteria based on read count and read length. | Scientific Reports

Figure 2: Length distribution of identified features for four different selection criteria based on read count and read length.

From: Identification of small RNAs in extracellular vesicles from the commensal yeast Malassezia sympodialis

Figure 2

(i) 100.15.30 (orange line: minimum read count for each feature is 100 nt, minimum read length is 15 nt, maximum read length is 30 nt), (ii) 1,000.15.30 (green line), (iii) 200.15.30 (blue line), and (iv) 500.15.30 (purple line). Each distribution shows a primary peak at 16 nt, and a secondary peak at 21 to 22 nt. The secondary peak is only visible with more stringent filtering (i.e. higher count cut off) and is not visible in the 100.15.30 dataset. Reads shorter than 15 nt were removed from the analysis. Insert. Reads map to coding or non-coding regions of the M. sympodialis genome according to the annotation from Zhu Y et al. (manuscript submitted). The mapped reads for the 500.15.30 annotation were summed over all samples and separated into coding (C, orange line) and non-coding (NC, blue line) groups and replotted. This graph shows that the secondary peak at 21 to 22 nt is strongly associated with the non-coding reads.

Back to article page