Extended Data Fig. 1: Data summary of publicly available RNA-Seq data.

(a) The number of publicly available RNA-Seq samples increases rapidly over years by fitting a second order polynominal model. (b) Distribution of sequence platforms of all 8,536 RNA-Seq samples. (c) Percentage of RNA-seq with single or paired reads. (d) Distribution of numbers of clean reads across all samples. (e) Distribution of read lengths. (f) Distribution of sexes. (g) Distribution of ages (Year-old). (h) Distribution of uniquely mapping rates. (i) Distribution of major tissues and breeds/ancestries in the 7,180 high quality RNA-Seq datasets (clean read > 500,000 & mapping rate > 60%).