Table 2 The sample size of different tissues’ training and test datasets.

From: RNAm5Cfinder: A Web-server for Predicting RNA 5-methylcytosine (m5C) Sites Based on Random Forest

Tissues

Training set

Test set

pos

neg

pos

neg

Comprehensive

19,798

593,941

6636

1,924,243

ESCa-specific

3440

103,201

828

299,610

Heart-specific

12,703

381,091

100

30,433

Kidney-specific

12,700

381,001

122

37,088

Liver-specific

11,937

358,111

125

37,844

Muscle-specific

11,826

354,781

118

36,519

Small-Intestine-specific

11,372

341,161

107

32,170

Brain-specific

19,141

424,231

472

155,409

  1. The ratio of the positives and the negatives of the training set and test set were set to 1:30 and 1:all respectively. As for the test sets of tissue-specific predictors, samples which were used to train predictors for the other tissues were discarded.
  2. aESC, embryonic stem cell.