Table 2 Genomic Datasets Specifications.
Dataset | Samples | # Original F | # Cleaned F | # Labels | Proportion of labels | |||
|---|---|---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | |||||
GDS1615 | 127 | 22,282 | 13,649 | 3 | 33% | 20.5% | 46.5% | – |
GDS3268 | 202 | 44,290 | 29,916 | 2 | 36.1% | 63.9% | – | – |
GDS968 | 171 | 12,625 | 9,117 | 4 | 26.3% | 26.3% | 22.8% | 24.6% |
GDS531 | 173 | 12,625 | 9,392 | 2 | 20.8% | 79.2% | – | – |
GDS2545 | 171 | 12,625 | 9,391 | 4 | 10.6% | 36.8% | 38% | 14.6% |
GDS1962 | 180 | 54,675 | 29,185 | 4 | 12.8% | 14.4% | 45 | 27.8% |
GDS3929 | 183 | 24,526 | 19,334 | 2 | 69.9% | 30.1% | – | – |
GDS2546 | 167 | 12,620 | 9,583 | 4 | 10.2% | 35.3% | 39.5% | 15% |
GDS2547 | 164 | 12,646 | 9,370 | 4 | 10.4% | 35.4% | 39% | 15.2% |