Table 1 Training and testing cohorts used in the development and validation of IdentifiHR
Cohort | Cases (n) | Samples (n) | Training samples (n) | Testing samples (n) | HRD (n) | HRP (n) | Mean age (min - max) | FIGO stage (mode) | Mean tumour purity (min - max) | RNA sequencing platform | Mean read length | Median library size (min - max) | Source (PMID) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
TCGA | 288 | 288 | 288 | 0 | 152 | 136 | 59.92 (34– 87) | Stage IIIC | 0.86 (0.52–1.00)a | Illumina HiSeq 2000 | 75 bp | 60,237,474 (17,409,619–143,721,571) | |
TCGA | 73 | 73 | 0 | 73 | 42 | 31 | 59.45 (30– 87) | Stage IIIC | 0.86 (0.60–1.00)a | Illumina HiSeq 2000 | 75 bp | 63,520,620 (26,171,923–134,252,345) | |
AOCS | 81 | 99 | 0 | 99 | 52 | 47 | 58.84 (39– 78) | Stage IIIC | 0.62 (0.30–0.92)b | Illumina HiSeq 2000 | 100 bp | 89,817,975 (35,616,656–144,374,239) | |
MSKCC | 37 | 106 | 0 | 37 (pseudobulked at patient level) | 23 | 14 | 61.60 (38– 81) | Stage IVB | NA (only HGSC cells examined) | Illumina HiSeq 2500 or on a NovaSeq 6000 | 28 bp/91-bp, 100 bp or 150 bp | 56,146,402 (1,094,415–325,475,953) |