Table 1 Training and testing cohorts used in the development and validation of IdentifiHR

From: IdentifiHR predicts homologous recombination deficiency in high-grade serous ovarian carcinoma using gene expression

Cohort

Cases (n)

Samples (n)

Training samples (n)

Testing samples (n)

HRD (n)

HRP (n)

Mean age (min - max)

FIGO stage (mode)

Mean tumour purity (min - max)

RNA sequencing platform

Mean read length

Median library size (min - max)

Source (PMID)

TCGA

288

288

288

0

152

136

59.92 (34– 87)

Stage IIIC

0.86 (0.52–1.00)a

Illumina HiSeq 2000

75 bp

60,237,474 (17,409,619–143,721,571)

21720365

TCGA

73

73

0

73

42

31

59.45 (30– 87)

Stage IIIC

0.86 (0.60–1.00)a

Illumina HiSeq 2000

75 bp

63,520,620 (26,171,923–134,252,345)

21720365

AOCS

81

99

0

99

52

47

58.84 (39– 78)

Stage IIIC

0.62 (0.30–0.92)b

Illumina HiSeq 2000

100 bp

89,817,975 (35,616,656–144,374,239)

36456881

MSKCC

37

106

0

37 (pseudobulked at patient level)

23

14

61.60 (38– 81)

Stage IVB

NA (only HGSC cells examined)

Illumina HiSeq 2500 or on a NovaSeq 6000

28 bp/91-bp, 100 bp or 150 bp

56,146,402 (1,094,415–325,475,953)

36517593

  1. aPurity taken from a consensus purity estimate published by Aran et al. (PMID: 26848121)
  2. bPurity estimated from FACETS (v0.6.1) by Garsed et al. (PMID: 36456881)
  3. AOCS Australian Ovarian Cancer Study, FIGO International Federation of Gynecology and Obstetrics, HRD Homologous Recombination Deficient, HRP Homologous Recombination Proficient, MSKCC Memorial Sloan Kettering Cancer Center, TCGA The Cancer Genome Atlas.