Table 1 Study population characteristics

From: Epigenetic fingerprints link early-onset colon and rectal cancer to pesticide exposure

Study

N total

Females, N (%)

Age, years (mean (s.d.))

Cancer type

Discovery dataset:

    

TCGA-COAD

   

Colon

Early-onset

31

18 (58)

43 (5)

 

Late-onset

100

43 (43)

78 (5)

 

Replication datasets:

    

TCGA-READ

   

Rectal

Early-onset

14

6 (43)

44 (6)

 

Late-onset

30

21 (70)

76 (6)

 

GSE39958

   

Rectal

Early-onset

12

3 (25)

45 (7)

 

Late-onset

9

2 (22)

74 (5)

 

GSE42752

   

Colon

Early-onset

4

3 (75)

46 (3)

 

Late-onset

7

5 (71)

76 (4)

 

GSE77954

   

Colorectal

Early-onset

3

3 (100)

48 (1)

 

Late-onset

10

3 (30)

77 (6)

 

GSE101764

   

Colorectal

Early-onset

13

3 (23)

38 (8)

 

Late-onset

37

13 (35)

76 (5)

 

GSE131013a

   

Colon

Early-onset

2

0 (0)

46 (4)

 

Late-onset

58

13 (22)

76 (5)

 

E-MTAB-7036

   

Colon

Early-onset

18

14 (78)

41 (7)

 

Late-onset

102

57 (56)

79 (5)

 

E-MTAB-3027

   

Colorectal

Early-onset

3

0 (0)

75 (5)

 

Late-onset

10

4 (10)

41 (7)

 

GSE199057

   

Colon

Early-onset

14

7 (50)

44 (6)

 

Late-onset

9

3 (33)

74 (2)

 
  1. Early-onset cases are defined as participants younger than 50 years, whereas late-onset cases comprise participants diagnosed at age 70 years and older. Dataset GSE77954 encompasses primary (n = 7) and metastatic samples (n = 6). The early-onset category in GSE131013 and E-MTAB-3027 comprises solely male participants, whereas GSE77954 includes only female participants. Consequently, adjustment for sex was not feasible in these datasets. DNA methylation data was generated using the Illumina Infinium HumanMethylation450 BeadChip, except for GSE199057, which used the EPIC BeadChip array.