Table 3 Association between human GEO deposits’ geographic origin (North America vs. other regions) and their corresponding disease area (related to a major disease area or unclassified) from 2001 to 2017

From: Trends in the characteristics of human functional genomic data on the gene expression omnibus, 2001–2017

 

North America

Other regions

   

Year

Related to major disease area

Unclassified

Related to major disease area

Unclassified

OR

95% CI

P value

2001

0

0

0

0

-

-

-

2002

19

16

2

1

0.59

0.05–7.17

>0.99

2003

44

109

13

20

0.62

0.28–1.36

0.297

2004

87

105

29

47

1.34

0.78–2.31

0.339

2005

168

157

86

57

0.71

0.48–1.06

0.107

2006

203

173

144

92

0.75

0.54–1.04

0.094

2007

312

210

199

139

1.04

0.79–1.37

0.831

2008

414

227

321

207

1.18

0.93–1.49

0.201

2009

522

338

447

292

1.01

0.83–1.23

0.959

2010

601

425

599

401

0.95

0.79–1.13

0.557

2011

814

552

768

514

0.99

0.84–1.15

0.874

2012

844

588

1026

630

0.88

0.76–1.02

0.09

2013

954

716

1103

741

0.90

0.78–1.02

0.107

2014

1001

757

1303

941

0.95

0.84–1.08

0.479

2015

1206

839

1419

1087

1.10

0.98–1.24

0.117

2016

1394

2192

1481

1080

0.46

0.42–0.51

6.84E−49

2017

1248

3709

1586

1353

0.29

0.26–0.32

3.46E−145

Total

9831 (25.2%)

11,113 (28.4%)

10,526 (26.9%)

7602 (19.5%)

0.64

0.61–0.67

5.02E−107

  1. Note: Odds ratio (OR) represents the odds that a disease-specific GEO deposit originated from North America. The 6 major disease areas were cancer, cardiovascular diseases, diabetes, immunologic diseases, infectious diseases, and neurologic diseases, while all deposits not fitting in any of the six disease areas were categorized as unclassified. CI confidence intervals