Table 15 Biomarker datasets

From: Machine learning for Parkinson’s disease: a comprehensive review of datasets, algorithms, and challenges

Article

Repository/ Source

#PD

#HC

#PDF

#PDM

#HCF

#HCM

Information

1

GEO databases264,265

Protein sequences from NCBI & UniProt (FASTA file format),

removed duplicates and incomplete sequences,

Dataset: 640 PD-related and 1010 non-PD sequences

6

GEO databases264

35

28

Training sets: GSE20163, GSE20164, GSE42966,

validation: GSE26927

145

266

14

14

16 F and 12 M

146

UC San Diego dataset267,268

15

16

8

7

9

7

HC: average age (63.5 ± 9.6)

PD: average age (63.2 ± 8.2)

147

Clinical

20

21

10

10

10

11

HC: average age (67.5 ± 6.4),

PD: average age (67.6 ± 7.0),

University of British Columbia

148

269,270

24

24

HC: average age (69.33 ± 9.78)

PD: average age (69.75 ± 8.91)

149

Clinical

104

11

PD: average age (59.43 ± 12.15)

HC: average age (57.26 ± 9.15)

144

271,272,273,274,275,276

Gene expression data from GEO: GSE18838, GSE57475, GSE72267, GSE99039, and GSE6613,

406 PD samples,

336 HC samples

150

Clinical

39

40

17

22

12

28

HC: average age (59.00 ± 4.54)

PD: average age (61.31 ± 6.01)

151

277

25

25

9

16

9

16

PD: average age (69.98 ± 8.73)

HC: average age (69.32 ± 9.58)

Cognitive Rhythms Lab (UNM),

collecting data in 2015

278,279

20

20

11

9

12

8

PD: average age (69.80 ± 7.60)

HC: average age (67.80 ± 6.35)

Information was gathered at the University of Turku in Finland

152

280

24

24

153

Clinical

19

154

PPMI220

490

197

HC: average age 61.3

PD: average age 62

Clinical

59

31

21

38

17

20

From 2015 to 2018

155

Clinical

187

125

76

111

67

58

156

Clinical

31

13

16

15

6

7

157

SEED-IV281

SEED-IV dataset with 15 subjects,

Evaluated using film clip stimuli,

Emotions: happy, neutral, fearful and sad

AMIGOS282

AMIGOS dataset:

33 subjects, auditory and visual stimuli

Two trial types: short and long videos

283

20

20

11

9

10

10

Multimodal stimuli: images, audio, video

Mean age: 58.7

158

UC San Diego dataset267,284

16

10

8

8

9

1

PD: average age 58.7

HC: average age 63.5 ± 9.6

159

Clinical

29

20

9

Female average age: 62

Male average age: 63

160

285

23

26

7

16

13

13

10 patients with ICD

(8 M and 2 F)

161

GEO databases264

20

20

10

10

12

8

GEO datasets: GSE8397, GSE20292, GSE20163, GSE20164, and GSE49036,

Average age (68.2 ± 7.2)

Average age (66.0 ± 12.8)

162

Clinical

65

65

9

56

9

56

Parkinson’s patients were selected from Juntendo University Hospital, Tokyo, Japan,

The first cohort included de novo PD patients

(HC: average age (62.2 ± 11.8)

PD: average age (61.7 ± 11.4)

The second cohort included male PD patients with and without medication

(HC: average age (66.8 ± 9.08)

PD: average age of (64.2 ± 10.6)

163

UC San Diego dataset267,286,287

15

16

8

7

9

7

Collected from the University of San Diego,

EEG was recorded in the resting state

270

27

27

17

10

17

10

Collected from the University of New Mexico (UNM)

164

PPMI220

294

154

99

195

58

96

PD: average rage (61 ± 9.7)

HC: average rage (60.3 ± 11)

PDEP288

263

115

112

151

64

51

PD: average rage (64.3 ± 8.6)

HC: average rage (63.6 ± 9.5)

165

National Health Insurance Service-Health Screening (NHIS-HEALS) database289

1102

1102

505

597

492

610

Adults aged 40 and older,

data includes lab and anthropometric measures, sex, lifestyle, socioeconomic status

166

PPMI220

423

The dataset consisted of de novo PD patients

167

Loyola University Chicago (LUC), Clinical

29

165

11

18

64

101

ECG data from individuals aged 26–89 years,

MLH dataset: collected 2015–2020,

LUC dataset: collected 2014–2020

University of Tennessee-Methodist Le Bonheur Healthcare (MLH), Clinical

131

1058

54

77

496

562

168

PPMI220

697

Patients assessed before dyskinesia onset

  1. In this table, we detailed all the datasets in papers and compared participant demographics (number, gender, and health status: #PD => Parkinson’s Disease, #HC => Number of Healthy Control Participants, #PDF => Number of Parkinson’s disease Female Participants, #PDM => Number of Parkinson’s disease Male Participants, #HCF => Number of Healthy Control Female Participants, #HCM => Number of Healthy Control Male Participants).