Table 1 Factor distribution differences between training and test datasets.

From: Prediction of Intracranial Aneurysm Risk using Machine Learning

Factors

Training set (n = 299,088)

Test set (n = 128,181)

p-value

Age

46.05 ± 13.92

46.04 ± 13.94

0.93

Sex (Female)

152,746 (51.1)

65,677 (51.2)

0.32

BMI

23.55 ± 3.28

23.55 ± 3.29

0.87

Waist circumference

79.41 ± 9.29

79.41 ± 9.31

0.98

Hypertension

39,341 (13.2)

16,248 (13.0)

0.23

Systolic BP (mmHg)

121.17 ± 14.66

121.18 ± 14.68

0.92

Diastolic BP (mmHg)

75.54 ± 9.91

75.56 ± 9.93

0.58

DM

13,686 (4.6)

5,754 (4.5)

0.21

Glucose (mg/dl)

95.1 ± 16.19

75.56 ± 9.93

0.61

Total cholesterol (mg/dl)

193.64 ± 35.83

193.67 ± 35.88

0.81

LDL cholesterol (mg/dl)

55.81 ± 13.79

55.85 ± 13.81

0.33

HDL cholesterol (mg/dl)

113.21 ± 32.79

113.24 ± 32.85

0.79

Triglyceride (mg/dl)

122.4 ± 72.9

122.28 ± 72.61

0.62

Hemoglobin (g/dl)

13.87 ± 1.61

13.86 ± 1.61

0.52

Creatinine (mg/dl)

0.89 ± 0.22

0.89 ± 0.22

0.67

AST (IU/L)

23.75 ± 8.74

23.74 ± 8.72

0.75

ALT (IU/L)

22.96 ± 14.09

22.97 ± 14.11

0.73

GGT (IU/L)

31.36 ± 28.9

31.31 ± 28.85

0.59

Smoking

     Never

187,333 (62.6)

80,371 (62.7)

0.68

     Ex

37,987 (12.7)

16,248 (12.7)

0.83

     Current

73,768 (24.7)

31,562 (24.6)

0.78

Familial history of stroke

16,590 (5.5)

7,098 (5.5)

0.91

Familial history of heart disease

10,320 (3.5)

4,351 (3.4)

0.36

Familial history of hypertension

34,574 (1.6)

14,982 (1.7)

0.23

Familial history of diabetes

27,424 (9.2)

11,805 (9.2)

0.68

  1. Continuous variables are presented as mean ± standard deviation. Categorical variables are represented as numbers (percentages). BMI = body mass index; BP = blood pressure; DM = diabetes mellitus; AST = aspartate aminotransferase; ALT = alanine transaminase; GGT = gamma-glutamyl transferase.