Table 1 Factor distribution differences between training and test datasets.
From: Prediction of Intracranial Aneurysm Risk using Machine Learning
Factors | Training set (n = 299,088) | Test set (n = 128,181) | p-value |
---|---|---|---|
Age | 46.05 ± 13.92 | 46.04 ± 13.94 | 0.93 |
Sex (Female) | 152,746 (51.1) | 65,677 (51.2) | 0.32 |
BMI | 23.55 ± 3.28 | 23.55 ± 3.29 | 0.87 |
Waist circumference | 79.41 ± 9.29 | 79.41 ± 9.31 | 0.98 |
Hypertension | 39,341 (13.2) | 16,248 (13.0) | 0.23 |
Systolic BP (mmHg) | 121.17 ± 14.66 | 121.18 ± 14.68 | 0.92 |
Diastolic BP (mmHg) | 75.54 ± 9.91 | 75.56 ± 9.93 | 0.58 |
DM | 13,686 (4.6) | 5,754 (4.5) | 0.21 |
Glucose (mg/dl) | 95.1 ± 16.19 | 75.56 ± 9.93 | 0.61 |
Total cholesterol (mg/dl) | 193.64 ± 35.83 | 193.67 ± 35.88 | 0.81 |
LDL cholesterol (mg/dl) | 55.81 ± 13.79 | 55.85 ± 13.81 | 0.33 |
HDL cholesterol (mg/dl) | 113.21 ± 32.79 | 113.24 ± 32.85 | 0.79 |
Triglyceride (mg/dl) | 122.4 ± 72.9 | 122.28 ± 72.61 | 0.62 |
Hemoglobin (g/dl) | 13.87 ± 1.61 | 13.86 ± 1.61 | 0.52 |
Creatinine (mg/dl) | 0.89 ± 0.22 | 0.89 ± 0.22 | 0.67 |
AST (IU/L) | 23.75 ± 8.74 | 23.74 ± 8.72 | 0.75 |
ALT (IU/L) | 22.96 ± 14.09 | 22.97 ± 14.11 | 0.73 |
GGT (IU/L) | 31.36 ± 28.9 | 31.31 ± 28.85 | 0.59 |
Smoking | |||
   Never | 187,333 (62.6) | 80,371 (62.7) | 0.68 |
   Ex | 37,987 (12.7) | 16,248 (12.7) | 0.83 |
   Current | 73,768 (24.7) | 31,562 (24.6) | 0.78 |
Familial history of stroke | 16,590 (5.5) | 7,098 (5.5) | 0.91 |
Familial history of heart disease | 10,320 (3.5) | 4,351 (3.4) | 0.36 |
Familial history of hypertension | 34,574 (1.6) | 14,982 (1.7) | 0.23 |
Familial history of diabetes | 27,424 (9.2) | 11,805 (9.2) | 0.68 |