Fig. 2: Features ranking and neural networks performance.
From: Accelerated functional brain aging in pre-clinical familial Alzheimer’s disease

a Scatter plots of SVM model weights (y-axis) and ensemble tree feature importance (x-axis). Model weights are absolute value, and normalized such that 1 indicates highest importance. Numbers next to data points indicate their rank (i.e., 1 = highest average rank between both SVM and ensemble models; orange dots correspond to the top 10 features, blue dots represent lower-ranked features). b Root mean square error of different neural network models with inputs sorted according to rank for the training set (left), and the validation set (middle). Values were averaged over 3 iterations of the models. Neural networks trained with randomly-ranked inputs served as our null models (right). The x-axis indicates the number of inputs into the model (number of graph metrics) while the y-axis indicates the network architecture. For example, 5 means 1 hidden layer with 5 units, 5 2 means 2 hidden layers, the first one with 5 units and the second with 2 units. Darker (blue) colors indicate higher accuracy, while lighter (yellow) colors indicate lower accuracy. The red square identifying the model that provides the better generalizability in the validation set (lowest rmse) contains 2 hidden layers of 5 and 2 units, and uses the 10 highest-ranked graph metrics as input. The same neural network trained on randomly-ranked inputs (null model, gray square) provides lower accuracy. c Brain age model performance across datasets. Correlations between chronological age (x-axis) and age predicted by the neural network (y-axis) are represented for the training (n = 773), validation (n = 46) and test (n = 521) sets. Statistical values (c) were obtained from Pearson’s correlations (two-sided test, with no adjustment). Source data are provided as a Source data file. SVM: support vector machine, rmse: root mean square error, mae: mean absolute error.