Extended Data Fig. 1: Performance of Gradient Boosting Regression (GBR) models, related to Fig. 1. | Nature Aging

Extended Data Fig. 1: Performance of Gradient Boosting Regression (GBR) models, related to Fig. 1.

From: Aging is associated with a systemic length-associated transcriptome imbalance

Extended Data Fig. 1: Performance of Gradient Boosting Regression (GBR) models, related to Fig. 1.The alternative text for this image may have been generated using AI.

(a) We defined prediction accuracy (ρGBR) as the Spearman correlation between observed and predicted relative fold-changes. (b) Significance of the prediction accuracy using t-distribution for two-sided significance test for Spearman correlation66. (c) Density map of scatter plot of observed and predicted relative fold-changes obtained from the cerebella of 9-months old mice. There is clear correlation between the predicted and observed relative fold-changes, which we quantify through the GBR prediction accuracy (ρGBR). (d) Comparison of prediction accuracy following Monte Carlo cross-validation, the default cross-validation scheme of this manuscript, against predication accuracy following four-fold cross-validation for individual combinations of tissues and ages (black dots). In Monte Carlo cross-validation a given gene will on average be used nine times for developing the gradient boosting regression, and on average once for quantifying its performance. In four-fold cross-validation each gene will be considered exactly three times for developing the gradient bossing regression, and exactly once for quantifying its performance.

Back to article page