Fig. 5: Metrics on the AgingMice dataset.

A Valid and test MCC scores for all methods benchmarked. Higher is better. The conditions compared are mice with high-fat diet and chow diet. B Batch mixing metrics: normalized Batch Entropy (nBE), Adjusted Rand Index (ARI), Adjusted Mutual Information (AMI). Smaller is better. MCC is compared to C nBE and D ARI. Error bars represent standard deviations around the means. All error bars are derived from the results of 5-fold cross-validation (n = 5). The BERNN models are underlined. Source data are provided as a Source Data file.