Table 1 Variants of ComBat and their recommended use cases.

From: Challenges and best practices when using ComBAT to harmonize diffusion MRI data

Method name

Short description

When to use

Longitudinal

ComBat26

Extends ComBat to longitudinal data using linear mixed-effects models to preserve within-subject changes over time.

Longitudinal MRI studies with repeated measures per subject (e.g., disease progression, development), where standard ComBat would remove true within-subject changes.

ComBat-GAM29

Incorporates generalized additive models to capture non-linear covariate effects (e.g., age trajectories).

Large multi-site datasets with wide covariate ranges (e.g., lifespan studies) where covariate effects are non-linear and need preservation.

AutoComBat27

Automates ComBat by inferring batch labels from data (metadata, quality metrics) when they are unknown or incomplete.

When site/scanner labels are unavailable, inconsistent, or when harmonizing unseen batches without predefined batch IDs.

CovBat14

Extends ComBat to harmonize not just mean and variance but also covariance between features.

Multivariate MRI features (e.g., radiomics, connectivity) where site differences in feature correlations could harm downstream ML models.

GMM-ComBat16

Uses Gaussian mixture models to split heterogeneous/multi-modal feature distributions before applying ComBat.

Sites with multi-modal feature distributions (e.g., mixed protocols, subpopulations) where a single Gaussian assumption is violated.

Cluster-ComBat30

(Federated ComBat)

Groups sites into clusters and harmonizes in a distributed way, allowing new sites to be integrated without retraining.

Federated/multi-center setups with privacy constraints or frequent addition of new sites, where central pooling is not possible.