Table 1 Methods from statistical process control (SPC) and their application to monitoring ML algorithms.

From: Clinical artificial intelligence quality improvement: towards continual monitoring and updating of AI algorithms in healthcare

Method(s)

What the method(s) detect and assumptions

Example uses

CUSUM, EWMA

Detects a shift in the mean of a single variable, given shift size. Assumes the pre-shift mean and variance are known. Extensions can monitor changes in the variance.

• Monitoring changes in individual input variables

  

• Monitoring changes in real-valued performance metrics (e.g. monitoring the prediction error)

MCUSUM, MEWMA, Hotelling’s T2

Monitor changes in the relationship between multiple variables

• Monitoring changes in the relationship between input variables

Generalized likelihood ratio test (GLRT), Online change point detection

Detects if a change occurred in a data distribution and when. Can be applied if characteristics of the pre- and/or post-shift distributions are unknown. GLRT methods typically make parametric assumptions. Parametric and nonparametric variants exist for online change point detection methods.

• Detecting distributional shifts for individual or multiple input variables

  

• Detecting shifts in the conditional distribution of outcome Y given input variables

  

• Determining whether parametric model recalibration/revision is needed

Generalized fluctuation monitoring

Monitor changes to the residuals or gradient

• Detect when the average gradient of the training loss for a differentiable ML algorithm (e.g. neural network) differs from zero