Fig. 4: Predicting neutralization capacity as a function of binding activity.

a SARS-CoV-2 positive donors (N = 467) were stratified into high neutralizers (NT50 > 250, N = 332; blue) and no/low neutralizers (NT50 < 250, N = 135; gray), based on their neutralization activity against Wuhan-Hu-1. b, c Comparison of the prediction ability of six different classification models using 100 cross-validation sets (divided as 80% for training and 20% for validation). b Comparison of models by area under the curve (AUC). Each dot corresponds to one cross-validation set. c Bayesian information criterion (BIC) of the five models based on logistic regression. The different models are: Univariable logistic regressions (ULR). ULR-RBD: mean of MFI-FOE RBD. ULR-S1: mean of MFI-FOE S1. Multivariable logistic regression (MLR). MLR-S1, RBD: mean of S1 reactivity and mean of RBD reactivity. MLR-PCA2 and MLR-PCA4: MLR of 2 and 4 first axis of PCA analysis, respectively. PCA was based on all 12 SARS-CoV-2 antibody reactivities measured by ABCORA 2.0. Random forest (RF) including all antibody reactivities measured by ABCORA 2.0. Boxplots represent the following: median with the middle line, upper and lower quartiles with the box limits, 1.5x interquartile ranges with the whiskers and outliers with points. d ULR-S1 estimated ROC curve based on full data set (N = 467). e Measured NT50 value versus probability of NT50 > 250 as predicted by ULR-S1 in five randomly chosen validation sets (each symbol corresponds to a validation set). Purple colored symbols indicate a higher than 0.70 probability of the respective sample to be neutralizing at NT50 > 250 and are therefore denoted as high neutralizers. Gray indicates samples with predicted neutralization NT50 < 250, therefore classified as no/low neutralizers. f Neutralization prediction based on a modified ULR-S1 model utilizing the diagnostic readout SOC instead of MFI-FOE values as input. Measured NT50 value versus sum of S1 SOC values (IgG, IgA, IgM) are depicted. Dashed lines correspond to a NT50 = 250 horizontally and the sum S1 SOCs = 9.7 vertically. The sum S1 SOCs = 9.7 corresponds to the thresholds depicted for ULR-S1 in (d, e). The gray shaded area corresponds to true positives (individuals with NT50 > 250 predicted as high neutralizers).