Extended Data Fig. 2: Improved antimicrobial resistance prediction based on MALDI-TOF mass spectra combining all species compared to species information alone.

AUROC values of logistic regression classifiers trained on data combining all samples with labels available for each antimicrobial prediction task in DRIAMS-A. The blue bars depict predictive performance using spectral data as features. The red bars show the predictive performance when using species label information only. The fractions of resistant/intermediate samples in the training data are indicated in brackets after the antibiotic name. Reported metrics and error bars are the mean and standard deviation of 10 repetitions with different random train–test-splits. The asterisks indicate a statistically significant difference between the reported metrics between all species and species information alone of a two-sided Welch’s t-test (not assuming equal population variance) and a significance level of <0.05.