Figure 4

Model performance. (a) Distribution of the accuracy, AUPRC, AUROC, and F1 score of the model. EfficientNetB4 models were trained during six-fold cross-validation per group. The 10th, 50th (median), and 90th quantiles, as well as minimum and maximum, are shown. A paired t-test with Bonferroni correction for multiple comparisons. *P < 0.05, **P < 0.01, ***P < 0.001 compared with setting C. (b) Precision–recall and receiver operating characteristics curves of settings A, B, and C. The mean of the six-fold cross-validation is shown. AUPRC: the area under the precision-recall curve; AUROC: the area under the receiver operating characteristics.