Figure 2

Model performance. (A), Solid lines and shades represent receiver operating characteristics curves and its 95% confidence intervals. An asterisk (*) indicates significant difference (P < 0.005) in comparison with logistic regression. (B), Solid lines and shades represent precision-recall curves and its 95% confidence intervals. Only the confidence intervals of the baseline model (logistic regression, “LogReg”) are represented with polka dot pattern in both plots. (C), Detailed performance analysis for the best model (LightGBM) in different discrimination thresholds. Solid lines and shades represent mean values and 95% confidence intervals in each variable. Abbreviations: AUC, area under the curve; CI, confidence interval; LogReg, Logistic regression; SVM, Support vector machine; XGBoost, Extreme gradient boosting; LightGBM, light gradient boosting machine; MLP, Multilayer perceptron.