Fig. 4: Random forest classification across populations.

Receiver operating characteristic (ROC) curves and classification performance metrics for one-vs-all random forest classifiers for a Kinshasa vs Masi-manimba vs Kahemba unaffected low prevalence zone (LPZ), and b Kinshasa vs Masi-manimba vs Kahemba unaffected high prevalence zone (HPZ), binary classifier for c unaffected individuals from HPZ vs unaffected individuals from LPZ, and those with konzo from HPZ vs konzo from LPZ, and d konzo vs unaffected individuals from LPZ and HPZ. All ROC curves and performance metrics are averaged over 10 repetitions of 10-fold cross-validation.