Fig. 4

Confusion matrices referring to the validation dataset are reported for all the models and target variables considered. In particular, sub-figures (a) to (c) show the results of a classical RF model for classification. In (d)–(f), however, a balanced RF algorithm has been employed. Finally, in sub-figures (g)–(i), the augmented dataset and the RF+SMOTE algorithm have been used.