Fig. 2: Flowchart of the applied machine learning procedure.

The steps marked with double lines are performed on the all-patients all-features dataset only, and applied to the others. Circular arrow: iterations for undersampling (5 seeds).

The steps marked with double lines are performed on the all-patients all-features dataset only, and applied to the others. Circular arrow: iterations for undersampling (5 seeds).