Fig. 3: Results of the linear discriminant analysis (LDA) with random-split samples.

A Mean values of training and testing accuracy (error bars indicate standard deviations), computed on random samples with 75%, 50%, and 25% of participants of the original sample assigned to the training subset and the remaining to the testing subset (50 iterations performed using the same method). The general performance of the classification function remains high and stable across different training-testing partitions. B Conceptual representation of a replication with 50% of participants randomly assigned to the training subset and the other 50% to the testing subset (training accuracy: 100%; testing accuracy: 97%). The outcome of this single replication shows that in the testing subset only one participant from Cluster 2 is misclassified by the model.