Fig. 6: Model selection based on sperm motility landscape configurations.

a Signal-to noise ratio (S2NR) as an heuristic of information gain, along the process of merging. The information generally increases along the merging process. However, there were some drops, that corresponded with significant changes in the hierarchical organisation of the landscape. b Leave-one out method for cross-validation, used to assess model performance (loo R package, "loo" and "loo_compare" functions, see details in Vehtari et al., 201779). The ELPD corresponds to the expected log pointwise predictive density, as a measure of predictive capability of the model. Models of different landscape configurations (merging scales) were compared, 11 clusters being the best partition for model performance (ELPD = 0). c Motility landscape with 11 merged clusters that encompass the variability of the whole landscape. This represented both a good compromise between information and interpretability, and a good model performance.