Extended Data Fig. 8: Cross-Validation of EvoAI model training.
From: EvoAI enables extreme compression and reconstruction of the protein sequence space

(a) Schematic diagram of the data set in 10-fold cross-validation. (b) CV-test spearman correlation coefficient of different layers during cross-validation. Curves are shown as the mean of 10 groups. The shadow shows the 95% confidence interval of Spearman correlation values among training process. (c) Influence of layer number of MLP on 10-fold cross-validation of model training. 2-layer MLP shows the best performance with higher spearman correlation coefficient compared to 1-layer MLP and smaller variance compared to 3-layer MLP. The centre line represented the median value, while the box contained a quarter to three quarters of the dataset. The minima and maxima were also shown by the whiskers.