Fig. 6: Characteristics and Performances on Stage Prediction Task.
From: A foundational model for in vitro fertilization trained on 18 million time-lapse images

Asterisks represent statistical significance with p < 0.05, where statistical significance is determined by performing a one-way ANOVA test followed by a Tukey HSD test. a Representative image sample for each of the 12 stages of development was collected from 12 different embryos. b Performances of FEMI and competitor models across multiclass classification metrics. Performances are aggregated across 4 replicates (four-fold cross validation) (n = 4). Top 1 Accuracy, p = 0.54; Top 2 Accuracy, p = 1.5e-54; Quadratic Weighted Kappa (QWK), p = 1.8e-58; Spearman Rank Correlation, p = 3.7e-56. Error bars show mean values +/- SEM. c Confusion matrix showing performance of FEMI across development stages. Source data are provided as a Source Data file.