Extended Data Fig. 5: Evaluation of DeepSPT and two AnDi challenge models on AnDi challenge task 3.

Confusion matrix for all individual time point predictions within the 20000 2D (a, c, e) and 3D (b, d) test set trajectories simulated using the 2021 AnDi challenge task 3 open-source framework totalling 4 million predictions. See Muñoz-gil et al.28 for further test set specification. Diagonal entries are correct predictions and off-diagonal indicates confused classes. Each entry reports the percentage of predictions normalized to the actual number of true labels in the given class. a, Confusion matrix for DeepSPT on 2D trajectories. b, Confusion matrix for DeepSPT on 3D trajectories. c, Confusion matrix for Method E on 2D trajectories. d, Confusion matrix for Method E on 3D trajectories. e, Confusion matrix for Method J on 2D trajectories.