Fig. 8: Experiment 2 model recoverability.
From: Reinforcement learning increasingly relates to memory specificity from childhood to adulthood

For each model within each stage of model comparison, 100 simulated experiments were conducted in which choice data were simulated from 73 agents, with parameters sampled from uniform distributions with ranges determined by the empirical fits. Data from each simulated experiment were then fit with each model within the comparison set. The top panels show confusion matrices, where the values within each tile represent the proportion of experiments for which each fitted model had the highest exceedance probability (top panels). The bottom panels show inversion matrices, where the values within each tile represent the proportion of experiments for which the fitted model had the highest exceedance probability that were generated by each of the models. Black lines outline the model that best fit the empirical data within each comparison stage. A Models with different numbers of choice weights were highly distinguishable from one another. B Models in which exemplar and category values were initialized with either one or two free parameters were moderately distinguishable from one another. C Models with a single learning rate, separate learning rates for experienced and inferred counterfactual outcomes, and no counterfactual learning were highly distinguishable from one another.