Fig. 3: Differences in the parameters estimated by the reinforcement learning drift diffusion model (RLDDM).

A In the placebo group, observation compared to privacy relatively decreased the prosocial learning rate for negative PE (i.e., the difference between αnegPE in the other condition and αnegPE in the self-condition). Testosterone administration reversed the observation effect. The results suggest that for better performance in the task, a lower learning rate from negative PE is more suitable. B In the placebo group, observation compared to privacy, relatively increased the consistency of the prosocial choices. Testosterone administration reversed this audience effect. C In the placebo group, observation compared to privacy, relatively increased the DDM threshold for prosocial choices. Testosterone administration reversed the audience effect. D Inverse temperature parameter tau that captures choice consistency significantly predicted prosociality. Dots represent the data of individual participants, lines represent mean values per group, and boxes 95% confidence intervals.