Extended Data Fig. 5: A more detailed analysis of multi-agent ablations from Fig. 3c, d.
From: Grandmaster level in StarCraft II using multi-agent reinforcement learning

PFSP-based training outperforms FSP under all measures considered: it has a stronger population measured by relative population performance, provides a less exploitable solution, and has better final agent performance against the corresponding league.