Fig. 4
From: Testing the limits of large language models in debating humans

Post-game and in-game productivity results. (A) Kernel Density Estimations (KDEs) of the number of conversations per player grouped by the type of game they played in. (B) KDEs of the number of messages each player sent within each game, grouped by their type. (C) KDEs of the reward point distributions gained by all players at the end of each game, grouped by their type. (D) KDEs of the on-topic keyword frequency of each player, agent and human, across all game types. (E) KDEs of the on-topic keyword frequency of each human in AH game versus humans in HH games.