Fig. 11
From: Arch-Eval benchmark for assessing chinese architectural domain knowledge in large language models

T-Test results for AO and COT.
From: Arch-Eval benchmark for assessing chinese architectural domain knowledge in large language models
T-Test results for AO and COT.