Fig. 9
From: Arch-Eval benchmark for assessing chinese architectural domain knowledge in large language models

Accuracy test results of outputs from different LLMs.
From: Arch-Eval benchmark for assessing chinese architectural domain knowledge in large language models
Accuracy test results of outputs from different LLMs.