Table 1 Large language models participating in the test.

From: Arch-Eval benchmark for assessing chinese architectural domain knowledge in large language models

Creator

LLM

Parameter

Deployment Methods

OpenAI

GPT-3.5-turbo

175B

Online

OpenAI

GPT-4-turbo

undisclosed

Online

Baichuan

Baichuan-13B-Chat

13B

Local

Baichuan

Baichuan2-13B-Chat

13B

Local

Baichuan

Baichuan2-7B-Chat

7B

Local

Tsinghua & Zhipu AI

Chatglm2-6B

6B

Local

Tsinghua & Zhipu AI

Chatglm3-6B

6B

Local

Tsinghua & Zhipu AI

Chatglm3-6B-32k

6B

Local

Shanghai AI Laboratory

Internlm-Chat-7B

7B

Local

Meta

Llama2-Chinese-13B

13B

Local

Alibaba Cloud

Qwen1.5-14B-Chat

14B

Local

Alibaba Cloud

Qwen1.5-7B-Chat

7B

Local

Alibaba Cloud

Qwen-14B-Chat

14B

Local

Alibaba Cloud

Qwen-7B-Chat

7B

Local