Table 3 Scores on each unit assessment and the final exam for all LLMs assessed

From: Results and implications for generative AI in a large introductory biomedical and health informatics course

Student ID

Unit 1

Unit 2

Unit 3

Unit 4

Unit 5

Unit 6

Unit 7

Unit 8

Unit 9

Unit 10

MCQ Average

Final Exam

MCQ+Final Combined

ChatGPT Plus

100

100

80

80

90

100

80

100

70

80

88

76

164

Claude 3 Opus

100

80

70

100

80

100

70

100

40

70

81

91

172

CoPilot Bing-Precise

100

90

80

100

90

100

100

100

70

50

88

85

173

Gemini Pro

100

90

70

90

90

100

90

80

60

80

85

91

176

Llama 3.1 405B

100

100

70

100

100

90

70

100

60

60

85

88

173

Mistral-Large

100

90

80

90

90

80

80

80

60

80

83

82

165