Table 2 Model analysis results scoring.

From: Evaluating simulated teaching audio for teacher trainees using RAG and local LLMs

Subject

Content accuracy

Depth and detail

Practicality and innovation of suggestions

Logicality and organization

Comprehensive assessment and critical thinking

Language expression and use of professional terminology

Expert

Middle school geography

4

3

4

5

3

4

Expert 1

4

4

3

5

4

4

Expert 2

4

3

3

5

3

4

Expert 3

Middle school art

4

4

3

5

4

5

Expert 1

4

3

3

4

4

5

Expert 2

3

4

3

4

4

4

Expert 3

Middle school math

5

4

3

4

4

5

Expert 1

4

3

3

5

4

5

Expert 2

4

3

3

4

4

4

Expert 3

Middle school pe

5

4

3

5

4

4

Expert 1

4

3

3

4

4

4

Expert 2

4

4

3

4

4

4

Expert 3

High school biology

4

4

3

5

4

4

Expert 1

4

3

3

4

3

5

Expert 2

4

3

3

4

3

4

Expert 3

High school music

5

4

4

5

4

5

Expert 1

4

3

3

4

3

4

Expert 2

3

4

4

4

3

4

Expert 3

Elementary school math

5

4

4

5

4

5

Expert 1

4

3

3

4

3

4

Expert 2

4

4

3

4

3

4

Expert 3

Elementary school mental health

4

4

3

5

4

4

Expert 1

5

4

4

5

4

5

Expert 2

4

4

3

4

4

4

Expert 3

Elementary school english

5

4

4

5

4

4

Expert 1

5

4

4

5

4

5

Expert 2

4

4

4

4

4

4

Expert 3

Early childhood education

4

4

3

5

4

5

Expert 1

5

5

4

5

5

5

Expert 2

4

4

3

4

4

4

Expert 3