Table 3 Comparison of the effectiveness of three prompt methods with ChatGPT for USMLE Step 1 samples, clinical non-calculation questions from GPT-4, and calculation questions from GPT-4 (Chi-square statistic was used to calculate P value).

	ChatGPT direct prompt	ChatGPT CoT prompt	ChatGPT Modified CoT prompt	P value
USMLE step 1 sample	54/95 (61.7%)	59/95 (62.8%)	58/95 (57.4%)	0.734
GPT-4 clinical questions	270/500 (54.0%)	274/500 (54.8%)	257/500 (51.4%)	0.530
GPT-4 calculation questions	397/500 (79.4%)	398/500 (79.6%)	386/500 (77.2%)	0.589

Quick links

Search