Fig. 2
From: The pitfalls of multiple-choice questions in generative AI and medical education

Performance of LLMs with Masking. A graphical presentation of the performance of the studied models, when the prompt is masked in 25% increments, in free-response and multiple-choice formats.