ChatGPT as a tool for reviewing multiple-choice questions in the health sector

Iembo, Tatiane; Cristóvão, Helena Landim Gonçalves; Gonçalves, Patrícia Carla Zanelatto; Montor, Wagner Ricardo; da Silva Fucuta, Patrícia; Neto, Toufic Anbar; André, Júlio César; de Arruda Martins, Milton

doi:10.1038/s41598-026-51988-9

Download PDF

Article
Open access
Published: 13 May 2026

ChatGPT as a tool for reviewing multiple-choice questions in the health sector

Tatiane Iembo¹,
Helena Landim Gonçalves Cristóvão²,
Patrícia Carla Zanelatto Gonçalves³,
Wagner Ricardo Montor⁴,
Patrícia da Silva Fucuta¹,
Toufic Anbar Neto¹,
Júlio César André² &
…
Milton de Arruda Martins⁵

Scientific Reports (2026) Cite this article

269 Accesses
Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

Abstract

Artificial Intelligence (AI), particularly ChatGPT-4, offers promising applications in medical education, including multiple-choice question (MCQ) development. This study aimed to evaluate and compare the quality of 36 MCQs created by medical faculty with their versions reviewed by ChatGPT-4. A cross-sectional, quantitative approach was used. Ten external health education specialists and four study authors (internal evaluators) assessed the questions based on 38 criteria. While external evaluators found no statistically significant difference in criteria met between versions (p = 0.325), the study authors, who underwent standardization meetings, identified a statistically significant increase in the number of criteria met by ChatGPT-4-reviewed MCQs (p < 0.001). Descriptive statistics, Wilcoxon Signed-Rank Test, and Non-Metric Multidimensional Scaling were employed. The results showed that ChatGPT-4 demonstrated proficiency in modifying questions to reflect greater structural clarity and adherence to basic item-writing principles, resulting in questions with increased clarity and objectivity. However, it struggled to incorporate clinical reasoning and higher-order thinking when these were lacking, particularly given the non-optimized prompt used. Despite these limitations, AI’s revisions were aligned with faculty quality standards, demonstrating its potential to complement faculty efforts, emphasizing the critical role of calibrated human expertise and effective prompt engineering, rather than replacement.

Evaluation of three artificial intelligence chatbots for generating clinical hematology multiple choice questions for medical students

Article Open access 20 January 2026

ChatGPT's performance before and after teaching in mass casualty incident triage

Article Open access 21 November 2023

Healthcare professionals and the public sentiment analysis of ChatGPT in clinical practice

Article Open access 07 January 2025

Acknowledgements

The authors would like to thank the professor Bruno Caramelli, PhD., from Unit of Interdisciplinary Medicine in Cardiology (InCor-FMUSP), for reviewing the English version of the article and for the suggestions provided.

Funding

This work was conducted without external funding. No grants, contracts, or other forms of financial support were received from government agencies, private foundations, or commercial entities for this research.

Author information

Authors and Affiliations

Faculty of Medicine in São José Do Rio Preto (FACERES), São José Do Rio Preto, 15090-305, Brazil
Tatiane Iembo, Patrícia da Silva Fucuta & Toufic Anbar Neto
Center for Studies and Development of Health Education - Faculty of Medicine of São José Do Rio Preto (CEDES / FAMERP), São José Do Rio Preto, 15090-000, Brazil
Helena Landim Gonçalves Cristóvão & Júlio César André
Mackenzie Evangelical College of Paraná (MACKENZIE), Curitiba, 80730-000, Brazil
Patrícia Carla Zanelatto Gonçalves
Faculty of Medical Sciences of Santa Casa de São Paulo (FCMSCSP), São Paulo, 01224-001, Brazil
Wagner Ricardo Montor
Center for the Development of Medical Education (CEDEM), Faculty of Medicine of the University of São Paulo (FMUSP), São Paulo, 01246-903, Brazil
Milton de Arruda Martins

Authors

Tatiane Iembo
View author publications
Search author on:PubMed Google Scholar
Helena Landim Gonçalves Cristóvão
View author publications
Search author on:PubMed Google Scholar
Patrícia Carla Zanelatto Gonçalves
View author publications
Search author on:PubMed Google Scholar
Wagner Ricardo Montor
View author publications
Search author on:PubMed Google Scholar
Patrícia da Silva Fucuta
View author publications
Search author on:PubMed Google Scholar
Toufic Anbar Neto
View author publications
Search author on:PubMed Google Scholar
Júlio César André
View author publications
Search author on:PubMed Google Scholar
Milton de Arruda Martins
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Tatiane Iembo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1. (download DOCX )

Supplementary Information 2. (download DOCX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Iembo, T., Cristóvão, H.L.G., Gonçalves, P.C.Z. et al. ChatGPT as a tool for reviewing multiple-choice questions in the health sector. Sci Rep (2026). https://doi.org/10.1038/s41598-026-51988-9

Download citation

Received: 07 August 2024
Accepted: 30 April 2026
Published: 13 May 2026
DOI: https://doi.org/10.1038/s41598-026-51988-9