Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
ChatGPT as a tool for reviewing multiple-choice questions in the health sector
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 13 May 2026

ChatGPT as a tool for reviewing multiple-choice questions in the health sector

  • Tatiane Iembo1,
  • Helena Landim Gonçalves Cristóvão2,
  • Patrícia Carla Zanelatto Gonçalves3,
  • Wagner Ricardo Montor4,
  • Patrícia da Silva Fucuta1,
  • Toufic Anbar Neto1,
  • Júlio César André2 &
  • …
  • Milton de Arruda Martins5 

Scientific Reports (2026) Cite this article

  • 269 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Mathematics and computing
  • Medical research

Abstract

Artificial Intelligence (AI), particularly ChatGPT-4, offers promising applications in medical education, including multiple-choice question (MCQ) development. This study aimed to evaluate and compare the quality of 36 MCQs created by medical faculty with their versions reviewed by ChatGPT-4. A cross-sectional, quantitative approach was used. Ten external health education specialists and four study authors (internal evaluators) assessed the questions based on 38 criteria. While external evaluators found no statistically significant difference in criteria met between versions (p = 0.325), the study authors, who underwent standardization meetings, identified a statistically significant increase in the number of criteria met by ChatGPT-4-reviewed MCQs (p < 0.001). Descriptive statistics, Wilcoxon Signed-Rank Test, and Non-Metric Multidimensional Scaling were employed. The results showed that ChatGPT-4 demonstrated proficiency in modifying questions to reflect greater structural clarity and adherence to basic item-writing principles, resulting in questions with increased clarity and objectivity. However, it struggled to incorporate clinical reasoning and higher-order thinking when these were lacking, particularly given the non-optimized prompt used. Despite these limitations, AI’s revisions were aligned with faculty quality standards, demonstrating its potential to complement faculty efforts, emphasizing the critical role of calibrated human expertise and effective prompt engineering, rather than replacement.

Similar content being viewed by others

Evaluation of three artificial intelligence chatbots for generating clinical hematology multiple choice questions for medical students

Article Open access 20 January 2026

ChatGPT's performance before and after teaching in mass casualty incident triage

Article Open access 21 November 2023

Healthcare professionals and the public sentiment analysis of ChatGPT in clinical practice

Article Open access 07 January 2025

Acknowledgements

The authors would like to thank the professor Bruno Caramelli, PhD., from Unit of Interdisciplinary Medicine in Cardiology (InCor-FMUSP), for reviewing the English version of the article and for the suggestions provided.

Funding

This work was conducted without external funding. No grants, contracts, or other forms of financial support were received from government agencies, private foundations, or commercial entities for this research.

Author information

Authors and Affiliations

  1. Faculty of Medicine in São José Do Rio Preto (FACERES), São José Do Rio Preto, 15090-305, Brazil

    Tatiane Iembo, Patrícia da Silva Fucuta & Toufic Anbar Neto

  2. Center for Studies and Development of Health Education - Faculty of Medicine of São José Do Rio Preto (CEDES / FAMERP), São José Do Rio Preto, 15090-000, Brazil

    Helena Landim Gonçalves Cristóvão & Júlio César André

  3. Mackenzie Evangelical College of Paraná (MACKENZIE), Curitiba, 80730-000, Brazil

    Patrícia Carla Zanelatto Gonçalves

  4. Faculty of Medical Sciences of Santa Casa de São Paulo (FCMSCSP), São Paulo, 01224-001, Brazil

    Wagner Ricardo Montor

  5. Center for the Development of Medical Education (CEDEM), Faculty of Medicine of the University of São Paulo (FMUSP), São Paulo, 01246-903, Brazil

    Milton de Arruda Martins

Authors
  1. Tatiane Iembo
    View author publications

    Search author on:PubMed Google Scholar

  2. Helena Landim Gonçalves Cristóvão
    View author publications

    Search author on:PubMed Google Scholar

  3. Patrícia Carla Zanelatto Gonçalves
    View author publications

    Search author on:PubMed Google Scholar

  4. Wagner Ricardo Montor
    View author publications

    Search author on:PubMed Google Scholar

  5. Patrícia da Silva Fucuta
    View author publications

    Search author on:PubMed Google Scholar

  6. Toufic Anbar Neto
    View author publications

    Search author on:PubMed Google Scholar

  7. Júlio César André
    View author publications

    Search author on:PubMed Google Scholar

  8. Milton de Arruda Martins
    View author publications

    Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Tatiane Iembo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1. (download DOCX )

Supplementary Information 2. (download DOCX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Iembo, T., Cristóvão, H.L.G., Gonçalves, P.C.Z. et al. ChatGPT as a tool for reviewing multiple-choice questions in the health sector. Sci Rep (2026). https://doi.org/10.1038/s41598-026-51988-9

Download citation

  • Received: 07 August 2024

  • Accepted: 30 April 2026

  • Published: 13 May 2026

  • DOI: https://doi.org/10.1038/s41598-026-51988-9

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Exam question
  • Educational technology
  • Educational assessment
  • Medical education
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics