Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

npj Digital Medicine
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. npj digital medicine
  3. articles
  4. article
Vision-Enabled AI scribes reduce omissions in clinical conversations: evidence from simulated medication histories
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 26 February 2026

Vision-Enabled AI scribes reduce omissions in clinical conversations: evidence from simulated medication histories

  • Bradley D. Menz  ORCID: orcid.org/0000-0002-0855-50811,2,
  • Nicholas L. Scarfo2,
  • Natansh D. Modi3,
  • Erik Cornelisse1,2,
  • Lee X. Li1,
  • Jin Quan Eugene Tan1,2,
  • Jimit Gandhi3,4,
  • Dorsa Maher1,
  • Dib Kousa2,
  • Kezia Daniel4,
  • Vidya Menon2,
  • Stephen Bacchi5,
  • Ross A. McKinnon1,
  • Michael D. Wiese3,
  • Andrew Rowland1,
  • Michael J. Sorich1 &
  • …
  • Ashley M. Hopkins1 

npj Digital Medicine , Article number:  (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Health care
  • Mathematics and computing
  • Medical research

Abstract

Most ambient AI medical scribes process audio only, omitting clinically important visual details. We developed a vision-enabled AI scribe using Google’s Gemini model and Ray-Ban Meta smart glasses to document medication histories—a task requiring both audio and visual input. Ten clinical pharmacists video-recorded 110 simulated medication history interviews. Following iterative prompt engineering on 10 training recordings, the scribe was evaluated on 100 test recordings (2160 data points) across patient details and medication-specific fields. The vision-enabled scribe achieved 98% overall accuracy (2114/2,160 data points), ranging from 96% for patient details to 99% for dosing directions and indication. Video input significantly outperformed audio-only processing (98% vs 81%, P < 0.001), primarily through reduced omissions (10 vs 358 errors). Vision-enabled AI scribes substantially improved documentation accuracy for tasks requiring visual input, demonstrating potential to markedly reduce omission errors in clinical documentation.

Data availability

All data, including the patient and medication details used in the simulated cases and the corresponding AI outputs, are available at https://zenodo.org/records/17032178.

Code availability

The Python code used to create the vision enabled AI scribe is available at [Vision-Enabled-AI-Scribe] ([https://github.com/MenzBD/Vision-Enabled-AI-Scribe/tree/main]). The R scripts used to generate the manuscript and supplementary tables and figures are available at [Vision-Enabled-AI-Scribe] ([https://github.com/MenzBD/Vision-Enabled-AI-Scribe/tree/main]). The data file is restricted to public access, and is available to the reviewers or editor(s) while under peer review.

References

  1. Tierney, A. A. et al. Ambient artificial intelligence scribes to alleviate the burden of clinical documentation. NEJM Catalyst 5, CAT.23.0404 (2024).

  2. Sorich, M. J., Mangoni, A. A., Bacchi, S., Menz, B. D. & Hopkins, A. M. The triage and diagnostic accuracy of frontier large language models: updated comparison to physician performance. J. Med. Internet Res. 26, e67409 (2024).

    Google Scholar 

  3. Menz, B. D. et al. Generative AI chatbots for reliable cancer information: evaluating web-search, multilingual, and reference capabilities of emerging large language models. Eur. J. Cancer 218, 115274 (2025).

    Google Scholar 

  4. Shahnam, A. et al. Application of generative artificial intelligence for physician and patient oncology letters—AI-OncLetters. JCO Clin. Cancer Inform. e2400323 (2025).

  5. Zaretsky, J. et al. Generative artificial intelligence to transform inpatient discharge summaries to patient-friendly language and format. JAMA Netw. Open 7, e240357 (2024).

    Google Scholar 

  6. Fahrner, L. J., Chen, E., Topol, E. & Rajpurkar, P. The generative era of medical AI. Cell 188, 3648–3660 (2025).

    Google Scholar 

  7. Tierney, A. A. et al. Ambient artificial intelligence scribes: learnings after 1 Year and over 2.5 Million Uses. NEJM Catalyst 6, CAT.25.0040 (2025).

  8. Altschuler, S., Huntington, I., Antoniak, M. & Klein, L. F. Clinician as editor: notes in the era of AI scribes. Lancet 404, 2154–2155 (2024).

    Google Scholar 

  9. van Buchem, M. M. et al. The digital scribe in clinical practice: a scoping review and research agenda. npj Digital Med. 4, 57 (2021).

    Google Scholar 

  10. Shah, S. J. et al. Physician perspectives on ambient AI scribes. JAMA Netw. Open 8, e251904 (2025).

    Google Scholar 

  11. Coiera, E., Kocaballi, B., Halamka, J. & Laranjo, L. The digital scribe. NPJ Digit Med. 1, 58 (2018).

    Google Scholar 

  12. Duggan, M. J. et al. Clinician experiences with ambient scribe technology to assist with documentation burden and efficiency. JAMA Netw. Open 8, e2460637 (2025).

    Google Scholar 

  13. Evans, K. et al. Impact of using an AI scribe on clinical documentation and clinician-patient interactions in allied health private practice: perspectives of clinicians and patients. Musculoskelet. Sci. Pract. 78, 103333 (2025).

    Google Scholar 

  14. Shah, S. J. et al. Ambient artificial intelligence scribes: physician burnout and perspectives on usability and documentation burden. J. Am. Med. Inform. Assoc. 32, 375–380 (2025).

    Google Scholar 

  15. Leung, T. I., Coristine, A. J. & Benis, A. AI scribes in health care: balancing transformative potential with responsible integration. JMIR Med. Inform. 13, e80898 (2025).

    Google Scholar 

  16. Albrecht, M. et al. Enhancing clinical documentation with ambient artificial intelligence: a quality improvement survey assessing clinician perspectives on work burden, burnout, and job satisfaction. JAMIA Open 8, ooaf013 (2025).

    Google Scholar 

  17. Biro, J. et al. Accuracy and safety of AI-enabled scribe technology: instrument validation study. J. Med. Internet Res. 27, e64993 (2025).

    Google Scholar 

  18. NHS: Guidance on the use of AI-enabled ambient scribing products in health and care settings. https://www.england.nhs.uk/long-read/guidance-on-the-use-of-ai-enabled-ambient-scribing-products-in-health-and-care-settings/?utm_source=chatgpt.com. Accessed: 01/08/2025. (2025).

  19. Banerji, C. R. S. et al. Clinicians must participate in the development of multimodal AI. eClin. Med. 84, (2025).

  20. Acosta, J. N., Falcone, G. J., Rajpurkar, P. & Topol, E. J. Multimodal biomedical AI. Nat. Med. 28, 1773–1784 (2022).

    Google Scholar 

  21. Rao, V. M. et al. Multimodal generative AI for medical image interpretation. Nature 639, 888–896 (2025).

    Google Scholar 

  22. Lee, J.-O., Zhou, H.-Y., Berzin, T. M., Sodickson, D. K. & Rajpurkar, P. Multimodal generative AI for interpreting 3D medical images and videos. npj Dig. Med. 8, 273 (2025).

    Google Scholar 

  23. Moryousef, J., Nadesan, P., Uy, M., Matti, D. & Guo, Y. Assessing the efficacy and clinical utility of artificial intelligence scribes in urology. Urology 196, 12–17 (2025).

    Google Scholar 

  24. DeepScribe: DeepScribe Outperforms GPT-4 by 59% on AI Medical Scribing: A Benchmark Study (2025). https://www.deepscribe.ai/resources/deepscribe-outperforms-gpt-4-by-59-percent-on-ai-medical-scribing#:~:text=challenging%20input%20characteristics%20such%20as,lack%20of%20medical%20data%20training. Accessed: 01/08/2025.

  25. Stokel-Walker, C. The “ambient scribe” tools listening to and summarising your doctor-patient consultations. BMJ 389, r663 (2025).

    Google Scholar 

  26. Kernberg, A., Gold, J. A. & Mohan, V. Using ChatGPT-4 to create structured medical notes from audio recordings of physician-patient encounters: comparative study. J. Med Internet Res 26, e54419 (2024).

    Google Scholar 

  27. Shemtob, L., Majeed, A. & Beaney, T. Regulation of AI scribes in clinical practice. BMJ 389, r1248 (2025).

    Google Scholar 

  28. Khanna, A. et al. Enhancing accuracy of operative reports with automated artificial intelligence analysis of surgical video. J. Am. Coll. Surg. 240, (2025).

  29. Dhote, M. G. et al. Hybrid vision-language models for real-time surgical report generation and documentation. J. Neonatal Surg. 14, 1–12 (2025).

    Google Scholar 

  30. Sasseville, M. et al. The impact of AI scribes on streamlining clinical documentation: a systematic review. in Healthcare, Vol. 13 (2025).

  31. Draper, T. C. et al. The impact of acoustic and informational noise on AI-generated clinical summaries. medRxiv, 2025.2003. 2024.25324398 (2025).

  32. Generative AI on Vertex AI: Gemini pro https://cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/2-5-pro (2025).

  33. Duffourc, M. N. & Gerke, S. Health care AI and patient privacy—dinerstein v Google. JAMA 331, 909–910 (2024).

    Google Scholar 

  34. Hewson, T. et al. The recording of mental health consultations by patients: clinical, ethical and legal considerations. BJPsych Bull. 46, 133–137 (2022).

    Google Scholar 

  35. Lear, R., Ellis, S., Ollivierre-Harris, T., Long, S. & Mayer, E. K. Video recording patients for direct care purposes: systematic review and narrative synthesis of international empirical studies and UK professional guidance. J. Med. Internet Res. 25, e46478 (2023).

    Google Scholar 

  36. Menz, B. D. et al. Gender representation of health care professionals in large language model-generated stories. JAMA Netw. Open 7, e2434997 (2024).

    Google Scholar 

  37. Menz, B. D. et al. Current safeguards, risk mitigation, and transparency measures of large language models against the generation of health disinformation: repeated cross sectional analysis. BMJ 384, e078538 (2024).

    Google Scholar 

  38. Murdoch, B. Privacy and artificial intelligence: challenges for protecting health information in a new era. BMC Med. Ethics 22, 122 (2021).

    Google Scholar 

  39. Rozenblit, L. et al. Toward responsible AI governance: balancing multi-stakeholder perspectives on AI in healthcare. Int. J. Med. Inform. 203, 106015 (2025).

    Google Scholar 

  40. Silcox, C. et al. The potential for artificial intelligence to transform healthcare: perspectives from international health leaders. npj Digital Med. 7, 88 (2024).

    Google Scholar 

  41. Bouderhem, R. Shaping the future of AI in healthcare through ethics and governance. Human. Soc. Sci. Commun. 11, 416 (2024).

    Google Scholar 

  42. World Health Organisation (WHO): Health Data Overview for the United States of America. https://data.who.int/countries/840 (2025).

  43. United States Pharmacopeia (USP): Chapter 17 - prescription container labeling. https://www.usp.org/sites/default/files/usp/webform/c17.pdf.

  44. American Society of Hospital Pharmacists (ASHP): Medication history form. https://www.ashp.org/-/media/assets/pharmacy-technician/docs/EdAndTrainMedHxForm.pdf.

  45. Australian Commission on Safety and Quality in Health Care: How to take a BPMH - resources and tools. https://www.ashp.org/-/media/assets/pharmacy-technician/docs/EdAndTrainMedHxForm.pdf.

  46. Zaghir, J. et al. Prompt engineering paradigms for medical applications: scoping review. J. Med. Internet Res. 26, e60501 (2024).

    Google Scholar 

  47. Meskó, B. Prompt engineering as an important emerging skill for medical professionals: tutorial. J. Med. Internet Res. 25, e50638 (2023).

    Google Scholar 

  48. El Hajj, M. S., Asiri, R., Husband, A. & Todd, A. Medication errors in community pharmacies: a systematic review of the international literature. PLoS ONE 20, e0322392 (2025).

    Google Scholar 

  49. Uhlenhopp, D. J. et al. Hospital-wide medication reconciliation program: error identification, cost-effectiveness, and detecting high-risk individuals on admission. Integr. Pharm. Res. Pract. 9, 195–203 (2020).

    Google Scholar 

Download references

Acknowledgements

The PhD scholarship of B.D.M. is supported by the National Health and Medical Research Council, Australia (APP2030913). A.M.H. holds an Emerging Leader Investigator Fellow, National Health and Medical Research Council, Australia (APP2008119). M.J.S. is supported by a Beat Cancer Research Fellowship from the Cancer Council South Australia. S.B. is supported by a Fulbright Scholarship. The funding organisations had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication. Disclosures: The authors used ChatGPT, Claude, and Grammarly AI to assist in formatting and editing the manuscript. Figures 1 and 3 were generated using Biorender.

Author information

Authors and Affiliations

  1. College of Medicine and Public Health, Flinders Health and Medical Research Institute, Flinders University, Adelaide, South Australia

    Bradley D. Menz, Erik Cornelisse, Lee X. Li, Jin Quan Eugene Tan, Dorsa Maher, Ross A. McKinnon, Andrew Rowland, Michael J. Sorich & Ashley M. Hopkins

  2. SA Pharmacy, Southern Adelaide Local Health Network, Adelaide, South Australia

    Bradley D. Menz, Nicholas L. Scarfo, Erik Cornelisse, Jin Quan Eugene Tan, Dib Kousa & Vidya Menon

  3. Clinical and Health Sciences, Adelaide, University of South Australia, Adelaide, South Australia

    Natansh D. Modi, Jimit Gandhi & Michael D. Wiese

  4. SA Pharmacy, Central Adelaide Local Health Network, Adelaide, South Australia

    Jimit Gandhi & Kezia Daniel

  5. Department of Neurology, Northern Adelaide Local Health Network, Adelaide, South Australia

    Stephen Bacchi

Authors
  1. Bradley D. Menz
    View author publications

    Search author on:PubMed Google Scholar

  2. Nicholas L. Scarfo
    View author publications

    Search author on:PubMed Google Scholar

  3. Natansh D. Modi
    View author publications

    Search author on:PubMed Google Scholar

  4. Erik Cornelisse
    View author publications

    Search author on:PubMed Google Scholar

  5. Lee X. Li
    View author publications

    Search author on:PubMed Google Scholar

  6. Jin Quan Eugene Tan
    View author publications

    Search author on:PubMed Google Scholar

  7. Jimit Gandhi
    View author publications

    Search author on:PubMed Google Scholar

  8. Dorsa Maher
    View author publications

    Search author on:PubMed Google Scholar

  9. Dib Kousa
    View author publications

    Search author on:PubMed Google Scholar

  10. Kezia Daniel
    View author publications

    Search author on:PubMed Google Scholar

  11. Vidya Menon
    View author publications

    Search author on:PubMed Google Scholar

  12. Stephen Bacchi
    View author publications

    Search author on:PubMed Google Scholar

  13. Ross A. McKinnon
    View author publications

    Search author on:PubMed Google Scholar

  14. Michael D. Wiese
    View author publications

    Search author on:PubMed Google Scholar

  15. Andrew Rowland
    View author publications

    Search author on:PubMed Google Scholar

  16. Michael J. Sorich
    View author publications

    Search author on:PubMed Google Scholar

  17. Ashley M. Hopkins
    View author publications

    Search author on:PubMed Google Scholar

Contributions

Concept and study design—B.M., A.M.H., M.J.S. Creation of AI scribe—B.M. Data collection—B.M., N.L.S., N.D.M., E.C., L.X.L., J.Q.E.T., J.G., D.M., D.K., K.D., V.M. Statistical analysis—B.M., A.M.H., M.J.S. Generation of figures—B.M. Interpretation of results—B.M., N.L.S., N.D.M., E.C., L.X.L., J.Q.E.T., J.G., D.M., D.K., K.D., V.M., S.B., R.A.M., M.D.W., A.R., M.J.S., A.M.H. Manuscript writing—B.M., A.M.H. Manuscript editing and review—B.M., N.L.S., N.D.M., E.C., L.X.L., J.Q.E.T., J.G., D.M., D.K., K.D., V.M., S.B., R.A.M., M.D.W., A.R., M.J.S., A.M.H. Supervision—A.M.H., A.R., M.J.S. Funding acquisition—B.M., A.M.H. All authors approved submission of the final manuscript.

Corresponding author

Correspondence to Bradley D. Menz.

Ethics declarations

Competing interests

A.M.H. is a recipient of investigator-initiated funding for research outside the scope of the current study from Boehringer Ingelheim. A.R. and M.J.S. are recipients of investigator-initiated funding for research outside the scope of the current study from AstraZeneca, Boehringer Ingelheim, Pfizer and Takeda. A.R. is a recipient of speaker fees from Boehringer Ingelheim and Genentech. The author team have no other potential conflicts of interest with respect to this research and/or publication to declare.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Menz, B.D., Scarfo, N.L., Modi, N.D. et al. Vision-Enabled AI scribes reduce omissions in clinical conversations: evidence from simulated medication histories. npj Digit. Med. (2026). https://doi.org/10.1038/s41746-026-02494-9

Download citation

  • Received: 25 September 2025

  • Accepted: 15 February 2026

  • Published: 26 February 2026

  • DOI: https://doi.org/10.1038/s41746-026-02494-9

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Associated content

Collection

Multimodal AI for Digital Medicine

Advertisement

Explore content

  • Research articles
  • Reviews & Analysis
  • News & Comment
  • Collections
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims and scope
  • Content types
  • Journal Information
  • About the Editors
  • Contact
  • Editorial policies
  • Calls for Papers
  • Journal Metrics
  • About the Partner
  • Open Access
  • Early Career Researcher Editorial Fellowship
  • Editorial Team Vacancies
  • News and Views Student Editor
  • Communication Fellowship

Publish with us

  • For Authors and Referees
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

npj Digital Medicine (npj Digit. Med.)

ISSN 2398-6352 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics