Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

View all journals
Search
Log in

nature
search

Search

Advanced search

Filter By:

Journal Check one or more journals to show results from those journals only.

Nature Medicine (3)

Choose more journals

Article type Check one or more article types to show results from those article types only.

Comments & Opinion (1)
News & Views (1)
Research (1)

Subject Check one or more subjects to show results from those subjects only.

Computational biology and bioinformatics
Health care
Medical research

Date Choose a date option to show results from those dates only.

Today
Last 7 days
Last 30 days
Last 12 months
Last 2 years
Last 5 years

Custom date range

Clear all filters

Sort by:

Relevance

Date published (new to old)

Date published (old to new)

Showing 1–3 of 3 results

Advanced filters: Author: Suhana Bedi Clear advanced filters

Holistic evaluation of large language models for medical tasks with MedHELM

MedHELM, an extensible evaluation framework including a new taxonomy for classifying medical tasks and a benchmark of many datasets across these categories, enables the evaluation of large language models on real-world clinical tasks.

Suhana Bedi
Hejie Cui
Nigam H. Shah
Research20 Jan 2026
Nature Medicine

P: 1-9
How to interpret ‘zero-shot’ results from generative EHR models

Generative models trained on electronic health records are viewed as ‘zero-shot predictors’ for clinical outcomes — but this interpretation is misleading.

Suhana Bedi
Jason Alan Fries
Nigam H. Shah
Comments & Opinion07 Jan 2026
Nature Medicine

P: 1-3
Evaluating the clinical benefits of LLMs

Although large language models (LLMs) show promise in controlled settings, a study now exposes their limitations in real-world clinical applications and points the way towards robust evaluation and benchmarking before clinical use.

Suhana Bedi
Sneha S. Jain
Nigam H. Shah
News & Views26 Jul 2024
Nature Medicine

Volume: 30, P: 2409-2410

Search

Search articles by subject, keyword or author

Show results from

Advanced search

Quick links

Explore articles by subject
Find a job
Guide to authors
Editorial policies

Nature.com

nature.com sitemap

About Nature Portfolio

About us
Press releases
Press office
Contact us

Discover content

Journals A-Z
Articles by subject
protocols.io
Nature Index

Publishing policies

Nature portfolio policies
Open access

Author & Researcher services

Reprints & permissions
Research data
Language editing
Scientific editing
Nature Masterclasses
Research Solutions

Libraries & institutions

Librarian service & tools
Librarian portal
Open research
Recommend to library

Advertising & partnerships

Advertising
Partnerships & Services
Media kits
Branded content

Professional development

Nature Awards
Nature Careers
Nature Conferences

Regional websites

Nature Africa
Nature China
Nature India
Nature Japan
Nature Middle East

Privacy Policy
Use of cookies
Legal notice
Accessibility statement
Terms & Conditions
Your US state privacy rights

Search

Filter By:

Holistic evaluation of large language models for medical tasks with MedHELM

How to interpret ‘zero-shot’ results from generative EHR models

Evaluating the clinical benefits of LLMs

Search

Quick links