Large language model-based agentic systems can process input information, plan and decide, recall and reflect, interact and collaborate, leverage various tools and act. This opens up a wealth of opportunities within medicine and healthcare, ranging from clinical workflow automation to multi-agent-aided diagnosis.
This is a preview of subscription content, access via your institution
Relevant articles
Open Access articles citing this article.
-
Context matching is not reasoning when performing generalized clinical evaluation of generative language models
npj Digital Medicine Open Access 27 December 2025
-
An Egocentric Life-Saving Interventional Procedure Dataset of Actions, Medical Questions, Maneuvers and Tools
Scientific Data Open Access 18 December 2025
-
Evaluating the reliability of large language models for clinical data extraction in bladder cancer prognosis
Scientific Reports Open Access 21 November 2025
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$32.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
$119.00 per year
only $9.92 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to the full article PDF.
USD 39.95
Prices may be subject to local taxes which are calculated during checkout

References
Patel, S. B. & Lam, K. Lancet Digit. Health 5, e107–e108 (2023).
Vaid, A., Landi, I., Nadkarni, G. & Nabeel, I. Lancet Digit. Health. 5, e855–e858 (2023).
Touvron, H. et al. Preprint at https://doi.org/10.48550/arXiv.2302.13971 (2023).
Russell, S.J. & Norvig, P. Artificial Intelligence: A Modern Approach (Pearson; 2016).
Wei, J. et al. Adv. Neural Inf. Process. Syst. 35, 24824–24837 (2022).
Shinn, N., Cassano, F., Gopinath, A., Narasimhan, K. & Yao, S. Advances in Neural Information Processing Systems 36 (NeurIPS 2023) https://go.nature.com/3CK7BPb (2023).
Yao, S. et al. International Conference on Learning Representations https://openreview.net/pdf?id=WE_vluYUL-X (2023).
Li, G., Hammoud, H. A. A. K., Itani, H., Khizbullin, D. & Ghanem, B. Adv. Neural Inf. Process. Syst. 36, 51991–52008 (2023).
Eloundou, T., Manning, S., Mishkin, P. & Rock, D. Science 384, 1306–1308 (2024).
Hong, S. et al. International Conference on Learning Representations https://openreview.net/forum?id=VtmBAGCN7o (2024).
Lewis, P. et al. Adv. Neural Inf. Process. Syst. 33, 9459–9474 (2020).
Zakka, C. et al. NEJM AI https://doi.org/10.1056/AIoa2300068 (2024).
Gou, Z. et al. International Conference on Learning Representations https://openreview.net/forum?id=Sx038qxjek (2024)
Tian, Y., Yang, X., Zhang, J., Dong, Y. & Su, H. Preprint at https://doi.org/10.48550/arXiv.2311.11855 (2024).
Tu, T. et al. Preprint at https://doi.org/10.48550/arXiv.2401.05654 (2024).
Acknowledgements
This work is supported by the Research Grants Council of Hong Kong SAR (ECS24211020, GRF14203821, GRF14216222 and GRF14201824) and the Innovation and Technology Fund (ITF) of Hong Kong SAR (ITS/252/23). K.L. is supported by a UK National Institute for Health and Care Research Academic Clinical Fellowship. The authors thank L. Li (King’s College London) for helpful discussions.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Rights and permissions
About this article
Cite this article
Qiu, J., Lam, K., Li, G. et al. LLM-based agentic systems in medicine and healthcare. Nat Mach Intell 6, 1418–1420 (2024). https://doi.org/10.1038/s42256-024-00944-1
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1038/s42256-024-00944-1
This article is cited by
-
Artefacts in continuous neuromonitoring
Nature Reviews Bioengineering (2026)
-
Explainability in the age of large language models for healthcare
Communications Engineering (2025)
-
Evaluating the reliability of large language models for clinical data extraction in bladder cancer prognosis
Scientific Reports (2025)
-
An Egocentric Life-Saving Interventional Procedure Dataset of Actions, Medical Questions, Maneuvers and Tools
Scientific Data (2025)
-
Context matching is not reasoning when performing generalized clinical evaluation of generative language models
npj Digital Medicine (2025)