In this Comment, we provide guidelines for reinforcement learning for decisions about patient treatment that we hope will accelerate the rate at which observational cohorts can inform healthcare practice in a safe, risk-conscious manner.
This is a preview of subscription content, access via your institution
Relevant articles
Open Access articles citing this article.
-
Optimizing long term disease prevention with reinforcement learning: a framework for precision lipid control
npj Digital Medicine Open Access 27 August 2025
-
Multi-task reinforcement learning and explainable AI-Driven platform for personalized planning and clinical decision support in orthodontic-orthognathic treatment
Scientific Reports Open Access 08 July 2025
-
Applications of machine learning and deep learning in musculoskeletal medicine: a narrative review
European Journal of Medical Research Open Access 15 May 2025
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$32.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout

Debbie Maizels/Springer Nature

Debbie Maizels/Springer Nature
References
Obermeyer, Z. & Emanuel, E. J. N. Engl. J. Med. 375, 1216 (2016).
Parbhoo, S., Bogojeska, J., Zazzi, M., Roth, V. & Doshi-Velez, F. AMIA Summits on Translational Science Proceedings 2017, 239 (2017).
Guez, A., Vincent, R. D., Avoli, M. & Pineau, J. Treatment of epilepsy via batch-mode reinforcement learning. In Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence 1671–1678 (AAAI, 2008).
Komorowski, M., Celi, L. A., Badawi, O., Gordon, A. & Faisal, A. Nat. Med. 24, 1716–1720 (2018).
Chakraborty, B., Moodie, E. & Erica, E. M. Statistical Methods for Dynamic Treatment Regimes (Springer, New York, 2013).
Simpson, N., Lamontagne, F. & Shankar-Hari, M. Curr Opin Crit Care. 23, 561–566 (2017).
Johansson, F., Shalit, U. & Sontag, D. Learning representations for counterfactual inference. In Proceedings of the 33th International Conference on Machine Learning (ICML, 2016).
Precup, D., Sutton, R. S. & Singh, S. P. Eligibility traces for off-policy policy evaluation. In Proceedings of the Seventeenth International Conference on Machine Learning 759–766 (ICML, 2000).
Gottesman, O. et al. Evaluating Reinforcement Learning Algorithms in Observational Health Settings. Preprint at https://arxiv.org/abs/1805.12298 (2018).
Doshi-Velez, F. & Kim, B. Towards a rigorous science of interpretable machine learning. Preprint at https://arxiv.org/abs/1702.08608 (2017).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
A.A.F. has received funding from Fresenius-KABI in the past.
Rights and permissions
About this article
Cite this article
Gottesman, O., Johansson, F., Komorowski, M. et al. Guidelines for reinforcement learning in healthcare. Nat Med 25, 16–18 (2019). https://doi.org/10.1038/s41591-018-0310-5
Published:
Issue date:
DOI: https://doi.org/10.1038/s41591-018-0310-5
This article is cited by
-
Applications of machine learning and deep learning in musculoskeletal medicine: a narrative review
European Journal of Medical Research (2025)
-
Optimizing long term disease prevention with reinforcement learning: a framework for precision lipid control
npj Digital Medicine (2025)
-
Multi-task reinforcement learning and explainable AI-Driven platform for personalized planning and clinical decision support in orthodontic-orthognathic treatment
Scientific Reports (2025)
-
Offline model-based reinforcement learning with causal structured world models
Frontiers of Computer Science (2025)
-
Reinforcement learning for healthcare operations management: methodological framework, recent developments, and future research directions
Health Care Management Science (2025)