Guidelines for reinforcement learning in healthcare

Gottesman, Omer; Johansson, Fredrik; Komorowski, Matthieu; Faisal, Aldo; Sontag, David; Doshi-Velez, Finale; Celi, Leo Anthony

doi:10.1038/s41591-018-0310-5

Comment
Published: 07 January 2019

Guidelines for reinforcement learning in healthcare

Nature Medicine volume 25, pages 16–18 (2019)Cite this article

18k Accesses
279 Citations
118 Altmetric
Metrics details

Subjects

In this Comment, we provide guidelines for reinforcement learning for decisions about patient treatment that we hope will accelerate the rate at which observational cohorts can inform healthcare practice in a safe, risk-conscious manner.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Relevant articles

Open Access articles citing this article.

Optimizing long term disease prevention with reinforcement learning: a framework for precision lipid control
- Yekai Zhou
- , Ruibang Luo
- … Celine SL Chui
npj Digital Medicine Open Access 27 August 2025
Multi-task reinforcement learning and explainable AI-Driven platform for personalized planning and clinical decision support in orthodontic-orthognathic treatment
- Zhiyuan Li
- & Liwei Wang
Scientific Reports Open Access 08 July 2025
Applications of machine learning and deep learning in musculoskeletal medicine: a narrative review
- Martina Feierabend
- , Julius Michael Wolfgart
- … Ulf Krister Hofmann
European Journal of Medical Research Open Access 15 May 2025

Access options

Access through your institution

Buy this article

Purchase on SpringerLink
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Sequential decision-making tasks.**

**Fig. 2: Effective sample size in off-policy evaluation.**

References

Obermeyer, Z. & Emanuel, E. J. N. Engl. J. Med. 375, 1216 (2016).
Article Google Scholar
Parbhoo, S., Bogojeska, J., Zazzi, M., Roth, V. & Doshi-Velez, F. AMIA Summits on Translational Science Proceedings 2017, 239 (2017).
PubMed Central Google Scholar
Guez, A., Vincent, R. D., Avoli, M. & Pineau, J. Treatment of epilepsy via batch-mode reinforcement learning. In Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence 1671–1678 (AAAI, 2008).
Komorowski, M., Celi, L. A., Badawi, O., Gordon, A. & Faisal, A. Nat. Med. 24, 1716–1720 (2018).
Chakraborty, B., Moodie, E. & Erica, E. M. Statistical Methods for Dynamic Treatment Regimes (Springer, New York, 2013).
Simpson, N., Lamontagne, F. & Shankar-Hari, M. Curr Opin Crit Care. 23, 561–566 (2017).
Article Google Scholar
Johansson, F., Shalit, U. & Sontag, D. Learning representations for counterfactual inference. In Proceedings of the 33th International Conference on Machine Learning (ICML, 2016).
Precup, D., Sutton, R. S. & Singh, S. P. Eligibility traces for off-policy policy evaluation. In Proceedings of the Seventeenth International Conference on Machine Learning 759–766 (ICML, 2000).
Gottesman, O. et al. Evaluating Reinforcement Learning Algorithms in Observational Health Settings. Preprint at https://arxiv.org/abs/1805.12298 (2018).
Doshi-Velez, F. & Kim, B. Towards a rigorous science of interpretable machine learning. Preprint at https://arxiv.org/abs/1702.08608 (2017).

Download references

Author information

These authors contributed equally: Omer Gottesman, Fredrik Johansson.

Authors and Affiliations

Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
Omer Gottesman & Finale Doshi-Velez
Institute for Medical Engineering and Science, MIT, Cambridge, MA, USA
Fredrik Johansson & David Sontag
Laboratory for Computational Physiology, Harvard-MIT Health Sciences & Technology, MIT, Cambridge, MA, USA
Matthieu Komorowski & Leo Anthony Celi
Department of Surgery and Cancer, Faculty of Medicine, Imperial College London, London, UK
Matthieu Komorowski
Department of Bioengineering, Imperial College London, London, UK
Aldo Faisal
Department of Computing, Imperial College London, London, UK
Aldo Faisal
Data Science Institute, London, UK
Aldo Faisal
MRC London Institute of Clinical Sciences, London, UK
Aldo Faisal
Division of Pulmonary, Critical Care and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
Leo Anthony Celi
MIT Critical Data, Cambridge, MA, USA
Leo Anthony Celi

Authors

Omer Gottesman
View author publications
Search author on:PubMed Google Scholar
Fredrik Johansson
View author publications
Search author on:PubMed Google Scholar
Matthieu Komorowski
View author publications
Search author on:PubMed Google Scholar
Aldo Faisal
View author publications
Search author on:PubMed Google Scholar
David Sontag
View author publications
Search author on:PubMed Google Scholar
Finale Doshi-Velez
View author publications
Search author on:PubMed Google Scholar
Leo Anthony Celi
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Leo Anthony Celi.

Ethics declarations

Competing interests

A.A.F. has received funding from Fresenius-KABI in the past.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gottesman, O., Johansson, F., Komorowski, M. et al. Guidelines for reinforcement learning in healthcare. Nat Med 25, 16–18 (2019). https://doi.org/10.1038/s41591-018-0310-5

Download citation

Published: 07 January 2019
Issue date: January 2019
DOI: https://doi.org/10.1038/s41591-018-0310-5

This article is cited by

Applications of machine learning and deep learning in musculoskeletal medicine: a narrative review
- Martina Feierabend
- Julius Michael Wolfgart
- Ulf Krister Hofmann
European Journal of Medical Research (2025)
Optimizing long term disease prevention with reinforcement learning: a framework for precision lipid control
- Yekai Zhou
- Ruibang Luo
- Celine SL Chui
npj Digital Medicine (2025)
Multi-task reinforcement learning and explainable AI-Driven platform for personalized planning and clinical decision support in orthodontic-orthognathic treatment
- Zhiyuan Li
- Liwei Wang
Scientific Reports (2025)
Offline model-based reinforcement learning with causal structured world models
- Zhengmao Zhu
- Honglong Tian
- Yang Yu
Frontiers of Computer Science (2025)
Reinforcement learning for healthcare operations management: methodological framework, recent developments, and future research directions
- Qihao Wu
- Jiangxue Han
- Zuo-Jun Max Shen
Health Care Management Science (2025)