- NEWS AND VIEWS
LLMs behaving badly: mistrained AI models quickly go off the rails
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$32.99 / 30 days
cancel any time
Subscribe to this journal
Receive 51 print issues and online access
$199.00 per year
only $3.90 per issue
Rent or buy this article
Prices vary by article type
from$1.95
to$39.95
Prices may be subject to local taxes which are calculated during checkout
Nature 649, 560-561 (2026)
doi: https://doi.org/10.1038/d41586-025-04090-5
References
Betley, J. et al. Nature 649, 584–589 (2026).
Turner, E., Soligo, A. Taylor, M., Rajamanoharan, S. & Nanda, N. Preprint at arXiv https://doi.org/10.48550/arXiv.2506.11613 (2025).
Chua, J., Betley, J., Taylor, M. & Evans, O. Preprint at arXiv https://doi.org/10.48550/arXiv.2506.13206 (2025).
Taylor, M., Chua, J., Betley, J., Treutlein, J. & Evans, O. Preprint at arXiv https://doi.org/10.48550/arXiv.2508.17511 (2025).
Wang, M. et al. Preprint at arXiv https://doi.org/10.48550/arXiv.2506.19823 (2025).
Liu, Y. et al. Preprint at arXiv https://doi.org/10.48550/arXiv.2305.13860 (2023).
Zou, A. et al. Preprint at arXiv https://doi.org/10.48550/arXiv.2307.15043 (2023).
Skinner, B. F. Beyond Freedom and Dignity (Hackett, 1971).
Competing Interests
The author declares no competing interests.
Read the paper: Training large language models on narrow tasks can lead to broad misalignment
Mathematicians put AI model AlphaProof to the test
AI discovers learning algorithm that outperforms those designed by humans