A new approach sheds light on the biological features learned by protein language models, promising greater interpretability for unsupervised sequence learning.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$32.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout

References
Simon, E. & Zou, J. Nat. Methods https://doi.org/10.1038/s41592-025-02836-7 (2025).
Lin, Z. et al. Science 379, 1123–1130 (2023).
Ruffolo, J. A. & Madani, A. Nat. Biotechnol. 42, 200–202 (2024).
Vig, J. et al. In International Conference on Learning Representations https://openreview.net/forum?id=YWtLZvLmud7 (2021).
Rao, R. et al. In International Conference on Learning Representations https://openreview.net/forum?id=fylclEqgvgd (2021).
Meier, J. et al. Adv. Neural Inf. Process. Syst. 34, 29287–29303 (2021).
Zhang, Z. et al. Proc. Natl Acad. Sci. USA 121, e2406285121 (2024).
Cunningham, H. et al. In International Conference on Learning Representations https://openreview.net/forum?id=F76bwRSLeK (2024).
Bhatnagar, A. et al. Preprint at bioRxiv https://doi.org/10.1101/2025.04.15.649055 (2025).
Acknowledgements
We thank E. Simon for insightful discussions.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
All authors are current or former employees, contractors or executives of Profluent Bio Inc. and may hold shares in Profluent Bio Inc.
Rights and permissions
About this article
Cite this article
Ruffolo, J.A. What does a language model know about proteins?. Nat Methods (2025). https://doi.org/10.1038/s41592-025-02837-6
Published:
DOI: https://doi.org/10.1038/s41592-025-02837-6