What does a language model know about proteins?

Ruffolo, Jeffrey A.

doi:10.1038/s41592-025-02837-6

News & Views
Published: 29 September 2025

Machine learning

What does a language model know about proteins?

Jeffrey A. Ruffolo ORCID: orcid.org/0000-0002-3385-9191¹

Nature Methods volume 22, pages 2017–2019 (2025)Cite this article

3615 Accesses
9 Altmetric
Metrics details

Subjects

A new approach sheds light on the biological features learned by protein language models, promising greater interpretability for unsupervised sequence learning.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Rent or buy this article

Prices vary by article type

from$1.95

to$39.95

Learn more

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Identifying biological features from protein language models.**

References

Simon, E. & Zou, J. Nat. Methods https://doi.org/10.1038/s41592-025-02836-7 (2025).
Article PubMed Google Scholar
Lin, Z. et al. Science 379, 1123–1130 (2023).
Article CAS PubMed Google Scholar
Ruffolo, J. A. & Madani, A. Nat. Biotechnol. 42, 200–202 (2024).
Article CAS PubMed Google Scholar
Vig, J. et al. In International Conference on Learning Representations https://openreview.net/forum?id=YWtLZvLmud7 (2021).
Rao, R. et al. In International Conference on Learning Representations https://openreview.net/forum?id=fylclEqgvgd (2021).
Meier, J. et al. Adv. Neural Inf. Process. Syst. 34, 29287–29303 (2021).
Google Scholar
Zhang, Z. et al. Proc. Natl Acad. Sci. USA 121, e2406285121 (2024).
Article CAS PubMed PubMed Central Google Scholar
Cunningham, H. et al. In International Conference on Learning Representations https://openreview.net/forum?id=F76bwRSLeK (2024).
Bhatnagar, A. et al. Preprint at bioRxiv https://doi.org/10.1101/2025.04.15.649055 (2025).

Download references

Acknowledgements

We thank E. Simon for insightful discussions.

Author information

Authors and Affiliations

Profluent Bio, Emeryville, CA, USA
Jeffrey A. Ruffolo

Authors

Jeffrey A. Ruffolo
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Jeffrey A. Ruffolo.

Ethics declarations

Competing interests

All authors are current or former employees, contractors or executives of Profluent Bio Inc. and may hold shares in Profluent Bio Inc.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ruffolo, J.A. What does a language model know about proteins?. Nat Methods 22, 2017–2019 (2025). https://doi.org/10.1038/s41592-025-02837-6

Download citation

Published: 29 September 2025
Version of record: 29 September 2025
Issue date: October 2025
DOI: https://doi.org/10.1038/s41592-025-02837-6

What does a language model know about proteins?

Subjects

Access options

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

InterPLM: discovering interpretable features in protein language models via sparse autoencoders

Search

Quick links

Subjects

Access options

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links