RESEARCH HIGHLIGHT
06 March 2026

An AI assistant for materials scientists

This domain-trained model built on millions of materials papers outperforms major commercial AI systems.

You have full access to this article via your institution.

The model is trained on a large corpus of peer-reviewed publications, crystallographic information files, a strategic subset of the open dataset RedPajama, and the Materials Science Community Discourse (MatSci comm.). Credit: Ahlawat, D. *et al.* *Nature Machine Intelligence* (2026).

Large language models (LLMs) are increasingly explored as tools to accelerate scientific discovery, but their effectiveness in specialised research domains remains uncertain. A new study introduces LLaMat, a family of language models designed specifically for materials science, showing that domain-adapted AI systems can outperform general-purpose models on key scientific tasks¹.

Researchers at Indian Institute of Technology Delhi and collaborators developed LLaMat by continuing the pretraining of base language models on a curated corpus of roughly 30 billion tokens. These tokens were drawn from about four million materials-science publications, crystallographic information files (CIFs), and community discussions. The models were then instruction- and task-tuned using more than 175,000 materials-science question–answer pairs along with multiple benchmark datasets.

When evaluated across 42 tasks — including natural-language understanding, structured information extraction, and crystal structure generation — the models outperformed several widely used commercial LLMs (Claude, GPT and Gemini) while retaining strong general capabilities.

The researchers also uncovered an unexpected phenomenon they call “adaptation rigidity.” Models that had undergone extensive general-purpose pretraining were less able to absorb specialised domain knowledge through continued training. In other words, the most heavily trained models may become comparatively resistant to later specialisation.

The findings suggest that moderately sized, domain-specific LLMs may serve as more effective and computationally efficient AI copilots for scientific research than very large, general-purpose systems.

doi: https://doi.org/10.1038/d44151-026-00043-7

References

Ahlawat, D. et al. Nature Machine Intelligence (2026).
Article PubMed Google Scholar

Download references

Jobs

Talent Recruitment Announcement of the College of Informatics, Huazhong Agricultural University

Join Huazhong Agricultural University

No.1 Shizishan Street, Hongshan District, Wuhan, Hubei Province, China

Huazhong Agricultural University (HZAU)
Shanghai Jiao Tong University Global Recruitment

Interested applicants can contact with the relevant department/school and submit their CV directly.

Shanghai (CN)

Shanghai Jiao Tong University
Associate or Senior Editor, Communications AI & Computing

Job Title: Associate or Senior Editor, Communications AI & Computing Locations: Shanghai, Beijing, Pune or New Delhi (hybrid) Application deadline:...

Shanghai, Beijing, Pune or New Delhi (hybrid)

Springer Nature Ltd
Postdoctoral Researcher in Immunology and molecular biology - Fangwei Leng Lab

Postdoctoral researcher with a PhD in immunology and molecular biology investigating molecular mechanisms of Treg development and function.

Beijing (CN)

The Chinese Institutes for Medical Research (CIMR), Beijing
Postdoc in Cancer Synthetic Lethality Prediction and AI-Driven Target Discovery

Build the science that shapes the future of human health. Application closing date: 01.05.2026 Join a place where ambitious science thrives Human T...

Milan (IT)

Human Technopole

An AI assistant for materials scientists

References

Jobs

Talent Recruitment Announcement of the College of Informatics, Huazhong Agricultural University

Shanghai Jiao Tong University Global Recruitment

Associate or Senior Editor, Communications AI & Computing

Postdoctoral Researcher in Immunology and molecular biology - Fangwei Leng Lab

Postdoc in Cancer Synthetic Lethality Prediction and AI-Driven Target Discovery

Search

Quick links