Focus 24 September 2025

The impact of large language models in science

Large language models (LLMs) are rapidly being implemented in a wide range of disciplines, with the promise of unlocking new possibilities for scientific exploration. However, while the development of LLMs brings opportunities to science, it also comes with pressing challenges. This Focus discusses the current state of the art, highlights key obstacles, and examines some of the potential pitfalls and biases of implementing and using LLMs across different domains, including healthcare, urban planning, chemistry, linguistics, humanities, and computer science. In addition, the Focus explores emerging technologies – such as neuromorphic engineering – that show promise in enhancing the energy efficiency of LLM deployment on hardware platforms.

Editorial & News

The rise of large language models

This issue of Nature Computational Science features a Focus that highlights both the promises and perils of large language models, their emerging applications across diverse scientific domains, and the opportunities to overcome the challenges that lie ahead.

Editorial24 Sep 2025 Nature Computational Science
The other AI revolution: how the Global South is building and repurposing language models that speak to billions

While leading tech companies race to build ever-larger models, researchers in Brazil, India and Africa are using clever tricks to remix big labs’ LLMs to bring AI to billions of users.
- Pedro Burgos
News Feature8 Sep 2025 Nature Computational Science

Perspectives

Rethinking chemical research in the age of large language models

This Perspective highlights the potential integrations of large language models (LLMs) in chemical research and provides guidance on the effective use of LLMs as research partners, noting the ethical and performance-based challenges that must be addressed moving forward.
- Robert MacKnight
- Daniil A. Boiko
- Gabe Gomes
Perspective24 Jun 2025 Nature Computational Science
Urban planning in the era of large language models

Large language models remain largely unexplored is the design of cities. In this Perspective, the authors discuss the potential opportunities brought by these models in assisting urban planning.
- Yu Zheng
- Fengli Xu
- Yong Li
Perspective8 Sep 2025 Nature Computational Science
Arti-‘fickle’ intelligence: using LLMs as a tool for inference in the political and social sciences

Large language models are increasingly important in social science research. The authors provide guidance on how best to validate and use these models as rigorous tools to further scientific inference.
- Lisa P. Argyle
- Ethan C. Busby
- David Wingate
Perspective8 Aug 2025 Nature Computational Science
On the compatibility of generative AI and generative linguistics

This Perspective discusses that generative AI aligns with generative linguistics by showing that neural language models (NLMs) are formal generative models. Furthermore, generative linguistics offers a framework for evaluating and improving NLMs.
- Eva Portelance
- Masoud Jasbi
Perspective16 Sep 2025 Nature Computational Science

Comments

The impact of language models on the humanities and vice versa

Many humanists are skeptical of language models and concerned about their effects on universities. However, researchers with a background in the humanities are also actively engaging with artificial intelligence — seeking not only to adopt language models as tools, but to steer them toward a more flexible, contextual representation of written culture.
- Ted Underwood
Comment25 Jun 2025 Nature Computational Science
Computational challenges arising in algorithmic fairness and health equity with generative AI

The use of generative artificial intelligence (AI) in healthcare is advancing, but understanding its potential challenges for fairness and health equity is still in its early stages. This Comment investigates how to define fairness and measure it, and highlights research that can help address challenges in the field.
- Vinith M. Suriyakumar
- Anna Zink
- Brett Beaulieu-Jones
Comment16 May 2025 Nature Computational Science
Threats to scientific software from over-reliance on AI code assistants

The adoption of generative artificial intelligence (AI) code assistants in scientific software development is promising, but user studies across an array of programming contexts suggest that programmers are at risk of over-reliance on these tools, leading them to accept undetected errors in generated code. Scientific software may be particularly vulnerable to such errors because most research code is untested and scientists are undertrained in software development skills. This Comment outlines the factors that place scientific code at risk and suggests directions for research groups, educators, publishers and funders to counter these liabilities.
- Gabrielle O’Brien
Comment25 Jul 2025 Nature Computational Science
Using LLMs to advance the cognitive science of collectives

Large language models (LLMs) are already transforming the study of individual cognition, but their application to studying collective cognition has been underexplored. We lay out how LLMs may be able to address the complexity that has hindered the study of collectives and raise possible risks that warrant new methods.
- Ilia Sucholutsky
- Katherine M. Collins
- Robert D. Hawkins
Comment9 Sep 2025 Nature Computational Science
Neuromorphic principles in self-attention hardware for efficient transformers

Strong barriers remain between neuromorphic engineering and machine learning, especially with regard to recent large language models (LLMs) and transformers. This Comment makes the case that neuromorphic engineering may hold the keys to more efficient inference with transformer-like models.
- Nathan Leroux
- Jan Finkbeiner
- Emre Neftci
Comment16 Sep 2025 Nature Computational Science

Related Content

SciToolAgent: a knowledge-graph-driven scientific agent for multitool integration

This study presents SciToolAgent, a large language model-based agent that orchestrates scientific tools via a knowledge graph, enabling automated and effective execution of scientific research workflows.
- Keyan Ding
- Jing Yu
- Huajun Chen
Resource20 Aug 2025 Nature Computational Science
Probing the limitations of multimodal language models for chemistry and materials research

A comprehensive benchmark, called MaCBench, is developed to evaluate how vision language models handle different aspects of real-world chemistry and materials science tasks.
- Nawaf Alampara
- Mara Schilling-Wilhelmi
- Kevin Maik Jablonka
ResourceOpen Access11 Aug 2025 Nature Computational Science
Harnessing large language models for data-scarce learning of polymer properties

A physics-based training pipeline is developed to help tackle the challenges of data scarcity. The framework aligns large language models to a physically consistent initial state that is fine-tuned for learning polymer properties.
- Ning Liu
- Siavash Jafarzadeh
- Yue Yu
Article10 Feb 2025 Nature Computational Science
Language models for quantum simulation

Language models offer promises in encoding quantum correlations and learning complex quantum states. This Perspective discusses the advantages of employing language models in quantum simulation, explores recent model developments, and offers insights into opportunities for realizing scalable and accurate quantum simulation.
- Roger G. Melko
- Juan Carrasquilla
Perspective22 Jan 2024 Nature Computational Science
E-waste challenges of generative artificial intelligence

Generative artificial intelligence (GAI) is driving a surge in e-waste due to intensive computational infrastructure needs. This study emphasizes the necessity for proactive implementation of circular economy practices throughout GAI value chains.
- Peng Wang
- Ling-Yu Zhang
- Wei-Qiang Chen
Brief Communication28 Oct 2024 Nature Computational Science
Analog in-memory computing attention mechanism for fast and energy-efficient large language models

Leveraging in-memory computing with emerging gain-cell devices, the authors accelerate attention—a core mechanism in large language models. They train a 1.5-billion-parameter model, achieving up to a 70,000-fold reduction in energy consumption and a 100-fold speed-up compared with GPUs.
- Nathan Leroux
- Paul-Philipp Manea
- Emre Neftci
ArticleOpen Access8 Sep 2025 Nature Computational Science
Efficient scaling of large language models with mixture of experts and 3D analog in-memory computing

This study shows a viable pathway to the efficient deployment of state-of-the-art large language models using mixture of experts on 3D analog in-memory computing hardware.
- Julian Büchel
- Athanasios Vasilopoulos
- Abu Sebastian
Article8 Jan 2025 Nature Computational Science
A large-scale replication of scenario-based experiments in psychology and management using large language models

Researchers replicated 156 psychological experiments using three large language models (LLMs) instead of human participants. LLMs achieved 73–81% replication rates but showed amplified effect sizes and challenges with socially sensitive topics.
- Ziyan Cui
- Ning Li
- Huaikang Zhou
Brief Communication9 Jul 2025 Nature Computational Science
Generative language models exhibit social identity biases

Researchers show that large language models exhibit social identity biases similar to humans, having favoritism toward ingroups and hostility toward outgroups. These biases persist across models, training data and real-world human–LLM conversations.
- Tiancheng Hu
- Yara Kyrychenko
- Jon Roozenbeek
AnalysisOpen Access12 Dec 2024 Nature Computational Science
Using proprietary language models in academic research requires explicit justification
- Alexis Palmer
- Noah A. Smith
- Arthur Spirling
Correspondence21 Dec 2023 Nature Computational Science
Using sequences of life-events to predict human lives

Using registry data from Denmark, Lehmann et al. create individual-level trajectories of events related to health, education, occupation, income and address, and also apply transformer models to build rich embeddings of life-events and to predict outcomes ranging from time of death to personality.
- Germans Savcisens
- Tina Eliassi-Rad
- Sune Lehmann
Article18 Dec 2023 Nature Computational Science
Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in ChatGPT

The reasoning capabilities of OpenAI’s generative pre-trained transformer family were tested using semantic illusions and cognitive reflection tests that are typically used in human studies. While early models were prone to human-like cognitive errors, ChatGPT decisively outperformed humans, avoiding the cognitive traps embedded in the tasks.
- Thilo Hagendorff
- Sarah Fabi
- Michal Kosinski
Brief CommunicationOpen Access5 Oct 2023 Nature Computational Science
Combining computational controls with natural text reveals aspects of meaning composition

A neural network-based language model of supra-word meaning, that is, the combined meaning of words in a sentence, is proposed. Analysis of functional magnetic resonance imaging and magnetoencephalography data helps identify the regions of the brain responsible for understanding this meaning.
- Mariya Toneva
- Tom M. Mitchell
- Leila Wehbe
Article28 Nov 2022 Nature Computational Science
Towards transparency and knowledge exchange in AI-assisted data analysis code generation
- Robert Haase
Correspondence27 Mar 2025 Nature Computational Science
The lost data: how AI systems censor LGBTQ+ content in the name of safety

Many AI companies implement safety systems to protect users from offensive or inaccurate content. Though well intentioned, these filters can exacerbate existing inequalities, and data shows that they have disproportionately removed LGBTQ+ content.
- Sophia Chen
News Feature24 Sep 2024 Nature Computational Science
Building open-source AI

Artificial intelligence (AI) drives innovation across society, economies and science. We argue for the importance of building AI technology according to open-source principles to foster accessibility, collaboration, responsibility and interoperability.
- Yash Raj Shrestha
- Georg von Krogh
- Stefan Feuerriegel
Comment26 Oct 2023 Nature Computational Science
Cost-effective instruction learning for pathology vision and language analysis

Training foundation models often requires a costly budget and excessive computational resources. In this study, a low-cost instruction learning framework is proposed that could enable the rapid adoption of visual-language pathology applications.
- Kaitao Chen
- Mianxin Liu
- Shaoting Zhang
Article19 Jun 2025 Nature Computational Science
Increasing alignment of large language models with language processing in the human brain

Larger LLMs’ self-attention more accurately predicts readers’ regressive saccades and fMRI responses in language regions, whereas instruction tuning adds no benefit.
- Changjiang Gao
- Zhengwu Ma
- Jixing Li
ArticleOpen Access16 Sep 2025 Nature Computational Science
Advancing real-time infectious disease forecasting using large language models

PandemicLLM adapts the large language model to predict disease trends by converting diverse disease-relevant data into text. It responds to new variants in real time, offering robust, interpretable forecasts for effective public health responses.
- Hongru Du
- Yang Zhao
- Hao ‘Frank’ Yang
Article6 Jun 2025 Nature Computational Science
Multimodal learning for mapping genotype–phenotype dynamics

A multimodal computational framework is proposed to integrate single-cell RNA sequencing data with phenotypic information to map complex genotype–phenotype relationships. This approach helps to refine cellular heterogeneity analysis, identify cross-tissue biomarkers and reveal polyfunctional characteristics of genes with cellular resolution.
- Farhan Khodaee
- Rohola Zandie
- Elazer R. Edelman
Article28 Jan 2025 Nature Computational Science
Comprehensive prediction and analysis of human protein essentiality based on a pretrained large language model

This study introduces the Protein Importance Calculator (PIC), a deep learning model designed to predict human essential proteins (HEPs) crucial for survival and development. Unlike conventional methods, PIC offers a comprehensive assessment of HEPs across three levels: humans, cell lines and mice.
- Boming Kang
- Rui Fan
- Qinghua Cui
Article27 Nov 2024 Nature Computational Science
Linguistics-based formalization of the antibody language as a basis for antibody language models

The parallels between natural language and antibody sequences could serve as a stepping stone to using deep language models for analyzing antibody sequences. This Perspective discusses how issues in antibody language model rule mining could be addressed by linguistically formalizing the antibody language.
- Mai Ha Vu
- Philippe A. Robert
- Victor Greiff
Perspective14 Jun 2024 Nature Computational Science
Unbiased organism-agnostic and highly sensitive signal peptide predictor with deep protein language model

Signal peptides (SPs) are vital for protein–transmembrane communication. In this work, the authors introduce USPNet, a deep learning method based on a protein language model for SP prediction that shows both high sensitivity and efficiency, thereby contributing to the identification of novel SPs.
- Junbo Shen
- Qinze Yu
- Yu Li
Article13 Dec 2023 Nature Computational Science
Single-sequence protein structure prediction using supervised transformer protein language models

In this study, a supervised protein language model is proposed to predict protein structure from a single sequence. It achieves state-of-the-art accuracy on orphan proteins and is competitive with other methods on human-designed proteins.
- Wenkai Wang
- Zhenling Peng
- Jianyi Yang
Article19 Dec 2022 Nature Computational Science

The impact of large language models in science

Editorial & News

Perspectives

Comments

Related Content

Search

Quick links