Comment

Towards agentic science for advancing scientific discovery

Artificial intelligence is transforming scientific discovery through (semi-)autonomous agents capable of reasoning, planning, and interacting with digital and physical environments. This Comment explores the foundations and frontiers of agentic science, outlining its emerging directions, current limitations, and the pathways for responsible integration into scientific practice.

Hongliang Xin
John R. Kitchin
Heather J. Kulik
Comment10 Sept 2025
Unregulated emotional risks of AI wellness apps

We propose that AI-driven wellness apps powered by large language models can foster extreme emotional attachments and dependencies akin to human relationships — posing risks such as ambiguous loss and dysfunctional dependence — that challenge current regulatory frameworks and necessitate safeguards and informed interventions within these platforms.

Julian De Freitas
I. Glenn Cohen
Comment06 Jun 2025
Why the carbon footprint of generative large language models alone will not help us assess their sustainability

There is a growing awareness of the substantial environmental costs of large language models (LLMs), but discussing the sustainability of LLMs only in terms of CO₂ emissions is not enough. This Comment emphasizes the need to take into account the social and ecological costs and benefits of LLMs as well.

Leonie N. Bossert
Wulf Loh
Comment03 Feb 2025
Discussions of machine versus living intelligence need more clarity

Sharp distinctions often drawn between machine and biological intelligences have not tracked advances in the fields of developmental biology and hybrid robotics. We call for conceptual clarity driven by the science of diverse intelligences in unconventional spaces and at unfamiliar scales and embodiments that blur conventional categories.

Nicolas Rouleau
Michael Levin
Comment13 Dec 2024
Successful implementation of the EU AI Act requires interdisciplinary efforts

The EU Artificial Intelligence Act bans certain “subliminal techniques beyond a person’s consciousness”, but uses undefined legal terms. Interdisciplinary efforts are needed to ensure effective implementation of the legal text.

Christian Montag
Michèle Finck
Comment10 Dec 2024
LLM-based agentic systems in medicine and healthcare

Large language model-based agentic systems can process input information, plan and decide, recall and reflect, interact and collaborate, leverage various tools and act. This opens up a wealth of opportunities within medicine and healthcare, ranging from clinical workflow automation to multi-agent-aided diagnosis.

Jianing Qiu
Kyle Lam
Eric J. Topol
Comment05 Dec 2024
Guidelines for ethical use and acknowledgement of large language models in academic writing

In this Comment, we propose a cumulative set of three essential criteria for the ethical use of LLMs in academic writing, and present a statement that researchers can quote when submitting LLM-assisted manuscripts in order to testify to their adherence to them.

Sebastian Porsdam Mann
Anuraag A. Vazirani
Julian Savulescu
Comment13 Nov 2024
Foundation models in healthcare require rethinking reliability

A new class of AI models, called foundation models, has entered healthcare. Foundation models violate several basic principles of the standard machine learning paradigm for assessing reliability, making it necessary to rethink what guarantees are required to establish warranted trust in them.

Thomas Grote
Timo Freiesleben
Philipp Berens
Comment11 Nov 2024
Back to basics to open the black box

Most research efforts in machine learning focus on performance and are detached from an explanation of the behaviour of the model. We call for going back to basics of machine learning methods, with more focus on the development of a basic understanding grounded in statistical theory.

Diego Marcondes
Adilson Simonis
Junior Barrera
Comment17 May 2024
Dangers of speech technology for workplace diversity

Speech technology offers many applications to enhance employee productivity and efficiency. Yet new dangers arise for marginalized groups, potentially jeopardizing organizational efforts to promote workplace diversity. Our analysis delves into three critical risks of speech technology and offers guidance for mitigating these risks responsibly.

Mike Horia Mihail Teodorescu
Mingang K. Geiger
Lily Morse
Comment22 Apr 2024
The curious case of the test set AUROC

The area under the receiver operating characteristic curve (AUROC) of the test set is used throughout machine learning (ML) for assessing a model’s performance. However, when concordance is not the only ambition, this gives only a partial insight into performance, masking distribution shifts of model outputs and model instability.

Michael Roberts
Alon Hazan
Carola-Bibiane Schönlieb
Comment04 Apr 2024
Federated learning is not a cure-all for data ethics

Although federated learning is often seen as a promising solution to allow AI innovation while addressing privacy concerns, we argue that this technology does not fix all underlying data ethics concerns. Benefiting from federated learning in digital health requires acknowledgement of its limitations.

Marieke Bak
Vince I. Madai
Stuart McLennan
Comment18 Mar 2024
The democratization of global AI governance and the role of tech companies

Can non-state multinational tech companies counteract the potential democratic deficit in the emerging global governance of AI? We argue that although they may strengthen core values of democracy such as accountability and transparency, they currently lack the right kind of authority to democratize global AI governance.

Eva Erman
Markus Furendal
Comment08 Mar 2024
Reporting electricity consumption is essential for sustainable AI

The rise of artificial intelligence (AI) has relied on an increasing demand for energy, which threatens to outweigh its promised positive effects. To steer AI onto a more sustainable path, quantifying and comparing its energy consumption is key.

Charlotte Debus
Marie Piraud
Markus Götz
Comment10 Nov 2023
Enabling collaborative governance of medical AI

Medical artificial intelligence needs governance to ensure safety and effectiveness, not just centrally (for example, by the US Food and Drug Administration) but also locally to account for differences in care, patients and system performance. Practical collaborative governance will enable health systems to carry out these challenging governance tasks, supported by central regulators.

W. Nicholson Price II
Mark Sendak
Karandeep Singh
Comment09 Aug 2023
Foundation models and the privatization of public knowledge

To protect the integrity of knowledge production, the training procedures of foundation models such as GPT-4 need to be made accessible to regulators and researchers. Foundation models must become open and public, and those are not the same thing.

Fabian Ferrari
José van Dijck
Antal van den Bosch
Comment28 Jul 2023
The incentive gap in data work in the era of large models

There are repeated calls in the AI community to prioritize data work — collecting, curating, analysing and otherwise considering the quality of data. But this is not practised as much as advocates would like, often because of a lack of institutional and cultural incentives. One way to encourage data work would be to reframe it as more technically rigorous, and thereby integrate it into more-valued lines of research such as model innovation.

Katy Ilonka Gero
Payel Das
Kush R. Varshney
Comment22 Jun 2023
How can LLMs transform the robotic design process?

We show that large language models (LLMs), such as ChatGPT, can guide the robot design process, on both the conceptual and technical level, and we propose new human–AI co-design strategies and their societal implications.

Francesco Stella
Cosimo Della Santina
Josie Hughes
Comment07 Jun 2023
Ethical hazards of health data governance in the metaverse

Metaverse-enabled healthcare is no longer hypothetical. Developers must now contend with ethical, legal and social hazards if they are to overcome the systematic inefficiencies and inequities that exist for patients who seek care in the real world.

Kristin Kostick-Quenet
Vasiliki Rahimzadeh
Comment16 May 2023
Generative AI entails a credit–blame asymmetry

Generative AI programs can produce high-quality written and visual content that may be used for good or ill. We argue that a credit–blame asymmetry arises for assigning responsibility for these outputs and discuss urgent ethical and policy implications focused on large-scale language models.

Sebastian Porsdam Mann
Brian D. Earp
Julian Savulescu
Comment04 May 2023