Reviews & Analysis

Boosting power for time-to-event GWAS analysis affected by case ascertainment

We propose a computationally efficient genome-wide association study (GWAS) method, WtCoxG, for time-to-event (TTE) traits in the presence of case ascertainment— a form of oversampling bias. WtCoxG addresses case ascertainment bias by applying a weighted Cox proportional hazard model, and outperforms existing approaches when incorporating information on external allele frequencies.

Research Briefing02 Oct 2025
Vision language models excel at perception but struggles with scientific reasoning

A benchmark — MaCBench — is developed for evaluating the scientific knowledge of vision language models (VLMs). Evaluation of leading VLMs reveals that they excel at basic scientific tasks such as equipment identification, but struggle with spatial reasoning and multistep analysis — a limitation for autonomous scientific discovery.

Research Briefing10 Sept 2025
A digital twin that interprets and refines chemical mechanisms

An integrated platform, Digital Twin for Chemical Science (DTCS), is developed to connect first-principles theory with spectroscopic measurements through a bidirectional feedback loop. By predicting and refining chemical reaction mechanisms before, during and after experiments, DTCS enables the interpretation of spectra and supports real-time decision-making in chemical characterization.

Research Briefing01 Sept 2025
Unbalanced gene-level batch effects in single-cell data

We developed group technical effects (GTE) as a quantitative metric for evaluating gene-level batch effects in single-cell data. It identifies highly batch-sensitive genes — the primary contributors to batch effects — that vary across datasets, and whose removal effectively mitigates the batch effects.

Research Briefing01 Jul 2025
A committor-based method to uniformly sample rare reactive events

Enhanced sampling methods aim to simulate rare physical and chemical reactive processes involving transitions between long-lived states. Existing methods often disproportionally sample either metastable or transition states. A machine-learning approach combines the strengths of these two cases to characterize entire rare events with the same thoroughness in a single calculation.

Research Briefing06 Jun 2025
How molecular changes impact brain states and whole-brain activity: a multiscale approach

Predicting how molecular changes affect brain activity is a challenge in neuroscience. We introduced a multiscale modeling approach to simulate these microscopic changes and how they impact macroscale brain activity. This approach predicted how the anesthetic action on synaptic receptors can lead to the transitions in macroscale brain activity observed empirically.

Research Briefing02 Jun 2025
Xeric plants inspire adaptable liquid crystal elastomers for water collection

Inspired by the morphologies of xeric plant leaves, we have developed biomimetic liquid crystal elastomer bilayers that can bend, spiral and twist. These adaptive shape morphing structures can twist to improve water collection efficiency and wind resistance, suggesting their potential application in adaptive water collection and directional transportation.

Research Briefing21 Apr 2025
Integrating statistical physics and machine learning for combinatorial optimization

We introduce free-energy machine (FEM), an efficient and general method for solving combinatorial optimization problems. FEM combines free-energy minimization from statistical physics with gradient-based optimization techniques in machine learning and utilizes parallel computation, outperforming state-of-the-art algorithms and showcasing the synergy of merging statistical physics with machine learning.

Research Briefing26 Mar 2025
Diversity-aware population modeling

We propose a diversity-aware population modeling framework using Bayesian multilevel regression and post-stratification to quantify sociodemographic disparities in cognitive development. Our approach improved subgroup estimates, guiding targeted public health strategies and addressing biases in traditional models to support more equitable decision-making.

Research Briefing20 Mar 2025
MultiSTAAR delivers multi-trait rare variant analysis of biobank-scale sequencing data

Identifying pleiotropic associations for rare variants in multi-ethnic biobank-scale whole-genome sequencing data poses considerable challenges. This study introduced MultiSTAAR as a scalable and robust multi-trait rare variant analysis framework designed for both coding and noncoding regions by integrating multiple variant functional annotations and leveraging multivariate modeling across diverse phenotypes.

Research Briefing07 Feb 2025
Shedding light on spatial signal transduction in cells using computational simulations

We present Spatial Modeling Algorithms for Reactions and Transport (SMART), a software package that simulates spatiotemporally detailed biochemical reaction networks within realistic cellular and subcellular geometries. This paper highlights the use of SMART in several biological test cases including cellular mechanotransduction, calcium signaling in neurons and cardiomyocytes, and adenosine triphosphate synthesis.

Research Briefing04 Feb 2025
Energy-efficient multimodal zero-shot learning using in-memory reservoir computing

To achieve an advanced neuromorphic computing system with brain-like energy efficiency and generalization capabilities, we propose a hardware–software co-design of in-memory reservoir computing. This co-design integrates a liquid state machine-based encoder with artificial neural network projections on a hybrid analog–digital system, demonstrating zero-shot learning for multimodal event data.

Research Briefing13 Jan 2025
A new tool for shape and structure optimization of soft materials

We present Morpho, an extensible programmable environment that uses finite elements for shape optimization in soft matter. Given an energy functional that incorporates physical boundaries and effects such as elasticity and electromagnetism, together with additional constraints to be satisfied, Morpho predicts the optimized shape and structure adopted by the material.

Research Briefing09 Jan 2025
A simulated C. elegans with biophysically detailed neurons and muscle dynamics

We created an open-source model that simulates Caenorhabditis elegans in a closed-loop system, by integrating simulations of its brain, its physical body, and its environment. BAAIWorm replicated C. elegans locomotive behaviors, and synthetic perturbations of synaptic connections impacted neural control of movement and affected the embodied motor behavior.

Research Briefing16 Dec 2024
Deep learning training dynamics analysis for single-cell data

Inspired by recent approaches for natural language processing and computer vision, we developed Annotatability, a framework that analyzes deep neural network training dynamics to interpret pre-annotated single-cell and spatial omics data. Annotatability identified erroneous annotations and ambiguous cell states, inferred trajectories from binary labels, and revealed underlying biological signals.

Research Briefing04 Dec 2024
Joint inference of discrete and continuous factors captures variability across and within cell types

We developed mixture model inference with discrete-coupled autoencoders (MMIDAS), an unsupervised variational framework that jointly learns discrete clusters and continuous cluster-specific variability. When applied to unimodal or multimodal single-cell omic data, MMIDAS learned single-cell representations with robust cell type definitions and interpretable, continuous within-cell type variability.

Research Briefing23 Sept 2024
AI-recognized mitochondrial phenotype enables identification of drug targets

Revealing a drug’s mechanism of action (MOA) is costly and time-consuming. In this study, we used deep learning to extract temporal mitochondrial phenotypic features after exposure to drugs with known MOAs using re-identification algorithms. The trained model could then predict the MOAs of unidentified substances, facilitating phenotypic screening-based drug discovery and repurposing.

Research Briefing22 Aug 2024
A machine learning tool to efficiently calculate electron–phonon coupling

A machine learning framework that uses atomic orbital-based Hamiltonian matrices and gradients predicted by an equivariant graph neural network is established to calculate electron–phonon coupling (EPC). This approach accelerates the calculations by several orders of magnitude, enabling EPC-related properties to be predicted for complex systems using highly accurate functionals.

Research Briefing08 Aug 2024
A multi-task learning strategy to pretrain models for medical image analysis

Pretraining powerful deep learning models requires large, comprehensive training datasets, which are often unavailable for medical imaging. In response, the universal biomedical pretrained (UMedPT) foundational model was developed based on multiple small and medium-sized datasets. This model reduced the amount of data required to learn new target tasks by at least 50%.

Research Briefing19 Jul 2024
Boosting graph neural networks with virtual nodes to predict phonon properties

A graph neural network using virtual nodes is proposed to predict the properties of complex materials with variable dimensions or dimensions that depend on the input. The method is used to accurately and quickly predict phonon dispersion relations in complex solids and alloys.

Research Briefing16 Jul 2024