Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Research Briefing
  • Published:

Machine learning framework reveals a concordant cell-state landscape across single-cell datasets

Comprehensively resolving the cell state landscape requires integrating single-cell omics data from diverse studies. We developed CONCORD, a contrastive learning framework that leverages principled mini-batch sampling to learn denoised, batch-integrated and high-resolution representations of cells, capturing intricate structures such as differentiation trajectories and cell-cycle loops across numerous biological contexts.

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: The CONCORD framework.

References

  1. Flores-Bautista, E. & Thomson, M. Unraveling cell differentiation mechanisms through topological exploration of single-cell developmental trajectories. Preprint at bioRxiv https://doi.org/10.1101/2023.07.28.551057 (2023). This preprint demonstrates that complex topological structures in gene expression space reflect meaningful biological processes.

  2. Luecken, M. D. et al. Benchmarking atlas-level data integration in single-cell genomics. Nat. Methods 19, 41–50 (2022). This paper systematically evaluates single-cell data integration methods using standardized metrics.

    Article  PubMed  Google Scholar 

  3. Chen, T., Kornblith, S., Norouzi, M. & Hinton, G. A simple framework for contrastive learning of visual representations. In International Conference on Machine Learning 1597–1607 (2020). This paper introduces the SimCLR framework that underpins many modern contrastive learning methods.

  4. Robinson, J., Chuang, C.-Y., Sra, S. & Jegelka, S. Contrastive learning with hard negative samples. In International Conference on Learning Representations (2021). This paper shows that modifying mini-batch composition with hard-negative sampling improves contrastive learning performance.

  5. Large, C. R. et al. Lineage-resolved analysis of embryonic gene expression evolution in C. elegans and C. briggsae. Science 388, eadu8249 (2025). This paper presents a lineage-resolved cross-species atlas of embryogenesis, providing a ground-truth resource for benchmarking data integration methods.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This is a summary of: Zhu, Q. et al. Revealing a coherent cell-state landscape across single-cell datasets with CONCORD. Nat. Biotechnol. https://doi.org/10.1038/s41587-025-02950-z (2026).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Machine learning framework reveals a concordant cell-state landscape across single-cell datasets. Nat Biotechnol (2026). https://doi.org/10.1038/s41587-025-02978-1

Download citation

  • Published:

  • Version of record:

  • DOI: https://doi.org/10.1038/s41587-025-02978-1

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing