Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Research Briefing
  • Published:

Deep learning training dynamics analysis for single-cell data

Inspired by recent approaches for natural language processing and computer vision, we developed Annotatability, a framework that analyzes deep neural network training dynamics to interpret pre-annotated single-cell and spatial omics data. Annotatability identified erroneous annotations and ambiguous cell states, inferred trajectories from binary labels, and revealed underlying biological signals.

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Interpreting single-cell data using Annotatability.

References

  1. Kim, T. et al. scReClassify: post hoc cell type classification of single-cell rNA-seq data. BMC Genom. 20, 913 (2019). The paper discusses challenges in cell type identification from scRNA-seq data, focusing on the potential for mislabeling due to reliance on human expertise and computational methods.

    Article  Google Scholar 

  2. Burkhardt, D. B. et al. Quantifying the effect of experimental perturbations at single-cell resolution. Nat. Biotechnol. 39, 619–629 (2021). A method developed to quantify the effects of perturbations at the single-cell level.

    Article  Google Scholar 

  3. Swayamdipta, S. et al. Dataset cartography: mapping and diagnosing datasets with training dynamics. In Proc. 2020 Conference on Empirical Methods in Natural Language Processing (eds Webber, B. et al.) 9275–9293 (Association for Computational Linguistics, 2020). A paper that presents ‘data maps’ in the context of natural language processing and used training dynamics to assess dataset quality, which revealed ambiguous, easy-to-learn and hard-to-learn examples.

  4. Gayoso, A. et al. A Python library for probabilistic analysis of single-cell omics data. Nat. Biotechnol. 40, 163–166 (2022). Annotatability used the 10x Genomics PBMC single-cell dataset, integrated and pre-processed by this library (scvi-tools), to reveal both errors and ambiguities in the original cell type annotations.

    Article  Google Scholar 

  5. Tritschler, S. et al. A transcriptional cross species map of pancreatic islet cells. Mol. Metab. 66, 101595 (2022). Annotatability used this single-cell dataset to quantify the damage to individual mSTZ β cells, infer disease-related cell states and dissect treatment responses for individual cells.

    Article  Google Scholar 

Download references

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This is a summary of: Karin, J. et al. Interpreting single-cell and spatial omics data using deep neural network training dynamics. Nat. Comput. Sci. https://doi.org/10.1038/s43588-024-00721-5 (2024).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Deep learning training dynamics analysis for single-cell data. Nat Comput Sci 4, 886–887 (2024). https://doi.org/10.1038/s43588-024-00728-y

Download citation

  • Published:

  • Version of record:

  • Issue date:

  • DOI: https://doi.org/10.1038/s43588-024-00728-y

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing