Single-cell sequencing reveals cellular heterogeneity but is challenged by technical noise and batch effects. Here, authors present CellFM, an 800-million-parameter foundation model trained on 100 million human cells through the MindSpore framework, which outperforms existing models in downstream tasks.
- Yuansong Zeng
- Jiancong Xie
- Yuedong Yang