Table 1 Datasets used in this paper

From: Automated optimized parameters for T-distributed stochastic neighbor embedding improve visualization and analysis of large datasets

Dataset

Data type

Details

References

Mass41parameter

Mass cytometry

41 parameter dataset (14 lineage parameters used for embedding) of 1 million datapoints concatenated from 5 samples of human bone marrow cells

46

Flow18parameter

Flow cytometry

18 parameter dataset (11 lineage parameters used for embedding) of 1 million datapoints concatenated from 2 samples of human PBMC

25,47

Flow20M

Flow cytometry

18 parameter dataset (15 lineage parameters used for embedding) of 20 million datapoints concatenated from 27 samples of human PBMC

25,47

10X Genomics

scRNA-seq

Single cell gene expression data of E18 mouse brain pre-processed into 20 PCA projections used for t-SNE embedding

https://support.10xgenomics.com/single-cell-gene-expression/datasets and ref. 26

van Unen et al.

Mass cytometry

32 parameter dataset (26 lineage parameters used for embedding) of 5.22 million datapoints concatenated from 104 samples of PBMC and gut biopsy cells

7