Abstract
Non-human primates are attractive laboratory animal models that accurately reflect both developmental and pathological features of humans. Here we present a compendium of cell types across multiple organs in cynomolgus monkeys (Macaca fascicularis) using both single-cell chromatin accessibility and RNA sequencing data. The integrated cell map enables in-depth dissection and comparison of molecular dynamics, cell-type compositions and cellular heterogeneity across multiple tissues and organs. Using single-cell transcriptomic data, we infer pseudotime cell trajectories and cell-cell communications to uncover key molecular signatures underlying their cellular processes. Furthermore, we identify various cell-specific cis-regulatory elements and construct organ-specific gene regulatory networks at the single-cell level. Finally, we perform comparative analyses of single-cell landscapes among mouse, monkey and human. We show that cynomolgus monkey has strikingly higher degree of similarities in terms of immune-associated gene expression patterns and cellular communications to human than mouse. Taken together, our study provides a valuable resource for non-human primate cell biology.
Similar content being viewed by others

Introduction
Non-human primates (NHP) are phylogenetically close to humans and show various human-like characteristics, including genetics, organ development, physiological function, pathological response and biochemical metabolism. Hence NHP are extremely valuable as experimental animal models for medical research and drug development1. Macaca fascicularis (cynomolgus monkeys) are such excellent NHP models for biomedical research2. Since cells are the fundamental unit of all life, direct comparison of cell identities and cell-type compositions in organs with a similar function between organisms would help to transfer knowledge in primates to medical research. In this regard, it is of vital importance to understand the cellular composition and heterogeneity of organs in the primate model like cynomolgus monkeys.
Rapid advances in single-cell multi-omics technologies have enabled molecular quantification of thousands of cells at once, leading to meticulous insight into organ compositions and molecular mechanisms driving cellular heterogeneity3. Previous studies4,5,6,7,8 have mapped the single-cell landscapes across multiple organs in humans and mice, expanding our knowledge about the cellular heterogeneity underlying normal development and aging. Three-dimensional multicellular culture systems combined with single-cell transcriptome sequencing technology enable to chart the cellular and molecular dynamic changes during organ growth and development9,10,11. In addition, extensive efforts10,11,12,13 have been achieved to investigate how cells are perturbed in various disease conditions, including cancer and neurological disorders.
Mice have long been used as a representative model organism for mammalian development and physiology. Recently, extensive comparative analyses based on single-cell transcriptomics data have shown that both cell types and associated gene regulatory networks are conserved between human and mouse4,14,15,16,17,18,19,20,21,22,23, which provides a new perspective for disease mechanism interpretation and intervention. However, it has been widely recognized that there are significant differences between mice and humans in terms of development and physiology24. For example, single-cell transcriptomic analysis revealed remarkable similarities and differences in lineage marker-associated gene expression and regulatory networks during spermatogenesis19 and embryogenesis21 between human, cynomolgus, and mouse. In a recent study, comparison of cell subsets of colon neurons in the human and mouse enteric nervous system (ENS) revealed differences on the basis of transcriptional programs and neuro-immune interactions22. The mapped differences may reflect divergent adaptations to feeding behavior by ENS between human and mouse. Besides, although the architecture and regulatory role of the immune system is conserved between mouse and human, it is notoriously difficult to translate the immunological principles from laboratory mice to humans2, partially due to their basic immunological differences25 and/or the immunological immaturity of the laboratory mouse26. In short, these observed differences may limit the immediate translational value of findings from the mouse model to biomedical research.
Primate experiments are more valuable as they can better simulate human diseases and promote scientific research owing to high genetic similarity between primates and humans27. Although the potential importance and values of NHP models in basic research are indispensable, an organism-wide single-cell atlas is still pending for primates.
Here, we present a compendium of single-cell regulomic and transcriptomic data in cynomolgus monkeys that comprises 40 distinct cell types from 16 representative organs and tissues, greatly extending our current view28,29,30 of single-cell landscapes in this model species. This cell atlas—which we denote ‘Monkey Atlas’—represents a new resource for cynomolgus monkey cell biology.
Results
Mapping a cynomolgus monkey multi-organ cell atlas by multi-omic analysis
To generate a reference cell map of monkey, we performed both single-cell RNA sequencing (scRNA-seq; 10x Genomics; n = 174,233 cells) and scATAC-seq (10x Genomics; n = 66,566 cells) in 16 tissues and organs from one male or/and one female cynomolgus monkeys (Fig. 1a, Supplementary Fig. 1 and Supplementary Table 1). We integrated all of the scRNA-seq data using canonical correlation analysis (CCA)31 to correct for potential batch effects. Unsupervised clustering based on t-distribution stochastic neighbor embedding (t-SNE) resolved major cell types, including epithelial, ciliated epithelial, mesenchymal, immune, endothelial, spermatid, and secretory cell populations. These cells could be annotated as 40 transcriptionally distinct clusters with cluster-specific markers (Fig. 1b, c and Supplementary Table 2). Due to technical and financial constraints, not every organ was analyzed in each monkey or by both data modalities. Nevertheless, the overall gene expression patterns and cell compositions for the same organ or functional related organs (e.g., stomach, liver, spleen, and colon from the digestive system) are generally consistent (Supplementary Figs. 2, 3), highlighting the reproducibility of our data. In particular, the analysis of multiple organs from the same monkey enables us to obtain data that are controlled for uncertain effects (such as age, sex, diet, environment, and so on).
a Schematic illustration of the workflow. Single-cell transcriptome (scRNA-seq) analysis was conducted in 16 organs and chromatin accessibility (scATAC-seq) analysis in seven organs from adult cynomolgus monkeys. b Cell type identification based on scRNA-seq data. In total 17 major cell types and 40 cell subtypes were identified. T-SNE visualizations of cells were colored either by the major cell types (left) or by the cell subtypes (right). c Heatmap showing the scaled expression levels of cell type-specific marker genes (left). Expression patterns of 18 representative marker genes are shown on the right. d T-SNE illustration of cell type annotation based on scATAC-seq data analysis. Ten major cell types were identified from 66,566 cells. Cells were colored by major cell types (the top panel) or by organs (bottom left) in the t-SNE plots. The bottom right t-SNE plot shows the clustering of cells based on scRNA-seq data for the seven organs with matched RNA-ATAC samples. e Sankey diagram showing the consistence of cell type annotations between scRNA-seq and scATAC-seq data. Only the seven organs with matched RNA-ATAC data were used for the analysis. The color code refers to (d).
The scATAC-seq experiment was performed in relatively few organs (n = 7), including liver, colon, uterus, spleen, lung, heart, and kidney. To define cell types based on scATAC-seq data in these organs, we created a count matrix of fragments across the genome. We demonstrated the overall high quality of scATAC-seq data based on the enrichment analysis of accessible DNA sequences relative to the transcriptional start site (TSS) and the size distribution of unique fragments (Supplementary Fig. 4). After batch effect correction with Harmony32 as implemented in ArchR33 (Supplementary Fig. 5a), t-SNE clustering analysis of the integrated scATAC-seq data revealed ten major cell types (Fig. 1d), which were annotated based on chromatin accessibility at the promoter regions of well-characterized marker genes (Supplementary Fig. 6).
To confirm the consistency of scATAC-seq and scRNA-seq data for cell type annotation, we first performed cross-modality integration analysis in organ-matched samples between RNA and ATAC (n = 7 organs) using a mutual nearest neighbors (MNNs) approach (Supplementary Fig. 5b–e; see “Methods”). We then assigned the cell type of each clusters using labels annotated from scRNA-seq data and identified twelve major cell types in the RNA-ATAC integration (Supplementary Fig. 5f). All major cell types were successfully recovered except the plasma cell (Supplementary Fig. 5b, f). In this manner, cell types of ATAC clusters can be predicted from RNA clusters on the basis of cross-modality integration (Supplementary Fig. 5g). We found that cell types identified by scRNA-seq and scATAC-seq data are highly consistent (Fig. 1e), with ten ATAC clusters one-to-one linked to RNA clusters (Supplementary Fig. 5h). Collectively, the above results highlight the quality of the dataset.
Heterogeneity and dynamics of main cell components across organs
Epithelial cells account for the largest part in the integrated transcriptomic cell map. To dissect epithelial heterogeneity, we extracted epithelial cells and performed unsupervised sub-clustering analysis (Fig. 2a). The analysis identified 14 epithelial cell clusters (E01-E14), including basal cells, secretory cells, ciliated cells and non-ciliated cells, according to the unique pattern of marker gene expression (Fig. 2a, b). As expected, secretory cells were mostly found in epidermis or parenchymatous organs such as spleen, kidney, colon and uterus (Fig. 2c). It is worth noting that there are a large proportion of ciliated epithelial cells in various tissues (Fig. 2c and Supplementary Fig. 7a, b). To explore the developmental and functional dynamics of ciliated epithelial cell subtypes, we performed pseudotime trajectory analysis using both Monocle34 and RNA velocity35 in organs with relatively more ciliated epithelial cells, including bladder, breast, kidney and uterus (Fig. 2d). We observed similar developmental trajectories of ciliated epithelial cells in the investigated organs, differentiation from a progenitor-like cell state to a mature state (Fig. 2e and Supplementary Fig. 7c), suggesting that ciliated epithelial cells may share a common differentiation trajectory in different organs. Accordingly, highly expressed genes along the pseudotime were enriched in gene ontology (GO) terms related to metabolic processes, cellular response to stimulus and defense response in a sequential manner (Fig. 2f, g). The analysis is in agreement with the knowledge that ciliated epithelial cells play an important role in cleaning pathogenic microorganisms and signal transduction36.
a Distribution of 14 epithelial cell subtypes on the UMAP. b Dot plot shows representative marker genes across the epithelial subtypes. The dot size is proportional to the fraction of cells which express specific genes. The color scale corresponds to the relative expression of specific genes. c Chord diagram showing the mapping of cells between major cell types and organs. The width of the arrow represents the proportion of cells in a given cell type. d UMAP showing the distribution of ciliated cell subtypes in bladder, breast, kidney and uterus. e Monocle2 detects semi-supervised pseudo-temporal trajectories of ciliated cell subtypes in bladder, breast, bladder and uterus. The trajectory is colored by cell subtypes (top) or pseudotime (bottom). f Heatmap showing the enrichment of pathways in epithelial subtypes of different organs based on GSEA analyses. Box plots show the distribution of estimated pesudotime of epithelial cells by Monocle2. The boxes indicate the 25% quantile, median (horizontal line), 75% quantile and Tukey-style whiskers (beyond the box). g Box plots showing the enrichment scores of metabolic-related pathways (n = 3), stimulus-related pathways (n = 4), and defense-related pathways (n = 3) in different epithelial subtypes of different organs. Each point indicates a specific pathway. The boxes indicate the 25% quantile, median (horizontal line), 75% quantile.
Stromal cells are an important component of body tissues37. In the stromal compartment, we identified 11 clusters (S01-S11) belonging to four major cell types, including endothelial cells, fibroblasts, FibSmo cells and smooth muscle cells (Fig. 3a–c and Supplementary Fig. 8a). Although these cell clusters were generally identified in all organs, the heterogeneity of stromal cells was observed in different organs (Fig. 3d, e and Supplementary Fig. 8b). Most mesenchymal cells were generated from kidney; almost all fibroblasts in testis were from S05 (DCNhighAPODhigh fibroblasts); there were a large number of TAGLNhighMUSTN1low smooth muscle cells in the aorta organ but few other smooth muscle cells were enriched in this organ (Fig. 3d). Considering that stromal cells have a certain differentiation potential38, we applied RNA velocity analysis to explore developing states of stromal cells (Fig. 3f). The results suggest that fibroblasts might have the capacity of differentiation to smooth muscle cells and endothelial cells (Fig. 3g, h and Supplementary Fig. 8c–e). GO enrichment analysis revealed that fibroblasts highly expressed genes involved in collagen metabolic or collagen catabolic processes (Fig. 3i). In fact, the cluster S06 may be response for the collagen metabolic function since cells in this cluster highly expressed collagen-associated genes (Supplementary Fig. 8f) including type I collagen genes COL1A1 and COL1A2 (Fig. 3b). Accordingly, gene signature scores of the collagen metabolic pathway were significantly higher (Mann–Whitney test, p-value < 0.05) in S06 than other cell clusters (Supplementary Fig. 8g). Furthermore, to explore the difference of the metabolic capacity of the fibroblasts cluster S06 in different organs, we analyzed the expression pattern and signature scores of genes from the collagen metabolic pathway in the 16 different organs (Supplementary Fig. 8f, h). We found that organs such as adipose, aorta, uterus, bladder, and colon showed relatively higher expression of collagen metabolic genes as well as gene signature scores than other organs, suggesting that the metabolic capacity of fibroblasts (S06) varies in different tissues. Therefore, we speculate that the high potential metabolic capacity of fibroblasts (S06) may help to synthesize and to secrete collagen.
a Distribution of eleven stromal cell subtypes on the UMAP. b Dot plots showing representative marker genes across the stromal subtypes. Dot size is proportional to the fraction of cells expressing specific genes. Color intensity corresponds to the relative expression of specific genes. c Feature plot showing the expression of selected marker genes. d Bar plot showing the percentage of cell subtypes in each organ. e Distribution of stromal cells from different organs on the UMAP. f Unsupervised pseudotime trajectory of the subtypes (S01-S11) of stromal cells by RNA velocity analysis. Trajectory is colored by cell subtypes. The arrow indicates the direction of cell pseudo-temporal differentiation. g UMAPs showing the pseudotime differentiation trajectories of fibroblasts, smooth muscle cells and endothelial cells respectively. h Heatmap showing the scaled expression levels of cell-type-specific marker genes along the pseudotime differentiation trajectory. Examples of marker expression are shown in the right UMAPs. i Barplot showing the enrichment of functional pathways in fibroblasts (cluster S06). j, k UMAP plot and Box plot showing the distribution of gene signature scores estimated by UCell based on annotated genes (n = 24) from the collagen metabolic pathway. The boxes indicate the 25% quantile, median (horizontal line), 75% quantile and Tukey-style whiskers (beyond the box).
Immune cells are essential for maintaining body homeostasis39. We identified 72,284 immune-related cells from the investigated organs, including B cells, T cells, and myeloid cells. These cells were further grouped into 13 major clusters (I01-I13) based on known or novel gene signatures (Fig. 4a, b and Supplementary Fig. 9a, b). Although the annotated immune cell clusters can be found in all organs (Fig. 4c, d), the relative proportion of immune cells varied greatly in different organs. For example, we noticed that the proportion of NKT_cell_CD3Dhigh_GZMKhigh_GZMBhigh cells largely varied between muscle and other organs (Fig. 4e). We therefore analyzed the differentially expressed genes between muscle and other organs in the NKT_cell_CD3Dhigh_GZMKhigh_GZMBhigh cells. We observed that mitochondria-related genes (ATP6, COX3 and ND1) were top differentially expressed genes in common among all the pairwise comparisons (Fig. 4f and Supplementary Fig. 9c). This suggests that the NKT_cell_CD3Dhigh_GZMKhigh_GZMBhigh cells may have an energy metabolism function40.
a Distribution of 13 major immune cell types on the UMAP. b Dot plots shows representative top marker genes across the immune subtypes. c Distribution of immune cells from different organs on the UMAP. d Chord diagram mapping cells between cell types and organs. The width of the arrow represents the proportion of cell types. e Bar chart showing the proportion of cells from different organs for each cell subtype. f Scatter plots showing differentially expressed genes (DEGs) between muscle and other organs. Each dot represents a DEG, and its size is proportional to the fold change.
Dynamics of cell–cell interactome
To decipher the dynamics of intercellular communications in different organs, we employed CellPhoneDB41 to identify potential ligand-receptor pairs among the major cell types. We observed that there are strong intercellular interactions among stromal cells, epithelial cells, and myeloid cells (Fig. 5a, b). Generally, the intensity and pattern of intercellular interactions were organ-specific (Fig. 5c). For example, the organs of tongue and uterus showed stronger cellular interactions, while intercellular interactions in testicular were weaker than other organs (Supplementary Figs. 10, 11). To chart the rewiring of molecular interactions regulating cell–cell interactions, we mapped ligand-receptor pairs in specified cell subpopulations in different organs (Fig. 5d). In brief, the “CD99-PILRA” ligand-receptor pair was specific in the interactions between stromal cells and myeloid cells, particularly in adipose, aorta, and colon. As an inhibitory receptor of immunoglobulin-like type 2 receptor (PILR), PILRA has been shown to bind to the CD99 ligand for immune regulation42. The “CCL4L2-VSIR” pair occurred exclusively in the interaction of myeloid cells and T cells. In contrast, the “LGALS9-CD44” pair contributed to most immune cell-related interactions. Accordingly, CD44 plays a role in innate immunity and subsequent adaptive responses, and has extensive inflammatory and proliferative effects on a variety of cell types43,44. Taken together, these results reveal the potential molecular mechanisms underlying cell–cell communication in various monkey organs.
a Chordal diagram of the integrated cell–cell interaction network among the major cell types. b Heatmap showing the interaction intensity of cellular interactome from (a). Block sizes and colors are proportional to the interaction frequency. c Sankey diagram demonstrating the cell–cell interactions of different cell types in 16 organs. The thickness of lines represents the strength of cell–cell interactions. d Ligand–receptor interactions between selected cell subtypes in different organs. Each row represents a ligand-receptor pair, and each column defines a pair of cell–cell interaction. P-values were calculated by CellPhoneDB without multiple comparisons.
Single-cell chromatin landscape of major organs in cynomolgus monkey
To deconstruct the gene regulation principles of complex tissues in cynomolgus monkey, we examined the single-cell chromatin accessibility landscape of major organs including colon, kidney, lung, uterus, heart, liver, and spleen by scATAC-seq. In total, we generated open chromatin profiles from 66,566 individual cells after quality control. We identified 22 distinct cell clusters in the integrated cell map according to cluster-specific cis-elements and visualized the single-cell profiles with uniform manifold approximation and projection (UMAP) (Fig. 6a and Supplementary Fig. 12a). For example, clusters 1-4 demonstrated accessibility at cis-elements neighboring B cell genes, such as CD22, MS4A1, and TNFRSF13C, while the cluster 22 exhibited accessibility at cis-elements neighboring T cell genes, such as CD3D and IL7R (Supplementary Fig. 6). We detected 397,773 cis-elements across all cell clusters, ranging from 3,046 to 75,001 peaks in each cluster (Fig. 6b). As expected, most of cis-regulatory elements (CREs or ATAC peaks) were derived from promoters, intronic or distal intergenic regulatory regions. We observed that most cell clusters were organ-specific (Fig. 6c) and cluster-specific CREs exhibited organ-specific accessibility accordingly (Fig. 6d), suggesting that different cell types and organs have distinctive chromatin landscapes.
a scATAC-seq data analysis revealing 66,566 cells and 22 cell subtypes. Shown is the t-SNE of cells colored by cell subtypes. b Bar plot showing the number of reproducible peaks identified from each cluster. The peaks are classified into four categories: distal, exonic, intronic and promoter. c Bar plot dhowing the percentage of cell subtypes in each organ. d Heatmap of 80,270 marker peaks across 22 subclusters identified by bias-matched differential testing (FDR < 0.01 and log2FC > 3). e Heatmap illustrating the chromatin accessibility and gene expression of 52,229 significantly (Pearson correlation r > 0.45 and adjusted p-value < 0.1, provided by chromVAR) linked peak-gene pairs. f Profile of POU2F2 and TCF21 chromatin accessibility, gene expression (inferred from scRNA-seq) and TF motif activity. g Visualization of the EGFL7 locus with the maximum number of peak-gene pairs shown by genome browser track (chr15: 1,772,849-1,832,850).
Comparison analysis of scATAC-seq and scRNA-seq data highlighted concordant patterns of chromatin accessibility and gene expression across clusters, as exemplified by marker genes (POU2F2 and TCF21) in specific cell types (Fig. 6f). We also computed the transcription factor (TF) deviation score (namely motif activity) using chromVAR45, which measured the accessibility of TF binding “footprint” at genome-wide in each single cell. Indeed, the motif activity of POU2F2, a B-cell-specific TF involving in cell immune response by regulating B cell proliferation and differentiation46,47, was increased in the B cell cluster (Fig. 6f and Supplementary Fig. 12b). Similarly, the motif activity for TCF21, an essential regulator of fibroblasts in development48, was increased in the fibroblast cell cluster (Fig. 6f and Supplementary Fig. 12b). Furthermore, we applied Cicero49 to identify co-accessible cis-elements at genome-wide. As exemplified at the gene locus of EGFL7, we observed increased enhancer-promoter connections in the endothelial cell cluster at its promoter (Fig. 6g). The result is consistent with the role of EGFL7 as an endothelium-specific secreted factor mostly produced by blood vessel endothelial cells during development50,51,52.
To explore the gene regulatory programs in a specific organ, we integrated the gene accessibility and gene expression data in colon using ArchR33. Based on the integration, cell type annotations from scRNA-seq were transferred to scATAC-seq using the latent semantic analysis (LSI) (Fig. 7a–c). Therefore, nine cell types were predicted with a high accuracy in scATAC-seq data (Fig. 7c and Supplementary Fig. 13a, b). Two rare cell types, monocyte and cycling B cells, were found in RNA clusters but not in ATAC clusters. This discrepancy might be due to the integration algorithm not sensitive enough to rare cells or cells with close states. We observed that most of ATAC peaks (CREs) demonstrated differential accessibility among cell clusters (Supplementary Fig. 13c), confirming distinctive chromatin landscapes in different cell types. The identities of cell clusters were determined according to the gene-activity score of known cell-specific markers (Fig. 7d). For example, the TF HNF4A is a crucial regulator for enterocyte cell identity53,54. Motif analysis revealed that different TF binding motifs showed different degree of enrichment among cell clusters, with the motif enrichment of HNF4A and HNF4G showing the largest variance (Supplementary Fig. 13d, e).
a UMAP plot showing the cell types identified by scRNA-seq data in colon. b UMAP plot showing the joint clustering of scRNA-seq (blue) and scATAC-seq (red) data in colon. Cells in the right UMAP are colored based on cell types annotated by scRNA-seq data. c UMAP plot showing the colon scATAC-seq cell types, which are inferred by scRNA-seq data. Note that cycling B cell and monocyte are not captured in scATAC-seq assay. d The heatmap showing the enrichment of the TF motif in cell-type-specific peaks. e Dot plot showing the identification of positive TF regulators through correlation of chromVAR TF deviation scores and gene expression in cell groups (Pearson correlation r > 0.5, adjusted p-value < 0.01 and deviation difference in the top 25th percentile). f TF footprint for the HNF4A and HNF4G motif with the subtracting the Tn5 bias normalization method. g Profile of the HNF4A and HNF4G gene chromatin accessibility, gene expression (inferred from scRNA-seq), and TF motif activity. h The dynamic of chromatin accessibility (left) and gene expression (right) of the JCHAIN gene along the B cell pseudo-time trajectory. i Genome track visualization of the JCHAIN locus (chr5: 63,526,764–63,676,765). Inferred peak-to-gene links for distal regulatory elements are shown below. j Heatmap showing the positive TF regulators for which gene expression is positively correlated with TF deviation (inferred by chromVAR) across the B cell trajectory.
CREs and TFs make up the regulatory logic determining the cell state transition55,56. To study CREs for cell type-specific transcriptional programs, we identified a total of 44,156 CRE-gene pairs (peak-to-gene links) in the ATAC-RNA integration cell clusters (Supplementary Fig. 13f). Similar to the observation of cell-specific CREs (Supplementary Fig. 13c), 36% of the identified CRE-gene pairs were also specific to a particular cell type (Supplementary Fig. 14a). Interestingly, we found that CRE-target genes significantly (hypergeometric test, p-value < 0.001) overlapped with the differentially expressed genes (DEGs) in each cell type (Supplementary Fig. 14b), supporting the notion that CREs regulate the transcription of target genes. In general, CREs regulated multiple genes, and the number of CREs per target gene was positively corrected with the CRE-gene association especially for local associations (Supplementary Fig. 14c–e). These observations suggest that organ development is subjected to complex regulation via multiple CREs, in line with recent studies57,58.
Next, we correlated the motif activity with the expression level of corresponding TFs to systematically deduce either positive or negative TFs controlling colon development based on whether TF expression was positively or negatively correlated with their motif enrichment. Top 30 representative activators were highlighted in Fig. 7e, including cell identities such as HNF4A, HNF4G and CDX2 for enterocyte cells (Fig. 7f), POU2F2 and BCL11A for B cells, and pioneer factors FOXA2, FOXA3 and KLF459 that play crucial roles in cell fate specification. Generally, the expression of these factors was closely correlated with the accessibility of open chromatin regions and their binding motif enrichment (Fig. 7g and Supplementary Fig. 15).
Furthermore, we sought to demonstrate the power of scATAC-seq data for reconstructing cellular developmental trajectories in colon—the analysis would allow to identify key regulators for organ development at a cell-type level. We focused on the three B cell-related clusters for trajectory analysis since the B-cell developmental trajectory is a well-defined differential programme in other model species60,61,62,63 that could be used to compare regulatory mechanisms found in monkey. We set the B cell cluster highly expressing BCL11A as the start of the trajectory because BCL11A has been implicated in early B lymphopoiesis60,64,65,66 (Fig. 7h). We analyzed the dynamics of gene scores and expression, CRE accessibility as well as TF motif enrichment across the differentiation trajectory (Supplementary Fig. 13g). For example, we observed coordinated sequential activation of JCHAIN based on both the gene score and gene expression across the trajectory (Fig. 7h). Consistently, the CRE accessibility in the promoter and distal enhancers of JCHAIN gradually increased from the early to the intermediate and then to the effector states (Fig. 7i). To identify positive TF regulators driving B-cell differentiation, we integrated motif accessibility with similarly dynamic gene scores or gene expression patterns across the trajectory. We found that BCL11A, IRF9, PAX5, EBF1, POU2F2 and FOS, factors that are critical for B cell lineage specification47,60,67, showed sequential activities (Fig. 7j).
Overall, our scATAC-seq data provide a rich resource for unbiased discovery of cell types and regulatory DNA elements in cynomolgus monkey.
Cell-type-specific- and organ-specific transcriptional gene regulatory networks
To infer cell-type and organ-specific transcriptional regulatory networks in a systemic manner, we applied SCENIC4 to identify TF regulons based on co-expression and motif enrichment. We identified several TF regulon modules that were active in either cell-type (n = 8; Supplementary Fig. 16a, b) or organ-specific manners (n = 6; Fig. 8a, b). Subsequently, we analyzed representative TF regulons across different cell types (n = 7; Supplementary Figs. 16c, 18) or different organs (n = 16; Fig. 8c and Supplementary Fig. 17). The identified TF regulons were highly cell-type or organ-specific based on regulon activity scores (Fig. 8d and Supplementary Fig. 16d). Finally, the representative TF regulons and their associated target genes were organized into cell-type-specific or organ-specific gene regulatory networks (Fig. 8e and Supplementary Fig. 16e).
a Identification of regulon modules using SCENIC. Heatmap (left) shows the similarity of different regulons (n = 87) based on the AUCell score. Eight regulon modules were identified based on regulon similarity. UMAPs (right) illustrate the average AUCell score distribution for different regulon modules (in different colors). b Wordcloud plots showing enrichment of organs in different regulon modules. c Representative transcription factors (TFs) and corresponding TF binding motifs in different regulon modules. d Heatmap showing TFs enriched in different organs. Color depth represents the level of regulon-specific scores. e Integrated gene-regulatory networks of the regulons. Regulon-associated TFs are highlighted in blue rectangles and target genes in circles. Target genes (in circles) are colored according to their highly expressed organs. f Heatmap showing gene-activity scores of marker genes in the indicated organs. g Violin plots showing motif activity (measured by TF chromVAR deviations) of TF regulons highlighted in (f). Box plots indicate the median (horizontal line), second to third quartiles (box).
In the cell-type-specific gene regulatory networks, we observed that CREM specifically controlled proliferation-related target genes such as DAZL and HEY2 in spermatid cells68. Immune-related TFs such as IRF2, FLI1 and IK2F3 were shown to regulate immune cell identity genes such as S100A4 and CD4869,70,71. FEV, a known TF that regulates the development of hematopoietic stem cells72, extensively link to target genes actively expressed in immune cells and epithelial cells (Supplementary Fig. 16e).
In the organ-specific gene regulatory networks, we found that target genes of ZNF770 and CTCF were specifically expressed in tongue. The spermatid cell-specific TF regulon CREM regulated genes actively expressed in testis, consistent with its role in spermatid development68. ETS-factors (ELK3, ERG, and FLI1) together with pre-/immature-B TFs (POU2F2) positively regulated genes showed elevated activity in heart and muscle73,74 (Fig. 8e).
To further confirm the unbiased inference of organ-specific TF regulons based on scRNA-seq data (Fig. 8d), we validated the organ-specific TF regulons using organ-matched scATAC-seq data. Chromatin accessibility were measured at cis-elements containing a specific TF binding motif using chromVAR45 and accessibility changes were analyzed in binding sites for the above-identified organ-specific TFs (denoted as TF deviation scores). In general, TF deviation scores showed similar organ-specific patterns to regulon activity scores (Fig. 8f). For example, the HOXD8 regulon was kidney-specific and the TF deviation scores of HOXD8 showed relatively high levels in kidney (Fig. 8g). Interestingly, the CRE accessibility and the expression level of the gene HOXD8 itself exclusively increased in kidney (Supplementary Fig. 19a, b). These observations indicate that HOXD8 may be important for regulating the kidney function. Supporting this line of notion, the defection of HOXD8 is reminiscent of polycystic kidney disease in mouse75. We also observed that both the regulon activity and the TF deviation scores of ONECUT1 were liver-specific (Fig. 8g). ONECUT1 has been shown to play an important role in liver development and was tightly linked to hepatic TF networks that include FOXA376,77,78. These analyses emphasize the unbiased prediction of organ-specific gene regulatory networks at the single-cell level.
Comparison of cell landscapes among human, mouse and cynomolgus monkey
The cynomolgus monkey cell landscape offers the opportunity to compare the cellular components and transcriptomic dynamics across species with similar organ compositions. Here we integrated scRNA-seq data from the non-human primate cynomolgus monkey (by our study), human4 and mouse6 with matched organs/tissues using orthologous genes for cross-species analysis (Fig. 9a and Supplementary Fig. 20a, b). The integrated cell map consists of 338,932 cells (Fig. 9b), which were grouped into 15 major cell types (Fig. 9c). While the proportions of non-immune cell types were generally stable in the three species, the compositions of certain types of immune cells largely varied among the species (Fig. 9d). In particular, monocytes were rare cells but exclusively detected in monkey. We could rule out the potential bias due to cross-species integration analysis by checking the cell-type annotation in the original cell maps (without integration) of each species (Supplementary Fig. 21 and Supplementary Fig. 22). Instead, the discrepancy might be related to a bias using different platforms to capture rare cell types or selecting sample parts for sequencing. Nevertheless, the expression patterns of representative marker genes and transcriptomic similarity of cell types were overall consistent across species (Fig. 9e, f and Supplementary Fig. 20c, d). Indeed, the gene expression patterns of the major cell types were conserved in all three species, including immune, stromal and epithelial cells (Fig. 9g). This observation is consistent with previous single-cell comparative analyses4,16,17,18. As expected, monkey and human showed significantly higher (Mann–Whitney test, p-value < 0.001) cell-type similarity in terms of orthologous gene expression than other comparisons (Fig. 9h). Intriguingly, immune-related cell types (such as macrophages, B cells and plasma cells) showed higher similarities of gene expression between human and monkey than non-immune cells (such as ciliated and epithelial cells). In contrast, stromal cells (including fibroblasts and FibSmo cells) demonstrated the highest similarities in human-mouse and monkey-mouse comparisons (Fig. 9h and Supplementary Fig. 20g). These findings indicate that monkeys share overall highly similar transcriptional programs in the immune system with human, and thus may provide an ideal benchmark for investigating the immune response to diseases such as cancer79 and the current coronavirus disease 2019 (COVID-19)80,81,82,83,84,85,86.
a Integration of scRNA-seq data for 16 organs from cynomolgus monkey, eleven organs from human, and nine organs from mouse. b UMAP showing the distribution of cells from cynomolgus monkey, human and mouse. c UMAP illustrating the distribution of annotated 15 major cell types. d Bar plot showing the percentage of cells from the three species (left) and the number of cells in each cell type (right). e Dot plot showing representative marker genes of different cell types. Dot size is proportional to the fraction of cells expressing specific genes. Color intensity corresponds to the relative expression of specific genes. f Feature Plot showing the expression of selected marker genes. g Correlation of orthologous gene expression between human, mouse and monkey pseudo-cell types (n = 212) based on the AUROC scores. The AUROC scores were calculated by MetaNeighbor to measure the similarity of different cell types. The clustered heatmap was plotted using the ‘pheatmap’ function in R, where the complete linkage method was used for hierarchical clustering. h Box plots showing the Spearman correlation of average gene expression between two different species using top 20 marker genes in a specific cell type. Each dot represents a major cell type (n = 13). Statistical significance of difference between comparisons was calculated by the two-sided Mann–Whitney U test. The boxes indicate the 25% quantile, median (horizontal line), 75% quantile. i The proportion of hepatocytes in liver (left) and ciliated cells in stomach (right). j Scatter plots showing a pairwise comparison of gene expression across species in a specific organ. Left two scatter plots are for comparisons of hepatocyte cells in liver and right for comparisons of ciliated cells in stomach. Differentially expressed genes (DEGs) are highlighted and representative DEGs are labeled. The size of dots is proportional to the fold change for a specific gene.
Next, we investigated the transcriptomic dynamics of the same cell types between monkey and other species, with a specific focus on varying cell types in compositions across species. Particularly, hepatocytes and ciliated cells showed the lowest similarity between human and monkey (Fig. 9h and Supplementary Fig. 20c, d) and were mainly enriched in the monkey organs of liver and stomach, respectively (Supplementary Fig. 20e, f). However, compared to human and mouse, both cell types were underrepresented in monkey (Fig. 9i). Therefore, hepatocytes and ciliated cells were two representative cell types that largely different between monkey and human or mouse. We performed pairwise comparison of gene expression in liver for hepatocytes and in stomach for ciliated cells (Fig. 9j). In the differential analysis of gastric ciliary cells between human and monkey, we found that LYZ was highly expressed in monkey87. As LYZ has a dual role of immune defense and digestive function88, this observation may highlight that monkey has enhanced function of immune response and digestive system. As a gastric lipase, LIPF is expressed in human chief cells and promotes lipid metabolism89 and it may help for the fat digestion processes occurring in human. ORM1, as an acute-phase protein, was highly specifically expressed in human hepatocytes and had a certain promoting effect on liver regeneration90, which might provide a potentially therapeutic target in hepatopathy.
Finally, we explored conserved and divergent patterns of cell–cell communications among the three species. We performed intercellular ligand-receptor interaction analyses between each pair of cell types in each organ using CellPhoneDB41. The number of significant cell–cell interactions were counted in each species and then scaled to the range between 0 and 1 for inter-species comparison (Fig. 10a). Interestingly, the frequency of intercellular interactions was positively correlated between different species (Fig. 10b), suggesting that these species generally share common intercellular signaling pathways mediating cell–cell communications. The overall intercellular interaction pattern was more similar, albeit slightly, between human and monkey than that between human and mouse (Spearman correlations 0.30 versus 0.27; Fig. 10b). Indeed, the interaction intensities of immune-related cell–cell interactome and the corresponding top ligand-receptor pairs were more consistent between human and monkey in the investigated organs such as kidney and spleen (Fig. 10c, d). In contrast, intercellular interactions among non-immune cells preferred to be more consistent between monkey and mouse (Supplementary Fig. 23).
a Box plot indicating the relative frequency of cell–cell interactions among human, monkey and mouse. Each point denotes a specific intercellular interaction in a specific organ. The number of significant cell–cell interactions are scaled to a range from 0 and 1 for inter-species comparison. b Scatter plots show the Spearman’s rank correlation of cell–cell interaction frequencies between human and monkey (left) or between human and mouse (right). P-values are provided (two-sided Spearman’s correlation test). The fitted line and standard errors with 95% confidence intervals are shown. c The heatmaps showing the strength of interactions among the common major cell types in kidney and spleen in the three species. The size and color of the blocks are proportional to the frequency of interactions. P-values were calculated by CellPhoneDB without multiple comparisons. d Dot plot of interactions between major cell subtypes in kidneys and spleens of different species. Each row represents a ligand-receptor pair, and each column defines a cell–cell interaction pair in a species.
Discussion
Non-human primates (NHP) are similar to humans in terms of anatomy, physiology and biochemical metabolism. Cynomolgus monkeys, a well-established laboratory animal model, have outstanding contributions to the scientific field91. Although several single-cell transcriptomic atlases have recently been established in cynomolgus monkeys based on a few organs (including ovary, lung, heart and artery)28,29,30, an organism-wide single-cell map is still lacking in this model species. Here, we charted a reference cell map of cynomolgus monkeys (named ‘Monkey Atlas’) using both scATAC-seq and scRNA-seq data across multiple organs, allowing deeper insights into the molecular dynamics and cellular heterogeneity of the cynomolgus monkeys organism.
As a proof of concept, we have performed various analyses based on the Monkey Atlas to show its wide uses, including the discovery of new putative cell types, the identification of key regulators in organ specification, and the ability to compare cell types across organs and species. For instance, our data demonstrated that ciliated cells present in various organs of cynomolgus monkeys and the different ciliated cell subpopulations showed various functions related to metabolic process, signal transduction, and cellular response to stimulus (Fig. 2). This observation somehow expands our notion that ciliated cells are generally found in respiratory system92 with vital role in cleaning pathogenic microorganisms and signal transduction36. We also predicted key regulatory factors controlling gene expression patterns of different cell types. Consistent with previous studies93,94,95, SPIB, POU2F2, SPI1, CEBPD and IRF4 were key regulators in myeloid cells and the motifs of these TFs were overrepresented in Monocytes_IL1Bhigh cells than other cells in the SCENIC analysis. In addition to the discovery of known immune-related TFs regulating S100A4 and CD48, we also found FEV, a known TF that regulates the function of hematopoietic stem cells72, has shown extensive regulation of other TFs in immune cells and epithelial cells in our study.
Recently, comprehensive reference cell maps across organs have been established in human4,5 and mouse6,96. Although the Monkey Atlas did not provide exhaustive characterization of all organs in cynomolgus monkeys, it does offer a rich dataset of the most populously studied organs in biology. In this regard, we performed cross-species integration analysis of cell maps to explore the molecular and cellular differences among the three species (human, cynomolgus monkeys and mouse) with comprehensive single-cell data. We noticed that cynomolgus monkeys and human both have abundant immune cells and epithelial cells and a comparative composition of cell types in matched organs. This indicates that cynomolgus monkeys are ideal models to study complex diseases. The differences in cell compositions and gene expression of human, mouse and monkey cells may provide scientists with a guide basis for selecting experimental animals.
Although we have provided relatively rich single-cell multi-omics data in cynomolgus monkey, there are still several limitations of the current study. First, only one male and one female monkeys were included in the analysis. It is therefore challenging to explore genes with sex-biased expression and other sex-related differences in our analysis. Second, the majority of the samples were taken from a single individual monkey. We cannot assess the impact of substantial cellular heterogeneity within the same organ or tissue. Nevertheless, there is a good match of expression patterns and cell compositions between male and female monkeys in the organs of liver, heart and colon. Third, there are still many functionally important organs or tissues (such as ovary, pancreas, and cerebellum) that have not yet been included in our study due to limited resources. In addition, some scRNA-seq samples do not have matched scATAC-seq data, which may restrict unbiased exploration of DNA regulatory elements in specific organs.
During the revision of our manuscript, we acknowledge that a very recent study by Han et al. provided a large-scale cell transcriptomic atlas in cynomolgus monkeys97. Both this study and ours bear striking similarities in terms of investigated organs/tissues and annotated cell components. Regardless of the differences of analytical focuses between the two studies, there are at least three aspects to demonstrate that the two studies are complementary to each other. First, although Han et al. included more tissue samples (n = 45), there are some organs / tissues (including breast and muscle) not covered but included in our data. Second, Han et al. mainly adopted single-nucleus RNA sequencing (snRNA-seq) to profile frozen tissues. However, nuclei have lower amounts of mRNA and less complexity of cell types compared to cells98. In contrast, our scRNA-seq data generated from fresh tissues provide more information including both cytoplasmic and nuclear transcripts, allowing to annotate potential new but rare cell types (such as neutrophils and spermatid cells) which were not mentioned by Han et al.98. Lastly, Han et al. provides only one scATAC-seq sample in kidney, while the eight samples in seven different organs provided in our study is a well complement to the cynomolgus monkey database.
In conclusion, our Monkey Atlas together with the cell transcriptomic atlas by Han et al.98 provide valuable information about the most populous and important cell populations in cynomolgus monkey, which are stepping stones for preclinical studies in future.
Methods
Organ tissue collection
Two healthy four-year-old cynomolgus monkeys were raised from the monkey breeding base of Changchun Biotechnology Development Co., Ltd., Guangxi, China. The managing protocols of the monkeys were carried out in accordance with the standard procedures referring to Guide for Care and Use of Laboratory Animal (2010) and the principles on Animal Welfare Management (Public Law 99-198). The sample collection of cynomolgus monkey and research conduction in this study were approved by the Research Ethics Committee of the Changchun Biotechnology Development Co., Ltd. (Approval Number: 21001). Tissues were collected from 16 different organs, including trachea, spleen, stomach, kidney, uterus, tongue, testis, muscle, lung, liver, heart, colon, breast, bladder, adipose and aorta. Specifically, tissues were firstly cut into 1-2 mm3 pieces in RPMI-1640 medium (Gibico) with 12% fetal bovine serum (FBS, Gibico), then enzymatically digested with gentleMACS (Miltenyi) according to manufacturer’s instruction. Cells were passed through a 70 μm cell trainer (Miltenyi) and centrifuged at 300 g for 5 min at 4 °C. The pelleted cells were re-suspended in red blood cell lysisbuffer (Beyotime) and incubated 1 min to lyse red blood cells. After wash twice with 1XPBS (Gibico), the cell pellets were re-suspended in sorting buffer (PBS with supplemented with 1% FBS). Single cells were captured by the 10 × Genomics Chromium Single Cell 3’ Solution. scRNA-seq library was conducted by Shanghai Xuran Biotechnology and scATAC-seq library was conducted by LC-Biotechnology (Hangzhou, China) and prepared following the manufacturer’s protocol (10 × Genomics). The libraries were subjected to high-throughput sequencing on the Novaseq6000 platform, and 150-bp paired-end reads were generated.
scRNA-seq and data processing
The reference genome sequence of Macaca fascicularis in FASTA format and gene annotation in GTF format were downloaded from the ENSEMBL database. Raw scRNA-seq data were aligned to the M.fascicularis reference genome (macFas6), and subjected to barcode assignment and unique molecular identifier (UMI) counting using the CellRanger v3.1.0 pipeline (10x Genomics). Filtered count matrices from the CellRanger pipeline were converted to sparse matrices using Seurat package (v4.0.0)99. Potential doublets were detected and filtered using DoubletFinder100 based on the expression proximity of each cell to artificial doublets. Cells which expressed either more than 4000 genes or less than 200 genes, as well as the ones who has more than 20% of mitochondrial gene expression in UMI counts were removed from the analysis. Filtered data were then log normalized and scaled to avoid cell-to-cell variation caused by UMI counts and the percent mitochondrial reads. Specifically, the top 3000 most variably expressed genes were determined using the “vst” method in the “FindVariableFeatures” function and scaled using “ScaleData” with regression on the proportion of mitochondrial UMIs (mt.percent).
We used the Robust Principal Component Analysis (RPCA) method in Seurat for integration of scRNA-seq data from different organs. The “RunPCA” function was used to compute the top 20 principal components (PCs) using variably expressed genes. We used UMAP (Uniform Manifold Approximation and Projection) for visualization of cell clusters. Clustering was performed for integrated expression values based on shared-nearest-neighbor (SNN) graph clustering (Louvain community detection-based method) using “FindClusters” with a resolution of 0.8. We used the “FindAllMarkers” function with default parameters to identify markers for each cluster. Marker genes for each cluster were provided in Supplementary Table 2.
Pathway analysis
Gene-set enrichment analysis on differentially expressed genes (DEGs) in this study was performed by the clusterProfiler package101 in R. Gene-set variation analysis (GSVA) was conducted using the GSVA102 package. Expression differences between different cell groups were calculated by the ‘FindMarkers’ function in the Seurat package. UCell103 was used to calculate the gene signature scores of the collagen metabolic pathway, which includes 24 genes (CST3, CTSK, CTSS, FAP, MMP14, MMP16, MMP9, VSIR, ADAMTS3, COL1A2, CREB3L1, F2, F2R, HIF1A, IL6, LARP6, P3H1, RGCC, SERPINF2, SERPINH1, SMPD3, TNS2, TRAM2, and VIM) annotated in the genome of M.fascicularis.
Trajectory inference using Monocle
Monocle2 (version 2.99.3)34 was used to infer the epithelial cells state transition. The UMI count matrix of epithelial cells, gene and cell annotation information derived from Seurat analysis were used to create a CellDataSet object. Variable genes identified among epithelial cell subsets were used to sort cells in the pseudotime analysis. We used the DDRTree method and orderCells function for dimensional reduction and cell ordering. The ciliated_cell_SCGB1D2high (E09 for the bladder organ) or ciliated_cell_PTGR1high clusters (E12 for other organs) were defined as the root state of the inferred trajectory.
RNA velocity analysis
The generated bam files by CellRanger were sorted by SAMTools. The sorted bam files were then used to run the ‘run10x’ command from Velocyto to generate a loom file. RNA velocity analysis was independently performed in epithelial cells and stromal cells.
Cell–cell interaction analysis
Cell–cell interactions among different cell types were estimated by CellPhoneDB (v2.1.1)41 with default parameters (10% of cells expressing the ligand/receptor). In order to run CellPhoneDB analysis in cynomolgus monkeys, the M.fascicularis genes were converted to human genes based on homologous gene mapping. Interactions with p-value < 0.05 were considered to be significant. We considered only ligand-receptor interactions based on the annotation from the CellPhoneDB database, and discarded receptor-receptor and other interactions without a clear receptor.
Create a cisTarget database for Macaca fascicularis
We followed the instruction by SCENIC (https://github.com/aertslab/create_cisTarget_databases) to construct cisTarget database. Since there are no well annotated TF motifs in M.fascicularis, we instead used the annotated human motifs from the CIS-BP website (http://cisbp.ccbr.utoronto.ca/) to create cisTarget databases for M.fascicularis.
Gene-regulatory network
To identify cell-type and organ-specific gene regulatory networks, we performed Single-cell Regulatory Network Inference and Clustering (v0.10.0; a Python implementation of SCENIC)3 in our M.fascicularis dataset. Firstly, the original expression data were normalized by dividing the gene count for each cell by the total number of cells in that cell and multiplying by 10,000, followed by a log1p transformation. Next, normalized counts were used to generate the co-expression module with GRNboost2 algorithm implemented in the arboreto package (v.0.1.3). Finally, we used pySCENIC with its default parameters to infer co-expression modules using the above-created RcisTarget database. An AUCell value matrix was generated to represent the activity of regulators in each cell. The final cell-type and organ-specific gene regulatory networks (GRNs) consisted of 86 and 87 regulons for our M.fascicularis dataset as shown in Supplementary Fig. 16a and Fig. 8a. GRNs were visualized by the igraph package in R.
Cross-species analysis of multiple-organ scRNA-seq data
For cross-species comparison analysis of scRNA-seq data, we only included the one-by-one orthologous genes (n = 12,971) for subsequent analysis. Specifically, cell count matrices of the orthologous genes were extracted from the integrated scRNA-seq data from cynomolgus monkey, human and mouse, respectively. Only cells in matched organs/tissues were considered in the analysis. We then performed cross-species scRNA-seq data integration using the Seurat’s reciprocal PCA (RPCA) integration strategy. Downstream cell-cluster-based cell-type annotation and marker gene analyses were carried out in a similar way as described above.
To compare the cell-type similarity among the three species, we computed the Spearman’s rank correlation of average expression values of the top 20 marker genes between different species in a specific cell type (Fig. 9h). We also validated the correlation analysis by using different sets of top marker genes (Supplementary Fig. 20g).
To compare the pattern of cell–cell communications among the three species, potential intercellular ligand-receptor interactions between each pair of cell types in each organ of each species were predicted by CellPhoneDB. The number of significant cell–cell interactions (p-value < 0.05) were counted in each species for inter-species comparison.
scATAC-seq data pre-processing
The scATAC-seq sequencing data were pre-processed by cellranger-atac (v1.2.0). The running parameters were used by default except for “--force-cells”. The “--force-cell” was set as 10000 for liver, lung and colon, 8000 for spleen, and default for the rest organs. Subsequent scATAC-seq data analysis was performed by ArchR (v1.0.1)33. Specifically, the M.fascicularis genome was constructed and annotated by createGenomeAnnotation and createGeneAnnotation function respectively. Then arrow file was created by createArrowFiles function with default parameters. We used the addDoubletScores function to infer potential doublets, and the filterDoublets function was used to remove the potential doublets with the “filterRatio = 1.0” parameter. ArchR project was created by ArchRProject function with default parameters. For dimensionality reduction, we used the addIterativeLSI function in ArchR with the following parameters: “iterations = 4, clusterParams = list (resolution = c(0.2, 0.4, 0.6), sampleCells = 10,000, n.start = 10, maxClusters = 6), varFeatures = 20,000, dimsToUse = 1:50, scaleDims = FALSE”. Next, the Harmony method was utilized to remove the batch effect by the addHarmony function32. AddClusters function was used to cluster cells by its default parameters. For single-cell embedding, we selected the reducedDims object with harmony and used addTSNE function with the parameter “perplexity = 30” for visualization.
Marker genes identification and cluster annotation
To identify the marker gene, gene scores were calculated when the ArchR project was created and stored in the arrow file. Then getMarkerFeatures function was used to identify the cluster-specific “expressed” genes with default parameters. To visualize the marker genes in the embedding, we used addImputeWeights function to run the MAGIC104 to smooth gene scores across the nearby cells. For track plot, we used the plotBrowserTrack function with default parameters except for “tileSize = 100”.
Peak calling and TF binding motif analysis
Before peak calling, we used the addGroupCoverages function with default parameters to make pseudo-bulk replicates. Then the addReproduciblePeakSet function was used with its default parameters except for “genomeSize = 2.7e09” to call accessible chromatin peaks using MACS2(v2.2.7.1)105. For cell type-specific peak analysis, the getMarkerFeatures function was firstly applied to identify marker peaks. Then the getMarkers function with the parameter “cutOff = FDR < = 0.01 & Log2FC > = 1” was conducted to get the differential peaks. Motif annotation was added to the ArchR project by the addMotifAnnotations function. The TF motif enrichment in differential peaks was computed by the peakAnnoEnrichment function with the parameter “cutOff = FDR < = 0.1 & Log2FC > = 0.5”. For motif footprint analysis, we first used the getPositions function to locate relevant motifs; then the getFootprints function was used to compute interested motif footprints with its default parameters. Lastly, footprint patterns were illustrated by the plotFootprints function with the parameter of “normMethod = Subtract, smoothWindow = 10”.
Integrative analysis of scRNA-seq and scATAC-seq data
In order to align and integrate scATAC-seq data from different organs, we extracted and annotated the scRNA-seq data with matched scATAC-seq data in the same organ. We first used the FindTransferAnchors function from the Seurat package and aligned the data with the addGeneIntegrationMatrix function in ArchR with “unconstrained integration” mode. Most of the predicted scores were > 0.5 in the result, indicting a relatively high prediction accuracy. To improve the accuracy of the prediction and to better integrate the two datasets, a “constrained integration” mode was applied to integrate the scATAC-seq and scRNA-seq data. Briefly, we annotated the scATAC-seq data with cell types based on the gene scores of scATAC-seq. Then, a restricted list was created to make sure that gene expression similarity was calculated only in the same cell type for both scATAC-seq and scRNA-seq data. For peak-to-gene linkage analysis, we used the addPeak2GeneLinks function to compute peak accessibility and gene expression with the parameters of “corCutOff = 0.45, resolution = 1”.
Statistical and reproducibility
If not specified, all statistical analyses and data visualization were done in R (version 4.0.0). We state that no statistical method was used to predetermine sample size. No data were excluded from the analyses and the experiments were not randomized. The Investigators were not blinded to allocation during experiments and outcome assessment.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The data that support the findings of this study have been deposited into CNGB Sequence Archive (CNSA) of China National GeneBank DataBase (CNGBdb) with accession numbers “CNP0002427” for scRNA-seq data and “CNP0002441” for scATAC-seq data. Gene counts and metadata are available at “Zenodo [https://doi.org/10.5281/zenodo.5881495]”. The Gene Expression Omnibus (GEO) accession number for scRNA-seq is “GSE196792”. The GEO accession number for scATAC-seq is “GSE196791”. We also provided an interactive website [https://biobigdata.nju.edu.cn/MonkeyAtlas/] for exploration of marker gene expression based on scRNA-seq data. The public dataset used in this study for cross-species comparisons between humans, mice, and monkeys can be accessed as below: the human count matrix is available at “human count matrix [https://figshare.com/articles/HCL_DGE_Data/7235471]”; the mouse count matrix is available at “mouse count matrix [https://figshare.com/articles/MCA_DGE_Data/5435866]”. All other relevant data supporting the key findings of this study are available within the article and its Supplementary Information files or from the corresponding author upon reasonable request.
References
Bhatt, D. L. & Mehta, C. Adaptive designs for clinical trials. N. Engl. J. Med. 375, 65–74 (2016).
Phillips, K. A. et al. Why primate models matter. Am. J. Primatol. 76, 801–827 (2014).
Chappell, L., Russell, A. J. C. & Voet, T. Single-cell (multi)omics technologies. Annu. Rev. Genomics Hum. Genet 19, 15–41 (2018).
Han, X. et al. Construction of a human cell landscape at single-cell level. Nature 581, 303–309 (2020).
He, S. et al. Single-cell transcriptome profiling of an adult human cell atlas of 15 major organs. Genome Biol. 21, 294 (2020).
Han, X. et al. Mapping the mouse cell atlas by microwell-Seq. Cell 172, 1091–1107 e17 (2018).
Schaum, N. et al. Single-cell transcriptomics of 20 mouse organs creates a Tabula Muris. Nature 562, 367 (2018).
Tabula Muris, C. A single-cell transcriptomic atlas characterizes ageing tissues in the mouse. Nature 583, 590–595 (2020).
Yu, Q. et al. Charting human development using a multi-endodermal organ atlas and organoid models. Cell 184, 3281–3298.e22 (2021).
Saviano, A., Henderson, N. C. & Baumert, T. F. Single-cell genomics and spatial transcriptomics: Discovery of novel cell states and cellular interactions in liver physiology and disease biology. J. Hepatol. 73, 1219–1230 (2020).
Williams, J. W. et al. Single cell RNA sequencing in atherosclerosis research. Circ. Res. 126, 1112–1126 (2020).
Chen, H., Ye, F. & Guo, G. Revolutionizing immunology with single-cell RNA sequencing. Cell Mol. Immunol. 16, 242–249 (2019).
Colonna, M. & Brioschi, S. Neuroinflammation and neurodegeneration in human brain at single-cell resolution. Nat. Rev. Immunol. 20, 81–82 (2020).
Xing, Q. R. et al. Diversification of reprogramming trajectories revealed by parallel single-cell transcriptome and chromatin accessibility sequencing. Sci. Adv. 6, eaba1190 (2020).
Wang, J. et al. Tracing cell-type evolution by cross-species comparison of cell atlases. Cell Rep. 34, 108803 (2021).
La Manno, G. et al. Molecular diversity of midbrain development in mouse, human, and stem cells. Cell 167, 566 (2016).
Hodge, R. D. et al. Conserved cell types with divergent features in human versus mouse cortex. Nature 573, 61 (2019).
Baron, M. et al. A single-cell transcriptomic map of the human and mouse pancreas reveals inter- and intra-cell population structure. Cell Syst. 3, 346 (2016).
Lau, X., Munusamy, P., Ng, M. J. & Sangrithi, M. Single-Cell RNA sequencing of the cynomolgus macaque testis reveals conserved transcriptional profiles during mammalian spermatogenesis. Dev. Cell 54, 548–566.e7 (2020).
Shami, A. N. et al. Single-cell RNA sequencing of human, macaque, and mouse testes uncovers conserved and divergent features of mammalian spermatogenesis. Dev. Cell 54, 529–547.e12 (2020).
Messmer, T. et al. Transcriptional heterogeneity in naive and primed human pluripotent stem cells at single-cell resolution. Cell Rep. 26, 815–824.e4 (2019).
Drokhlyansky, E. et al. The human and mouse enteric nervous system at single-cell resolution. Cell 182, 1606–1622.e23 (2020).
Hermann, B. P. et al. The mammalian spermatogenesis single-cell transcriptome, from spermatogonial stem cells to spermatids. Cell Rep. 25, 1650–1667.e8 (2018).
Nakamura, T. et al. Data Descriptor: Single-cell transcriptome of early embryos and cultured embryonic stem cells of cynomolgus monkeys. Sci. Data 4 (2017).
Mestas, J. & Hughes, C. C. Of mice and not men: differences between mouse and human immunology. J. Immunol. 172, 2731–2738 (2004).
Sachs, D. H. Tolerance: of mice and men. J. Clin. Invest. 111, 1819–1821 (2003).
VandeBerg, J. L. & Williams-Blangero, S. Advantages and limitations of nonhuman primates as animal models in genetic research on complex diseases. J. Med. Primatol. 26, 113–119 (1997).
Ma, S. et al. Single-cell transcriptomic atlas of primate cardiopulmonary aging. Cell Res. 31, 415–432 (2021).
Wang, S. et al. Single-cell transcriptomic atlas of primate ovarian aging. Cell 180, 585 (2020).
Zhang, W. Q. et al. A single-cell transcriptomic landscape of primate arterial aging. Nat. Commun. 11, 2202 (2020).
Butler, A., Hoffman, P., Smibert, P., Papalexi, E. & Satija, R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat. Biotechnol. 36, 411–420 (2018).
Korsunsky, I. et al. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat. Methods 16, 1289–1296 (2019).
Granja, J. M. et al. ArchR is a scalable software package for integrative single-cell chromatin accessibility analysis. Nat. Genet. 53, 403–411 (2021).
Qiu, X. et al. Reversed graph embedding resolves complex single-cell trajectories. Nat. Methods 14, 979–982 (2017).
La Manno, G. et al. RNA velocity of single cells. Nature 560, 494–498 (2018).
Spassky, N. & Meunier, A. The development and functions of multiciliated epithelia. Nat. Rev. Mol. Cell Biol. 18, 423–436 (2017).
Buckley, C. D., Barone, F., Nayar, S., Benezech, C. & Caamano, J. Stromal cells in chronic inflammation and tertiary lymphoid organ formation. Annu. Rev. Immunol. 33, 715–745 (2015).
Soliman, H. et al. Multipotent stromal cells: one name, multiple identities. Cell Stem Cell 28, 1690–1707 (2021).
Natoli, G. & Ostuni, R. Adaptation and memory in immune responses. Nat. Immunol. 20, 783–792 (2019).
Spinelli, J. B. & Haigis, M. C. The multifaceted contributions of mitochondria to cellular metabolism. Nat. Cell Biol. 20, 745–754 (2018).
Efremova, M., Vento-Tormo, M., Teichmann, S. A. & Vento-Tormo, R. CellPhoneDB: inferring cell-cell communication from combined expression of multi-subunit ligand-receptor complexes. Nat. Protoc. 15, 1484–1506 (2020).
Ma, P. et al. Immune cell landscape of patients with diabetic macular edema by single-cell RNA analysis. Front Pharm. 12, 754933 (2021).
Orian-Rousseau, V. & Sleeman, J. CD44 is a multidomain signaling platform that integrates extracellular matrix cues with growth factor and cytokine signals. Adv. Cancer Res. 123, 231–254 (2014).
Wu, C. et al. Galectin-9-CD44 interaction enhances stability and function of adaptive regulatory T cells. Immunity 41, 270–282 (2014).
Schep, A. N., Wu, B. J., Buenrostro, J. D. & Greenleaf, W. J. chromVAR: inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nat. Methods 14, 975 (2017).
Clerc, R. G., Corcoran, L. M., Lebowitz, J. H., Baltimore, D. & Sharp, P. A. The B-cell-specific Oct-2 protein contains pou box-type and homeo box-type domains. Genes Dev. 2, 1570–1581 (1988).
Hodson, D. J. et al. Regulation of normal B-cell differentiation and malignant B-cell survival by OCT2. Proc. Natl Acad. Sci. USA 113, E2039–E2046 (2016).
Acharya, A. et al. The bHLH transcription factor Tcf21 is required for lineage-specific EMT of cardiac fibroblast progenitors. Development 139, 2139–2149 (2012).
Pliner, H. A. et al. Cicero predicts cis-regulatory DNA interactions from single-cell chromatin accessibility data. Mol. Cell 71, 858–871.e8 (2018).
Parker, L. H. et al. The endothelial-cell-derived secreted factor Egfl7 regulates vascular tube formation. Nature 428, 754–758 (2004).
Soncin, F. et al. VE-statin, an endothelial repressor of smooth muscle cell migration. EMBO J. 22, 5700–5711 (2003).
Fitch, M. J., Campagnolo, L., Kuhnert, F. & Stuhlmann, H. Egfl7, a novel epidermal growth factor-domain gene expressed in endothelial cells. Dev. Dyn. 230, 316–324 (2004).
Chen, L. et al. A reinforcing HNF4-SMAD4 feed-forward module stabilizes enterocyte identity. Nat. Genet. 51, 777–785 (2019).
Chen, L. et al. The nuclear receptor HNF4 drives a brush border gene program conserved across murine intestine, kidney, and embryonic yolk sac. Nat. Commun. 12, 2886 (2021).
Wittkopp, P. J. & Kalay, G. Cis-regulatory elements: molecular mechanisms and evolutionary processes underlying divergence. Nat. Rev. Genet. 13, 59–69 (2011).
Yanez-Cuna, J. O., Kvon, E. Z. & Stark, A. Deciphering the transcriptional cis-regulatory code. Trends Genet 29, 11–22 (2013).
Khouri-Farah, N., Guo, Q., Morgan, K., Shin, J. & Li, J. Y. H. Integrated single-cell transcriptomic and epigenetic study of cell state transition and lineage commitment in embryonic mouse cerebellum. Sci. Adv. 8, eabl9156 (2022).
Sarropoulos, I. et al. Developmental and evolutionary dynamics of cis-regulatory elements in mouse cerebellar cells. Science 373, eabg4696. (2021).
Balsalobre, A. & Drouin, J. Pioneer factors as master regulators of the epigenome and cell fate. Nat. Rev. Mol. Cell. Biol. 23, 449–464 (2022).
Nutt, S. L. & Kee, B. L. The transcriptional regulation of B cell lineage commitment. Immunity 26, 715–725 (2007).
You, M. et al. Single-cell epigenomic landscape of peripheral immune cells reveals establishment of trained immunity in individuals convalescing from COVID-19. Nat. Cell Biol. 23, 620–630 (2021).
Satpathy, A. T. et al. Massively parallel single-cell chromatin landscapes of human immune cell development and intratumoral T cell exhaustion. Nat. Biotechnol. 37, 925–936 (2019).
Morgan, D. & Tergaonkar, V. Unraveling B cell trajectories at single cell resolution. Trends Immunol. 43, 210–229 (2022).
Yu, Y. et al. Bcl11a is essential for lymphoid development and negatively regulates p53. J. Exp. Med. 209, 2467–2483 (2012).
Satterwhite, E. et al. The BCL11 gene family: involvement of BCL11A in lymphoid malignancies. Blood 98, 3413–3420 (2001).
Liu, P. et al. Bcl11a is essential for normal lymphoid development. Nat. Immunol. 4, 525–532 (2003).
Yaseen, N. R., Park, J., Kerppola, T., Curran, T. & Sharma, S. A central role for Fos in human B- and T-cell NFAT (nuclear factor of activated T cells): an acidic region is required for in vitro assembly. Mol. Cell Biol. 14, 6886–6895 (1994).
Weinbauer, G. F., Behr, R., Bergmann, M. & Nieschlag, E. Testicular cAMP responsive element modulator (CREM) protein is expressed in round spermatids but is absent or reduced in men with round spermatid maturation arrest. Mol. Hum. Reprod. 4, 9–15 (1998).
Honda, K. & Taniguchi, T. IRFs: master regulators of signalling by Toll-like receptors and cytosolic pattern-recognition receptors. Nat. Rev. Immunol. 6, 644–658 (2006).
Gallant, S. & Gilkeson, G. ETS transcription factors and regulation of immunity. Arch. Immunol. Ther. Exp. (Warsz.) 54, 149–163 (2006).
Nowling, T. K. & Gilkeson, G. S. Regulation of Fli1 gene expression and lupus. Autoimmun. Rev. 5, 377–382 (2006).
Wang, L. et al. Fev regulates hematopoietic stem cell development via ERK signaling. Blood 122, 367–375 (2013).
Rogers, C. D., Phillips, J. L. & Bronner, M. E. Elk3 is essential for the progression from progenitor to definitive neural crest cell. Dev. Biol. 374, 255–263 (2013).
Pond, A. L. & Nerbonne, J. M. ERG proteins and functional cardiac I(Kr) channels in rat, mouse, and human heart. Trends Cardiovasc Med. 11, 286–294 (2001).
Di-Poi, N., Zakany, J. & Duboule, D. Distinct roles and regulations for HoxD genes in metanephric kidney development. PLoS Genet. 3, e232 (2007).
Zhang, P. et al. DNA methylation alters transcriptional rates of differentially expressed genes and contributes to pathophysiology in mice fed a high fat diet. Mol. Metab. 6, 327–339 (2017).
Vanderpool, C. et al. Genetic interactions between hepatocyte nuclear factor-6 and Notch signaling regulate mouse intrahepatic bile duct development in vivo. Hepatology 55, 233–243 (2012).
Tomaru, Y. et al. Identification of an inter-transcription factor regulatory network in human hepatoma cells by Matrix RNAi. Nucleic Acids Res. 37, 1049–1060 (2009).
Dewi, F. N. & Cline, J. M. Nonhuman primate model in mammary gland biology and neoplasia research. Lab Anim. Res. 37, 3 (2021).
Gebre, M. S. et al. Optimization of non-coding regions for a non-modified mRNA COVID-19 vaccine. Nature 601, 410–414 (2022).
Routhu, N. K. et al. SARS-CoV-2 RBD trimer protein adjuvanted with Alum-3M-052 protects from SARS-CoV-2 infection and immune pathology in the lung. Nat. Commun. 12, 3587 (2021).
Marlin, R. et al. Targeting SARS-CoV-2 receptor-binding domain to cells expressing CD40 improves protection to infection in convalescent macaques. Nat. Commun. 12, 5215 (2021).
Liu, H. et al. Development of recombinant COVID-19 vaccine based on CHO-produced, prefusion spike trimer and alum/CpG adjuvants. Vaccine 39, 7001–7011 (2021).
Alleva, D. G. et al. Development of an IgG-Fc fusion COVID-19 subunit vaccine, AKS-452. Vaccine 39, 6601–6613 (2021).
Urano, E. et al. COVID-19 cynomolgus macaque model reflecting human COVID-19 pathological conditions. Proc. Natl Acad. Sci. USA 118, e2104847118 (2021).
Li, H. et al. Enhanced protective immunity against SARS-CoV-2 elicited by a VSV vector expressing a chimeric spike protein. Signal Transduct. Target Ther. 6, 389 (2021).
Janiak, M. C., Burrell, A. S., Orkin, J. D. & Disotell, T. R. Duplication and parallel evolution of the pancreatic ribonuclease gene (RNASE1) in folivorous non-colobine primates, the howler monkeys (Alouatta spp.). Sci. Rep. 9, 20366 (2019).
Badiyan, S. N. et al. Clinical outcomes of patients with recurrent lung cancer reirradiated with proton therapy on the Proton Collaborative Group and University of Florida Proton Therapy Institute Prospective Registry Studies. Pr. Radiat. Oncol. 9, 280–288 (2019).
Busslinger, G. A. et al. Human gastrointestinal epithelia of the esophagus, stomach, and duodenum resolved at single-cell resolution. Cell Rep. 34, 108819 (2021).
Qin, X. Y. et al. Transcriptome analysis uncovers a growth-promoting activity of Orosomucoid-1 on hepatocytes. EBioMedicine 24, 257–266 (2017).
Capitanio, J. P. & Emborg, M. E. Contributions of non-human primates to neuroscience research. Lancet 371, 1126–1135 (2008).
Sachs, N. et al. Long-term expanding human airway organoids for disease modeling. EMBO J 38, e100300 (2019).
Care, M. A. et al. SPIB and BATF provide alternate determinants of IRF4 occupancy in diffuse large B-cell lymphoma linked to disease heterogeneity. Nucleic Acids Res. 42, 7591–7610 (2014).
Harris, A. et al. Onecut factors and Pou2f2 regulate the distribution of V2 interneurons in the mouse developing spinal cord. Front Cell Neurosci. 13, 184 (2019).
Gery, S., Tanosaki, S., Hofmann, W. K., Koppel, A. & Koeffler, H. P. C/EBPdelta expression in a BCR-ABL-positive cell line induces growth arrest and myeloid differentiation. Oncogene 24, 1589–1597 (2005).
Kalucka, J. et al. Single-cell transcriptome atlas of murine endothelial cells. Cell 180, 764 (2020).
Han, L. et al. Cell transcriptomic atlas of the non-human primate Macaca fascicularis. Nature 604, 723–731 (2022).
Slyper, M. et al. A single-cell and single-nucleus RNA-Seq toolbox for fresh and frozen human tumors. Nat. Med. 26, 792–802 (2020).
Satija, R., Farrell, J. A., Gennert, D., Schier, A. F. & Regev, A. Spatial reconstruction of single-cell gene expression data. Nat. Biotechnol. 33, 495–502 (2015).
McGinnis, C. S., Murrow, L. M. & Gartner, Z. J. DoubletFinder: doublet detection in single-cell RNA sequencing data using artificial nearest neighbors. Cell Syst. 8, 329–337.e4 (2019).
Yu, G. C., Wang, L. G., Han, Y. Y. & He, Q. Y. clusterProfiler: an R package for comparing biological themes among gene clusters. Omics-a J. Integr. Biol. 16, 284–287 (2012).
Hanzelmann, S., Castelo, R. & Guinney, J. GSVA: gene set variation analysis for microarray and RNA-Seq data. Bmc Bioinform. 14, 7 (2013).
Andreatta, M. & Carmona, S. J. UCell: robust and scalable single-cell gene signature scoring. Comput. Struct. Biotechnol. J. 19, 3796–3798 (2021).
van Dijk, D. et al. Recovering gene interactions from single-cell data using data diffusion. Cell 174, 716–729.e27 (2018).
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Acknowledgements
The authors acknowledge the High Performance Computing Center of Nanjing University for providing high performance computing (HPC) resources. This work is supported by the National Natural Science Foundation of China (Nos. 32070656, 81872877, 82173819, and 81872876), the Nanjing University Deng Feng Scholars Program, the Priority Academic Program Development (PAPD) of Jiangsu Higher Education Institutions and Postgraduate Research & Practice Innovation Program of Jiangsu Province. F.Y., J.Q. and T.Z. appreciate the 2022 Postgraduate Research & Practice Innovation Program of Jiangsu Province (SJCX22_0029, KYCX22_0188 and KYCX22_0118).
Author information
Authors and Affiliations
Contributions
D.C., Y.S., and Y.X. conceived and designed the study. J.Q., Y.W., Y.D., X.Q., and Qiangmin Xie. performed animal experiments. F.Y., T.Z., and W.F. performed data analyses with supported from M.C. D.C. wrote the manuscript with input from J.Q., F.Y., X.Z., and Qiang Xu. All authors reviewed and approved the submitted version.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous Reviewer(s) to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Qu, J., Yang, F., Zhu, T. et al. A reference single-cell regulomic and transcriptomic map of cynomolgus monkeys. Nat Commun 13, 4069 (2022). https://doi.org/10.1038/s41467-022-31770-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-022-31770-x
This article is cited by
-
In vivo self-renewal and expansion of quiescent stem cells from a non-human primate
Nature Communications (2025)
-
The human and non-human primate developmental GTEx projects
Nature (2025)
-
A molecular cell atlas of mouse lemur, an emerging model primate
Nature (2025)
-
Single-cell transcriptome unveils unique transcriptomic signatures of human organ-specific endothelial cells
Basic Research in Cardiology (2024)
-
An organism-wide atlas of hormonal signaling based on the mouse lemur single-cell transcriptome
Nature Communications (2024)