Introduction

GATA2 is a hematopoietic transcription factor that is indispensable for hematopoietic stem cell (HSC) generation and maintenance [1], and alterations in its expression can lead to severe hematologic abnormalities. Over a decade ago, heterozygous GATA2 mutations leading to a loss of DNA binding ability were identified as drivers of familial myelodysplastic syndrome/acute myeloid leukemia (MDS/AML) in four families [2]. Germline GATA2 mutations have since been implicated in several autosomal dominant disorders with a propensity for familial MDS/AML [3,4,5,6,7,8,9,10,11,12,13,14], leading to the categorization of these disorders as “GATA2 deficiency” [15].

GATA2 deficiency is a pediatric/adolescent pre-leukemic disorder that is highly susceptible to transformation, with 75% of carriers developing a myeloid neoplasm by 20 years of age [16]. Studies of large European MDS cohorts revealed that 15% of MDS cases with excess blasts and 7% of all primary MDS cases contained germline GATA2 mutations [17], and that morbidity significantly increased in patients receiving HSC transplant post-adolescence [18], highlighting the need for early intervention. While several GATA2 variants are associated with GATA2 deficiency [16, 19,20,21], the most common mutations associated with leukemia development are missense, hotspot mutations in the second zinc finger (ZF2) of GATA2 [18, 22, 23]. Molecular contributors to leukemic progression include cytogenetic abnormalities such as monosomy 7 or trisomy 8 followed by the acquisition of somatic mutations, including ASXL1, STAG2, and SETBP1 [17, 18, 23,24,25,26,27,28]. However, the precise mechanisms underlying transformation are not fully understood.

The function of GATA2 as a potent regulator of HSC development and maintenance has been explored in several model systems [1, 29,30,31,32,33,34]. The first mammalian model of GATA2 deficiency disrupted the GATA2 intronic +9.5 enhancer site, which is altered in rare cases, and showed that baseline levels of GATA2 are necessary for HSC generation during development [35, 36]. Recently, two mouse models of missense ZF2 mutations were reported: L359V, which is an activating mutation commonly associated with myelomonocytic transformation of chronic myelogenous leukemia (CML) [37, 38], and R398W, a mutation commonly associated with MDS/AML formation in GATA2 deficiency [39]. Of note, Gata2R398W/+ mice phenocopy the dendritic cell, monocyte, B and NK lymphoid (DCML) deficiency seen in some patients, and the mutant protein was shown to exert a dominant-negative effect on wild-type (WT) GATA2 by enhancing or silencing GATA2 target genes [39]. iPSC lines from GATA2 deficiency patients have been developed to model disease progression but show subtle hematological differences [40]. More recently, two heterozygous GATA2 mutant iPSC lines were developed with CRISPR/Cas9: R398W and R396Q [41], but the consequences of the mutations have yet to be reported. Here, we report the phenotype of a new GATA2 deficiency mouse model which mimics a common germline GATA2 missense mutation in ZF2 associated with MDS/AML progression, R396Q [22]. Using functional studies and single-cell RNA sequencing (scRNA-Seq), we show that Gata2R396Q/+ mice display aberrant hematopoiesis throughout development due to abnormalities in lineage fate decisions and indications of accelerated aging within the HSC compartment, resulting in a loss of HSC stemness. However, the mice do not develop marked immunodeficiency or MDS/AML, indicating that additional drivers are necessary for disease.

Methods

Mice

Gata2R396Q/+ mice were created using CRISPR-Cas9 technology and direct zygote injection by the St. Jude Transgenic Core Facility. Briefly, a mixture of the 40 ng/ul 3X NLS SpCas9 protein (St. Jude Protein Production Core), 20 ng/μl chemically modified sgRNA (CAGE1313.Gata2.g2 – 5’ AAGGGAUCCAGACCCGGAAU-3’; Synthego), and 10 ng/μl donor ssODN (CAGE1313.g2.mod.sense.ssODN-5’TAACAGGCCACTGACCATGAAGAAGGAAGGGATCCAGACCTGGAATCGCAAGATGTCCAGCAAATCCAAGAAGAGCAAGAAAGGGGCTG 3’; IDT AltR modification) was injected into fertilized zygotes as previously described [42]. Animals were genotyped by targeted deep sequencing using CAGE1313.Gata2.F – 5 ’ctacacgacgctcttccgatctGGGGGCTTTGGTGAGTCACATTGGGT 3’ and CAGE1313.Gata2.R – 5’ cagacgtgtgctcttccgatctTAGGCGTGGGCGTGGGTAGGATGTG 3’ primers at the Center for Advanced Genome Engineering (St. Jude) and resulting sequences analyzed by CRIS.py as previously described [43]. Animals positive for the desired R396Q modification were backcrossed to C57BL/6 J mice and then bred to heterozygosity in a pathogen-free facility. B6.SJL (CD45.1+) mice were purchased from The Jackson Laboratory. For timed pregnancies, 1–3 female CD45.2+ mice were placed with a Gata2R396Q/+ male overnight. Females were assessed for a vaginal plug the following morning and prospective embryos were considered embryonic day 0.5 (E0.5). For all in vivo and ex vivo experiments, male and female mice were used and ranged from birth through one year. Animal studies were not blinded to the researcher. Animals were selected randomly from their litter for analysis. All animal experiments were approved by the St. Jude Children’s Research Hospital Institutional Animal Care and Use Committee and comply with all relevant ethical regulations regarding animal research.

Tissue collection and flow cytometry

Mice were bled from the retro-orbital plexus using heparinized capillary tubes and analyzed for complete blood counts (CBCs) using a Forcyte hematology analyzer. For flow cytometry, peripheral blood (PB) was lysed (eBioscience) then resuspended in FACS buffer (PBS/0.5% BSA/2 mM EDTA) and analyzed on a BD LSR Fortessa. All antibodies were purchased from BioLegend and catalog numbers are provided below. For transplant recipients, PB was stained with CD45.1-APC (110714), CD45.2-BV786 (109839), [B220 (103236), CD11b (101228), Gr-1 (108428)]-PerCP-Cy5.5, and [B220 (103222), CD4 (100528), CD8 (100722)]-PECy7. To isolate bone marrow (BM) hematopoietic cells in adult mice, tibias, femurs, and pelvic bones were removed and crushed in ice cold PBS, followed by passage through 70μm filters. Cells were then lysed, washed, and resuspended in FACS buffer. To isolate hematopoietic cells from fetal liver (FL), livers were gently pipetted in FACS buffer, lysed, passed through a 70μm filter, washed, and resuspended in FACS buffer. For flow cytometry of BM and FL hematopoietic progenitors, cells were stained with (1) Lineage (B220 (103244)), CD4 (100548), CD8 (100744), Gr-1 (108440), Ter119 (116239)-BV605, Sca1-PerCP-Cy5.5 (108124), c-Kit-APC780 (135108), CD135-APC (135310), CD48-A700 (103426), CD150-PE-Cy7 (115914) to identify LSK (Lineage-Sca1+c-Kit+), LT-HSC (LSKCD135-CD48-CD150+), ST-HSC (LSKCD135-CD48-CD150-), MPP2 (LSKCD135-CD48+CD150+), MPP3 (LSKCD135-CD48+CD150-), and MPP4 (LSKCD135+CD48+CD150-) cells or (2) Lineage (B220 (103244)), CD4 (100548), CD8 (100744), Gr-1 (108440), Ter119 (116239)-BV605, Sca1-PerCP-Cy5.5 (108124), c-Kit-APC780 (135108), CD32/16-A700 (156620), CD34-A647 (343508), CD127-PE-Cy7 (135014) to identify LK (Lineage-Sca1-c-Kit+), CMP (LKCD34+CD32/16mid), GMP (LKCD34+CD32/16+), MEP (LKCD34-CD32/16-), and CLP (Lineage-c-KitmidSca1midCD127+). For mature lineages, cells were stained with CD11b-FITC (101206), Gr-1-PerCP-Cy5.5 (108428), B220-BV786 (103246), Ter119-APC (116212), and CD71-BV786 (334102) to identify myeloid (CD11b+Gr-1+), B (B220+), or erythroid (R1, CD71-Ter119-; R2, CD71+Ter119-; R3, CD71+Ter119mid; R4, CD71+Ter119+; R5, CD71mid/lowTer119+). For analysis of transplanted bone marrow, CD45.1-BV786 (103149)/APC (110714)/FITC (110706) and CD45.2-PE (109808) were added. All flow data were analyzed using FlowJo version 10 (Treestar).

Single-cell RNA sequencing (scRNA-seq)

LSK cells were sorted on a BD Aria cell sorter, and mRNA expression libraries were constructed according to the Chromium 10X single cell 3′ reagent v3.1 protocol. Libraries were sequenced using Illumina NovaSeq paired-end sequencing per the 10X protocol to a minimum depth of 50,000 reads per cell. Raw reads from four WT and three Gata2R396Q/+ libraries consisting of 2-3 mice (all mice were 8–10 week old females) were mapped to GRCm38 and gene expression matrices were obtained using count command in CellRanger (v.7.0.0). The filtered_feature_bc_matrix was used in Seurat (V.4.0.3) with R(V. 4.3.1) for downstream analysis. Low-quality cells were regressed out if the detected genes per cell were ≤ 200 or ≥ 6000, and the percentage of mitochondrial reads was greater than or equal to 5%. Harmony was used to integrate different replicates and remove the batch effects [44]. Default Seurat pipelines were followed to generate UMAP and clustering the cells into different groups. Differential genes between Gata2WT and Gata2R396Q/+ were obtained using FindMarkers function in Seurat for each cluster. Gene set enrichment analyses were performed using GSEApy (V.1.1.0) using python (V.3.11.6).

Statistical analysis

Unless otherwise specified, data are presented as mean ± standard deviation with sample sizes and replicates indicated in figure legends. No statistical sample size calculation was performed to predetermine sample sizes. Variances were similar for the groups being compared. No samples or animals were excluded from analysis. Significance was determined via Mann-Whitney or Wilcoxon Signed Rank Test. We calculated the hscScore of each cell in our scRNA data using the published hscScore method [45]. hscScore is visualized using VlnPlot in Seurat with integration of the ggpubr package in R. A customized script is provided: (https://github.com/gulkhan007/VlnPlot_pvalue_example).

Additional methods are provided in an online supplement.

Results

Gata2 R396Q/+ mice display significant alterations in bone marrow hematopoiesis during adulthood

To model one of the most recurrent GATA2 ZF2 missense mutations associated with MDS/AML development in GATA2 deficiency, we utilized CRISPR/Cas9 genome editing and homology directed repair to induce a G to A point mutation at c.1087 in heterozygosity on the C57B6/J background, resulting in the Gata2R396Q/+ mutant mouse (Fig. 1A, Supplemental Fig. 1A). Digital droplet PCR confirmed that Gata2R396Q is expressed at 50% in Gata2R396Q/+ mouse whole bone marrow cells (Fig. 1B). To assess the effects of this mutation on normal hematopoiesis, we first aged a cohort of mice and tracked changes in peripheral blood (PB) output over time through complete blood counts (CBC). Young (10–12 weeks) Gata2R396Q/+ mice displayed a modest but significant reduction in white blood cell (WBC), neutrophil, and monocyte counts compared to WT littermates, with no differences in red blood cell (RBC) or platelet counts (Fig. 1C, top). The reduction in monocytes observed in young mice is intriguing, as many GATA2 deficiency patients experience severe monocytopenia [21]. Of note, the previously published Gata2R398W/+ mouse model began showing reduced WBC counts at 6 months of age and severe monocytopenia at 12 months, while 3-month-old mutants had relatively normal CBCs [39], and Gata2+/- mice do not display significant differences in PB output [30]. While older mice (53–55 weeks) in our cohort continued to exhibit significantly reduced WBC, neutrophil, and lymphocyte counts, monocytopenia was not present by 12 months (Fig. 1C, bottom), suggesting that the R398W and R396Q mutations can elicit disparate phenotypes.

Fig. 1: Gata2R396Q/+ mice display significant alterations in bone marrow hematopoiesis during adulthood.
figure 1

A Schematic representing CRISPR/Cas9 editing of Gata2 to produce Gata2R396Q/+ heterozygous mice. B Frequency of R396Q positive droplets in WT and Gata2R396Q/+ whole bone marrow (WBM) via digital droplet PCR (n = 4). C Complete blood count (CBC) data from young (10-12 weeks) and old (53-55 weeks) mice (n = 18). Absolute number of WBM cells D, mature cells E, maturing red blood cells (RBC) F, myeloid progenitors G, and common lymphoid progenitors (CLP) H in 8–10 week old mice (n = 7–11). I Proportion of CMPs, GMPs, or MEPs among myeloid progenitors in the BM of 8–10 week old mice (n = 7–11). J Absolute number of BM hematopoietic stem and progenitors (HSPCs) in 8–10 week old mice (n = 7–11). K Proportion of LT-HSC, ST-HSC, and MPP2 cells among BM LSK cells in 8–10 week old mice (n = 7–11). Representative flow plot L and quantification M of gMFI CD150 expression among BM HSPCs in 8–10 week old mice. ns, not significant; *p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001 WBC white blood cells, LY lymphocytes, NE neutrophils, MO monocytes, RBC red blood cells, PLT platelets, WBM whole bone marrow, LK lineage-c-Kit+; CMP common myeloid progenitor, GMP granulocyte-monocyte progenitor, MEP megakaryocyte-erythroid progenitor, CLP common lymphoid progenitor, LSK lineage-Sca-1+c-Kit+, LT-HSC long-term hematopoietic stem cell, ST-HSC short-term HSC, MPP2 multipotent progenitor 2, gMFI geometric mean fluorescence intensity.

We next assessed the bone marrow (BM) hematopoietic compartment of 8–10 week old WT and Gata2R396Q/+ littermates (Supplemental Fig. 1B–E). Gata2R396Q/+ mice showed no significant differences in total BM cellularity, mature B or myeloid cells, or RBC maturation (Fig. 1D–F), although there was a trend for increased B cells in the BM (Fig. 1E). However, mutant mice had significantly reduced numbers of committed myeloid progenitors, a trend for a reduction in common lymphoid progenitors (CLP), and a bias toward megakaryocytic/erythroid progenitors (MEP) at the expense of common myeloid progenitors (CMP) (Fig. 1G–I). More strikingly, the Gata2R396Q/+ hematopoietic stem and progenitor cell (HSPC) compartment exhibited a significant reduction in the short-term HSC (ST-HSC) population and a significant increase in multipotent progenitor 2 (MPP2) cells, with a trend for increased long-term HSC (LT-HSC) and reduced total Lineage-Sca1+c-Kit+ (LSK) cells (Fig. 1J). Gating on LSK cells revealed a bias in HSPC production toward LT-HSC and MPP2 cells at the expense of ST-HSCs (Fig. 1K), which appeared to be driven by an increase in expression of the SLAM family marker CD150 (Fig. 1L–M).

To assess the functional output of specific HSPCs, we performed single cell colony-forming unit (CFU) assays in methylcellulose with cytokines to induce differentiation (Supplemental Fig. 2A). Surprisingly, Gata2R396Q/+ LT-HSCs and other HSPCs showed little to no difference in CFU potential, while MPP2s displayed a slight bias for GEMM and G/M/GM colonies at the expense of Erythroid/Meg colonies (Supplemental Fig. 2B, C). As MPP2s are known to have an intrinsic bias toward myeloid and erythroid/megakaryocyte differentiation [46], this result suggests that the immunophenotypic MPP2 population in Gata2R396Q/+ BM—which has high expression of CD150—may contain a heterogeneous mixture of cells with disparate lineage potential in this in vitro assay. Together, these data show that the R396Q mutation induces significant changes in blood production during homeostasis, especially at the level of stem and progenitor cells.

Gata2 R396Q/+ bone marrow progenitors are functionally impaired

We next assessed the functional potential of the Gata2R396Q/+ hematopoietic compartment by competitive transplantation assays with whole bone marrow (WBM) from WT and Gata2R396Q/+ (CD45.2+) into lethally irradiated recipients (CD45.1+) (Fig. 2A). Gata2R396Q/+ recipients showed a significant reduction in PB chimerism by 4 weeks post-transplant in two independent experiments (Fig. 2B, Supplemental Fig. 2D). While Gata2R396Q/+ cells were outcompeted by competitor BM in all mature lineages as expected (Fig. 2C, left), they were still able to generate all mature lineages at the expected ratios (Fig. 2C, right), suggesting a defect in repopulation a but not lineage potential after primary transplant. Assessment of the bone marrow at 20 weeks post-transplant revealed a reduction in chimerism for all BM compartments (Supplemental Fig. 2E–H). However, many of the intrinsic properties of Gata2R396Q/+ seen during homeostasis were recapitulated post-transplant, including a preference for MPP2 production at the expense of ST-HSC production (Fig. 2D). This was again driven by an increase in CD150 expression in MPP2s (Fig. 2E, F), indicating the presence of LT-HSCs in Gata2R396Q/+ BM with a propensity for differentiation into CD150hi progenitors.

Fig. 2: The hematopoietic compartment of Gata2R396Q/+ bone marrow displays functional defects.
figure 2

A Schematic of primary competitive transplant of WT or Gata2R396Q/+ WBM. B Peripheral blood chimerism in recipients of WT or Gata2R396Q/+ WBM over 20 weeks of transplant. Two independent transplants shown (n = 9-10 recipients/condition in each transplant). C Peripheral blood output of recipients, showing CD45.2+ chimerism within myeloid and lymphoid lineages (left) or lineage output from CD45.2+ cells (right). Representative transplant is shown (n = 9-10 recipients). D Percent output of hematopoietic stem and progenitor cells (HSPCs) from CD45.2+ cells in recipient bone marrow 20 weeks post-transplant. Representative transplant is shown (n = 9-10 recipients). Representative flow plot E and quantification F of gMFI CD150 expression among BM HSPCs in recipients of WT or Gata2R396Q/+ bone marrow 20 weeks post-transplant. Representative transplant is shown (n = 9-10 recipients). G Schematic for secondary transplantation of WT or Gata2R396Q/+ LSK cells. H Peripheral blood chimerism in secondary transplant (n = 10 recipients per condition). I Peripheral blood output of recipients, showing CD45.2+ chimerism within myeloid and lymphoid lineages (left) or lineage output from CD45.2+ cells (right) (n = 10 recipients per condition). J Percent output of HSPCs from CD45.2+ cells in secondary recipient BM 20 weeks post-transplant (n = 8-10 recipients). K Quantification of gMFI CD150 expression among BM HSPCs in secondary recipients 20 weeks post-transplant (n = 8–10 recipients). L Percent output of myeloid progenitors from CD45.2+ cells in secondary recipient BM 20 weeks post-transplant (n = 8–10 recipients). ns not significant, *p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001. WBM whole bone marrow, LK lineage-c-Kit+, LSK lineage-Sca-1+c-Kit+, LT-HSC long-term hematopoietic stem cell, ST-HSC short-term HSC, MPP2 multipotent progenitor 2, gMFI geometric mean fluorescence intensity, CMP common myeloid progenitor, GMP granulocyte-macrophage progenitors.

We next isolated LSKs from primary recipients and performed secondary transplants to determine bona fide HSC potential (Fig. 2G). While there was a significant reduction in repopulating potential in the peripheral blood and bone marrow of Gata2R396Q/+ secondary recipients compared to WT, long-term potential was still present by 20 weeks post-transplant (Fig. 2H, Supplemental Fig. 3A–D). However, a slight but significant bias toward myeloid output at the expense of lymphoid differentiation was seen in the PB of Gata2R396Q/+ secondary recipients (Fig. 2I). Consistent with primary recipients, Gata2R396Q/+ recipients showed a bias toward MPP2 production at the expense of ST-HSCs, as well as a trend for increased CD150 expression among LT-HSCs and MPP2s (Fig. 2J–K). While not significant for all populations, secondary recipients also displayed an increase in the production of myeloid progenitors and a trend for a decrease in B cell output in the BM (Fig. 2L, Supplemental Fig. 3E). Coupled with the increase in CD150 expression among LT-HSC/MPP2 populations, this could be an indication of an accelerated aging phenotype within the Gata2R396Q/+ HSC compartment, as aged HSCs are known to have increased CD150 expression and a myeloid bias [47].

Together, these data show that the stem cell compartment of Gata2R396Q/+ BM is functionally impaired and contains intrinsic biases for progenitor output in vivo.

The transcriptional signature of Gata2 R396Q/+ hematopoietic progenitors indicate a myeloid bias and loss of HSC stemness

We next performed 10X scRNA-seq on sorted LSK cells from 8–10 week old WT and mutant bone marrow, revealing 22 distinct clusters (Fig. 3A). To annotate the clusters, we combined our dataset with recently published murine BM hematopoietic progenitor scRNA-seq data from the Göttgens group [48], and validated our results with known lineage markers (Fig. 3B, C, Supplemental Figs. 46) [48,49,50,51,52,53]. UMAP projections of the datasets revealed a myeloid progenitor bias in Gata2R396Q/+ LSKs at the expense of erythroid/meg progenitors (Fig. 3B, D). Gata2R396Q/+ LSKs also showed a decrease in the proportion of the earliest hematopoietic progenitors (HSC 1, HSC 2, Intermediate Progenitor (Int Prog) 1) and an increase in more committed progenitors (Int Prog 2-4, Lymphoid (Lymp) Prog) (Fig. 3D, Supplemental Fig. 6A). We also attempted to define immunophenotypic HSPC subsets (LT-HSC, MPP2, etc.) in our scRNA-seq data using publicly available datasets on Haemosphere (Supplemental Fig. 6B) [54]. Similar to our flow cytometry data (Figs. 1, 2), Gata2R396Q/+ cells putatively annotated as LT-HSC, ST-HSC, and MPP2 expressed higher levels of Slamf1 (CD150) (Supplemental Fig. 6C, D).

Fig. 3: The transcriptional signature of Gata2R396Q/+ hematopoietic progenitors reveals myeloid bias and a loss of HSC stemness.
figure 3

A UMAP projections of scRNA-seq from sorted WT or Gata2R396Q/+ BM LSK cells from 8–10 week old mice (n = 3-4 samples per condition). B UMAP projections of WT or Gata2R396Q/+ LSKs (blue) overlayed onto previously published scRNA-seq data from Kucinski et al. (gray), with indications of specific lineages. C Cluster annotations for LSK cells from WT and Gata2R396Q/+. D Proportions of specific cell types among WT or Gata2R396Q/+ LSKs (n = 3-4). E FeaturePlot displaying hscScore values among specific clusters in WT or Gata2R396Q/+ LSK cells. F Quantification and comparison of hscScores between specific clusters of WT or Gata2R396Q/+ LSKs. ns not significant, *p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001. Prog progenitor, Mono monocyte, DC dendritic cell, pDC plasmacytoid dendritic cell, Ery/Meg/Bas erythroid, megakaryocyte, basophil, Int intermediate, Lymp lymphoid, HSC hematopoietic stem cell.

Given the reduced function of Gata2R396Q/+ progenitors in vivo, we hypothesized that the transcriptional signature of HSC and Int Prog clusters would exhibit reduced stemness. To test this, we applied hscScore—a computational tool that scores cells in scRNA-seq data based on their similarity to the transcriptomes of validated HSCs—to our dataset [45]. hscScore validated our annotations, as the clusters with the highest hscScore were the most primitive (HSC 1, HSC 2, Int Prog 1, and Lymp Prog) (Fig. 3E). Meanwhile, hscScore was significantly lower in Gata2R396Q/+ LSKs for all cells, Int Prog 1, Int Prog 2, and Lymp Prog clusters (Fig. 3F). Interestingly, while hscScore was significantly lower for the HSC 2 cluster in Gata2R396Q/+ LSKs, the HSC 1 cluster showed no significant differences in hscScore between WT and mutant LSKs (Fig. 3F). This coupled with the reduction in the proportion of the most primitive clusters in Gata2R396Q/+ LSKs suggests that the reduced repopulating potential seen in our competitive transplants may be due to multiple factors: a reduction in the number of bona fide HSCs and a loss of stem cell potential, as more differentiated HSCs (represented by HSC 2) display lower HSC score.

Gata2 R396Q/+ hematopoietic progenitors exhibit hallmarks of aging, stem cell exhaustion, and upregulation of Gata2

We next performed GSEA analysis on the HSC, Int Prog, and Lymp Prog clusters in our scRNA-seq dataset to determine significantly up- and down-regulated biological processes in Gata2R396Q/+ cells. Gata2R396Q/+ HSC 1 and HSC 2 clusters showed an upregulation of GO terms involved in mitochondrial activity, and downregulation of terms involved in HSC signature, response to cytokines, and differentiation (Fig. 4A). Similar terms were found for Int Prog 1/Lymp Prog clusters as well as Int Prog 2/3/4 clusters (Fig. 4B, C). Combined with the myeloid bias and reduced hscScore seen in our scRNA-seq data, these terms suggest a phenotype of aging and stem cell exhaustion within the Gata2R396Q/+ HSC compartment. For instance, increased mitochondrial activity has been shown to be associated with HSC aging [55], and aged HSCs lose self-renewal capacity and are myeloid-biased [47, 56, 57]. A blunted response to cytokines may also be indicative of a stem cell compartment that is on the pathway to senescence. Further GSEA analysis revealed a significant enrichment of BM HSC aging pathway in Gata2R396Q/+ HSC, Int Prog, and Lymp Prog clusters (Fig. 4D). Indeed, all Gata2R396Q/+ HSPCs displayed significantly increased oxidative stress (Supplemental Fig. 7A), which may be indicative of premature exhaustion of the stem cell compartment [58]. Also, acute treatment of Gata2R396Q/+ mice with lipopolysaccharide (LPS)—which drives quiescent HSCs into cycling—resulted in a significant reduction in LT-HSC recovery compared to WT controls, suggesting that the mutant HSC compartment contains fewer quiescent HSCs (Fig. 4E, F).

Fig. 4: Gata2R396Q/+ hematopoietic progenitors exhibit an accelerated aging phenotype.
figure 4

UMAP projections and GSEA analysis of HSC 1/HSC 2 A, Int Prog 1/ Lymp Prog B, and Int Prog 2/3/4 C clusters, showing terms negatively and positively enriched in Gata2R396Q/+ populations. D GSEA plots showing positive enrichment for Marrow HSC Aging in Gata2R396Q/+ cells from the indicated clusters. E Schematic for LPS in vivo treatment. F Frequency of LT-HSC cells normalized to PBS controls 5 days post in vivo LPS treatment of WT and Gata2R396Q/+ 8–10 week old mice (n = 4-5 per condition). Terms have an FDR < 0.05 and Gene % > 15%. ns not significant, *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001. HSC hematopoietic stem cell, Lymp lymphoid, Prog progenitor, Int intermediate, LT-HSC long-term HSC, PBS phosphate buffered saline, LPS lipopolysaccharide.

To further identify transcriptional differences between WT and mutant LSKs, we looked for genes specifically upregulated in Gata2R396Q/+ clusters. As expected, the myeloid progenitor marker Ctsg was upregulated in progenitor cells surrounding the Myeloid Prog cluster in Gata2R396Q/+ cells (Fig. 5A), validating an enhanced myeloid bias. Interestingly, Ly6c1—a cell-surface marker for granulocyte and monocyte/macrophage populations—was exclusively expressed in Gata2R396Q/+ progenitor cells surrounding the Myeloid Prog cluster (Fig. 5B). Another highly upregulated gene was Tgfbr3 (Fig. 5C), also known as betaglycan, an important co-receptor for TGFβ receptors [59]. TGFβ plays several roles in HSPC function and has been shown to inhibit the lymphoid/megakaryocyte/erythroid differentiation pathways while stimulating the monocyte/DC pathway [59], which is reflected in our scRNA-seq dataset. Interestingly, Gata2 expression was significantly increased in several Gata2R396Q/+ clusters, including HSC 1, Int Prog 1, and Lymp Prog (Fig. 5D, E, Supplemental Fig. 7B). Increased Gata2 expression within hematopoietic progenitors was also seen in the Gata2R398W/+ mouse [39]. To determine if this was due to enhanced WT or mutant Gata2 expression, we sorted HSPCs and performed droplet digital PCR (ddPCR) for mutant allele expression. Gata2R396Q/+ mice displayed varying mutant expression ratios depending on the cell type, with a lower mutant ratio in ST-HSCs and a significantly higher ratio in MPPs (Supplemental Fig. 7C). Given that our scRNA-Seq data showed increased Gata2 expression in all putatively annotated HSPC populations (Supplemental Fig. 7D), this suggests that the degree of WT Gata2 expression may be more critical in some populations (ST-HSC), while mutant expression may be permissible or even advantageous in others (MPP2).

Fig. 5: Gata2R396Q/+ hematopoietic progenitors upregulate Gata2.
figure 5

FeaturePlots displaying the expression of Ctsg A ly6c1 B and Tgfbr3 C in WT and Gata2R396Q/+ LSK cells. D FeaturePlot displaying the expression of Gata2 in WT and Gata2R396Q/+ LSK cells. E Quantification and comparison of Gata2 expression for the indicated clusters in WT and Gata2R396Q/+ cells. F FeaturePlots displaying the expression of Gata2, Dntt, Ctsg, and Procr among Lymp Prog cells in WT and Gata2R396Q/+ samples. Dotted circle indicates a unique population found among the Gata2R396Q/+ Lymp Prog cluster. ns not significant, *p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001. HSC hematopoietic stem cell, Lymp lymphoid, Prog progenitor, Int intermediate.

Interestingly, the Lymp Prog cluster appeared to upregulate Gata2 exclusively in Gata2R396Q/+ cells (Fig. 5D). We therefore subclustered on the Lymp Prog cluster and identified two unique clusters, defined by Gata2 expression (Fig. 5F, left). These cells expressed lymphoid (Dntt) and myeloid (Ctsg) markers and appeared to arise from a small population of HSCs based on Procr expression (Fig. 5F), indicating the presence of a unique Gata2-expressing progenitor population confined to the Gata2R396Q/+ LSK compartment that may contribute to the enhanced myeloid potential seen in our scRNA-seq dataset.

Together, these data provide evidence for an HSPC compartment exhibiting stem cell exhaustion and aging-related signatures in Gata2R396Q/+ BM that may contribute to the phenotypic and functional defects seen in vivo.

Gata2 R396Q/+ affects fetal liver hematopoietic development and function

As GATA2 deficiency is primarily a pediatric/adolescent disease, we investigated whether the effects of Gata2R396Q/+ on hematopoiesis could be detected prenatally. To do this, we first performed flow cytometry and CFU assays on embryonic day 14.5 (E14.5) fetal liver (FL) cells (Fig. 6A), as at E14.5 the definitive hematopoietic compartment is largely confined to the FL [60]. While there were no significant differences in FL cellularity, Gata2R396Q/+ cells showed a significant increase in all HSPC subtypes except MPP3/4 cells (Fig. 6B). Similarly to BM HSPCs, E14.5 FL HSPCs showed a bias for LT-HSC and MPP2 differentiation, with significantly increased CD150 expression in MPP2s (Fig. 6C–E), indicating that this intrinsic property of Gata2R396Q/+ HSPCs begins during embryonic hematopoiesis. While there was an increase in immunophenotypic HSPCs, CFU assays revealed a functional deficiency in Gata2R396Q/+ E14.5 FL progenitors in terms of GEMM and BFU-E production (Fig. 6F). Similar to the BM compartment, these data suggest the expansion of a functionally compromised pool of HSPCs.

Fig. 6: Gata2R396Q/+ impairs fetal liver hematopoietic development and function.
figure 6

A Schematic of experiments utilizing WT and Gata2R396Q/+ E14.5 fetal liver (FL). B Absolute numbers of total FL, LSK, and hematopoietic stem and progenitor (HSPC) cells from WT and Gata2R396Q/+ littermates at E14.5 (n = 10-11 embryos). C Percentage of LT-HSC or MPP2 among LSK cells in E14.5 FL (n = 17–20 embryos). Representative flow plot D and normalized quantification E of gMFI CD150 among WT and Gata2R396Q/+FL HSPCs at E14.5 (n = 17–20 embryos). F Number of CFU subtypes from E14.5 FL in WT or Gata2R396Q/+ embryos (n = 10–12 embryos). G Schematic of experiments utilizing WT and Gata2R396Q/+ E12.5 FL. H Absolute numbers of total FL, LSK, and LT-HSC in E12.5 FL (n = 11–18 embryos). I Percentage of LT-HSC among LSK cells in E12.5 FL (n = 11–18 embryos). J Normalized quantification of gMFI CD150 among WT and Gata2R396Q/+ FL HSPCs at E12.5 (n = 11–18 embryos). K Number of CFU subtypes from E12.5 FL in WT or Gata2R396Q/+ embryos (n = 11–18 embryos). L Schematic for transplantation of E12.5 FL cells. M Peripheral blood CD45.2+ chimerism in recipients of E12.5 FL (n = 9–12 recipients). Peripheral blood output of recipients, showing CD45.2+ chimerism within myeloid and lymphoid lineages N or lineage output from CD45.2+ cells O (n = 9–12 recipients). ns not significant, *p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001. E14.5, embryonic day 14.5; FL fetal liver, CFU colony-forming unit, LSK lineage-Sca-1+c-Kit+, LT-HSC long-term hematopoietic stem cell, ST-HSC short-term HSC, MPP2 multipotent progenitor 2, gMFI geometric mean fluorescence intensity, GEMM granulocyte/erythrocyte/monocyte/megakaryocyte, BFU-E burst-forming unit-erythroid, G/M/GM granulocyte/monocyte.

We next looked at E12.5 FL: the murine timepoint at which definitive HSCs have migrated into the FL niche (Fig. 6G) [60]. Similar to E14.5 FL, there was an expansion of LSK and LT-HSC populations in Gata2R396Q/+ embryos (Fig. 6H, I). However, the geometric mean fluorescence intensity (gMFI) of CD150 within LT-HSC and MPP2 cells was not significantly enhanced, suggesting that this phenomenon does not occur until later in FL hematopoiesis (Fig. 6J). E12.5 FL cells showed a significant reduction in GEMM and BFU-E CFUs (Fig. 6K), and transplantation of E12.5 FL cells into irradiated recipients resulted in a significant reduction in repopulating potential (Fig. 6L–N). Similar to primary BM transplants, Gata2R396Q/+ E12.5 FL cells displayed reduced competitive potential but a retained ability to produce all major lineages at expected ratios, suggesting that the pool of E12.5 FL HSCs is reduced but not fully dysfunctional.

Gata2 R396Q/+ abrogates definitive HSC production

Given the reduced function of Gata2R396Q/+ E12.5 FL cells and the known role of GATA2 in hematopoietic development [1], we interrogated the generation of definitive HSCs in the aorta gonad mesonephros (AGM) of WT and mutant mice. First, we performed transplants of E11 AGM into lethally irradiated recipients with support BM cells (Fig. 7A). While 6/23 recipients of WT AGM showed PB chimerism > 1%, 0/14 recipients of Gata2R396Q/+ AGM had sufficient levels of donor chimerism (Fig. 7B), suggesting either a functional deficiency or lack of definitive HSCs in the AGM. We then performed imaging of E10 AGMs to determine the number of intra-aortic hematopoietic clusters in WT and Gata2R396Q/+ embryos. Gata2R396Q/+ embryos contained significantly fewer intra-aortic hematopoietic clusters than WT littermates (Fig. 7C, D), suggesting that the R396Q mutation results in a loss or stalling of definitive HSC production. Finally, given the reduced production of intra-aortic hematopoietic clusters and reduced functional potential of FL and BM HSCs in Gata2R396Q/+, we hypothesized that the absolute number of HSCs was reduced into adulthood. To test this, we performed a limiting dilution transplant from adult BM (Fig. 7E). Indeed, the number of predicted HSCs was significantly reduced from ~1700 HSCs in WT BM to ~570 HSCs in Gata2R396Q/+ BM (Fig. 7F). These data show that the Gata2R396Q/+ mutation results in a reduction in definitive HSC production at the earliest stage of hematopoietic development, resulting in downstream effects on the hematopoietic system.

Fig. 7: Gata2R396Q/+ negatively affects definitive HSC production.
figure 7

A Schematic for transplantation of E11 AGM from WT or Gata2R396Q/+ into irradiated recipients. B Peripheral blood CD45.2+ chimerism in recipients of AGM from E11-E11.5 (42-48 somite pairs (sp)) WT or Gata2R396Q/+ embryos. Adequate reconstitution defined as ≥ 1% CD45.2+ (n = 14–23 recipients). C Representative imaging of intra-aortic hematopoietic clusters (IAHC) in E10.5 AGM (35–37 sp) of WT or Gata2R396Q/+ embryos (blue, c-KIT; purple, CD31; red, IAHC). D Quantification of IAHCs in “C” using automatic, semi-automatic, or manual methods (n = 4 embryos per condition). E Schematic of limiting dilution transplant of whole BM from WT or Gata2R396Q/+ mice, 8–10 weeks old. F Limiting dilution analysis plot (n = 6–10 recipients per dilution). Analysis performed using Extreme Limiting Dilution Analysis (ELDA) [65]. *p < 0.05. E11 embryonic day 11, IAHC intra-aortic hematopoietic cluster.

Discussion

Here, we report the generation of a mouse model of the recurrent GATA2 deficiency mutation, Gata2R396Q/+. Through phenotypic, functional, and transcriptional analysis, we show that mice with the R396Q mutation generate fewer definitive HSCs that have reduced function at embryonic and adult timepoints, possibly due to an accelerated aging phenotype resulting in myeloid-bias and stem cell exhaustion (Fig. 8). These observations make the Gata2R396Q/+ mouse an intriguing model for leukemic development in GATA2 deficiency. For instance, aberrant hematopoiesis begins in utero in our model, which will allow for modeling of leukemic progression during developmental stages more characteristic of human disease, such as early postnatal and adolescence. The decrease in definitive HSC production and lifelong numbers of HSCs as shown by our limiting dilution assay suggests that the clonal complexity of the hematopoietic system may be reduced. While this will need to be further validated experimentally through clonal tracing studies, reduced clonal complexity may provide an impetus for accelerated progression after the appropriate secondary events are applied to Gata2R396Q/+ cells. Likewise, an aged hematopoietic compartment would be more susceptible to transformation. Finally, the enrichment of mitochondrial activity terms in Gata2R396Q/+ progenitors is similar to observations seen in transgenic Erg/Gata2+/- and Erg/Gata2L395V/+ mouse models [61], which exhibited accelerated AML. Our future studies will focus on modeling leukemic development by adding putative drivers of malignancy in this disease, such as loss of orthologous chromosome 7 genes or mutations in Asxl1, Setbp1, and Stag2 [23], followed by identifying therapeutic strategies to ameliorate disease progression.

Fig. 8: Gata2R396Q/+ disrupts normal hematopoiesis.
figure 8

During HSC generation at around embryonic day 10 (E10), Gata2R396Q/+ mice produce fewer definitive HSCs in the aorta gonad mesonephros (AGM). HSCs show reduced functional potential in the fetal liver at E12.5-E14.5, as well as intrinsic biases toward particular HSPC populations. During adulthood, the continued effects of dysregulated HSC generation manifest as reduced HSC function, biases toward LT-HSC and MPP2 production at the expense of ST-HSC production (driven by enhanced CD150 expression), myeloid bias, and suggestions of stem cell exhaustion/aging.

Apart from modeling transformation, many questions remain about the role of the R396Q mutation in preparing the hematopoietic system for transformation. Our scRNA-seq data suggests that Gata2 is upregulated in several BM HSPC populations of Gata2R396Q/+ mice. Therefore, the hematologic abnormalities seen in our model are not due to a lack of Gata2 expression as in heterozygous KO models. It will be important to determine if there are any changes in DNA binding ability or targets of the R396Q variant in vivo that can contribute to a pre-leukemic state. Indeed, it is known that upregulation of GATA2 in humans is found in hematologic malignancies [62]. Isolation of specific populations of Gata2HI cells from Gata2R396Q/+ HSPCs, such as the Lymp Prog population, will also be useful in answering questions about the cell-of-origin in this disease. Finally, we have only assessed the hematopoietic compartment in this study, but Gata2 is also expressed in the BM niche—especially endothelial cells [63]. Given the role of inflammation in MDS/AML progression [64], future studies will interrogate the niche components of Gata2R396Q/+ mice for potential contributions to a pre-leukemic state.