Introduction

Population-scale genetic screening have provided invaluable insights into genomic regions linked to human disease. Genome-wide association studies (GWAS) have highlighted thousands of genetic variants associated with a wide range of disorders and with disease severity or susceptibility. Interestingly, the vast majority of GWAS variants are located in the non-coding fraction of the genome, and are highly enriched in open chromatin regions corresponding to putative enhancers1,2. Enhancers are multifaceted distal regulatory elements enriched in transcription factor binding sites (TFBSs) which play essential roles in the spatio-temporal control of gene expression3,4,5,6. Mammalian genes are usually under the influence of multiple enhancers that can play overlapping as well as specific roles, to establish complex patterns of gene expression. Multiple enhancers, for instance, control the morphogen Sonic Hedgehog SHH expression, and SHH enhancers seem to individually contribute to SHH expression in specific tissues thereby displaying non overlapping functions6,7. Similarly, various non-redundant enhancers drive GATA2 transcription factor expression in distinct hematopoietic cell types, thereby conferring precise spatio-temporal control of GATA2 during hematopoiesis8. On the other hand, clusters of distal enhancers forming the so-called globin locus control region (LCR), act in combination to maximize globin genes expression in terminally differentiating erythroid cells5,9,10,11,12. Both the alpha and beta globin gene loci, and their associated distal enhancers, have been extensively characterized through (epi-)genetic studies and sophisticated genome engineering approaches (e.g. enhancer KO, genetic and epigenetic engineering), providing a clear in vivo functional role of the individual enhancers and their complex interactions in the context of the 3D genome landscape11,12,13,14,15,16,17. Although a growing number of high-throughput functional studies are currently emerging18,19,20,21, such a deep characterization is lacking for most gene loci, leaving the in vivo roles of the vast majority of individual enhancers unclear.

In humans, the fetal and adult hemoglobins (HbF and HbA, respectively) are expressed in a temporally ordered fashion during erythroid development22. The fetal-to-adult switch in hemoglobin production is tightly linked to the developmental silencing of fetal globin genes (HBG1/2) and activation of adult type β-globin genes (HBB) which occurs in adult erythroid cells by two mechanisms: (i) direct binding of multiple transcriptional repressors such as BCL11A23, ZBTB7A24, NFIA/NFIX25,26 at the HBG1/2 promoters and (ii) distal LCR enhancers looping to the HBB gene promoter, mediated by the GATA1/LDB1 transcription factor complex, directly and strongly activating β-globin expression14,16,27,28. In some individuals, this developmental switch is only partially effective, leading to persistence of HbF in adulthood, a benign condition known as hereditary persistence of fetal hemoglobin (HPFH)29,30. This developmental switch has therefore attracted a tremendous amount of attention since reactivation of HbF in HPFH is known to ameliorate the symptoms of β-thalassemia (BT) and sickle cell disease (SCD), and to mitigate disease complications31. GWAS hits linked to HbF levels have repeatedly been mapped to three genomic loci, the HBG promoter itself, the BCL11A intronic erythroid enhancer, and the chromosome 6q23 HBS1L-MYB intergenic region32,33,34,35,36,37,38,39,40. We and others have shown that the SNPs occurring at the HBS1L-MYB intergenic interval, which are highly associated with elevated HbF levels and clinically important human erythroid parameters (e.g., red blood cell count [RBC], mean cell volume [MCV], and mean cell hemoglobin [MCH] content), fall within distal enhancers that control MYB expression19,41,42. The HBS1L-MYB interval harbors multiple MYB distal enhancers that are bound by the GATA1/LDB1 chromatin looping complex in erythroid cells. LDB1 mediates long-range enhancer interactions with the MYB gene and stimulates its expression in erythroid progenitors5,41,42. We have shown that some of the 6q23 intergenic variants decrease enhancer activity through the attenuation of the GATA1 and LDB1 TF binding and enhancer-promoter looping, resulting in reduced MYB expression levels42. The highest scoring SNPs pointed at a specific enhancer located 84kb upstream of MYB (MYB − 84kb enh), which is conserved in the mouse (Myb − 81kb enh), suggesting that this enhancer could play a specific role in HPFH and could directly be involved in elevated HbF levels. However, the several HBS1L-MYB 6q23 intergenic variants which are clustered in a ~ 25kb region, are in linkage disequilibrium (LD), and cover an area overlapping with at least 6 putative enhancers19,33,42,43. Despite recent efforts to refine this regulatory region, fine-map the causative variants and narrow-down the region(s) involved39, this leaves the open questions of the role played by the individual MYB distal enhancers in MYB expression regulation and whether a single enhancer is causally involved in the HPFH phenotype and alleviates BT severity. To this aim, we generated a distal enhancer knock-out mouse model by deleting the Myb − 81kb enhancer (i.e. the equivalent of the human MYB − 84kb enhancer), and analyzed how it impinges on Myb expression in vivo, erythroid parameters, and globin switching. We further crossed the Myb − 81kb enh KO animals with a mouse model of β-thalassemia (Hbbth1/th1)44 and surveyed the impact of − 81kb enhancer deletion on the thalassemic phenotype.

Results

A unique transcription factor content demarcates the − 81kb Myb distal enhancer

The mouse Myb gene is controlled by a cluster of 5 distal enhancers located from − 36kb to − 109kb upstream of its transcriptional start site (TSS)41,45 (Fig. 1A). All enhancers display the typical chromatin signature of active enhancers and are enriched in H3K4me1 and H3K27ac. Examination of TFBS within the locus highlighted strong enrichment of the GATA1/LDB1 erythroid TF complex at all enhancers, as previously reported in mouse erythroid cells27,41,45,46. The human MYB locus displays a similar organization, with a cluster of distal regulatory elements located between − 63 to − 107kb upstream of the TSS42. These elements display strong chromatin accessibility (shown by ATAC-Seq signals)47,48, an active enhancer mark (H3K27ac) and GATA1/LDB1 binding (Fig. 1B).

Fig. 1
figure 1

Epigenetic landscape of the MYB locus in mouse and humans. The mouse (a) and human (b) MYB loci are shown with ChIP-Seq tracks of select TFs from published studies in mouse (KLF181, LDB127, GATA1, H3K27ac, H3K4me1, H3K4me312) or human cells (KLF121, GATA1 and H3K27ac19, LDB142). Human ATAC-Seq data are from47. Capture-C82 (mouse) or 4C-Seq83,84,85 data showing long-range interactions throughout the locus from the mouse Myb − 81kb enh, and human MYB − 84kb enh and MYB promoter viewpoints (VP) were performed in MEL cells (mouse) or UT7 cells (human). The BglII cut sites are shown underneath the 4C-Seq tracks (human locus). Position of the distal enhancers is indicated above the ChIP-Seq tracks (distance in kb relative to the transcription start site). The top scoring SNPs associated with HbF levels and clinical erythroid parameters are also shown (human track).

The mouse − 81kb enh sequence is orthologous to the human − 84kb enh, which harbors the genetic variants most highly associated with HPFH, fetal globin levels and variations in clinical erythroid parameters42. Remarkably, both the mouse − 81kb enh and human − 84kb enh stand out as being uniquely bound by KLF1, in addition to the GATA1/LDB1 complex, rendering them unique among the locus (Fig. 1A, B). Chromatin looping experiments confirm that both enhancers interact with the Myb promoter and first intron regions (Fig. 1A,B), supporting a role in Myb regulation. Since KLF1 is a major regulator of erythropoiesis, this suggests that the − 81kb/− 84kb enh may play a specific role within the locus.

The mouse − 81kb enh/human − 84kb enh are erythroid-specific enhancers

Since MYB is expressed in most hematopoietic progenitors, including hematopoietic stem cells (HSCs), myeloid and lymphoid progenitors49,50,51,52,53, we sought to determine whether the − 81/− 84kb enh are broadly active throughout hematopoiesis or whether they display lineage specificity. We analyzed published chromatin accessibility datasets (ATAC-Seq) from multiple primary human and mouse hematopoietic cell types (Fig. 2A, B)54,55, in addition to datasets specifically focused on erythropoiesis, including terminal differentiation47 (Fig. 3). Strikingly, chromatin opening at the − 84kb enh is completely absent from HSCs, non-erythroid myeloid cell types (e.g. GMPs, monocytes) and lymphoid lineages (NK, B cells, CD4 and CD8 T cells), and is restricted to erythroid committed cells (MEPs and erythroid progenitors), with low levels in CMPs (Fig. 2A). This absence of chromatin opening is likely not due to inactivity of the MYB locus outside erythroid-committed cells as the MYB gene is transcribed in most of these cell types. Besides, other putative regulatory regions such as the − 92kb site appear to be broadly active in hematopoiesis, with clear signs of chromatin opening in HSCs, immature progenitors (MPP, LMPP), and both myeloid (CMP, GMP) and lymphoid committed progenitors, indicating that the locus is active in these cell types. Similar observation was done using mouse hematopoietic cells, with − 81kb enh chromatin opening being restricted to erythro-myeloid cells, MEP, erythroid and megakaryocytic progenitors (Fig. 2B). Further analysis of hundreds of human (n = 171) or mouse (n = 279) cells and tissue types from the ENCODE consortium using the SCREEN database56 confirmed that the mouse − 81kb/human − 84kb enhancer display chromatin accessibility uniquely in erythroid cells as compared to other hematopoietic cell types and non-hematopoietic tissues (Supplementary Figures S1 and S2). We next looked specifically within the erythroid lineage using datasets derived from ex vivo differentiation of human CD34 + hematopoietic stem/progenitor cells (HSPCs)47. Chromatin accessibility at MYB − 84kb enh becomes apparent at the early erythroid progenitor stage (Bust forming unit erythroid, BFU-E) and gradually increases up to the pro-erythroblast/early basophilic erythroblast precursor stages, and becomes progressively lost as cells fulfill terminal differentiation (late basophilic, polychromatic and orthochromatic erythroblast stages) (Fig. 3). Taken altogether these observations support the notion that the conserved mouse − 81kb/human − 84kb enhancer displays lineage-specific activity, with erythroid (and to a lower extent megakaryocytic) specificity.

Fig. 2
figure 2

The MYB chromatin landscape in human and mouse hematopoietic cells. (a) ATAC-Seq signals showing open chromatin regions in the human MYB locus from sorted human hematopoietic populations54 are depicted, showing erythroid specificity of the MYB − 84kb enhancer. (b) ATAC-Seq signals showing open chromatin regions at the mouse Myb locus from various mouse hematopoietic populations55 are depicted, showing Myb − 81kb enhancer chromatin opening being restricted to erythro-myeloid cells, MEP, erythroid and megakaryocytic progenitors.

Fig. 3
figure 3

MYB chromatin landscape during human erythropoiesis. Dynamic accessibility (ATAC-Seq) and differentiation stage-specificity of the human MYB − 84kb enhancer throughout terminal erythropoiesis (using CD34+-derived ex vivo differentiation)47.

Generation of a Myb − 81kb enh KO mouse model

The above-mentioned observations prompted us to investigate the in vivo role of the − 81kb enh and test whether its inactivation recapitulates the changes in erythroid parameters observed in humans harboring genetic variants at the − 84kb enh region. We defined the − 81kb enh as the 347bp-sized region occupied by the GATA1/LDB1 complex (Fig. 4A). The corresponding genomic sequence harbors the typical GATA1::TAL1 motif i.e. CTGN(6–8)WGATAR recognized by the GATA1/LDB1 complex, and the CACCC box motifs recognized by KLF1 (Fig. 4B). The − 81kb enh sequence was flanked by LoxP sites by homologous recombination in embryonic stem (ES) cells (Fig. 4C), which were used for blastocyst injections to generate Floxed − 81kb enh knock-in mice (see methods). These mice were crossed with a mouse line ubiquitously expressing the Cre recombinase to generate Myb − 81kb enh homozygous KO mice, referred to as − 81kb enhKO. The − 81kb enhKO animals were viable and fertile, and were obtained in a Mendelian ratio (Fig. 4C), indicating that removal of the − 81kb enh is not lethal.

Fig. 4
figure 4

Generation of the mouse Myb − 81kb enhancer knock-out. (a) The mouse Myb locus is shown with a zoom view of the − 81kb enhancer, and associated LDB1/GATA1 binding site, GATA and KLF1 DNA motifs, and location of the deleted region. (b) Position weight matrixes of the LDB1/GATA1 complex (TAL1::GATA1_MA0140.1) and KLF1 (KLF1_MA0493.1) from JASPAR, and their position within the deleted enhancer sequence. (c) strategy used to knock-out the − 81kb enh sequence in mouse embryonic stem cells, to generate the mouse Myb − 81kb enh KO. The Mendelian ratios of heterozygous and homozygous animals are shown.

Erythroid phenotype of the − 81kb enhKO mice

We first investigated erythroid cells development in the fetal liver (FL), representing the main hematopoietic organ at mid gestation. FACS analysis of embryonic day 13.5 (E13.5) FLs revealed decreased levels of c-Kit + progenitors in KO animals (Fig. 5A). Fine examination of differentiating erythroid populations based on the CD71/Ter119 surface markers revealed subtle but significant decrease in the most immature cells (R1, CD71-Ter119-), slightly increased R2 population (CD71+Ter119-) and a decrease in the most mature R4 cells (CD71-Ter119+) indicating that deletion of the − 81kb enh has a slight impact on erythroid maturation in FLs with a decreased or delayed output of mature red blood cell production (Fig. 5A, B). We next investigated the impact of − 81kb enh removal on adult hematopoietic populations. As expected from the chromatin accessibility patterns of the − 81kb enh, the bone marrow immature hematopoietic compartment defined as lineage-negative (Lin-), Sca1-positive (Sca+), Kit-positive (Kit+) cells (LSK) containing the hematopoietic stem/progenitor cells displayed no significant alterations (Fig. 5C and Supplementary Figure S3). Within the LSK cell population, both long-term, short-term HSCs (LT-HSCs and ST-HSCs, respectively), and multipotent progenitors (MPPs) were found in equal amounts in wild-type (WT) and KO animals, indicating that the stem and early progenitor cells were unaffected. Common lymphoid (CLPs), myeloid (CMPs), and granulocyte–macrophage (GMPs) progenitors were found in equal amounts in WT and KO animals. However, we detected a slight but significant decrease in Lin-/Kit+ (LK) cells, matching the observed decrease in Kit+ cells in FL, although total bone marrow cellularity was not significantly decreased (Fig. 5E, see Supplementary Figure S4 for bone marrow cell counts). We also observed a decrease in megakaryocyte-erythroid bi-potent progenitors (MEPs), consistent with the opening and putative activity of the − 81kb enh at this stage (Fig. 5C, D, and Fig. 2). Surprisingly, we detected an increase in the percentage of Ter119+ population representing differentiating erythroid populations (Fig. 5E). We next focused specifically on the Ter119+ erythroid compartment and analyzed terminal erythropoiesis as the ratio of immature/mature erythroblasts. Erythroid populations gradually acquire differentiation marker Ter119 and lose CD71 expression as they progress throughout terminal differentiation. The percentage of CD71highTer119med pro-erythroblasts (ProE), Ter119+CD71+FSChigh (EryA), Ter119+CD71+FSClow cells (late basophilic and poly-chromatic erythroblasts, EryB cells), and Ter-119+CD71FSClow cells (orthochromatic erythroblasts and reticulocytes, EryC cells) were within the range of control mice (Fig. 5F, G). Accordingly, the erythroid maturation index was similar to controls (Fig. 5H). Taken altogether these data indicate that deletion of the − 81kb enh does not globally impair hematopoiesis, but affects the percentage of MEPs and of Ter119+ erythroid cells without altering the ratio of immature/mature erythroblasts in the adult bone marrow.

Fig. 5
figure 5

Analysis of fetal liver and bone marrow hematopoiesis of Myb − 81kb enhKO mice. (a) Analysis of c-Kit + populations and FACS analysis of CD71/Ter119 populations in E13.5 fetal livers from WT and KO mice. (b) Representative FACS plots of WT and KO animals. (c) FACS analysis of various hematopoietic compartments from WT and KO animals. LK: Lin-Kit+, LSK: Lin-Sca1+Kit+, LT-HSC: long-term HSCs, ST-HSC: short-term HSCs, MPP: multipotent progenitors, CLP: common lymphoid progenitors, CMP: common myeloid progenitors, GMP: granulocyte/macrophage progenitors, MEP: megakaryocyte/erythroid progenitors. (d) representative FACS plots of WT and KO GMP, CMP and MEP populations based on CD34/CD16 surface markers. (e) Bone marrow cellularity and frequency of Ter119+ erythroid cells. (f-g) Erythroblast differentiation in bone marrow evaluated by CD71 and Ter119 staining, with the (h) bone marrow erythroid maturation assessed by CD71 and forward scatter area on Ter119 + erythroblasts. All data are expressed as the mean ± s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001.

Erythroid parameters of Myb − 81kb enh KO animals and recovery from hemolytic anemia

Since genetic variants at the human − 84kb enh are associated with variations in clinical erythroid parameters57, we tested whether it is also the case in the mouse. We therefore monitored the erythroid parameters of − 81kb enhKO animals from circulating blood sampling in adult (7–10 weeks old) animals. As expected, white blood cell (WBC) counts were unaffected (Supplementary Figure S5). Concerning erythroid cells, whereas red blood cell (RBC) counts, hematocrit (HCT), hemoglobin (Hb) concentration, red cells distribution width (RDW), mean corpuscular Hb (MCH), and MCH concentration (MCHC) were within similar ranges as in control animals, mean corpuscular volume (MCV) values in KO animals were significantly increased (Fig. 6A). Increased MCV could be the result of decreased MYB expression (see below) and accelerated erythroid differentiation, as suggested in previous studies42,43,58,59, which is compatible with our flow cytometry data showing modified immature/mature erythroblasts ratios (Fig. 5B, C). Interestingly, this mimics the human situation where HBS1L-MYB intergenic variants associate with increased MCV42,43,60,61,62,63,64, indicating that the − 81kb enhKO model recapitulates features observed in humans.

Fig. 6
figure 6

Circulating blood parameters and recovery from hemolytic anemia. (a) Erythroid parameters of WT and KO animals measured from circulating blood. RBC: red blood cells, HCT: hematocrit, Hb: hemoglobin, MCV: mean corpuscular volume, MCH: mean corpuscular hematocrit. (b) Mice were challenged with phenylhydrazine injections and monitored for recovery over a period of 17 days. Reb blood cell counts and MCV values are shown. All data are expressed as the mean ± s.e.m. *P < 0.05.

Since MYB is involved in sustaining erythroid progenitors proliferation, we tested whether − 81kb enh deletion alters progenitors capacity to support strong erythroid progenitors expansion following hemolytic challenge. Wild type and − 81kb enhKO animals were injected with phenylhydrazine (PHZ) to induce hemolytic anemia and were monitored during the recovery phase. In control mice, a ~ 50% reduction in RBC concentration was observed within the first 2 days following the PHZ challenge, followed by a gradual increase starting at day 3 to reach back normal values by day 10 to 14 (Fig. 6B). In contrast, KO animals displayed a prolonged period of anemia, with a delayed recovery phase starting only at day 6 post challenge (Fig. 6B). However, KO animals managed to fully recover by day 10–14 as in controls. Interestingly, consistent with the blood parameters analysis (Fig. 6A), MCV values in KO animals were continuously higher than in controls, both at day 0 and during the whole recovery phase. Taken altogether, these data suggest that the lack of the − 81kb distal Myb enh impairs progenitor expansion following an acute anemic stress, but does not prevent full recovery. This is suggestive of a reduced proliferative capacity upon Myb − 81kb enh deletion.

Decreased Myb expression in − 81kb enhKO fetal liver cells associates with impaired globin switching and decreased adult globins expression

Since complete Myb inactivation leads to an early block of definitive erythroid development at the fetal liver (FL) stage65, we monitored Myb expression levels in − 81kb enhKO sorted FL erythroid populations (Fig. 7A). As shown in Fig. 7B, Myb levels were decreased in immature erythroid progenitors, indicating that the − 81kb enh is required to reach maximal expression levels in early erythroid differentiation. In physiological conditions, Myb expression dramatically decreases as cells terminally differentiate41,53,66,67. We observed a significantly stronger Myb silencing in KO animals, supporting a role of the − 81kb enh in the fine-tuning of Myb regulation in differentiating erythroid cells.

Fig. 7
figure 7

Adult and fetal globin genes expression in Myb − 81kb enhKO animals. (a) E13.5 fetal liver cells were sorted into four populations based on the CD71 and Ter119 surface markers. Sorted cells were used to perform qRT-PCR analyses of (b) Myb, (c) embryonic globins Hbb-bh1, Hbb-y, (D) adult β-globin Hbb-b1, Hbb-b2, and (E) adult α-globin Hba-a1 genes expression. All data are expressed as the mean ± s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001.

Since MYB levels strongly correlate with fetal globin reactivation in humans42,43,68, we next analyzed the embryonic beta-like globin genes (Hbb-bh1, Hbb-y) expression in KO animals. We detected significant upregulation of the hemoglobin Z embryonic beta-like chain Hbb-bh1 in − 81kb enhKO definitive erythroid cells, and a similar trend towards upregulation (although not statistically significant) of the hemoglobin Y embryonic beta-like Hbb-y (Fig. 7C). This result is in accordance with the increased HBF levels in individuals bearing HBS1L-MYB intergenic variants, and indicates that the − 81kb enhKO model recapitulates the partial globin switch reversal (or impaired fetal globin repression) seen in humans. Very interestingly, when measuring adult globin genes expression, we observed a consistent and significant decrease in β-globin (Hbb-b1, Hbb-b2), and α-globin (Hba-a1) genes (Fig. 7D, E). This is particularly interesting since excessive amounts of unpaired α-globin proteins in β-thalassemia are a cause of ineffective erythropoiesis, through the formation of toxic aggregates in erythroid cells69. Accordingly, reduction of α-globin content and increased fetal β-like globin genes in β-thalassemia are effective in reducing globin chain imbalance and ameliorate disease course. Our result therefore prompted us to investigate whether deletion of the − 81kb enh could, at least partially, rescue a thalassemic phenotype in the mouse.

Myb − 81kb enhKO ameliorates erythroid parameters in vivo in a mouse model of β-thalassemia

To study the role of the Myb − 81kb enh in hemoglobinopathies, we used a preclinical model of β-thalassemia intermedia, the Hbbth1/th1 mouse44. The Hbbth1/th1 β-thalassemic mice recapitulate several features of the human disease including chronic anemia and splenomegaly. The − 81kb enh KO mice and Hbbth1/th1 mice were bred to homozygosity and checked for disease-specific erythroid parameters in order to determine if inactivation of the Myb − 81kb enh ameliorates disease course (Fig. 8A). We first noticed that the splenomegaly associated with β-thalassemia in Hbbth1/th1 mice was significantly reduced in Hbbth1/th1::Myb− 81kb enhKO/KO animals (referred to as double KO) (Fig. 8B), indicating that − 81kb enh KO limits extramedullary erythropoiesis in the Hbbth1/th1 thalassemic context. Interestingly, we could detect in young adult double KO animals (7–10 weeks old) a slight but significant increase in RBC counts (7,512 ± 0,1446 N = 39 in Hbbth1/th1 animals versus 8,016 ± 0,1068 N = 60 in double KO animals) (Fig. 8C). Hematocrit levels were also weakly but significantly raised in double KO animals, together with total hemoglobin levels, and mean corpuscular hemoglobin (Fig. 8C). Surprisingly, none of these parameters differed significantly from control WT animals in Myb − 81kb enh KO mice (with the notable exception of the mean corpuscular volume, see also Fig. 6), indicating that inactivation of the − 81kb enh is phenotypically silent in steady state conditions, but provides a phenotypic advantage in stressed conditions such as β-thalassemia. However, it should be noted that, despite persistence of embryonic globin genes expression in adult Myb − 81kb enh KO mice (Supplementary Figure S6), none of the erythroid parameters could be rescued to the levels seen in control animals, indicating that Myb − 81enh KO on its own is not sufficient to fully rescue disease phenotype in this model of β-thalassemia.

Fig. 8
figure 8

Phenotypic impact of Myb− 81kb enhKO in Hbbth1/th1 thalassemic mice. (a) experimental set-up, (b) spleen weight of control, Hbbth1/th1, and Hbbth1/th1::Myb− 81kb enhKO mice showing decreased splenomegaly. (c) Circulating erythroid blood parameters of 7–10 weeks old adult animals from the different genotypes. All data are expressed as the mean ± s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001.

Discussion

GWAS studies have been invaluable to identify key genomic loci responsible for disease susceptibility and heritability. However, these studies usually highlight rather large genomic areas and fail to precisely delineate the regulatory regions involved. Genomic variations at the HBS1L-MYB locus are such an example. Multiple SNPs in linkage disequilibrium are located in this intergenic region, and are associated with HPFH, a feature ameliorating disease symptom in some of the most common human genetic disorders worldwide, namely sickle cell disease and β-thalassemia. Because common and rare associated variants cover a large region encompassing multiple putative enhancers, functional assessment of the different variants and precise determination of the regulatory elements involved remained enigmatic.

Published data from us and others have shown that the Myb − 81kb enh, which is the orthologue of the human MYB − 84kb enh, represents a peculiar enhancer within the locus. First, recent genetic data investigating variants linked to HPFH have robustly identified the human MYB − 84kb enh as key regulatory site linked to MYB regulation and fetal globin reactivation39,40. Second, epigenomic data highlighted the mouse − 81kb enh/human − 84kb enh as being specifically bound by both the GATA1/LDB1 complex and KLF1, a master regulator of terminal erythroid differentiation and globin genes regulation41,70,71. Third, saturating mutagenesis coupled to functional analyses highlighted clear MYB regulatory potential at the − 84kb enh mediated, at least in part, by GATA1 in human cells19. Fourth, interestingly, a previous report72 and a recent study45 reported that the mouse Myb − 81kb enh harbors the transcriptional initiation site of the Myrlin long non-coding RNA, shown to be involved in the cis-regulation of Myb expression by recruiting or stabilizing the MLL/COMPASS histone methyltransferase complex and stimulating RNA polymerase pause release to promote Myb transcription45. Collectively, these observations point at the − 81kb/− 84kb enh as being a key candidate for the observed erythroid phenotypic variations linked to the top scoring HBS1L-MYB variants. We therefore unequivocally tested its function by deleting the − 81kb enh sequence from the mouse genome and assessing its functional impact in vivo on erythropoiesis, globin genes expression and thalassemia.

We show here that the − 81/84kb enh displays erythroid specificity as judged by its epigenetic accessibility profile throughout human and mouse hematopoiesis. Interestingly, its deletion is well tolerated in vivo and Myb − 81kb enh KO animals were viable and fertile contrarily to Myb KO mice. This shows that this enhancer is not dominant within the Myb locus in vivo, and suggests that functional redundancy likely exists between the various Myb distal enhancers, buffering the effects of − 81kb enh deletion. Alternatively, this may indicate that the − 81kb enh is a weak enhancer and has little overall impact on Myb expression. Interestingly, in a recent study, in-depth dissection the α-globin enhancer cluster has shown that enhancers with seemingly weak enhancer potential (such as the R4 α-globin enhancer) may actually play key roles as ‘facilitators’ by potentiating the activity of the strong typical enhancers (i.e. R2 enhancer) within the locus, although deletion of the R4 enhancer leads to very little impact on α-globin expression11,12. Interestingly, a facilitator-like element was also described in the β-globin enhancer cluster (i.e. HS1), suggesting that such atypical enhancers with weak apparent transcriptional activation capacity may be widespread in mammalian genomes12. Accordingly, we found only moderately decreased Myb transcript levels in differentiating erythroid cells in KO animals. Whether the − 81kb enh belongs to this new class of regulatory elements and potentially act as a facilitator element rather than a typical enhancer remains to be explored. Interestingly, this weak transcriptional effect was also observed in individuals bearing the rs66650371 variant, a 3bp deletion altering LDB1/GATA1 complex binding at the − 84kb enh, which in combination with other SNPs in LD show only moderate decrease in MYB expression levels42. This observation contrasts with previous reports suggesting that Myb − 81/− 84kb enh is critical for erythroid cell fitness and Myb expression19,45. A major difference accounting for these discrepancies is the use of cell lines which may be overtly addicted to MYB levels, as reported for transformed leukemic cell lines73, or the use of ex vivo cultured primary cells that may expand under stressed conditions in the absence of proper tissular supporting microenvironment. By contrast Myb − 81kb enh KO mice do not show signs of anemia or overt alteration of the erythroid compartment suggesting that the microenvironment may compensate for a slight decrease in erythroid progenitors’ fitness in vivo.

It remains unclear why and how variations at this enhancer associate with fetal globin reactivation, but it is interesting to note that as a transcription factor, MYB was shown to target both the globin gene loci themselves, and some of the most potent fetal globin regulators such as BCL11A, KLF1, SOX642,67. A very interesting aspect of our findings is the discovery that in addition to embryonic/fetal globin genes reactivation, Myb − 81kb enh KO associates with reduction of adult α-globin expression. This is particularly relevant in the case of β-thalassemia as it was shown that excess unpaired α-globin proteins form cytoplasmic aggregates that trap the HSP70 chaperone, which becomes limiting in thalassemic cells. This, in turn, results in the failure to protect the master regulator GATA1 against caspase-mediated cleavage, leading to decreased GATA1 levels and ineffective erythropoiesis69,74. Thus, one of the beneficial effects of Myb − 81kb enh KO may be to partially rescue the α/β globin chains imbalance, thereby ameliorating terminal erythropoiesis. When Myb − 81kb enh KO animals were crossed with Hbbth1/th1 thalassemic mice, a marked reduction of splenomegaly was observed, which was accompanied by a significant amelioration of the erythroid parameters, red blood cells counts and hematocrit in young adults, albeit at levels that did not reach wild-type control levels. How the slight reduction of Myb expression in − 81kb enh KO mice leads to these moderate phenotypic improvements in vivo remains to be uncovered. Importantly, it is currently not known if these improvements only originate from MYB’s impact on fetal(embryonic)/adult globin ratios, or if additional deregulated MYB direct target genes collectively provide additive beneficial effects besides the observed changes in globin levels. MYB is known to be a potent regulator of cell cycle53,66,75,76 and under reduced MYB levels, erythroid progenitors display cell cycle kinetic perturbations19,68,77, which impact terminal erythropoiesis. Altering cell cycle at these terminal differentiation stages may likely lead to increased fetal hemoglobin as a result of failure to fully switch to adult hemoglobin expression, as proposed in earlier models of HbF reactivation78. Accordingly, altered cell cycle progression with accelerated erythroid differentiation were reported in HPFH individuals with decreased MYB expression42,58,68. On the other hand, our recent CUT&RUN experiments showed strong enrichment of cell cycle genes and key cell-cycle regulators as direct MYB targets in murine erythroid progenitor cell lines67. These studies, together with results reported from hypomorphic Myb alleles in the mouse79,80, provide evidence for the tight links existing between fine-tuned MYB levels, cell cycle regulation, coordination of globin genes expression, and erythroid differentiation. In our mouse model, reduction of MYB levels likely results in accelerated differentiation due to reduced number of proliferation cycles. This would be consistent with higher MCV values as the erythrocytes are younger red cells, explaining why we consistently observed circulating red blood cells with increased MCV indices in Myb − 81kb enh KO animals (Figs. 6 and 8). Importantly, this reflects what is happening in patients with MYB-associated HPFH42,43,60,61,62,63,64.

Whereas the precise perturbations of gene regulatory networks downstream of MYB, together with the genome-wide molecular mechanisms at play remain to be determined, and are outside the scope of this study, our data document for the first time the in vivo role of a key Myb enhancer in the context of erythroid development, steady state and stressed erythropoiesis, and in disease. It provides a unique model to further dissect the in vivo roles of this conserved GWAS-highlighted Myb enhancer and to gain deeper mechanistic insights into the role of MYB in the control of terminal erythroid differentiation and β-hemoglobinopathies.

Methods

Animals

All experiments were conducted by authorized personnel in accordance with the guidelines of the European Community and approved by the French Ministry of research under approbation numbers APAFIS 18,144 v2. The authors complied with the ARRIVE guidelines. Myb − 81kb enhKO and Hbbth1/th1 mice were housed and maintained under pathogen-free conditions in the Montpellier RAM-PCEA / RAM-ZEFI animal facilities. The Hbbth1/th1 mouse model originated from a natural occurring deletion of the gene encoding β-globin44, and were a kind gift from E. Payen, CEA Fontenay-aux-Roses, France. Mice genotypes were determined by PCR. All mice were on a C57BL/6 background, and both sexes were used without differences between males and females being observed.

Generation of the Myb − 81kb enhKO mouse model

Homologous recombination of mouse embryonic stem cells (ESCs) was performed using the targeting construct shown in Fig. 4C. Proper targeting was confirmed by PCR and southern blotting. Two independent targeted ESC lines were injected into blastocysts, and one gave germline transmission. F1 mice were bred with mice ubiquitously expressing the Flp recombinase to remove the neomycin resistance marker. These animals were further crossed with mice ubiquitously expressing the Cre recombinase to induce deletion of the Myb − 81kb enh sequence. Removal of the − 81kb enh sequence was confirmed by PCR.

Blood parameters analysis

Blood samples (50 μL) were collected from facial veins to EDTA-coated tubes. Complete blood counts were measured on scil Vet abc Plus + (Horiba) according to the manufacturer’s instructions.

Induction of hemolytic anemia and blood parameters analysis

Mice received sterile phenylhydrazin (PHZ) solution intraperitoneally on day − 1 and day 0 with an 80 mg/kg body weight dose. Blood samples (50 μL) were collected from day 0 to day 17, from facial veins to heparin-coated tubes to monitor the recovery of erythropoiesis post PHZ injection.

Flow cytometry

Flow cytometry analyses were performed on bone marrow cells (from tibias and femurs) or spleen cells from adult mice and fetal liver cells from E13.5 embryos. Antibodies used were listed in Table 1. Cells were analyzed with a BD FACS-Quanto II or sorted with a BD FACS Aria.

Table 1 Flow cytometry antibodies used.

RNA preparation and real-time polymerase chain reaction analysis

Total RNA from sorted cells were extracted using HP RNA isolation kit (Roche) according to manufacturer’s instructions. RNA were primed with oligo(dT) and reverse transcribed with SuperScript II (Invitrogen). Real-time polymerase chain reaction was performed using LightCycler® 480 SYBR Green I Master mix (Roche). The sequences of the primers used for PCR amplification are listed in Table 2.

Table 2 qRT-PCR primers used.

Statistical analyses

Statistical analysis was performed using GraphPad Prism 5.0.0 software. Unpaired Student’s t-test was used to determine the statistical significance of the difference between two groups. Results with p-values ≤ 0.05 were considered as significant (* = P < 0.05, ** = P < 0.01, *** = P < 0.001, **** = P < 0.0001, ns = not significant).