Introduction

Tet family methylcytosine dioxygenases recognize and oxidize 5-methyl-cytosine (5mC) to 5-hydroxymethylcytosine (5hmC)1,2,3. This oxidation event modifies DNA to enable dynamic alterations in chromatin signatures and DNA accessibility from transcriptionally inaccessible (5mC) to accessible (5hmC) states4,5,6,7,8. 5hmC is thought to serve both as an intermediate in the DNA demethylation pathway7 and as a stable epigenetic mark that can accumulate in differentiated cells, particularly neurons9,10,11,12,13. 5hmC can be deposited throughout the genome and several recent studies have shown 5hmC deposition in enhancers4,5,7 and gene bodies11,14 during organ and tissue development. Indeed, during organogenesis, Tet-mediated 5hmC and DNA demethylation occurs across enhancers, many of which have been shown to regulate key developmental genes4,5,6,7,15,16. There is also evidence that Tet-mediated demethylation of enhancers promotes the expression of components of conserved developmental signaling pathways4,17. As tissues differentiate and mature, 5hmC accumulates on gene bodies to promote constitutive expression of genes encoding proteins that facilitate terminal differentiation and/or the functions of terminally differentiated cells14,18,19,20. Beyond enhancers and gene bodies, Tets have also been shown to regulate demethylation events in other genic regions whose demethylation is necessary for normal differentiation21,22. 5hmC deposition varies depending on tissue type, differentiation status, environmental perturbation, age, and disease progression4,5,7,14,18,19,23,24. Functionally, Tets are required for the development of many distinct tissues and cell types including B cells6,16, hematopoietic stem cells25, the heart5 and the retina26. Finally, there are three Tet genes from zebrafish to humans and the proteins they encode all function in DNA hydroxymethylation. However, Tet proteins possess some structural differences from one another, individual Tets interact with a variety of additional proteins to achieve their functions, and additional protein-specific cellular functions have been identified that include roles in modulating RNA hydroxymethylation and in regulating histone modifications and the assembly of chromatin regulatory complexes27,28.

The retina is an ideal tissue to elucidate the functions of Tet proteins. During development, retinal neurons arise from a population of multipotent retinal progenitor cells (RPCs) that first undergo specification to one of seven principal retinal cell classes: rods, cones, retinal ganglion cells (RGCs), amacrine cells, horizontal cells (HC), bipolar cells (BC) and Muller glia. Once specified to a particular cell class, RPCs then undergo further rounds of specification into distinct subtypes within these cell classes. Once specified, RPCs subsequently complete differentiation, morphogenesis and maturation programs to generate the precise retinal architecture and circuitry that underlies vision29,30,31. The molecular underpinnings of RPC specification and differentiation events have been the focus of decades of elegant studies, with recent experiments leveraging single-cell data to further understand the gene regulatory networks underlying retinal cell type diversification32,33,34,35,36.

While much focus has been placed on the gene regulatory networks facilitating retinal development, epigenetic processes have also been shown to contribute to retinal cell type specification and differentiation events; however, their precise roles are less well understood37,38,39. Previous work from our lab identified a role for tet2 and tet3 during retinal development, with tet2−/−;tet3−/− retinae displaying defects in the formation of differentiated retinal neurons26. While tet2−/−;tet3−/− retinal phenotypes are suggestive of critical roles during retinal development, the mechanisms through which they influence retinal neuron formation remain unknown. With this in mind, we utilized single cell RNA-sequencing (scRNA-seq) to better understand tet2 and tet3 function during retinal development. Our analyses revealed that Tet proteins play a critical role in regulating the fates of retinal cell types and subpopulations arising from RPCs as well as cell type-specific terminal differentiation events during retinal development.

Methods

Animals

All procedures were approved by the University of Pittsburgh Animal Care and Use Committee (IACUC) and were performed by following AALAC guidelines and regulations and in accordance with ARRIVE guidelines. tet2au59 and tet3au60 were used in all experiments, which were generated in our lab previously26. tet2;tet3 mutation is embryonic lethal by ~ 9dpf. Euthanasia was performed using 0.3 g/L tricaine/MS222 immersion for approximately 10 min.

Sample collection and scRNA-seq

Detailed protocols for dissection, dissociation, count, and viability analysis are available in Heilman et al. 202540, and briefly summarized here. 48 and 72hpf samples were dissociated with trypsin, while 36 and 120hpf samples were dissociated with papain (using a protocol modified from34). Between 12–17 eyes were collected for each sample. scRNAseq datasets were generated from tet2−/−tet3−/− and sibling control zebrafish (sibCTL), which included all other genotype combinations. 18 total samples were collected from nine independent experiments; samples isolated were as follows from sibCTL and tet2−/−tet3−/−: 36hpf (N = 2), 48 hpf (N = 2), 72hpf (N = 2), and 120hpf (N = 3).

Count, viability, loading

A 5 µg/mL acridine orange (AO), 100 ug/mL propidium iodide (PI) solution in PBS was used for cell count and viability assays. The AO/PI setting was utilized on the Denovix CellDrop automated cell counter. 10 uL of each single-cell suspension and 10 uL of AO/PI solution was mixed and loaded into the counter. Cell count and viability measurements were collected twice per sample, and the lower of these was used to determine loading volume for 10× Genomics chips. Samples were only utilized if they showed a viability > 80%. Samples were loaded into 10× Genomics v3.1 Dual Index Chip System. The 10 × Genomics protocol was followed as recommended by the manufacturer.

QC and sequencing

cDNA and cDNA libraries were assessed for quality and quantity by the UPMC Genome Center using qubit and qPCR, respectively. A SP-100 (800 million reads) sequencing run was performed prior to high-depth sequencing to better inform proportional loading. Using informed proportional loading, see40, 18 libraries were sequenced using NovaSeq S4-200, generating ~ 10 billion reads.

scRNA-seq analysis

Data processing and quality control

The cellranger count pipeline (version 6.1.2) was used for alignment, using the zebrafish reference genome (Danio rerio.GRCz11.106) and default arguments. To remove empty droplets and low-quality cells, we applied emptyDrops, followed by graph-based clustering41. Cells from clusters with abnormally low log-detected genes were manually annotated as “low quality”, while remaining cells were labeled “good quality”. Then, sample-specific thresholds for detected genes were calculated to determine the more tailored retention of cells in each dataset. For each sample, thresholds for the number of detected genes were determined using methods outlined in42. Cells that show numbers of detected genes below the sample-specific calculated thresholds, levels of mitochondrial gene expression > 3 median absolute deviations (MADs) from the median, and library sizes > 4 MADs from the median were removed. Additionally, genes that are not expressed in > 1% of cells in at least one sample were removed. The R package decontX was used to correct for ambient RNA contamination43, and hybrid (scds, R package) was used to remove doublets44.

Batch correction

The top 2000 highly variable genes were identified using the modelGeneCV2 (scran, R package)45 and samples were integrated using multiBatchNorm (batchelor, R package)46. We then used fastMNN (batchelor, R package) to produce a batch-corrected matrix of reduced dimensionality, which was used for downstream visualization, clustering, and nearest neighbor discovery.

Clustering and cell type annotation

Clusters in all analyses were performed using the louvain clustering method within clusterCells (scran, R package). For initial cell type annotation, sibCTL and tet2−/−;tet3−/− cells were assigned together. To identify cell types, AUCell (R package) in conjunction with cell type-specific gene set lists were used to iteratively identify cell type identities. Gene set lists used for cell type determination include 131 eye cell type-specific gene sets reported by Raj et al.,35 plus 2 additional cell type-specific gene sets based on literature searches of on and off bipolar (BC-ON and BC-OFF) gene expression, and displaced amacrine cell gene expression (dAC a subset of AC). From these 134 gene sets we aimed to identify 17 eye-associated cell types35. Genes associated with the BC-ON gene set include rgs11, nyx, isl1, trpm1a, gnb3a, si:ch211-160f23.7, pvalb8, abhd3, prkcaa, and gnao1b. Genes associated with the BC-OFF gene set include si:ch211-232m10.6, zgc: 112332, fezf2, slc1a9, six3b, and neto1. Genes associated with the dAC gene set include sox2 and isl1. AUCell was used to score cells for each gene set and then assigned identity calls for each cell type based on gene set calls. Cells whose identities corresponded to two cell types were resolved manually, and those without any identities or with more than two identities were assigned using knn analyses with a k value of 5. Final cell types include cone/rod/PRP for photoreceptor, horizontal/amacrine/retinal ganglion cell for neuroblast, and on/off/early bipolar cells (Fig. S1). Early bipolar cells (BC-early) were distinguished from BC-ON and BC-OFF by running AUCell scoring for progenitor gene sets only on bipolar cells. Bipolar cells scoring above 0.08 for the progenitor signature were called BC-early. Including BC-early, cells within our dataset were assigned to one of 18 cell types.

Computational separation of non-retinal cells, including non-retinal progenitor cells

The following cell types were removed from downstream analysis of retinal cells: mesoderm, epidermis, pigment, cornea, lens, immune cell, erythrocyte, and non-RPC progenitor cells. Additional analysis was performed to identify and then separate retinal progenitor cells (RPCs) from non-retinal progenitor cells, which give rise to other non retinal cell types. To identify RPCs within the broader population of progenitor cells in the eye, progenitor cells from all timepoints and both sibCTL and tet2−/−tet3−/− genotypes were integrated, and dimensionality reduction and clustering were performed. Cluster-specific differentially expressed genes were calculated to determine the lineage biases of each progenitor cell cluster using findMarkers (scran, R package, pval.type = ”all”, test.type = “wilcox”)  (S1 Table). Literature searches were performed to identify lineages related to cluster specific DEGs as follows: lens progenitor genes, pigment progenitor genes, and anterior segment/cornea progenitor genes (S2 Table). Expression of genes related to retinal function were used to establish clusters clearly composed of RPCs. These genes include the early progenitor genes npm1a, notch1a, vsx2, and notch1b, proliferative genes her4.1, her4.2, her4.4, specification genes atoh7, foxn4, crx, vsx1, otx5, insm1a, isl1, prdm1a, tfap2a, tfap2b, and the Muller glia-specific gene gfap. Indeterminate clusters were counted as RPCs, considering the possibility that the tet2−/−tet3−/− genotype could manifest as abnormal gene expression that could make RPCs more difficult to identify. Non-retinal progenitor clusters 1 (lens progenitor), 16 (corneal progenitor), and 18 (pigment progenitor) were removed from subsequent analyses.

Differential abundance analysis and imbalance scoring

Differential abundance (DA) analysis was performed using the edgeR package as described in (http://bioconductor.org/books/3.16/OSCA.multisample/differential-abundance.html)47,48. We performed DA on retinal cell types at each time point (36, 48, 72, and 120 hpf) and on cell type-specific clusters at 120hpf. Briefly, we calculated cell type (or cluster) abundance for each sample. Cell types (or clusters) with low abundance were retained. A negative binomial dispersion was then estimated for each cell type (or cluster) with the estimateDisp function with trend = none. Then, the quasi-likelihood dispersion was computed using glmQLFit with abundance.trend = false. Finally, we used glmQLFTest to perform empirical Bayes quasi-likelihood F-tests to test for significant differences in cell type abundance between sibCTL to tet2−/−;tet3−/− samples, with p-values derived from the F-statistic. Imbalance score calculations were based on49. First, we used the top 8 corrected principal components to find each cell’s 10 nearest neighbors (RANN, R package). Then, an initial score was derived from the ratio of sibCTL to tet2−/−;tet3−/− cells in each cell’s neighborhood. The probability of observing at least that number of sibCTL cells in the neighborhood (i.e., p-value) is calculated using a multinomial distribution with priority given by the global ratio of sibCTL to tet2−/−;tet3−/− (R Core Team 2022). These p-values are then converted into normal distribution values using a quantile function with a mean of 0 and standard deviation of 1 (R Core Team 2022). More negative values indicate enrichment for tet2−/−;tet3−/− and more positive values indicate enrichment for sibCTL. Finally, to smooth scores, a generalized additive model (GAM) was fit to predict the scores given the corrected principal component coordinates50. Predicted scores from this GAM were the final smoothed imbalance scores used for visualization.

Differential gene expression, GO term analysis between sibCTL and tet2 −/− tet3 −/− cells

Differential gene expression between sibCTL and tet2−/−tet3−/− cells of each cell type at 120hpf, and between clusters of each cell type, were computed using the findMarkers function with pval.type = ”all” and test.type = “wilcox” (scran, R package). Scaled expression heatmaps were generated using DoHeatmap (Seurat, R package) on scaled data. Top DEGs (p < 0.05, FDR < 0.05) upregulated in sibCTL or tet2−/−tet3−/− cells were input into ShinyGO51. Fold enrichment scores were calculated by ShinyGO.

Confocal microscopy

sibCTL and tet2−/−tet3−/− zebrafish were raised to 120hpf and were euthanized. Euthanized embryos were fixed in 4% paraformaldehyde, cryopreserved in 25% and 35% sucrose, embedded in Tissue Freezing Media (Scigen 23730625), and cryosectioned at 14 uM. Sections were placed in blocking solution (5% Normal Goat Serum, in PBS) for 2–8 h and exposed to anti-PKCɑ (sc-17769) at 1:250 at 4 °C overnight. Samples were then washed 3× for 10 min each with PBS at room temperature and incubated with goat anti-mouse AF-647(Invitrogen, A-21235) secondary antibody for 2 h. Samples were then washed 3× for 10 min each with PBS and stained with DAPI (D-9542) at 1:250 at RT for 10 min. Samples were then washed 3× for 10 min with PBS and mounted with DAPI Vectashield (Vector Laboratories, H-1200). Images were taken using an Olympus Fluoview FV1200 laser scanning microscope with a 40× objective, NA = 1.3 (Olympus Corporation). Cell counts were performed manually using ImageJ’s Cell Counter tool. PKCɑ+/DAPI+ cells represent PKCɑ+ cells, while PKCɑ-/DAPI+ cells represent the remaining retinal cells. Counts of PKCɑ+/DAPI+ cells relative to all DAPI+ cells in retinal regions of interest were performed on 4 sibCTL eyes and 5 tet2−/−tet3−/− eyes, from separate larvae. Cell proportions for each metric tested were statistically assessed using the Wilcoxon Rank-sum test.

Results

A single cell atlas of retinal development in tet2 −/−;tet3 −/− mutants

To identify molecular differences between tet2−/−;tet3−/− and control (hereafter referred to as sibCTL) retinae, we first sought to build single-cell atlases of the developing eye. Eyes were collected at 36, 48, 72, and 120 h post fertilization (hpf), timepoints that span early to late retinogenesis. Cells were then dissociated and utilized for 10× Genomics single cell RNA sequencing (Fig. 1A)40. 168,078 total cells were profiled: 89,797 from sibCTL eyes and 78,281 from tet2−/−;tet3−/− eyes. tet2−/−;tet3−/− eyes are microphthalmic and possess retinal defects but all other ocular tissues and structures are present in the mutant eye26. Here, scRNA-Seq datasets confirmed the presence of all expected retinal and non-retinal cell populations in both sibCTL and tet2−/−;tet3−/− eyes (Fig. S1). Non-retinal cells were computationally removed from the dataset, including non-retinal progenitor cells (Figs. S1, S2, S1 Table), leaving 71,178 sibCTL and 80,684 tet2−/−;tet3−/− retinal cells. Retinal cell types identified in this study include retinal progenitor cells (RPCs), retinal ganglion cells (RGCs), amacrine cells (ACs), horizontal cells (HCs), cones, rods, photoreceptor precursors (PRPs), Muller glia (glia), and bipolar cells (BCs). BC subpopulations were further subdivided to ON (BC-ON) and OFF (BC-OFF) subtypes, and a group of developmentally immature BCs that possessed high expression of progenitor-related genes and low expression of differentiation genes, which we termed early Bipolar cells (BC-early). Further information about cell type annotation can be found in Figs. S1, S2, S2 Table and “Methods”.

Fig. 1
figure 1

Single cell atlases of sibCTL and tet2−/−;tet3−/− retinae (A) Workflow for scRNAseq analysis of sibCTL and tet2−/−;tet3−/− retinae. Samples were collected at 36,48,72, and 120 h post fertilization (hpf) covering differentiation time windows of all retinal cell types; time window illustrations were based on several references:29,30,31,76,102,103,104,105,106. At each time point whole eyes were removed and digested into single-cell suspensions. Cells were assessed for viability, counted using the Denovix CellDrop Acridine Orange/Propidium Iodide (AO/PI) assay and loaded into a 10× Genomics v3.1 System. 18 libraries were sequenced concurrently. (B) UMAP projections of retinal datasets separated by developmental time point and genotype. Cell types are individually colored. sibCTL datasets generated 15,20,27, and 28 clusters at 36,48,72, and 120hpf, respectively. tet2−/−;tet3−/− datasets generated 17,13,22, and 24 clusters,at 36,48,72, and 120hpf, respectively. (CE) UMAP projections from pooled sibCTL and tet2−/−;tet3−/− retinal cells separated by (C) cell type, (D) age, and (E) imbalance score. (F) Expression dotplot of enriched cell type-specific genes for each retinal cell type calculated from the pooled scRNA-Seq dataset. Expression of enriched genes within each cell type and the percentage of cells expressing each gene within each cell type are plotted. Abbreviations: retinal progenitor cell (RPCs), retinal ganglion cell (RGC), early bipolar cell (BC-early), ON bipolar cell(BC-ON), OFF bipolar cell (BC-OFF), photoreceptor precursor (PRP).

The cellular composition of tet2 −/−;tet3 −/− retinae is disrupted

Studies in a variety of systems have demonstrated that loss of Tet function and/or 5hmC deposition results in alterations to specification and differentiation events in developing organs and tissues5,8,25,26. Indeed, our previous work showed that in the retina tet2−/−;tet3−/− mutants possessed defects that included reduced expression of cell type-specific terminal differentiation markers, the absence of an optic nerve and impaired morphogenesis of photoreceptor outer-segments. Given these data, we hypothesized that the application of scRNA-Seq would reveal alterations in cellular composition and differentiation states within tet2−/−;tet3−/− retinae. To begin to test this hypothesis, dimensionality reduction was performed on sibCTL and tet2−/−;tet3−/− cells to visualize retinal cell populations at each time point (Fig. 1B). Our sibCTL data revealed expected retinal cell populations and patterns of differentiation over time, consistent with previous studies of zebrafish retinal development (Fig. 1B)35,36. Additionally, sibCTL datasets revealed increasing cluster numbers and detectable cell types as differentiation proceeds, consistent with the observation that scRNA-seq population diversity increases as tissues mature35,36. Time point-separated sibCTL retinal datasets possess 14, 22, 27, and 28 unique clusters and tet2−/−;tet3−/− retinal datasets possess 16, 15, 20, and 23 unique clusters at 36, 48, 72, and 120hpf, respectively (Fig. 1B). Integrating all time points and genotypes together identifies all retinal cell types (Fig. 1C), with cell type composition becoming progressively more complex over time (Fig. 1D). However, imbalance scores revealed UMAP regions with unequal contributions of sibCTL or tet2−/−;tet3−/− cells when datasets are integrated (Fig. 1E). Regions of imbalance between sibCTL and tet2−/−;tet3−/− cells include subpopulations of RPCs and early specified neurons as well as differentiated neurons at 120hpf, suggesting that differences between sibCTL and tet2−/−;tet3−/− retinae emerge as differentiation proceeds. Stage-specific differences between sibCTL and tet2−/−;tet3−/− cells across the retinal dataset show disproportionate compositions of sibCTL cells in both early BCs and ACs at 48 hpf, and in specified BCs and photoreceptors at 72 hpf (Fig. S3), suggesting delayed specification and differentiation of many retinal cell populations in tet2−/−;tet3−/− retinae. Top cell type-specific genes measured from integrated sibCTL and tet2−/−;tet3−/− datasets across all timepoints were consistent with known cell type-specific marker genes (Figs. 1F, S3). These genes include gnat2 (cones), saga/sagb (rods), prdm1a (PRP), gnao1b (BC-ON), fezf2 (BC-OFF), marcksl1a (BC_early), rem1 (horizontal), slc32a1 (amacrine), elavl3 (RGC), rlbp1a (Muller glia), and hmgb2b (RPCs). Figure S4 represents proportions of raw counts of each retinal cell type in sibCTL and tet2−/−;tet3−/− samples, compiled from all scRNA-Seq datasets at each timepoint. These data indicate that retinal cell type identities are maintained in tet2−/−;tet3−/− retinae, and suggest that overall cellular proportions and differentiation times may be altered.

To quantitatively determine how the loss of tet2 and tet3 impacts cell type composition in the retina during development, we next performed differential abundance analysis (Fig. 2A; S4 Table). tet2−/−;tet3−/− retinae are microphthalmic and contain fewer cells compared to sibCTL; therefore, we were particularly curious to assess the differential abundances of retinal cell types between sibCTL and tet2−/−;tet3−/− retinae, while keeping in mind that tet2−/−;tet3−/− retinae contain fewer cells. At 48hpf, sibCTL retinae possessed a significantly higher proportion of BC-early cells than tet2−/−;tet3−/− retinae (p = 1.6 × 10–3). However, this appears to be due to a developmental delay in tet2−/−;tet3−/− mutants, as BC-early recover by 72 hpf, with tet2−/−;tet3−/− mutants showing higher abundance of BC-early than sibCTLs at this time (p = 3.23 × 10–2). Interestingly, at 120hpf, tet2−/−;tet3−/− retinae possess significantly more BC-ON (p = 1.32 × 10–2), suggesting both an overall delay in development as well as an alteration in the abundance of differentiated BC subtypes.

Fig. 2
figure 2

tet2−/−;tet3−/− retinae show alterations in cell type compositions during development. (A) Differentially abundant cell types at 36,48,72, 120hpf (p < 0.05, FDR < 0.1). (B) Immunofluorescent staining of PKCα+ BCs at 120hpf. Nuclei stained with DAPI (white). PKCα (magenta). Scale bar = 50 µm. (C) Quantification of proportion of PKCα+ BCs relative to all DAPI+ cells across the retina (p = 1.59 × 10–2). (D) Quantification of proportion of PKCɑ+ BCs relative to DAPI+ cells localized to the inner nuclear layer (p = 1.59 × 10–2). (E) Proportion of nuclei localized to the inner nuclear layer in sibCTL and tet2−/−;tet3−/− retinae (p = 3.43 × 10–1). (F) Proportion of nuclei localized to the ganglion cell layer in sibCTL and tet2−/−;tet3−/− retinae (p = 3.43 × 10–1). (G) Proportion of nuclei localized to the outer nuclear layer in sibCTL and tet2−/−;tet3−/− retinae (p = 4.13 × 10–1). For D-H, sibCTL N = 4; tet2−/−;tet3−/− N = 5.

Differentiated bipolar cells can be split into two distinct subtypes, ON (BC-ON) and OFF (BC-OFF), which are activated either in light or dark conditions, respectively52,53,54. PKCɑ is a marker for subsets of BC-ON in zebrafish53,54,55. Immunohistochemistry for PKCɑ in sibCTL retinae at 120hpf shows the normal pattern of PKCɑ+ BC-ON cells; the cell bodies line the INL and they send processes into the IPL, which segregate into distinct IPL sublaminae (Fig. 2B). To confirm that the expansion of BC-ON occurs in vivo, we next quantified PKCɑ+ BC-ON cells, at 120hpf in tet2−/−;tet3−/− mutants (Fig. 2B). Consistent with the scRNA-Seq predictions, tet2−/−;tet3−/− retinae possessed a higher proportion of PKCɑ+ BC-ON than sibCTLs (Fig. 2C, p = 1.59 × 10–2). This BC-ON expansion was also evident when only the inner nuclear layer (INL) was assessed (Fig. 2D, p = 1.59 × 10–2). Importantly, this is not simply a result of tet2−/−;tet3−/− mutants having a proportionally larger inner nuclear layer (INL) as we detected no significant differences in proportions of the INL (Fig. 2E), nor did we detect differences in the ganglion cell layer (GCL; Fig. 2F) or outer nuclear layer (ONL; Fig. 2G) in sibCTL and tet2−/−;tet3−/− retinae. These data highlight that BC-ON are expanded in tet2−/−;tet3−/− retinae. Interestingly, BC processes in the IPL were not well organized into distinct sublamina like in the sibCTL retina (Fig. 2B), highlighting additional defects in retinal morphogenesis in tet2−/−;tet3−/− mutants.

Other retinal cell types also showed differential abundance when comparing sibCTL and tet2−/−;tet3−/− retinae. Cone photoreceptor populations were reduced in tet2−/−;tet3−/− retinae at 72 and 120hpf (72hpf p = 5.07 × 10–4 and p = 1.61 × 10–2, respectively). We also observed several transient changes in the abundance of other cell populations at 72hpf including a reduction in Muller glia (p = 2.32 × 10–2), rods (p = 4.09 × 10–2), and BC-OFF (p = 1.75 × 10–2) in tet2−/−;tet3−/− retinae, and an elevation of HCs (p = 1.39 × 10–2) in tet2−/−;tet3−/− retinae. Taken together, these data indicate that BC and cone formation are significantly disrupted in tet2−/−;tet3−/− retinae, and that tet2−/−;tet3−/− retinae possess additional, but subtle and transient, defects in the formation of other differentiated retinal cell types.

Cone differentiation is impaired in tet2 −/−;tet3 −/− retinae

Cone defects were first reported in tet2−/−;tet3−/− zebrafish retinae26, with mutants lacking outer segments and showing reduced expression of several markers of cone differentiation (opsins, guca1c, guca1g, gnat2, grk1b). Similar cone defects were also recently identified in mouse Chx10-cre Tet1;Tet2;Tet3 conditional knockouts56. In alignment with these previous reports, our scRNA-Seq data show reduced cone cell numbers in tet2−/−;tet3−/− mutants (Fig. 2A,B). We next sought to determine if the tet2−/−;tet3−/− cones that do form exhibit signs of impaired differentiation. We identified differentially expressed genes between sibCTL and tet2−/−;tet3−/− cones at 120hpf (S5 Table). As expected, sibCTL cones highly express several genes related to cone function including pde6c, grk7a, rp1l1b, rgs9a, grk1b, mef2cb, gnat2, and six757,58,59,60,61,62,63, but these are downregulated in tet2−/−;tet3−/− cones (Fig. 3A, S5 Table). Conversely, tet2−/−;tet3−/− cones show high differential expression of several genes associated with immature cones, including neurod1, prdm1a, and otx2b (Fig. 3B, S5 Table). neurod1 is a transcription factor enriched in photoreceptor precursors that is subsequently downregulated as differentiated photoreceptors mature64. prdm1a and otx2b are transcription factors expressed in immature photoreceptors that play key roles in specifying photoreceptor fate65,66. Interestingly, tet2−/−;tet3−/− cones also show high differential expression of many ribosome-related genes compared to sibCTL cones (Fig. 3C). Accordingly, KEGG, GO, and REAC Terms associated with enriched genes in sibCTL cones correlate with biological processes involved in mature photoreceptor function; these include phototransduction and metabolic processes like glycolysis and oxidative phosphorylation67 (Fig. 3D, S6 Table). Additionally, lactate dehydrogenase genes, ldha and ldhba, were downregulated in tet2−/−;tet3−/− cones, suggesting that anaerobic respiration, an important metabolic process in mature photoreceptor function, may be less active in tet2−/−;tet3−/− cones67 (S5 Table). KEGG, GO, and REAC Terms associated with genes enriched in tet2−/−;tet3−/− cones relate to ribosome function, development, nonsense-mediated decay and other basic developmental processes (Fig. 3E, S6 Table). These data support impaired cone differentiation in tet2−/−;tet3−/− mutants.

Fig. 3
figure 3

Cone differentiation is impaired in tet2−/−;tet3−/− retinae. (AC) Expression heatmaps of (A) mature cone genes, (B) immature cone genes, and (C) ribosome genes. Columns correspond to individual cones at 120hpf, with sibCTL and tet2−/−;tet3−/− separated. (D,E) Results of ShinyGO analysis on differentially expressed genes within cones showing the Fold Enrichment score for representative GO terms in (D) sibCTL-upregulated and (E) tet2−/−;tet3−/−-upregulated categories at 120hpf (p < 0.05, FDR < 0.05). KEGG analysis after107,108,109.

Bipolar cell differentiation and subpopulation diversity is disrupted in tet2 −/−;tet3 −/− retinae

Our previous characterization of tet2−/−;tet3−/− mutants utilized a variety of methods to assess retinal defects, but did not assess BC or HC formation26. Beyond the main ON and OFF classes, BCs can further segregate into multiple subtypes, which have been characterized in several organisms: 15 subtypes have been identified in P17 mice68, 22 subtypes in embryonic chickens69, and in adult zebrafish, 18 subtypes have been identified anatomically, based on their connectivity, and at least 23 molecularly, from scRNA-Seq33,70.

Our scRNA-Seq data suggested that BC identities were disrupted in tet2−/−;tet3−/− mutants, with potential differences in subtype composition in the tet2−/−;tet3−/− retina. To further explore whether BC populations were altered in the tet2−/−;tet3−/− retina, we analyzed all BCs at 120hpf, a timepoint at which most BCs are differentiated31. To identify BC subpopulations, we integrated sibCTL and tet2−/−;tet3−/− BCs and performed cluster-specific differential gene expression analysis. Figure 4A shows the occupancy of BC-ON, BC-OFF, and BC-early subgroups across the combined BC dataset. The combined 120hpf BC dataset contains 18 unique clusters (Fig. 4B). BC cluster-specific DEGs are listed in S7 Table while Fig. S5A demonstrates the specificity of cluster-specific marker genes. Some BC subpopulations showed specific expression of known adult BC subtype markers, while other larval BC subpopulations were not as easily correlated with those found in adult (Fig. S5A)33. One notable cluster identified in our dataset was cluster 15, which likely represents a molecularly and morphologically unique rod BC population recently identified in adult zebrafish33. This population shows high cluster-specific expression of rdh10a and uts1, relatively high expression of trpm1a, and low expression of grm6b and prkcaa (Figs. S5A,B).

Fig. 4
figure 4

BC formation is disrupted in tet2−/−;tet3−/− retinae (AC) UMAP projections of pooled 120hpf sibCTL and tet2−/−;tet3−/− BCs indicating (A) BC type: BC-ON, BC-OFF or BC_early; (B) BC subpopulation and (C) imbalance score. (D) BC subpopulation distributions in sibCTL and tet2−/−;tet3−/− retinae at 120hpf. Cluster 13, which shows a disproportionately high composition of tet2−/−;tet3−/− cells, is annotated. (E) Differential abundance analysis on 120hpf BC subpopulations. (F) UMAP projection of 120hpf BCs colored by subpopulations that show disproportionately more tet2−/−;tet3−/− cells (gold), sibCTL cells (purple), or no difference (gray). (G) Median AUCell scores for neuronal GO Terms across 120hpf BC subpopulations. GO Terms assayed include cell projection (GO:0042995) synapse (GO:0045202) transmembrane transporter (GO:0015318) calcium ion binding (GO:005509) and gated channel activity (GO:0022836). Statistics comparing AUCell scores across 120hpf BC clusters are available in Table S8. NS = not significant. (H) UMAP projection of 120hpf BCs colored by AUCell scoring for the synapse GO term (GO:0045202).

To assess potential shifts in BC identities in the tet2−/−;tet3−/− retina, we calculated imbalance scores for sibCTL and tet2−/−;tet3−/− BCs (Fig. 4C). Imbalance scores revealed that several BC subpopulations were shifted in tet2−/−;tet3−/− retinae. For example, clusters 7,13, and 18 showed regions of disproportionate contribution of tet2−/−;tet3−/− cells. Cluster-specific differential abundance analysis revealed disproportionate contributions of tet2−/−;tet3−/− cells in clusters 7,13,17, and 18 (Fig. 4D,E). Cluster 7 cells identified as BC-early, while clusters 13,17, and 18 identified as BC-ON (Fig. 4E). Interestingly, several BC-ON subpopulations showed preferential composition of sibCTL cells including clusters 4,8,10,11,14, and 15 (Fig. 4E). Additionally, clusters 3,5,9, and 16 BC-OFF subpopulations showed preferential composition of sibCTL cells. The two clusters primarily composed of BC-early cells (120-BC clusters 7 and 13) showed preferential contributions of tet2−/−;tet3−/− cells, suggesting a bias of tet2−/−;tet3−/− BCs to more nascent differentiation states (Fig. 4E). There were no BC-OFF clusters with high differential abundances of tet2−/−;tet3−/− cells. Figure 4F illustrates how subpopulations composed disproportionately of sibCTL or tet2−/−;tet3−/− cells occupy the 120hpf BC dataset.

Due to the high occupancy of BC-early in tet2−/−;tet3−/−-expanded BC populations, we hypothesized that the tet2−/−;tet3−/− mutation may have impaired all BC differentiation. To test this hypothesis, we leveraged AUCell scoring, which uses Area Under the Curve to determine if specific gene sets related to neuronal function are enriched in BC subpopulations71. Across all gene sets tested, all tet2−/−;tet3−/−-expanded clusters showed the lowest AUCell scores, with the exception of cluster 17, indicating reduced neuronal activity in nearly all tet2−/−;tet3−/−-expanded subpopulations (Figs. 4G,H, S5C,D, S8 Table). Additionally, cluster 17 showed high expression of prkcaa, the mRNA encoding PKCɑ, a BC-ON marker that accumulates during differentiation31,55. These data suggest that loss of tet2 and tet3 impairs the differentiation of many BCs and leads to preferential expansion of some differentiated BC subtypes, suggesting that 5hmC may play a role in both differentiation and subtype fate specification.

Horizontal cell differentiation is impaired in tet2 −/−;tet3 −/− retinae

HC subtypes have been characterized in several organisms, reviewed in72. In zebrafish, four subtypes of HCs have been identified morphologically via their unique connectivity patterns to different photoreceptor populations73,74,75. To determine how HC populations are altered in tet2−/−;tet3−/− retinae, we analyzed all HCs at 120hpf, a time point at which HCs are mostly differentiated76 (Fig. 5A–C). To identify HC subpopulations, we integrated sibCTL and tet2−/−;tet3−/− HCs and performed cluster-specific differential gene expression analysis. The combined 120hpf HC dataset contains 7 unique clusters (Fig. 5A). HC cluster-specific DEGs are listed in S9 Table, while Fig. S6A demonstrates the specificity of cluster-specific marker genes. While further characterization of HCs into subtypes was difficult because most subtype-specific molecular features have only been studied in adult HCs77, expression zones of isl1 and lhx1a reveal that specific regions of the 120hpf HC UMAP are occupied by either the axon-less HC subtypes (isl1 + populations, clusters 1,3,6,7) or axon-bearing HC subtypes (lhx1a + populations, clusters 2,4)78,79 (Fig. 5B). Cluster 5 shows high cluster-specific expression of onecut1, onecut2, and onecutl, all known to be involved in HC genesis (S9 Table)80,81,82, suggesting that cluster 5 is composed of the most nascent HCs in the 120hpf retina (S9 Table).

Fig. 5
figure 5

Horizontal Cell formation is disrupted in tet2−/−;tet3−/− retinae. (AC) UMAP projection of pooled 120hpf sibCTL and tet2−/−;tet3−/− horizontal cells indicating (A) horizontal cell subtype, (B) isl1 expression and lhx1a expression and (C) imbalance score. (D) Horizontal cell subpopulation distributions in sibCTL and tet2−/−;tet3−/− retinae at 120hpf. Clusters 2,5, and 6, which all show a disproportionately high composition of tet2−/−;tet3−/− cells, are annotated. (E) Differential abundance analysis on 120hpf horizontal cell subpopulations. (F) UMAP projection of 120hpf horizontal cells colored by subpopulations that show disproportionately more tet2−/−;tet3−/− cells (gold), sibCTL cells (purple), or no difference (gray). (G) Median AUCell scores for neuronal GO Terms across 120hpf horizontal cell subpopulations. GO Terms plotted include neuron projection (GO:0043005), plasma membrane bounded cell projection (GO:0120025), cell projection (GO:0042995), adenyl ribonucleotide binding (GO:0032559), ATP binding (GO:0005524), cell communication (GO:0007154), protein kinase activity (GO:0004672), action potential (GO:0001508), neuron to neuron synapse (GO:0098984), axon (GO:0030424), calcium ion binding (GO:0005509), gated channel activity (GO:0022836) and synapse (GO:0045202). Statistics comparing AUCell scores across 120 hpf horizontal cell clusters are available in Table S10. Key represents minimum and maximum values of median AUCell scores in each cluster for each GO Term tested. (H) UMAP projection of 120hpf horizontal cells colored by AUCell scoring for cell communication (GO:0007154).

We next sought to determine the extent to which HC subpopulations are dynamically shifted between sibCTL and tet2−/−;tet3−/− retinae. Imbalance scores revealed several regions of the 120hpf HC dataset that showed disproportionate contributions of sibCTL and tet2−/−;tet3−/− cells. Disproportionate contributions of sibCTL or tet2−/−;tet3−/− cells localized to regions of the dataset occupied by both axon-bearing and axon-less HC subtypes, suggesting that no individual morphological subpopulation of HCs is disproportionately overrepresented in tet2−/−;tet3−/− retinae (Fig. 5C). Differential abundance analysis revealed disproportionate contributions of sibCTL cells to clusters 1 and 4 and disproportionate contributions of tet2−/−;tet3−/− cells to clusters 2,5, and 6, with no particular association between morphological subtype and disproportionate sibCTL or tet2−/−;tet3−/− cell contributions (Fig. 5D–F). This result supports the conclusion that Tet loss-of-function does not lead to disproportionate expansion of a given morphological subtype of HCs.

We next tested the hypothesis that HC subpopulations with elevated contributions of tet2−/−;tet3−/− cells were associated with impaired differentiation states, while HC subpopulations with elevated contributions of sibCTL cells were associated with mature differentiation states. To test this hypothesis, we used AUCell scoring with GO_Term-informed gene sets on 120hpf HCs to determine the extent that cells in each HC cluster exhibit molecular characteristics of differentiated neurons (S6 Table, S7 Table). Across all gene sets, cluster 5 scored lower than other HC clusters in this analysis, further supporting the conclusion that cluster 5 was the least-differentiated of the 120hpf HC clusters (Figs. 5G,H, S6B, S10 Table). Across all gene sets, clusters enriched for sibCTL cells exhibited higher AUCell scores compared to all other clusters. Likewise, clusters enriched for tet2−/−;tet3−/− cells always exhibited lower AUCell scores compared to sibCTL-enriched clusters (Figs. 5G, S6B, S10 Table). Taken together, these data suggest that the least differentiated HC subpopulations in our dataset are disproportionately composed of tet2−/−;tet3−/− cells (Figs. 5F,H, Fig S6B).

Discussion

With an interest in how Tet activity regulates retinal development, we performed scRNA-Seq on control and tet2−/−;tet3−/− retinal cell populations at multiple developmental time points and identified a number of transcriptional differences in the tet2;tet3-deficient retina. These included defects in (1) the maturity of several differentiated cell types, (2) the relative abundances of differentiated cell types, and (3) the fate biases and diversity of differentiated cell type subpopulations, particularly in BCs. tet2−/−;tet3−/− cones exhibited transcriptional signatures of impaired differentiation, consistent with our previous report26 and a more recent study in mice56. Additionally, tet2−/−;tet3−/− BCs and HCs showed expanded proportions of immature subpopulations and reduced cellular diversity when compared to sibCTL retinae. It is likely that reduced subpopulation complexity in tet2−/−;tet3−/− BCs occurs as a result of the elevated proportions of immature BCs in the tet2−/−;tet3−/− retinae. tet2−/−;tet3−/− retinae possessed several deficiencies in relative abundances of cell types and subpopulation biases. There were proportionally fewer cones and an expansion of the BC-ON lineage in tet2−/−;tet3−/− retinae relative to other cell populations. Expansion of the BC-ON lineage includes many differentiation-impaired BC-ON subpopulations. These data suggest both differentiation impairment and altered BC subtype lineages in tet2−/−;tet3−/− retinae. Taken together, our results reveal that Tet proteins play a role in generating normal proportions of retinal cell populations during retinogenesis, while also ensuring proper differentiation of these populations.

Given Tet proteins’ roles in epigenetic regulation of gene expression, there are several potential molecular mechanisms underlying these retinal phenotypes. As noted above, RPCs undergo sequential rounds of specification to generate the diverse neuronal cell types and subtypes found in the mature retina83,84. The precise orchestration of retinal development from this RPC pool involves continuous modulation of gene regulatory networks and epigenetic changes that are thought to contribute to this process38,85,86. Tet-mediated conversion of 5mC to 5hmC facilitates passive DNA demethylation in dividing cells87,88, due to the inability of the maintenance DNA methyltransferase, Dnmt1, to recognize hemi-hydroxymethylated DNA and thereby reestablish the repressive 5mC pattern post-DNA replication89. Alternatively, active DNA demethylation occurs when 5hmC is converted to 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC), which are subsequently removed through a thymine DNA glycosylase (TDG)-dependent base excision repair (BER) mechanisms2,90.

Whether 5hmC is directly deposited on the elements of differentially expressed genes or on upstream factors that regulate these genes is an open question. Specific sites impacted by tet2- and tet3-mediated DNA hydroxymethylation could involve regulatory regions associated with genes that function in cell type specification, subtype specification, and/or the terminal differentiation sub-networks operating in RPCs. Alternatively, impacted sites could be associated with upstream genes and signaling pathways with differentiation-inhibitory influences, thereby revealing 5hmC as an indirect regulator of specification and differentiation genes. There is extensive evidence supporting both possibilities from studies in a variety of tissues and organ systems. Indeed, our previous work showed that tet2 and tet3 regulate Wnt and Notch pathway activity, which cell non-autonomously modulate the differentiation of retinal neurons during retinal development26. Similarly, in Tet1;2;3 deficient mouse embryonic stem cells, Wnt signaling is upregulated and neural fates are compromised91. Likewise, it is well known that 5hmC can accumulate in genes required for tissue identity and maturation11,92, suggesting that 5hmC deposition could impact specification and differentiation genes directly in RPCs. These are not mutually exclusive possibilities as well, as it is likely that Tets exert stage-dependent and cell-type dependent functions, so could act outside of the retina to modulate cell signaling pathways and within RPCs or specified retinal neurons to regulate gene expression of key differentiation and morphogenesis genes. Our scRNA-Seq data do not differentiate between these possibilities as we focus solely on retinal cells and do not assess 5mC or 5hmC statuses of any genes. Here, targeted methylation and hydroxymethylation profiling will be needed, and in both retinal and non-retinal cell types during key stages of retinal development.

Recent insights have established relationships between gene regions with deposited 5hmC and gene expression levels. Genic regions with the strongest relationships between 5hmC deposition and changes in gene expression during cell and organ differentiation include enhancers and gene bodies4,11. Enhancer hydroxymethylation has been shown to mark tissue-specific genes associated with differentiation93,94. Gene body hydroxymethylation has been shown to mark tissue-specific genes associated both with differentiation, functions of mature cells, and the maintenance of cellular function11,92. Identifying whether Tet loss preferentially impacts enhancer or gene body hydroxymethylation would provide greater context into patterns of gene regions that are dynamically hydroxymethylated during retinogenesis.

Beyond site-specific studies, there are broad associations between cellular maturation and 5hmC accumulation. 5hmC accumulation in neurons has been shown to positively correlate with their differentiation state and to be required for terminal differentiation11,95. Indeed, in the mouse retina, 5hmC accumulates in retinal neurons as they mature, and this accumulation correlates with increased expression from 5hmC-enriched loci encoding various proteins related to neuronal development and maturation96. Mechanistically, 5hmC accumulation in post-mitotic retinal neurons could facilitate gene expression via active demethylation and/or site-specific recruitment of other factors to 5hmC-enriched regions of the genome to activate gene expression9,97,98. Alternatively, accumulation of Tet proteins themselves at specific genomic loci could recruit other factors necessary for gene expression in differentiating neurons. In the mouse retina, Tet3 steadily accumulates in retinal neurons over time and the Tet3 protein interacts with transcriptional regulators and histone modifying enzymes, whose expression levels also correlate with increased expression of retinal genes96.

In our previous study, we demonstrated that tet2−/−;tet3−/− retinae possessed severe defects in retinal neuron differentiation, with the most prominent phenotypes being in RGCs, most of which lacked an axon, and hence, tet2−/−;tet3−/− mutants overall lacked an optic nerve. tet2−/−;tet3−/− mutants also possessed defects in photoreceptors, most of which lacked defined outer segments26. This latter observation is consistent with recent studies in the mouse retina56. scRNA-Seq data presented here provide additional insight into tet2 and tet3 function during retinal development. In this study, we identified the seven principal retinal cell classes in tet2−/−;tet3−/− retinae, but gene expression in cones, BCs and HCs was disrupted, resulting in defects in neuronal subtype formation and differentiation status. We were surprised that we did not identify gene expression changes in rods and RGCs, but suspect that this was due to low sampling. Indeed, rods comprised less than 2% of the 72hpf dataset and ~ 0.5% of the 120hpf dataset, while RGCs comprised less than 1% of the 36 and 48hpf datasets. Thus, while morphological defects are obvious in both cell types in tet2;tet3 mutants, the enrichment of these cell types in our dataset was likely too low to identify the molecular underpinnings of these defects. Future studies will require higher enrichment of these cell types to assess the molecular underpinnings of their differentiation defects at the single-cell level.

DNA methylation and demethylation have been well-studied in photoreceptors, while roles in other retinal cell types like BCs and HCs remain largely unknown. In photoreceptors, DNA demethylation is thought to be important for the expression of photoreceptor genes during the early stages of differentiation99,100, while at later stages, and particularly in rods, most loci remain hypomethylated, perhaps due to inaccessibility of the DNA methyltransferases to the highly condensed chromatin architecture found in the rod nucleus101. Our data demonstrate that cones in tet2−/−;tet3−/− retinae were impaired in differentiation, with genetic markers of maturity expressed at significantly lower levels. It will be of interest in future studies to determine if these loci remain methylated in tet2,tet3-deficient cones and therefore downregulated relative to wild-type cones. Likewise, it will be of interest to examine loci specific to differentiating BCs and HCs to determine their methylation and hydroxymethylation statuses in tet2−/−;tet3−/− mutants relative to wild-type animals. Indeed, integrating our scRNA-Seq data with genome-wide 5mC and 5hmC profiling in sibCTL and tet2−/−;tet3−/− retinae could reveal specific genic regions affected in tet2−/−;tet3−/− mutants and thus, potential regulatory mechanisms leading to the specification and differentiation defects observed in tet2−/−;tet3−/− mutants.