Introduction

Corneal diseases like Fuchs Endothelial Corneal Dystrophy (FECD) represent one of the leading causes of vision loss and blindness worldwide1. FECD is a bilateral, genetically heterogeneous degenerative disease2 of corneal endothelial cells (CEnCs) occurring in approximately 4% of the U.S. population over 40 years of age, with higher incidence in women3,4,5,6,7. Among the prominent characteristics of FECD are the formation of corneal guttae and extracellular matrix (ECM) deposition8,9,10. Guttae and subsequent thickening or remodeling of Descemet’s membrane (DM) cause light scatter and account for glare and visual complaints in FECD patients. In FECD, CEnCs exhibit stress-induced antioxidant imbalance11, oxidation of DNA bases12, heightened reactive oxygen species (ROS) production13, mitochondrial damage14, and sustained ER stress15. However, the mechanism of guttae formation or ECM deposition is not clearly understood. We have previously established that physiological stressor, ultraviolet-A (UVA) light, causes DNA damage and leads to FECD phenotype in both in vitro and in vivo disease models16,17. Recently we demonstrated that the endo-degenerative phenotype in FECD is indeed driven by estrogen metabolite-dependent corneal endothelial (CE) cell loss that is exacerbated in the absence of NAD(P)H quinone oxidoreductase 1 (NQO1); thus, shedding light on the mechanism accounting for the higher incidence of FECD in females18.

Most cases of FECD are caused by expansion of the intronic CTG trinucleotide repeat within the TCF4 (Transcription factor 4) gene, making it the largest contributing locus for FECD19,20,21,22. TCF4 expansions of greater than 40 CTG repeats confer significant risk for developing FECD23. FECD can also be caused by a CTG expansion within the 3′-untranslated region (3′-UTR) of the DMPK (Myotonic dystrophy protein kinase) gene, implicating mutant expanded CUG RNA as the root cause for repeat-associated FECD24,25,26. FECD cases with (FECD-R) or without (FECD-NR) repeat expansions are indistinguishable by clinical observation and require genetic analysis like TCF4 CTG18.1 polymorphism genotyping for classification. To date, several non-mutually exclusive mechanisms have been proposed that drive and/or exacerbate the onset of disease. These mechanisms include dysregulation of TCF4 expression27; RNA-mediated toxicity28,29; repeat-associated non-AUG dependent (RAN) translation30; and somatic instability of CTG repeat length31,32. However, the relative contribution of these proposed mechanisms in disease pathogenesis is currently unknown.

Currently, corneal transplantation is the only treatment for FECD, accounting for the vast majority of the 32,000 endothelial keratoplasties performed in the U.S. annually33. There is a significant unmet need to investigate and understand mechanisms involved in the pathogenesis of FECD to help develop targeted pharmacotherapeutic interventions. RNA sequencing (RNAseq) has become an indispensable tool for transcriptome-wide analysis of differential gene expression to infer biological meaning in further studies and to understand the disease pathogenesis34. This technique is also suitable for analyzing many aspects of RNA biology and has introduced the use of RNA-based biomolecules as useful diagnostic, prognostic, and therapeutic means in various diseases35. This implies that RNAseq analysis of FECD related samples could be valuable in identifying causative genes as well as pathways.

Although there had been four previous studies employing RNAseq technique, all these studies have been conducted using corneal endothelium from patients with FECD and normal controls36,37,38,39. Here, instead of using RNA from the corneal endothelium of FECD patients for RNAseq analysis, we utilized the FECD and normal cell lines that were generated by collecting DM and endothelium from corneas of cadaveric donors and FECD patients undergoing endothelial keratoplasty, followed by simian virus-40 T-antigen cell immortalization of primary CEnCs. To identify differentially expressed genes (DEGs) and pathways that may be potentially related to the pathogenesis of FECD, we performed RNAseq using RNA from 3 normal cell lines, 3 FECD cell lines derived from donors with TCF4 repeat expansions (FECD-R) and 4 FECD cell lines derived from donors without TCF4 repeat expansions (FECD-NR). Transcriptomic data from the three groups, normal, FECD-R and FECD-NR, revealed various differentially expressed genes that are (i) shared between FECD-R and FECD-NR or (ii) uniquely altered in FECD-R but not FECD-NR. Pathways related to hepatic fibrosis and epithelial–mesenchymal transition (EMT) were found to be shared by FECD-R and FECD-NR, thus supporting extracellular matrix (ECM) depositions (guttae) seen in FECD. These results will help understand the underlying mechanism of pathogenesis in FECD as well as other trinucleotide repeat diseases, thus providing potential targets for therapeutic design.

Results

Identification of differentially expressed genes and pathways in FECD compared to normal corneal endothelial cell lines

To elucidate the differences in gene expression profiles of CEnCs between FECD and normal controls, we compared the CE cell lines derived from FECD (n = 7) subjects (irrespective of their TCF4 CTG repeat expansion status) undergoing endothelial keratoplasty with normal cadaveric donors (n = 3) by RNA sequencing (RNAseq) (Fig. 1A). Genetic analysis from blood samples and CEnCs identified 3 FECD samples with a single allelic CTG repeat expansion in the TCF4 gene (FECD-R), 4 FECD samples without the CTG repeat expansion (FECD-NR), and 3 normal control samples without the CTG repeat expansion (Fig. 1B). Since FECD is a genetically heterogenous disease, the FECD-R and FECD-NR cell lines were tested for mutation in specific genetic variants in other FECD-associated genes namely LOXHD1, SLC4A11, AGBL1, KANK4 and ZEB1 (Supplementary Fig. 1A-B). The selection of alleles tested was based on that allele being predominant in the familial cases of FECD and/or having been reported most commonly in FECD cases in literature (Supplementary Fig. 1A). All the 7 FECD cell lines were found to be wild type for all the alleles tested, except for ZEB1 where one FECD-NR cell line (SVF2-61F) was heterozygous for rs35753967 (Supplementary Fig. 1B). Multi-dimensional scaling (MDS) plot demonstrated clustering of FECD and normal CEnCs (Supplementary Fig. 2). To classify changes as significant, we used the cutoff of < 0.05 for False Discovery Rate (FDR). Volcano plot allows a global overview of individual gene expression changes and is useful in visualizing pattern of changes and identifying ‘outlier’ genes by combining highly significant changes in gene expression with higher fold changes. Evaluation of volcano plot and heatmap [log2 RPKM (reads per kilobase of transcript per million reads mapped), FDR = 0.05) for RNAseq analyses identified a total of 214 statistically significant DEGs between FECD and normal cells with 193 downregulated (log2 fold change (log2FC) < = 1, FDR < 0.05) and 21 upregulated (log2FC > 1, FDR < 0.05) genes in FECD cell lines (Fig. 2A-B, Supplementary Tables 1–3). While the MDS plot (Supplementary Fig. 2) illustrates the qualitative relative similarity between FECD and normal cells for the top 300 most varied genes in our cohort, the heatmap (Fig. 2B) is generated using only the 214 differentially expressed genes (DEGs) between the two cohorts. This explains the differences between the clustering of samples in the MDS plot and the heatmap. Therefore, although the one healthy control cell line (SVN2-67F) clustered oddly in the MDS plot, the evaluation of heatmap (Fig. 2B) confirmed that its gene expression pattern is more similar to the other normal than the FECD samples. The top 20 differentially overexpressed genes identified in FECD cells include transcriptional regulator genes like ZNF711 (Fig. 2C). To elucidate the potential impact of changes in the expression of individual genes between FECD and normal cells on physiological processes, we applied Ingenuity Pathway Analysis (IPA) to our RNAseq data set. Intriguingly, the top canonical pathway was hepatic fibrosis pathway (Fig. 2D). Involvement of hepatic fibrosis genes in FECD is consistent with the accumulation of extracellular matrix (ECM) proteins with thickening of DM and guttae formation9. Genes of the fibrosis pathway that showed statistically significant decrease in expression include EGFR, COL12A1, COL9A3, BAMBI, COL4A3, COL8A2, CD14, COL21A1, COL26A1 and FGFR2 (Fig. 2E). The pathway analysis also showed the role of Epithelial–mesenchymal transition (EMT) related genes in FECD (Fig. 2D-E), including FGF5, TCF4, EGFR, WNT7B, FGF10 and FGFR2, some of which have already been explored and studied in the disease pathogenesis15,40,41. Subsequently, we performed an IPA upstream regulator analysis and identified β-estradiol as the top upstream regulator in our RNAseq data set, which included 50 target molecules out of the 214 DEGs (Fig. 2F). Additional upstream regulators included SMARCA4, estrogen receptor 2, and ARNT2 as predicted inhibitors and mir155-5p as a predicted activator (Fig. 2F). PANTHER analysis of 214 DEGs identified in FECD cells demonstrated a variety of molecular functions with the largest representation involving a binding function, and statistically significant enrichment of extracellular matrix (ECM) structural constituents conferring tensile strength, heparin binding, glycosaminoglycan binding, cation channel activity and calcium ion binding (Supplementary Fig. 3A-B).

Fig. 1
Fig. 1
Full size image

Experimental scheme and sample cohorts. (A) Descemet’s membrane and endothelium were collected from donor corneas (n = 3) and patients with FECD undergoing endothelial keratoplasty (n = 7). Primary CEnCs were immortalized using the simian virus-40 T-antigen cell immortalization kit and the number of repeats were estimated in CTG18.1 in the TCF4 gene of FECD cells. Cell lines from normal (SVN1-67F, SVN2-67F and SVN3-65M), FECD with repeat expansions (FECD-R; SVF1-73F, SVF5-54F, and SVF6-61M) and FECD without repeat expansions (FECD-NR; SVF2-61F, SVF3-76M, SVF4-61F, and SVF7-74F) were used for RNA sequencing followed by data analysis. (B) Table representing age, sex, passage number, and TCF4 trinucleotide CTG repeats status of normal and FECD cell lines used in this study.

Fig. 2
Fig. 2
Full size image

RNAseq analysis of differential gene expression and pathways in FECD versus normal control cell lines. (A) Volcano plot for FECD (FECD-R and FECD-NR) versus normal control (FDR < 0.05) showing 193 downregulated (log2FC < = 1) and 21 upregulated (log2FC > 1) DEGs. (B) Heatmap (log2 RPKM, FDR = 0.05) comparing the differential expression level of total 214 genes in FECD and control cohorts, verifies that the gene expression pattern of FECD cases is distinct from that of normal controls. Samples are indicated at the bottom. Red stripes represent high expression levels, while blue stripes represent low expression levels. (C) Top 20 downregulated and upregulated genes in FECD versus control cohorts. (D) Top 20 ingenuity canonical pathways with -log(p-value) for each pathway shown in the second column. The two pathways involved in FECD pathogenesis are boxed in red. (E) Differentially expressed genes in the Hepatic Fibrosis and Epithelial Mesenchymal Transition (EMT) pathways in FECD vs. normal cohorts. (F) Top 18 upstream regulators that were identified as significantly inhibited (z-score <-0.2) or activated (z-score > = 0.2) in FECD as compared to normal. The activation z-scores and p-values for each upstream regulator are shown in the last two columns.

Differentially expressed transcriptional regulator genes in FECD cells

The primary regulators of gene expression are transcription factors that play a crucial role in maintaining cellular sustainability and finely regulating complex internal metabolic networks42. Two-thirds of the FECD cases have been found to be associated with the CTG repeat expansions within transcription factor 4 (TCF4)20,43, a basic helix-loop-helix transcription factor and an important regulator of neurodevelopment44,45 and epithelial-mesenchymal transition46. To advance our understanding of the role of additional transcription factors in FECD, we inspected differentially expressed transcriptional regulators in our dataset of 214 DEGs in FECD. Ten out of the 214 DEGs were identified as transcription regulators (Fig. 3A-B). While ZNF711 was the only transcription regulator that was identified as upregulated in FECD, the remaining 9 genes including TCF4, MAFB, TFAP2B, POU6F2, SOX21, ARX, PPARGC1A, ZNF423, and LMX1B were all found to be downregulated in FECD as compared to normal cells (Fig. 3A-B). Interestingly, small Maf (MAF BZIP transcription factor) genes like MAFB have been previously shown to heterodimerize with activated Nrf247,48,49 and bind to antioxidant response elements (ARE) in the DNA to facilitate the transcription of a myriad of genes encoding antioxidants and protect against oxidative stress, an underlying cause of FECD50,51. Additionally, transcription factors TFAP2B and POU6F2 are shown to play a role in corneal development and maintenance52,53,54. We therefore validated the downregulation of MAFB, TFAP2B and POU6F2 proteins in cell lysates from 2 normal cell lines, SVN1-67F and SVN3-65M; and 4 FECD cell lines, SVF1-73F, SVF5-54F, SVF2-61F and SVF3-76M by western blotting (Fig. 3C). The densitometry analysis of western blots demonstrated MAFB/β-actin and POU6F2/β-actin protein levels were markedly downregulated by 4-fold (p < 0.01) and 2-fold (p < 0.05) respectively, in FECD compared to normal cell lines (Fig. 3C). Lifelong exposure of the cornea to oxidative stress can play a role in the pathogenesis of FECD11. We have previously demonstrated that UVA light, an environmental stressor of the human cornea, causes macromolecular damage via the production of ROS, leading to FECD phenotype in a mouse model16,17,50. A recent study revealed a distinct regional variation in FECD progression in humans, with relative sparing of the superior cornea suggesting that UVA light could be an etiologic factor in FECD pathogenesis55. Therefore, we evaluated the effect of UVA on the expression of MAFB, TFAP2B and POU6F2 proteins. Interestingly, UVA-irradiation decreased the levels of MAFB and TFAP2B but did not affect POU6F2 in SVN-67F cells (Fig. 3C).

Fig. 3
Fig. 3
Full size image

Differential expression of transcriptional regulators in FECD compared to normal cell lines. (A) List of 9 downregulated and 1 upregulated transcription regulator genes in FECD as compared to normal controls. The log2 fold change values and gene functions for each of 10 differentially regulated genes are shown in the last two columns. The three interesting candidate genes shown to be involved in FECD related pathways or corneal endothelial cell development are boxed in red. (B) Expression profiles of 9 downregulated and 1 upregulated transcription regulator genes for normal and FECD cell lines (FDR < 0.05). (C) Western blot assay showing downregulation of MAFB, TFAP2B and POU6F2 in FECD as compared to control cell lines. UVA-irradiated normal cells showed downregulation of MAFB and TFAP2B, but not POU6F2 at 24-hours post UVA (5 J/cm2). Densitometry was analyzed using Image J software and relative protein expression levels were determined by normalizing band intensities to that of β-actin. * Indicates p < 0.05 applying two-tailed unpaired t-test with Welch’s correction.

Differentially expressed genes and altered pathways in FECD-R and FECD-NR cell lines

We next investigated whether we could further identify common DEGs between FECD CE cell lines with and without CTG repeat expansions in TCF4, compared to normal cell lines. We identified 77 DEGs (60 downregulated and 17 upregulated) between FECD with the CTG repeat expansion in TCF4 (FECD-R) and normal (Fig. 4A; Supplementary Fig. 4); and 484 DEGs (338 downregulated and 146 upregulated) between FECD without the CTG repeat expansion in TCF4 (FECD-NR) and normal cell lines (Fig. 4B; Supplementary Fig. 4). Overlapping the DEGs between Normal vs. FECD-R and Normal vs. FECD-NR, revealed forty-eight DEGs (45 downregulated and 3 upregulated) to be common among both the data sets (Fig. 4C-D). The 48 common gene list retained the transcription factors MAFB, and TAFB2P as downregulated genes and identified FAM196B, STEAP1 and ALDH1A3 as the three upregulated genes (Fig. 4D). Among the top canonical pathways revealed by IPA of the 48 common genes included some known FECD-associated signaling pathways like hepatic fibrosis, Wnt signaling, EMT pathway, mitochondrial dysfunction, NF-kB signaling and estrogen receptor signaling (Fig. 4E). Additionally, there were some interesting new pathways like serotonin and melatonin degradation, PTEN signaling, Aryl hydrocarbon signaling, and signaling by Rho GTPases, that need further investigation with respect to FECD pathogenesis. Furthermore, we performed an IPA upstream regulator analysis and identified three transcriptional regulators CEBPA, CEBPB and SMARCA4, along with β-estradiol, TGF-β1, ESR1 and AHR (aryl hydrocarbon receptor) (Fig. 4F). PANTHER analysis of 48 common DEGs demonstrated a diversity of molecular functions with the largest representation involving a binding function, and statistically significant catalytic and transporter activity (Supplementary Fig. 5).

Fig. 4
Fig. 4
Full size image

Analysis of differential gene expression and pathways in FECD-R, FECD-NR, and normal control cell lines. (A) Heatmap (log2 RPKM, FDR = 0.05) comparing the differential expression level of total 77 genes (60 down- and 17 up-regulated) in FECD-R and normal control cohorts. (B) Heatmap (log2 RPKM, FDR = 0.05) comparing the differential expression level of total 484 genes (338 down- and 146 up-regulated) in FECD-NR and normal control cohorts. Samples are indicated at the bottom. Red stripes represent high expression levels, while blue stripes represent low expression levels. (C) Venn diagram showing the overlap for gene expression of between Normal vs. FECD-R and Normal vs. FECD-NR. Numbers indicate the number of differentially expressed genes identified in each individual region. (D) Significant expression level changes in 48 common DEGs genes in FECD-R and FECD-NR cohorts. (E) Top 20 ingenuity canonical pathways with -log(p-value) for each pathway shown in the second column. (F) Top 10 upstream regulators that were identified as significantly inhibited (z-score <-0.2) or activated (z-score > = 0.2) as common between Normal vs. FECD-R and Normal vs. FECD-NR. The molecule type and p-values for each upstream regulator are shown in the last two columns.

Furthermore, we also investigated differences in gene expression profiles that were unique to FECD-R cell lines. We identified 29 (14 downregulated and 15 upregulated) DEGs in Normal vs. FECD-R but absent in Normal vs. FECD-NR (Fig. 4C; Table 1). The downregulated genes in FECD-R include MAPK4, NPY, ZBED2, TAC1, PRSS21, L1CAM, CDH11, TMTC1, APOD, DMKN and BOC, while some of the upregulated genes in FECD-R are NMT2, DRAM1, SLIT2, FGF5, ARL4C, DLC1, IL31RA, TNFGSF1B and IL1RL1. It will be interesting to further investigate if any of these genes or their altered pathways play a role in TCF4 repeat expansion, seen in FECD.

Table 1 List of 29 down- and up-regulated DEGs in FECD-R, but not in FECD-NR and normal cell lines, with FDR < 0.05.

Discussion

Identifying the drivers of late-onset diseases like FECD is important for understanding disease progression and developing therapeutics. Transcriptome analysis of corneal endothelium has been previously reported in vitro55 and ex vivo38,56,57. However, many of the studies were performed with low sample size, pooled samples from multiple corneas, limited statistical analysis, or lack of TCF4 repeat expansion status. Cell lines derived from FECD patient tissues serve as an in vitro model to not only study disease causing pathways but also screen therapeutic compounds. Therefore, it is essential to characterize them to identify known and unknown pathways implicated in FECD that are still preserved in the immortalized cell lines obtained from patient tissues. RNA-seq is a great tool for transcriptome profiling and allows examining the expression of all genes at once58. The present study was designed to utilize RNA-sequencing technique and evaluate the gene expression changes in the corneal endothelial cell-lines generated from FECD tissues with or without CTG repeat expansions in TCF4 and thereby highlight novel disease-causing molecular pathways/mechanisms of FECD. In this study, we identified a total of 214 genes that met our criteria as DEGs between FECD and normal cells with 193 downregulated (log2FC < = 1, FDR < 0.05) and 21 upregulated (log2FC > 1, FDR < 0.05) in FECD cell lines. 48 common DEGs were detected in both FECD-R and FECD-NR, compared to normal cells. Moreover, we also identified 29 DEGs (14 downregulated and 15 upregulated) that were specifically altered in FECD-R, but not in FECD-NR compared to normal cell lines. The molecular signatures identified for FECD-R would be useful for developing biomarkers to characterize FECD with and without TCF4 repeat expansions.

CEnCs secrete and deposit extracellular matrix (ECM) proteins that form a specialized basement membrane called Descemet’s membrane (DM). A normal DM contains collagen type VIII, collagen type IV (chains α1–α2), and fibronectin on its stromal side and entactin, laminin, perlecan, and collagen type IV (chains α3–α6) on its endothelial side59,60. The main feature of late-onset FECD is its irregular and thickened DM61,62 due to abnormal accumulation of ECM components including collagens, transforming growth factor-β induced (TGFβ-I) and clusterin9,63,64; resulting in guttae formation. ECM accumulation and tissue remodeling during fibrosis is often a result of continued endothelial-mesenchymal transition (EMT), a repair-associated process whereby endothelial cells lose their cell-cell adhesion, develop migratory and invasive properties leading to tissue fibrosis40. Among the top canonical pathways identified by IPA analysis of our RNAseq data confirmed the involvement of hepatic fibrosis and EMT pathways, consistent with the abnormal ECM regulation and EMT in FECD pathogenesis, and with previously reported RNA measurements using tissues from late-stage FECD9. Hepatic fibrosis genes showing significant downregulation in FECD cells included EGFR, CD14, FGFR2 and collagens like COL8A2, COL4A3, and COL12A1. Mutations in COL8A2 gene have consistently shown association with early-onset FECD, associated with thickening of Descemet membrane and subsequent increase in central corneal thickness (CCT)65,66. Corneal endothelial expression of COL4A3 due to loss of ZEB1 has been shown to result in an abnormal endothelial proliferation, corneal thickening, and iridocorneal adhesions observed in individuals affected with posterior polymorphous corneal dystrophy type 3 (PPCD3) and in Zeb1-heterozygous and -null mice67. Collagens XII have been shown to be critical for corneal function and their absence results in the delayed maturation of corneal endothelium and abnormal corneal compaction68. Among the EMT-related genes in our dataset, while FGF5 showed a significant increase, TCF4, EGFR, WNT7B, FGF10 and FGFR2 genes markedly decreased in FECD, as compared to normal cells. Gao et al., performed a genome-wide association study in 2016, and identified WNT7B as a novel locus for CCT in Latinos69. Interestingly, FGF10 (Fibroblast Growth Factor 10) has been recently shown to play a significant role in the repair of the corneal endothelium, primarily by promoting wound healing, inhibiting inflammatory responses, and enhancing the function of endothelial cells70.

Two transcription factors have been implicated in the pathogenesis of FECD: TCF4 (Transcription factor-4) and TCF8 (Transcription factor-8). An intronic CTG repeat in TCF4 gene is the most common mutation in FECD is that leads to its altered expression20,21,23,71. In the United States, about 70% of the FECD cases are caused by an intronic trinucleotide repeat expansion in TCF420,21,22,23. Additionally, mutations in TCF8 have been linked to late-onset FECD72. TCF8 encodes the ZEB1 protein and is upregulated by the E2-2 protein encoded by TCF472,73. Although TCF4 and TCF8 have similar biological functions, their specific role in the pathogenesis of FECD is unclear. Interestingly, we identified a heterozygous ZEB1 variant, c.1721 A > G; p.K553R (rs35753967), in SVF2-61 F, a FECD-NR cell line. This ZEB1 variant has been previously reported in 3 non-familial FECD cases74 (Supplementary Fig. 1A-B), however, functional studies are needed to determine its role in FECD pathogenesis. Since in this study our criterion of grouping the cell lines was the presence or absence of TCF4 repeat expansion, we do not think that our current transcriptomic analyses would be impacted even if ZEB1 mutation identified in SVF2-61 F cells is found to be pathogenic. To further explore the altered expression of additional transcription factors in FECD, we utilized our RNA-seq dataset and identified MAFB (MAF BZIP transcription factor B) to have a lower expression in FECD, compared to normal cells. Interestingly, small Maf proteins including Mafb have been known to heterodimerize with another transcription factor Nrf247,48,49,75. Suppression or loss of Nrf2 in FECD causes mitochondrial dysfunction, DNA damage, excessive lipid peroxidation and ultimately corneal endothelial cell death50,51. Additionally, Nrf2 is deficient in FECD and leads to loss of its transcriptional target NQO1, rendering ocular tissue to be highly susceptible to oxidative stress18,76,77,78. Furthermore, our dataset identified decreased expression of TFAP2B (transcription factor AP-2 beta) and POU6F2 (POU class 6 homeobox 2) genes in FECD, which was further validated by western blots. Both TFAP2B and POU6F2 are shown to be CEnC-specific transcription factors52,53. While TFAP2B transcriptionally regulates corneal endothelial-related genes52, POU6F2 expression is found to be upregulated during human CEnC differentiation, suggesting it to be pivotal in terminal differentiation of corneal endothelium from neural crest53. King et al. used the mouse genetic reference panel and identified Pou6f2 as one of the transcription factors that modulates corneal thickness during development and provides a potential link with the risk for glaucoma54. Further research on these corneal endothelium transcription factors may increase our knowledge on the corneal endothelial differentiation process and homeostatic conservation transcription and may also be important in regenerative medicine using cell-based therapy with CEnCs and pluripotent stem cells.

FECD has been shown to have greater incidence in females16. Our lab has recently demonstrated the role of estrogen genotoxicity in preferential development of FECD in females18. Specifically, we showed that injecting wild-type male mice with estrogen (E2, β-estradiol) increases CEnC loss, followed by increased production of estrogen metabolites and mitochondrial DNA damage18. Overwhelmingly, the IPA upstream regulator analysis of 214 DEGs identified β-estradiol, the primary female sex hormone, as the top upstream regulator with predicted inhibition (activation Z-score < -2.0), thus confirming our previous findings. ESR2 (Estrogen receptor-2), SMARCA4 (SWI/SNF Related, Matrix Associated, Actin Dependent Regulator of Chromatin, subfamily A, Member 4) and ARNT2 (aryl hydrocarbon receptor nuclear translocator 2) genes are also identified as the upstream regulators with predicted inhibition. Rosenbloom et al. utilized chromatin immunoprecipitation sequence (ChIP-seq) and determined that TCF4 rs613872 single nucleotide polymorphism (SNP) is present within the binding site for the SMARCA4 transcription factor79. It has been hypothesized that variation at rs613872 SNP affects the spatiotemporal expression of TCF4 through SMARCA4 and hence its targets. ARNT2 is a transcription factor that is known to dimerize with Aryl hydrocarbon receptor (AhR) and then translocate to nucleus where it binds with specific xenobiotic responsive elements to upregulate CYP-family of enzymes80. Interestingly, CYP1B1 is the key enzyme in the metabolism of estrogen, that is shown to be upregulated in FECD tissues16. Additionally, the upstream regulators with predicted activation included miR-155-5p. miR-155 (microRNA-155) are demonstrated to play a significant role in regulating the function of CEnCs, primarily by impacting their cell cycle and barrier function through the modulation of the E2F2 protein, potentially contributing to conditions like high myopia where endothelial damage is observed81.

We identified 6-fold more DEGs in normal vs. FECD-NR, compared to normal vs. FECD-R, suggesting that the FECD in patients with non-expanded TCF4 repeats (FECD-NR) may involve a multitude of genes and pathways that converge into disease-causing mechanism leading to FECD. We further investigated the 48 DEGs found in the cell lines with the previously published RNA-seq data set taken from FECD post-keratoplasty tissues38, comparing control vs. FECD tissues taken from pre-symptomatic (Pre-S), late stage FECD with TCF4 repeat expansions (FECD-REP), or FECD without TCF4 repeat expansions (FECD-NR). Notably, we found 3 out of 48 genes, namely ALDH1A3, MAOB and MAFB from our dataset to be also dysregulated in at least 2 of the 3 tissue groups from that study38. While ALDH1A3 was upregulated, the other 2 genes, MAOB and MAFB, were found to be downregulated in both immortalized FECD cell lines and ex vivo FECD tissues, indicating the consistent findings between both data sets.

The canonical pathway analysis for the 48 DEGs detected in both FECD-R and FECD-NR indicated that these common genes were significantly enriched in some of the pathways like fibrosis, Wnt signaling, EMT signaling, mitochondrial dysfunction82, and estrogen receptor signaling18 that are known to be involved in FECD pathogenesis. Additionally, some novel and interesting pathways like tryptophan degradation, serotonin/melatonin degradation, and AhR signaling, were also discovered that require further investigations with respect to FECD pathogenesis. Previous studies have shown that mitochondrial biogenesis, and autophagy are tightly regulated by melatonin, an evolutionarily conserved metabolite of tryptophan and serotonin83. Although the expression of melatonin receptors (MT1 and MT2) has been detected in corneal endothelium84,85, melatonin has been shown to stimulate antioxidative responses and DNA repair via receptor independent pathways86. Melatonin, along with its metabolites, has been shown to protect against UV-radiation87, and act as ligands for AhR88. Interestingly, in our dataset of the upstream regulators for these 48 common DEGs, we found AhR and TGF-β (transforming growth factor-β), the most potent fibrogenic cytokine. Recently, AhR is shown to be activated by environmental89 factors like smoking, leading to oxidative stress and corneal endothelial dysfunction in mice90,91. Additionally, TGF-β family genes are known to be involved in regulating cell differentiation, proliferation and extracellular matrix production in ocular tissue92. In fact, TGF-β2 null mice exhibit an absence of endothelium and fusion of the lens with the cornea93. Overall, our data supports the notion that these pathways including melatonin, AhR, and TGF-β signaling may be important for FECD pathogenesis, irrespective of their TCF4 repeat expansion status. Future studies are required to investigate (a) the non-receptor mediated, mitochondrial function of melatonin in the eye, especially in response to AhR activation, and (b) the role of TGF-β related genes in corneal endothelial dysfunction, seen in FECD. Use of immortalized cell lines offers advantages, including increased proliferative capacity and consistency and reproducibility of the results due to their homogenous population. However, cell lines pose limitations of reduced physiological relevance compared to primary cells due to the genetic manipulation and induction of replication that is not physiologic in post-mitotic cells such as corneal endothelium. Therefore, it will be important to further validate findings with patient specimen data and animal models. Although FECD more commonly affects females than males at a ratio 4 − 3:1, different genetic variants have variable sex-specific association. For example, COL8A2 mutations affect both men and women equally66, while LAMC1 carries greater risk in women94. A study by Zarouchlioti et al. showed TCF4 mutation carries a greater risk in men in white European individuals95; however, there is a recent study demonstrating no sex bias for the prevalence of TCF4 repeat expansion in a multiethnic population96. The goal of this study was to investigate the common causative pathways of FECD based on the most common genetic association (presence of TCF4 repeat expansions), irrespective of sex-differences. The current study is not powered for sex-related differences and future studies are needed to investigate the differential expression of genes in male vs. female FECD samples.

In conclusion, we have identified several differentially expressed genes, related canonical pathways enriched in these genes, and their upstream regulators, in FECD cell lines with or without TCF4 repeat expansions. These results shed light on the genes/pathways/mechanisms that are shared between FECD caused or not caused due to TCF4 repeat expansions. Our data also provided transcriptomic evidence that FECD caused by expanded TCF4 repeats possesses a set of distinct functional properties. However, further studies for the functional characterization of the genes identified in this study are necessary to discern the molecular mechanisms underlying FECD pathogenesis. The generation of stable corneal endothelial cell lines overexpressing or lacking genes of interest could facilitate the development of therapeutic strategies for FECD, thus reducing the need for corneal transplantation. Identification of signature pathways that trigger disease onset in individuals harboring the expanded mutation in FECD may also provide pointers for studying the pathogenesis of other oxidative stress and repeat expansion disorders.

Methods

Generation of Simian virus-40 T-antigen immortalized normal and FECD cell lines

All experiments were performed in accordance with relevant guidelines and regulations. The protocol for generating CE cell lines from FECD patient tissues was approved by the Massachusetts Eye and Ear Institutional Review Board. Written and informed consent was obtained from patients and donors97. The generation of the normal and FECD cell lines by simian virus (SV)-40 (SV-40) immortalization in our lab has been described previously97. Cell lines were prepared from corneal endothelial cells extracted from single corneas of cadaveric donors harvested at Eversight or SightLife eye banks or of FECD patients undergoing routine Descemet’s membrane endothelial keratoplasty (DMEK) at the Massachusetts Eye and Ear Infirmary (Boston, Massachusetts) and Price Vision Group (Indianapolis, Indiana). All FECD patients were at the end-stage of FECD requiring corneal transplantation, with Krachmer grade 5 (≥ 5 mm central confluent guttae without stromal edema) or 6 (≥ 5 mm central confluent guttae with stromal edema) assessed by slit lamp microscopy4,98. Specimens were incubated in complete Chen’s medium (OptiMEM-I; Invitrogen) containing 8% fetal bovine serum (HyClone), 5 ng/ml epidermal growth factor (Millipore), 100 mg/ml bovine pituitary extract (Invitrogen), 200 mg/l calcium chloride (Sigma-Aldrich), 0.08% chondroitin sulfate (Sigma-Aldrich), 50 mg/ml gentamicin (Invitrogen), and 1:100 diluted antibiotic and antimycotic solution (Sigma-Aldrich) for 24 to 48 h. Chen’s medium was removed from specimens and incubated at 37° C, 5% CO2, with 5 ml prewarmed 0.02% ethylenediaminetetraacetic acid (EDTA) solution (Sigma-Aldrich) for up to 45 min to disrupt cell-to-cell junctions. CEnCs were centrifuged for 3 min at 3000 rpm at room temperature to pellet DM and cells, which were resuspended in prewarmed Chen’s media with gentle mechanical trituration and cells and DM were plated in a 24-well tissue culture plate precoated with undiluted FNC Coating (AthenaES). CEnCs were incubated for approximately 48 h at 37° C, 5% CO2, before changing the media. Subculturing of CECs was performed using 0.05% trypsin (Invitrogen) for 5 min at 37° C, 5% CO2. Primary CECs were immortalized SV-40 T-antigen cell immortalization kit (Alstem Cell Advancements). At passage 1, CECs were divided into 2 wells and infected with 2 µl/ml lentivirus (SV-40 T antigen) with 1 µl/ml TransPlus reagent (Alstem Cell Advancements). After 24 h, viral supernatant was removed, and CECs were grown in Chen’s media for 72 h. Transduced CECs underwent puromycin selection (2 µg/ml) until passage 5, after which they were grown in Chen’s media without puromycin. Using this method, we generated 3 normal cell lines (SVN1-67F, SVN2-67F and SVN3-65M) and 7 FECD cells lines (SVF1-73F, SVF5-54F, SVF6-61M, SVF2-61F, SVF3-76M, SVF4-61F, and SVF7-74F) that were further analyzed for TCF4 CTG repeat expansion status.

TCF4 CTG18.1 polymorphism genotyping

The number of repeats in CTG18.1 in the TCF4 gene in the blood samples of patients and donors from which DM and endothelium were collected to generate cell lines, were estimated by short tandem repeat (STR) and triplet-primed PCR (TP-PCR) based fragment analysis as previously established99. For CTG repeat size estimation, genomic DNA was isolated from the peripheral blood mononuclear cells (PBMCs) obtained from same FECD cases. The fragment analysis was performed on data generated by an ABI3730 genetic analyzer in the Genomics Core Facility of Mass Eye and Ear. The literature on FECD suggests that a repeat length of 40 or more CTG repeats is accepted as an expanded allele associated with FECD20. Accordingly, the FECD patients and their respective immortalized cell lines were stratified as either those with mono- or bi-allelic CTG repeats greater than 40 (expanded or FECD-R) or those with bi-allelic CTG repeats less than 40 (non-expanded or FECD-NR) in TCF4. Genetic analysis in blood samples (normal and FECD) as well as newly generated cell lines revealed an expansion of the CTG repeat sequence in the TCF4 gene in 3 out 7 samples (SVF1-73F with 74 repeats, SVF5-54F with 73 repeats, and SVF6-61M with 80 repeats). FECD cell lines retained the TCF4 repeat expansion until at least passage 7.

Genotyping for mutations in other FECD-associated genes

The FECD cell lines (FECD-R and FECD-NR) were further tested for specific FECD-associated genetic variants in LOXHD1, SLC4A11, AGBL1, and KANK4 genes. PCR was performed using primers containing universal M13 tags to target the genetic variants listed in Supplementary Fig. 1. The primer sequences used are provided in Supplementary Fig. 6. PCR products were purified using ExoSAP-IT™ Express PCR Product Cleanup Reagent (Thermo Fisher, Grand Island, NY). Sanger sequencing reactions were then prepared using the purified PCR products and the BigDye® Terminator v3.1 Cycle Sequencing Kit (Thermo Fisher, Grand Island, NY). Bi-directional sequencing was carried out on the Applied Biosystems 3730xl DNA Analyzer (Thermo Fisher, Grand Island, NY).

RNA isolation and sequencing

Total RNA was extracted from 10 immortalized human corneal endothelial cell lines (3 normal and 7 FECD cell lines; Fig. 1B), with early passage numbers (p5-p7) (Fig. 1B), using RNeasy Micro kit (Qiagen) and sent to the Genomics Core Facility at the Massachusetts General Hospital (Boston, MA) for bulk RNA sequencing. Illumina sequencing libraries were constructed from total RNA using polyA selection followed by NEBNext Ultra II Directional RNA Library Prep protocol (New England Biolabs). Sequencing was performed on Illumina HiSeq 2500 instrument (Illumina) resulting in approximately 30 million of 50 bp reads per sample.

Analysis of differentially expressed gene

Sequencing reads (50 bp single-end reads) were mapped in a splice-aware fashion to the human reference transcriptome (hg19/GRCh37.75 assembly) using STAR100. Read counts over transcripts were calculated using HTSeq101 based on the Ensembl annotation for hg19/GRCh37.75 assembly. For differential expression analysis, we used the EdgeR method102 and classified genes as differentially expressed based on the cutoffs of 2-fold change in expression value and false discovery rates (FDR) below 0.05.

Ingenuity pathway analysis (IPA) and Panther analysis

The datasets of DEGs identified by RNA-seq after comparing (1) the 3 normal cell lines vs. 7 FECD cell lines; (2) the common genes between the 3 normal cell lines vs. 3 FECD cell lines with expanded CTG repeats and 3 normal cell lines vs. 4 FECD cell lines with normal CTG repeats; were uploaded to the server and analyzed through the use of IPA (Ingenuity Pathway Analysis, QIAGEN Inc., https://www.qiagenbioinformatics.com/products/ingenuitypathway-analysis)103. The canonical pathway and upstream regulator revealed by IPA were further analyzed as gene networks. Additionally, the datasets were uploaded to an online website for Protein Annotation Through Evolutionary Relationship (Panther, version 16.0, http://www.pantherdb.org/) analysis following instructions by Mi et al.104. The gene was grouped to different categories by molecular function and further analyzed by overrepresentation (enrichment) test by using PANTHER Go-Slim molecular function annotation dataset.

Validation of differential gene expression at protein level using Western blot

Whole cell lysates were prepared by lysis in RIPA buffer containing HALT protease and phosphatase inhibitors (#78440, Life Technologies) for 30 min on ice. Protein concentration was determined using the BCA assay kit (#23225, Life Technologies). Whole cell lysates were resolved by loading onto a 4–12% Bis-Tris gel for SDS-PAGE and blotted onto a polyvinylidene difluoride membrane (#IPVH00010, Millipore) which was blocked in 5% nonfat milk (#1706404, Bio-Rad) or 5% bovine serum albumin (#BP1600, Fisher Scientific) in tris-buffered saline with 0.1% Tween-20 (TBST) for 1 h and incubated overnight with primary antibody at 4 °C. The primary antibodies used were MAFB (1:1000, #ab243902, Abcam, USA), TFAP2B (1: 1000, #sc-376709, Santa Cruz Biotechnology, USA), POU6F2 (1: 1000, #2509, Cell Signaling, USA), β-actin (1: 5000, #A1978, Sigma, USA), and secondary antibodies were horseradish peroxidase-conjugated mouse anti-rabbit IgG (1:1000, #sc-2357), anti-mouse IgG (1:1000, #sc-516102), and mouse anti-goat (1:1000, sc-2354), from Santa Cruz Biotechnology Inc., USA. The membrane was developed with SuperSignal West Pico or Femto (#34577 or # 34096, Life Technologies, USA) plus chemiluminescent substrate. Densitometry was analyzed with ImageJ software (developed by Wayne Rasband, National Institutes of Health, Bethesda, MD), and protein content was normalized relative to β-actin protein content. Experiments were repeated a minimum of three times. Results were averaged and SEM values were calculated.

In vitro treatment with UVA light

The procedure for in vitro treatment with ultraviolet-A light has been previously described17. In short, 0.4 × 10^5 cells were seeded in modified Chen’s medium (phenol red-free OptiMEM, Life Technologies) and 8% charcoal-stripped fetal bovine serum (FBS, Gemini Bio-products) on FNC-coated 6-well plates overnight. For UVA irradiation, fresh growth medium was added, and cells were exposed to two 19.5-inch UVA tubes (Analytik Jena US LLC) emitting 365 nm light to deliver a fluence of 5 J/cm2 (6 min), followed by a 24 h recovery period in the same medium.

Statistical analysis

Statistical analysis was performed using a student t-test or either a one-way or two-way analysis of variance followed by a Welch’s correction. A p-value < 0.05 was considered statistically significant.