Integrative analysis of 115 transcriptomic studies decodes the molecular landscape of neurodevelopmental disorders

Koetsier, Jarno; Eijssen, Lars M. T.; Schurgers, Leon J.; Curfs, Leopold M. G.; Reutelingsperger, Chris P.; Bahram Sangani, Nasim

doi:10.1038/s42003-025-08330-2

Download PDF

Article
Open access
Published: 12 June 2025

Integrative analysis of 115 transcriptomic studies decodes the molecular landscape of neurodevelopmental disorders

Communications Biology volume 8, Article number: 914 (2025) Cite this article

2668 Accesses
1 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Due to the low disease prevalence, transcriptomic studies of neurodevelopmental disorders (NDDs) often face limited statistical power, constraining the depth of insights they can provide. To tackle this limitation, we integrated 151 human RNA sequencing datasets from 115 independent studies, and characterized the common and distinct molecular pathways of NDDs and their neurological phenotypes. In addition to revealing an aberrant expression profile of imprinted genes, our analysis identified transcriptomic changes in inflammatory, translational, mitochondrial, and synaptic processes across the different NDDs. We further highlight disorder-associated alterations, including upregulation of ITGB4 across Rett syndrome datasets. Moreover, gene expression changes in LHX1/5-mediated cerebellar Purkinje cell layer formation were found to be specific to seizure-associated NDDs. We combined the datasets into a publicly accessible NDD transcriptomic atlas: https://SyNUM.shinyapps.io/NDD-transcriptomic-atlas/. Together, our findings provide fundamental insights into the molecular pathophysiology of NDDs and highlight genes and pathways with aberrant transcriptomic profiles. This knowledge can guide future therapeutic development and precision medicine approaches.

Skewed X-chromosome inactivation in unsolved neurodevelopmental disease cases can guide re-evaluation For X-linked genes

Article 06 March 2023

Molecular pathology associated with altered synaptic transcriptome in the dorsolateral prefrontal cortex of depressed subjects

Article Open access 22 January 2021

Combining exome/genome sequencing with data repository analysis reveals novel gene–disease associations for a wide range of genetic disorders

Article Open access 19 April 2021

Introduction

Neurodevelopmental disorders (NDDs) are characterized by a disrupted brain development, leading to a wide range of psychiatric and neurological conditions that affect more than 4.7% of children worldwide^1,2. These conditions typically emerge in childhood² and include rare genetic disorders, such as Rett syndrome (RTT), Fragile X syndrome (FXS), and Duchenne muscular dystrophy (DMD), as well as multifactorial conditions, such as attention deficit hyperactivity disorder (ADHD) and autism spectrum disorder (ASD)³. Despite the wide range of clinical manifestations, many neurological phenotypes are shared across distinct NDDs, indicating the involvement of common molecular pathways. Examples of such shared phenotypes include, among others, seizures, intellectual disability, microcephaly, and hypotonia.

High-throughput techniques, such as RNA sequencing, have significantly advanced our understanding of the molecular pathways involved in NDDs. By identifying genes and pathways with altered expression levels, transcriptomic profiling provides insights into the pathological mechanisms and enables the discovery of potential therapeutic targets. The importance of expression profiling for therapeutic target discovery is exemplified by the observation of brain-derived neurotrophic factor (BDNF) downregulation in RTT in the early 2000’s⁴. Since BDNF cannot cross the blood-brain-barrier, this discovery led to the prioritization of insulin-like growth factor 1 (IGF1), a factor with similar biological properties as BDNF, as a promising therapeutic candidate⁵. More recently, IGF1 treatment (Trofinetide) has been FDA-approved for the treatment of RTT⁶, highlighting the potential of expression profiling for the identification of therapeutic targets.

Nevertheless, current transcriptomic studies of NDDs are often constrained by small sample sizes or high biological variability, particularly for rare genetic disorders and conditions with a complex disease etiology, respectively. Because of these limitations, these studies do not achieve the statistical power needed to fully characterize the disease transcriptome and to uncover novel molecular pathways. Integrating multiple datasets can increase statistical power, allowing for deeper insights into the molecular pathophysiology of NDDs. In our study, we therefore integrated 151 human RNA sequencing datasets from 115 independent studies to characterize common and distinct molecular pathways of NDDs and their neurological phenotypes.

Results

The NDD transcriptomic profile consists of 151 datasets from 115 independent studies

The Gene Expression Omnibus (GEO) was queried for RNA sequencing data of NDDs (Supplementary Text S1), identifying 188 studies with NCBI-generated raw counts available for at least six samples. Datasets without case-control design, with less than three cases and/or controls, and without NDD cases were excluded. The 115 studies that remained after filtering were included in our analysis (Supplementary Data S1). Where possible, individual studies were stratified based on mutation type and/or cell type/tissue before performing differential expression analysis (i.e., case versus control), yielding a total of 151 distinct datasets/statistical comparisons. The differential expression estimates of the datasets were used to identify general transcriptomic changes that occur across the different NDDs and to find alterations that are associated with a specific disorder or neurological phenotype (Fig. 1A). The summary statistics of the datasets can be downloaded and interactively explored on our website (https://SyNUM.shinyapps.io/NDD-transcriptomic-atlas/).

Most of the 151 datasets encompassed only three or four cases and/or controls (Fig. 1B). The most common NDDs in our meta-analysis include RTT, FXS, DMD, and down syndrome (DS), which together account for 43% of all datasets (Fig. 1C). The causative genes of these four NDDs (i.e., MECP2, FMR1, DMD, and chromosomal 21 genes, respectively) exhibited the anticipated transcriptomic alterations (Supplementary Fig. S1). Moreover, datasets clustered mostly by cell type/tissue rather than by disease, highlighting the importance of tissue choice in transcriptomic experiments (Fig. 1D, Supplementary Fig. S2). This is exemplified by cholinergic receptor CHRNA4 and T-cell receptor TRGV4, which only reach high levels of significance in neural and immune cell types/tissues, respectively.

NDDs are characterized by inflammatory, translational, mitochondrial, and synaptic alterations

Our first aim was to identify transcriptomic alterations that are common across NDDs. In particular, gene set enrichment analysis (GSEA) was performed on the 151 datasets to find Gene Ontology–Biological Process (GO-BP) terms with a differential expression profile across the NDDs. The 30 GO-BP terms that reached statistical significance (i.e., false discovery rate-adjusted (FDR-adj) P value < 0.05) across most NDD datasets encompassed processes related to inflammation, RNA translation, mitochondrial ATP synthesis, and synaptic signaling (Fig. 2A, Supplementary Fig. S3, and Supplementary Data S2). The GSEA of molecular function and cellular component show similar results and are provided in Supplementary Fig. S4.

**Fig. 2: Common alterations of the neurodevelopmental disorders (NDDs).**

Notably, enrichment of the GO-BP terms (i.e., -log₁₀ P value of the GSEA) was not associated with any specific NDD or neurological phenotype (independent two-group Mann-Whitney U Test, FDR-adj. P value > 0.05, Supplementary Fig. S5), suggesting that these transcriptomic changes are not driven by a specific disorder or phenotype but instead are shared across the NDD spectrum. There were neither any significant associations between the GO-BP enrichment and the sample type (i.e., immune, neural, and other cell types/tissues) or model system (i.e., in vitro and non-in vitro models) (Supplementary Fig. S5). Among the genes in the inflammation-, RNA translation-, ATP synthesis-, and synaptic signaling-related GO-BP terms, HLA-B, SH3BGRL, APP, and SNCA, respectively, were differentially expressed in the largest number of datasets, exemplifying gene-level transcriptomic alterations across NDDs (Fig. 2B-C and Supplementary Fig. S6). Moreover, changes in ATP synthesis were linked to a strongly dysregulated expression of mitochondrial-encoded genes (Supplementary Fig. S7).

Imprinted genes exhibit a dysregulated transcriptomic profile in NDDs

Since imprinted genes are known to have essential roles during brain development⁷, their dysregulation might be involved in NDD pathology. In line with this hypothesis, we observed large transcriptomic alterations in imprinted genes, such as PEG10 and MEST, which were differentially expressed in 27% and 24% of the NDD datasets, respectively (Supplementary Fig. S8). More generally, in 123 datasets, imprinted genes were found to have higher odds of differential expression than non-imprinted genes, which is more than expected by chance (permutation P value < 0.001, Fig. 2D). This preferential differential expression of imprinted genes was observed for datasets of in vitro (permutation P value < 0.001) and non-in vitro (permutation P value = 0.003) systems (Supplementary Fig. S9). Furthermore, imprinted genes exhibited higher odds of both upregulation and downregulation (Supplementary Fig. S10).

RTT, DMD, DS, and FXS have distinct transcriptomic alterations

To identify disorder-associated transcriptomic changes, separate transcriptome-wide meta-analyses on the log₂ fold change (log₂FC) estimates (random effect, inverse variance method, Hartung-Knapp adjustment) were performed for DMD, DS, FXS, and RTT. After excluding the causative genes, the most significantly differentially expressed genes in the meta-analysis for these four disorders were PFN2, ZNF22, SRBD1, and ITGB4, respectively (Fig. 3A-B, Supplementary Data S3). Of these genes, PFN2 is mostly expressed in hippocampal neurons (Human Brain Cell Atlas⁸ version 1.0, Fig. 3C). In contrast, ITGB4 is not expressed in neuronal cells, but is specific to glia such as astrocytes and ependymal cells. Additionally, by performing GSEA, several processes with differential expression profiles in DMD, DS, FXS, and RTT were found (Fig. 3D). For instance, axonogenesis was downregulated in RTT and humoral immune response was upregulated in DS.

**Fig. 3: Disorder-associated transcriptomic alterations.**

LHX1/5-related Purkinje cell layer formation is altered in seizure-associated NDDs

The final aim was to identify transcriptomic alterations in genes and biological processes that are associated with specific NDD phenotypes. Using the Human Phenotype Ontology⁹, the NDDs were linked to eight distinct phenotypes: intellectual disability, hypotonia, global developmental delay, microcephaly, gait ataxia, autism/autistic behavior, seizure, and scoliosis (Fig. 4A). Interestingly, the differential expression of cerebellar Purkinje cell layer formation was strongly correlated with seizure-associated NDDs (FDR-adj. P value = 0.01, Fig. 4B). Within this GO-BP term, the LIM homeobox genes LHX1 and LHX5 had a higher likelihood of differential expression (P value = 0.01 and 0.03, respectively) in NDDs associated with seizures compared to those without seizures (Fig. 4C-D, Supplementary Fig. S11).

**Fig. 4: Phenotype-specific transcriptomic alterations.**

Publicly available single-cell RNA sequencing (scRNA-seq) data from first trimester developing human brain¹⁰ showed that during early neurodevelopment, both LHX1 and LHX5 are mostly expressed in the hindbrain and midbrain regions (Supplementary Fig. S12). In the adult brain (Human Protein Atlas¹¹ v24.0), LHX1 expression is highly specific to the cerebellum (Fig. 4E). This is in contrast to LHX5 which is also abundantly expressed in the hypothalamus and medulla oblongata (Supplementary Fig. S13). Moreover, scRNA-seq data of the cerebellar vermis (Human Brain Atlas⁸ v1.0) revealed that LHX1 is almost exclusively transcribed in cerebellar neurons, particularly in those of the cerebellar inhibitory supercluster (Fig. 4F).

Besides cerebellar Purkinje cell layer formation, no other GO-BP term’s differential expression profile was significantly associated with any of the eight neurological phenotypes (i.e., FDR-adj. Mann-Whitney U Test P value > 0.05). Nevertheless, several genes, including ETS2, MRPS6, PIGP, and UBE2G2, were found to be more likely upregulated (i.e., P value < 0.05 and log₂FC > 0) in hypotonia-associated NDDs than in NDDs without hypotonia (FDR-adj. Fisher’s exact test P value < 0.05, Supplementary Fig. S14 and Supplementary Data S4).

Discussion

In this study, we conducted a transcriptome-wide meta-analysis of 151 human RNA sequencing datasets from 115 independent studies. Through this analysis, we revealed the aberrant transcriptomic profile of imprinted genes across NDDs. Additionally, we identified NDD-wide expression changes in inflammatory, translational, mitochondrial, and synaptic functions. Disorder-specific alterations were also uncovered, with PFN2, ZNF22, SRBD1, and ITGB4 showing differential expression in DMD, DS, FXS, and RTT, respectively. Finally, we showed that transcriptomic changes related to cerebellar Purkinje cell layer formation—in particular the differential expression of LHX1 and LHX5—are specific to seizure-associated NDDs.

Across most NDD datasets, imprinted genes were found to be more likely to be differentially expressed than non-imprinted genes (Fig. 2D). Genetic defects in some imprinted genes are well-known causes of NDDs. For example, Angelman and Prader-Wili syndromes are linked to the imprinted 15q11-q13 region, while Temple and Kagami-Ogata syndromes are associated with mutations or uniparental disomy of the imprinted 14q32 region¹². Moreover, genes from the 14q32 region, including the chromosome 14 miRNA cluster, MEG3, and MEG8, have also been linked to RTT pathology. Particularly, previous studies on RTT found differential expression of 14q32 genes or their murine homologs in brain organoid-derived extracellular vesicles¹³ and mouse brains^14,15. Besides RTT, imprinted genes are involved in the pathology of other NDDs, such as Williams syndrome¹⁶, fetal alcohol syndrome¹⁷, and ASD¹⁸. Until now, the link between NDDs and imprinted genes has been based on these specific examples. Our findings, however, provide the first systematic evidence for the dysregulation of imprinted genes across a wide range of NDDs. This widespread dysregulation can be explained by the essential roles of imprinted genes in the regulation of neural proliferation, differentiation, and migration⁷. Particularly, any alteration in these tightly regulated processes may impair normal brain development, giving rise to NDD phenotypes.

We further identified synaptic signaling as one of the processes with transcriptomic alterations across the various NDD datasets (Fig. 2A). Likewise, mutations in synaptic genes are known to give rise to a wide range of NDD phenotypes, including hypotonia, intellectual disability, ataxia, and developmental delay¹⁹. Among synaptic signaling-associated genes, we found large expression changes in APP and SNCA (Fig. 2B-C), which have been linked to familial Alzheimer’s²⁰ and Parkinson’s disease²¹, respectively. In line with previous observations²², these findings point towards similarities in synaptic dysfunction between neurodevelopmental and neurodegenerative disorders.

Transcriptomic alterations in oxidative phosphorylation (Fig. 2A) and mitochondrial-encoded genes (Supplementary Fig. S7) may indicate the presence of mitochondrial dysfunction across the various NDDs. Indeed, many NDDs, including RTT, FXS, Angelman syndrome, and tuberous sclerosis complex, are characterized by dysfunctional mitochondria²³. The shared involvement of mitochondria relates to the essential role of mitochondrial dynamics during neurogenesis and synaptogenesis²⁴. Moreover, mitochondrial function, in particular the shift to oxidative phosphorylation, is required for the induction of neuronal differentiation²⁵.

Through the excessive production of reactive oxygen species or the release of danger-associated molecular patterns, mitochondrial dysfunction can trigger (neuro)inflammation²⁶, which, just as oxidative phosphorylation, exhibited transcriptomic alterations across the NDD datasets (Fig. 2A). Like mitochondrial dysfunction, aberrant inflammatory responses during neurodevelopment are considered key processes in NDD pathology²⁷. Interestingly, inflammation can, in turn, compromise mitochondrial function, including oxidative phosphorylation²⁶.

It is interesting to note that processes exhibiting transcriptomic changes across NDDs, including mitochondrial and synaptic function, RNA translation, and inflammatory processes, closely resemble hallmarks of ageing (i.e., mitochondrial dysfunction, altered intercellular communication, loss of proteostasis, and chronic inflammation, respectively). This observation may suggest that, at the molecular level, NDDs represent a state of accelerated or early-onset ageing. This hypothesis is supported by the fact that many NDDs have an associated neurodegenerative phenotype: the neurodevelopmental-degenerative continuum²⁸. For instance, DS and FXS are strongly associated with the development of Alzheimer’s disease and movement disorders, respectively. Particularly, almost all individuals with DS will have been diagnosed with Alzheimer’s disease by the age of 70 years, with a median age of diagnosis of around 50 years²⁹. Moreover, at an age of 40 years or older, nearly 40% of male FXS patients will suffer from movement disorders, such as Parkinson’s disease, tremor, and/or bradykinesia³⁰. Together, our findings highlight the need for dedicated analyses assessing the possible overlap in the molecular pathophysiology of neurodevelopmental and neurodegenerative disorders, and its potential for therapeutic interventions.

In our disorder-specific meta-analysis, we identified several transcriptomic markers for DMD, DS, FXS, and RTT (Fig. 3). The identification of these transcriptionally dysregulated genes may provide insights into key pathways downstream of their underlying genetic alterations. Among the identified markers, the upregulation of PFN2 and ITGB4 across the DMD and RTT datasets, respectively, may relate to known disease mechanisms, which are elaborated upon in the paragraphs below.

ITGB4 is highly expressed in astrocytes (Fig. 3C). In these cells, ITGB4 mediates exosome secretion, which, in turn, enhances oligodendrocyte progenitor cell (OPC) proliferation^31,32. Although OPC proliferation has not been studied in the context of RTT, previous studies did identify differential protein expression of oligodendrocyte markers in the brain of MeCP-null mice³³. Interestingly, increased expression of the oligodendrocyte marker PLP could not be restored in MeCP2-null mice with MeCP2-expressing oligodendrocytes, possibly due to non-cell autonomous effects³³. An example of such a non-cell autonomous effect in RTT pathology could be the enhanced ITGB4-mediated exosome secretion by astrocytes.

In DMD, the interaction between actin filaments and the extracellular matrix is lost as a result of dystrophin deficiency³⁴. In smooth muscle cells, actin polymerization has been shown to promote dystrophin expression³⁵ and strengthen the response to mechanical stress³⁶. Hence, upregulation of PFN2 (Fig. 3A), an important regulator of actin polymerization³⁷, might be a compensatory response aiming at strengthening the weakened interaction between actin filaments and the extracellular matrix in DMD pathology. It should be noted that, although DMD is predominantly characterized by muscular degeneration, dystrophin deficiency also has a significant impact on the electrophysiological function of hippocampal neurons³⁸ where PFN2 is highly expressed (Fig. 3C).

Our phenotype-specific analysis suggests the presence of altered cerebellar Purkinje cell layer formation in seizure-associated NDDs (Fig. 4B). Within this process, we identified a prominent role of LHX1 and LHX5, which showed a dysregulated expression profile specific to seizure-associated NDDs (Fig. 4C, D). Previous studies have shown essential roles of LHX1 and LHX5 in the formation of a functional cerebellar Purkinje cell layer^39,40. For instance, knockout of Lhx1 and Lhx5 during embryonic development was shown to inhibit differentiation of Purkinje cell precursors into their mature form, thereby reducing the Purkinje cell pool in the cerebellum³⁹. Furthermore, postnatal knockout of these genes limits dendrite development of cerebellar Purkinje cells without reducing the cell number⁴⁰. Although LHX1 and LHX5 have not been directly linked to seizures or epilepsy, cerebellar Purkinje cell numbers are decreased in epileptic disorders⁴¹. Specifically, Purkinje cells are GABAergic neurons that provide inhibitory stimuli to the deep cerebellar nuclei⁴². Through this inhibitory pathway, cerebellar Purkinje cells prevent a seizure-causing hyperexcitability state of the cerebellum. In neurodegenerative disorders, seizures are thought to result from Purkinje cell degeneration or disruptions in Purkinje cell activity^42,43. Our analysis suggests an alternative mechanism in NDDs, where the dysregulated LHX1 and LHX5 expression may contribute to seizure onset by impairing the formation of the Purkinje cell pool. As the negative impact of Lhx1/5 dysregulation on cerebellar Purkinje cell function extends postnatally⁴⁰, therapies that aim at maintaining physiological levels of LHX1 and LHX5 in the cerebellum may be beneficial for the treatment or management of seizures in NDD. However, it should be noted that LHX1 and LHX5 have multifaceted roles during the development of the brain, including regulation of axonal guidance and neural survival^44,45. Hence, their dysregulation impacts processes beyond Purkinje cell layer formation. Some of these may also contribute to seizures in NDDs—an aspect that warrants further investigation in dedicated future studies.

By integrating and analyzing 151 RNA sequencing datasets from 115 NDD studies, our research achieved statistical power that surpassed that of conventional RNA sequencing experiments, enabling the identification of transcriptomic alterations that have not been described before. Particularly, we highlighted the presence of ageing-related transcriptomic alterations and the differential expression of imprinted genes across distinct NDDs. We further identified multiple disorder- and phenotype-specific changes, such as the upregulation of ITGB4 in RTT and the differential expression of LIM homeobox genes LHX1 and LHX5 in seizure-associated NDDs. The current study is, however, limited to insights at the RNA level. Therefore, future studies should investigate whether the identified alterations are persistent at the protein level and which epigenetic processes are responsible for driving these transcriptomic changes. Furthermore, to translate the insights into cause-consequence relationships and potential therapeutic targets, functional studies, such as knockout experiments, will be needed.

To support other studies to build upon our findings, we combined our data into an atlas that summarizes the current state of knowledge from NDD transcriptomic experiments. The transcriptomic atlas is publicly accessible for exploration and download (https://SyNUM.shinyapps.io/NDD-transcriptomic-atlas/). This resource allows researchers to examine the differential expression profiles of genes of interest across multiple NDDs, serving as a tool to prioritize potential drug targets and streamline the drug discovery process.

Methods

Data collection

An overview of the methodological workflow is illustrated in Fig. 1A. The list of NDDs from D’Souza et al.³ was used to query the GEO⁴⁶ for RNA sequencing data (NCBI-generated raw count files; Homo sapiens; date of extraction: July 5, 2024). The exact search query is provided in Supplementary Text S1. GEO studies with less than six samples or without NCBI-generated raw count files were excluded from data processing.

Data processing and statistical analysis

The collected datasets were manually curated and processed. During curation, datasets without case-control design, with less than three cases and/or controls, and without NDD cases were excluded. For the statistical analysis, the DESeq2⁴⁷ (version 1.42.0) R/Bioconductor package was applied to compare the RNA expression levels between NDD cases and controls. Samples were stratified by mutation type and/or tissue/cell type before the differential expression analysis, provided that there were at least three cases and controls within each stratum. This means that for some GEO studies more than one statistical comparison has been performed. Furthermore, when information about sex, age/developmental stage, donor, and/or cell type/tissue was available, these factors were included as covariates in the statistical model. The exact experimental design for each dataset is detailed in Supplementary Data S1.

Disease-phenotype associations

The Human Phenotype Ontology⁹ (version 2.0.4) was used to determine whether the included NDDs were associated with each of the following phenotypes: intellectual disability, hypotonia, global developmental delay, microcephaly, gait ataxia, autism/autistic behavior, seizure, and scoliosis. Each disorder was linked to the relevant phenotypes using its OMIM record in the Human Phenotype Ontology database. When the OMIM record was not available, the ORPHA record was used as an alternative.

Principal coordinate analysis

Principal coordinate analysis (PCoA) was applied to assess the similarity between the different datasets. The similarity between two datasets i and j (S_i,j) was defined as the Spearman correlation between the datasets’ P values (Eq. 1). The distances (D_i,j) were calculated from the dataset similarities (Eq. 2) and was double-centered before eigendecomposition.

$${S}_{i,j}={Spearman}\left({P}_{i},{P}_{j}\right)$$

(1)

$${D}_{i,j}=1-{S}_{i,j}$$

(2)

Gene set enrichment analysis

For each statistical comparison, GSEA was performed using the clusterProfiler⁴⁸ (version 4.10.1) R/Bioconductor package. For this, GO terms were used as gene sets and the log₂ fold change (log₂FC) was used as ranking variable. The rrvgo⁴⁹ (version 1.14.2) R/Bioconductor package was applied to cluster similar GO terms using Resnik similarity (threshold = 0.85). The GO term that reached statistical significance (FDR-adjusted P value < 0.05) in the largest number of datasets was selected as the representative term for the cluster. For each GO-BP term, the P value and enrichment score were used to calculate the signed -log₁₀ P value according to Eq. 3.

$${Signed}{{{{-}}}\log }_{10} \, P{ \, value}={{{-}}}{\log }_{10} \, P \, {value}\cdot {sign}\left({enrichment\; score}\right)$$

(3)

Identification of common NDD transcriptomic alterations

Commonly affected biological processes were identified by finding the GO-BP terms with the largest number of significant NDD datasets (FDR-adjusted P value < 0.05). The top 30 GO-BP terms were clustered based on their signed -log₁₀ P value using hierarchical clustering (Euclidean distance and Ward D2 linkage). Independent two-group Mann-Whitney U Test was used to test whether the top 30 GO-BP terms (i.e., signed and unsigned -log₁₀ GSEA P value) were associated with any of NDDs with at least ten included datasets (i.e., RTT, FXS, DMD, and DS) or common neurological phenotypes (i.e., intellectual disability, hypotonia, global developmental delay, microcephaly, gait ataxia, autism/autistic behavior, seizure, and scoliosis).

Furthermore, it was tested whether imprinted genes play a role in NDD pathology. For this, the Fisher’s exact test was applied to assess whether previously reported or predicted imprinted genes (obtained from geneimprint.com on December 5, 2023) have higher odds of differential expression compared to non-imprinted genes. To establish the baseline significance, a 1000-permutation analysis was performed. In each permutation, the Fisher’s exact test was applied on random gene set with the same size (n = 195) and expression profile as the imprinted genes. To generate random gene sets with matching expression profile, all genes were first ranked by their median expression level across all datasets and divided into quarters. Random gene sets were accordingly sampled from each quartile in the same proportion as the imprinted genes (Supplementary Text 2). The permutation P value was calculated by counting the number of permutations for which there are at least as many datasets with preferential differential expression of imprinted genes (i.e., log₂ OR > 0).

Identification of disorder-associated transcriptomic alterations

To identify the genes that are associated with FXS, RTT, DS, and DMD, meta-analysis on the log₂FC estimates of the differential expression analysis (random effect, inverse variance method, Hartung-Knapp adjustment) was performed using the meta⁵⁰ (version 7.0-0) R package. The genes were ranked based on their P value. To identify disorder-associated biological processes, the GO-BP terms—clustered by a Resnik similarity threshold of 0.85—were ranked based on the number of significant datasets (GSEA P value < 0.05) with a consistent direction of effect (i.e., sign of enrichment score).

Identification of phenotype-specific transcriptomic alterations

Independent two-group Mann-Whitney U Test was used to find the GO-BP terms that are significantly associated with any of the selected phenotypes (i.e., intellectual disability, hypotonia, global developmental delay, microcephaly, gait ataxia, autism/autistic behavior, seizure, and scoliosis). This was done using both the signed and unsigned -log₁₀ P values of the GSEA as the dependent variables. Moreover, Fisher’s exact test was applied to identify which genes have a higher likelihood of being differentially expressed (i.e., P value < 0.05) for each of selected phenotypes. Protein-protein and gene-gene relationships were retrieved from GeneMANIA⁵¹ (version 3.5.3).

Brain region and cell type-specificity

The brain region-specificity of genes of interest was assessed using the six available brain RNA expression datasets from the Human Protein Atlas¹¹ (version 24.0): HPA (Human), FANTOM, GTEx, HPA (Pig), HPA (Mouse), and Allen Mouse Brain Atlas. The RNA expression levels for each brain region were expressed in normalized transcript per million values. Only for the Allen Mouse Brain Atlas, the expression energy was used as a measure of RNA expression levels. Moreover, the cell type-specific expression profile of genes of interest was assessed using the processed scRNA-seq data from the Human Brain Atlas⁸ (version 1.0). Finally, the single-cell expression profile of LHX1 and LHX5 during the first trimester of the developing human brain was assessed using the processed scRNA-seq data from Braun et al.¹⁰.

Shiny app

An online application for the visualization of the collected NDD transcriptomics data was created using the Shiny (version 1.9.1) R package. The application is hosted on the shinyapps.io server and is accessible via the following weblink: https://SyNUM.shinyapps.io/NDD-transcriptomic-atlas/.

Statistics and reproducibility

All statistical analyses were performed using the R programming language version 4.4.2. The codes used for the analyses are publicly available on GitHub (https://github.com/SyNUM-lab/NDD-transcriptomics) and Zenodo (https://doi.org/10.5281/zenodo.15528341).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The NDD transcriptomics datasets used in this study are deposited in the GEO. A list of the GEO accession codes is provided in Supplementary Data S1. To facilitate interactive exploration of the transcriptomic profile of the NDD datasets, we have made a Shiny application publicly available at https://SyNUM.shinyapps.io/NDD-transcriptomic-atlas/. Numerical source data for the main figures in the manuscript are available on Zenodo (https://doi.org/10.5281/zenodo.15516906).

Code availability

All R scripts used for the analysis are publicly available on GitHub (https://github.com/SyNUM-lab/NDD-transcriptomics) and Zenodo (https://doi.org/10.5281/zenodo.15528341).

References

Francés, L. et al. Current state of knowledge on the prevalence of neurodevelopmental disorders in childhood according to the DSM-5: a systematic review in accordance with the PRISMA criteria. Child Adolesc. Psychiatry Ment. Health 16, 27 (2022).
Article PubMed PubMed Central Google Scholar
Thapar, A., Cooper, M. & Rutter, M. Neurodevelopmental disorders. Lancet Psychiatry 4, 339–346 (2017).
Article PubMed Google Scholar
D’Souza, H. & Karmiloff-Smith, A. Neurodevelopmental disorders. Wiley Interdiscip. Rev. Cogn. Sci. 8, e1398 (2017).
Sun, Y. E. & Wu, H. The ups and downs of BDNF in Rett syndrome. Neuron 49, 321–323 (2006).
Article CAS PubMed Google Scholar
Pini, G. et al. IGF1 as a potential treatment for Rett syndrome: safety assessment in six Rett patients. Autism Res. Treat. 2012, 679801 (2012).
PubMed PubMed Central Google Scholar
Percy, A. K., Ananth, A. & Neul, J. L. Rett syndrome: the emerging landscape of treatment strategies. CNS Drugs 38, 851–867 (2024).
Article PubMed PubMed Central Google Scholar
Thamban, T., Agarwaal, V. & Khosla, S. Role of genomic imprinting in mammalian development. J. Biosci. 45, 1–21 (2020).
Siletti, K. et al. Transcriptomic diversity of cell types across the adult human brain. Science 382, eadd7046 (2023).
Article CAS PubMed Google Scholar
Gargano, M. A. et al. The human phenotype ontology in 2024: phenotypes around the world. Nucleic Acids Res. 52, D1333–d1346 (2024).
Article CAS PubMed Google Scholar
Braun, E. et al. Comprehensive cell atlas of the first-trimester developing human brain. Science 382, eadf1226 (2023).
Article CAS PubMed Google Scholar
Uhlén, M. et al. Proteomics. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
Article PubMed Google Scholar
Isles, A. R. The contribution of imprinted genes to neurodevelopmental and neuropsychiatric disorders. Transl. Psychiatry 12, 210 (2022).
Article CAS PubMed PubMed Central Google Scholar
Bahram Sangani, N. et al. Involvement of extracellular vesicle microRNA clusters in developing healthy and Rett syndrome brain organoids. Cell Mol. Life Sci. 81, 410 (2024).
Article CAS PubMed PubMed Central Google Scholar
Sharifi, O. et al. Sex-specific single cell-level transcriptomic signatures of Rett syndrome disease progression. Commun. Biol. 7, 1292 (2024).
Article CAS PubMed PubMed Central Google Scholar
Wu, H. et al. Genome-wide analysis reveals methyl-CpG-binding protein 2-dependent regulation of microRNAs in a mouse model of Rett syndrome. Proc. Natl. Acad. Sci. USA 107, 18161–18166 (2010).
Article CAS PubMed PubMed Central Google Scholar
Crespi, B. J. & Procyshyn, T. L. Williams syndrome deletions and duplications: genetic windows to understanding anxiety, sociality, autism, and schizophrenia. Neurosci. Biobehav. Rev. 79, 14–26 (2017).
Article CAS PubMed Google Scholar
Gutherz, O. R. et al. Potential roles of imprinted genes in the teratogenic effects of alcohol on the placenta, somatic growth, and the developing brain. Exp. Neurol. 347, 113919 (2022).
Article CAS PubMed Google Scholar
Li, J. et al. Potential role of genomic imprinted genes and brain developmental related genes in autism. BMC Med. Genom.13, 54 (2020).
Article Google Scholar
Michetti, C., Falace, A., Benfenati, F. & Fassio, A. Synaptic genes and neurodevelopmental disorders: From molecular mechanisms to developmental strategies of behavioral testing. Neurobiol. Dis. 173, 105856 (2022).
Article CAS PubMed Google Scholar
Nilsberth, C. et al. The ‘Arctic’ APP mutation (E693G) causes Alzheimer’s disease by enhanced Abeta protofibril formation. Nat. Neurosci. 4, 887–893 (2001).
Article CAS PubMed Google Scholar
Xu, W., Tan, L. & Yu, J. T. Link between the SNCA gene and Parkinsonism. Neurobiol. Aging 36, 1505–1518 (2015).
Article CAS PubMed Google Scholar
Taoufik, E., Kouroupi, G., Zygogianni, O. & Matsas, R. Synaptic dysfunction in neurodegenerative and neurodevelopmental diseases: an overview of induced pluripotent stem-cell-based disease models. Open Biol. 8, 180138 (2018).
Ortiz-González, X. R. Mitochondrial dysfunction: a common denominator in neurodevelopmental disorders? Dev. Neurosci. 43, 222–229 (2021).
Article PubMed Google Scholar
Anitha, A., Thanseem, I., Iype, M. & Thomas, S. V. Mitochondrial dysfunction in cognitive neurodevelopmental disorders: cause or effect? Mitochondrion 69, 18–32 (2023).
Article CAS PubMed Google Scholar
Zheng, X. et al. Metabolic reprogramming during neuronal differentiation from aerobic glycolysis to neuronal oxidative phosphorylation. Elife 5, e13374 (2016).
van Horssen, J., van Schaik, P. & Witte, M. Inflammation and mitochondrial dysfunction: a vicious circle in neurodegenerative disorders? Neurosci. Lett. 710, 132931 (2019).
Article PubMed Google Scholar
Zengeler, K. E. & Lukens, J. R. Innate immunity at the crossroads of healthy brain maturation and neurodevelopmental disorders. Nat. Rev. Immunol. 21, 454–468 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hickman, R. A., O’Shea, S. A., Mehler, M. F. & Chung, W. K. Neurogenetic disorders across the lifespan: from aberrant development to degeneration. Nat. Rev. Neurol. 18, 117–124 (2022).
Article PubMed PubMed Central Google Scholar
Fortea, J. et al. Clinical and biomarker changes of Alzheimer’s disease in adults with down syndrome: a cross-sectional study. Lancet 395, 1988–1997 (2020).
Article CAS PubMed PubMed Central Google Scholar
Utari, A. et al. Aging in fragile X syndrome. J. Neurodev. Disord. 2, 70–76 (2010).
Article PubMed PubMed Central Google Scholar
Zhang, W. et al. Astrocytes increase exosomal secretion of oligodendrocyte precursor cells to promote their proliferation via integrin β4-mediated cell adhesion. Biochem. Biophys. Res. Commun. 526, 341–348 (2020).
Article CAS PubMed Google Scholar
Bahram Sangani, N., Gomes, A. R., Curfs, L. M. G. & Reutelingsperger, C. P. The role of extracellular vesicles during CNS development. Prog. Neurobiol. 205, 102124 (2021).
Article PubMed Google Scholar
Nguyen, M. V. et al. Oligodendrocyte lineage cells contribute unique features to Rett syndrome neuropathology. J. Neurosci. 33, 18764–18774 (2013).
Article CAS PubMed PubMed Central Google Scholar
Duan, D., Goemans, N., Takeda, S., Mercuri, E. & Aartsma-Rus, A. Duchenne muscular dystrophy. Nat. Rev. Dis. Prim. 7, 13 (2021).
Article PubMed Google Scholar
Turczyńska, K. M. et al. Regulation of smooth muscle dystrophin and synaptopodin 2 expression by actin polymerization and vascular injury. Arterioscler Thromb. Vasc. Biol. 35, 1489–1497 (2015).
Article PubMed Google Scholar
Gunst, S. J. & Zhang, W. Actin cytoskeletal dynamics in smooth muscle: a new paradigm for the regulation of smooth muscle contraction. Am. J. Physiol. Cell Physiol. 295, C576–C587 (2008).
Article CAS PubMed PubMed Central Google Scholar
Murk, K., Ornaghi, M. & Schiweck, J. Profilin isoforms in health and disease - all the same but different. Front. Cell Dev. Biol. 9, 681122 (2021).
Article PubMed PubMed Central Google Scholar
Bianchi, R. et al. Hippocampal synaptic and membrane function in the DBA/2J-mdx mouse model of Duchenne muscular dystrophy. Mol. Cell Neurosci. 104, 103482 (2020).
Article CAS PubMed Google Scholar
Zhao, Y. et al. LIM-homeodomain proteins Lhx1 and Lhx5, and their cofactor Ldb1, control Purkinje cell differentiation in the developing cerebellum. Proc. Natl. Acad. Sci. USA 104, 13182–13186 (2007).
Article CAS PubMed PubMed Central Google Scholar
Lui, N. C. et al. Lhx1/5 control dendritogenesis and spine morphogenesis of Purkinje cells via regulation of Espin. Nat. Commun. 8, 15079 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ming, X., Prasad, N., Thulasi, V., Elkins, K. & Shivamurthy, V. K. N. Possible contribution of cerebellar disinhibition in epilepsy. Epilepsy Behav. 118, 107944 (2021).
Article PubMed Google Scholar
Bernardi, S., Gemignani, F. & Marchese, M. The involvement of Purkinje cells in progressive myoclonic epilepsy: focus on neuronal ceroid lipofuscinosis. Neurobiol. Dis. 185, 106258 (2023).
Article CAS PubMed PubMed Central Google Scholar
Cook, A. A., Fields, E. & Watt, A. J. Losing the beat: contribution of Purkinje cell firing dysfunction to disease, and its reversal. Neuroscience 462, 247–261 (2021).
Article CAS PubMed Google Scholar
Leung, R. F. et al. Genetic regulation of vertebrate forebrain development by homeobox genes. Front. Neurosci. 16, 843794 (2022).
Article PubMed PubMed Central Google Scholar
Hirsch, D., Kohl, A., Wang, Y. & Sela-Donenfeld, D. Axonal projection patterns of the dorsal interneuron populations in the embryonic hindbrain. Front. Neuroanat. 15, 793161 (2021).
Article CAS PubMed PubMed Central Google Scholar
Barrett, T. et al. NCBI GEO: archive for functional genomics data sets-update. Nucleic Acids Res. 41, D991–D995 (2013).
Article CAS PubMed Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Article PubMed PubMed Central Google Scholar
Xu, S. et al. Using clusterProfiler to characterize multiomics data. Nat. Protoc. 19, 3292–3320 (2024).
Article CAS PubMed Google Scholar
Sayols, S. Rrvgo: a bioconductor package for interpreting lists of gene ontology terms. MicroPubl. Biol. 10, 000811 (2023).
Balduzzi, S., Rücker, G. & Schwarzer, G. How to perform a meta-analysis with R: a practical tutorial. Evid. Based Ment. Health 22, 153–160 (2019).
Article PubMed PubMed Central Google Scholar
Warde-Farley, D. et al. The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res. 38, W214–W220 (2010).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This research was financially supported by Stichting Terre - the Dutch Rett Syndrome Foundation.

Author information

Authors and Affiliations

Department of Biochemistry, Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands
Jarno Koetsier, Leon J. Schurgers, Chris P. Reutelingsperger & Nasim Bahram Sangani
GKC, Maastricht University Medical Centre, Maastricht, The Netherlands
Jarno Koetsier, Leopold M. G. Curfs, Chris P. Reutelingsperger & Nasim Bahram Sangani
Department of Psychiatry and Neuropsychology, School for Mental Health and Neuroscience (MHeNs), Maastricht University, Maastricht, The Netherlands
Lars M. T. Eijssen
Department of Translational Genomics, Maastricht University, Maastricht, The Netherlands
Lars M. T. Eijssen

Authors

Jarno Koetsier
View author publications
Search author on:PubMed Google Scholar
Lars M. T. Eijssen
View author publications
Search author on:PubMed Google Scholar
Leon J. Schurgers
View author publications
Search author on:PubMed Google Scholar
Leopold M. G. Curfs
View author publications
Search author on:PubMed Google Scholar
Chris P. Reutelingsperger
View author publications
Search author on:PubMed Google Scholar
Nasim Bahram Sangani
View author publications
Search author on:PubMed Google Scholar

Contributions

J.K. performed data collection and analysis and wrote the first version of the manuscript. L.E. and N.B.S. supervised the project and wrote the first version of the manuscript. C.R., L.S., and L.C. supervised the project and reviewed and edited the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Jarno Koetsier.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Biology thanks A.I. and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary handling editors: A.M. and B.B. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Transparent Peer Review file

Supplementary Information

Description of Additional Supplementary Materials

Supplementary Data S1

Supplementary Data S2

Supplementary Data S3

Supplementary Data S4

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Koetsier, J., Eijssen, L.M.T., Schurgers, L.J. et al. Integrative analysis of 115 transcriptomic studies decodes the molecular landscape of neurodevelopmental disorders. Commun Biol 8, 914 (2025). https://doi.org/10.1038/s42003-025-08330-2

Download citation

Received: 07 February 2025
Accepted: 02 June 2025
Published: 12 June 2025
DOI: https://doi.org/10.1038/s42003-025-08330-2