Abstract
Although the efficacy of pharmacy in the treatment of attention deficit/hyperactivity disorder (ADHD) has been well established, the lack of predictors of treatment response poses great challenges for personalized treatment. The current study employed a comprehensive approach, combining genome-wide association analyses (GWAS) and deep learning (DL) methods, to elucidate the genetic underpinnings of pharmacological treatment response in ADHD. Based on genotype data of medication-naïve patients with ADHD who received pharmacological treatments for 12 weeks, the current study performed GWAS using the percentage changes in ADHD-RS score as phenotype. Then, DL models were constructed to predict percentage changes in symptom scores using genetic variants selected based on four different genome-wide P thresholds (E-02, E-03, E-04, E-05) as inputs. The current GWAS results identified two significant loci (rs10880574, P = 2.39E-09; rs2000900, P = 3.31E-09) which implicated two genes, TMEM117 and MYO5B, that were primarily associated with both brain- and gut-related disorders. The convolutional neural network (CNN) model, using variants with genome-wide P values less than E-02 (5516 SNPs), demonstrated the best performance with mean squared error (MSE) equals 0.012 (Accuracy = 0.83; Sensitivity = 0.90; Specificity = 0.75) in the validation dataset, 0.081 in an independent test dataset (Acc = 0.61, Sensitivity = 0.81; Specificity = 0.26). Notably, the variant that contributed most to the CNN model was NKAIN2, an ADHD-related gene, which is also associated with metabolic processes. To conclude, the integration of GWAS and DL methods revealed new genes contribute to ADHD pharmacological treatment responses, and underscored the interplay between neural systems and metabolic processes, potentially providing critical insights into precision treatment. Furthermore, our CNN model exhibited good performance in an independent dataset, encouraged future studies and implied potential clinical applications.
Similar content being viewed by others
Introduction
Attention deficit hyperactivity disorder (ADHD) is a highly prevalent and debilitating condition that affects individuals across the lifespan. Pharmacological treatment is often recommended as a key component of ADHD management [1, 2]. Although both stimulants methylphenidate (MPH) and non-stimulants atomoxetine (ATX) could reduce clinical symptoms of children with ADHD, the responses to medication vary widely among individuals. Nearly 30–40% of children with ADHD show poor response to stimulants and a higher proportion for non-stimulants [3, 4]. Therefore, identifying factors that contribute to treatment response may help clinicians tailor treatment to the individual patient and reduce unnecessary side effects, potentially enhancing treatment compliance and improving outcomes and quality of life [5].
ADHD is highly heritable, with a heritability between 70–80% [6], which suggested that Identifying genetic factors associated with pharmacological response in ADHD could shed light on the underlying neurobiological mechanisms of the condition [5]. Previous candidate gene-based studies showed that ADHD treatment responses were probably regulated by genes involved in neurotransmitter system [7], including dopamine pathway (DRD4, SLC6A3), norepinephrine pathway (SLC6A2, ADRA2A), and catecholamines metabolism (COMT); and cytochrome P450 isoenzyme [8, 9], including CYP2D6 and CES1. On the other hand, polygenetic risk scores (PRSs) based on our Han Chinese ADHD genome-wide association study (GWAS) only explained 2% variance of medication response [10]. Hegvik and his colleagues concluded that ADHD drugs might act on different mechanisms other than the etiology of ADHD [11].
For complex traits, such as medication response, association signals tend to spread across the genome, emphasizing the importance of genome-wide study [12]. A GWAS from Denmark, which utilized “switching medications from methylphenidate” as a phenotype, reported significant SNPs [13]. However, it’s hard to speculate whether ADHD patients discontinued or switched from stimulants due to poor response or other reasons.
While GWAS has significantly enhanced our knowledge of the genetic basis of complex traits, it also has inherent limitations, as “significant SNPs” often explain only a small fraction of the underlying complexity [12, 14]. Thus, considering the aforementioned findings, machine learning (ML) has the potential to further contribute to our understanding. ML models have shown promise in predicting ADHD treatment response using clinical information [15]. In other psychiatric disorders, such as Alzheimer’s disease, ML models have been successful in identifying new gene targets [16]. Moreover, deep learning (DL) models have demonstrated good performance in ADHD diagnosis and have the ability to capture nominal SNPs that may have been overlooked by GWAS studies [17].
In summary, previous studies have emphasized the genetic underpinnings of ADHD treatment response, particularly using candidate genes in neurotransmitter and drug metabolism systems. However, the limitations of candidate gene-based approaches necessitate a transition to GWAS and ML methodologies. GWAS provides a comprehensive genome-wide perspective, meanwhile ML models have the potential to complement GWAS findings and build clinic-informative prediction models.
Based on the above background, the current study aims to achieve two objectives: Firstly, identify SNPs associated with ADHD pharmacological treatment response using GWAS; secondly, develop a DL model to predict treatment response based on genomic data, validate the model using an independent dataset, and interpret the trained model to identify relevant SNPs. By combining GWAS and DL approaches, we hope to advance our understanding of ADHD treatment response and its underlying genetic architecture.
Patients and methods
Study population
Subject recruitment
Dataset 1 (discovery/training & validation set)
Dataset 1 was used in the GWAS analysis as well as training and validating the deep learning models. Two hundred and forty-one children with ADHD are included in this set and are from a previous clinical study conducted by our team between 2001 and 2010. The details about the inclusion and exclusion criteria for these participants can see our previous publications [10, 18].
Dataset 2 (replication/testing set)
This dataset was used to test the DL model. Sixty-four children with ADHD are included in this set, and part of the subjects overlapped with the subjects of our previous publication and can found the recruitment details [19].
Detailed procedure was shown in Fig. 1. The study was approved by the Institutional Review Board of the Peking University Health Science Center (2010–19; 2018–21). After complete description of the study to the subjects, written informed consent was obtained from parents of the ADHD individuals. The current research was registered in ClinicalTrials.gov (NCT01065259).
Study flow of the current research.
Treatment and efficacy evaluation
After baseline evaluation and blood collecting (T0), all the participants accepted open-label treatment with either intermediate-release MPH (Ritalin), oral osmotic MPH (OROS MPH, Concentra), or ATX (Strattera) for 8–12 weeks. All participants in the independent testing set accepted the pharmacological treatment under a naturalistic circumstance after baseline evaluation. In this part, the medication (OROS MPH or ATX) was chosen based on a discussion between the clinical child psychiatrist and parents. The details about titration process can be found in Fu et al. [19]. After standardized titration and maintenance treatment, the clinical symptoms of the subjects were evaluated again (T1), usually 12 weeks after initial treatment. The clinical symptoms were evaluated by parent-rated ADHD-rating scale (ADHD-RS) [20, 21], with each item on a four-grade scale of 0–3. The change rate of ADHD-RS score between follow up (T1) and baseline (T0) was adopted to evaluate the efficacy of the pharmacological treatment as following.
(The Score in this equation means parent-rated ADHD-RS overall score)
The current study classified participants into responders and non-responders based on a threshold of 25% [22], defined robust and not robust responders using 40% as threshold [23, 24].
Genetic analyses
Genotyping
The samples in dataset 1 were genotyped using the Affymetrix6.0 array from CapitalBio Ltd. (Beijing, China) following the standard Affymetrix protocol. The Affymetrix6.0 array consisted of 906,600 probes targeting single nucleotide polymorphisms (SNPs).
For quality control (QC), the current study implemented the following procedures: (1) Individuals with per-individual autosomal heterozygosity >3 SD larger than the mean, without age or sex information, or with a per-individual call rate <98% were excluded; (2) the SNPs with low call rate (<95%), significant deviation from Hardy–Weinberg equilibrium (P < 1e–04), or a minor allele frequency (MAF) lower than 5% were excluded. After quality control, 241 samples with 516,369 SNPs remained for subsequent analysis.
The dataset 2 was genotyped using different array, i.e., the InfiniumPsychArray-24 array by CapitalBio Ltd. (Beijing). Considering these two arrays covered different sets of SNPs, we performed pre-phase and imputation after QC in dataset 2 to cover as many SNPs detected by dataset 1 as possible. Finally, 64 samples and 2,727,614 variants were included.
GWAS analysis
The association study between SNPs and the pharmacological treatment outcomes (percentage changes of ADHD-RS) was conducted using the additive linear regression model implemented in PLINK version 1.9, controlling age, sex, and IQ as covariates [25,26,27]. The genome-wide significant threshold was set as 1E-08.
To fully assess the confounding effects of different medications, MPH and ATX, we performed GWAS separately for each medication, and utilized interaction analyses and Cochrane’s Q test to determine the potential heterogeneity across these medications.
Gene-based and pathway-based analyses
Gene-level and pathway-level analyses were implemented by Multi-marker Analysis of GenoMic Annotation (MAGMA 1.05) with the GWAS summary statistics [28]. MAGMA assigned SNPs to genes based on the NCBI 37.3 definitions, and generated gene-level P-values using a SNP-wise mean model in which the sum of –log (SNP P-value) is used as a test statistic, then calculated gene P-value using a known approximation of the sampling distribution. 1000 Genomes Project Phase 3 Asian samples were used as reference for correcting LD [29]. After SNP annotation, there were 13,157 genes that were covered by at least one SNP. To account for multiple testing, a Bonferroni correction was applied, with a significance threshold set at 3.80E-06.
We used Hi-C coupled MAGMA (H-MAGMA) [30], an analytic tool based on MAGMA which assigns noncoding SNPs to cognate genes based on the chromatin interaction profiles; and assigns exonic and promoter SNPs based on physical position. Five Hi-C datasets were used (Adult dorsolateral prefrontal cortex, fetal paracentral cortex, iPSC-derived astrocyte cells, cortical neuron, midbrain dopaminergic system) [12, 31,32,33,34,35,36]. Bonferroni method was applied for multiple testing.
After obtaining gene-based P values, enrichment of association signals in genes belonging to specific biological pathways or processes were tested. In the current study, we used the gene sets identified by Biological Pathways in the Gene-ontology (GO) datasets (Num of gene sets = 7300, Bonferroni corrected P = 6.85E-06) [37, 38].
Mapping risk genes and enrichment analyses
To link variants to potential genes, three methods for gene mapping were leveraged: positional mapping, eQTL mapping, and chromatin interaction mapping. For positional mapping, we used a maximum distance of 10 kb. The eQTL data and chromatin interaction data were utilized to identify genes through the implementation of FUMA [39] (details of the datasets utilized in this study can be found in the Supplementary materials).
To gain further insights into the biological implications, enrichment analyses were performed using the over-representation analysis in WebGestalt [40, 41], utilizing four datasets (GO, KEGG, DisGeNET, and GLAD4U) [37, 38, 42,43,44]. Phenome-wide association analyses (PheWAS) were performed for top genes using GWAS-ATLAS [45]. These analyses aimed to identify any significant enrichment of gene sets or functional categories associated with the identified variants.
DL model
Feature selection using GWAS p-value
SNPs were chosen based on the GWAS analysis described above. Four different P value thresholds were applied: P < 1E-02 (NSNP = 5516); P < 1E-03 (NSNP = 1055); P < 1E-04 NSNP = 282; P < 1E-05 NSNP = 99). The SNPs were input into the DL models following the location orders.
DL model training
We trained and evaluated four types of prediction DL models each: Convolutional neural networks (CNN), Long-short term memory (LSTM), CNN+LSTM, and Dense neural network (DNN) with training, validation, and test datasets, using four sets of SNPs selected by GWAS P-value which was described above as inputs. Similar to GWAS analysis, the change rates of ADHD-RS scores were used as predicted phenotype. We also compared the performance between our DL model and machine learning models, including Support Vector Regression (SVR) and the Random Forest (RF) model. All the codes were implemented using python with Keras and scikit-learn [46,47,48].
Each SNP was encoded to represent the number of major alleles (A) it contains, resulting into a one-dimensional vector (n × 1) with values of 1 (aa), 2 (Aa), 3 (AA), or 0 (missing). For CNN and CNN-LSTM model, we further transformed the (n ×1) vector to a (n × 4) one-hot vector representation, i.e., aa to (1, 0, 0, 0), Aa to (0, 1, 0, 0), AA to (0, 0, 1, 0), NA to (0, 0, 0, 1).
The samples were randomly divided into a training set (85%) and a validation set (15%). All the DL models were initialized using the Xavier method [49]; the mean squared loss (MSE) was employed as loss function [50]. For activation functions, the hidden layers were activated by the Rectified Linear Unit (ReLU), while the Sigmoid function was applied in the output layer. To further evaluate the performance of DL models in classification tasks, we assessed accuracy, sensitivity, and specificity metrics, using a 25% change in ADHD-RS scores as the threshold to distinguish between responders and non-responders.
The “Weights and biases” (Wandb) platform was used for hyperparameter tuning [51]. Sweeps with 200 runs was performed using a “random” strategy for every DL model, and the model with lowest validation loss was selected as the best.
According to the “sweep” results, A CNN model using E2 SNP-set as inputs outperformed other models. The hyperparameters were chosen as follows: Batch size = 64; Adam method for optimize [52]; learning rate = 1 × 10−2; 600 epochs; dropout layers with a dropout rate of 0.5; two convolutional layers both with 64 neurons, kernel size = 3. (Details were shown in the supplementary file).
Interpretation of the DL model
We interpreted the model using DeepExplain package [53]. Contributions of each SNP to the trained model were calculated using “Saliency” method. Then, gene variants with top 300 saliency scores were manually checked and mapped to putative genes using the methods described above (Section “Mapping risk genes and enrichment analyses”).
Independent dataset: test the consistency of GWAS results and test the performance of DL models
To validate the GWAS findings and assess the performance of the DL model, an independent replication/test dataset consisting of 64 ADHD samples (dataset 2) was utilized. The recruitment and data quality control procedures for this dataset were identical to those of the discovery set, with the exception of the genotype procedure (Details see Section “Study population”).
Referring to a previous GWAS study, we evaluated the consistency of GWAS locus by (1) examining the direction and magnitude of effect sizes [54]. Concordance of direction was assessed using a one-sample test of the proportion with Yates’ continuity correction, with a null hypothesis of a proportion of 0.50. Linear regression analyses were performed to determine the relationship between effect sizes in the two datasets, with a significance threshold set at α < 0.05. (2) perform association tests within the significant genes detected by GWAS, Monte Carlo permutation test was used with 500 times permutation, setting significance threshold as 0.05.
We also compared our GWAS findings with the results from a previous study focusing on stimulant-dependence by Cox et al. [55], which included Western and African populations, to compare consistencies across diverse ethnic groups.
The dataset 2 was also utilized as an independent test dataset to evaluate the performance of the trained DL mode. Due to disparities in genotyping procedures, only a subset of SNPs could be included from dataset 2. In this study, there are 4099 out of 5516 SNPs (E-02), 770 out of 1055 SNPs (E-03), 215 out of 282 SNPs (E-04), and 69 out of 99 SNPs (E-05) were included by dataset 2 for model testing, with any uncovered SNPs coded as ‘NA’.
Results
Demographic characteristics
As mentioned above (Section “Study population”), the current study leveraged two datasets (Table 1). The first dataset comprised 241 samples (39 girls and 202 boys), with a mean age of 9.16 years old (±2.14). In the current cohort, 130 samples comorbid with other psychiatric disorders (More details about comorbidity were shown in the Supplementary Table 1). The first dataset was used for GWAS analysis, training and validating the machine learning models. 169 (70%) participants were classified as responders (percentage changes >25%), 114 (49%) participants were robust responders (>40%).
The second data set comprised 64 samples (8 girls and 56 boys), with a mean age of 10.34 (±2.24). The second dataset was used for replication of GWAS results and testing the trained DL models. 43 (67%) individuals were responders, and 29 (44%) were classified as robust responders.
Genome-wide significant loci
Two SNPs surpassed the threshold for genome-wide significance (P < 1e-08, Fig. 2A, Table 2). The QQ plot is presented in Fig. 2B, with a genomic inflation factor of 1.00604, indicating a negligible influence of systematic bias on the obtained results. Notably, both loci maintained statistical significance even after adjustment using Genomic Control (PGC_adj < 1e-08). To assess the potential effects of comorbidities on our findings, we conducted supplementary analyses, incorporating comorbidity status as a covariate and separately analyzing data after removing participants with comorbid conditions. Both analyses consistently identified the significance of two loci (rs10880574 P = 3.28E-10, 3.54E-09; rs2000900 P = 3.35E-09, 1.30E-08), suggested no confounding effect of comorbidities on our results.
A Manhattan plot and B QQ plot of the current GWAS; C, D regional plots of two significant locus (rs10880574; rs2000900); E. PPI network of gene MYO5B.
The first locus is located on chromosome 12 (lead SNP: rs10880574, CHR12: 44235427 [GRCh37], P = 2.39E-09, beta = −0.5134, PGC_adj = 2.645E-09), located in the 5-prime UTR intron region of the gene TMEM117 (Transmembrane Protein 117). TMEM117 is a protein-coding gene, mainly expressed in the brain (LifeMap [56]). TMEM117 was involved in intrinsic apoptotic signaling pathway in response to endoplasmic reticulum stress, located in the endoplasmic reticulum and plasma membrane. Using GWAS catalog [57], we identified prior-reported associations between TMEM117 and electroencephalogram measurement and inflammatory bowel disease [58, 59]. Gene PUS7L (pseudouridine Synthase 7 Like) was identified as putative by eQTL data. Interestingly, gene PUS7L is also previously detected as responsible for both brain measurement and inflammatory bowel disease [59, 60].
The second locus is located on the chromosome 18 (lead SNP: rs2000900, CHR18:47502338, P = 3.31e-09, beta = −0.4648, PGC_adj = 3.664E-09). The nearest gene is MYO5B (Myosin 5 B), which is a protein-coding gene, involved in vesicular trafficking and is required in a complex with RAB11A and RAB11FIP2 for the transport of NPC1L1 to the plasma membrane, essential for several neural processes [61,62,63,64]. MYO5B is mainly expressed in digestive organs, including esophagus, minor salivary gland, colon, and intestine (GTEx) [65], as well as liver, and kidney (LifeMap). MYO5B was previously identified to be responsible for lipid metabolic processes, associated with several cholesterol measurements [66,67,68,69]. PheWAS results suggested that both TMEM117 and MYO5B were related to psychiatry and metabolism phenotypes (Pbon < 0.05), gene PUS7L potentially related to lipid metabolism (Supplementary Fig. 2).
To test the heterogenous effects between MPH and ATX, we separately performed GWAS for each medication and meta-analyzed them. The meta-analysis result confirmed two loci at P < 1E-06 level (rs10880574 P = 5.553E-08; rs2000900 P = 1.438E-07) (Supplementary Fig. 2). No significant heterogenous effects (P < 1E-05) between two medications were identified (Q = 0.0122, 0.0033), which were confirmed by interaction analyses (P = 0.06; 0.0163) (Supplementary Fig. 2).
Results identified rs10880574 and rs2000900 at P < 1E-06 level (P = 2.098E-07; 7.613E-08) in MPH GWAS (Supplementary Fig. 3). No SNP with P value less than 1E-06 was detected in ATX GWAS (rs10880574 P = 0.01439; rs2000900 P = 0.03735), which might suggest that the SNPs affected ATX responses to a smaller extent than MPH; and the more limited sample size of ATX (N = 80) might prevent us from identifying statistically significant loci.
Gene-based results
MAGMA identified the gene CTD-2561J22.3 (P = 3.67E-06; Pbon = 0.048; CHR19: 21587961-21674068) as significantly associated with ADHD pharmacological treatment (Supplementary Fig. 2). Nominal significance was identified for TMEM117 (P = 5.60E-05; Pbon = 0.737). Novel RNA gene ENSG00000226664 (P = 1.39E-06; Pbon = 0.0455) was tested using cortical neurons and iPSC-derived astrocytes based Hi-C data (Supplementary Fig. 3); gene ENSG00000222121 (P = 1.01E-07; Pbon = 0.0326) was suggested by Hi-C data in the fetal brain tissues (Supplementary Fig. 3).
Replication of the current GWAS results
We further tested the reliability of GWAS results by an independent dataset (dataset 2). Our results indicated some consistencies between two cohorts. A locus in TMEM117 was identified in the dataset 2 (12:44544253, rs117799886, Pperm = 0.036), which provide further validation of our GWAS results. Sign test suggested same direction of SNPs with a GWAS-P less than E-05 (P = 0.019, concordance rate = 0.833), however, SNPs under E-03 and E-04 threshold did not show significance (P = 0.19; P = 0.17, Supplementary Table 2).
Comparing with a previous GWAS results reported stimulant dependence associated loci [55], we found moderate consistencies. Specifically, we observed one SNP s61915073 (P = 7.35E-05, OR = 0.63, 12q12) with P-value less than 1E-04 locating within 500 kb of the current significant loci rs10880574; and five SNPs locating with P values less than 1E-03 (18q21.1) within the 250 kb of the loci rs2000900. Our results suggested there are consistencies of genetic background among different ethnic groups underlying stimulant associated response.
Deep learning prediction model
The CNN model using SNPs with GWAS-P value less than E-02 presented best performance (MSE = 0.043, 0.012, 0.081; Acc = 0.99, 0.83, 0.61; Sen = 1.0, 0.90, 0.81; Specificity = 0.98, 0.75, 0.26 in training, validation, and test datasets). The architecture of model was showed in the Fig. 3A, and the hyperparameters and architecture in other DL models were shown in Supplementary Table 2.
A architecture and B Training and validation loss of the E2-CNN model (N represents batch size).
Compared with other DL models and traditional machine learning method, our CNN model had a better performance (Table 3).
Interpret the DL model
The current study identified relevant variants through calculating saliency scores. Top 20 SNPs were shown in the Supplementary Table 4.
The variant contributed most to the DL model locate within the intronic region of the gene NKAIN2 (rs535492, 6:124863962, P = 0.00193), which is a brain-expressed gene associated with externalizing traits (ADHD, substance use disorder, and others [70]. NKAIN2 mainly expressed in multiple brain tissues, including substantial nigra; hippocampus; amygdala; anterior cingulate cortex; thalamus; cerebellum, and oligodendrocyte cells (GTE v8 & LifeMap). PheWAS analysis also indicated its involvement in both psychiatric disorder/cognitive functions and metabolic processes (Supplementary table 5 and Supplementary Fig. 5).
Then, FUMA was applied to explore the functional consequences of the top 300 SNPs, which lead to a total of 920 genes. Subsequent enrichment analysis was performed using WebGestalt.
The analyses based on GO Cellular Component (CC) terms suggested these genes were significantly enriched in the intrinsic component of plasma membrane (P = 1.92E-05; PFDR = 0.023). Analyses based on biological pathways (BP) and molecular function (MF) terms failed to show any significant results.
Analysis based on the GLAD4U database identified the phenotype Methamphetamine dependence (P = 1.78E-05; PFDR = 0.03) was significantly enriched in the mapped genes (Supplementary Fig. 5).
Discussion
Understanding the heterogeneity in pharmacological treatment response among individuals with ADHD has long been a challenging yet crucial issue in the field, as it holds significance for personalized treatment. Aiming to deepen our understanding of the underlying genetic basis by integrating genome data with innovative DL techniques, the current study revealed novel genes, including TMEM117, MYO5B, NKAIN2, PUS7L, and CTD-2561J22.3. Furthermore, we developed a CNN-based prediction model that demonstrated favorable performance in an independent test dataset, suggesting its potential utility in clinical settings. The current results contribute to advancing our knowledge of the genetic factors involved in ADHD treatment response and provide a basis for future individualized treatment.
In the current genome-wide analyses, we identified two significant loci, i.e., rs10880574 and rs2000900, located within the genes TMEM117 and MYO5B, respectively. The gene TMEM117 is primarily expressed in the brain, meanwhile, previous GWAS have linked TMEM117 to brain activity and digestive disorders [58, 59]. On the other hand, MYO5B is predominantly expressed in digestive organs, with its role in metabolic processes. GWAS have shown associations between MYO5B and various lipid and cholesterol measurements, such as total cholesterol, low-density lipoprotein, high-density lipoprotein, and apolipoprotein A1 [66,67,68,69]. Furthermore, we identified gene PUS7L as the target gene of rs10880574, which is also associated with brain measurement and digestive disorder [59, 60]. PheWAS results of TMEM117 and MYO5B suggested involvements in metabolism/nutrition and psychiatric disorders (Supplementary Fig. 1). Meanwhile, MYO5B was identified as related to schizophrenia in a Chinese sample and subsequently replicated in a later study [71, 72], which might be attributed to the essential role of MYO5B in BDNF/TrkB-induced dendritic branching and its involvement in various neurotransmitter circuits [62,63,64]. The PheWAS results suggested significant associations between MYO5B with lithium response in bipolar I patients, might support the possible role of this gene on psychopharmacological response (Supplementary Fig. 1). However, it also needs to notice that the signal detected in rs2000900 was somewhat isolated, and could not exclude possibilities of false positive, future investigations and validations would be warranted. Taken together, the GWAS findings suggested consistency with previous findings in psychopharmacology, highlighted the complex interplay between neural, gut and metabolic pathways in the context of ADHD pharmacological treatment response and provide new avenues for further investigation.
In the current study, we combined the treatment responses to ATX and MPH. It’s worth noting although there are different mechanisms, substantial shared pathways underlying two medications have also been identified [18, 73,74,75]. Both medications target monoaminergic circuits, especially NE pathway. Neuroimaging studies also underscored similar effects of both medications in normalizing compromised brain activation in fronto-striatal and fronto-parietal circuits [76,77,78]. Additionally, an animal study highlighted lipid metabolism as another shared pathway between MPH and ATX, corroborating our findings [67, 79,80,81]. The methodology of combing different phenotypes that share a genetic architecture has previously been shown to offer numerous benefits, such as augmenting the statistical power of GWAS, uncovering common genetic loci, and clarifying shared biological pathways. Thus, by combining the outcomes of MPH and ATX, our current results provided reliable evidence about common genetic basis underlying the effects of these two medications.
It’s worth mention that the two significant loci are both intronic variants. Although this kind of variants typically not directly cause protein variations, growing evidence has suggested their critical roles in gene regulation, including expression and splicing. The influence of intronic variants on pharmacological responses in mental disorders has also been identified, including associations with antidepressant responses and antipsychotic-induced weight gain through intronic variants rs4884091 in RNF219-AS1, rs4955665 in MECOM, and rs10422861 in PRPD, and others (Li et al., 2020; Liao et al., 2024). Our results further foregrounded the pivotal role of intronic variants in mediating the pharmacological treatment responses for ADHD, aligning with existing literature in this field.
Previous studies demonstrated several remarkable contributions of DL models in the field of psychiatry, including identifying “nominal” variants which tend to be ignored in GWAS and capturing the real-world complex non-linear relationship, which could lead to more accurate prediction model and new target genes. Here we proposed a CNN-based regression model using 5516 variants with P-values less than 1E-02 in genome-wide analyses as inputs (E2-CNN model), which outperformed other DL models and traditional ML models, also in the prediction of an independent test dataset, highlighting its promise in precise treatment. However, it is important to acknowledge that due to the limited size of the dataset and the heterogeneity in gene arrays between the training and testing datasets, our deep learning model achieved only moderate accuracy on the test dataset (61%). Further research is required to enhance the model’s generalization. Potential improvements could involve incorporating larger and more diverse datasets, integrating SNP data with other omics data, and utilizing transfer learning techniques to increase the clinical utility of DL models in predicting responses to ADHD pharmacological treatments.
Interpretations of the E2-CNN model highlighted a brain-expressed gene, NKAIN2, which is responsible for the externalizing disorders, including substance use disorder and ADHD. By linking the gene to functions, we found it mainly involved in brain disorders/measurements and metabolic processes (Supplementary table 5). Taking these findings together, our DL-identified genes further validated our above hypothesis, i.e., metabolism and brain-related genes collaboratively regulate the pharmacological treatment response.
Enrichment analyses of the genes identified by the DL model revealed the involvement of two pathways, i.e., the plasma membrane and Methamphetamine dependence. It is noteworthy that Methamphetamine, similar to MPH, is also a central nervous system (CNS) stimulant that mainly achieves its functions through competitively inhibiting dopamine and norepinephrine transporters, thereby increasing catecholamine transmission [1, 82]. This finding provides rational explanation for our results. Additionally, the stability of the plasma membrane is closely associated with lipid metabolic processes, suggesting a potential link between the identified genes and lipid metabolism.
Emerging evidence has highlighted the intricate connections between ADHD and metabolic processes. Individuals with ADHD often exhibit a higher prevalence of obesity and gastrointestinal symptoms; and the brain-gut axis was regarded as a crucial pathway in ADHD etiology [83,84,85]. Meanwhile, the administration of ADHD medications has also been shown to influence metabolic processes, with animal studies suggesting that MPH could impact the levels of lipoproteins in axon and glial cells [86]. Our findings identified that genes associated with treatment response, were meanwhile involved in the metabolic processes or digestive system, which implied a dynamic and reciprocal interplay between medication and metabolism. To conclude, our results encouraged further investigations to explore the impact of metabolic processes on the pharmacological treatment response in ADHD, especially regarding the lipid-related processes; and our results suggested that clinical practice should take the metabolic factors into consideration when tailor the specific treatment plans for individuals.
Conclusion
In summary, our study represents the first genome-wide analyses to identify significant variants associated with ADHD pharmacological treatment response. We discovered possible associated genes, included TMEM117, MYO5B, and PUS7L. Additionally, this is the first study to utilize genetic data constructing a DL model for ADHD pharmacological treatment prediction, which demonstrated promising performance. Through the interpretation of the DL model, we further identified NKAIN2 as a potential associated gene. Functional analyses further supported the involvement of genes related to neural and metabolism systems in the modulation of ADHD pharmacological treatment responses. These findings contribute to a better understanding of the genetic and biological mechanisms underlying medication response in ADHD.
Data availability
The data from this study are not publicly available.
Code availability
The GWAS analyses were conducted using the software tools specified in the Methods section. The code for developing the deep learning models was implemented in Python 3.9 using Keras. Detailed code can be provided upon request.
References
Posner J, Polanczyk GV, Sonuga-Barke E. Attention-deficit hyperactivity disorder. Lancet. 2020;395:450–62. https://doi.org/10.1016/S0140-6736(19)33004-1.
Faraone SV, Asherson P, Banaschewski T, Biederman J, Buitelaar JK, Ramos-Quiroga JA, et al. Attention-deficit/hyperactivity disorder. Nat Rev Dis Primers. 2015;1:15020. https://doi.org/10.1038/nrdp.2015.20.
Newcorn JH, Kratochvil CJ, Allen AJ, Casat CD, Ruff DD, Moore RJ, et al. Atomoxetine and osmotically released methylphenidate for the treatment of attention deficit hyperactivity disorder: acute comparison and differential response. Am J Psychiatry. 2008;165:721–30. https://doi.org/10.1176/appi.ajp.2007.05091676.
Su Y, Yang L, Stein MA, Cao Q, Wang Y. Osmotic release oral system methylphenidate versus atomoxetine for the treatment of attention-deficit/hyperactivity disorder in chinese youth: 8-week comparative efficacy and 1-year follow-up. J Child Adolesc Psychopharmacol. 2016;26:362–71. https://doi.org/10.1089/cap.2015.0031.
Michelini G, Norman LJ, Shaw P, Loo SK. Treatment biomarkers for ADHD: taking stock and moving forward. Transl Psychiatry. 2022;12:444. https://doi.org/10.1038/s41398-022-02207-2.
Faraone SV, Larsson H. Genetics of attention deficit hyperactivity disorder. Mol Psychiatry. 2019;24:562–75. https://doi.org/10.1038/s41380-018-0070-0.
Myer NM, Boland JR, Faraone SV. Pharmacogenetics predictors of methylphenidate efficacy in childhood ADHD. Mol Psychiatry. 2018;23:1929–36. https://doi.org/10.1038/mp.2017.234.
Michelson D, Read HA, Ruff DD, Witcher J, Zhang S, McCracken J. CYP2D6 and clinical response to atomoxetine in children and adolescents with ADHD. J Am Acad Child Adolesc Psychiatry. 2007;46:242–51. https://doi.org/10.1097/01.chi.0000246056.83791.b6.
Trzepacz PT, Williams DW, Feldman PD, Wrishko RE, Witcher JW, Buitelaar JK. CYP2D6 metabolizer status and atomoxetine dosing in children and adolescents with ADHD. Eur Neuropsychopharmacol. 2008;18:79–86. https://doi.org/10.1016/j.euroneuro.2007.06.002.
Zhong Y, Yang B, Su Y, Qian Y, Cao Q, Chang S, et al. The association with quantitative response to attention-deficit/hyperactivity disorder medication of the previously identified neurodevelopmental network genes. J Child Adolesc Psychopharmacol. 2020;30:348–54. https://doi.org/10.1089/cap.2018.0164.
Hegvik TA, Waløen K, Pandey SK, Faraone SV, Haavik J, Zayats T. Druggable genome in attention deficit/hyperactivity disorder and its co-morbid conditions. new avenues for treatment. Mol Psychiatry. 2021;26:4004–15. https://doi.org/10.1038/s41380-019-0540-z.
Boyle EA, Li YI, Pritchard JK. An expanded view of complex traits: from polygenic to omnigenic. Cell. 2017;169:1177–86. https://doi.org/10.1016/j.cell.2017.05.038.
Brikell I, Wimberley T, Albiñana C, Pedersen EM, Vilhjálmsson BJ, Agerbo E, et al. Genetic, clinical, and sociodemographic factors associated with stimulant treatment outcomes in ADHD. Am J Psychiatry. 2021;178:854–64. https://doi.org/10.1176/appi.ajp.2020.20121686.
Tam V, Patel N, Turcotte M, Bossé Y, Paré G, Meyre D. Benefits and limitations of genome-wide association studies. Nat Rev Genet. 2019;20:467–84. https://doi.org/10.1038/s41576-019-0127-1.
Faraone SV, Gomeni R, Hull JT, Chaturvedi SA, Busse GD, Melyan Z, et al. Predicting efficacy of viloxazine extended-release treatment in adults with ADHD using an early change in ADHD symptoms: machine learning post Hoc analysis of a phase 3 clinical trial. Psychiatry Res. 2022;318:114922. https://doi.org/10.1016/j.psychres.2022.114922.
Rodriguez S, Hug C, Todorov P, Moret N, Boswell SA, Evans K, et al. Machine learning identifies candidates for drug repurposing in Alzheimer’s disease. Nat Commun. 2021;12:1033. https://doi.org/10.1038/s41467-021-21330-0.
Liu L, Feng X, Li H, Cheng Li S, Qian Q, Wang Y. Deep learning model reveals potential risk genes for ADHD, especially Ephrin receptor gene EPHA5. Brief Bioinforma. 2021;22:bbab207. https://doi.org/10.1093/bib/bbab207.
Yang L, Cao Q, Shuai L, Li H, Chan RCK, Wang Y. Comparative study of OROS-MPH and atomoxetine on executive function improvement in ADHD: a randomized controlled trial. Int J Neuropsychopharmacol. 2012;15:15–26. https://doi.org/10.1017/S1461145711001490.
Fu Z, Yuan J, Pei X, Zhang K, Xu C, Hu N, et al. Shared and unique effects of long-term administration of methylphenidate and atomoxetine on degree centrality in medication-naïve children with attention-deficit/hyperactive disorder. Int J Neuropsychopharmacol. 2022;25:709–19. https://doi.org/10.1093/ijnp/pyac028.
DuPaul GJ, Power TJ, Anastopoulos AD, Reid R. ADHD rating scale IV: checklists, norms, and clinical inter-pretation. NewYork: The Guilford Press; 1998.
Su YE, Wang H, Geng YG, Sun L, Du YS, Fan F, et al. Parent ratings of ADHD symptoms in Chinese urban schoolchildren: assessment with the Chinese ADHD rating scale-IV: home version. J Atten Disord. 2015;19:1022–33. https://doi.org/10.1177/1087054712461177.
Leuchter AF, McGough JJ, Korb AS, Hunter AM, Glaser PEA, Deldar A, et al. Neurophysiologic predictors of response to atomoxetine in young adults with attention deficit hyperactivity disorder: a pilot project. J Psychiatr Res. 2014;54:11–18. https://doi.org/10.1016/j.jpsychires.2014.03.009.
Dickson RA, Maki E, Gibbins C, Gutkin SW, Turgay A, Weiss MD. Time courses of improvement and symptom remission in children treated with atomoxetine for attention-deficit/hyperactivity disorder: analysis of Canadian open-label studies. Child Adolesc Psychiatry Ment Health. 2011;5:1–8. https://doi.org/10.1186/1753-2000-5-14.
Pelsser LM, Frankena K, Toorman J, Savelkoul HF, Dubois AE, Pereira RR, et al. Effects of a restricted elimination diet on the behaviour of children with attention-deficit hyperactivity disorder (INCA study): a randomised controlled trial. Lancet. 2011;377:494–503. https://doi.org/10.1016/S0140-6736(10)62227-1.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for Whole-Genome Association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–75. https://doi.org/10.1086/519795.
Purcell S, Chang C. PLINK. 2007. www.cog-genomics.org/plink/1.9/.
Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaSci. 2015;4:7. https://doi.org/10.1186/s13742-015-0047-8.
de Leeuw CA, Mooij JM, Heskes T, Posthuma D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput Biol. 2015;11:e1004219. https://doi.org/10.1371/journal.pcbi.1004219.
Auton A, Abecasis GR, Altshuler DM, Durbin RM, Abecasis GR, Bentley DR, et al. A global reference for human genetic variation. Nature. 2015;526:68–74. https://doi.org/10.1038/nature15393.
Sey NYA, Hu B, Mah W, Fauni H, McAfee JC, Rajarajan P, et al. A computational tool (H-MAGMA) for improved prediction of brain-disorder risk genes by incorporating brain chromatin interaction profiles. Nat Neurosci. 2020;23:583–93. https://doi.org/10.1038/s41593-020-0603-0.
Jung I, Schmitt A, Diao Y, Lee AJ, Liu T, Yang D, et al. A compendium of promoter-centered long-range chromatin interactions in the human genome. Nat Genet. 2019;51:1442–9. https://doi.org/10.1038/s41588-019-0494-8.
Sey NYA, Hu B, Iskhakova M, Lee S, Sun H, Shokrian N, et al. Chromatin architecture in addiction circuitry identifies risk genes and potential biological mechanisms underlying cigarette smoking and alcohol use traits. Mol Psychiatry. 2022;27:3085–94. https://doi.org/10.1038/s41380-022-01558-y.
Won H, de la Torre-Ubieta L, Stein JL, Parikshak NN, Huang J, Opland CK, et al. Chromosome conformation elucidates regulatory relationships in developing human brain. Nature. 2016;538:523–7. https://doi.org/10.1038/nature19847.
Wang D, Liu S, Warrell J, Won H, Shi X, Navarro FCP, et al. Comprehensive functional genomic resource and integrative model for the human brain. Science. 2018;362:eaat8464. https://doi.org/10.1126/science.aat8464.
Rajarajan P, Borrman T, Liao W, Schrode N, Flaherty E, Casiño C, et al. Neuron-specific signatures in the chromosomal connectome associated with schizophrenia risk. Dev Dynam. 2018;362:1269. https://doi.org/10.1126/science.aat4311.
Hu B, Won H, Mah W, Park RB, Kassim B, Spiess K, et al. Neuronal and glial 3D chromatin architecture informs the cellular etiology of brain disorders. Nat Commun. 2021;12:3968. https://doi.org/10.1038/s41467-021-24243-0.
Aleksander SA, Balhoff J, Carbon S, Cherry JM, Drabkin HJ, Ebert D, et al. The gene ontology knowledgebase in 2023. Genetics. 2023;224:iyad031. https://doi.org/10.1093/genetics/iyad031.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. Nat Genet. 2000;25:25–29. https://doi.org/10.1038/75556.
Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun. 2017;8:1826. https://doi.org/10.1038/s41467-017-01261-5.
Liao Y, Wang J, Jaehnig EJ, Shi Z, Zhang B. WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Res. 2019;47:W199–205. https://doi.org/10.1093/nar/gkz401.
Zhang B, Kirov S, Snoddy J. WebGestalt: an integrated system for exploring gene sets in various biological contexts. Nucleic Acids Res. 2005;33:W741–748. https://doi.org/10.1093/nar/gki475.
Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28:27–30. https://doi.org/10.1093/nar/28.1.27.
Piñero J, Bravo À, Queralt-Rosinach N, Gutiérrez-Sacristán A, Deu-Pons J, Centeno E, et al. DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants. Nucleic Acids Res. 2017;45:D833–39. https://doi.org/10.1093/nar/gkw943.
Jourquin J, Duncan D, Shi Z, Zhang B. GLAD4U: deriving and prioritizing gene lists from PubMed literature. BMC Genomics. 2012;13:S20. https://doi.org/10.1186/1471-2164-13-S8-S20.
Watanabe K, Stringer S, Frei O, Umićević Mirkov M, de Leeuw C, Polderman TJC, et al. A global overview of pleiotropy and genetic architecture in complex traits. Nat Genet. 2019;51:1339–48. https://doi.org/10.1038/s41588-019-0481-0.
Chollet F, Others. Keras. 2015. GitHub. Retrieved from https://github.com/fchollet/keras.
Van Rossum G, Drake FL. Python 3 reference manual. Scotts Valley, CA: CreateSpace; 2009.
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12:2825–30.
Glorot X, Bengio Y. Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the thirteenth international conference on artificial intelligence and statistics. JMLR Workshop and Conference Proceedings; 249–56. https://proceedings.mlr.press/v9/glorot10a.html (2010). Accessed July 13 2023.
Das K, Jiang J, Rao JNK. Mean squared error of empirical predictor. Ann Stat. 2004;32:818–40. https://doi.org/10.1214/009053604000000201.
Biewald, L. Experiment tracking with weights and biases. Published online 2020. Software available from wandb.com.
Kingma D, Ba J. Adam: A Method for Stochastic Optimization[J]. Comput Sci. 2014. https://doi.org/10.48550/arXiv.1412.6980.
Ancona M, Ceolini E, Öztireli C, Gross M. Towards better understanding of gradient-based attribution methods for Deep Neural Networks. ICLR. 2018. https://openreview.net/forum?id=Sy21R9JAW.
Demontis D, Walters RK, Martin J, Mattheisen M, Als TD, Agerbo E, et al. Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder. Nat Genet. 2019;51:63–75. https://doi.org/10.1038/s41588-018-0269-7.
Cox J, Sherva R, Wetherill L, Foroud T, Edenberg HJ, Kranzler HR, et al. Genome-wide association study of stimulant dependence. Transl Psychiatry. 2021;11:1–10. https://doi.org/10.1038/s41398-021-01440-5.
Edgar R, Mazor Y, Rinon A, Blumenthal J, Golan Y, Buzhor E, et al. LifeMap discoveryTM: the embryonic development, stem cells, and regenerative medicine research portal. PLOS One. 2013;8:e66629. https://doi.org/10.1371/journal.pone.0066629.
MacArthur J, Bowler E, Cerezo M, Gil L, Hall P, Hastings E, et al. The new NHGRI-EBI catalog of published genome-wide association studies (GWAS Catalog). Nucleic Acids Res. 2017;45:D896–901. https://doi.org/10.1093/nar/gkw1133.
Jawinski P, Kirsten H, Sander C, Spada J, Ulke C, Huang J, et al. Human brain arousal in the resting state: a genome-wide association study. Mol Psychiatry. 2019;24:1599–609. https://doi.org/10.1038/s41380-018-0052-2.
Yang SK, Hong M, Oh H, Low HQ, Jung S, Ahn S, et al. Identification of Loci at 1q21 and 16q23 that affect susceptibility to inflammatory bowel disease in Koreans. Gastroenterology. 2016;151:1096–99.e4. https://doi.org/10.1053/j.gastro.2016.08.025.
He W, Han X, Ong JS, Hewitt AW, Mackey DA, Gharahkhani P, et al. Association of Novel Loci With Keratoconus Susceptibility in a Multitrait Genome-Wide Association study of the UK Biobank database and canadian longitudinal study on Aging. JAMA Ophthalmol. 2022;140:568–76. https://doi.org/10.1001/jamaophthalmol.2022.0891.
Liu Y, Xu XH, Chen Q, Wang T, Deng CY, Song BL, et al. Myosin Vb controls biogenesis of post-Golgi Rab10 carriers during axon development. Nat Commun. 2013;4:2005. https://doi.org/10.1038/ncomms3005.
Gardner LA, Hajjhussein H, Frederick-Dyer KC, Bahouth SW. Rab11a and its binding partners regulate the recycling of the ß1-adrenergic receptor. Cell Signal. 2011;23:46–57. https://doi.org/10.1016/j.cellsig.2010.07.020.
Lisé MF, Wong TP, Trinh A, Hines RM, Liu L, Kang R, et al. Involvement of myosin Vb in glutamate receptor trafficking. J Biol Chem. 2006;281:3669–78. https://doi.org/10.1074/jbc.M511725200.
Lazo OM, Gonzalez A, Ascaño M, Kuruvilla R, Couve A, Bronfman FC. BDNF regulates Rab11-mediated recycling endosome dynamics to induce dendritic branching. J Neurosci. 2013;33:6112–22. https://doi.org/10.1523/JNEUROSCI.4630-12.2013.
Carithers LJ, Ardlie K, Barcus M, Branton PA, Britton A, Buia SA, et al. A novel approach to high-quality postmortem tissue procurement: The GTEx Project. Biopreserv Biobank. 2015;13:311–9. https://doi.org/10.1089/bio.2015.0032.
Davis JP, Huyghe JR, Locke AE, Jackson AU, Sim X, Stringham HM, et al. Common, low-frequency, and rare genetic variants associated with lipoprotein subclasses and triglyceride measures in Finnish men from the METSIM study. PLoS Genet. 2017;13:e1007079. https://doi.org/10.1371/journal.pgen.1007079.
Sinnott-Armstrong N, Tanigawa Y, Amar D, Mars N, Benner C, Aguirre M, et al. Genetics of 35 blood and urine biomarkers in the UK Biobank. Nat Genet. 2021;53:185–94. https://doi.org/10.1038/s41588-020-00757-z.
Koskeridis F, Evangelou E, Said S, Boyle JJ, Elliott P, Dehghan A, et al. Pleiotropic genetic architecture and novel loci for C-reactive protein levels. Nat Commun. 2022;13:6939. https://doi.org/10.1038/s41467-022-34688-6.
Graham SE, Clarke SL, Wu KHH, Kanoni S, Zajac GJM, Ramdas S, et al. The power of genetic diversity in genome-wide association studies of lipids. Nature. 2021;600:675–9. https://doi.org/10.1038/s41586-021-04064-3.
Karlsson Linnér R, Mallard TT, Barr PB, Sanchez-Roige S, Madole JW, Driver MN, et al. Multivariate analysis of 1.5 million people identifies genetic associations with traits related to self-regulation and addiction. Nat Neurosci. 2021;24:1367–76. https://doi.org/10.1038/s41593-021-00908-3.
Yue WH, Wang HF, Sun LD, Tang FL, Liu ZH, Zhang HX, et al. Genome-wide association study identifies a susceptibility locus for schizophrenia in Han Chinese at 11p11.2. Nat Genet. 2011;43:1228–31. https://doi.org/10.1038/ng.979.
Chen Y, Tian L, Zhang F, Liu C, Lu T, Ruan Y, et al. Myosin Vb gene is associated with schizophrenia in Chinese Han population. Psychiatry Res. 2013;207:13–18. https://doi.org/10.1016/j.psychres.2013.02.026.
Caye A, Swanson JM, Coghill D, Rohde LA. Treatment strategies for ADHD: an evidence-based guide to select optimal treatment. Mol Psychiatry. 2019;24:390–408. https://doi.org/10.1038/s41380-018-0116-3.
Cortese S, Adamo N, Del Giovane C, Mohr-Jensen C, Hayes AJ, Carucci S, et al. Comparative efficacy and tolerability of medications for attention-deficit hyperactivity disorder in children, adolescents, and adults: a systematic review and network meta-analysis. Lancet Psychiatry. 2018;5:727–38. https://doi.org/10.1016/S2215-0366(18)30269-4.
Yang L, Wang YF, Li J, Faraone SV. Association of norepinephrine transporter gene with methylphenidate response. J Am Acad Child Adolesc Psychiatry. 2004;43:1154–8. https://doi.org/10.1097/01.chi.0000131134.63368.46.
Cubillo A, Smith AB, Barrett N, Giampietro V, Brammer M, Simmons A, et al. Drug-specific laterality effects on frontal lobe activation of atomoxetine and methylphenidate in attention deficit hyperactivity disorder boys during working memory. Psychol Med. 2014;44:633–46. https://doi.org/10.1017/S0033291713000676.
Kowalczyk OS, Cubillo AI, Smith A, Barrett N, Giampietro V, Brammer M, et al. Methylphenidate and atomoxetine normalise fronto-parietal underactivation during sustained attention in ADHD adolescents. Eur Neuropsychopharmacol. 2019;29:1102–16. https://doi.org/10.1016/j.euroneuro.2019.07.139.
Schulz KP, Fan J, Bédard ACV, Clerkin SM, Ivanov I, Tang CY, et al. Common and unique therapeutic mechanisms of stimulant and nonstimulant treatments for attention-deficit/hyperactivity disorder. Arch Gen Psychiatry. 2012;69:952–61. https://doi.org/10.1001/archgenpsychiatry.2011.2053.
Suzuki S. Different effects of methylphenidate and atomoxetine on the behavior and brain transcriptome of zebrafish. Mol Brain. 2020;13:70. https://doi.org/10.1186/s13041-020-00614-4.
Mattheisen M, Grove J, Als TD, Martin J, Voloudakis G, Meier S, et al. Identification of shared and differentiating genetic architecture for autism spectrum disorder, attention-deficit hyperactivity disorder and case subgroups. Nat Genet. 2022;54:1470–8. https://doi.org/10.1038/s41588-022-01171-3.
O’Reilly PF, Hoggart CJ, Pomyen Y, Calboli FCF, Elliott P, Jarvelin MR, et al. MultiPhen: joint model of multiple phenotypes can increase discovery in GWAS. PLoS One. 2012;7:e34861. https://doi.org/10.1371/journal.pone.0034861.
Calipari ES, Ferris MJ. Amphetamine mechanisms and actions at the dopamine terminal revisited. J Neurosci. 2013;33:8923–5. https://doi.org/10.1523/JNEUROSCI.1033-13.2013.
Kedem S, Yust-Katz S, Carter D, Levi Z, Kedem R, Dickstein A, et al. Attention deficit hyperactivity disorder and gastrointestinal morbidity in a large cohort of young adults. World J Gastroenterol. 2020;26:6626–37. https://doi.org/10.3748/wjg.v26.i42.6626.
Gong W, Guo P, Li Y, Liu L, Yan R, Liu S, et al. Role of the gut-brain axis in the shared genetic etiology between gastrointestinal tract diseases and psychiatric disorders: a genome-wide pleiotropic analysis. JAMA Psychiatry. 2023;80:360–70. https://doi.org/10.1001/jamapsychiatry.2022.4974.
Dam SA, Mostert JC, Szopinska-Tokov JW, Bloemendaal M, Amato M, Arias-Vasquez A. The role of the gut-brain axis in attention-deficit/hyperactivity disorder. Gastroenterol Clin North Am. 2019;48:407–31. https://doi.org/10.1016/j.gtc.2019.05.001.
Schmitz F, Pierozan P, Biasibetti-Brendler H, Ferreira FS, Dos Santos Petry F, Trindade VMT, et al. Methylphenidate disrupts cytoskeletal homeostasis and reduces membrane-associated lipid content in juvenile rat hippocampus. Metab Brain Dis. 2018;33:693–704. https://doi.org/10.1007/s11011-017-0177-z.
Acknowledgements
We would like to express sincere gratitude to all the participants and their families. We would like to thank Dr. Jiayi Cox, Prof. Lindsay A Farrer, Prof. Richard Sherva, and their team for generously providing us with their GWAS summary statistical files. And we thank the Beijing Key Laboratory for Big Data Innovative Application of Child and Adolescent Mental Disorders for the support and contribution to this study.
Funding
This work was supported by grants from the National Natural Science Foundation of China (81471381), the Major State Basic Research Development Program of China (973 Program, 2014CB846100), the National Natural Science Foundation of China (81671358), Beijing Municipal Science and Technology Commission (Z181100001518005), National Key R&D Program of China (2016YFC1306103), and the National Natural Science Foundation of China (31871259 to Suhua Chang).
Author information
Authors and Affiliations
Contributions
YL Zhao performed the main analyses and wrote the manuscript. Z Fu was responsible for data collection and cleaning. EJ. Barnett provided suggestions for data analysis and manuscript preparation. N Wang, KFX Zhang, XP Gao, XY Zheng, and JB Tian contributed to sample recruitment and data collection. H Zhang, XT Ding, SX Li, and SY Li assisted with data analysis. QJ Cao, Suhau Chang, and Yufeng Wang supervised the overall process. SV Faraone and Li Yang conceptualized the study and provided supervision.
Corresponding authors
Ethics declarations
Competing interests
In the past year, Dr. Faraone received income, potential income, travel expenses continuing education support and/or research support from Aardvark, Aardwolf, AIMH, Akili, Atentiv, Axsome, Genomind, Ironshore, Johnson & Johnson/Kenvue, Kanjo, KemPharm/Corium, Noven, Otsuka, Sky Therapeutics, Sandoz, Supernus, Tris, and Vallon. With his institution, he has US patent US20130217707 A1 for the use of sodium-hydrogen exchange inhibitors in the treatment of ADHD. He also receives royalties from books published by Guilford Press: Straight Talk about Your Child’s Mental Health, Oxford University Press: Schizophrenia: The Facts and Elsevier: ADHD: Non-Pharmacologic Interventions. He is Program Director of www.ADHDEvidence.org and www.ADHDinAdults.com. Dr. Faraone’s research is supported by the European Union’s Horizon 2020 research and innovation programme under grant agreement 965381; NIH/NIMH grants U01AR076092, R01MH116037, 1R01NS128535, R01MH131685, 1R01MH130899, U01MH135970, and Supernus Pharmaceuticals. His continuing medical education programs are supported by The Upstate Foundation, Corium Pharmaceuticals, Tris Pharmaceuticals and Supernus Pharmaceuticals.
Ethics approval and consent to participate
The study was approved by the Institutional Review Board of the Peking University Health Science Center (2010–19; 2018–21). After complete description of the study to the subjects, written informed consent was obtained from parents of the ADHD individuals. The current research was registered in ClinicalTrials.gov (NCT01065259). All methods were performed in accordance with the relevant guidelines and regulations.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Zhao, Y., Fu, Z., Barnett, E.J. et al. Genome data based deep learning identified new genes predicting pharmacological treatment response of attention deficit hyperactivity disorder. Transl Psychiatry 15, 46 (2025). https://doi.org/10.1038/s41398-025-03250-5
Received:
Revised:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41398-025-03250-5





