Abstract
Autism spectrum disorders (ASD) are neurodevelopmental conditions that are for subsets of individuals, underpinned by dysregulated immune processes, including inflammation, autoimmunity, and dysbiosis. Consequently, the major histocompatibility complex (MHC)-hosted human leukocyte antigen (HLA) has been implicated in ASD risk, although seldom investigated. By utilizing a GWAS performed by the EU-AIMS consortium (LEAP cohort), we compared HLA and MHC genetic variants, single nucleotide polymorphisms (SNP), and haplotypes in ASD individuals, versus typically developing controls. We uncovered six SNPs, namely rs9268528, rs9268542, rs9268556, rs14004, rs9268557, and rs8084 that crossed the Bonferroni threshold, which form the underpinnings of 3 independent genetic pathways/blocks that differentially associate with ASD. Block 1 (rs9268528-G, rs9268542-G, rs9268556-C, and rs14004-A) afforded protection against ASD development, whilst the two remaining blocks, namely rs9268557-T, and rs8084-A, associated with heightened risk. rs8084 and rs14004 mapped to the HLA‐DRA gene, whilst the four other SNPs located in the BTNL2 locus. Different combinations amongst BTNL2 SNPs and HLA amino acid variants or classical alleles were found either to afford protection from or contribute to ASD risk, indicating a genetic interplay between BTNL2 and HLA. Interestingly, the detected variants had transcriptional and/or quantitative traits loci implications. As BTNL2 modulates gastrointestinal homeostasis and the identified HLA alleles regulate the gastrointestinal tract in celiac disease, it is proposed that the data on ASD risk may be linked to genetically regulated gut inflammatory processes. These findings might have implications for the prevention and treatment of ASD, via the targeting of gut-related processes.
Similar content being viewed by others
Introduction
Autism spectrum disorders (ASD) are complex and heterogeneous neurodevelopmental conditions characterized by problems in social interaction and verbal and non-verbal communication, repetitive behavior, and stereotyped behaviors and interests [1]. Along with co-occurring psychiatric symptoms, patients with ASD also present with several medical comorbidities including seizures, sleep disorders, and gastrointestinal dysfunctions, particularly with alterations in bowel habits and abdominal pain evident in 20–90% of ASD patients [2]. Various genetic and environmental factors associated with ASD have an impact on brain development [3, 4] and/or on the homeostasis of immune responses [5]. Environmental risks not only include prenatal factors such as maternal obesity, valproate use, and infections [6], but also gut microbial commensals dysregulation [7, 8]. Growing evidence in recent decades has highlighted the role of alterations in immune function, including heightened inflammation, anti-brain protein antibodies, as well as changes in T-cell and natural killer (NK) cell function in individuals diagnosed with ASD [5, 9, 10].
Immune dysregulation in ASD is primarily comprised of (i) an apparent increased susceptibility to prenatal infectious, possibly leading to maternal immune activation with consequent deleterious neurodevelopmental processes [11,12,13]; (ii) a pro-inflammatory status observed long after disease onset and possibly resulting from non-resolved infectious events [14, 15]; (iii) phenotype and functional alterations in immune cell subsets [16]; (iv) inflammation-mediated gut alterations [17,18,19]; and (v) increased ‘autoimmune’ disorders in ASD patients and their mothers, sometimes in association with raised circulating anti-brain protein autoantibodies [15, 20].
As with many other common immune-related disorders, these immune tags are believed to stem from complex interactions between inter-individual genetic backgrounds and environmental insults. Exploring the basis of their immunogenetic control is of interest not only for the understanding of the underlying immune processes but also for the discovery of predictive, diagnostic, and prognostic biomarkers that may help in the identification of homogeneous sub-groups of ASD patients, and thereby to preventative and symptom-targeted therapies.
One of the most promising immune genetic candidates is the human leucocyte antigen (HLA) system, which is hosted by the major histocompatibility complex (MHC) on the short arm of chromosome 6 (6p21.3). HLA genes encode cell surface molecules pivotal for cellular and humoral adaptive immune responses. The HLA/MHC cluster recently joined the psychiatric at-risk genomic regions either at pan-genomic or at a case-control level. More precisely, (i) a meta-analysis of genome-wide association studies (GWAS) performed in different psychiatric disorders, including ASD, detected the MHC region as a strong signal of risk [21], (ii) a more recent and targeted post-GWAS HLA imputation allowed to identify a protective effect conferred by an HLA-DPB1 variant against autism and intellectual disability [22] and (iii) we reported both at risk and protective HLA haplotypes associated with ASD through HLA haplotype-based analysis [23,24,25]. In the latter context, the uncovered risk modulator haplotypes are highly implicated in inflammatory processes during chronic inflammation and ‘autoimmunity’, including gut dysbiosis [25,26,27,28]. Moreover, HLA molecules are pivotal for central nervous system (CNS) development and function, including core neuronal functions such as neuronal/synaptic plasticity, pruning, learning, memory, and behavior [29,30,31,32], as well as more direct modulation of neuron-neuron interactions and neuro-signaling [33].
Despite the functional importance of the HLA system in patterned immune responses, it remains understudied in psychiatric and neurodevelopmental disorders, including ASD. This seems partly due to the high levels of polymorphisms that characterize the HLA loci, coupled with the complexity of the MHC-hosting HLA complex, which is one of the most gene-dense regions of the human genome [34]. The HLA gene cluster is physically divided into three functionally distinct sub-regions namely: (i) the HLA-class I region includes the classical HLA-A, HLA-B, and HLA-C genes as well as the non-classical HLA-E, HLA-F and HLA-G loci. These HLA genes differentially regulate antigen presentation to CD8+ T lymphocytes towards adaptive cellular immune responses, and immunomodulatory properties important for immune surveillance and inflammation homeostasis; (ii) the HLA class II region that is mainly comprised of HLA-DPA1, HLA-DPB1, HLA-DQA1, HLA-DQB1, HLA-DRA, HLA-DRB1, HLA-DRB3, HLA-DRB4 and HLA-DRB5 genes, which are involved in the presentation of antigens to CD4+ T cells and the initiation of adaptive humoral immune responses; and (iii) the class III region which contain gene loci involved in inflammatory responses, leukocyte maturation and in the complement cascade (for review see Tamouza et al., 2021 [25]).
HLA-class I and class II genes are also referred to as histocompatibility loci. These encode cell surface expressed molecules crucial to self or foreign antigen presentation to T-cell receptors (TCR) on effector cells. HLA-class I and class II genes also underpin the downstream modulation of adaptive immune responses, with their dysfunctions relevant to anti-infectious responses, inflammatory processes, ‘autoimmunity’ and/or intestinal dysbiosis [35]. The complexity of HLA loci is indicated by the presence of more than 35,000 alleles reported to date [36] and by their association with a range of diverse medical conditions, including inflammatory/’autoimmune’, cancers and, more recently, psychiatric disorders [37, 38]. HLA loci are therefore important in unraveling the complexity of pathophysiological processes across a range of diverse medical conditions. The high level of linkage disequilibrium present in the HLA/MHC region, together with the strong homogeneity of HLA histocompatibility loci, requires expensive and time-consuming high-resolution sequencing approaches as well as a high degree of expertize in the assignment of allelic polymorphisms and the association signals. Alternatively, imputation tools of the HLA diversity from GWAS chip data are now available and provide accurate information regarding single nucleotide polymorphisms (SNPs), alleles, genotypes, haplotypes, and amino-acids position of the implicated loci within the MHC region [39].
Accordingly, we took advantage of a GWAS performed by a large European consortium named “EU-AIMS” (A Multicentre Study for Developing New Medications)” to perform an in-depth analysis of the MHC/HLA genetic variants possibly influencing ASD risk and/or severity. We specifically investigated the distribution of HLA and MHC-imputed genetic variants corresponding to amino acid, SNP alleles, and haplotypes in ASD individuals, versus typically developing controls. Importantly, we narrowed the ASD phenotype by including in the analysis only patients of European descent with a strict diagnosis of ASD without intellectual disability and unrelated non-affected controls. We believed that applying such stringent selection would allow us to uncover variants otherwise difficult to be detected because of potential dilution effects.
Materials and methods
Longitudinal European Autism Project (LEAP): design, participant selection, and clinical characterization
The present study was performed under the framework of the “European Autism Interventions—A Multicentre Study for Developing New Medications (EU-AIMS)” project. Funded by the European community, EU-AIMS was designed to identify ASD markers to enhance earlier and more accurate diagnosis and prognosis as well as the development of new therapies. More specifically the EU-AIMS Longitudinal European Autism Project (LEAP) aims to identify risk factors that contribute to differences in brain development, difficulties in social behavior, and other core ASD symptoms. Consequently, an accelerated longitudinal design was used. Four sub-cohorts of ASD individuals and non-autistic controls, defined by age and intellectual quotient (IQ), were recruited concurrently i.e. (i) adults aged 18–30 years; (ii) adolescents aged 12–17 years; (iii) children aged 6–11 years with IQ in the typical range (75+); and (iv) adolescents and adults aged 12–30 years with mild intellectual disability (ID) (IQ 50–74). This approach allowed the investigation of the age range of interest in a shorter time period as compared to a single-cohort approach. The protocol and standard operation procedures (SOPs) of the LEAP study are accessible on https://www.eu-aims.eu/fileadmin/websites/eu-aims/ media/EU-AIMS_LEAP/EU-AIMS-LEAP_SOP_Study- Protocol.zip. The study was approved by national and local ethics review boards at each study site and is carried out under Good Clinical Practice (ICH GCP) standards.
Patients enrollment criteria were as follows: (i) a clinical diagnosis of ASD based on DSM-IV/ICD-10 [40, 41] or DSM-5 criteria [1], (ii) age between 6 and 30 years, (iii) confirmation of the ASD diagnosis using the Autism Diagnostic Interview-Revised (ADI-R) [42] and the Autism Diagnostic Observation Schedule 2 (ADOS-2) [43], (iv) eligibility of participants with psychiatric comorbidities with exception for psychosis or bipolar disorder, (v) eligibility of participants under stable medication. Assessment of symptom severity across core domains of ASD was conducted, with emphasis on social communication, repetitive and restricted behaviors, and sensory processing anomalies. Exclusion criteria consisted of: (i) significant hearing or visual impairments not corrected by glasses or hearing aids, (ii) history of alcohol and/or substance abuse or dependence within the year preceding the date of inclusion, (iii) presence of any MRI contraindications, and (iv) individuals with low IQ (<50) due to the difficulty of administering core measures and MRI scanning without sedation.
For the typically developing (TD) group, beside strict exclusion for any personal or familial history of psychiatric disorder, criteria used for the ASD group were applied. Additionally, TD individuals were required to have a T-score below 70 on the social responsiveness scale [44], a psychometric tool used to identify and quantify social impairment associated with ASD.
Inclusion and exclusion criteria for ASD patients as well as for the typically developing (TD) group were as described previously [45, 46].
Participants were recruited between January 2014 and March 2017 across six European specialized ASD centers namely the: (i) Institute of Psychiatry, Psychology and Neuroscience, King’s College London (IoPPN/KCL, UK), (ii) Autism Research Centre, University of Cambridge (UCAM, UK), (iii) University Medical Centre Utrecht (UMCU, Netherlands), (iv) Radboud University Medical Centre (RUMC, Netherlands), (v) Central Institute of Mental Health (CIMH, Germany) and (vi) the University Campus Bio-Medico (UCBM) in Rome, Italy. In addition, twins discordant with ASD were recruited at the Karolinska Institute in Sweden. A variety of recruitment sources were utilized, including existing volunteer databases, existing research cohorts, clinical referrals from local outpatient centers, special needs schools, mainstream schools, and local communities.
The study sample of the LEAP cohort is comprised of genetic data on 1566 individuals from 698 families, including 51 from twin families.
For the present investigation, we designed a case-control study including ASD patients (without intellectual disability) and controls belonging to the TD group. Eligible ASD cases were all participants who have an IQ greater than 75 without any exclusion for missing clinical data. Parents and relatives were excluded from the analysis even if classed as having ASD. Moreover, given the well-known restricted anthropogenic distribution of HLA diversity, only subjects of European descents were kept in the analysis.
Genotyping procedure and quality controls
Genetic data from the LEAP cohort participants and their families were obtained for 1566 individuals (coming from 698 families, including 51 twin families). Their genotyping was performed at the “Centre National de Recherche en Génomique Humaine” (CNRGH) using the Infinium OmniExpress-24v1 BeadChip (721114 markers) from Illumina.
We performed a genetic quality control on the whole genetic LEAP cohort and the following SNPs were removed: 41,889 with missingness rate >2%, 17,832 from sexual chromosomes, 61,701 with minor allele frequency <1% and 459 with departure from Hardy–Weinberg equilibrium (HWE) with p value < 10−6 for controls and p value < 10−10 for cases. For individual quality control of the 1566 ASD participants, 28 individuals were removed when data missingness rate per individual >2%, and 3 individuals removed with sex discrepancies, with no individuals removed for a heterozygosity rate issue. Finally, we obtained 599,233 SNPs for 1535 individuals.
Phenotype
Given the case-control design of our study, further exclusions were made from the 1535 individuals that passed the genetic quality control procedure, including all the parents remaining in the cohort (N = 818), even when diagnosed with ASD. In the remaining individuals (N = 717), we identified as cases, all LEAP individuals that passed the genetic quality control with an ASD diagnosis and with IQ > 75 (N = 328). Included controls were all LEAP individuals that passed the genetic quality control and classed according to the TD criteria (N = 282). This procedure of selection was used in order to narrow the well-known heterogeneity of ASD even if we have to mention that participants with IQs between 75 and 85 may exhibit some intellectual impairments. Moreover, we excluded 16 cases and 38 controls due to sibling relationships. Finally, 556 individuals, including 312 cases and 244 controls survived remained. We performed a principal component analysis (PCA) on genotype data of the 556 selected individuals, leading to 28 individuals being excluded due to non-European ethnic origins to avoid spurious associations. At the end, 298 ASD participants without ID and 230 unrelated controls, all of European descents were retained for further statistical analysis. The flow chart of the participants selection procedure is depicted in supplemental Fig. S1.
To evaluate the potential impact of the genetic diversity on ASD severity, we used the following psychometric tools from among those utilized in the LEAP clinical evaluation: (i) the Autism Diagnostic Interview (ADI) [42], with its 3 domains—social, restricted and repetitive behaviors (rrb) and communication difficulties; (ii) the Autism Diagnostic Observation Schedule (ADOS) [43] Calibrated Severity Scores (CSS) total, social affect and rrb; (iii) the SSP (Short Sensory Profile) [47] total score for quantifying sensory processing disorder in ASD patients; (iv) the Strength and Difficulties Questionnaire (SDQ) externalizing score [48]; (v) the Social Responsiveness Scale [44] T-score, as a quantitative phenotype of autistic traits; (vi) DAWBA scores [49] for depression, anxiety, Attention Deficit—Hyperactivity disorders (ADHD) and score for internalizing and externalizing behaviors; and (vii) the full-scale IQ.
Imputation of HLA variants and MHC haplotypes
HLA imputation was performed using the SNP2HLA software [39]. This bioinformatic tool allows researchers to impute from the chromosome 6 genotypic data of each patient, namely: (i) the HLA class I and class II-classical alleles at a 4-digit level; (ii) the amino acid variants from the imputed 4-digit HLA alleles; and (iii) the SNPs of the entire MHC region.
Specifically, we extracted 41,111 chromosome 6 variants from the chip genotype data and imputed a total of 8,961 HLA variants (424 classical HLA alleles and 1276 amino acid variations). We performed a second quality control on imputed markers, namely: the variants with minor allele frequency < 0.03 and/or p value for the Hardy–Weinberg departure test < 1 × 10−6 were removed (N = 1546). To ensure the best imputation quality, we applied a filter on the imputation score R2, and kept all variants with R2 > 0.8 with consequent removal of 12 additional variants. The resulting 7,403 variants, including 106 classical HLA alleles, 930 amino acid variations, and the 6367 SNPs were subjected to the statistical analysis pipeline.
As well as the single SNP effect, we also analyzed the potential impact of haplotypes involving the most significant SNPs. In addition, we looked for the possible effects of composite haplotypes involving the SNPs along the neighboring class II-classical HLA alleles genes, namely HLA-DQA1, HLA-DQB1, and HLA-DRB1. We also computed those having significant SNPs and suggestive HLA signals but failing to reach statistical significance. To this end, we used the package R HaploStats [50] to compute haplotypes. We obtained 508 haplotypes and as for the imputed variants, haplotypes with minor allele frequency < 0.03 were removed (N = 320) with PLINK [51].
Statistical analysis
ASD vulnerability
We looked for genetic associations as to the risk to develop ASD among the 7403 imputed HLA variants and the 188 computed haplotypes. We compared ASD and TD for all the variants using a logistic regression implemented in PLINK software [51] under the additive, dominant, and recessive models. To correct for possible population stratification, we used the top four principal components (PC) as covariates in the statistical analyses. We also added gender and recruitment centers as covariates in our model. We used the Bonferroni method to correct for multiple testing. The statistical threshold of significance was set at 6.75 × 10−6, which was derived by dividing 0.05 by 7403 markers tested. This conservative threshold holds particular significance in our investigation due to the prominent presence of linkage disequilibrium (LD) in the MHC region. Through LD calculations, encompassing all variants in the region, we found a limited number of independent SNPs, i.e., 1349 (with a pruning based on a r2 value below 0.5). When considering the analysis for three different models (3 × 1349 = 4047 tests), the threshold became 1.24 × 10−5. However, for the sake of simplicity, we only retained the lowest threshold of 6.75 × 10−6 for all tests.
ASD severity
To explore if variants or haplotypes implicated in ASD risk can influence ASD severity, we used the psychometric tools assessed in the LEAP cohort (see above) to perform a multivariate linear regression using PLINK software [51] for each studied clinical scale under additive and dominant models only for the cases. To avoid spurious associations, we included age, IQ, sex, recruitment centers, and the first 4 PC as covariates for each model with the exception of IQ stratification. In the latter case, we used only age, sex, recruitment centers, and the first 4 PC as covariates in the model.
Results
Initially, 1535 individuals (corresponding to 698 families) passed the genetic quality control procedures. After the selection of unrelated individuals from each family and stratification analysis (see Methods), the genotyping information from 528 unrelated individuals enrolled in the EU-AIMS/LEAP project (298 cases and 230 controls) were used for HLA imputation. Clinical and related information were available for 275 of the 298 cases. Socio-demographic characteristics are reported in Table 1 and showed a difference in the distribution according to gender, both in cases and in controls, where males were predominant (males: 71% for the ASD group and 63% for the controls). This distribution approached significance in comparison of cases and controls (p value = 0.053). Psychometric scores assessing clinical variables in ASD patients are reported in the supplementary Table S1.
SNPs and HLA association analyses
We compared the genetic distribution of MHC SNPs, HLA alleles, and amino acids in 298 individuals with ASD and without ID, versus 230 non-autistic controls in additive, dominant and recessive models. While no classical HLA allele passed the Bonferroni threshold, six SNPs i.e., rs9268528, rs9268542, rs9268556, rs14004, rs9268557, and rs8084 were significantly associated with ASD risk in the additive model (Fig. 1 and Table 2). Three of them, namely rs9268528, rs9268542, and rs9268556, also passed the Bonferroni threshold in the dominant model (Table 2). By analyzing the linkage disequilibrium (LD) among the 6 SNPs, we obtained the three following blocks of LD (Fig. 2): (1) rs9268528, rs9268542, rs9268556, rs14004; (2) rs9268557 and (3) rs8084. However, these 3 blocks were not completely independent, with comparisons among them showing an r2 greater than 0.4 and the D’ > 0.8 (Fig. 2). Block 1 (rs9268528-G, rs9268542-G, rs9268556-C, and rs14004-A) associated with a protective effect against ASD risk, whilst the two remaining two blocks, namely the rs9268557-T and the rs8084-A alleles, associated with enhanced ASD risk (Table 2). In the control group, the allelic frequencies of such ASD-associated SNPs were in agreement with that of the 1000 genomes European population diversity. Although the allelic frequencies of the 6 SNPs were not always the same across centers, the distribution between cases and controls was always in the same direction (supplementary Table S2). As to location, the rs8084 and rs14004 mapped to the HLA-DRA gene [respectively at the junction between the second intron and the third exon and in the 5’-untranslated region (UTR)], whilst the four others were intergenic variants, located in a region encompassing the BTNL2 (~10 kb) and HLA-DRA (~20 kb) loci (chr6: 32383108–32411035) (Fig. 1). Interestingly, beside these strongly associated variations, we also observed, in a recessive and a dominant manner respectively, two signals belonging to the HLA-DQB1 amino acids, namely a leucine at position 26 (DQB1-26-L) and a glycine at position 45 (DQB1-45–G) of the DQB1 heavy chain. While the two signals, although showing a strong association trend, failed initially to reach the established statistically significant threshold, we decided to investigate their possible implication in the modulation of ASD risk at haplotype level (see below).
The –log10(P value of association) were plotted according to the physical position of the SNPs. The genes of the region are annotated at the bottom of the figure. The dashed line corresponds to the Bonferroni threshold. The colors of the SNPs indicate the LD with the strongest SNP rs9268557 measured by r2 coefficient. The SNPs tagging the three blocks are indicated.
The analysis of the linkage disequilibrium (LD) among the 6 SNPs shows three blocks of LD namely: (1) rs9268528, rs9268542, rs9268556, rs14004; (2) rs9268557 and (3) rs8084 which are not completely independent with an r2 greater than 0.4 and a D’ > 0.8.
Haplotype analyses
Since the 3 SNPs tagging the 3 blocks (rs9268528-G, rs9268557-T, and rs8084-A) are close to each other (Fig. 1), we investigated for possible haplotype effects. Moreover, as located within the HLA class II region, we also searched for haplotype effects between them and the classical class II HLA alleles (i.e. belonging to the HLA-DQA1, DQB1, and DRB1 gene loci). Finally, we computed haplotypes involving the tag SNPs and the above-mentioned HLA-DQ beta-amino acids (DQB1-26-L and DQB1-45–G). We thus performed a logistic regression to compare the distribution of haplotypes in ASD patients and in controls with typical development, in the additive, dominant, and recessive models by means of the pipeline of analysis used for SNPs and HLA association analyses.
Interestingly, in the additive model, haplotypes consisting of DQB1-26-L or DQB1-45-G amino acids and rs9268528-A or rs9268557-T were strongly associated with ASD risk. Indeed, the comparison of their distribution between ASD cases and TD subjects yielded lower p values than the most significant SNP in the additive model (p values: 1.66 × 10−07 and 1.5 × 10−6, respectively). Consequently, the odds ratios were also increased (2.1 for the most significant haplotype vs 1.90 for the rs9268557-T alone). This effect is even stronger in the dominant model for the haplotypes bearing the DQB1-26-L (Table 3 and Fig. 3).
The bar graph displays the log odds ratios (OR) and their 95% confidence intervals (CI) for the main SNPs, haplotypes, and HLA (amino acid or classical alleles) based on the logistic regression analysis under both additive and dominant models. The vertical line represents the reference category. The horizontal bars represent the width of the 95% CI. The names of the variants are shown on the y-axis. The OR and p value are indicated on the right of the figure for each variant. As the p values of associations along odds ratio for the 3 tag SNPs haplotypes are comparable to that of the SNPs taken individually and since the 3 blocks of LD were not completely independent, we cannot conclude whether each variant by itself or their bearing haplotypes functionally drive ASD vulnerability.
Again, in the additive model, we observed that the two haplotypes, including the 3 tag SNPs (rs9268528, rs9268557, and rs8084) have two opposing effects: with (rs9268528-G, rs9268557-C, rs8084-C) affording protection against ASD, whilst (rs9268528-A, rs9268557-T, rs8084-A) associated with enhanced ASD risk (Table 3 and Fig. 3). As the p values of associations along odds ratio for the two haplotypes are comparable to that of the SNPs taken individually and since the 3 blocks of LD were not completely independent, we cannot conclude whether each variant by itself or their bearing haplotypes functionally drive ASD vulnerability.
Further analysis of correlations between combined SNPs and classical HLA class II alleles allowed the identification of the following haplotypes: (i) the HLA-DQA1*05 allele with rs9268528-G or rs8084-C or rs9268557-C, that confer a protective status against ASD; (ii) the HLA-DQB1*06 variant allele with rs9268557-T which associates with raised ASD risk; and (iii) the HLA-DQB1*03 allele either with the rs9268557-C or with the HLA-DQA1*05 variant both conferring a protective effect (Table 3).
In the dominant model, we observed a strong association between ASD risk and haplotypes bearing the amino acid DQB1-26-L variant, namely DQB1-26-L-rs9268557-T, DQB1-26-L-rs9268528-A, DQB1-26-L-rs8084-A, and DQB1-26-L- rs9268557-T-rs9268528-A-rs8084-A. The observed at-risk effect was found to be stronger than in the additive model, suggesting a dominant effect of these haplotypes. The p value and the odds ratio for the protective haplotype harboring both rs9268528-G and HLA-DQA1*05 were lower than in additive model, suggesting that the conferred protective status against ASD could be exerted in a dominant manner.
Finally, in the recessive model, the only haplotype yielding a p value comparable to that of the individual SNPs is the at-risk haplotype combining the HLA-DQB1-45-G variant with the rs9268528-A allele.
Overall, the observed haplotype-derived at-risk/protective effects strongly suggest a genetic interplay between the genetic diversity of MHC SNPs (BTNL2) and HLA class II alleles or amino acid variants.
Functional analysis of the associated SNPs
Functionally, it is notable that the rs8084 variation creates a splice acceptor site leading to a 25 amino acid deletion of the HLA-DR alpha chain, which affects its molecular conformation and cellular localization [52].
Further screening of the RegulomeDB database [53], which reports information concerning the potential transcriptomic properties of genomic SNPs, indicated that 3 SNPs (rs9268528, rs14004, and rs8084) had a good RegulomeDB rank of 1 f (Supplementary Table S3). This makes their location within regulatory elements plausible and therefore their involvement in gene transcription, given that the probability to have a RegulomeDB score better than 1 f is less than 5% (data not shown). Thus, from the haplotype blocks only rs9268528 and rs14004 (block 1) and rs8084 (block 3) reached this 1 f score (Supplementary Table 2).
We also used the GTex portal [54] to uncover possible quantitative trait loci (eQTL) linked to rs9268528, rs9268557, and rs8084 SNPs that tag the 3 identified blocks. We focused on three tissues known to be involved in ASD pathophysiology namely: brain, small intestine, and whole blood (PBMCs). Figure 4 depicts the most expressed genes and their correlations to these 3 SNPs in the brain, small intestine, and PBMCs. P values were more significant in whole blood, versus brain and small intestine. For all tissues and SNPs, we observed an overexpression of HLA-DQA2. Interestingly, the protective rs9268528-G allele associated with under-expression of the HLA-DQB1 gene in the whole blood and brain (Fig. 4). Similarly, the HLA-DQA1 gene is also under-expressed in the whole blood in correlation with the rs9268528-G allele (p = 10−12, beta = −0.15, not shown in Fig. 4). Conversely, BTNL2, well known for its involvement in inflammatory gastrointestinal disorders, was overexpressed in the small intestine in correlation with the protective allele rs9268528-G. Overall, these observations may reflect the above-mentioned genetic interplay between genetically determined low expression of the HLA DQ genes and high expression of BTNL2 gene in conferring protection against ASD risk. Moreover, HLA-DRB6 overexpression correlated with the rs8084-C variant in the whole blood and brain, whilst HLA-DBR5 was under-expressed when combined with the rs9268557-C variant in the same tissues (Fig. 4). These observations reinforce the hypothesis that the observed tag SNPs may be having regulatory effects on HLA class II gene expression.
Graphical representation of the possible GTex-derived quantitative trait loci (eQTL) linked to the SNPs rs8084 (A), rs9268528 (B), and rs9268557 (C), that tag the identified LD blocks in the brain, small intestine, and PBMCs. P values are more significant in whole blood, versus brain and small intestine. For all tissues and SNPs, we observed an overexpression of HLA-DQA2. The protective rs9268528-G allele is associated with the under-expression of the DQB1 gene in the whole blood and brain. Conversely, BTNL2, well known for its involvement in inflammatory gastrointestinal disorders, is overexpressed in the small intestine in correlation with the protective allele rs9268528-G. HLA-DRB6 overexpression correlated with the rs8084-C variant in the whole blood and brain, whilst HLA-DBR5 was under-expressed when combined with the rs9268557-C variant in the same tissues.
Regarding the DQB1 amino acids, it is notable that the leucine at position 26 is in the peptide binding groove of the HLA DQ heavy chain and is part of the pocket 4 known to be pivotal for antigen presentation and repeatedly implicated as a risk factor for the development of ‘autoimmune’ disorders, including those targeting the gastrointestinal tract [55], (Fig. 5). The DQB1-26L amino acid is shared by alleles belonging to the HLA-DQB1*02, HLA-DQB1*03, and HLA-DQB1*06 specificities, whilst the DQB1-45G amino acid tag alleles of the HLA-DQB1*02, HLA-DQB1*03, HLA-DQB1*04, HLA-DQB1*05, and HLA-DQB1*06 antigens.
The leucine at position 26 (DQB1-26L) which is located the pocket 4 is indicated by a red arrow. The DQB1-26L amino acid is shared by alleles belonging to the HLA-DQB1*02, HLA-DQB1* 03, and HLA-DQB1*06 specificities, whilst the DQB1-45G amino acid tag alleles of the HLA-DQB1*02, HLA-DQB1* 03, HLA-DQB1*04, HLA-DQB1* 05 and HLA-DQB1* 06 antigens.
SNP and haplotype influence on clinical characteristics in ASD patients
We then tested the potential influence of ASD-associated haplotypes and SNPs on ASD clinical presentation or severity for patients with available clinical annotations (N = 275). We, therefore, performed linear regression involving different psychometric tools assessed in the LEAP cohort, these scores were used to evaluate ASD characteristics and cognitive performance. Only trends toward association were observed between the genetic variants and the studied clinical scales (data not shown), highlighting the difficulties in comparing genetic traits with clinical symptoms that fluctuate over the course of a disorder.
Discussion
Mutational events affecting synaptic and neuronal processes only explain a relatively small proportion of ASD cases [3, 4]. While genomic linkage scan and cytogenetic analysis allowed the first descriptions of at-risk gene alterations scattered across the human genome [56, 57], further GWAS expand such genetic diversity by identifying causal copy number variations (CNVs) including large recurrent deletions or duplications disrupting either single gene or a chromosomal region containing multiple genes [57]. Although such mutational events affect important candidate genes, they remain relatively rare explaining around 1% of ASD cases [3, 4]. Besides, common variants were also reported, especially in a recent very large GWAS that identify 88 variants significatively associated with ASD [58]. In this context, loci or genetic variants regulating immune processes have emerged as a cutting-edge area of research in ASD [25]. Various aspects of immune dysfunction underpin ASD pathophysiology, including overexpression of pro-inflammatory molecules, ‘autoimmune’ processes, and gastrointestinal dysbiosis [59]. Such immune dysregulation implicates the human leukocyte antigen (HLA) system, which is an important aspect of both innate and adaptive immunity. However, partly as a consequence of its complexity, the HLA system has been under-investigated in ASD [23,24,25].
After imputation of the MHC/HLA variants from individuals included in a GWAS study, we analyzed their potential associations with ASD and identified six variants crossing the Bonferroni threshold. Four variants (rs9268528, rs9268542, rs9268556, and rs9268557) are located in close proximity to the MHC class II gene-linked butyrophilin-like 2 (BTNL2) locus. BTNL2 belongs to the B7 superfamily, which are crucial regulators of T-cell activation and tolerance (for review see Nguyen et al., 2006; Arnett & Viney, 2014 [60, 61]). BTNL2 is highly expressed both in the lymphoid tissues and in the intestine. BTNL2 inhibits T-cell proliferation, thereby linking BTNL2 polymorphisms with inflammatory/’autoimmune’ diseases such as sarcoidosis or myositis [62,63,64]. BTNL2 expression in lymphoid organs indicates a role in initiating immune responses, whilst the abundant expression of BTNL2 mRNA in the intestine indicates a pivotal role in the modulation of mucosal immunity and/or tolerance. Within the GI-tract, BTNL2 is a significant homeostatic regulator of tissue-resident gamma delta (γδ) intraepithelial lymphocytes (IELs). γδ IELs are a heterogeneous population of immune cells in the intestinal epithelium, which are crucial to the maintenance of gut homeostasis [65]. γδ IELs can strengthen tight junctions and orchestrate innate and adaptive immunity during homeostasis, inflammation, and infection, thereby having an important role in the regulation of intestinal epithelial barrier integrity [65]. BTNL2 acts to suppress γδ IELs proliferation, preferentially in the ileum, as demonstrated by transcriptomic characterization of murine site-specific BTNL2-KO γδ IELs, which showed BTNL2 to regulate the antimicrobial response of ileal γδ IELs [66]. Dextran sulfate sodium (DSS)-induced colitis in BTNL2-KO mice enhances inflammation and delays mucosal repair in the colon, highlighting the role of BTNL2 in regulating inflammatory signals. Since ileal γδ IEL motility along the villi-crypt axis is strictly dependent on the presence of microbiota [66], gut dysbiosis and an altered antibacterial profile may contribute to improper localization and/or ineffective surveillance by ileal BTNL2-KO γδ IELs. Suppressed BTNL2 levels may therefore contribute to the initiation and/or maintenance of inflammation in the ileum, with consequences for ASD pathophysiology and behaviors. This is supported by data from the GTex database showing BTNL2 to be overexpressed in the small intestine in correlation with the rs9268528-G SNP allele. This is likely to be the underpinnings of the protection against ASD afforded by this SNP and by the protective block 1 haplotype that carries it.
The gut-brain axis is intimately associated with many neuro-psychiatric and neurodevelopmental disorders, including ASD [67, 68]. This has relevance to the digestive symptoms commonly observed in a substantial subset of autistic people, which will often be concurrent to alterations in the gut microbiome in ASD [7], and supported by a recent meta-analysis [69]. However, in a study of pre-pubertal Chinese patients with ASD, gut microbiota alterations seem independent of comorbid functional gastrointestinal disorders, perhaps indicative of distinct genetic control of the observed dysbiosis in ASD patients [70].
The second top signal also comes from the HLA-class II region and consists of two SNPs (rs8084 and rs14004) both belonging to the HLA-DRA gene, which encodes the non-variable HLA-DR alpha subunit that can associate with any hypervariable HLA-DR beta chain to form mature HLA-DRB molecules. rs8084 creates a cryptic splice site in the DRA mRNA and is associated with immune conditions including psoriasis [71] and allergy to penicillin [72], whilst the 5-UTR rs14004 modulates both HLA-DR and DQ gene expression and may increase type 1 diabetes mellitus risk [73]. These two variations exhibit a good RegulomeDB rank, substantiating their involvement in transcriptomic processes. Accordingly, the rs8084-C SNP allele belongs to the protective rs9268528-G, rs9268557-C, rs8084-C haplotype while its rs8084-A counterpart allele is a part of the rs9268528-A, rs9268557-T, rs8084-A as a risk haplotype. Given the monomorphic status of the HLA-DRA molecule, it is likely that the uncovered association may reflect either the functional relevance of the two SNPs per se or possible linkage disequilibrium with other loci in the HLA class II region (Fig. 1).
The strongest signals in the current investigation are derived from the haplotype combinations between the DQB1-26-L and DQB1-45-G amino acids and the two SNPs tagging the BTNL2 region, namely rs9268528 and rs9268557. GTex portal screening showed DQA1 and DQB1 genes to be overexpressed in whole blood in correlation with the at-risk allele, rs9268528-A. This likely indicates an interaction between BTNL2 and HLA-DQ loci that increases ASD risk, whereby lower BTNL2 expression, coupled with HLA-DQA1 and HLA-DQB1 overexpression will exacerbate inflammation, perhaps especially in the GI-tract given the role of BTNL2 in GI-tract homeostasis. It is proposed that dysregulated intestinal inflammation will underpin that heightened ASD vulnerability in carriers of these haplotypes. Similarly, haplotypes carriers of the at-risk allele rs9268528-A and one of the two above-mentioned amino acids (leucine-26 and glycine-45) will also have an increased ASD risk. These two amino acids tag specifically the following HLA alleles: HLA-DQB1*02 (DQ2), HLA-DQB1*03:02 (DQ8), HLA-DQB1*04, HLA-DQB1*05 and HLA-DQB1*06, which have mostly been associated with ‘autoimmune’ and inflammatory disorders, including celiac disease, allergy to hydrolyzed wheat proteins, autoimmune pancreatitis, inflammatory bowel disease, and narcolepsy [55, 74,75,76,77,78]. This may be of particular relevance in celiac disease, where the HLA-DQB1*02 and HLA-DQB1*03:02 alleles are pivotal [74].
In line with the role of GI-tract inflammation in ASD risk, the dominant model also showed a protective status conferred by the composite haplotype HLA-DQA1*05 allele rs9268528-G. Such observations reinforces the parallel with celiac disease [74], where the most associated HLA allele is the HLA-DQ2 antigenic specificity, which consists of a molecular heterodimer encoded by the HLA-DQA1*05 and the HLA-DQB1*02 alleles. HLA-DQA1*05 has been associated with GI-tract inflammation for many years, including raised symptom severity in ulcerative colitis (UC) and Crohn’s disease at diagnosis, in association with a more rapid onset, and an earlier need for immunosuppressive or biological treatment [79]. The HLA-DQA1*05 allele is also associated with anti-drug antibody production against biotherapies in Crohn’s disease patients, further contributing to its pro-inflammatory properties [80]. Consequently, due to its involvement in GI-tract inflammatory and allergic processes, both of which are aspects of ASD pathophysiology [5, 81], the HLA-DQA1*0501 allele may be of particular relevance to ASD pathoetiology. The haplotypic analysis also highlight the possible involvement of other HLA alleles, namely HLA-DQB1*03, and HLA-DQB1*06, which have all previously been associated with celiac disease risk.
Overall, this investigation highlights an interplay between two distinct but intertwined genetic elements, namely BTNL2 and the HLA class II, which may underpin ASD risk. As well as the modulation of T-cell-related properties, BTNL2 may also modulate HLA class II allele expression, with consequences for immune/’autoimmune’ disorders. The present findings agree with and extend previous observations concerning the genetic association between ASD and variants related to GI tract homeostasis. In consecutive studies, we showed that: (i) the CLEC7A gene, which encodes the fungal sensor, Dectin-1, also modulate the risk of developing severe ulcerative colitis [82], and acts as a genetic disease specifier of cognition in high-functioning ASD [83]; (ii) an HLA haplotype previously associated with pediatric celiac disease, HLA-DRB1*11-HLA-DQB1*07, increase ASD risk, associating with higher social and non-verbal capacity [23]; and (iii) an HLA-class II sub-haplotype, which modulates regressive ASD risk, an autistic endophenotype characterized by intestinal anatomical alterations [24]. Notably, the MET receptor tyrosine kinase gene that encode molecules involved in gastrointestinal repair, immune function, neocortical, and cerebellar growth [84] was also repeatedly associated with ASD risk by independent studies [85]. The gastrointestinal component of ASD clearly has an immunogenetic aspect, relevant to both innate and adaptive immune processes. Gut microbiota diversity seems importantly determined by HLA diversity, as indicated by data on a large general population sample by Andeweg and colleagues showing HLA haplotypes to associate with gut microbiome composition [28]. Notably, the authors did not exclude the involvement of the non-conventional MHC class I-related gene A (MICA) and MICB loci, which are the main modulators of resident γδ IELs. γδ T cells represent a small population of T cells, being distinct from α/β TCR. γδ IELs are a subset of γδ T cells around the gut being concentrated in the intestinal mucosa and appear to have a prominent role in recognizing small bacterial phosphoantigens and undefined antigens presented by MICA/MICB proteins. γδ T cells have potent cytotoxic activity, being an important link between innate and adaptive immunity. γδ T cells may be especially important in the first 6 months postnatally, when the adaptive immune system is still developing, with this thought to be a crucial time in the pathoetiology of ASD. The role of early developmental processes is also highlighted by the association of the HLA-DQ2 genotype with the early developmental pathoetiology of celiac disease, with the HLA-DQ2 genotype associating with gut microbiota composition [86]. Finally, a pathophysiological link between the γδ IELs and the HLA-DQA1*0501 allele may be evident in a study analyzing the cytotoxic properties of PBMCs, which showed specific recognition of cells carrying the DQA1*0501 by lymphocytes bearing γδ TCR [87].
The data above indicate that genetically deficient BTNL2, coupled with high expression of associated HLA alleles can create favorable conditions for the initiation, maintenance, or exacerbation of inflammation in the ileum with downstream inflammatory processes possibly impacting ASD risk and/or symptom severity. Diminished expression of BTNL2 may dysregulate ileal γδ IELs, with consequent intestinal inflammation, whilst overexpression of the celiac disease-associated HLA-DQA1*05 or other pro-inflammatory HLA alleles will further contribute to inflammation. This study is the first to identify a genetic link between the GI tract and ASD.
It is important to note that the herein uncovered genetic variations were not previously identified in other Genome-Wide Association Studies, including a recent and extensive one conducted by Grove et al. [58]. This discrepancy may be explained by the heterogeneity of ASD, as pointed out by Warrier et al. [88, 89], which can arise from variations in diagnostic criteria and associated features such as IQ, adaptive behavior, and motor coordination. Our study focused specifically on ASD without intellectual disability, whereas Grove et al. [58] included patients with various types of ASD including Pervasive Developmental Disorder (PDD) and PDD-Not Otherwise Specified (PDD-NOS), atypical autism with or without ID, as well as some controls with ID. In our study, we specifically focused on ASD without intellectual disability, whereas Grove et al. [50] included patients with various types of ASD, such as Pervasive Developmental Disorder (PDD), PDD-Not Otherwise Specified (PDD-NOS), atypical autism with or without intellectual disability, as well as controls with intellectual disability. Furthermore, we rigorously selected participants of European descent from various countries, while Grove et al. [58] included a mix of populations from Denmark, North America, Europe, and Canada, which could affect the results of HLA analysis given the well-known anthropogenetic distribution of HLA alleles. Additionally, our study was performed under a classical case-control design, while Grove et al. conducted a meta-analysis of results from a case-control study (iPSYCH) and PGC family-based results obtained using pseudo-controls. Additionally, our study was performed under a classical case-control design, whereas Grove et al. conducted a meta-analysis of results from a case-control study (iPSYCH) and family-based results obtained using pseudo-controls from the Psychiatric Genomics Consortium (PGC). These differences in study design may contribute to the apparent inconsistency in the detection of genetic variations in the MHC/HLA region, which has been previously linked to ASD by the Cross-Disorder Group of the Psychiatric Genomics Consortium (2013) [21]. Such possibility emphasizes the critical importance of carefully considering the study design as well as patient population selection when investigating the genetic underpinnings of complex disorders like ASD, particularly within complex clusters such as the MHC/HLA. Another important aspect is the herein exclusive focus on the major histocompatibility complex/human leukocyte antigen (MHC/HLA) region, using a similar approach as that of the Nudel study [22]. In contrast to Nudel study [22], our investigation encompassed not only the classical HLA variants but also individual single nucleotide polymorphisms (SNPs) and amino acid variants within the MHC region, which may explain the new results we observed. The strongest signal rs9268757 comes with an OR reaching 1.9, which is quite high for a GWAS. We acknowledge that the population studied is relatively small (a few hundred patients), and it is unlikely that such a high OR will be obtained in a replication study. Nevertheless, in spite of the variability of the MHC allele distribution between populations, we have been able to check that the distribution of the main SNP alleles identified was similar and in the same direction in the four main European centers included in the study (with at least 20 cases and controls per center, see supp Table 2). Thus, these significant signals are good candidates, and they already provide an interesting biological interpretation for the molecular etiology of autism without ID, which is indeed the primary goal of a GWAS. Given the preliminary nature of our work with a globally relatively small sample size, further confirmation and replication of the findings are warranted before translation to clinical applications.
Some limitations deserve to be mentioned: (i) the non-availability of data concerning either history/stigma of infections or comorbid somatic symptoms, including those related to the gastrointestinal tract, (ii) the non-ability of creating additional study group with IQ greater than 85 as IQs between 75 and 85 may corresponds to patients with some intellectual impairments [90] and (iii) missing data pertaining to severity possibly explaining the absence of associations between the uncovered variations and the severity of ASD even if it is also possible that the genetic factors influencing susceptibility and those influencing the severity of autism may not necessarily coincide or overlap.
In summary, BTNL2, γδ IELs, and HLA class II variants and interactions may be an important mediator of how the gut-brain axis links to ASD pathoetiology and pathophysiology. Future studies will be required to replicate and extend the data above, including exploring how such susceptibility alleles may shape gut-targeted therapies.
Data availability
All the data generated in the present study are fully available.
References
Diagnostic and statistical manual of mental disorders. DSM-5. 5th edn. Washington, D.C: American Psychiatric Association; 2013.
Neuhaus E, Bernier RA, Tham SW, Webb SJ. Gastrointestinal and psychiatric symptoms among children and adolescents with autism spectrum disorder. Front Psychiatry 2018;9:515.
Anney R, Klei L, Pinto D, Regan R, Conroy J, Magalhaes TR, et al. A genome-wide scan for common alleles affecting risk for autism. Hum Mol Genet. 2010;19:4072–82.
Havdahl A, Niarchou M, Starnawska A, Uddin M, van der Merwe C, Warrier V. Genetic contributions to autism spectrum disorder. Psychol Med. 2021;51:2260–73.
Meltzer A, Van de Water J. The role of the immune system in autism spectrum disorder. Neuropsychopharmacology 2016;42:284–98.
Modabbernia A, Velthorst E, Reichenberg A. Environmental risk factors for autism: an evidence-based review of systematic reviews and meta-analyses. Mol Autism. 2017;8:13.
Fattorusso A, Di Genova L, Dell’Isola G, Mencaroni E, Esposito S. Autism spectrum disorders and the gut microbiota. Nutrients 2019;11:521.
Madra M, Ringel R, Margolis KG. Gastrointestinal issues and autism spectrum disorder. Child Adolesc Psychiatr Clin North Am. 2020;29:501–13.
Hsiao EY. Immune dysregulation in autism spectrum disorder. Int Rev Neurobiol. 2013;113:269–302.
Zerbo O, Traglia M, Yoshida C, Heuer LS, Ashwood P, Delorenze GN, et al. Maternal mid-pregnancy C-reactive protein and risk of autism spectrum disorders: the early markers for autism study. Transl Psychiatry. 2016;6:e783–e783.
Atladóttir HÓ, Thorsen P, Schendel DE, Østergaard L, Lemcke S, Parner ET. Association of hospitalization for infection in childhood with diagnosis of autism spectrum disorders. Arch Pediatr Adolesc Med. 2010;164:470–7.
Lee BK, Magnusson C, Gardner RM, Blomström Å, Newschaffer CJ, Burstyn I, et al. Maternal hospitalization with infection during pregnancy and risk of autism spectrum disorders. Brain Behav Immun. 2015;44:100–5.
Shuid AN, Jayusman PA, Shuid N, Ismail J, Kamal Nor N, Mohamed IN. Association between viral infections and risk of autistic disorder: an overview. Int J Environ Res Public Health. 2021;18:2817.
Jyonouchi H, Geng L, Davidow AL. Cytokine profiles by peripheral blood monocytes are associated with changes in behavioral symptoms following immune insults in a subset of ASD subjects: an inflammatory subtype? J Neuroinflamm. 2014;11:187.
McLellan J, Kim DHJ, Bruce M, Ramirez-Celis A, Van de Water J. Maternal immune dysregulation and autism–understanding the role of cytokines, chemokines and autoantibodies. Front Psychiatry. 2022;13:834910.
Bennabi M, Tarantino N, Gaman A, Scheid I, Krishnamoorthy R, Debré P, et al. Persistence of dysfunctional natural killer cells in adults with high-functioning autism spectrum disorders: stigma/consequence of unresolved early infectious events? Mol Autism. 2019;10:22.
Finegold SM, Dowd SE, Gontcharova V, Liu C, Henley KE, Wolcott RD, et al. Pyrosequencing study of fecal microflora of autistic and control children. Anaerobe 2010;16:444–53.
Xu M, Xu X, Li J, Li F. Association between gut microbiota and autism spectrum disorder: a systematic review and meta-analysis. Front Psychiatry. 2019;10:473.
Hughes HK, Rose D, Ashwood P. The gut microbiota and dysbiosis in autism spectrum disorders. Curr Neurol Neurosci Rep. 2018;18:81.
Wu S, Ding Y, Wu F, Li R, Xie G, Hou J, et al. Family history of autoimmune diseases is associated with an increased risk of autism in children: a systematic review and meta-analysis. Neurosci Biobehav Rev. 2015;55:322–32.
Cross-Disorder Group of the Psychiatric Genomics Consortium. Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis. Lancet. 2013;381:1371–9.
Nudel R, Benros ME, Krebs MD, Allesøe RL, Lemvigh CK, Bybjerg-Grauholm J, et al. Immunity and mental illness: findings from a Danish population-based immunogenetic study of seven psychiatric and neurodevelopmental disorders. Eur J Hum Genet. 2019;27:1445–55.
Bennabi M, Gaman A, Delorme R, Boukouaci W, Manier C, Scheid I, et al. HLA-class II haplotypes and autism spectrum disorders. Sci Rep. 2018;8:7639.
Tamouza R, Fernell E, Eriksson MA, Anderlid B-M, Manier C, Mariaselvam CM, et al. HLA polymorphism in regressive and non-regressive autism: a preliminary study. Autism Res. 2020;13:182–6.
Tamouza R, Krishnamoorthy R, Leboyer M. Understanding the genetic contribution of the human leukocyte antigen system to common major psychiatric disorders in a world pandemic context. Brain Behav Immun. 2021;91:731–9.
Price P, Witt C, Allock R, Sayer D, Garlepp M, Kok CC, et al. The genetic basis for the association of the 8.1 ancestral haplotype (A1, B8, DR3) with multiple immunopathological diseases. Immunol Rev. 1999;167:257–74.
Hov JR, Zhong H, Qin B, Anmarkrud JA, Holm K, Franke A, et al. The influence of the autoimmunity-associated ancestral HLA haplotype AH8.1 on the human gut microbiota: a cross-sectional study. PLoS One. 2015;10:e0133804.
Andeweg SP, Keşmir C, Dutilh BE. Quantifying the impact of human leukocyte antigen on the human gut microbiota. MSphere. 2021;6:e0047621.
Boulanger LM. Immune proteins in brain development and synaptic plasticity. Neuron 2009;64:93–109.
Kipnis J. Multifaceted interactions between adaptive immunity and the central nervous system. Science. 2016;353:766–71.
Norris GT, Kipnis J. Immune cells and CNS physiology: microglia and beyond. J Exp Med. 2018;216:60–70.
Filiano AJ, Gadani SP, Kipnis J. Interactions of innate and adaptive immunity in brain development and function. Brain Res. 2015;1617:18–27.
Sterner KN, Weckle A, Chugani HT, Tarca AL, Sherwood CC, Hof PR, et al. Dynamic gene expression in the human cerebral cortex distinguishes children from adults. PLoS ONE. 2012;7:e37714.
Trowsdale J, Knight JC. Major histocompatibility complex genomics and human disease. Annu Rev Genomics Hum Genet. 2013;14:301–23.
Klein J, Sato A. The HLA system. N. Engl J Med. 2000;343:702–9.
IPD-IMGT/HLA. Database. https://www.ebi.ac.uk/ipd/imgt/hla/. Accessed 27 March 2023.
Dendrou CA, Petersen J, Rossjohn J, Fugger L. HLA variation and disease. Nat Rev Immunol. 2018;18:325–39.
Schizophrenia Working Group of the Psychiatric Genomics Consortium. Biological insights from 108 schizophrenia-associated genetic loci. Nature 2014;511:421–7.
Jia X, Han B, Onengut-Gumuscu S, Chen W-M, Concannon PJ, Rich SS, et al. Imputing amino acid polymorphisms in human leukocyte antigens. PLOS ONE. 2013;8:e64683.
Diagnostic and Statistical Manual of Mental Disorders, 4th Edition, Text Revision (DSM-IV-TR). 2000: vol. 1. 4th edn. Arlington, VA: American Psychiatric Association.
Organization WH. ICD-10: international statistical classification of diseases and related health problems: tenth revision. 2004: Spanish version, 1st edition published by PAHO as Publicación Científica 544.
Rutter M, Le Couteur A, Lord C. ADI-R: Autism diagnostic interview-revised: manual. Los Angeles, CA: Western Psychological Services; 2008.
Lord C, Rutter M, DiLavore P, Risi S, Gotham K, Bishop S, et al. Autism diagnostic observation schedule–2nd edition (ADOS-2). 284. Los Angeles, CA: Western Psychological Corporation; 2012.
Constantino JN. Social Responsiveness Scale. In: Volkmar FR, ed. Encyclopedia of Autism Spectrum Disorders. 2013, New York, NY: Springer New York; p. 2919–29.
Loth E, Charman T, Mason L, Tillmann J, Jones EJH, Wooldridge C, et al. The EU-AIMS Longitudinal European Autism Project (LEAP): design and methodologies to identify and validate stratification biomarkers for autism spectrum disorders. Mol Autism. 2017;8:24.
Charman T, Loth E, Tillmann J, Crawley D, Wooldridge C, Goyard D, et al. The EU-AIMS Longitudinal European Autism Project (LEAP): clinical characterisation. Mol Autism. 2017;8:27.
McIntosh D, Miller L, Shyu V, Dunn W. Development and validation of the short sensory profile. Sens Profile Man. 1999;61:59–73.
Goodman R. The strengths and difficulties questionnaire: a research note. J Child Psychol Psychiatry. 1997;38:581–6.
Goodman R, Ford T, Richards H, Gatward R, Meltzer H. The development and well-being assessment: description and initial validation of an integrated assessment of child and adolescent psychopathology. J Child Psychol Psychiatry. 2000;41:645–55.
Sinnwell JP, Schaid DJ. Haplo Stats (version 1.7. 7) statistical methods for haplotypes when linkage phase is ambiguous. Mayo Clinic Division of Health Sciences Research Rochester, MN, USA, 2016.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–75.
Didonna A, Damotte V, Shams H, Matsunaga A, Caillier SJ, Dandekar R, et al. A splice acceptor variant in HLA ‐ DRA affects the conformation and cellular localization of the class II DR alpha‐chain. Immunology 2021;162:194–207.
Boyle AP, Hong EL, Hariharan M, Cheng Y, Schaub MA, Kasowski M, et al. Annotation of functional variation in personal genomes using RegulomeDB. Genome Res. 2012;22:1790–7.
GTEx Portal. https://gtexportal.org/home/. Accessed 5 November 2020.
Noguchi E, Akiyama M, Yagami A, Hirota T, Okada Y, Kato Z, et al. HLA-DQ and RBFOX1 as susceptibility genes for an outbreak of hydrolyzed wheat allergy. J Allergy Clin Immunol. 2019;144:1354–63.
Bourgeron T. From the genetic architecture to synaptic plasticity in autism spectrum disorder. Nat Rev Neurosci. 2015;16:551–63.
Xia K, Guo H, Hu Z, Xun G, Zuo L, Peng Y, et al. Common genetic variants on 1p13.2 associate with risk of autism. Mol Psychiatry. 2014;19:1212–9.
Grove J, Ripke S, Als TD, Mattheisen M, Walters RK, Won H, et al. Identification of common genetic risk variants for autism spectrum disorder. Nat Genet. 2019;51:431–44.
Bjørklund G, Saad K, Chirumbolo S, K. Kern J, A. Geier D, R. Geier M, et al. Immune dysfunction and neuroinflammation in autism spectrum disorder. Acta Neurobiol Exp. 2016;76:257–68.
Nguyen T, Liu XK, Zhang Y, Dong C. BTNL2, a butyrophilin-like molecule that functions to inhibit T cell activation. J Immunol. 2006;176:7354–60.
Arnett HA, Viney JL. Immune modulation by butyrophilins. Nat Rev Immunol. 2014;14:559–69.
Rybicki BA, Walewski JL, Maliarik MJ, Kian H, Iannuzzi MC. The BTNL2 gene and sarcoidosis susceptibility in african americans and whites. Am J Hum Genet. 2005;77:491–9.
Valentonyte R, Hampe J, Huse K, Rosenstiel P, Albrecht M, Stenzel A, et al. Sarcoidosis is associated with a truncating splice site mutation in BTNL2. Nat Genet. 2005;37:357–64.
Price P, Santoso L, Mastaglia F, Garlepp M, Kok CC, Allcock R, et al. Two major histocompatibility complex haplotypes influence susceptibility to sporadic inclusion body myositis: critical evaluation of an association with HLA-DR3. Tissue Antigens. 2004;64:575–80.
Arnett HA, Escobar SS, Gonzalez-Suarez E, Budelsky AL, Steffen LA, Boiani N, et al. BTNL2, a butyrophilin/B7-like molecule, is a negative costimulatory molecule modulated in intestinal inflammation. J Immunol. 2007;178:1523–33.
Panea C, Zhang R, VanValkenburgh J, Ni M, Adler C, Wei Y, et al. Butyrophilin-like 2 regulates site-specific adaptations of intestinal γδ intraepithelial lymphocytes. Commun Biol 2021;4:913.
Cryan JF, O’Riordan KJ, Sandhu K, Peterson V, Dinan TG. The gut microbiome in neurological disorders. Lancet Neurol. 2020;19:179–94.
Safadi JM, Quinton AMG, Lennox BR, Burnet PWJ, Minichino A. Gut dysbiosis in severe mental illness and chronic fatigue: a novel trans-diagnostic construct? A systematic review and meta-analysis. Mol Psychiatry. 2021;27:141–53.
Bezawada N, Phang TH, Hold GL, Hansen R. Autism spectrum disorder and the gut microbiota in children: a systematic review. Ann Nutr Metab. 2020;76:16–29.
Wong OWH, Lam AMW, Or BPN, Mo FYM, Shea CKS, Lai KYC, et al. Disentangling the relationship of gut microbiota, functional gastrointestinal disorders and autism: a case–control study on prepubertal Chinese boys. Sci Rep. 2022;12:10659.
Lee YH, Bae S-C, Choi SJ, Ji JD, Song GG. Genome-wide pathway analysis of genome-wide association studies on systemic lupus erythematosus and rheumatoid arthritis. Mol Biol Rep. 2012;39:10627–35.
Guéant J-L, Romano A, Cornejo-Garcia J-A, Oussalah A, Chery C, Blanca-López N, et al. HLA-DRA variants predict penicillin allergy in genome-wide fine-mapping genotyping. J Allergy Clin Immunol. 2015;135:253.e10.
Piekos SN, Gaddam S, Bhardwaj P, Radhakrishnan P, Guha RV, Oro AE. Biomedical data commons (BMDC) prioritizes B-lymphocyte non-coding genetic variants in type 1 diabetes. PLOS Comput Biol. 2021;17:e1009382.
Espino L, Núñez C. The HLA complex and coeliac disease. Int Rev Cell Mol Biol. 2021;358:47–83.
Goni E, Regel I, Mahajan UM, Amodio A, De Marchi G, Beyer G, et al. HLA-DRB1∗16 and -DQB1∗05 alleles are strongly associated with autoimmune pancreatitis in a cohort of hundred patients. Pancreatology 2022;22:466–71.
Trachtenberg EA, Yang H, Hayes E, Vinson M, Lin C, Targan SR, et al. HLA class II haplotype associations with inflammatory bowel disease in Jewish (Ashkenazi) and non-Jewish Caucasian populations. Hum Immunol. 2000;61:326–33.
Kawa S, Ota M, Yoshizawa K, Horiuchi A, Hamano H, Ochi Y, et al. HLA DRB1*0405-DQB1*0401 haplotype is associated with autoimmune pancreatitis in the Japanese population. Gastroenterology 2002;122:1264–9.
Mahlios J, De la Herrán-Arita AK, Mignot E. The autoimmune basis of narcolepsy. Curr Opin Neurobiol. 2013;23:767–73.
Nowak JK, Glapa-Nowak A, Banaszkiewicz A, Iwańczak B, Kwiecień J, Szaflarska-Popławska A. et al. HLA-DQA1*05 associates with extensive ulcerative colitis at diagnosis: an observational study in children. Genes. 2021;12:1934
Sazonovs A, Ahmad T, Anderson CA. Underpowered PANTS: a response to the conclusions of “extended analysis identifies drug-specific association of two distinct HLA class II haplotypes for development of immunogenicity to adalimumab and infliximab”. Gastroenterology. 2021;160:470–1.
Xu G, Snetselaar LG, Jing J, Liu B, Strathearn L, Bao W. Association of food allergy and other allergic conditions with autism spectrum disorder in children. JAMA Netw Open. 2018;1:e180279.
Iliev ID, Funari VA, Taylor KD, Nguyen Q, Reyes CN, Strom SP, et al. Interactions between commensal fungi and the C-type lectin receptor dectin-1 influence colitis. Science 2012;336:1314–7.
Bennabi M, Delorme R, Oliveira J, Fortier C, Lajnef M, Boukouaci W, et al. Dectin-1 polymorphism: a genetic disease specifier in autism spectrum disorders? PLoS One. 2015;10:e0137339.
Campbell DB, Sutcliffe JS, Ebert PJ, Militerni R, Bravaccio C, Trillo S, et al. A genetic variant that disrupts \emphMET transcription is associated with autism. Proc Natl Acad Sci. 2006;103:16834–9.
Eagleson KL, Xie Z, Levitt P. The pleiotropic MET receptor network: circuit development and the neural-medical interface of autism. Biol Psychiatry. 2017;81:424–33.
Olivares M, Neef A, Castillejo G, Palma GD, Varea V, Capilla A, et al. The HLA-DQ2 genotype selects for early intestinal microbiota composition in infants at high risk of developing coeliac disease. Gut 2014;64:406–17.
Bosnes V, Qvigstad E, Lundin KEA, Thorsby E. Recognition of a particular HLA-DQ heterodimer by a human γ/δ T cell clone. Eur J Immunol. 1990;20:1429–33.
Warrier V, Zhang X, Reed P, Havdahl A, Moore TM, Cliquet F, et al. Genetic correlates of phenotypic heterogeneity in autism. Nat Genet. 2022;54:1293–304.
Uffelmann E, Huang QQ, Munung NS, de Vries J, Okada Y, Martin AR, et al. Genome-wide association studies. Nat Rev Methods Prim. 2021;1:1–21.
Regan R, Willatt L. Mental retardation: definition, classification and etiology. In:Knight SJL, ed. Monographs in Human Genetics. 18. S. Karger AG; 2010. pp. 16–30.
Acknowledgements
The work was supported by grants from the EU-AIMS (European Autism Interventions) and AIMS-2-TRIALS programs which receive support from Innovative Medicines Initiative Joint Undertaking Grant No. 115300 and 777394, the resources of which are composed of financial contributions from the European Union’s FP7 and Horizon2020 programs, and from the European Federation of Pharmaceutical Industries and Associations (EFPIA) companies’ in-kind contributions, and AUTISM SPEAKS, Autistica and SFARI; and by the Horizon2020 supported program CANDY Grant No. 847818). The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results. Any views expressed are those of the author(s) and not necessarily those of the funders (IHI-JU2). The work was further supported by the Fondation FondaMental and the LVMH group. Laura Lombardi benefits from a fellowship no. FDM202006011305 from Fondation de la Recherche Médicale (FRM). The data used for the analyses described in this manuscript were obtained from the GTEx Portal on 21/12/2022.
Author information
Authors and Affiliations
Contributions
Design and conceptualization of study: LL, SL, JFZ, ML, and RT. Design and conceptualization of analysis: LL, SL, JFZ, and RT. Data acquisition, recruitment, and logistics: CLW, JB, WB, SS, JRR, ML, MT, PLC, CB, TB, RH, SD, AMP, BO, EL, JB, and DM. Data processing and statistical analyses: LL, SL, and JFZ. Data interpretation: LL, SL, JFZ, ML, and RT. Drafting of the manuscript: LL, SL, JFZ, ML, and RT. All authors revised the manuscript critically for important intellectual context and agreed to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Lombardi, L., Le Clerc, S., Wu, CL. et al. A human leukocyte antigen imputation study uncovers possible genetic interplay between gut inflammatory processes and autism spectrum disorders. Transl Psychiatry 13, 244 (2023). https://doi.org/10.1038/s41398-023-02550-y
Received:
Revised:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41398-023-02550-y







