Abstract
The pupillary light reflex (PLR), the automatic constriction of the pupil in response to increased luminance, is a candidate early intermediate phenotype associated with autism, with potential to help understand early neurodevelopmental differences because it is controlled by relatively simple neural circuitry. We conducted epigenome-wide association analyses of PLR onset latency and constriction amplitude at 9, 14, and 24 months, with 51 male infants enriched for familial autism likelihood (~ 80% with a first-degree autistic relative), using buccal DNA collected at 9 months. We identified four epigenome-wide differentially methylated probes (p < 2.4 × 10⁻⁷) significantly associated with PLR latency at 14 and 24 months, and 14- to 24-month developmental change in latency. Probes linked to PLR amplitude were identified at a discovery threshold (p < 5 × 10⁻⁵). Regional analyses revealed multiple differentially methylated regions associated with both latency and amplitude. Associated probes were enriched for neurodevelopmental processes and autism-associated genes, including NR4A2, HNRNPU, and NAV2. While the findings are most directly relevant to male infants in whom PLR variability may be associated with familial autism likelihood, they provide novel evidence that DNAm contributes to early variation in PLR. These insights into the biological underpinnings of this reflex support PLR as an early intermediate phenotype associated with autism.
Similar content being viewed by others
Introduction
Autism is a life-long, highly heterogeneous neurodevelopmental condition characterised by differences in social interactions and communication, alongside specialised, focused, or intense interests, repetitive behaviours and sensory differences1. Autism cannot be diagnosed reliably until two years, at the earliest, when behavioural characteristics are sufficiently developed for clinical observation2. Prospective investigation of early intermediate phenotypes, defined as pre-diagnosis measurable features associated with the later behavioural characteristics3, may help clarify early neurodevelopmental pathways underlying autistic traits4 and ultimately inform research into supportive strategies for autistic people.
The pupillary light reflex (PLR) is a candidate intermediate phenotype associated with autism5. The PLR is the reflexual constriction of the pupil in response to increased optical luminance6 and is primarily mediated by a relatively simple four-neuron visual sensorimotor circuitry7. This well-studied index of autonomic functioning is governed by parasympathetic and sympathetic pathways8 and its regulation involves neural signal transduction and cholinergic and norepinephrine neurotransmission8,9,10,11,12. Recent findings indicate that variability in infant PLR associates with both categorical autism diagnosis and dimensional variation in later autism traits, making it a useful measure for studying early differences in neurodevelopment. Cross-sectionally, 10-month-old infants with a family history of autism have a larger and faster PLR relative to infants with no family history13, with larger 10-month PLR constriction amplitude associating with later autism diagnosis14. Developmentally, from 6 to 24 months, PLR tends to become faster and larger15,16, but a decreasing pattern of pupil constriction amplitude over time in those who later receive an autism diagnosis has been demonstrated14. This finding was replicated in a partially overlapping sample in which a decrease in amplitude between 9 and 14 months was associated with autistic traits5. In the same sample, those with a later diagnosis demonstrated a significantly steeper decrease in latency from 14 to 24 months relative to those with other outcomes5. Additionally, a higher polygenic score for autism associated with a smaller decrease in latency in the first year5.
Increasing evidence suggests autism arises during early development from complex interactions between genetic and environmental factors17. Though the path through which these factors coalesce remains unclear, emerging evidence points to altered epigenetic signatures in autism18. Epigenetic modifications, such as DNA methylation (DNAm), are chemical markers that can influence gene expression without altering the underlying DNA sequence itself19. DNAm is a key biological mechanism involved in the timed up- or down-regulation of gene expression20,21. Recent epigenome-wide association studies (EWAS) using both post-mortem brain tissue and peripheral tissue (e.g. blood, buccal or saliva) from living participants have identified several differentially methylated sites and regions, as well as co-methylated modules, associated with both autism diagnosis22,23,24 and dimensional autism traits25. Autism-associated DNAm signatures have been identified in genes and pathways linked to critical functional processes, including neurodevelopment, immune response, synaptic functioning, and microscopic neural structures; highlighting their potential and relevance for providing insight into early neurodevelopmental variation related to autism22,23,24,26,27.
Notably, most prior studies have used samples taken post-diagnosis. Since autism is a neurodevelopmental condition, prospectively investigating DNAm before diagnosis could be fruitful in mapping early autism development. A recent EWAS28 identified differential peripheral tissue DNAm in 8-month-olds associated with 3-year autism diagnosis and dimensional traits. However, these associations were not statistically significant after stringent epigenome-wide multiple testing corrections. An alternative, potentially more statistically powerful, method of exploring the role of DNAm in early autism is to investigate the association of DNAm with early intermediate phenotypes4, which tend to be placed at an earlier stage in the developmental pathway3. Indeed, an EWAS with two social attention candidate 8-month intermediate phenotypes for autism (face looking and neural response to faces) was recently conducted, though, again, no probe-wise associations were significant after epigenome-wide multiple testing corrections28. Given that the pupillary light reflex (PLR) is a quantifiable sensorimotor reflex involving relatively simple neuro-circuitry, it is likely situated earlier in the developmental pathway than other candidate intermediate phenotypes. This offers promise for exploring how epigenetic variation relates to early neurodevelopmental differences.
In this study, we aimed to expand previous research indicating PLR as a candidate intermediate phenotype for autism5 by exploring the epigenetic variability associated with individual differences in early PLR development. We present findings from epigenome-wide DNAm association analysis of infant PLR onset latency and constriction amplitude, using buccal tissue taken from 51 male 9-month-old infants enriched for autism family history. A minority of infants (20%) without a family history of autism were included to broaden PLR variability and enable examination of epigenetic differences across the full range of this candidate intermediate phenotype. We applied a hypothesis-free epigenome-wide approach to explore the association of probe- and region-level DNAm with variability in cross-sectional (9-, 14-, and 24-month) and developmental (9- to 14-month and 14- to 24-month changes) PLR onset latency and constriction amplitude. We conducted downstream gene ontology enrichment analysis to uncover biological significance and performed functional exploratory analyses to investigate genetic and epigenetic associations with autism.
Results
DNAm analysis
Our study identified multiple novel significantly differentially methylated probes and regions associated with PLR latency and amplitude. A summary of significant results from the epigenome-wide association studies (EWAS) and differentially methylated region (DMR) analyses is presented in Table 1 (SM 4.1 includes Q-Q plots and EWAS Manhattan plots).
PLR latency
We identified four highly significant differentially methylated probes associated with three PLR latency phenotypes at a stringent p-value threshold (p < 2.4 × 10− 7, see Table 2). A broader list of probes associated with PLR latency phenotypes above the discovery p-value threshold (p < 5 × 10− 5) is provided in SM 4 Table 1.
In total, we reported 13 DMRs significantly associated with PLR latency; each latency phonotype had a single significantly associated DMR, except for 14- to 24-month PLR latency change, where nine significantly associated DMRs were found. The DMRs significantly associated with each PLR latency phenotype are listed in Table 3 and SM 4 Table 2.
PLR amplitude
In contrast to PLR latency, PLR amplitude did not show probe-level associations at epigenome-wide significance after stringent correction for multiple comparisons (p < 2.4 × 10− 7). Several differentially methylated PLR amplitude-associated probes reached the discovery p-value threshold (p < 5 × 10− 5; See full list in SM 4 Table 1). Region analyses revealed 17 DMRs significantly associated with PLR amplitude, with the topmost significantly associated DMR per phenotype listed in Table 4 (full list in SM 4 Table 2).
Downstream exploratory analysis
Gene ontology (GO) analysis was conducted for each phenotype using genes annotated with probes within the top 1000 EWAS p-values, and probes located within the significantly associated DMRs. Multiple significantly enriched GO terms were identified (Table 5). SM 5 Table 2 lists all significantly enriched GO terms.
For both PLR components, no probes above the discovery p-value threshold, nor probes located in any of the significantly associated DMRs, were listed in the MRC-IEU EWAS catalogue29. None of the stringently significant probes were annotated to genes listed in the SFARI database30. Nineteen probes significantly associated with PLR at the discovery p-value threshold were annotated to 18 distinct genes listed in the SFARI gene. Notably, two SFARI genes with ‘high confidence’ associations were annotated with probes identified to significantly associate with latency at 14 months (cg01123282; NR4A2) and latency at 24 months (cg02230180; HNRNPU). A full list of probes annotated to SFARI genes is summarised in SM 5 Table 3.
Three of the identified significantly associated DMRs contained probes annotated to three distinct SFARI genes, as summarised in Table 6.
Discussion
The infant pupillary light reflex (PLR) is increasingly recognised as a candidate early intermediate phenotype for autism, offering a potential window into the neurobiological pathways underlying autism. In this study, we aimed to expand previous work by exploring the epigenetic variability associated with the early development of the PLR in a sample of male infants enriched for familial autism likelihood. This sample included a small proportion (20%) of infants with no autism family history to introduce a modest degree of phenotypic heterogeneity. This was advantageous for capturing broader, but still relevant, variability in PLR to better assess associated DNA methylation (DNAm) variation in this candidate intermediate phenotype. Our hypothesis-free epigenome-wide scan identified four novel Bonferroni-significant (p < 2.4 × 10 − 7) differentially methylated probes and 13 differentially methylated regions (DMRs) associated with PLR latency phenotypes, particularly within a sensitive neurodevelopmental window between 14 and 24 months. These findings provide compelling evidence that differential epigenetic signatures contribute to the variability of early PLR onset latency.
In contrast, PLR amplitude did not show probe-level associations at epigenome-wide significance after Bonferroni corrections. This divergence between PLR components likely reflects their underlying biological differences. Specifically, while the PLR is driven by a core four-neuron reflex circuit7, increasing evidence suggests that PLR amplitude is modulated by additional top-down higher-order cognitive processes, including emotional regulation and executive control31,32,33,34. Supporting this, our downstream GO analysis indicated that probes associated with amplitude at discovery significance threshold (p < 5 × 10⁻⁵) were annotated to a broader and more complex biological network than latency-associated probes. While amplitude-related terms shared some overlap with those of latency, such as system development (GO:0048731) and homophilic cell adhesion via plasma membrane adhesion molecules (GO:0007156), amplitude-related terms also included categories related to cell signalling (e.g., cell-cell signalling, GO:0007267), metabolic processes (e.g., cellular glucuronidation, GO:0052695), and stress response (e.g., regulation of response to stimulus, GO:0048583). This more diffuse regulatory landscape may dilute the detectable effects of single-probe DNAm associations for amplitude. In contrast, if the PLR latency is controlled by a tightly focused, mostly bottom-up biological system, then DNAm changes in that system might have stronger and more specific effects. Because the effects would be concentrated in a small, well-defined pathway rather than spread across many brain systems, they’re more likely to be statistically significant even under the very strict Bonferroni correction used in our EWAS analysis.
Both PLR latency and DNAm have previously been linked to early autism characteristics, particularly sensory sensitivities5,13,28. To explore the connection of the identified epigenetic signatures of the PLR latency with autism, we undertook an in silico experiment to examine whether associated probes or regions were listed in the SFARI database30 or the EWAS catalogue29. While none of the Bonferonni-corrected probes or regions had prior links to autism, as reported in these two databases, several discovery-significant probes across the five latency EWASs were annotated to autism susceptibility genes. Among the most compelling genes were NR4A2 and HNRNPU, listed as ‘high confidence’ autism-associated genes in the SFARI database given the multiple separate reports identifying mutations in each of these genes in autistic people35,36,37,38,39. Specifically, probe cg01123282, located in intron 1 of NR4A2, was hypermethylated in association with slower latency at 14 months. Probe cg02230180, located in intron 1 of HNRNPU, was hypermethylated in association with faster PLR latency at 24 months. Both genes are critical for neuronal functioning and development; NR4A2 is implicated in dopaminergic neural cell differentiation40, and HNRNPU is linked to synaptic function41 and neurogenesis42. Further supporting the relevance of these findings, gene ontology (GO) enrichment analysis revealed that differentially methylated probes associated with all PLR latency phenotypes, except for cross-sectional 24-month, were significantly enriched for neurodevelopmental processes. These included neurogenesis (GO:0022008), nervous system development (GO:0007399), negative regulation of developmental processes (GO:0051093), and regulation of cellular component organisation (GO:0051128). These findings are notable given previous evidence pointing to the contributing role of DNAm associated with differences in neurodevelopmental processes linked to autism18. Collectively, the current findings support PLR latency as a candidate intermediate phenotype for autism, and highlight the association of differential DNAm (and potentially implicated neurodevelopmental processes) to the previously documented associations between PLR latency development and early-emerging autism characteristics5,13,14.
At a discovery threshold, differential DNAm of multiple probes significantly associated with PLR amplitude, as did several regions. PLR amplitude has previously been linked to autism5,13,14, and although no discovery significant probes nor probes within significantly associated regions were found to have direct associations with autism according to the EWAS catalogue29, 12 genes annotated with amplitude-associated probes were in the SFARI database30. Nine of these were categorised as ‘strong candidates’ for autism. These autism-associated genes reiterate the diversity of the biological pathways connected to PLR amplitude highlighted in the GO analysis, with gene functions ranging from immunity (CARD11)40, to metabolic pathways and circadian regulation (CSNK2B)40,43, to glycoprotein production (GALNT10)40, and calcium signalling pathways (CACNA1D)40. Several of the identified autism-associated genes linked to amplitude also play a role in neurodevelopment40. These findings add to previous work5,13,14 to provide further support for PLR as a candidate intermediate phenotype for autism, and suggest the involvement of differential DNAm across a relatively broad network of genes in infant PLR amplitude variability5,13,14.
Our findings must be interpreted in the light of certain limitations. The sample was relatively homogeneous; all participants were male, DNA methylation and phenotypic measures were collected within narrow developmental windows, and the sample was enriched for familial autism likelihood. This limits the generalisability to females, other developmental stages and individuals with different autism likelihoods (e.g., syndromic autism). Nonetheless, within this restricted sample, these findings may inform research on early neurodevelopment and the design/implementation of targeted supportive strategies, though broader application should be approached cautiously and requires further validation. Studying DNAm in a sample of children of the same sex and with DNA and phenotypic measures collected within a tight timeframe likely enhances power by reducing variability unrelated to PLR and associated DNA methylation, and improving signal-to-noise ratios28,44. Supporting this, although the sample size was relatively small, four epigenome-wide significantly associated probes were identified. Larger cohorts will be valuable for uncovering additional effects and further elucidating the interplay between epigenetics, PLR and later neurodevelopmental outcomes. Future studies could build on these findings by integrating parent-report and behavioural data to examine how PLR-related epigenetic variation relates to emerging developmental and behavioural profiles over time. While such data were available for the broader cohort, we did not include developmental or familial-likelihood measures as covariates in the current analysis because doing so would risk controlling for variability that is theoretically central to the PLR signal of interest. Our previous work in this sample has already characterised associations between PLR and later developmental outcomes5, but meaningfully incorporating epigenetic variation into those models would require substantially larger samples. Larger, well-powered cohorts will therefore be needed to test whether epigenetic variation mediates or moderates links between PLR and later autism-related development.
A second potential limitation is that quantifying DNAm in the PLR reflex neurocircuitry tissue, instead of buccal cells, may yield more specific results related to PLR. However, buccal cells, being more accessible, allow for a larger sample size from live individuals that are deeply phenotyped over time (like our current sample), and are more practical and accessible than post-mortem brain tissue. Further, empirical evidence shows that DNAm patterns in various brain regions are more similar to saliva samples (which contain many buccal cells) than blood samples45, and we accounted for variance attributable to differences in cell type proportions by including the proportion of common cell types in buccal samples as a covariate in our analysis.
Another limitation is that while the Illumina 450k array is a reliable tool for quantifying numerous variable DNAm sites, it only covers 2% of all DNAm sites across the genome46. Limited coverage may hinder conclusions as probes related to PLR or autism may be unmeasured. Additionally, although we aimed to explore the association of PLR with early altered DNAm (at 9 months), this window for observing the role of DNAm in regulating PLR is narrow. Investigating DNAm at other timepoints or longitudinal changes in DNAm across infancy may offer complementary insights. Finally, while our study provides an initial in silico assessment of autism relevance, the MRC-IEU EWAS Catalogue and SFARI Gene database are continuously expanding resources; therefore, the absence of overlap between our findings and these databases should be interpreted with caution. Replication using other available or emerging datasets could strengthen confidence in the observed associations and clarify their relevance to autism.
In conclusion, this study is, to our knowledge, the first to investigate the association of DNA methylation with the infant pupillary light reflex (PLR), an early-emerging candidate neurophysiological intermediate phenotype associated with autism. We identified four differentially methylated probes significantly associated with PLR latency phenotypes between 14 and 24 months of age, surpassing stringent epigenome-wide significance thresholds (p < 2.4 × 10⁻⁷). In addition, 13 differentially methylated regions (DMRs) were associated with PLR latency, particularly with developmental changes occurring over the 14- to 24-month period. While no probes reached epigenome-wide significance for PLR amplitude, several were associated at the discovery threshold (p < 5 × 10⁻⁵), along with multiple DMRs, some annotated to genes previously associated with autism. Gene ontology and pathway analyses implicated nervous system development and neurobiological processes in both latency and amplitude phenotypes, with results suggesting a more complex regulatory basis for PLR amplitude.
Together, our study findings not only shed light on the distinct biological underpinnings of early PLR latency and amplitude variability but also highlight the importance of considering both biological specificity and developmental timing when evaluating epigenetic contributions to early neurodevelopment. This work opens a new line of inquiry into how epigenetic signatures associate with neurophysiological processes in infancy - rather than higher-order cognitive processes - might forecast neurodevelopmental outcomes. By identifying methylation patterns linked to this reflex, we move closer to understanding individual differences in sensory-motor and autonomic regulation that have previously been associated with early autism, thereby contributing to a more nuanced understanding of neurodevelopmental diversity.
Further, support or services aimed at promoting well-being are often most effective when implemented during infancy, a period of high neurodevelopmental plasticity47. Insights from the current research may help guide the identification of individuals who could benefit most from early support, and/or inform the design of such support - particularly those addressing sensory, motor, or autonomic differences that some autistic individuals or their families report as highly impactful on daily life48,49,50. In this way, our findings contribute to a foundation for future research to explore biologically informed approaches that prioritise the well-being of neurodivergent individuals, as well as advance understanding of the biological diversity underlying early development.
Methodology
Sample
Participants in the current analysis were recruited into the British Autism Study of Infant Siblings (BASIS; www.basisnetwork.org; SM 1), a longitudinal study of the development of infants with familial autism likelihood due to having a first degree autistic relative. Full ethical approval was granted by NHS Research Ethics Committees (08/H0718/76 [BASIS] and 06/MRE02/73 [DNA collection, extraction and analysis]). All methods were performed in accordance with NHS research governance frameworks and relevant international ethical standards, including the Declaration of Helsinki. Informed written consent was provided by the parent(s).
The current sample consisted of a total 51 male infants who had DNA methylation data quantified at 9-months, and PLR data from at least one of the following time points: 9-months, 14-months or 24-months. The sample was enriched for infants with familial autism likelihood (N = 42); 9 infants had no familial autism likelihood, having no family history of autism. Including infants with and without a familial likelihood for autism introduced a modest degree of phenotypic heterogeneity, which was advantageous for capturing broader, but still relevant, variability in PLR to better assess associated epigenetic variation in this candidate intermediate phenotype.
We focused specifically on these age windows of PLR measurement because (1) previous investigations have reported an association of the PLR change across this period with a later autism diagnosis, and (2) this is the key early developmental timeframe in which behavioural autism characteristics gradually emerge51. As DNAm is dynamic over development18, we focused on DNAm at our first PLR timepoint (9 months) to investigate the developmental sequelae in PLR following earlier DNAm and to narrow the age range of DNAm as recommended by previous investigations52. We additionally included only male infants to minimise epigenetic heterogeneity related to sex.
Pupillary light reflex (PLR)
The PLR was induced while infants passively watched stimuli that transitioned from a black to white screen on a monitor53. Stimulus was presented to the cohort 32 times at 9 and 14 months and 16 times at 24 months (see SM 2 Fig. 1). Pupil diameter data was collected at 9-, 14- and 24-months using Tobii T120 eye tracker (sampling rate = 60 Hz; Tobii Technology AB, Danderyd, Sweden; millimetres to 2 decimal places). Pre-processing of raw pupil data (see SM 2.1.2) removed artefacts and extracted PLR latency (constriction onset time; ms) and amplitude (constriction magnitude relative to average pupil size before latency; %). Latency was defined as the minimum acceleration from 110-570ms after white slide onset. Amplitude was calculated as the maximum constriction within the interval 170–1450ms post-latency, relative to the average pupil diameter size at latency onset54. The parameter identification time windows were determined by a series of optimisation investigations (see SM 2.1.1). Three or more valid trials per time point were required for inclusion. The median latency and amplitude per individual per time point were calculated.
To account for the variance in the PLR parameters at each timepoint explained by the covariates of missingness (i.e., percentage of missing trials from the stimuli) and baseline pupil diameter (i.e., the average pupil diameter up to 100ms before PLR latency), we conducted linear multiple regression analysis with latency or amplitude as the dependent variables and missingness and baseline as the predictor variables (SM 2.2). Linear regressions were conducted within timepoint. Residuals were extracted and used in all subsequent analysis as the PLR latency and amplitude variables.
In total 10 PLR phenotypes were analysed in this study. This included 6 cross-sectional PLR measures (latency and amplitude at 9-, 14- and 24-month) and 4 PLR developmental change scores (9- to 14-month and 14- to 24-month for latency and amplitude). Change scores were calculated using subtraction of the older minus the younger measure. The inclusion of all 10 PLR phenotypes, encompassing both cross-sectional and developmental measures, is based on the current literature, which suggests that both latency and amplitude at various time points across infancy may be associated with autism either directly such as 10-month amplitude and latency13,55, or as part of an associated developmental trajectory5,13. The inclusion of these phenotypes enables a comprehensive exploration of the role of epigenetics in early PLR as a candidate intermediate phenotype for autism.
DNA methylation (DNAm)
Buccal samples were collected at 9 months from 51 infants and processed using a standard pipeline28. Genomic DNA (500ng) from each sample was extracted and treated with sodium bisulfite using the Zymo EZ DNA Methylation-Lightning Kit™ (Zymo Research, Irvine, CA, USA). The Illumina Infinium® HumanMethylation450 BeadChip kit (450 K array; Illumina, San Diego, CA, USA) was used to assess DNA methylation at 482,421 sites (∼2% o all CpG loci) throughout the genome. We then quantified DNAm using the HiScan System (Illumina, San Diego, CA, USA) and extracted signal intensities using Illumina GenomeStudio software (Illumina, San Diego, CA, USA) for each probe.
All epigenomic data processing and downstream analysis was conducted using R version 4.0.2. and R studio56,57 on the high-performance computing facilities at King’s College London (CREATE). Methylation signal intensity data was imported using methylumi R-package58. Data quality control and pre-processing was conducted using wateRmelon R-package59. The ‘pfilter’ function removed failed samples and probes (e.g., low bead counts (< 3) or detection p-values > 0.05 for more than 1% of probes). The ‘dasen’ function was then used to normalise the data using a standardised pipeline59 to address background correction, dye bias, probe adjustments, batch effects, and other non-biological variations related to external parameters (e.g., array order)60. Cross-reactive and polymorphic probes, as identified in Illumina annotation files and recent studies61,62, were excluded. The final dataset included 402,971 probes with methylation levels expressed as beta (β) values ranging from 0 (unmethylated) to 1 (fully methylated), as shown in SM 3.1 Fig. 1.
Statistical analysis
Epigenome-wide association study (EWAS) analysis
In total, 10 separate EWAS analyses were conducted using linear multiple regressions fitted individually for each CpG. Models were built with processed DNAm β as the dependent variable and the PLR phenotype as the predictor with age, batch, 10 principal components (identified using location-based principal component analysis63, SM 3.2) and estimated cell type proportion for 7 common cell-types in buccal samples (identified using EpiDISH R package64, SM 3.3) as covariates. To account for account for multiple testing in epigenome-wide association studies (EWAS), we applied a ‘stringent’ 450k significance threshold of p < 2.4 × 10− 765, and a more liberal ‘discovery’ significance threshold of p < 5 × 10− 5, as commonly utilised in EWAS of complex phenotype including traits associated with autism24,26,27,28,66.
Differential methylated region (DMR) analysis
We used dmrff R-package67 to identify DMRs associated with the 10 phenotypes of interest. DMRs were determined using EWAS summary statistics and defined as regions of two or more neighbouring probes within 500 base pairs with EWAS p-value < 0.05 and effect estimates in the same direction (negative or positive). The p-value threshold for significant DMR was set at 0.05 after Bonferroni adjustments.
Downstream exploratory analysis
To assess the biological significance of genes linked to probes associated with the PLR phenotypes, we performed Gene Ontology (GO) enrichment analysis using the top 1,000 probes (ranked by p-value) and significantly associated DMRs (SM 5 Table 1). Analysis focused on biological processes and was conducted with the PANTHER overrepresentation test68,69,70, referencing genes annotated to the 402,971 probes used in EWAS and DMR analyses. Gene annotations were based on Illumina UCSC hg19 annotations71. Significant enrichment required at least two genes per term and an FDR-adjusted p-value ≤ 0.05. We investigated whether probes significantly associated with PLR in the EWAS or DMR analysis were previously linked to autism in the MRC-IEU EWAS catalogue29 (October 2024) or located within autism-associated genes listed in the SFARI Gene database30 (September 2024; gene.sfari.org).
Data availability
The datasets analysed during the current study are not publicly available but can be requested from the Emily J. H. Jones (a co-author of this manuscript and author of the original studies).
References
American Pyschiatric Association. Diagnostic and Statistical Manual of Mental Disorders. (2013).
Lord, C. et al. Autism from 2 to 9 years of age. Arch. Gen. Psychiatry. https://doi.org/10.1001/archpsyc.63.6.694 (2006).
Lee Gregory, M., Burton, V. J. & Shapiro, B. K. ‘Developmental Disabilities and Metabolic Disorders. In Neurobiology of Brain Disorders: Biological Basis of Neurological and Psychiatric Disorders (2015). https://doi.org/10.1016/B978-0-12-398270-4.00003-3
Flint, J. & Munafò, M. R. The endophenotype concept in psychiatric genetics (2007). https://doi.org/10.1017/S0033291706008750
Fish, L. A. et al. Development of the pupillary light reflex from 9 to 24 months: association with common autism spectrum disorder (ASD) genetic liability and 3-year ASD diagnosis. J. Child. Psychol. Psychiatry. https://doi.org/10.1111/jcpp.13518 (2021).
Daluwatte, C., Miles, J. H., Sun, J. & Yao, G. Association between pupillary light reflex and sensory behaviors in children with autism spectrum disorders. Res. Dev. Disabil. https://doi.org/10.1016/j.ridd.2014.11.019 (2015).
McDougal, D. H. & Gamlin, P. D. Autonomic control of the eye. Compr. Physiol. https://doi.org/10.1002/cphy.c140014 (2015).
Bremner, F. D. The pupil: anatomy, physiology, and clinical applications: By Irene E. Loewenfeld. 1999. Oxford: Butterworth-Heinemann. Price pound180. Pp. 2278. ISBN 0-750-67143-2.. Brain https://doi.org/10.1093/brain/124.9.1881 (1999).
de Vries, L., Fouquaet, I., Boets, B., Naulaers, G. & Steyaert, J. Autism spectrum disorder and pupillometry: A systematic review and meta-analysis (2021). https://doi.org/10.1016/j.neubiorev.2020.09.032
Dinalankara, D. M. R., Miles, J. H., Takahashi, T. N. & Yao, G. Atypical pupillary light reflex in 2–6-year-old children with autism spectrum disorders. Autism Res. https://doi.org/10.1002/aur.1745 (2017).
Heller, P. H., Perry, F., Jewett, D. L. & Levine, J. D. ‘Autonomic components of the human pupillary light reflex’. Invest Ophthalmol. Vis. Sci (1990).
Lynch, G. Using pupillometry to assess the atypical pupillary light reflex and LC-Ne system in ASD (2018). https://doi.org/10.3390/bs8110108
Nyström, P., Gredebäck, G., Bölte, S. & Falck-Ytter, T. Hypersensitive pupillary light reflex in infants at risk for autism. Mol. Autism. https://doi.org/10.1186/s13229-015-0011-6 (2015).
Nyström, P. et al. Enhanced pupillary light reflex in infancy is associated with autism diagnosis in toddlerhood. Nat. Commun. https://doi.org/10.1038/s41467-018-03985-4 (2018).
de Vries, L. M. et al. Investigating the development of the autonomic nervous system in infancy through pupillometry. J. Neural Transm. https://doi.org/10.1007/s00702-023-02616-7 (2023).
Kercher, C. et al. A longitudinal study of pupillary light reflex in 6- to 24-month children. Sci. Rep. https://doi.org/10.1038/s41598-020-58254-6 (2020).
Nelson, C. A. et al. An integrative, multidisciplinary approach to the study of brain-behavior relations in the context of typical and atypical development. Dev. Psychopathol. https://doi.org/10.1017/S0954579402003061 (2002).
Ciernia, A. V. & LaSalle, J. The landscape of DNA methylation amid a perfect storm of autism aetiologies (2016). https://doi.org/10.1038/nrn.2016.41
Wolffe, A. P. & Matzke, M. A. Epigenetics: Regulation through repression (1999). https://doi.org/10.1126/science.286.5439.481
Jjingo, D., Conley, A. B., Yi, S. V. & Lunyak, V. V. King Jordan, ‘On the presence and role of human gene-body DNA methylation’. Oncotarget https://doi.org/10.18632/oncotarget.497 (2012).
Salinas, R. D., Connolly, D. R. & Song, H. Invited Review: Epigenetics Neurodevelopment. https://doi.org/10.1111/nan.12608 (2020).
Nardone, S. et al. DNA methylation analysis of the autistic brain reveals multiple dysregulated biological pathways. Transl Psychiatry. https://doi.org/10.1038/tp.2014.70 (2014).
Ladd-Acosta, C. et al. Common DNA methylation alterations in multiple brain regions in autism. Mol. Psychiatry. https://doi.org/10.1038/mp.2013.114 (2014).
Wong, C. C. Y. et al. Genome-wide DNA methylation profiling identifies convergent molecular signatures associated with idiopathic and syndromic autism in post-mortem human brain tissue. Hum. Mol. Genet. https://doi.org/10.1093/hmg/ddz052 (2019).
Wong, C. C. Y. et al. Methylomic analysis of monozygotic twins discordant for autism spectrum disorder and related behavioural traits. Mol. Psychiatry. https://doi.org/10.1038/mp.2013.41 (2014).
Andrews, S. V. et al. Case-control meta-analysis of blood DNA methylation and autism spectrum disorder. Mol. Autism. https://doi.org/10.1186/s13229-018-0224-6 (2018).
Hannon, E. et al. Elevated polygenic burden for autism is associated with differential DNA methylation at birth. Genome Med. https://doi.org/10.1186/s13073-018-0527-4 (2018).
Gui, A. et al. Leveraging epigenetics to examine differences in developmental trajectories of social attention: A proof-of-principle study of DNA methylation in infants with older siblings with autism. Infant Behav. Dev. https://doi.org/10.1016/j.infbeh.2019.101409 (2020).
Battram, T. et al. The EWAS catalog: A database of epigenome-wide association studies. Wellcome Open. Res. https://doi.org/10.12688/wellcomeopenres.17598.1 (2022).
Banerjee-Basu, S. & Packer, A. SFARI gene: an evolving database for the autism research community (2010). https://doi.org/10.1242/dmm.005439
Becket Ebitz, R. & Moore, T. Selective modulation of the pupil light reflex by microstimulation of prefrontal cortex. J. Neurosci. https://doi.org/10.1523/JNEUROSCI.2433-16.2017 (2017).
Mathôt, S., Grainger, J. & Strijkers, K. Pupillary responses to words that convey a sense of brightness or darkness. Psychol. Sci. https://doi.org/10.1177/0956797617702699 (2017).
Mathôt, S., Sundermann, L. & van Rijn, H. The effect of semantic brightness on pupil size: A replication with Dutch words’, bioRxiv (2019).
Mathôt, S., Dalmaijer, E., Grainger, J. & Van der Stigchel, S. The pupillary light response reflects exogenous attention and Inhibition of return. J. Vis. https://doi.org/10.1167/14.14.7 (2014).
Lévy, J. et al. NR4A2 haploinsufficiency is associated with intellectual disability and autism spectrum disorder. Clin. Genet. https://doi.org/10.1111/cge.13383 (2018).
Leppa, V. M. M. et al. Rare inherited and de Novo CNVs reveal complex contributions to ASD risk in multiplex families. Am. J. Hum. Genet. https://doi.org/10.1016/j.ajhg.2016.06.036 (2016).
O ’ Roak, B. J. et al. Sporadic autism exomes reveal a highly interconnected protein network of de Novo mutations. Nature https://doi.org/10.1038/nature10989 (2012).
Lim, E. T. et al. Rates, distribution and implications of postzygotic mosaic mutations in autism spectrum disorder. Nat. Neurosci. https://doi.org/10.1038/nn.4598 (2017).
Wang, T. et al. De Novo genic mutations among a Chinese autism spectrum disorder cohort. Nat. Commun. https://doi.org/10.1038/ncomms13316 (2016).
Stelzer, G. et al. The genecards suite: from gene data mining to disease genome sequence analyses. Curr. Protoc. Bioinf. https://doi.org/10.1002/cpbi.5 (2016).
Sapir, T. et al. Heterogeneous nuclear ribonucleoprotein U (HNRNPU) safeguards the developing mouse cortex. Nat. Commun. https://doi.org/10.1038/s41467-022-31752-z (2022).
Mastropasqua, F. et al. Deficiency of the heterogeneous nuclear ribonucleoprotein U locus leads to delayed hindbrain neurogenesis. Biol. Open. https://doi.org/10.1242/bio.060113 (2023).
Malik, M. Z. et al. Disruption in the regulation of casein kinase 2 in circadian rhythm leads to pathological states: cancer, diabetes and neurodegenerative disorders. Front. Mol. Neurosci. https://doi.org/10.3389/fnmol.2023.1217992 (2023).
Tsai, P. C. & Bell, J. T. Power and sample size Estimation for epigenome-wide association scans to detect differential DNA methylation. Int. J. Epidemiol. https://doi.org/10.1093/ije/dyv041 (2015).
Smith, A. K. et al. DNA extracted from saliva for methylation studies of psychiatric traits: evidence tissue specificity and relatedness to brain. Am. J. Med. Genet. Part. B: Neuropsychiatric Genet. https://doi.org/10.1002/ajmg.b.32278 (2015).
Morris, T. J. & Beck, S. Analysis pipelines and packages for infinium HumanMethylation450 BeadChip (450k) data. Methods https://doi.org/10.1016/j.ymeth.2014.08.011 (2015).
Johnson, M. H., Gliga, T., Jones, E. & Charman, T. Annual research review: infant development, autism, and ADHD - Early pathways to emerging disorders. J. Child. Psychol. Psychiatry. https://doi.org/10.1111/jcpp.12328 (2015).
Narzisi, A. et al. Sensory processing in autism: a call for research and action. Front. Media SA. https://doi.org/10.3389/fpsyt.2025.1584893 (2025).
Daly, G., Jackson, J. & Lynch, H. Family life and autistic children with sensory processing differences: A qualitative evidence synthesis of occupational participation (2022). https://doi.org/10.3389/fpsyg.2022.940478
Griffin, Z. A. M. et al. Atypical sensory processing features in children with autism, and their relationships with maladaptive behaviors and caregiver strain. Autism Res. https://doi.org/10.1002/aur.2700 (2022).
Szatmari, P. et al. Prospective Longitudinal Studies of Infant Siblings of Children with Autism: Lessons Learned and Future Directions (2016). https://doi.org/10.1016/j.jaac.2015.12.014
Michels, K. B. et al. Recommendations for the design and analysis of epigenome-wide association studies (2013). https://doi.org/10.1038/nmeth.2632
Gliga, T. et al. Enhanced visual search in infancy predicts emerging autism symptoms. Curr. Biol. https://doi.org/10.1016/j.cub.2015.05.011 (2015).
Fan, X., Miles, J. H., Takahashi, N. & Yao, G. Abnormal transient pupillary light reflex in individuals with autism spectrum disorders. J. Autism Dev. Disord. https://doi.org/10.1007/s10803-009-0767-7 (2009).
Hellmer, K. & Nyström, P. Infant acetylcholine, dopamine, and melatonin dysregulation: neonatal biomarkers and causal factors for ASD and ADHD phenotypes. Med. Hypotheses. https://doi.org/10.1016/j.mehy.2017.01.015 (2017).
R Core Team. R Development Core Team. R: A Language and Environment for Statistical Computing (2016).
RStudio Team, RStudio: Integrated Development for R (2022).
Davis, S. Du, P. & Bilke, S. Im Triche, and oiz Bootwalla, ‘Methylumi: Handle Illumina methylation data (2012).
Pidsley, R. et al. A data-driven approach to preprocessing illumina 450K methylation array data. BMC Genom. https://doi.org/10.1186/1471-2164-14-293 (2013).
Dedeurwaerder, S. et al. A comprehensive overview of infinium human Methylation450 data processing. Brief. Bioinform. https://doi.org/10.1093/bib/bbt054 (2013).
Chen, Y. A. et al. Discovery of cross-reactive probes and polymorphic CpGs in the illumina infinium HumanMethylation450 microarray. Epigenetics https://doi.org/10.4161/epi.23470 (2013).
Price, M. E. et al. Additional annotation enhances potential for biologically-relevant analysis of the illumina infinium HumanMethylation450 BeadChip array. Epigenetics Chromatin. https://doi.org/10.1186/1756-8935-6-4 (2013).
Barfield, R. T. et al. Accounting for population stratification in DNA methylation studies. Genet. Epidemiol. https://doi.org/10.1002/gepi.21789 (2014).
Zheng, S. C., Breeze, C. E., Beck, S. & Teschendorff, A. E. Identification of differentially methylated cell types in epigenome-wide association studies. Nat. Methods. https://doi.org/10.1038/s41592-018-0213-x (2018).
Saffari, A. et al. Estimation of a significance threshold for epigenome-wide association studies. Genet. Epidemiol. https://doi.org/10.1002/gepi.22086 (2018).
Kandaswamy, R. et al. DNA methylation signatures of adolescent victimization: analysis of a longitudinal monozygotic twin sample. Epigenetics https://doi.org/10.1080/15592294.2020.1853317 (2021).
Suderman, M. et al. dmrff: identifying differentially methylated regions efficiently with power and control, bioRxiv (2018).
Aleksander, S. A. et al. The gene ontology knowledgebase in 2023. Genetics https://doi.org/10.1093/genetics/iyad031 (2023).
Ashburner, M. et al. Gene ontology: Tool for the unification of biology (2000). https://doi.org/10.1038/75556
Thomas, P. D. et al. PANTHER: Making genome-scale phylogenetics accessible to all (2022). https://doi.org/10.1002/pro.4218
Karolchik, D. et al. The UCSC table browser data retrieval tool. Nucleic Acids Res. https://doi.org/10.1093/nar/gkh103 (2004).
Acknowledgements
We heartfully thank all the families who took part in our research. We also thank The BASIS Team members listed here in alphabetical order contributed to the design and/or data collection of the longitudinal BASIS study: Baron-Cohen S., Blasi A., Cheung C., Davies K., Elsabbagh M., Fernandes J., Gammer I., Ganea N., Gliga T., Guiraud J., Hendry A., Liersch U., Liew M., Maris H., O’Hara L., Pickles A., Ribeiro H., Salomone E., Taylor, C., Tucker L., Tye C., Wass S. The authors also acknowledge the contribution and use of the CREATE high-937 performance computing cluster at King’s College London (King’s College London (2022) (King’s Computational Research, Engineering and Technology Environment; CREATE; https://doi.org/10.18742/rnvf-m076.). The current work was funded by the Simons Foundation Autism Research Initiative (SFARI) Pilot Award grant no. 511504; European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant no.642996 (BRAINVIEW). Data collection was funded by MRC Programme [grant numbers G0701484 and MR/K021389/1], the BASIS funding consortium led by Autistica (www.basisnetwork.org), EU-AIMS (the Innovative Medicines Initiative joint undertaking grant agreement number 115300, resources of which are composed of financial contributions from the European Union’s Seventh Framework Programme [grant number FP7/2007-2013] and EFPIA companies’ in-kind contribution) and AIMS-2-TRIALS (Innovative Medicines Initiative 2 Joint Undertaking under grant agreement number 777394. This Joint Undertaking receives support from the European Union’s Horizon 2020 research and innovation programme and EFPIA and AUTISM SPEAKS, Autistica, SFARI). Epigenetic data generation was funded by MRC Centenary Award to Chloe C. Y. Wong. Chloe C. Y. Wong was supported by the National Institute for Health and Care Research (NIHR) Maudsley Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King’s College London [NIHR203318]. Laurel A. Fish was supported by the SGDP PhD Studentship award. The views expressed are those of the authors and not necessarily those of the NIHR, the Department of Health, King’s College London, or other affiliated institutes.
Funding
The current work was funded by the Simons Foundation Autism Research Initiative (SFARI) Pilot Award grant no. 511504; European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant no.642996 (BRAINVIEW). Data collection was funded by MRC Programme [grant numbers G0701484 and MR/K021389/1], the BASIS funding consortium led by Autistica (www.basisnetwork.org), EU-AIMS (the Innovative Medicines Initiative joint undertaking grant agreement number 115300, resources of which are composed of financial contributions from the European Union’s Seventh Framework Programme [grant number FP7/2007–2013] and EFPIA companies’ in-kind contribution) and AIMS-2-TRIALS (Innovative Medicines Initiative 2 Joint Undertaking under grant agreement number 777394. This Joint Undertaking receives support from the European Union’s Horizon 2020 research and innovation programme and EFPIA and AUTISM SPEAKS, Autistica, SFARI). Epigenetic data generation was funded by MRC Centenary Award to Chloe C. Y. Wong. Chloe C. Y. Wong was supported by the National Institute for Health and Care Research (NIHR) Maudsley Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King’s College London [NIHR203318]. Laurel A. Fish was supported by the SGDP PhD Studentship award funded by the UK Medical Research Council (MRC). The views expressed are those of the authors and not necessarily those of the NIHR, the Department of Health, King’s College London, or other affiliated institutes.
Author information
Authors and Affiliations
Contributions
Concept and design: L.A.F., T.G., C.C.Y.W., E.J.H.J., M.H.J., T.C.Acquisition of data: A.G., J.B.A., L.M.Analysis or interpretation of data: L.A.F., C.C.Y.W., T.G., M.H.J., T.C., T.F.-Y., E.J.H.J., R.K., F.H.Drafting of the manuscript: L.A.F., C.C.Y.W.Critical revision of the manuscript for important intellectual content: T.G., M.H.J., T.C., T.F.-Y., E.J.H.J., R.K., F.H., C.C.Y.W.Statistical analysis: L.A.F., C.C.Y.W.Obtained funding: M.H.J., T.C., E.J.H.J., C.C.Y.W., T.G.Data collection: A.G., J.B.A., L.M.Supervision: E.J.H.J., C.C.Y.W., F.H.Data processing: R.K., T.F.-Y.PLR processing pipeline conception: T.F.-Y.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Fish, L.A., Gliga, T., Gui, A. et al. Epigenome-wide analysis identifies DNA methylation signatures associated with the infant pupillary light reflex, a candidate intermediate phenotype for autism. Sci Rep 16, 325 (2026). https://doi.org/10.1038/s41598-025-31651-5
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-31651-5


