EEG neural indicator of temporal integration in the human auditory brain with clinical implications

Xu, Haoxuan; Huang, Qianyue; Song, Peirun; Chen, Yanxin; Li, Qiuyu; Zhai, Yuying; Du, Xinyu; Ye, Hangting; Bao, Xuehui; Mehmood, Ishrat; Tanigawa, Hisashi; Niu, Wanqiu; Tu, Zhiyi; Chen, Pei; Zhang, Tingting; Zhang, Lingling; Zhao, Xuan; Zhang, Li; Wen, Wanshun; Cao, Liyu; Yu, Xiongjie

doi:10.1038/s42003-025-08540-8

Download PDF

Article
Open access
Published: 26 July 2025

EEG neural indicator of temporal integration in the human auditory brain with clinical implications

Haoxuan Xu^1,2,3,4^na1,
Qianyue Huang^1,2,3^na1,
Peirun Song^1,2^na1,
Yanxin Chen ORCID: orcid.org/0009-0005-3020-6805¹,
Qiuyu Li¹,
Yuying Zhai¹,
Xinyu Du¹,
Hangting Ye¹,
Xuehui Bao³,
Ishrat Mehmood³,
Hisashi Tanigawa ORCID: orcid.org/0000-0002-7736-0683³,
Wanqiu Niu⁵,
Zhiyi Tu⁵,
Pei Chen⁵,
Tingting Zhang⁵,
Lingling Zhang⁵,
Xuan Zhao⁵,
Li Zhang⁶,
Wanshun Wen⁶,
Liyu Cao ORCID: orcid.org/0000-0002-1124-9579^7,8 &
…
Xiongjie Yu ORCID: orcid.org/0000-0002-0040-2187^1,2,3,4,5

Communications Biology volume 8, Article number: 1109 (2025) Cite this article

1779 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Temporal integration, the process by which the auditory system combines sound information over a certain period to form a coherent auditory experience, is essential for auditory perception, yet its neural mechanisms remain underexplored. We use a “transitional click train” paradigm, which concatenates two click trains with slightly differing inter-click intervals (ICIs), to investigate temporal integration in the human brain. Using a 64-channel electroencephalogram (EEG), we recorded responses from healthy participants exposed to regular and irregular transitional click trains and conducted change detection tasks. Regular transitional click trains elicited significant change responses in the human brain, indicative of temporal integration, whereas irregular trains did not. These neural responses were modulated by length, contrast, and regularity of ICIs. Behavioral data mirrored EEG findings, showing enhanced detection for regular conditions compared to irregular conditions and pure tones. Furthermore, variations in change responses were associated with decision-making processes. Temporal continuity was critical, as introducing gaps between click trains diminished both behavioral and neural responses. In clinical assessments, 22 coma patients exhibited diminished or absent change responses, effectively distinguishing them from healthy individuals. Our findings identify distinct neural markers of temporal integration and highlight the potential of transitional click trains for clinical diagnostics.

A new function of offset response in the primate auditory cortex: marker of temporal integration

Article Open access 18 October 2024

Contrastive representation learning with transformers for robust auditory EEG decoding

Article Open access 06 August 2025

Utilization of temporal autoencoder for semi-supervised intracranial EEG clustering and classification

Article Open access 13 January 2023

Introduction

Sound, intrinsically bound to the temporal domain, necessitates temporal integration for a coherent perception. Traditional auditory research has predominantly concentrated on the frequency domain, in which the auditory system is tonotopically organized to segregate and process distinct frequencies along the auditory pathway¹. Nonetheless, the critical importance of the temporal dimension in auditory processing cannot be overstated. Temporal elements, including rhythm, timing, and the recognition of complex sound patterns, are fundamentally reliant on temporal cues². This temporal aspect underpins essential facets of speech and music perception, alongside the discrimination of environmental sounds³. An auditory object—a perceptual unit formed through integration of spectral, temporal, and spatial cues that the brain identifies as distinct from background sounds—emerges from the integration process. Auditory objects may span milliseconds to seconds (e.g., a spoken syllable, a musical note, or a complete phrase), and they exhibit hierarchical organization across different timescales. For instance, temporal integration in humans spans various timescales⁴, as demonstrated in oral language processing where the brain aligns with syllables, phrases, and sentences at differing temporal scales^5,6. A related but distinct process is auditory streaming, which refers to the brain’s ability to segregate different sound sources over time, forming separate perceptual streams. While auditory streaming explains how the brain organizes sequential sounds into distinct perceptual streams based on acoustic cues such as frequency or spatial location^7,8,9, temporal integration accounts for the neural process by which temporally proximal (discrete) auditory events are fused into a coherent perceptual unit. Despite its crucial role in auditory perception, the neural mechanisms underlying temporal integration—particularly the neural signatures related to the fusion process—remain poorly understood.

Addressing this gap necessitates the generation of sounds with distinct components to isolate the holistic response from those elicited by individual elements. Click trains, comprised of uniform pulses that vary only in temporal spacing, serve as an exemplary stimulus for delving into temporal auditory processing. Despite their prevalent use in auditory research, click trains have rarely been studied for their holistic representation, as most research has focused on encoding individual clicks^10,11,12. However, psychological studies have indicated that when the inter-click interval (ICI) of a click train is small (< 33 ms), it can be perceived as a continuous pitch-like sound^13,14, suggesting of temporal integration during the perception of the click train. This indicates that a click train, possessing a long-timescale structure but composed of individual clicks with short-timescale characteristics, can be perceived as a coherent auditory pitch rather than a collection of individual elements.

To investigate temporal integration in a controlled manner, we designed a novel “transitional click train” paradigm. A transitional click train consists of two consecutive segments of click sounds, where the ICI differs slightly between the first and second segments. The first segment establishes a stable temporal pattern (potentially yielding a perceived pitch), and the second segment introduces a new pitch, creating a temporal change midway through the sound. If the auditory system integrates the first segment into a single auditory object (e.g., a continuous pitch), then the transition to the new ICI would be perceived as a salient change—essentially marking the emergence of a new auditory object. This paradigm is particularly advantageous because it probes the brain’s ability to detect internal changes in an ongoing sound, rather than merely responding to the abrupt onset of a new sound. Moreover, when ICIs are in the millisecond range, prolonged stimulation leads to cortical adaptation following the initial onset response, such that individual clicks no longer evoke discernible responses^10,15. The transitional click train leverages this adaptation, maintaining consistent stimulation while introducing a perceptual shift—thereby isolating the neural signature associated with the perceptual transition between auditory objects.

Crucially, temporal integration is not only foundational for basic auditory cognition, but is also increasingly recognized as a marker of conscious sensory processing and higher cognitive functions. Recent studies indicate that disruptions in temporal integration are linked to disorders of consciousness^16,17 and various psychiatric conditions, including schizophrenia¹⁸, autism¹⁹, ADHD, and Parkinson’s disease²⁰. Neurophysiological work has further demonstrated that processing of temporal structure is impaired in patients with disordered consciousness^21,22. Accurate assessment of residual consciousness is critical for prognosis and clinical decision-making; however, behavioral assessments are often unreliable and prone to misdiagnosis. Neurophysiological methods offer objective tools to detect “covert cognition” and improve diagnostic accuracy. Despite this potential, few paradigms currently provide temporally precise, non-invasive markers of temporal integration that are well-suited for clinical applications.

In our study, we employed the “transitional click train” to explore the neural correlates of temporal integration within click trains. Our results demonstrated that the human brain indeed exhibited a specific change response closely linked to temporal integration when exposed to transitional click trains. This discovery provides an important neural indicator of how the auditory system constructs and modifies auditory objects over time. Furthermore, our findings open new directions for auditory research, including potential clinical applications, as deficits in temporal integration are known to be associated with various neurological and psychiatric disorders.

Results

Temporal merging into auditory objects with click trains

The human brain seamlessly integrates discrete sounds into a unified perceptual experience when the interval between these sounds is exceptionally short. For example, in the case of click trains, individuals cannot discern the gaps between clicks when the inter-click interval (ICI) falls below 29.6 ms²³ (gap detection task in Experiment 1; Supplementary Fig. 1). This phenomenon is an indication of temporal integration, whereby separate clicks merge into a single auditory object experienced psychologically. This process, where sounds with small auditory gaps integrate into a singular auditory object, is what we termed “temporal merging”. To investigate the neural representation of this temporal integration, we devised an experimental protocol featuring two types of click trains. The first type is a regular click train, characterized by uniform ICIs (the top row of Fig. 1a). The second type is an irregular click train, marked by variable ICIs (the bottom row of Fig. 1a). These timing variations disrupt the formation of a unified temporal pitch, thus providing a comparison condition to assess whether rhythmic consistency is essential for temporal merging. We created transitional trains by linking two 1-s regular click trains with different ICIs, one at 4 ms (train 1) and the other at 4.06 ms (train 2), and referred to these as Reg_4-4.06 (the top row of Fig. 1b). Similarly, we generated transitional trains using irregular click trains with two distinct average ICIs, again one at 4 ms (train 1) and the other at 4.06 ms (train 2), and labeled these as Irreg_4-4.06 (the bottom row of Fig. 1b). Unless otherwise specified, the standard deviation of the irregular click train (e.g., Irreg_4-4 and Irreg_4-8) is set to half of the mean value (μ/2). The two types of transitional trains were randomly presented to participants. For illustration, representative audio files of Reg_4-4.06 and Irreg_4-4.06 are provided as supplementary materials.

**Fig. 1: Change responses in transitional click trains.**

At the example electrode POz (Fig. 1c, d), a significant difference (p < 0.05, permutation test, 333−409 ms relative to the onset) emerged between the regular and irregular trains following the onset response, converging before the change point. The change responses within the time windows (92 to 148 ms and 163 to 280 ms relative to the change point) were significantly stronger for Reg_4-4.06 than for Irreg_4-4.06 (n = 42 from Session 1 of Experiment 2, p < 0.05, permutation test). The change response to Reg_4-4.06 in the temporal-parietal-occipital scalp regions consisted of a positive component peaking around 120 ms after the change, followed by a negative component peaking around 200 ms (Fig. 1d). To quantify this change response across the entire scalp, we designated the positive component as “change P1” (cP1) and the negative component as “change N2” (cN2), and calculated the root mean square (RMS) over 74 to 251 ms following the change point for each channel, covering the combined cP1-cN2 responses (yellow vertical bar, Supplementary Fig. 2c). Furthermore, among the 42 subjects, the majority exhibited higher response amplitudes in the time window of the change response for Reg_4-4.06 compared to Irreg_4-4.06 (t(41) = 6.63, p < 0.001, d = 1.02, paired t-test; Fig. 1e), highlighting the differential impact of regular versus irregular transitional trains on auditory processing. Across all the channels of the 42 subjects, pronounced change responses to Reg_4-4.06 were observed in the temporal, parietal, and occipital electrodes (Supplementary Fig. 2a, c). In contrast, the response to Irreg_4-4.06 primarily manifested as an onset response, with a weak or indistinct change response observed (Supplementary Fig. 2b, d). Source analysis further indicated that the origin of the change response was predominantly localized within the auditory cortex (Fig. 1f, g).

To ascertain whether the observed change response stemmed from true temporal integration or was merely a reaction to the transient change of ICIs, we systematically replaced subsets of the original 4 ms ICIs in the ongoing click train with 4.06 ms ICIs (1, 2, 4, 8, 16, or 32 replacements; Session 2 of Experiment 2; Fig. 2a). Crucially, these modified intervals were not silent but instead slightly lengthened ICIs, preserving continuous acoustic stimulation while altering temporal pattern. These modified click trains were compared to responses elicited by the Reg_4-4.06 transitional train and continuous the Reg_4-4 click train (fixed 4 ms ICIs). Remarkably, the introduction of a single interval failed to produce any noticeable change responses (t(23) = 0.52, p = 0.68, d = 0.09, BF₁₀ = 0.23, one-sample t-test), as evidenced in Fig. 2b, c. It was not until the insertion of 16 intervals that the change response saturated to the level of Reg_4-4.06 (Fig. 2b, c). Thus, the change response most likely corresponds to a perceptual shift between two pitches, coalesced through temporal merging.

**Fig. 2: Change responses depend on temporal integration.**

Factors affecting change response

Having established the link between the change response and temporal merging, we endeavored to delineate the temporal boundary for integration, we systematically varied the ICI length while maintaining a fixed ratio between click train 1 and click train 2 (Session 3 of Experiment 2; Fig. 3a). Our goal was to identify at what ICI length the change response diminishes, indicating a failure of temporal integration. We found that an increase in ICI resulted in a diminution of the change response (n = 42; Fig. 3b), with the response magnitude being inversely proportional to the ICI length (Fig. 3c). Specifically, a strong change response was elicited by both Reg_4-4.06 (t(41) = 6.31, p < 0.001, d = 0.97, one-sample t-test) and Reg_8-8.12 (t(41) = 5.54, p < 0.001, d = 0.86, one-sample t-test). In contrast, configurations with longer ICIs, such as Reg_32-32.48, failed to produce detectable change responses (t(41) = 0.89, p = 0.38, d = 0.14, BF₁₀ = 0.24, one-sample t-test). While Reg_16-16.24 did induce a significant change response in the occipital lobe (Supplementary Fig. 3), the overall average did not show a significant change response (t(41) = 0.16, p = 0.87, d = 0.03, BF₁₀ = 0.17, one-sample t-test; Fig. 3c). These findings suggest that the upper limit for the perceptual integration of individual clicks is between 16 and 32 ms, at least under the current experimental conditions.

**Fig. 3: The effect of three factors on change responses in transitional train.**

To assess the resolution of temporal integration, we kept the first train’s ICI constant (4 ms) and systematically increased the contrast (difference) between the two click train segments (Session 1 of Experiment 2; Fig. 3d). This was achieved through holding the ICI of train 1 constant (4 ms), whilst systematically modulating the ICI of train 2 so that it was longer than train 1 by a ratio between 0.25% and 1.5% (Fig. 3d). The results showed that a mere 0.5% difference in ICI (Reg_4-4.02) was sufficient to elicit significant change responses (t(41) = 4.10, p < 0.001, d = 0.63, one-sample t-test; Fig. 3e), thereby indicating a remarkably high temporal resolution in the process of auditory temporal integration. Larger ICI contrasts were associated with stronger change responses (Fig. 3e, f).

The initial phase of our investigation predominantly focused on click trains characterized by constant ICIs, designated as regular click trains. This prompted a subsequent inquiry into the perceptual implications of varying ICIs within click trains. To quantitatively assess the effect of regularity, we introduced varying degrees of variance to each train, with standard deviations of µ/400, µ/200, µ/100, µ/2, and 0 (where µ denotes the mean ICI, set at either 4 ms or 4.06 ms, and standard deviation of 0 corresponds to Reg_4-4.06) (Session 4 of Experiment 2; Fig. 3g). Significant change responses were only observed in 0 (t(23) = 3.61, p < 0.01, d = 0.74, one-sample t-test) and µ/400 conditions (t(23) = 3.16, p < 0.01, d = 0.64, one-sample t-test; Fig. 3h, i), with a negligible difference between the two conditions (i.e., 0 and μ/400; t(23) = 1.76, p = 0.09, d = 0.36, BF₁₀ = 0.82, paired t-test). The responses with µ/200 were significantly weaker than those with 0 standard deviation (i.e., Reg_4-4.06) (t(23) = 3.33, p < 0.01, d = 0.68, paired t-test).

Perception performance during transitional click train

We next investigated the perception of temporal merging using behavioral experiments together with EEG recording. The primary objective was to clarify the impact of click regularity on temporal merging (Fig. 1c–e and Fig. 3g–i) and to compare the perception of temporal merging auditory objects and pure tones. For this purpose, we developed an experimental paradigm that juxtaposed three distinct sets of stimuli to assess both the behavioral performance and change responses under various degrees of contrast (Experiment 3; Fig. 4a). The regular condition included transitional click trains transitioning from a regular click train with 4-ms ICI to another ICI (contrast levels: 0, 0.25%, 0.5%, 0.75%, 1.5%). The irregular condition comprised transitional click trains transitioning from an irregular click train (standard deviation: µ/2) with an average ICI of 4 ms to another average ICI (contrast levels: 0, 1.5%, 100%). The tone condition consisted of pure tones shifting from 250 Hz to another frequency (contrast levels: 0, 1.5%). Each block within the three conditions was designed to present a 1 s initial stimulus followed by a 1 s subsequent stimulus, concluding with a 2-s choice window. Participants were required to detect whether an auditory stimulus change had occurred (Fig. 4a).

**Fig. 4: Psychological and EEG results during change detection task.**

The results showed that the change detection performance progressed with the increase in the difference between the first and the second stimulus (Fig. 4b). Remarkably, a 1.5% contrast difference in the regular condition (Reg_4-4.06) led to a detection rate of 98.8% in correctly identifying changes (n = 36; Fig. 4c), in stark contrast to the detection rate of 35.7% observed in the irregular condition (Irreg_4-4.06), which did not significantly differ from its control, Irreg_4-4 (t(35) = 0.93, p = 0.36, d = 0.16, BF₁₀ = 0.27, paired t-test; Fig. 4c), suggesting no perceptual distinction between the two segments in the irregular transitional click train. The detection rate in the irregular condition reached 87.4% when the contrast was increased by 100% (i.e., Irreg_4-8, Fig. 4b). For pure tones, the detection rate was 90.1% when the tone shifted from 250 Hz to 246 Hz (Tone_250-246). Note that 250 Hz corresponds to 4 ms, and 246 Hz corresponds to 4.06 ms. The detection rate for Tone_250-246 was significantly higher than the control condition (Tone_250-250), in which the tone was always 250 Hz (t(35) = 25, p < 0.001, d = 4.17, paired t-test; Fig. 4c), yet lower than that observed for Reg_4-4.06 (t(35) = 3.55, p < 0.01, d = 0.59, paired t-test; Fig. 4c, e). Furthermore, subject-by-subject comparisons revealed most subjects had higher detection rate for Reg_4-4.06 than for Irreg_4-4.06 (t(35) = 11.59, p < 0.001, d = 1.93, paired t-test; Fig. 4d) and Tone_250-246 (Fig. 4e). In summary, these findings emphasize the enhanced performance in the regular condition in identifying contrast changes compared to both the irregular condition and the pure tone condition.

For the EEG change responses in the three conditions, Reg_4-4.06 evoked stronger change responses compared to Irreg_4-4.06 (Fig. 4f) and Tone_250-246 (Fig. 4g). Actually, no significant change response was observed in Irreg_4-4.06 (Fig. 4f). Individual results also indicated that most subjects demonstrate stronger changes responses for Reg_4-4.06 than for Irreg_4-4.06 (t(35) = 6.86, p < 0.001, d = 1.14, paired t-test; Fig. 4h) and Tone_250-246 (t(35) = 4.41, p < 0.001, d = 0.73, paired t-test; Fig. 4i), which is consistent with the behavior results (Fig. 4d and Fig. 4e). Interestingly, the variation in change response amplitude was correlated with decision-making in the more difficult condition, such as Reg_4-4.01. In Reg_4-4.01, the decision to detect a change was typically accompanied by a stronger change response compared to the decision of no change in the sound (Supplementary Fig. 4).

The effect of temporal continuity

To investigate the impact of temporal continuity on the change responses, we designed two sets of stimuli: one set without gaps of silence between click train 1 and click train 2 (No-gap) and the other set with a gap of 600-ms silence between the two click trains (Gap) (Experiment 4; Fig. 5a). Four transitional click trains were used: Reg_4-4.01, Reg_4-4.02, Reg_4-4.03, and Reg_4-4.06. Participants were asked to detect whether an auditory stimulus change had occurred. The behavioral performance was better for the No-gap click trains than for the Gap click trains (n = 34; Fig. 5b), with most participants showing this pattern (t(33) = 6.82, p < 0.001, d = 1.17, paired t-test; Fig. 5c).

Subsequently, we examined the EEG responses, superimposing all contrast conditions for both the No-gap (Fig. 5d) and Gap (Fig. 5e) stimuli. The responses to varying contrast conditions were distinguishable in the No-gap condition (Fig. 5d) but nearly indistinguishable in the Gap condition (Fig. 5e). Two components, P1 and N2, were identified from the onset response of the second click train, as shown in Fig. 5e. We plotted the tuning curves for P1 ([70, 120] ms) and N2 ([133, 183] ms), compared with cP1 ([90, 140] ms) and cN2 ([221, 271] ms). As the contrast increased, the average response magnitude (RM) also increased, displaying clear tuning in the No-gap condition for both peak window (F(3,132) = 6.46, p < 0.001, η² = 0.13, one-way ANOVA; red line in Fig. 5f) and trough window (F(3,132) = 19.52, p < 0.001, η² = 0.31, one-way ANOVA; red line in Fig. 5g), while remaining nearly constant in the Gap condition for both windows (P1/cP1 window: F(3,132) = 1.38, p = 0.25, η² = 0.03, BF₁₀ = 0.07; N2/cN2 window: F(3,132) = 2.78, p = 0.04, η² = 0.06, BF₁₀ = 0.43, one-way ANOVA test; blue line in Fig. 5f, g). These EEG findings align with the psychological results regarding thresholds (Fig. 5c), emphasizing the role of temporal continuity in both psychological perception and neural processing.

Potential clinical application

Considering the fundamental role of temporal integration in the brain^24,25 and its relevance to many psychiatric diseases^{19,26,27,28,29}, the change response serves as a promising tool for diagnosis. To explore the potential for clinical application of this paradigm, we conducted 64-channel EEG recordings in 22 coma subjects using transitional click trains stimuli: Reg_4-4 and Reg_4-5 (Experiment 5).

For coma patients, both onset and change responses were small, even in Reg_4-5, contrasting with the healthy subjects (Supplementary Fig. 5a). The scatter plots of onset vs. change responses showed significant overlap between coma subjects and healthy subjects. This overlap was probably due to the presence of slow oscillations with larger amplitudes localized to specific channels, along with the prolonged latency and extended duration of the auditory response in coma subjects (Supplementary Fig. 6a–c). To quantify change responses in subjects with impaired consciousness, global field power (GFP) was employed due to its robustness to spatial variability and enhanced sensitivity to response latency^30,31. GFP calculates the standard deviation of EEG data across all electrodes at each sampling point, thereby mitigating the influence of spatial variability in electrode placement or individual differences in brain anatomy. This is crucial when studying subjects with impaired consciousness, where localized differences in brain function might occur due to injury or pathology. A robust onset response was detected in one example coma subject using GFP (Supplementary Fig. 6c), although no visible onset response was detected in amplitude (Supplementary Fig. 6b). However, no change response was detected even using GFP (Supplementary Fig. 6c). In the population, no visible onset or change responses were detected using GFP in coma patients, whereas significant robust responses were observed in healthy subjects (Fig. 6a).

**Fig. 6: The effect of consciousness on change responses in transitional train.**

Furthermore, the scatter plot of onset vs. change responses in GFP effectively separated coma patients from healthy subjects, suggesting the transitional click train paradigm as a good tool for distinguishing between the two groups (Fig. 6b). More interestingly, the change response may gradually recover as the coma patient regains consciousness (Supplementary Fig. 7), indicating that the transitional click train paradigm could potentially monitor the entire recovery process of coma patients. However, no correlation was found between the CRS-R score, a standard method for quantifying the degree of coma, and either onset or change responses (Supplementary Fig. 8).

Discussion

Our study meticulously examined the mechanisms of temporal merging within auditory perception, elucidating how the human auditory system assimilates discrete sound elements into unified auditory objects. With temporal merging, a click train with minimal ICIs gives a distinct auditory experience. Specifically, regular click trains (Reg_4-4.06) prompted more pronounced change responses in the auditory brain than irregular click trains (Irreg_4-4.06), highlighting the significant impact of temporal regularity on auditory processing (Figs. 1 and 4). Further analysis demonstrated that the change response is intricately tied to the integration of multiple intervals, suggesting it as a marker for the perceptual transition between distinct auditory objects via temporal merging (Fig. 2). This response is notably affected by several factors: the length of ICI (Fig. 3a–c), the ICI ratio (IC1₂ vs. ICI₁) (Fig. 3d–f), and the regularity of the click train (Fig. 3g–i). Additionally, behavioral experiments showed enhanced change detection rates for regular click trains (Reg_4-4.06) compared to irregular click trains and pure tones, corroborated by stronger EEG change responses (Fig. 4). Temporal continuity significantly affected behavioral and EEG responses, with better performance and clear tuning curves for continuous click trains compared to those with gaps (Fig. 5). Finally, the GFP method effectively distinguished coma patients from healthy subjects, suggesting the potential clinical application of transitional click trains for diagnosing and monitoring recovery in impaired consciousness (Fig. 6).

Change response in transitional click train as an indicator of temporal integration

Click trains with ICIs less than ~33 ms are often perceived as pitch³², and it has been suggested that the analysis of regularity in click trains differs for ICIs above and below 40–60 ms³³. Traditional theories differentiate pitch perception based on the auditory system’s ability to segregate individual harmonic components, categorizing pitch as either resolved or unresolved harmonics^27,28. Resolved harmonics arise from distinct components processed by separate auditory filters, whereas unresolved harmonics involve closely spaced components within a single filter, relying on temporal coding for pitch extraction. Interestingly, sounds with the same repetition rate but highly different spectral compositions often evoke the same pitch, whereas sounds with similar spectra can produce significantly different pitch percepts^27,28. This demonstrates that the frequency-to-place mapping performed by the cochlea does not necessarily correspond to a frequency-to-pitch mapping²⁹. The temporal pitch induced by click trains is distinct because it relies solely on the temporal regularity of successive auditory events rather than spectral components^29,30,31,23. However, the neuronal mechanisms underlying temporal pitch perception remain unresolved, and the relationship between temporal pitch and temporal integration has not been clearly established. Our research provides direct evidence linking temporal pitch perception to temporal integration.

Auditory research utilizing click trains as stimuli has unveiled intricate neuronal responses in the auditory system on both single-neuron and systems neuroscience levels. On the single-neuron level, neurons display a remarkable capability for precise temporal coding where individual spike activities precisely align with specific intervals between the clicks³⁴. Despite the prominence of this temporal alignment, rate coding emerges as another vital mechanism, particularly at accelerated click rates³⁵. Lu et al.¹⁰ identified two distinct populations of neurons: one that synchronizes to slow sound sequences and another that encodes rapid events through firing rates. However, these studies mainly focused on how individual clicks within a train are represented, largely overlooking the holistic perception of the click train as a coherent object³⁶. At the macroscopic level, click trains have been extensively used to study auditory steady-state responses (ASSR)^37,38,39, where the neural response follows the same frequency of auditory stimuli, and the auditory response can be disrupted by an additional click¹². These studies, similar to those at the single-neuron level, concentrate on responses to individual clicks, leaving the mechanism of how the brain integrates regular clicks into pitch perception unresolved. Recently, the holistic representation of sound has been investigated in the frequency domain, and researchers have found that auditory cortex (AC) neurons may exhibit bursting responses specifically to the configuration of tones but not to any constituent tone^40,41. However, the holistic representation in the temporal domain, especially for sound through temporal integration, has been seldom addressed. This gap exists because disentangling neural responses to individual clicks from those induced by the holistic perception of the whole click train as pitch poses a significant challenge. Consequently, no brain signal has yet been identified that adequately represents auditory events through temporal merging with click trains in prior research^10,34,35, highlighting a crucial area for future investigation at both single-neuron and macroscopic levels.

To navigate this intricacy, we propose the innovative concept of a transitional train, as illustrated in Fig. 1. A typical onset response to transitional trains was observed in the first 300 ms of EEG signals, followed by an adaptation period from ~300 to 1000 ms, during which no discernible auditory response to individual clicks or the train was detected (Supplementary Fig. 2a). However, the introduction of a second click train with a slightly changed ICI (e.g., Reg_4-4.06) within a transitional train elicited a change response in the adapted auditory brain, followed by subsequent adaptation (Fig. 1d). Moreover, the transitional train also introduced a perceptual switch psychologically (Fig. 4). Since this change response in the EEG signal is not solely attributed to local temporal changes but is linked to temporal merging (Fig. 2), it most likely reflects a perceptual switch, signifying a transition between distinct temporal-merging auditory objects (Fig. 4). The key aspect underlying the transitional click train is that it maintains the presentation of individual clicks, which leads to consistent adaptation of the auditory brain (Fig. 1), while simultaneously introducing a perceptual switch (Fig. 4).

The critical innovation of our paradigm lies in maintaining a regular stream of clicks, ensuring ongoing cortical adaptation, while subtly shifting the temporal pattern to elicit a new percept without reintroducing stimulus onset artifacts. This provides a unique framework to examine the neural basis of short-timescale temporal integration—a process distinct from higher-order regularity detection. We acknowledge prior work by Barascud et al. (2016)⁴², who used tone-pip sequences to explore the brain’s sensitivity to transitions between random and regular acoustic patterns. While conceptually related in probing sensitivity to structure, their paradigm focused on complex, longer-timescale regularities, and statistical learning. In contrast, our transitional click train design targets integration at the millisecond scale, isolating the mechanisms by which the brain fuses discrete auditory events into a single perceptual object—a function central to pitch-like perception. Importantly, we emphasize that the change response identified here is not a direct neural correlate of pitch itself, but rather a marker of temporal integration—indicating when the brain transitions from one temporally fused percept to another. This distinction clarifies the functional role of the change response: it signals a perceptual shift between temporally structured auditory objects, rather than encoding the pitch per se.

Three key factors influence the change response: the length of the ICI (Fig. 3a–c), the ICI ratio (ICI₂ vs. ICI₁) (Fig. 3d–f), and the regularity of the click train (Fig. 3g–i). Our research found that when the ICI length exceeds 16 ms, the auditory system is unable to elicit the change response, potentially suggesting a limit of temporal merging in ICI. This threshold is notably lower than what psychological studies have suggested, where the perception of a unified sound begins to falter at ICIs >29.6 ms²³ (Supplementary Fig. 1), and it is also below the 33 ms threshold often associated with pitch perception³². This discrepancy might be attributed to the inherent limitations of EEG recordings, which typically have a poor signal-to-noise ratio, underscoring the need for further investigation into the ICI threshold for temporal merging using more sophisticated methodologies. The auditory brain exhibits hypersensitivity to ICI ratios; even a 0.5% difference (Reg_4–4.02) can evoke a robust change response (Fig. 3d–f). Additionally, the regularity of the click train, which not only characterizes the temporal structure but also requires extended time for integration to extract the train’s regularity, reflects context-dependent temporal merging (Fig. 3g–i).

The transitional click train paradigm presents significant opportunities for fundamental research in auditory science. Traditional auditory research has predominantly concentrated on the frequency domain, guided by the auditory system’s tonotopic organization, where distinct frequencies are processed separately along the auditory pathway¹. Nevertheless, the importance of the temporal dimension in auditory processing cannot not be stressed enough. This temporal aspect is critical for speech and music perception, as well as for distinguishing environmental sounds³. Recent advances in neuroimaging and electrophysiology have enhanced our understanding of temporal integration mechanisms in oral language, revealing a hierarchical structure of temporal integration in the human brain^43,44. However, there remains a significant gap in our understanding of temporal integration in non-human animals, primarily due to the lack of a clear neuronal signature for this process, which has impeded research at the neuronal level and in animal studies. The identification of a change response in transitional click trains in our study provides a promising pathway to investigate this complex area further. Future research could employ the transitional click train paradigm to delve into the neuronal mechanisms underpinning temporal integration at the neuronal level in animal subjects.

In addition to providing neural signals related to temporal integration, our study elucidates the neuronal mechanisms underlying pitch perception evoked by click trains. Our findings highlight the role of temporal integration as a key process in pitch perception. Traditional theories distinguish between resolved and unresolved harmonics based on the auditory system’s ability to segregate individual harmonic components^45,46. Resolved harmonics arise from distinct components processed by separate auditory filters, while unresolved harmonics involve closely spaced components processed by a single filter, relying on temporal coding for pitch extraction. Interestingly, sounds with the same repetition rate but very different spectral compositions often evoke the same pitch, while sounds with similar spectra can produce significantly different pitches. This observation demonstrates that the frequency-to-place mapping performed by the cochlea does not necessarily correspond to a frequency-to-pitch mapping⁴⁷. Temporal pitch, induced by click trains, is distinct in that it relies solely on the temporal regularity of successive auditory events rather than on the spectral components^47,48,49,50. Our study provides compelling neuronal evidence supporting this process, demonstrating that the change response reflects the integration of temporal information into a unified auditory pitch (Fig. 2). Previous research has used paradigms similar to transitional click trains to investigate temporal pitch sensitivity⁵¹ and observed the change response in EEG signals of cats⁵⁰. Our insertion experiments further explored the nature of the change response, with a focus on temporal integration (Fig. 2).

Behavior relevance during transitional click train

The alignment between psychological findings and EEG data underscores a notable facet of our research. On the one hand, our psychological data reveal the heightened sensitivity of regular transitional click trains compared to both pure tones (Fig. 4e, g, i) and irregular click trains (Fig. 4d, f, h). Concurrently, EEG signals exhibit stronger responses to regular click trains (Fig. 4f–i). Regular click trains, especially those with shorter ICIs, are often perceived to have pitch-like qualities; thus, change detection in regular click trains may reflect a perceptual shift in pitch rather than a simple temporal discontinuity. Traditionally, this pitch perception has been explained through theories and computational models focusing on the basilar membrane’s processing¹³. The pronounced sensitivity of regular click trains over pure tones underscores the critical role of temporal integration in the central nervous system for refining fine temporal structures. This suggests that the pitch perception associated with regular click trains might originate in the central auditory system rather than the basilar membrane. This hypothesis necessitates further exploration, particularly employing our innovative transitional click train in animal studies.

Additionally, both psychological and EEG responses demonstrate a dependency on temporal continuity. The introduction of a 600 ms gap adversely affects change detection capabilities (Fig. 5b, c) and alters the tuning of change differences (Fig. 5f, g). Given that the change response to the transitional click train systematically correlates with the contrast ratio (Fig. 5f, g), whereas responses to the second sound in the Gap condition remain relatively constant across different contrast ratios, it suggests that the change response primarily signifies the signal of perceptual switching between pitches rather than the perception of the second pitch. The influence of temporal discontinuity on both behavioral and neural responses accentuates the essential role of temporal integration within the auditory system, suggesting that seamless auditory perception relies on the continuous flow of temporal information.

Change response as a biomarker in clinical application

Three key factors influence the change response (Fig. 3) and are consequently related to temporal merging, offering diverse metrics for characterizing temporal integration, and potentially serving as valuable tools in clinical applications: the length of the ICI, the difference between ICIs, and the regularity in the click train. These factors hold promise as potential biomarkers for mental disorders. To further explore this possibility, we investigated the coma patients (Fig. 6). The change response dramatically vanished (Fig. 6a), even in some cases, the onset response exists while no visible change response (Supplementary Fig. 6c), suggesting that the signal may reflect a state-dependent marker of consciousness. We acknowledge that our current sample size is limited, and these findings alone do not establish the change response as a definitive diagnostic or prognostic indicator. However, they point toward its potential as a neural correlate of awareness. Future studies with larger cohorts and longitudinal tracking will be necessary to determine whether this EEG marker can reliably predict recovery trajectories in patients with impaired consciousness. Beyond disorders of consciousness, we propose the use of transitional trains for the assessment of psychiatric conditions. Interestingly, the change response may recover as the patient get recovery (Supplementary Fig. 7). The observed change response to transitional train provides an innovative pathway for refining coma monitoring techniques. Further extending the clinical applicability of our research, we propose the use of transitional trains for the assessment of psychiatric conditions. As temporal integration, a central component of brain functionality^24,25, has been found to be compromised in conditions like schizophrenia²⁶, autism spectrum disorders^19,27, attention deficit hyperactivity disorder²⁸, and Parkinson’s Disease²⁹. Given these findings, the signal of temporal integration might be poised to emerge as a pivotal biomarker for broader clinical diagnostics.

Materials and methods

Experimental procedure and participants

The study comprised four experiments, all conducted in accordance with the Declaration of Helsinki (2013)⁵². Experiment 1 involved a total of 22 participants (14 males and 8 females, mean age: 29.36, standard deviation: 3.19). A total of 42 participants (20 males and 22 females, mean age: 23.36 years old, standard deviation: 2.55) participated in Experiment 2 (Sessions 1 and 3), Experiment 3, and Experiment 4. A total of 24 participants (14 males and 10 females, mean age: 24.87 years old, standard deviation: 7.43) participated in Sessions 2 and 4 of Experiment 2, who also served as the healthy control in comparison with coma patients in Experiment 5. Participants maintained a stationary head position while listening to auditory stimuli and responding via keyboard presses. Experiments including healthy participants were approved by the Institutional Review Board (IRB-20230131-R), and informed consent was obtained from all participants. Experiment 5 involved 22 coma participants with impaired consciousness (16 males and 6 females, mean age: 56.52 years, standard deviation: 15.96), including one participant who was recorded again after recovery from coma. The level of consciousness in coma participants was measured using the Coma Recovery Scale-Revised (CRS-R) scores. This experiment was approved by Natural Science Foundation of Zhejiang Provincial (LGF22H170006). All ethical regulations relevant to human research participants were followed. Each stimulus was repeated a minimum of 40 times to each participant across all experimental sessions.

Experiment 1

This was a gap detection task (Supplementary Fig. 1a) involving click trains of 1024 ms duration with varying ICIs (4, 8, 16, 32, 64, 128, 256 ms). Participants were positioned in a chair facing a keyboard and speaker. After each click train, a 100 ms 1000 Hz cue (100 ms) was presented 800 ms after the end of the click train. Participants were instructed to press the right key on the keyboard if a gap was detected in the click train and the left key if the click train was perceived as continuous. Keyboard press was valid within 700 ms after the cue onset. Experiment 1 aimed to determine the upper psychological threshold for perceiving temporally integrated click trains. We found that when the inter-click interval (ICI) exceeded 29.6 ms, participants began to perceive individual clicks rather than a unified auditory object. This threshold, together with neuronal data from previous studies^10,15, was used to constrain the ICI values employed in Experiments 2, 3, and 4. This approach ensured that the tested intervals fell within the perceptual range for temporal integration, as well as within the adaptation range for eliciting neuronal responses to each click.

Experiment 2

This included four passive listening sessions. Experiment 2 was designed to systematically examine the parameters influencing the change response and to elucidate the underlying nature of this neural response.

Session 1 (ICI contrast): This session consisted of five regular transitional trains (Reg_4-4, Reg_4-4.01, Reg_4-4.02, Reg_4-4.03, and Reg_4-4.06) and two irregular transitional trains (Irreg_4-4 and Irreg_4-4.06), and a tone-pair (Tone_250-246). In Fig. 1 (showing Reg_4-4.06 and Irreg_4-4.06) and Supplementary Fig. 2 (displaying Reg_4-4, Reg_4-4.06, Irreg_4-4, and Irreg_4-4.06), we focus on four transitional click trains to introduce the paradigm and illustrate the fundamental neural responses evoked by both regular and irregular stimuli. Analyses presented in Fig. 3d–f encompass all five regular transitional trains from this session, providing a detailed assessment of neural sensitivity to subtle differences in inter-click interval.
Session 2 (Insertion experiment): 1, 2, 4, 8, 16, and 32 intervals (each with an interval of 4.06 ms, not silent) were inserted into a click train with a 4 ms ICI (Fig. 2a). The inserted intervals were indeed slightly lengthened ICIs rather than silent gaps, preserving acoustic stimulation while altering temporal pattern. Controls included Reg_4-4 and Reg_4-4.06.
Session 3 (ICI length): Four regular transitional trains (Fig. 3a) were presented (Reg_4-4.06, Reg_8-8.12, Reg_16-16.24, and Reg_32-32.48), which altered ICI of train 1 but maintained a fixed ratio of ICI between train 1 and train 2.
Session 4 (ICI variance): Four Irreg_4-4.06 transition trains with different standard deviations (σ = µ/400, µ/200, µ/100, and µ/2, where σ represents the standard deviation and μ represents the mean value of ICI) and one Reg_4-4.06 (σ = 0) were randomly presented (Fig. 3g).

Experiment 3

was a change detection task, aiming to investigate both the perceptual and neural correlates of temporal integration, with particular emphasis on how rhythmic regularity and ICI contrast contribute to the formation and detection of integrated auditory objects. Participants were required to report whether the sound changed by pressing two designated keys within 2 s after the end of the stimuli, with left key representing change in the transitional stimulation and right key for no change (Fig. 4a). This experiment included five regular transitional trains (Reg_4-4, Reg_4-4.01, Reg_4-4.02, Reg_4-4.03, and Reg_4-4.06) and three irregular transitional trains (Irreg_4-4, Irreg_4-4.06, and Irreg_4-8), and two tone-pairs (Tone_250-250 and Tone_250-246, with 5-ms rise-fall edges).

Experiment 4

(Gap effect) was also a behavioral experiment on the effect of temporal gaps. A 600-ms gap between click train 1 and click train 2 resulted in Gap transitional trains (Fig. 5a). Participants identified if the two click trains were different and reported following the same rule in Experiment 3.

Experiment 5

involved a passive listening session in which coma patients were presented with two transitional click trains (Reg_4-4 and Reg_4-5; Fig. 6). For healthy controls, the same two transitional trains were presented in a corresponding session, serving as a direct comparison group for Experiment 5 (Fig. 6).

Auditory stimuli

The experiments were conducted within a sound-proof room. A single click consisted of a 0.2-ms pulse. Click trains were categorized as either regular, with a fixed ICI, or irregular, with random ICIs. For irregular click trains, the ICIs were randomized using a Gaussian distribution, and satisfied the following formula:

$$0.3\times {\mu }_{i}\le {I}_{i,j}\le 1.7\times {\mu }_{i} \\ 1.2\times {\mu }_{i}\le {\sum }_{n=j}^{j+1}{I}_{i,n}\le 3.1\times {\mu }_{i} \\ 1.8\times {\mu }_{i}\le {\sum }_{n=j}^{j+2}{I}_{i,n}\le 4.6\times {\mu }_{i}$$

where ${\mu }_{i}$ represents the average ICI (in milliseconds) of the $i$^th train in the transitional click train and ${I}_{i,j}$ is the $j$^th ICI in the $i$^th train. The mean value of the Gaussian distribution matched the fixed ICI of a regular click train, while the standard deviation was a certain percentage of the fixed ICI (0.25%, 0.5%, 1%, or 50%). A transitional click train is formed by concatenating two click trains. For example, a Reg_4-4.06 transitional train denotes the combination of two regular click trains: regular click train 1 (with an ICI of 4 ms) is seamlessly followed by regular click train 2 (with an ICI of 4.06 ms). Similarly, an irregular transitional train is composed of two irregular click trains with the given average ICIs. For continuous click trains that seamlessly transitioned from ICI₁ to ICI₂, the transition time was defined as the onset time of the first click after the first ICI₂ interval (Fig. 1b). The ratio of ICI₁ to ICI₂ quantified the difference level between the two ICIs. Auditory stimuli were delivered through the Golden Field M23 sound player, driven by a Creative AE-7 Sound Blaster, with a sampling rate of 384 kHz. Sound delivery was controlled with Psychtoolbox 3 in MATLAB. Sound intensity was calibrated to maintain a constant level of 60 dB SPL (sound pressure level), using a ¼-inch condenser microphone (Brüel & Kjær 4954, Nærum, Denmark) and a PHOTON/RT analyzer (Brüel & Kjær, Nærum, Denmark).

Data acquisition

Electroencephalogram (EEG) data of Experiment 2 (Sessions 1 and 3) and Experiments 3 and 4 were acquired using a 64-channel NeuroScan system (Compumedics, Australia). EEG data of Experiments 2 (Sessions 2 and 4) and Experiment 5 were acquired using a 64-channel NeuSenW system (Neuracle, China). The EEG cap schematics are depicted in Supplementary Fig. 9. In practice, we only used 59 electrodes for NeuSenW system recordings and 60 electrodes for NeuroScan system recordings. The ground electrodes for both systems were placed between Fpz and Fz in the frontal area. The reference electrode of the 64-channel NeuSenW wireless EEG cap was positioned between Cz and Pz, replacing CPz, while the reference electrode of the 64-channel NeuroScan Quick-Cap was placed between Cz and CPz. The EEG data were sampled at 1 kHz, and electrode placement followed the international 10-20 system protocol.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data analysis

Preprocessing

The data analyses were performed using MATLAB R2021b (MathWorks) and the Fieldtrip toolbox⁵³. Monopolar referencing was employed in this study, using the default single reference electrode of the EEG cap (Supplementary Fig. 9). The multichannel EEG data underwent several preprocessing steps. First, the full EEG data were filtered using a band-pass filter in the frequency range of 0.5 to 40 Hz. Then, 4-s epochs were obtained, spanning from -1 to 3 s relative to trial onset. Independent component analysis (ICA) was then applied to the epochs to remove electrooculogram (EOG). After ICA, baseline correction was applied by subtracting the mean response within the baseline window from -200 to 0 ms relative to the onset of train stimulation for each trial. Following this, a relative threshold was used to evaluate motion artifacts for each trial, excluding those exceeding predefined thresholds. The relative threshold was determined based on the percentage of bad samples within a trial. A sample was flagged as a “bad sample” if it fell outside the range of “Mean”±3 × “SD” for a trial across specific channels. Trials with over 20% of bad samples were labeled as “bad trials”. Additionally, channels with over 10% of bad trials were labeled as “bad channels”. Bad channels were initially excluded by assessing bad samples using data from all channels, and subsequently, bad trials were excluded from all channels by computing bad samples using data from the remaining good channels. Finally, the event-related potential (ERP) data were obtained by averaging the epoch data for each experimental condition, channel, and subject. Prior to applying inter-subject analysis, ERP data were normalized by each channel’s standard deviation per subject to reduce inter-channel variability.

Permutation test

For time-sample level comparisons of ERP or global field power (GFP) between two conditions, a two-tailed cluster-based permutation test was conducted using the ‘ft_timelockstatistics’ function from the FieldTrip toolbox in MATLAB. The procedure was as follows: 1) A two-tailed independent samples t-test was performed at each time point and channel to compute the t-value comparing Dataset A and B. 2) A cluster-defining threshold (p < 0.05) was applied to identify significant time points. Time points exceeding this threshold were marked as “candidate significant points”. Clusters were then formed by grouping consecutive significant time points and neighboring significant electrodes (for GFP comparison, only in the time dimension), using a Statistical Parametric Mapping (SPM) labeling algorithm⁵⁴. 3) The condition labels for Dataset A and B were randomly permuted. For each permutation, a new t-value matrix was calculated, and clusters were re-formed using the same threshold. The largest cluster-level statistic (e.g., sum of t-values) from each permutation was retained, forming a null distribution. 4) Finally, the observed clusters from step 2 were compared to the null distribution. A cluster was deemed significant if its cluster-level statistic was >95% of permuted clusters (p < 0.05). This permutation test was conducted at the inter-subject level within [-200, 600] ms relative to change point. For example, Dataset A represented ERP/GFP data under Reg_4-4.06 from 42 subjects, while Dataset B represented ERP/GFP data under Reg_4-4 from the same subjects.

Quantification of change response

The change response comprised two major ERP components, cP1 and cN2 (Supplementary Fig. 2c and Fig. 1d), which exhibited opposite polarities in the frontal and temporal-parietal-occipital scalp regions (Supplementary Fig. 2a). We identified these EEG components based on peak detection in the GFP averaged across subjects. For instance, the first peak (118 ms) of the GFP following the change point of Reg_4-4.06 was identified as the cP1 component, with the second peak (198 ms) as the cN2 component (red curve, Supplementary Fig. 2c). The P1 component of the onset response was peaked at 95 ms after the onset of the click train with 4-ms ICI, and the N2 component was peaked at 158 ms. To quantify individual components (e.g., cP1 of Reg_4-4.06 in Supplementary Fig. 2, and P1/cP1/N2/cN2 in Fig. 5f, g) for each electrode, we calculated the mean ERP value within a [-25, 25] ms time window centered around the peak time identified from GFP, subtracting the baseline mean ([-200, 0] ms relative to change point or onset). For a global quantification of the entire change or onset response across the scalp, we used the relative response magnitude (RM), defined as the root mean square (RMS) of the ERP over a specific time window encompassing both peak and trough responses, with the baseline RMS subtracted. The RM calculation proceeded as follows:

$${{\rm{R}}}{{{\rm{M}}}}_{i}\left[{t}_{1},{t}_{2}\right]=\sqrt{\frac{\mathop{\sum }_{t={t}_{1}}^{{t}_{2}}{\left[{r}_{i}\left(t\right)\right]}^{2}}{n}}$$

where ${r}_{i}\left(t\right)$ represents the ERP of the ${i}^{{th}}$ channel at time $t$, $n$ represents the number of samples within the time window from ${t}_{1}$ and ${t}_{2}$, and ${{\rm{R}}}{{{\rm{M}}}}_{i}$ is the response magnitude of channel $i$. The relative RM was defined as:

$${{\rm{R}}}{{{\rm{M}}}}_{i}={{\rm{R}}}{{{\rm{M}}}}_{i}\left[{t}_{1},{t}_{2}\right]-{{\rm{R}}}{{{\rm{M}}}}_{i}\left[-200,0\right]$$

The 200-ms baseline before change represented the steady state response of click train 1. The analysis time window for the change response was determined using a two-tailed cluster-based permutation test of the GFP data between the transitional click trains Reg_4-4.06 and Reg_4-4 at the time-sample level (Supplementary Fig. 2c). Specifically, the change response time windows were [74, 251] ms relative to change point for all passive listening sessions. The relative RMs of all channels were then averaged for each subject to facilitate comparison across experimental conditions (e.g., tunings and scatterplots).

To quantify change responses in subjects with impaired consciousness, GFP was employed due to its robustness to spatial variability and enhanced sensitivity to response latency. GFP calculates the differences in potential across all electrodes at each sampling point, thereby mitigating the influence of spatial variability in electrode placement or individual differences in brain anatomy. The calculation of GFP follows:

$${{\rm{GFP}}}\left(t\right)=\sqrt{\frac{\mathop{\sum }_{i=1}^{N}{\left[{r}_{i}\left(t\right)-\bar{r}\left(t\right)\right]}^{2}}{N}}$$

where $N$ is the total number of channels, ${r}_{i}\left(t\right)\,$ represents the ERP of the ${i}^{{th}}$ channel at time $t$, and $\bar{r}\left(t\right)$ denotes the averaged response across all channels at time $t$. The difference between the maximum GFP value detected within 300 ms after the change point of the train stimulation and the mean GFP value across the 200-ms baseline response before change was used as an indicator of the change response.

Psychological threshold

The ratio of change detection for each group was calculated by dividing the number of trials in which the subject pressed the left arrow key (indicating change detection) by the total number of trials in that group. Subjects with a ratio of change detection exceeding 0.3 in the control group or less than 0.6 in the Reg_4-4.06 group were excluded from behavior-related analyses, including both No-gap and Gap conditions. In total, 36 subjects (out of 42 who participated in Experiments 3 and 4) were included in the stand-alone analyses of the No-gap change detection task. In the comparison of the Gap and No-gap change detection tasks, only the intersection of subjects (n = 34) was included. Psychometric functions were fitted to data using a cumulative Gaussian function^55,56:

$$p\left(r\right)=\frac{1}{\sigma \sqrt{2{{\rm{\pi}}}}}{\int}^{r}_{-\infty}{e}^{-\frac{{\left(x-\mu \right)}^{2}}{2{{{\rm{\sigma}}}}^{2}}}{dx}$$

where $p\left(r\right)$ represents the ratio of change detection as a function of ICI $r$. $\mu$ is the Gaussian mean, and $\sigma$ is the standard deviation (SD). The psychological threshold of change detection was defined as 0.6 of the Gaussian fit (Fig. 5c). This curve fitting procedure was achieved using ‘psignifit’ software package (see http://bootstrap-software.org/psignifit/) for MATLAB. Similarly, the psychological threshold of gap detection (0.4 of the Gaussian fit, Supplementary Fig. 1b) was obtained with the same fitting procedure.

Assessment of consciousness

The level of consciousness in coma participants was assessed using the Coma Recovery Scale–Revised (CRS-R)⁵⁷, a standardized behavioral assessment tool specifically designed to differentiate among disorders of consciousness (DoC), including coma, vegetative state/unresponsive wakefulness syndrome (VS/UWS), and minimally conscious state (MCS). The CRS-R comprises six subscales evaluating auditory, visual, motor, oromotor/verbal, communication, and arousal functions. Each subscale includes hierarchically arranged items, with scores reflecting the presence or absence of specific behavioral responses. The total CRS-R score ranges from 0 to 23, with higher scores indicating greater levels of behavioral responsiveness and awareness. Assessments were conducted by trained clinicians under standardized conditions to ensure reliability. The CRS-R score for each coma participant was obtained just before the EEG recording. In total, 20 out of 22 coma participants completed the CRS-R assessment.

Source reconstruction

EEG source reconstruction was performed to compare the neural generators of change responses between Reg_4-4 and Reg_4-4.06 within the [0, 300] ms window following the change point (Session 1, Experiment 2). Covariance matrices were computed from ERP data for each condition, using data from all 42 subjects. A standard boundary element method (BEM) head model (standard_bem.mat) and a standard MRI volume (single_subj_T1_1mm.nii)—both provided in the FieldTrip template directory—were used to model the head and brain anatomy. The MRI and head model were aligned to the MNI coordinate system and resliced with a spatial resolution of 1 mm. Electrode positions were based on the standard 10-20 system (NeuroScan 64-channel Quick-Cap), with a total of 60 electrodes included after excluding M1, M2, CB1, and CB2. A regular three-dimensional source grid with 10 mm resolution was constructed in MNI space. Regions of interest (ROIs) were selected based on the AAL atlas (ROI_MNI_V4.nii), excluding brainstem and most cerebellar areas. The leadfield matrix was computed using the electrode configuration and the BEM head model, with normalization applied. Source reconstruction was performed using exact low-resolution electromagnetic tomography (eLORETA), based on the ERP-derived covariance and thee precomputed leadfield. This procedure was repeated for each subject and condition.

To statistically locate the source of change responses, voxel-wise paired two-tailed t-tests were performed across subjects, with significance assessed using a cluster-based permutation test corrected at p < 0.05. The relative change in source power between conditions was quantified using the normalized difference:

$$\Delta {S}_{{norm}}=\frac{{S}_{2}-{S}_{1}}{{S}_{1}+{S}_{2}}$$

where ${S}_{1}$ and ${S}_{2}$ denote the average source power for Reg_4-4 and Reg_4-4.06, respectively.

Statistics and reproductivity

All statistical analyses were performed using custom scripts in MATLAB R2021b (MathWorks). Data are presented as mean $\pm \,$ standard error of the mean (SEM), unless otherwise stated. All statistical tests used in this study were two-tailed, and the threshold for statistical significance was set at p < 0.05.

For all analyses, “n” refers to the number of subjects, as specified in figure legends. Replicates were defined as independent biological replicates, corresponding to different subjects each performing the same experimental procedure.

For the identification of channels showing significant change responses (e.g., topographic plot of RM in Supplementary Fig. 2a), we computed the relative RM of change responses for each channel and subject and performed a two-tailed one-sample t-test against zero. P-values were corrected for multiple comparisons across channels using the Benjamini-Yekutieli false discovery rate (FDR) procedure⁵⁸. Significant channels were marked with black dots in topographic plots (e.g., Supplementary Fig. 2a and Fig. 3c, f, and i).

To examine the difference in cP1 responses under Reg_4-4.01 between correct and incorrect trials in the No-gap behavioral task (n = 10 for change detection ratio between 0.3 and 0.7), we used two-tailed Wilcoxon signed-rank tests on the average cP1 amplitude (calculated over [138, 168] ms relative to the change point and centered around the estimated cP1 peak; see Supplementary Fig. 4) for each channel.

For other pairwise comparisons (RM and behavioral comparisons), statistical significance was tested using two-tailed paired t-tests. Exact t-values, degrees of freedom, and effect sizes (Cohen’s d) are reported in the Results section. Cohen’s d was computed as:

$$d=\frac{\bar{X}-\bar{Y}}{\sqrt{\frac{1}{N-1}\mathop{\sum }_{i=1}^{N}{\left({X}_{i}-{Y}_{i}-\left(\bar{X}-\bar{Y}\right)\right)}^{2}}}$$

where $X$ and $Y$ are paired samples with a total number of $N$ (with $Y$ = 0 for one-sample t-test). ${X}_{i}$ and ${Y}_{i}$ are the $i$^th elements from the paired samples. $\bar{X}$ and $\bar{Y}$ are mean values of the samples.

To test for tuning effects of RM across conditions (Reg_4-4.01, Reg_4-4.02, Reg_4-4.03, and Reg_4-4.06) in the No-gap and Gap tasks, one-way ANOVAs were performed. Effect size for the one-way ANOVA was measured by eta-squared (${\eta }^{2}$), defined as:

$${\eta }^{2}=\frac{S{S}_{{between}}}{S{S}_{{total}}}$$

where $S{S}_{{between}}$ is the sum of squared variation due to the factor (between groups) and $S{S}_{{total}}$ is the total sum of squares.

To test the relationship between EEG responses and the level of consciousness, we computed Pearson’s correlation coefficients between CRS-R scores from 20 coma subjects and their GFP responses following sound onset and sound change. No significant correlations were observed for either onset or change responses.

In addition, Bayesian statistical analyses were conducted to quantify evidence for the null hypothesis across different types of comparisons. For pairwise comparisons, we used Bayesian paired-sample or independent-sample t-tests with a Jeffreys-Zellner-Siow (JZS) prior, as described by Rouder et al. (2009)⁵⁹. For group comparisons involving more than two levels, we employed Bayesian one-way ANOVA, using default Cauchy priors on effect sizes (Rouder et al.)⁶⁰. To assess associations between continuous variables, we conducted Bayesian correlation analyses based on a Jeffreys prior for Pearson’s correlation coefficient (Ly et al.)⁶¹. All Bayes factors (BF₁₀) were computed using the BayesFactor toolbox⁶² for MATLAB. A BF₁₀ > 1 indicates evidence in favor of the alternative hypothesis, whereas a BF₁₀ < 1 indicates evidence in favor of the null.

Data availability

All source data supporting the findings of this study are publicly available at Zenodo under the following https://doi.org/10.5281/zenodo.15795285⁶³.

References

Schnupp, J., Nelken, I. & King, A. J. Auditory Neuroscience: Making Sense of Sound. (The MIT Press, 2010).
ten Cate, C. & Spierings, M. Rules, rhythm and grouping: auditory pattern perception by birds. Animal Behav. 151, 249–257 (2019).
Article Google Scholar
Moore, B. C. J. Temporal integration and context effects in hearing. J. Phonetics 31, 563–574 (2003).
Article Google Scholar
Gao, R., van den Brink, R. L., Pfeffer, T. & Voytek, B. Neuronal timescales are functionally dynamic and shaped by cortical microarchitecture. Elife 9, e61277 (2020).
Ding, N., Melloni, L., Zhang, H., Tian, X. & Poeppel, D. Cortical tracking of hierarchical linguistic structures in connected speech. Nat. Neurosci. 19, 158–164 (2016).
Article CAS PubMed Google Scholar
Norman-Haignere, S. V. et al. Multiscale temporal integration organizes hierarchical computation in human auditory cortex. Nat. Hum. Behav. 6, 455–469 (2022).
Article PubMed PubMed Central Google Scholar
Cusack, R. The intraparietal sulcus and perceptual organization. J. Cogn. Neurosci. 17, 641–651 (2005).
Article PubMed Google Scholar
Gutschalk, A. et al. Neuromagnetic correlates of streaming in human auditory cortex. J. Neurosci. 25, 5382–5388 (2005).
Article CAS PubMed PubMed Central Google Scholar
Kondo, H. M. & Kashino, M. Involvement of the thalamocortical loop in the spontaneous switching of percepts in auditory streaming. J. Neurosci. 29, 12695–12701 (2009).
Article CAS PubMed PubMed Central Google Scholar
Lu, T., Liang, L. & Wang, X. Temporal and rate reresentations of time-varying signals in the auditory cortex of awake primates. Nat. Neurosci. 4, 1131–1138 (2001).
Article CAS PubMed Google Scholar
Steinschneider, M., Reser, D. H., Fishman, Y. I., Schroeder, C. E. & Arezzo, J. C. Click train encoding in primary auditory cortex of the awake monkey: evidence for two mechanisms subserving pitch perception. J. Acoust. Soc. Am. 104, 2935–2955 (1998).
Article CAS PubMed Google Scholar
Lutkenhoner, B. & Patterson, R. D. Disruption of the auditory response to a regular click train by a single, extra click. Exp. Brain Res. 233, 1875–1892 (2015).
Article PubMed Google Scholar
Balaguer-Ballester, E., Denham, S. L. & Meddis, R. A cascade autocorrelation model of pitch perception. J. Acoust. Soc. Am. 124, 2186–2195 (2008).
Article PubMed Google Scholar
Yost, W. A., Mapes-Riordan, D., Shofner, W., Dye, R. & Sheft, S. Pitch strength of regular-interval click trains with different length “runs” of regular intervals. J. Acoust. Soc. Am. 117, 3054–3068 (2005).
Article PubMed Google Scholar
Nourski, K. V. et al. Coding of repetitive transients by auditory cortex on posterolateral superior temporal gyrus in humans: an intracranial electrophysiology study. J. Neurophysiol. 109, 1283–1295 (2013).
Article PubMed Google Scholar
Kolvoort, I. R., Wainio-Theberge, S., Wolff, A. & Northoff, G. Temporal integration as “common currency” of brain and self-scale-free activity in resting-state EEG correlates with temporal delay effects on self-relatedness. Hum. Brain Mapp. 41, 4355–4374 (2020).
Article PubMed PubMed Central Google Scholar
Revach, D. & Salti, M. Consciousness as the temporal propagation of information. Front Syst Neurosci 16, 759683 (2022).
Article PubMed PubMed Central Google Scholar
Asai, R. et al. Abnormal temporal window of integration in auditory sensory memory in Schizophrenia. Clin. EEG Neurosci. 56, 100–105 (2025).
Article PubMed Google Scholar
Stevenson, R. A. et al. Multisensory temporal integration in autism spectrum disorders. The J. Neurosci. 34, 691–697 (2014).
Article CAS PubMed Google Scholar
Akshaya, R. et al. Temporal Order Judgment (TOJ) in Parkinson’s disease. Park. Relat. Disord. 134, 107721 (2025).
Bekinschtein, T. A. et al. Neural signature of the conscious processing of auditory regularities. Proc. Natl. Acad. Sci. USA 106, 1672–1677 (2009).
Article CAS PubMed PubMed Central Google Scholar
Jones, S. J. et al. Auditory evoked potentials to spectro-temporal modulation of complex tones in normal subjects and patients with severe brain injury. Brain 123, 1007–1016 (2000).
Article PubMed Google Scholar
Song, P. et al. Temporal merging into pitch with click train in the macaque auditory cortex. Natl. Sci. Rev. 12, nwaf026 (2025).
Article PubMed PubMed Central Google Scholar
Mauk, M. D. & Buonomano, D. V. The neural basis of temporal processing. Annu. Rev. Neurosci. 27, 307–340 (2004).
Article CAS PubMed Google Scholar
D’Argembeau, A., Jeunehomme, O., Majerus, S., Bastin, C. & Salmon, E. The neural basis of temporal order processing in past and future thought. J. Cogn. Neurosci. 27, 185–197 (2015).
Article PubMed Google Scholar
Su, L. et al. Temporal perception deficits in schizophrenia: integration is the problem, not deployment of attentions. Sci. Rep. 5, 9745 (2015).
Article CAS PubMed PubMed Central Google Scholar
Nakano, T., Ota, H., Kato, N. & Kitazawa, S. Deficit in visual temporal integration in autism spectrum disorders. Proc. Biol. Sci. 277, 1027–1030 (2010).
PubMed Google Scholar
Panagiotidi, M., Overton, P. G. & Stafford, T. Multisensory integration and ADHD-like traits: evidence for an abnormal temporal integration window in ADHD. Acta Psychol. (Amst) 181, 10–17 (2017).
Article PubMed Google Scholar
Tokushige, S. I. et al. Does the clock tick slower or faster in parkinson’s disease? - insights gained from the synchronized tapping task. Front. Psychol. 9, 1178 (2018).
Article PubMed PubMed Central Google Scholar
Skrandies, W. Global field power and topographic similarity. Brain Topogr. 3, 137–141 (1990).
Article CAS PubMed Google Scholar
Giannopoulos, A. E. et al. Early auditory-evoked potentials in body dysmorphic disorder: an ERP/sLORETA study. Psychiatry Res. 299, 113865 (2021).
Article PubMed Google Scholar
Krumbholz, K., Patterson, R. D. & Pressnitzer, D. The lower limit of pitch as determined by rate discrimination. J. Acoust. Soc. Am. 108, 1170–1180 (2000).
Article CAS PubMed Google Scholar
Phillips, D. P., Dingle, R. N., Hall, S. E. & Jang, M. Dual mechanisms in the perceptual processing of click train temporal regularity. J. Acoust. Soc. Am. 132, EL22–EL28 (2012).
Article PubMed Google Scholar
Oshurkova, E., Scheich, H. & Brosch, M. Click train encoding in primary and non-primary auditory cortex of anesthetized macaque monkeys. Neuroscience 153, 1289–1299 (2008).
Article CAS PubMed Google Scholar
Bendor, D. & Wang, X. Differential neural coding of acoustic flutter within primate auditory cortex. Nat. Neurosci. 10, 763–771 (2007).
Article CAS PubMed Google Scholar
Song, P. et al. A new function of offset response in the primate auditory cortex: marker of temporal integration. Commun. Biol. 7, 1350 (2024).
Article PubMed PubMed Central Google Scholar
Nakamura, T. et al. Characteristics of auditory steady-state responses to different click frequencies in awake intact macaques. BMC Neurosci. 23, 57 (2022).
Article PubMed PubMed Central Google Scholar
Presacco, A., Bohorquez, J., Yavuz, E. & Ozdamar, O. Auditory steady-state responses to 40 Hz click trains: relationship to middle latency, gamma band and beta band responses studied with deconvolution. Clin. Neurophysiol. 121, 1540–1550 (2010).
Article PubMed Google Scholar
Neklyudova, A., Kuramagomedova, R., Voinova, V. & Sysoeva, O. Atypical brain responses to 40-Hz click trains in girls with Rett syndrome: auditory steady-state response and sustained wave. Psychiatry Clin. Neurosci. 78, 282−290 (2024).
Li, R. et al. Holistic bursting cells store long-term memory in auditory cortex. Nat. Commun. 14, 8090 (2023).
Article CAS PubMed PubMed Central Google Scholar
Wang, M. et al. Single-neuron representation of learned complex sounds in the auditory cortex. Nat. Commun. 11, 4361 (2020).
Article CAS PubMed PubMed Central Google Scholar
Barascud, N., Pearce, M. T., Griffiths, T. D., Friston, K. J. & Chait, M. Brain responses in humans reveal ideal observer-like sensitivity to complex acoustic patterns. Proc. Natl. Acad. Sci. USA 113, E616–E625 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chang, C. H. C., Nastase, S. A. & Hasson, U. Information flow across the cortical timescale hierarchy during narrative construction. Proc. Natl. Acad. Sci. USA 119, e2209307119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Lerner, Y., Honey, C. J., Silbert, L. J. & Hasson, U. Topographic mapping of a hierarchy of temporal receptive windows using a narrated story. J. Neurosci. 31, 2906–2915 (2011).
Article CAS PubMed PubMed Central Google Scholar
Carlyon, R. P. & Shackleton, T. M. Comparing the fundamental frequencies of resolved and unresolved harmonics: evidence for two pitch mechanisms? J. Acoustical Soc. Am. 95, 3541–3554 (1994).
Article Google Scholar
Shackleton, T. M. & Carlyon, R. P. The role of resolved and unresolved harmonics in pitch perception and frequency modulation discrimination. J. Acoust. Soc. Am. 95, 3529–3540 (1994).
Article CAS PubMed Google Scholar
Plack, C. J., Oxenham, A. J. & Fay, R. R. Pitch: Neural Coding and Perception, Vol. 24 (Springer Science & Business Media, 2006).
Carlyon, R. P. & Deeks, J. M. Limitations on rate discrimination. J. Acoust. Soc. Am. 112, 1009–1025 (2002).
Article PubMed Google Scholar
Macherey, O. & Carlyon, R. P. Re-examining the upper limit of temporal pitch. J. Acoust. Soc. Am. 136, 3186–3199 (2014).
Article PubMed Google Scholar
Richardson, M. L. et al. Temporal pitch sensitivity in an animal model: psychophysics and scalp recordings: temporal pitch sensitivity in cat. J. Assoc. Res. Otolaryngol. 23, 491–512 (2022).
Article PubMed PubMed Central Google Scholar
Ungan, P. & Yagcioglu, S. Significant variations in Weber fraction for changes in inter-onset interval of a click train over the range of intervals between 5 and 300 ms. Front. Psychol. 5, 1453 (2014).
Article PubMed PubMed Central Google Scholar
Duncan, N. Declaration of Helsinki. World Med. J. 310, 2191−4(2013).
Oostenveld, R., Fries, P., Maris, E. & Schoffelen, J. M. FieldTrip: Open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data. Comput. Intell. Neurosci. 2011, 156869 (2011).
Article PubMed Google Scholar
Thurfjell, L., Bengtsson, E. & Nordin, B. A new three-dimensional connected components labeling algorithm with simultaneous object feature extraction capability. CVGIP: Graph. Model Image Process. 54, 357–364 (1992).
Google Scholar
Yu, X. J., Dickman, J. D., DeAngelis, G. C. & Angelaki, D. E. Neuronal thresholds and choice-related activity of otolith afferent fibers during heading perception. Proc. Natl. Acad. Sci. USA 112, 6467–6472 (2015).
Article CAS PubMed PubMed Central Google Scholar
Xu, X. X., Zhai, Y. Y., Kou, X. K. & Yu, X. Adaptation facilitates spatial discrimination for deviant locations in the thalamic reticular nucleus of the rat. Neuroscience 365, 1–11 (2017).
Article CAS PubMed Google Scholar
Giacino, J. T., Kalmar, K. & Whyte, J. The JFK coma recovery scale-revised: measurement characteristics and diagnostic utility. Arch. Phys. Med. Rehabil. 85, 2020–2029 (2004).
Article PubMed Google Scholar
Benjamini, Y. & Yekutieli, D. The control of the false discovery rate in multiple testing under dependency. Annal. Stat. 29, 1165–1188 (2001).
Article Google Scholar
Rouder, J. N., Speckman, P. L., Sun, D., Morey, R. D. & Iverson, G. Bayesian t tests for accepting and rejecting the null hypothesis. Psychon. Bull. Rev. 16, 225–237 (2009).
Article PubMed Google Scholar
Rouder, J. N., Morey, R. D., Speckman, P. L. & Province, J. M. Default Bayes factors for ANOVA designs. J. Math. Psychol. 56, 356–374 (2012).
Article Google Scholar
Ly, A., Verhagen, J. & Wagenmakers, E.-J. Harold Jeffreys’s default Bayes factor hypothesis tests: Explanation, extension, and application in psychology. J. Math. Psychol. 72, 19–32 (2016).
Article Google Scholar
Krekelberg, B. klabhub/bayesFactor: Bayes only (v3.0). Zenodo https://doi.org/10.5281/zenodo.13744717 (2024).
Xu, H. Dataset of Exps 2–5 – EEG neural indicator of temporal integration in the human auditory brain with clinical implications. Zenodo https://doi.org/10.5281/zenodo.15795285 (2025).

Download references

Acknowledgements

We are grateful to Xiaokai Kou and Fujin Gao and for their help with the experiments. This work was supported by STI2030-Major Projects (2022ZD0204600 and 2022ZD0204800) (to X.Y.); National Natural Science Foundation of China 32171044 (to X.Y.), 32100827 (to Y.Z.), and 32271078 (to L.C.); Key Support Discipline Construction Project of Shanghai Municipal Health Commission 2023ZDFC0203 (to X.Z.); Zhejiang Provincial Natural Science Foundation of China under Grant No. LGF20H020010 (to W.W.); Natural Science Foundation of Zhejiang Provincial LGF22H170006 (to Li Zhang).

Author information

These authors contributed equally: Haoxuan Xu, Qianyue Huang, Peirun Song.

Authors and Affiliations

Department of Anesthesia, Women’s Hospital, Zhejiang University School of Medicine, Hangzhou, China
Haoxuan Xu, Qianyue Huang, Peirun Song, Yanxin Chen, Qiuyu Li, Yuying Zhai, Xinyu Du, Hangting Ye & Xiongjie Yu
Zhejiang Provincial Key Laboratory of Precision Diagnosis and Therapy for Major Gynecological Diseases, Women’s Hospital, Zhejiang University School of Medicine, Hangzhou, China
Haoxuan Xu, Qianyue Huang, Peirun Song & Xiongjie Yu
College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China
Haoxuan Xu, Qianyue Huang, Xuehui Bao, Ishrat Mehmood, Hisashi Tanigawa & Xiongjie Yu
Key Laboratory for Biomedical Engineering of Ministry of Education, Zhejiang University, Hangzhou, China
Haoxuan Xu & Xiongjie Yu
Department of Anesthesiology, Shanghai Tenth People’s Hospital, Tongji University School of Medicine, Shanghai, China
Wanqiu Niu, Zhiyi Tu, Pei Chen, Tingting Zhang, Lingling Zhang, Xuan Zhao & Xiongjie Yu
Center for Rehabilitation Medicine, Rehabilitation and Sports Medicine Research Institute of Zhejiang Province, Department of Rehabilitation Medicine, Zhejiang Provincial People’s Hospital, Affiliated People’s Hospital, Hangzhou Medical College, Hangzhou, Zhejiang, China
Li Zhang & Wanshun Wen
Department of Psychology and Behavioral Sciences, Zhejiang University, Hangzhou, China
Liyu Cao
The State Key Lab of Brain-Machine Intelligence, Zhejiang University, Hangzhou, China
Liyu Cao

Authors

Haoxuan Xu
View author publications
Search author on:PubMed Google Scholar
Qianyue Huang
View author publications
Search author on:PubMed Google Scholar
Peirun Song
View author publications
Search author on:PubMed Google Scholar
Yanxin Chen
View author publications
Search author on:PubMed Google Scholar
Qiuyu Li
View author publications
Search author on:PubMed Google Scholar
Yuying Zhai
View author publications
Search author on:PubMed Google Scholar
Xinyu Du
View author publications
Search author on:PubMed Google Scholar
Hangting Ye
View author publications
Search author on:PubMed Google Scholar
Xuehui Bao
View author publications
Search author on:PubMed Google Scholar
Ishrat Mehmood
View author publications
Search author on:PubMed Google Scholar
Hisashi Tanigawa
View author publications
Search author on:PubMed Google Scholar
Wanqiu Niu
View author publications
Search author on:PubMed Google Scholar
Zhiyi Tu
View author publications
Search author on:PubMed Google Scholar
Pei Chen
View author publications
Search author on:PubMed Google Scholar
Tingting Zhang
View author publications
Search author on:PubMed Google Scholar
Lingling Zhang
View author publications
Search author on:PubMed Google Scholar
Xuan Zhao
View author publications
Search author on:PubMed Google Scholar
Li Zhang
View author publications
Search author on:PubMed Google Scholar
Wanshun Wen
View author publications
Search author on:PubMed Google Scholar
Liyu Cao
View author publications
Search author on:PubMed Google Scholar
Xiongjie Yu
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization, X.Y.; Methodology, X.Y., P.S., H.X., and H.Y.; Investigation and Data Curation, X.Y., P.S., H.X., Y.C., Q.L., H.Y., X.D., Y.Z., X.B., Q.H., I.M., H.T., W.N., Z.T., P.C., T.Z., Lin.Z. and X.Z.; Writing—Original Draft, X.Y., P.S., H.X. and H.Y.; Writing—Review and Editing, X.Y., X.Z., P.S., H.X., H.Y., and L.C.; Funding Acquisition, L.C., X.Z., W.W., Li.Z., X.Y., and Y.Z.; Resources, W.W. and X.Y.; Software, P.S., H.X. and H.Y.; Validation and Visualization, P.S., H.X. and H.Y.; Supervision, X.Y.; L.C.; W.W. and X.Z.

Corresponding authors

Correspondence to Wanshun Wen, Liyu Cao or Xiongjie Yu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Biology thanks Yamil Vidal and the other, anonymous, reviewers for their contribution to the peer review of this work. Primary Handling Editors: Helen Blank and Jasmine Pan. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Transparent Peer Review file

Supplementary Information

Description of Additional Supplementary Files

Supplementary Audio 1

Reporting summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Xu, H., Huang, Q., Song, P. et al. EEG neural indicator of temporal integration in the human auditory brain with clinical implications. Commun Biol 8, 1109 (2025). https://doi.org/10.1038/s42003-025-08540-8

Download citation

Received: 17 January 2025
Accepted: 15 July 2025
Published: 26 July 2025
DOI: https://doi.org/10.1038/s42003-025-08540-8

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Temporal merging into auditory objects with click trains

Factors affecting change response

Perception performance during transitional click train

The effect of temporal continuity

Potential clinical application

Discussion

Change response in transitional click train as an indicator of temporal integration

Behavior relevance during transitional click train

Change response as a biomarker in clinical application

Materials and methods

Experimental procedure and participants

Experiment 1

Experiment 2

Experiment 3

Experiment 4

Experiment 5

Auditory stimuli

Data acquisition

Reporting summary

Data analysis

Preprocessing

Permutation test

Quantification of change response

Psychological threshold

Assessment of consciousness

Source reconstruction

Statistics and reproductivity

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links