Abstract
We can often anticipate the precise moment when a stimulus will be relevant for our behavioral goals. Voluntary temporal attention, the prioritization of sensory information at task-relevant time points, enhances visual perception. However, the neural mechanisms of voluntary temporal attention have not been isolated from those of temporal expectation, which reflects timing predictability rather than relevance. Here we use time-resolved steady-state visual evoked responses (SSVER) to investigate how temporal attention dynamically modulates visual activity when temporal expectation is controlled. We recorded magnetoencephalography while participants directed temporal attention to one of two sequential grating targets with predictable timing. Meanwhile, a co-localized SSVER probe continuously tracked visual cortical modulations both before and after the target stimuli. We find that in the pre-target period, the SSVER gradually ramps up as the targets approach, reflecting temporal expectation. Furthermore, we find a low-frequency modulation of the SSVER, which shifts approximately half a cycle in phase according to which target is attended. In the post-target period, temporal attention to the first target transiently modulates the SSVER shortly after target onset. Thus, temporal attention dynamically modulates visual cortical responses via both periodic pre-target and transient post-target mechanisms to prioritize sensory information at precise moments.
Similar content being viewed by others
Introduction
We can often anticipate the points in time at which sensory events will be relevant for our behavior. For example, when playing table tennis, it is critical to see the ball at the moment it bounces on the table; only seeing the ball half a second earlier or later would be unhelpful1. Voluntary temporal attention is the deliberate prioritization of a point in time that we know in advance will be relevant for our behavioral goals2.
Voluntary temporal attention involves prioritizing a stimulus on the basis of both its timing and its relevance to the participant’s task3,4,5. These two features distinguish voluntary temporal attention from other dynamic phenomena6. For example, a task-relevant stimulus may appear at an unpredictable time, eliciting reactive attentional dynamics—as in the attentional blink7,8,9—rather than the proactive dynamics of voluntary temporal attention. Alternatively, an irrelevant stimulus may appear at a predictable time, eliciting temporal expectation but not attentional prioritization6,10,11,12.
Voluntary temporal attention has been shown to improve behavioral performance3,4,13,14 and affect microsaccades11,15 over and above the effects of temporal expectation, suggesting dissociable neural mechanisms. Likewise, in the domains of visual spatial and feature-based attention, as well as auditory temporal attention, relevance, and predictability can have distinct behavioral and neural effects10,16,17,18,19,20,21,22,23,24,25. Finally, spatial26,27 and feature-based28 attention have larger effects on visual responses to targets that appear at expected vs. unexpected times, further suggesting that temporal attention and expectation may have separable effects on visual cortex.
The neural mechanisms specific to voluntary temporal attention are unclear, however, because previous neural studies have typically manipulated visual temporal attention and temporal expectation together, by presenting a single task-relevant target at a more or less predictable time29,30,31,32,33,34. As a result, the neural modulations observed could be related to timing predictability, time point relevance, or a combination of the two. Such studies have found increased visual responses to temporally anticipated targets31,33 as well as slow ramping activity in dorsal stream cortical and subcortical areas35,36,37,38 that peaked around the time of an anticipated event. These ramping signals could be related to time-keeping and top-down control of anticipatory processes, including motor preparation. Temporal expectation has also been associated with neural oscillations in sensory and fronto-parietal cortices, particularly in the delta (~0.5–4 Hz) range, both during rhythmic stimulation at the delta frequency39,40,41 and in predictable interval timing tasks42.
Our normalization model of dynamic attention, which was fit to psychophysical data3,4,14, predicts that voluntary temporal attention will change the gain of visual responses rapidly and transiently, roughly coincident with the evoked sensory response to the visual stimulus3, rather than slowly and gradually before the stimulus appears. Importantly, sensory gain modulations that multiplicatively scale neural response amplitudes can only be measured in the presence of non-zero visual activity, which can be elicited by a probe stimulus. A few studies of temporal expectation presented probe stimuli during a pre-target period and found anticipatory modulation of sensory activity, including ramping up of gamma band signals in V143, ramping up of responses to auditory probe stimuli in A144, and spatial attentional modulation that depended on task timing in V427, consistent with anticipatory gain modulation3,45. However, it remains unclear whether voluntary temporal attention still modulates visual gain when expectation is controlled and what the dynamics of any such modulation may be.
Here we investigated how temporal attention dynamically modulates visual cortical responses using magnetoencephalography (MEG). Human participants performed a challenging orientation discrimination task in which voluntary temporal attention was manipulated independent of temporal expectation, which was constant. A precue directed temporal attention to one of two brief, sequential grating targets that were presented at fixed, predictable times. To measure attentional modulation continuously across time, we used time-resolved steady-state visual evoked responses (SSVER)46,47,48,49,50,51. The grating targets were superimposed on a 20 Hz flickering noise probe, which generated a continuous SSVER signal52,53. We tracked this signal throughout the trial—both leading up to and following the targets—by calculating the inter-trial phase coherence (ITPC) at the probe flicker frequency in a time-resolved fashion. ITPC is the primary driver of SSVER power52,53,54,55 and indexes the reliability with which the visual stimulus drives cortical responses: a more reliable neural response to the stimulus will yield a higher phase coherence across trials52,56.
In this work, we find dynamic task-related modulations of visual cortical responses leading up to and following target stimuli. Visual cortical ITPC slowly ramps up in anticipation of the predictable target stimuli regardless of the precue, an overall effect of temporal expectation. Furthermore, we find a low-frequency (2 Hz) modulation of the ITPC in the pre-target period, which shifts approximately half a cycle in phase according to which target is attended. In the post-target period, temporal attention to the first target transiently modulates ITPC shortly after target onset, increasing the peak response to the target when it is attended. The findings show how both pre- and post-target modulatory mechanisms contribute to voluntary temporal attention. The presence and time course of these dynamic modulations in visually-driven cortical activity suggest mechanisms for how temporal attention can selectively enhance visual perceptual sensitivity at specific moments in time.
Results
Temporal precueing improved perceptual sensitivity
To investigate the effects of voluntary temporal attention on visual cortical dynamics, we recorded MEG while participants performed a temporal attention task with fully predictable stimulus timing (Fig. 1). Participants (n = 10 × 2 sessions) directed temporal attention to one of two sequential grating targets separated by a 300 ms stimulus onset asynchrony (SOA) and discriminated the orientation of one grating on each trial. A precue tone before the targets instructed participants to attend to either the first target (T1) or second target (T2), and a response cue tone after the targets instructed participants to report the tilt (clockwise (CW) or counterclockwise (CCW)) of one of the targets. The response cue matched the precue with 75% validity, so participants had incentive to direct their attention to different points in time according to the precue.
Trial timeline showing stimulus durations and SOAs in ms. Two successive targets (T1, T2) were embedded in 20 Hz counterphase flickering noise (gray bar). A precue indicated which target to attend, and a response cue indicated which target to report. Precues and response cues were pure tones (high = cue T1, low = cue T2).
Temporal attention improved orientation discrimination performance, consistent with previous findings3,4,14,30,57. Perceptual sensitivity (d’) was higher overall for valid trials (when the response cue matched the precue) than invalid trials (when the response cue did not match) as shown by a repeated measures ANOVA (main effect of validity: F(1,9) = 8.03, p = 0.020, \({\eta \,}_{{\rm{G}}}^{2}\) = 0.25, mean [95% confidence interval (CI)] d’valid = 1.49 [1.27, 1.72], d’invalid = 1.03 [0.54, 1.52]) (Fig. 2). Perceptual sensitivity for T1 was higher than that for T2 (main effect of target: F(1,9) = 16.47, p = 0.003, \({\eta \,}_{{\rm{G}}}^{2}\) = 0.26, d’T1 = 1.50 [1.07, 1.93], d’T2 = 1.03 [0.76, 1.30]). Given the directional nature of time, we also tested how temporal attention affected each target separately. The improvement in d’ with temporal attention was significant for T1 individually (F(1,9) = 9.08, p = 0.015, \({\eta \,}_{{\rm{G}}}^{2}\) = 0.38, d’valid = 1.79 [1.53, 2.05], d’invalid = 1.21 [0.58, 1.85]) with a trend for improvement for T2 (F(1,9) = 4.75, p = 0.057, \({\eta \,}_{{\rm{G}}}^{2}\) = 0.20, d’valid = 1.20 [0.94, 1.46], d’invalid = 0.85 [0.48, 1.23]). Temporal attention had no statistically significant effect on participants’ discrimination criterion, or bias to report CW vs. CCW (F(1,9) = 0.67, p = 0.433, cvalid = −0.05 [−0.20, 0.11], cinvalid = −0.01 [−0.17, 0.15]). Reaction time (RT) was faster for valid versus invalid trials overall (F(1,9) = 39.87, p < 0.001, \({\eta \,}_{{\rm{G}}}^{2}\) = 0.76, RTvalid = 0.62 s [0.45, 0.80], RTinvalid = 0.88 s [0.72, 1.05]) and for each target individually (T1: F(1,9) = 43.02, p < 0.001, \({\eta \,}_{{\rm{G}}}^{2}\) = 0.79, RTvalid = 0.62 s [0.45, 0.79], RTinvalid = 0.89 s [0.72, 1.06]; T2: F(1,9) = 34.71, p < 0.001, \({\eta \,}_{{\rm{G}}}^{2}\) = 0.74, RTvalid = 0.63 s [0.46, 0.80], RTinvalid = 0.88 s [0.72, 1.04]). No other main effects or interactions were found for d’ or RT (F(1,9) < 2.66, p > 0.1) (Fig. 2). Thus, we can rule out any speed-accuracy tradeoffs in the effect of the precue on performance.
Mean (a) perceptual sensitivity and (b) reaction time for each target (T1, T2) and attention condition (n = 10 participants). Sensitivity was higher for valid (green) than invalid (purple) trials overall (p = 0.020), and for T1 individually (p = 0.015). Sensitivity was higher for T1 than T2 (p = 0.003). Reaction time was faster for valid than invalid trials overall (p < 0.001), and for each target individually (p < 0.001). Small data points indicate individual participant data. Large data points indicate group means, with error bars ±1 SEM. ~p < 0.1; * p < 0.05; ** p < 0.01; *** p < 0.001. The statistical tests used were repeated measures ANOVAs. Source data are provided as a Source Data file.
Continuous tracking of visual responses using ITPC
To allow continuous tracking of visual responses as participants deployed temporal attention, we used time-resolved SSVER. The grating targets were embedded in a stream of 20 Hz counterphase flickering noise, filtered to have similar orientation and spatial frequency content as the targets. The targets were presented superimposed on the noise background for 100 ms each and also flickered at 20 Hz, in phase with the ongoing noise flicker. This protocol allowed visual cortical responses to be tracked throughout the trial by measuring the 20 Hz SSVER. The SSVER technique allows stimulus-driven activity to be isolated and measured with high signal-to-noise58. The use of a high-frequency flicker allows high temporal-resolution measurements of the SSVER and increases the localization of the signal to posterior sites49,59.
For each participant, we measured the 20 Hz SSVER power for each channel, calculated from the average time series across all trials (henceforth “trial-average power”), and selected the five most visually responsive channels for further analysis. As expected, the most visually responsive channels were located in the back of the head, consistent with occipital responses (Fig. 3a). We confirmed strong 20 Hz trial-average power in these channels and found that power was enhanced specifically at the 20 Hz stimulus frequency (Fig. 3c).
a Topography of visually responsive channels across participants (top five channels selected for each session based on 20 Hz trial-average SSVER power). Grayscale indicates the proportion of sessions across participants (n = 20 sessions) in which the channel was selected as visually responsive. b Illustration of cartoon time series from five trials (gray) and their average (black). High phase coherence across trials results in high ITPC and trial-average power (top); low phase coherence across trials results in low ITPC and trial-average power (bottom), despite the same single-trial power. c–e Time frequency spectrograms (group means of n = 10 participants) of (c) trial-average power reveals a strong 20 Hz signal at the probe flicker frequency. Trial-average power depends on two factors: (d) the power on single trials and (e) phase coherence across trials, or ITPC. The trial-average power at 20 Hz was more strongly reflected in ITPC than in single-trial power.
Trial-average power depends on two factors: (1) single-trial power and (2) phase coherence across trials (ITPC) at the frequency of interest (Fig. 3b). Therefore, we next asked how the 20 Hz SSVER depended on these two factors. As in previous studies that have pulled apart single-trial power and ITPC52,53,54,55, we found that ITPC was the dominant component of the trial-average SSVER, with a strong and specific ITPC response at the 20 Hz probe frequency (Fig. 3e). Single-trial power had a relatively weak 20 Hz component and was dominated by alpha power around 10 Hz (Fig. 3d). We therefore used ITPC as our primary measure to quantify responsiveness to visual stimulation in our subsequent analyses.
Temporal expectation was accompanied by a ramping increase in ITPC
We first asked how temporal expectation affected ITPC in the period between the precue and the first target. In all trials, a predictable interval of 1.05 s elapsed between the onset of the precue and the onset of T1, so participants could form an expectation about the timing of T1 onset regardless of which target was precued to be attended4,14. We observed an apparent neural signature of expectation in the form of a slow ramping increase of ITPC (mean slope = 0.047 \(\Delta\)ITPC/s [0.01, 0.077]) during the precue-to-T1 period (Fig. 4a, Supplementary Fig. 1). To quantify the rate of increase, we fit lines to each participant’s ITPC time series from the onset of the precue to 80 ms before T1 (a cushion before T1 was inserted to prevent responses to T1 from interfering with the fit, see Methods, Model fitting and parameter estimation). The slope of the ITPC time series was positive for 8/10 participants and significantly greater than zero across the group (F(1,9) = 12.02, p = 0.007, \({\eta \,}_{{\rm{G}}}^{2}\)= 0.40) (Fig. 4b). In contrast, single-trial power showed no statistically significant ramping increase in the precue-to-T1 period (Supplementary Fig. 1c). Thus, cortical response reliability gradually ramped up during the anticipatory period.
a Group mean 20 Hz ITPC time series across all trials. Lines were fit to each participant’s ITPC time series during the precue-to-T1 period (gray bar). Linear fit to group data (dashed purple line) shown for visualization. Shaded error region shows ±1 SEM, normalized by a 100 ms baseline period before the precue. b Fitted slopes were significantly greater than zero (repeated measures ANOVA p = 0.007). c ITPC ramp is time-locked to the precue. Mean 20 Hz ITPC time series for each jittered duration of SSVER stimulation before the precue. Colored vertical lines indicate the onset of the stimulus flicker for each duration. d ITPC slopes during the precue-to-T1 period did not show statistically significant differences across jitter durations (repeated measures ANOVA p = 0.303). Data in (b, d) are presented as mean ±1 SEM. n = 10 participants for all panels and statistical tests. **p < 0.01 compared to zero. Source data are provided as a Source Data file.
To ensure this ITPC ramp was locked to the precue and not simply the continuous build-up of the SSVER signal, we separated trials according to the duration of the SSVER stimulation period before the precue, which had been jittered from 500 to 1500 ms to prevent predictable precue timing. A one-way ANOVA revealed no statistically significant differences in ITPC slope across the six jitter durations (F(1,9) = 1.19, p = 0.303, \({\eta \,}_{{\rm{G}}}^{2}\) = 0.03, slope500 = 0.064 [0.01, 0.11], slope700 = 0.033 [−0.02, 0.08], slope900 = 0.056 [0.03, 0.08], slope1100 = 0.039 [−0,01, 0.08], slope1300 = 0.053 [0.00, 0.10], slope1500 = 0.027 [−0.01, 0.06]) (Fig. 4c, d).
Delta modulation of anticipatory ITPC
Given previous findings of both linear ramping and low-frequency modulations related to temporal expectation42,60, we next investigated whether the pre-target ITPC time series contained any periodic activity above what is expected from the intrinsic scale-free properties of neural activity, which follows a 1/f power law61,62. We found that the power spectrum of the ITPC time series exceeded the 1/f expectation for a range of low frequencies, from 1 to 6 Hz (delta and low-theta), as shown by a nonparametric bootstrap test (all p < 0.001) (Fig. 5a) (see “Methods”, FFT). Fitting a linear plus periodic (sine wave) model in which frequency was a free parameter showed that frequencies between 0.5–3 Hz explained the most variance in the periodic component of the ITPC time series for each precue condition in every participant, with a mean of 1.69 Hz for precue condition fits to individual data and 2.13 Hz for the group fit to all trials (Fig. 5b, c). We confirmed that the power spectrum of the ITPC time series during a baseline period 1500–500 ms before the precue showed no statistically significant deviations from 1/f (Supplementary Fig. 2). Thus, visual response reliability during the predictable pre-target interval exhibited periodic modulation, primarily at delta frequencies, on top of the ramping modulation.
a The predictable precue-to-T1 period (gray bar in b) contains low frequency (1–6 Hz) periodicity above that expected from 1/f scale-free activity, determined by a linear fit to the power from 35–200 Hz (solid green line) in log-log space and extrapolated to lower frequencies (dashed green line). Data are presented as mean values ± 95% confidence intervals (CI) of fast Fourier transforms (FFT) bootstrapped across n = 20 sessions. * indicates integer frequencies at which power from the 95% CI of bootstrapped FFTs exceeded the 1/f expectation, p < 0.001. b Group mean (n = 10 participants) 20 Hz ITPC time series from all trials fit with a linear plus periodic model (dashed purple line), in which frequency was a free parameter, indicates a dominant modulatory frequency near 2 Hz. Shaded error region shows ±1 SEM, normalized by a 100 ms baseline period before the precue. c Fitted frequencies to individual participant 20 Hz ITPC time series did not show a statistically significant difference across precue conditions (n = 10 participants, repeated measures ANOVA p = 0.734). Data are presented as mean values ± 1 SEM. Source data are provided as a Source Data file.
Temporal attention phase-shifts the periodic modulation of anticipatory ITPC
Next, we tested the effect of temporal attention—over and above temporal expectation—on the ramping and periodic components of the pre-target ITPC time series. To do so, we fit linear plus periodic models to each precue condition and assessed differences in fitted parameters. To determine whether temporal attention impacted the phase of the periodic modulation, we fixed frequency across participants and conditions and tested each of the six frequencies (1–6 Hz) that had exceeded the 1/f expectation (Fig. 5a). The 2 Hz model fit (Fig. 6) showed a significant phase difference between precue conditions, with a mean phase shift of 157° [123, 190] (\(\kappa\) = 1.53, Rayleigh z = 5.14, p = 0.004 uncorrected, p = 0.021 after Bonferroni correction for multiple comparisons across 6 frequencies) (Fig. 6c). The 2 Hz model also showed significant phase clustering in precue T1 trials (mean phase = 183° [147, 219], \(\kappa\) = 1.83, Rayleigh z = 4.72, p = 0.006 uncorrected, p = 0.035 after Bonferroni correction). There was no statistically significant difference between the mean phase shifts for the two sessions (session 1: 171°; session 2: 128°; Hotelling paired-sample circular test, F = 1.11, p = 0.854) (Supplementary Fig. 3). The amplitude of the 2 Hz component and the slope and intercept of the linear component were all statistically unaffected by the precue condition (all p > 0.1 uncorrected) (Fig. 6b). The phase shift was specific to 2 Hz, which concords with the dominant frequency observed in the time series (Fig. 6a); none of the other models using different fixed frequencies showed statistically significant phase shifts (all p > 0.05 after Bonferroni correction). Interestingly, the 157° phase shift of a 2 Hz signal corresponds to a time interval of 218 ms, which is similar to the SOA between the two targets. It is also similar to the SOAs at which maximal attentional tradeoffs occur: in a voluntary temporal attention study that varied the interval between the two targets, the attentional precue had the largest effect on perceptual sensitivity when the targets were separated by 250 ms3. Taken together, it appears that temporal expectation elicits a periodic modulation of visual response reliability in the delta band, and temporal attention shifts the phase of this modulation depending on the precise timing of the task-relevant target.
a Group mean 20 Hz ITPC time series for each precue condition, fit with linear plus 2 Hz periodic models (dashed lines) to the precue-to-T1 period (gray bar). Shaded error regions indicate ±1 standard error of the difference (SED). b No statistically significant differences in the fitted intercept, slope, or amplitude between precue conditions (repeated measures ANOVA p > 0.150). Data are presented as mean values ± 1 SED. c The 2 Hz phase is significantly concentrated on precue T1 trials (left, Rayleigh z test p = 0.035 after Bonferroni correction), and temporal attention phase-shifts the periodic modulation (right, Rayleigh z test p = 0.021 after Bonferroni correction). Shaded histograms show phase distributions across individual participants. n = 10 participants for all panels and statistical tests. ** p < 0.01 uncorrected, p < 0.05 after Bonferroni correction. Source data are provided as a Source Data file.
Temporal attention transiently affected the target-evoked ITPC response
We next asked how temporal attention—over and above temporal expectation—affected the ITPC response to the targets. A target stimulus briefly superimposed on an SSVER stimulus can either enhance or suppress the ongoing SSVER, such that some participants show transient increases (upward peaks) in the SSVER signal following the target stimulus, whereas others show transient decreases (downward peaks) (Supplementary Note 1, Supplementary Figs. 4–6). Thus, we asked how temporal attention affected the magnitude of this target-evoked ITPC response for each participant using a normalized measure of ITPC (see Methods, Peak analysis). The group average normalized ITPC time series has clear peaks after each target (Fig. 7a).
a Group normalized 20 Hz ITPC time series for each precue condition. Gray bar indicates the 500 ms baseline period used for normalization. Data are presented as mean timeseries with ±1 SED shaded error regions. b Effect of temporal attention on normalized target-evoked ITPC responses measured at individual participant peak times. Temporal attention increased the target-evoked ITPC response for T1 (repeated measures ANOVA p < 0.001). Data are presented as mean values ± 1 SED. n = 9 participants for all panels and statistical tests. *** p < 0.001. Source data are provided as a Source Data file.
The magnitude of the target-evoked ITPC response to T1 was larger when T1 was precued than when T2 was precued (Fig. 7a). To quantify the magnitude of this difference, we measured the peak of the normalized target-evoked ITPC response for each individual participant for the two precue types (see “Methods”, Peak analysis). We found that temporal attention increased the target-evoked ITPC response to T1 (F(1,8) = 32.34, p < 0.001, \({\eta \,}_{{\rm{G}}}^{2}\) = 0.23, peakPrecue T1 = 0.09 [0.02, 0.15], peakPrecue T2 = 0.06 [0.00, 0.12]) (Fig. 7b, Supplementary Fig. 7). We found no statistically significant effect of the precue on the response to T2 (F(1,8) = 0.23, p = 0.642, \({\eta \,}_{{\rm{G}}}^{2}\) = 0.005, peakPrecue T1 = 0.08 [0.02, 0.014], peakPrecue T2 = 0.08 [0.01, 0.14]). The presence of an effect of the precue for T1 and absence for T2 was reflected in a significant interaction of target and validity (F(1,8) = 7.17, p = 0.028, \({\eta \,}_{{\rm{G}}}^{2}\) = 0.055). See Supplementary Table 1 for all ANOVA results for the target-evoked response analyses.
We further investigated the effect of the precue on other measures of the evoked response to the targets. At the same peak times, trial-average normalized SSVER power behaved similarly to ITPC, whereas single-trial normalized 20 Hz power was unrelated to precue validity (Supplementary Table 1). Specifically, temporal attention increased the trial-average normalized SSVER power in response to T1 (F(1,8) = 5.80, p = 0.043, \({\eta \,}_{{\rm{G}}}^{2}\) = 0.11, peakPrecue T1 = 5.60 [1.43, 9.77], peakPrecue T2 = 4.06 [0.60, 7.52]), but there was no statistically significant change for T2 (F(1,8) = 0.22, p = 0.648, \({\eta \,}_{{\rm{G}}}^{2}\) = 0.0026, peakPrecue T1 = 4.07 [1.24, 6.89], peakPrecue T2 = 3.77 [0.63, 6.91]) (Supplementary Fig. 1h). Single-trial normalized 20 Hz power, in contrast, showed no statistically significant effect of precue for either target (F(1,8) < 2.05, p > 0.1) (Supplementary Fig. 1i). Finally, we confirmed that the ITPC effect for T1 was specific to 20 Hz (Supplementary Fig. 8). Therefore, directing temporal attention to the first target changed the steady-state response to that target in a transient fashion.
Discussion
We investigated how temporal attention dynamically modulates visual processing by using time-resolved SSVER to continuously track visual cortical activity as participants attended to precise points in time. Our two-target temporal cueing task3,4,11,14 allowed us to measure specific effects of voluntary temporal attention to task-relevant time points, over and above the effects of temporal expectation due to predictable stimulus timing. Temporal attention affected cortical response reliability both in anticipation of and in response to target stimuli. Visual response reliability indexed by ITPC slowly ramped up as target onset approached, an overall effect of temporal expectation. In addition, response reliability exhibited low-frequency modulations before target onset, which shifted in phase according to which target was attended. Temporal attention to task-relevant time points also enhanced target-evoked responses, specifically for the first target. These findings demonstrate both anticipatory and evoked visual cortical signatures of voluntary temporal attention when temporal expectation is controlled.
Both ramping activity and neural oscillations have been proposed to serve as timing mechanisms in the brain60,63, and both have been observed in anticipation of a predictable stimulus. Studies of temporal expectation have found anticipatory ramps in EEG voltage known as the contingent negative variation (CNV), often when a speeded response to a target was required34,64,65,66,67. The CNV is usually associated with response preparation or interval timing68 and has been localized to frontal and parietal areas69. Ramping activity in motor and parietal areas has also been observed leading up to timed movements36,70. In visual cortex, gamma responses to an irrelevant grating ramped up over the course of ~1 s before the expected target time, and alpha power decreased43.
Temporal expectation has also been associated with low-frequency oscillations in the delta band (~0.5–4 Hz) in both sensory39,42,71,72 and fronto-parietal73,74 areas. A study in macaques39 found that interleaved rhythmic auditory and visual stimuli elicited V1 delta modulation, which phase-shifted to align with input in the task-relevant modality. Delta phase was related to the amplitude of visual responses trial by trial, suggesting that delta can modulate response gain. Alignment of rhythmic stimuli to a preferred phase of delta also has behavioral consequences, improving discrimination41,72,74 and speeding reaction times39,40,74. These reports of delta activity come from experiments that also presented rhythmic stimuli at delta frequencies. Thus, it is possible that the activity was due to stimulus-driven entrainment39,41,71. However, other expectation experiments found delta phase synchronization even in the absence of rhythmic input, using predictable cue interval timings42,74. None of the above-mentioned studies attempted to isolate the effects of voluntary temporal attention from those of temporal expectation.
Here, we found expectation-related ramping and low-frequency modulation in the reliability of stimulus-specific visual responses, which was phase-shifted by temporal attention. Given that the ITPC signal and attention-related modulations were narrow-band at the stimulus frequency of 20 Hz and phase-locked to the stimulus, these measurements very likely reflect neural responses driven by the stimulus flicker rather than endogenously generated activity in the beta band. Unlike in previous studies27,28,43 no speeded responses to targets were required, ruling out the possibility that the cortical modulations were associated with response preparation75. SSVER studies of spatial attention sustained across seconds have reported SSVER amplitudes that are relatively constant across a trial52,53, suggesting that the ramping plus periodic pattern we observed is specific to situations when the participant can predict the upcoming target time. Interestingly, an overall anticipatory effect of temporal expectation that is time-shifted by temporal attention is similar to the pattern we have observed for microsaccades, which also provide continuous measures of anticipatory processes11,15.
Unlike studies showing ramping64,66,67 or changes in delta phase39,42,74 of the raw neural time series, the time-resolved SSVER reflects ramping and periodic modulations of cortical responses to visual stimuli presented at each time point, demonstrating the continuous functional impact of these modulatory signals during target anticipation. The delta-band modulations of ITPC could have arisen from periodic fluctuations in gain (which would increase sensitivity to the SSVER flicker) or periodic fluctuations in the timing consistency of responses to the SSVER stimulus. Whereas ramping may heighten overall response reliability during an anticipated period, additional periodic modulation may allow for more temporally fine-grained adjustments, selectively enhancing reliability at one moment while suppressing it at moments just before and after. Such control over windows of increased response reliability could facilitate the communication of accurate stimulus information at precise moments when a task-relevant stimulus is likely to occur while providing some shielding from temporally proximal distracting stimuli76.
We also found that temporal attention enhanced the evoked ITPC response even for fully predictable stimuli, but this enhancement occurred only for T1 and not for T2. One possible explanation, albeit speculative, is that the neural mechanisms of temporal attention differ for T1 and T2 given the directionality of time. When stimuli appear in rapid succession, anticipatory signals that slowly build up over time would mainly affect the first stimulus in the sequence. It may be that at occipital stages, changing the relative responses to the two targets by modulating T1 alone is sufficient to bias downstream competition between the two target representations and produce attentional tradeoffs in behavior. In our data, consistent with a relative modulation of target responses by attention, the T2 evoked ITPC response was higher than the T1 response when T2 was precued and lower than the T1 response when T1 was precued.
A previous investigation of temporal expectation using SSVER yielded mixed results51: post-target SSVER amplitude was higher for predictable vs. unpredictable stimuli for some flicker frequencies but lower for other flicker frequencies, with no increase in SSVER amplitude in advance of a predictable target time. However, the SSVER stimulus had no spatial overlap with the target location, so any expectation-related modulation specific to the target location would not have been measurable in the SSVER signal. Here, with spatially co-localized SSVER and target stimuli, temporal attention increased the impact of the targets on the SSVER for all participants. However, the targets themselves increased the SSVER for some participants and decreased it for others. This directionality was stable across sessions for each participant. To our knowledge this individual variation is an unexplained observation in the field that deserves further study (Supplementary Note 1, Supplementary Table 2).
Our normalization model of dynamic attention predicts that temporal attention will modulate sensory responses3. Based on fits to psychophysical data, the model also predicts that the effects of temporal attention on neural gain will be transient and roughly coincident with the sensory responses. The current data are consistent with both model predictions: sensory modulation by temporal attention and modulation at a transient timescale. Specifically, the model predicts a change in gain at the neuronal level, which requires a linking hypothesis to make predictions for MEG measures. Higher gain would increase the probability or rate of neuronal spiking in response to each stimulus flicker, which in turn would be expected to lead to more reliable population responses to the SSVER stimulus across trials and thus higher ITPC. Such a mechanism also might be expected to increase single-trial power, which we did not observe. However, the SSVER stimulus itself did not substantially increase 20 Hz single-trial power above baseline. This raises the possibility that stimulus-driven 20 Hz activity interacted with ongoing endogenous (non-phase locked) 20 Hz power, which may have obscured gain changes in the single-trial power measure. Alternatively, changes in ITPC could have arisen from a mechanism other than gain changes, such as more consistent timing of stimulus-driven responses without changes in neuronal response amplitude. Invasive recordings, and perhaps modeling efforts, could distinguish between these possibilities.
This linking hypothesis between gain increases at the neuronal level and ITPC increases in the SSVER can account for the pre-target effects of attention and expectation we observed, as well as post-target effects for those participants showing a transient increase of the ITPC following each target. However, transient reductions of the ITPC in response to target stimuli, regardless of attention (Supplementary Note 1), appear to arise from participant-specific interactions between the ongoing SSVER and target-evoked responses and are thus not predictable from the normalization model of dynamic attention3, which does not include specific biophysical structure. The model also could not predict the anticipatory ramping and periodic modulation we observed here, because the behavioral data alone could not distinguish anticipatory from stimulus and post-stimulus processing; future versions of the model should incorporate these new experimental findings.
In summary, using time-resolved SSVER, we studied how voluntary attention to points in time continuously impacts the reliability of cortical responses to visual stimulation. Our measurements revealed two complementary neural mechanisms specific to voluntary temporal attention, when temporal expectation was controlled: (1) a slow, periodic fluctuation of visual cortical ITPC that anticipated predictable target times and shifted in time depending on which time point was to be attended, and (2) a transient, post-target increase of the evoked ITPC response to targets at attended time points, which was specific to the first target. This transient modulation changed the relative strengths of the two target responses as a function of attention. By revealing the full time course of temporal attentional modulations of sensory activity, the results show how attention dynamically alters visual signals to select and prioritize visual information at precise moments in time.
Methods
Participants
Ten participants (5 females) between the ages of 24–35 years, including authors RND and KJT, participated in the experiment. Each participant completed two MEG sessions, which allowed us to check for consistency across sessions for each participant while collecting 20 sessions of MEG data in total, comparable to previous MEG studies of vision77,78. The sample size was also similar to previous temporal attention studies that measured accuracy4,14,29 and to several studies on spatial attention79,80,81. No statistical method was used to predetermine sample size. No data were excluded from the analyses, except in one case where the measure of interest could not be calculated for one participant (see Peak analysis). All participants provided informed consent, and the University Committee on Activities involving Human Subjects at New York University approved the experimental protocols. All participants had normal or corrected-to-normal vision and were monetarily compensated for their participation. No sex or gender-based analyses were performed and we did not consider sex or gender in the study design, as neither sex nor gender played a role in our research questions. The sex of the participants was determined based on self-report.
Stimuli
Stimuli were generated on an Apple iMac using MATLAB and Psychophysics Toolbox version 3.0.1882,83 and were displayed using a gamma-corrected InFocus LP850 projector (Texas Instruments, Warren, NJ) with a resolution of 1024 × 768 pixels and a refresh rate of 60 Hz. Stimuli were projected via a mirror onto a translucent screen at a viewing distance of 42 cm. Stimulus timing accuracy was confirmed with photodiode measurements. Stimuli were presented on a medium gray background (206 cd/m2). A central fixation circle subtended 0.15° visual angle.
SSVER stimulus
Throughout the trial, a 20 Hz contrast-reversing flickering noise patch was presented foveally, with the fixation circle overlaid. The noise patch was 4° in diameter with a blurred outer edge subtending 0.4°, fading to the background gray. To create the noise patch, pixel noise was generated from a normal distribution then bandpass filtered in both orientation and spatial frequency around target values: in orientation 0° ± 5° and 90° ± 5° forming a vertical and horizontal crosshatch pattern, and in spatial frequency 1.5 cpd ±1 octave. A new noise patch was randomly generated on each trial and then flickered throughout the trial by contrast reversing at 20 Hz (50 ms of the original noise image in alternation with 50 ms of the contrast-reversed image). The contrast of the SSVER stimulus was adjusted to ensure high visibility of the target stimuli while still producing a clear SSVER signal (40–55% noise patch contrast across participants).
Targets
Visual targets were 1.5 cpd sinusoidal gratings of 4° in diameter with a blurred outer edge subtending 0.4°. Targets were also presented foveally and were added pixelwise to the noise patch stimulus. To match the temporal frequency of the targets to the SSVER stimulus, the targets were also contrast-reversed at 20 Hz, in phase with the noise patch flicker, for a total target duration of 100 ms. Target grating contrast was adjusted so that, when added to the noise patch, the total contrast was 100% (grating contrast 45–60% across participants).
Cues
Auditory cues were pure sine wave tones 100 ms in duration (with 10 ms cosine amplitude modulation ramps at onset and offset) presented through earbuds at a comfortable volume. There were two possible auditory cue tones, one high-pitched (1046.5 Hz, C6) and one low-pitched (440 Hz, A4).
Experimental procedure
All participants participated in two 2-h MEG sessions on separate days, on average 3.6 days apart. Each session included 12 experimental blocks ~6 min each in duration. Participants took breaks between blocks and pressed a button to indicate their readiness for the next block.
Task
On each trial, participants were asked to discriminate the orientation of one of two grating targets (Fig. 1). The two targets (T1 and T2) were each presented for 100 ms, separated by a SOA of 300 ms, based on previous psychophysical studies3,4,14. Each target was tilted slightly CW or CCW from either the vertical or horizontal axis, with independent tilts and axes for T1 and T2. Both vertical and horizontal axes were used to reduce the number of trials on which the two stimuli were identical and discourage participants from adopting a strategy of judging whether the stimuli were the same or different to aid in discrimination. An auditory precue 1050 ms before the first target instructed participants to attend to the first or the second target (high tone: attend T1; low tone: attend T2). An auditory response cue 950 ms after T2 indicated which of the two targets’ tilt to report (high tone: report T1; low tone: report T2). Participants pressed one of two buttons to indicate whether the tilt was CW or CCW relative to the main axis within a 1600 ms response window. The response cue matched the precue on 75% of the trials (valid trials) and mismatched the precue on the remaining 25% (invalid trials). At the end of each trial, feedback was provided by a change in color of the fixation circle (green: correct; red: incorrect; blue: response timeout). At the end of each block, performance accuracy (percent correct) was displayed. The timing of auditory and visual events was the same on every trial. From trial to trial, the allocation of temporal attention varied (depending on the precue), and the response selection varied (depending on the response cue).
So that only the precue would provide predictive timing information about the target onsets, and to allow time for the SSVER signal to stabilize before any trial events occurred, the interval between the SSVER onset and the precue was jittered from 500 to 1500 ms in steps of 200 ms. Between each trial, an inter-trial interval of 1500 ms was presented, during which only the fixation circle appeared on a gray background. Participants were encouraged to blink during this period. The fixation circle turned black at the start of the ITI, turned white 500 ms before the SSVER onset to indicate that the trial was about to begin, and remained white throughout the trial until the response feedback was provided.
Each session consisted of 12 blocks of 41 trials each with all combinations of cue type (valid: 75%, invalid: 25%), probed target (T1, T2), target tilt (CW, CCW; independent for T1 and T2), and target axis (horizontal, vertical; independent for T1 and T2) in a randomly shuffled order. To reduce adaptation to the SSVER stimulus, every fifth trial was a blank trial, with an additional blank trial at the beginning of each block. This procedure yielded a total of 768 temporal attention trials per participant across the two sessions.
Training and thresholding
Prior to MEG, participants completed at least one session of training to familiarize them with the task and to determine their orientation tilt thresholds. During the training session, a chin-and-head rest was used to hold head position and viewing distance constant. Stimuli were displayed on a gamma-corrected Sony Trinitron G520 CRT monitor with a refresh rate of 60 Hz at a viewing distance of 57 cm. A 3-up-1-down staircase procedure was used to estimate each participant’s tilt threshold to achieve ~79% accuracy across all trials. Threshold tilts ranged from 1.5–5°.
Eye tracking
Eye position was recorded using an EyeLink 1000 eye tracker (SR Research Ltd., Ottawa, ON, Canada) with a sampling rate of 1000 Hz. Raw gaze positions were converted into degrees of visual angle using the five-point-grid calibration, which was performed at the start of each MEG session.
MEG recording
Data were recorded continuously in each block with a 157-channel axial gradiometer system (Kanazawa Institute of Technology, Kanazawa, Japan) in the KIT/NYU facility at New York University. The magnetic fields were sampled at 1000 Hz with online DC filtering and a 200 Hz low-pass filter. Three orthogonally-oriented reference magnetometers placed ~20 cm away from the recording array recorded environmental noise.
Before recording, each participant’s head shape was digitized with a handheld FastSCAN laser scanner (Polhemus, VT, USA). Digital markers were placed on the forehead, nasion, and left and right tragus and peri-auricular points. To record marker locations with respect to the MEG channels, electrodes were placed on five of the locations identified by the digital markers (three points on the forehead and left and right peri-auricular points), and marker locations were measured at the beginning and end of the MEG recording session.
Data analysis
Preprocessing
Data were preprocessed in MATLAB using the FieldTrip toolbox (version 20220707) for EEG/MEG-analysis84. Environmental noise was removed from the data by regressing signals recorded from three orthogonally oriented magnetometers against the recorded data.
MEG channels in which there was no signal or excessive noise were interpolated from neighboring channels (the number of interpolated channels per recorded session ranged from 2 to 16 (1.3–10.2%), mean = 3.9 (2.5%), SD = 3.0). Trials were manually inspected and rejected for blinks and other artifacts. The number of rejected trials per session ranged from 26 to 73 (5.3–14.8%), mean = 45.4 (9.2%), SD = 14.3.
Channel selection
We selected visually responsive channels as the five channels per session with the strongest 20 Hz trial-average SSVER power. Topographies confirmed selected channels were in the posterior part of the head.
SSVER analysis
For the SSVER analyses, trials were epoched from 1500 ms before to 5200 ms after the precue. Time-frequency representations of the data were computed using wavelet convolution with the FieldTrip toolbox. For the 20 Hz SSVER frequency, Morlet wavelets had 8 cycles (127 ms temporal resolution, 5 Hz spectral resolution). For full time-frequency spectra, frequency was sampled every 1 Hz from 1–50 Hz, time was sampled every 10 ms, wavelets were tapered with Hanning windows, and the time windows scaled with frequency according to 10 s divided by the frequency.
Trial-average power
Trial-average power quantifies the power of a periodic signal that is phase-locked across trials and so survives averaging across trials. This measure has been commonly used in the SSVER literature46,85,86,87. To calculate the trial-average power, the time series of all trials (or all trials in a given experimental condition) were first averaged together, giving the mean time series across trials. Time-frequency analysis was then performed on this single time series to calculate the power across time at a frequency of interest. Trial-average power depends on two factors: single-trial power and ITPC.
Single-trial power
Single-trial power quantifies the power of a periodic signal present on individual trials. To calculate the single-trial power, time-frequency analysis was performed on each individual trial, and these time-frequency representations were then averaged together across trials. This procedure preserves oscillatory activity that is not phase-locked across trials but relies on having sufficient signal-to-noise at the single-trial level to measure the periodic signal.
ITPC
ITPC quantifies the phase alignment of a periodic signal across trials. The measure ranges from 0 (no phase alignment) to 1 (perfect phase alignment)88. To calculate ITPC, time-frequency analysis was performed on each individual trial, returning estimates of the instantaneous phase angle at each point in time for a frequency of interest. The following standard formula was then used to calculate the alignment of these phase angles across trials, providing an estimate of ITPC:
where \(\phi\)tf is the instantaneous phase angle in radians at a time point t and frequency f for each trial n∈[1,…,N].
Anticipatory ITPC analyses
FFT
To investigate whether there were any periodic modulations of the ITPC timeseries beyond the expected scale-free spectral content, we assessed whether the power spectrum of the ITPC timeseries in the precue-to-T1 period differed from a 1/f relationship. To do so, we converted the anticipatory ITPC time series per session into the frequency domain using fast Fourier transform (from 1 Hz to 200 Hz, with 1 Hz resolution). We bootstrapped the 20 resultant spectra 1000 times, resampling with replacement to generate a distribution of spectra. The 1/f power law—as frequency increases, power decays following a power-law function—simplifies to a linear relationship between power and frequency in log-log space89. Hence, we estimated the aperiodic components of the anticipatory ITPC period by fitting a line to the average empirical spectrum in log-log space from 35 to 200 Hz, while leaving out 60 Hz line noise and harmonics, following the procedure from ref. 90, and extrapolated the fit to estimate the power expected from 1/f scale-free activity at lower frequencies (<35 Hz). For each integer frequency, we calculated the 95% confidence interval from the bootstrapped distribution of spectra using the 2.5–97.5 percentiles. Deviation of the confidence interval above or below the fitted line would be evidence for significantly more or less periodicity, respectively, than expected from scale-free activity alone, at an alpha level of 0.05. As a control, we repeated this procedure for a baseline period (1500–500 ms before the precue).
Using the same bootstrapping procedure, we also confirmed that the spectra of the original time series, averaged across the top 5 visually responsive channels (see Channel selection) per session, showed no statistically significant deviations in 2 Hz power from 1/f expectations during the precue-to-T1 period.
Model fitting and parameter estimation
To quantify pre-target ITPC modulations for individual participants, we fit both linear and linear plus periodic models to the ITPC time series in the time window between the precue and first target. In all trials, a predictable interval of 1.05 s elapsed between the onset of the precue and the onset of T1, so participants could form an expectation about the timing of T1 onset. To prevent T1 from influencing ITPC measurements of pre-target activity due to the temporal bandwidth of the wavelet, we set a cushion before T1 based on the group average ITPC time series across all trials. The start of the cushion was the inflection point 80 ms before T1 when ITPC began to sharply decrease due to the T1 response. Thus the precue-to-T1 window was defined as the onset of the precue to 80 ms before T1 (0–970 ms).
To assess the overall effect of expectation on ITPC in this precue-to-T1 period, we fit the models to the ITPC time series calculated from all trials. To assess the effect of temporal attention, we fit the models to ITPC time series calculated from trials for each precue condition separately. In all cases, we fit the models for each participant and session and then statistically assessed the parameter estimates from these fits.
The linear plus periodic model was defined as:
where \(\hat{y}\) is the predicted ITPC, \(t\) is time, \({f}_{s}\) is the sampling frequency (1000 Hz), \(b\) is the intercept, \(m\) is the slope of the linear component, and \(A\), \(\omega\), and \(\phi\) are the amplitude, frequency, and phase of the periodic (sine wave) component. The parameters \(b\), \(m\), \(A\), and \(\phi\) were always free to vary during model fitting, and \(\omega\) could be free or fixed to a specific frequency, e.g., 2 Hz.
The linear only model was identical but contained only the intercept and slope terms.
We estimated the model parameters by minimizing the mean squared error between the predicted and true ITPC using the optimization algorithm Bayesian Adaptive Direct Search (BADS)91. The search bounds for each parameter were:
\(b:\,[\mathrm{0,1}]\)
\(m:\,\left[-2,\,2\right]\) \(\Delta\)ITPC/s
\(A:\,[\mathrm{0,1}]\)
\(\omega :\,[0.1,\,5]\) Hz
\(\phi :\,[-{pi},{pi}]\) radians
To avoid local minima, we randomly generated 100 sets of initial parameters within the search bounds. We performed BADS optimization for each set of initial parameters, then selected the set of fitted parameters associated with the minimum error across iterations as the best fitting parameters.
Phase analysis
Phase analyses were performed using the Circular Statistics Toolbox92. The effect of temporal attention on phase was assessed using Rayleigh’s z test, which tests for non-uniformity of a circular distribution.
Evoked ITPC analysis
Peak analysis
To quantify the evoked ITPC response for individual participants, we estimated the latency, direction, and amplitude of the most prominent deflection in the 20 Hz ITPC following each target, which we refer to as a “peak”. As ITPC data in the selected visual channels was highly consistent across sessions for a given participant, we identified peak times for each participant using the ITPC time series averaged across the two sessions. To find the peaks, we applied the MATLAB algorithm findpeaks.m to the time window 0–600 ms after T1 onset, which captured the evoked ITPC responses to both T1 and T2.
All participants but one had two peaks that were either upward or downward during this window. The participant without an identifiable pair of peaks was not considered further in the peak analysis. Two participants had both upward peaks and downward peaks during the window, so for these participants we selected the pair of peaks whose timing was best aligned with the peaks of the other participants. The direction of the pair of peaks identified by this algorithm determined the classification of a participant as having either upward or downward peaks.
We next assessed the effect of attention on the magnitude of the ITPC peaks, regardless of their direction. To combine evoked ITPC data across participants for this analysis, we first normalized the data so that peaks for all participants pointed upward. To do so, we measured a baseline ITPC value for each participant and session in the window 470–970 ms after the precue (a 500 ms window before target onset, excluding the 80 ms cushion before T1, see Model fitting and parameter estimation). We then subtracted that baseline value from the time series, and for participants with downward peaks, reversed the sign:
where s = 1 for participants with upward peaks and −1 for participants with downward peaks. This procedure allowed all evoked ITPC peaks to point upward while preserving all other features of the time series.
Finally, to quantify the ITPC magnitude at the peak times, for each participant, session, and precue condition, we took the mean normalized ITPC value in a 100 ms window centered on each peak time (i.e., following each target). Equivalent peak analyses were performed for trial-average 20 Hz power and single-trial 20 Hz power using the same peak times and normalization procedure.
Statistical analysis
The effects of temporal attention on b0ehavior (d’ and RT) and MEG time series (anticipatory model parameter estimates and evoked peak values) were assessed using repeated measures ANOVAs via the ezANOVA package in R. For behavioral data, the within-subject factors were session, target, and validity (valid or invalid, with respect to the match between the precue and response cue), with session as an observed factor. For MEG anticipatory analyses, within-subject factors were session and precue (precue T1 or precue T2), with session as an observed factor. For MEG peak analyses, within-subject factors were session, target, and attention (attended or unattended, with respect to the precue type, referenced to each target), with session as an observed factor. Planned ANOVAs were also conducted for each target separately. All statistical tests met normality and equal variance assumptions.
To assess the effect of the precue on the full time series or time-frequency spectrum, we used a nonparametric cluster-corrected test93. This involved a two-step procedure using data averaged across sessions for each participant. In the first step, two-tailed paired t-tests were performed at each time point or time-frequency bin (henceforth, “bin” for both) comparing precue T1 to precue T2 condition values. This operation yielded t-statistics for each bin for the empirical data. Contiguous bins passing the p < 0.05 threshold were considered to form a cluster, and a statistic was generated for each cluster by summing the absolute value of the t-values for bins within the cluster. The largest statistic out of all clusters was considered to be the maximum cluster statistic for the empirical data. Next, null distributions of the maximum cluster statistic were constructed by shuffling the labels for the precue conditions, computing the maximum cluster statistic for the shuffled data, and repeating this process 1000 times. Significance levels for the empirical data were determined by comparing the value of the maximum cluster statistic to the null distribution.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The neural and behavioral data generated in this study have been deposited in the Open Science Framework database at https://osf.io/796kb/94 and are publicly available. Source data are provided with this paper.
Code availability
The code for the presented analyses are available on GitHub at https://github.com/denisonlab/TANoise_MEG95.
References
Land, M. F. & Furneaux, S. The knowledge base of the oculomotor system. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci. 352, 1231–1239 (1997).
Nobre, A. C. & van Ede, F. Anticipated moments: temporal structure in attention. Nat. Rev. Neurosci. 19, 34–48 (2018).
Denison, R. N., Carrasco, M. & Heeger, D. J. A dynamic normalization model of temporal attention. Nat. Hum. Behav. 5, 1674-1685 (2021).
Denison, R. N., Heeger, D. J. & Carrasco, M. Attention flexibly trades off across points in time. Psychon. Bull. Rev. 24, 1142–1151 (2017).
Duyar, A., Ren, S. & Carrasco, M. When temporal attention interacts with expectation. Sci. Rep. 14, 4624 (2024).
Denison, R. N. Visual temporal attention from perception to computation. Nat. Rev. Psychol. 3, 261–274 (2024).
Raymond, J. E., Shapiro, K. L. & Arnell, K. M. Temporary suppression of visual processing in an RSVP task: An attentional blink? J. Exp. Psychol. Hum. Percept. Perform. 18, 849–860 (1992).
Tang, M. F. et al. Neural dynamics of the attentional blink revealed by encoding orientation selectivity during rapid visual presentation. Nat. Commun. 11, 434 (2020).
Marti, S. & Dehaene, S. Discrete and continuous mechanisms of temporal selection in rapid visual streams. Nat. Commun. 8, 1955 (2017).
Summerfield, C. & Egner, T. Feature-based attention and feature-based expectation. Trends Cogn. Sci. 20, 401–404 (2016).
Denison, R. N., Yuval-Greenberg, S. & Carrasco, M. Directing voluntary temporal attention increases fixational stability. J. Neurosci. 39, 353–363 (2019).
Duyar, A., Denison, R. N. & Carrasco, M. Exogenous temporal attention varies with temporal uncertainty. J. Vis. 23, 9 (2023).
Jing, C. et al. Temporal attention affects contrast response function by response gain. Front. Hum. Neurosci. 16, 1020260 (2023).
Fernández, A., Denison, R. N. & Carrasco, M. Temporal attention improves perception similarly at foveal and parafoveal locations. J. Vis. 19, 12 (2019).
Palmieri, H., Fernández, A. & Carrasco, M. Microsaccades and temporal attention at different locations of the visual field. J. Vis. 23, 6 (2023).
Hillyard, S. A., Vogel, E. K. & Luck, S. J. Sensory gain control (amplification) as a mechanism of selective attention: electrophysiological and neuroimaging evidence. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci. 353, 1257–1270 (1998).
Richter, D., Ekman, M. & de Lange, F. P. Suppressed sensory response to predictable object stimuli throughout the ventral visual stream. J. Neurosci. 38, 7452–7461 (2018).
Wyart, V., Nobre, A. C. & Summerfield, C. Dissociable prior influences of signal probability and relevance on visual contrast sensitivity. Proc. Natl. Acad. Sci. USA 109, 3593–3598 (2012).
Cheadle, S., Egner, T., Wyart, V., Wu, C. & Summerfield, C. Feature expectation heightens visual sensitivity during fine orientation discrimination. J. Vis. 15, 14 (2015).
Kok, P., Rahnev, D., Jehee, J. F. M., Lau, H. C. & de Lange, F. P. Attention reverses the effect of prediction in silencing sensory signals. Cereb. Cortex 22, 2197–2206 (2012).
Summerfield, C. & Egner, T. Expectation (and attention) in visual cognition. Trends Cogn. Sci. 13, 403–409 (2009).
Bang, J. W. & Rahnev, D. Stimulus expectation alters decision criterion but not sensory signal in perceptual decision making. Sci. Rep. 7, 17072 (2017).
Rungratsameetaweemana, N. & Serences, J. T. Dissociating the impact of attention and expectation on early sensory processing. Curr. Opin. Psychol. 29, 181–186 (2019).
Todorovic, A., Schoffelen, J.-M., van Ede, F., Maris, E. & de Lange, F. P. Temporal expectation and attention jointly modulate auditory oscillatory activity in the beta band. PloS ONE 10, e0120288 (2015).
Lange, K. & Röder, B. Orienting attention to points in time improves stimulus processing both within and across modalities. J. Cogn. Neurosci. 18, 715–729 (2006).
Sharma, J. et al. Spatial attention and temporal expectation under timed uncertainty predictably modulate neuronal responses in monkey V1. Cereb. Cortex 25, 2894–2906 (2015).
Ghose, G. M. & Maunsell, J. H. R. Attentional modulation in visual cortex depends on task timing. Nature 419, 616–620 (2002).
Warren, S., Yacoub, E. & Ghose, G. Featural and temporal attention selectively enhance task-appropriate representations in human V1. Nat. Commun. 5, 5643–5643 (2014).
Davranche, K., Nazarian, B., Vidal, F. & Coull, J. Orienting attention in time activates left intraparietal sulcus for both perceptual and motor task goals. J. Cogn. Neurosci. 23, 3318–3330 (2011).
Samaha, J., Bauer, P., Cimaroli, S. & Postle, B. R. Top-down control of the phase of alpha-band oscillations as a mechanism for temporal prediction. Proc. Natl. Acad. Sci. USA 112, 8439–8444 (2015).
Anderson, B. & Sheinberg, D. L. Effects of temporal context and temporal expectancy on neural activity in inferior temporal cortex. Neuropsychologia 46, 947–957 (2008).
van Ede, F., Chekroud, S. R., Stokes, M. G. & Nobre, A. C. Decoding the influence of anticipatory states on visual perception in the presence of temporal distractors. Nat. Commun. 9, 1449 (2018).
Correa, Á., Lupiáñez, J., Madrid, E. & Tudela, P. Temporal attention enhances early visual processing: a review and new evidence from event-related potentials. Brain Res. 1076, 116–128 (2006).
Miniussi, C., Wilding, E. L., Coull, J. T. & Nobre, A. C. Orienting attention in time: modulation of brain potentials. Brain 122, 1507–1518 (1999).
Leon, M. I. & Shadlen, M. N. Representation of time by neurons in the posterior parietal cortex of the macaque. Neuron 38, 317–327 (2003).
Maimon, G. & Assad, J. A. A cognitive signal for the proactive timing of action in macaque LIP. Nat. Neurosci. 9, 948–955 (2006).
Merchant, H., Harrington, D. L. & Meck, W. H. Neural basis of the perception and estimation of time. Annu. Rev. Neurosci. 36, 313–336 (2013).
Finnerty, G. T., Shadlen, M. N., Jazayeri, M., Nobre, A. C. & Buonomano, D. V. Time in cortical circuits. J. Neurosci. 35, 13912–13916 (2015).
Lakatos, P., Karmos, G., Mehta, A. D., Ulbert, I. & Schroeder, C. E. Entrainment of neuronal oscillations as a mechanism of attentional selection. Science 320, 110–113 (2008).
Schroeder, C. E. & Lakatos, P. Low-frequency neuronal oscillations as instruments of sensory selection. Trends Neurosci. 32, 9–18 (2009).
Cravo, A. M., Rohenkohl, G., Wyart, V. & Nobre, A. C. Temporal expectation enhances contrast sensitivity by phase entrainment of low-frequency oscillations in visual cortex. J. Neurosci. 33, 4002–4010 (2013).
Breska, A. & Deouell, L. Y. Neural mechanisms of rhythm-based temporal prediction: delta phase-locking reflects temporal predictability but not rhythmic entrainment. PLoS Biol. 15, e2001665 (2017).
Lima, B., Singer, W. & Neuenschwander, S. Gamma responses correlate with temporal expectation in monkey primary visual cortex. J. Neurosci. 31, 15919–15931 (2011).
Jaramillo, S. & Zador, A. M. The auditory cortex mediates the perceptual effects of acoustic temporal expectation. Nat. Neurosci. 14, 246–251 (2011).
Nobre, A. C. & van Ede, F. Attention in flux. Neuron 111, 971–986 (2023).
Morgan, S. T., Hansen, J. C. & Hillyard, S. A. Selective attention to stimulus location modulates the steady-state visual evoked potential. Proc. Natl. Acad. Sci. USA 93, 4770–4774 (1996).
Müller, M. M., Teder, W. & Hillyard, S. A. Magnetoencephalographic recording of steady-state visual evoked cortical activity. Brain Topogr. 9, 163–168 (1997).
Andersen, S. K. & Müller, M. M. Behavioral performance follows the time course of neural facilitation and suppression during cued shifts of feature-selective attention. Proc. Natl. Acad. Sci. USA 107, 13878–13882 (2010).
Garcia, J. O., Srinivasan, R. & Serences, J. T. Near-real-time feature-selective modulations in human cortex. Curr. Biol. 23, 515–522 (2013).
Adamian, N. & Andersen, S. K. Attentional modulation in early visual cortex: a focused reanalysis of steady-state visual evoked potential studies. J. Cogn. Neurosci. 36, 46–70 (2023).
Mora‐Cortes, A., Ridderinkhof, K. R. & Cohen, M. X. Evaluating the feasibility of the steady‐state visual evoked potential (SSVEP) to study temporal attention. Psychophysiology 55, e13029 (2017).
Kim, Y. J., Grabowecky, M., Paller, K. A., Muthu, K. & Suzuki, S. Attention induces synchronization-based response gain in steady-state visual evoked potentials. Nat. Neurosci. 10, 117–125 (2007).
Kashiwase, Y., Matsumiya, K., Kuriki, I. & Shioiri, S. Time courses of attentional modulation in neural amplification and synchronization measured with steady-state visual-evoked potentials. J. Cogn. Neurosci. 24, 1779–1793 (2012).
Moratti, S., Clementz, B. A., Gao, Y., Ortiz, T. & Keil, A. Neural mechanisms of evoked oscillations: stability and interaction with transient events. Hum. Brain Mapp. 28, 1318–1333 (2007).
Ding, J., Sperling, G. & Srinivasan, R. Attentional modulation of SSVEP power depends on the network tagged by the flicker frequency. Cereb. Cortex 16, 1016–1029 (2006).
Eidelman-Rothman, M. et al. Sleepless and desynchronized: Impaired inter trial phase coherence of steady-state potentials following sleep deprivation. Neuroimage 202, 116055 (2019).
Rohenkohl, G., Gould, I. C., Pessoa, J. & Nobre, A. C. Combining spatial and temporal expectations to improve visual perception. J. Vis. 14, 8 (2014).
Dmochowski, J. P., Greaves, A. S. & Norcia, A. M. Maximally reliable spatial filtering of steady state visual evoked potentials. NeuroImage 109, 63–72 (2015).
Srinivasan, R., Bibi, F. A. & Nunez, P. L. Steady-state visual evoked potentials: distributed local sources and wave-like dynamics are sensitive to flicker frequency. Brain Topogr. 18, 167–187 (2006).
Muller, T. & Nobre, A. C. Perceiving the passage of time: neural possibilities. Ann. N. Y. Acad. Sci. 1326, 60–71 (2014).
Donoghue, T. et al. Parameterizing neural power spectra into periodic and aperiodic components. Nat. Neurosci. 23, 1655–1665 (2020).
He, B. J. Scale-free brain activity: past, present, and future. Trends Cogn. Sci. 18, 480–487 (2014).
Buzsáki, G. & Draguhn, A. Neuronal oscillations in cortical networks. Science 304, 1926–1929 (2004).
Breska, A. & Ivry, R. B. Context-specific control over the neural dynamics of temporal attention by the human cerebellum. Sci. Adv. 6, eabb1141 (2020).
Amit, R., Abeles, D., Carrasco, M. & Yuval-Greenberg, S. Oculomotor inhibition reflects temporal expectations. Neuroimage 184, 279–292 (2019).
Mento, G., Tarantino, V., Vallesi, A. & Bisiacchi, P. S. Spatiotemporal neurodynamics underlying internally and externally driven temporal prediction: a high spatial resolution ERP study. J. Cogn. Neurosci. 27, 425–439 (2015).
Walter, G., Cooper, R., Aldrige, V. J., McCallum, W. C. & Winter, A. L. Contingent negative variation: an electric sign of sensorimotor association and expectancy in the human brain. Nature 203, 380–384 (1964).
Ng, K. K., Tobin, S. & Penney, T. B. Temporal accumulation and decision processes in the duration bisection task revealed by contingent negative variation. Front. Integr. Neurosci. 5, 77 (2011).
Mento, G. The role of the P3 and CNV components in voluntary and automatic temporal orienting: a high spatial-resolution ERP study. Neuropsychologia 107, 31–40 (2017).
Mita, A., Mushiake, H., Shima, K., Matsuzaka, Y. & Tanji, J. Interval time coding by neurons in the presupplementary and supplementary motor areas. Nat. Neurosci. 12, 502–507 (2009).
Lakatos, P. et al. The leading sense: supramodal control of neurophysiological context by attention. Neuron 64, 419–430 (2009).
Arnal, L. H., Doelling, K. B. & Poeppel, D. Delta–beta coupled oscillations underlie temporal prediction accuracy. Cereb. Cortex 25, 3077–3085 (2015).
Wyart, V., de Gardelle, V., Scholl, J. & Summerfield, C. Rhythmic fluctuations in evidence accumulation during decision making in the human brain. Neuron 76, 847–858 (2012).
Stefanics, G. et al. Phase entrainment of human delta oscillations can mediate the effects of expectation on reaction speed. J. Neurosci. 30, 13578–13585 (2010).
Correa, Á., Lupiáñez, J. & Tudela, P. Attentional preparation based on temporal expectancy modulates processing at the perceptual level. Psychon. Bull. Rev. 12, 328–334 (2005).
Bonnefond, M. & Jensen, O. Alpha oscillations serve to protect working memory maintenance against anticipated distracters. Curr. Biol. 22, 1969–1974 (2012).
Flounders, M. W., González-García, C., Hardstone, R. & He, B. J. Neural dynamics of visual ambiguity resolution by perceptual prior. eLife 8, e41861 (2019).
Kok, P., Mostert, P. & de Lange, F. P. Prior expectations induce prestimulus sensory templates. Proc. Natl. Acad. Sci. USA 114, 10473–10478 (2017).
Fernández, A. & Carrasco, M. Extinguishing exogenous attention via transcranial magnetic stimulation. Curr. Biol. 30, 4078–4084 (2020).
Jigo, M. & Carrasco, M. Differential impact of exogenous and endogenous attention on the contrast sensitivity function across eccentricity. J. Vis. 20, 11 (2020).
Anton-Erxleben, K., Herrmann, K. & Carrasco, M. Independent effects of adaptation and attention on perceived speed. Psychol. Sci. 24, 150–159 (2012).
Brainard, D. H. The psychophysics toolbox. Spat. Vis. 10, 433–436 (1997).
Pelli, D. G. The VideoToolbox software for visual psychophysics: transforming numbers into movies. Spat. Vis. 10, 437–442 (1997).
Oostenveld, R., Fries, P., Maris, E. & Schoffelen, J.-M. FieldTrip: open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data. Comput. Intell. Neurosci. 2011, 156869 (2011).
Müller, M. M., Teder-Sälejärvi, W. & Hillyard, S. A. The time course of cortical facilitation during cued shifts of spatial attention. Nat. Neurosci. 1, 631–634 (1998).
Müller, M. M. et al. Effects of spatial selective attention on the steady-state visual evoked potential in the 20–28 Hz range. Cogn. Brain Res. 6, 249–261 (1998).
Norcia, A. M., Appelbaum, L. G., Ales, J. M., Cottereau, B. R. & Rossion, B. The steady-state visual evoked potential in vision research: a review. J. Vis. 15, 4 (2015).
Lachaux, J., Rodriguez, E., Martinerie, J. & Varela, F. J. Measuring phase synchrony in brain signals. Hum. Brain Mapp. 8, 194–208 (1999).
Penttonen, M. & Buzsáki, G. Natural logarithmic relationship between brain oscillators. Thalamus Relat. Syst. 2, 145 (2003).
Hermes, D., Miller, K. J., Wandell, B. A. & Winawer, J. Stimulus dependence of gamma oscillations in human visual cortex. Cereb. Cortex 25, 2951–2959 (2015).
Acerbi, L. & Ma, W. J. Practical Bayesian optimization for model fitting with Bayesian adaptive direct search. Adv. Neural Inf. Process. Syst. 30, 1834–1844 (2017).
Berens, P. CircStat: a MATLAB toolbox for circular statistics. J. Stat. Softw. 10, 1–21 (2009).
Maris, E. & Oostenveld, R. Nonparametric statistical testing of EEG- and MEG-data. J. Neurosci. Meth 164, 177–190 (2007).
Tian, K. J. & Denison, R. N. Data from: Anticipatory and evoked visual cortical dynamics of voluntary temporal attention. Open Science Framework. https://doi.org/10.17605/osf.io/796kb (2024).
Tian, K. J. & Denison, R. N. Code from: Anticipatory and evoked visual cortical dynamics of voluntary temporal attention. GitHub. https://doi.org/10.5281/zenodo.13368267 (2024).
Acknowledgements
This research was supported by National Institutes of Health National Eye Institute R01 EY019691 to M.C., National Institutes of Health National Eye Institute F32 EY025533 to R.N.D., National Institutes of Health National Eye Institute T32 EY007136 to NYU, startup funds from Boston University to R.N.D. and National Defense Science and Engineering Graduate Fellowship to K.J.T. We thank Sirui Liu and Luis Ramirez for assistance with early versions of the experiment. We thank Hsin-Hung Li, Amit Yashar, and members of the Carrasco and Denison Labs for helpful comments. We thank Jeffrey Walker at the NYU MEG lab for technical assistance.
Author information
Authors and Affiliations
Contributions
Conceptualization: R.N.D., D.J.H., M.C. Methodology: R.N.D., K.J.T., D.J.H., M.C. Investigation: R.N.D., K.J.T. Visualization: R.N.D., K.J.T. Supervision: R.N.D., D.J.H., M.C. Writing—original draft: R.N.D., K.J.T. Writing—review & editing: R.N.D., K.J.T., D.J.H., M.C.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Matt Friedl, Freek van Ede, and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Denison, R.N., Tian, K.J., Heeger, D.J. et al. Anticipatory and evoked visual cortical dynamics of voluntary temporal attention. Nat Commun 15, 9061 (2024). https://doi.org/10.1038/s41467-024-53406-y
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41467-024-53406-y









