Anticipatory eye gaze as a marker of memory

Schmidig, Flavio Jean; Yamin, Daniel; Sharon, Omer; Nadu, Yoav; Nir, Jonathan; Ranganath, Charan; Nir, Yuval

doi:10.1038/s44271-025-00305-7

Download PDF

Article
Open access
Published: 11 August 2025

Anticipatory eye gaze as a marker of memory

Communications Psychology volume 3, Article number: 122 (2025) Cite this article

7077 Accesses
1 Citations
105 Altmetric
Metrics details

Subjects

Abstract

Human memory is typically studied by direct questioning, and the recollection of events is investigated through verbal reports. Thus, current research confounds memory per-se with its report. Critically, the ability to investigate memory retrieval in populations with deficient verbal ability is limited. Here, using the MEGA (Memory Episode Gaze Anticipation) paradigm, we show that monitoring anticipatory gaze using eye tracking can quantify memory retrieval without verbal report. Upon repeated viewing of movie clips, eye gaze patterns anticipating salient events can quantify their memory traces seconds before these events appear on the screen. A series of five experiments with a total of 145 participants using either tailor-made animations or naturalistic movies consistently reveal that accumulated gaze proximity to the event can index memory. Machine learning-based classification can identify whether a given viewing is associated with memory for the event based on single-trial data of gaze features. Detailed comparison to verbal reports establishes that anticipatory gaze marks recollection of associative memory about the event, whereas pupil dilation captures familiarity. Finally, anticipatory gaze reveals beneficial effects of sleep on memory retrieval without verbal report, illustrating its broad applicability across cognitive research and clinical domains.

Neural and behavioral reinstatement jointly reflect retrieval of narrative events

Article Open access 23 August 2025

EyeT4Empathy: Dataset of foraging for visual information, gaze typing and empathy assessment

Article Open access 03 December 2022

Pre-AttentiveGaze: gaze-based authentication dataset with momentary visual interactions

Article Open access 13 February 2025

Introduction

Traditionally, human memory assessment has predominantly relied on explicit retrieval tasks, where participants verbally categorize learned items as familiar or novel^1,2,3. However, relying exclusively on verbal reports entails significant limitations. First, from a basic science standpoint, it confounds memory per se with the ability to access and report the memory. Accordingly, it remains unclear whether the underlying neural process, a deleterious effect of brain disease, or the benefit of sleep pertain to memory, or to the ability to access or report the memory engram. In addition, studying memory retrieval is limited (or completely impossible) in populations where explicit reports are unreliable or absent, such as in aphasia patients, newborns, or animals. A reliable and well-validated method to assess episodic-like memory of events without an explicit report would represent a major advance. By “episodic-like memory”, we refer to computational definitions (e.g., refs. ^4,5,6) that emphasize a memory for an episode (an event of what, where, and when) irrespective of the autonoetic (personal) and conscious aspects of the memory (as in the full definition by Tulving³).

A similar challenge exists in the study of consciousness, where specialized paradigms have been developed to separate the neural correlates of consciousness from those related to its report⁷ [Many such paradigms employ eye tracking, suggesting that an eye-tracking-based “no-report” paradigm could also be effective in studying memory retrieval. Additional motivation arises from rodent studies, where differences in exploratory behavior are routinely used to index memory^8,9. Humans and other primates are visual-centric and primarily rely on eye movements to explore their environment^10,11,12. Along this line, Kano & Hirata (2015)¹³ leveraged visual exploration to assess memory retrieval by tracking great apes’ eyes during a single movie clip. Inspired by this approach, we sought to use the exploration profiles of human gaze in a similar way, which could potentially allow to capture individual memory traces independent of explicit reports.

Already over the past two decades, eye tracking has been increasingly linked to human long-term memory¹⁴. To this end, a variety of studies in the fields of visual search^15,16 and spatial orientation of attention^17,18 have examined memory-guided eye-movements. Contextual cueing paradigms¹⁹, visual expectation paradigms²⁰, and “looking-at-nothing” paradigms^21,22 have jointly shed light on how declarative²³ and non-declarative^18,24 memory can guide human gaze. It is therefore widely accepted that people visually explore static images in relation to the retrieval of memories^{25,26,27,28,29,30,31,32}. For example, the recognition of photographs is associated with a decrease in distributed overt attention, and familiar faces elicit fewer fixations compared to unfamiliar ones^{29,30,32,33,34}.

We aimed to leverage gaze memory effects to develop a paradigm and analytical approach that uses eye-tracking as an alternative to verbal reporting. To this end, we developed and validated a method that uses anticipatory gaze patterns to quantify memory of events. We hypothesized that predictive eye movements during video clip viewing could serve as a proxy for the retrieval of episodic-like memory in adult humans. Due to its non-verbal nature, anticipatory gaze has previously been used to study memory in preverbal infants^20,35,36 and toddlers³⁷. Recently, memory-guided predictions, reflected in anticipatory gaze behavior, have regained attention in memory research^{38,39,40,41,42}. Such studies connect predictive eye movement patterns to memory updating⁴¹, to scenes with potential to change³⁹, or to tracking the transition from learning to memory-guided action³⁸. However, it remains unclear how reliable anticipatory gaze can be used as an indicator of episodic-like memory in adults and how such effects relate to explicit reports of episodic-like memories.

Here we introduce and validate MEGA (Memory Episode Gaze Anticipation), a no-report method based on eye tracking during repeated movie viewing to assess memory for events. Participants watched movies with predefined surprising events (SE). When participants watch the movies for the second time and have already formed a memory of the event, their gaze is drawn to its location, anticipating its occurrence before it appears. Therefore, the movies deliberatively produce an anticipatory gaze, similar to a cued recall task. Importantly, this design allows the movies to be presented twice in exactly the same way, with memory being the only difference between the two viewings. In a series of experiments, we validate MEGA as an approach to quantify memory and compared it to explicit memory retrieval. We introduce a distance-based metric on eye-tracking data to capture gaze characteristics probing memory at the single-trial level. We then compare its correspondence with multiple explicit memory reports and contrast it with other eye-tracking metrics, such as pupillometry. Finally, we demonstrate one application by studying the influence of sleep on memory consolidation in a no-report paradigm.

Methods

Participants

We tested a total of 145 participants across the five different experiments. Written informed consent was obtained from each participant prior to their involvement in the study as approved by the Institutional Review Board at Tel Aviv University (Experiments 1,2) or by the Medical Institutional Review Board at the Tel Aviv Sourasky Medical Center (Experiments 3,4). All participants were required to have normal or corrected-to-normal vision, reported overall good health, and confirmed the absence of any history of neurological or psychiatric disorders. Experiment 1 (animation movies) included a total of 34 participants (age range: 19-61, M ± SD = 26.2 ± 9.01 years; 23 (67%) female participants and 11 (33%) male participants). Experiment 2 (animation movies - with elaborate memory assessments) included 32 participants (age range: 18-40, M ± SD = 26.48 ± 4.3 years; 23 (72%) female participants and 9 (28%) male participants) of which we excluded two due to unsufficient eye-tracking. Experiment 3 (naturalistic movies) included 32 participants (age range: 19-44, M ± SD = 27.03 ± 5.09 years; 9 (27%) female participants and 23 (73%) male participants), where one participant’s eye-tracking failed and was subsequently excluded. For experiment 4 (naturalistic movies - with sleep consolidation), 19 of 27 participants reached sufficient sleep (sleep efficiency>50%). These were subsequently analyzed (age range: 20-34, M ± SD = 27.13 ± 3.44 years; 10 (34%) female participants and 9 (66%) male participants). For the control experiment (animation without events), 20 participants were subsequently analyzed (age range: 23-51, M ± SD = 28.8 ± 6.88 years; 14 (70%) female participants and 6 (30%) male participants). Thus, we collected 145 data sets and analzed the data of 134 participants after the exclusion of 11 participants. Information about sex was provided by participants’ self-report, we did not collect gender.

Experimental procedures

Five different experiments were carried out, each included a first viewing session (encoding), a break (consolidation period with different durations), and a second viewing session (retrieval), which included the same movies, and in some cases additional new movies.

Experiment 1: Tailor-made animation movies

Experiment 1 (Figs. 1–3) was conducted during a daytime session with a 2-h break between the first and second movie viewing sessions. Following the completion of consent forms, and eye tracking setup (see below), the first viewing session started around noon (11:48 AM on average) and lasted an average of 45 min. Participants watched 65 animated movies while pupils and gaze were monitored (see below). The movies were separated by a 2-s fixation cross presented on a blank gray screen. Then, the participants had a ~2 h break in which they were free to leave the lab unsupervised. During the second viewing session, each movie was followed by participants feedback indicating their explicit memory recall (“Have you seen this movie before?” with options Yes (1) or No (2)) and a second screen in which they rated their confidence (“How confident are you in your answer?” on a scale from “Not at all (1)” to “Very confident (4)”).

**Fig. 3: Single-trial predictive modeling using machine learning of eye tracking data features.**

Experiment 2: Animation movies with extensive explicit memory reports

This experiment (Fig. 4) examined in more detail the relation between anticipatory gaze and verbal reports, probing different aspects of memory. The experiment employed the exact same animation used in Experiment 1 for the first viewing, including 48 animations. During the second session—held after a 2 h break—participants rewatched each of the 48 animations, interleaved with 12 novel films in a randomized order. To enable explicit verbal report, the movies were edited to omit the surprising events. Before viewing, they received task instructions and completed a single practice trial to confirm understanding. After each movie, the following five retrieval tasks were presented:

i)
movie recognition: “Do you remember watching this movie before?” (“Yes” or “No”),
ii)
free recall: “Please describe what was missing in the video and where did it happen?” Participants were instructed to say their answer aloud (for example: “snake on the right” or “frog in the center”).
iii)
object recognition: two objects were presented, and the participant was asked to indicate, “What was missing in the movie?”. The lure object was created such that it was not unlikely to exist in the presented environment. For example, if the scene was underwater, possible lures would be “fish” or “crab” but not “elephant”. Hence the lure list was fixed in such a way that each object appeared once as correct and once as incorrect answer.
iv)
event location recall: a frame from the movie was presented, and participants were instructed to click on the screen where they thought the event should have happened. An answer was considered correct if they clicked in the correct quadrant of the screen.
v)
temporal recall: participants were asked about the timing of the event within the Movie: “When did the Surprising Event happen?”. Answer options were a) “In the middle of the movie” or b) “In the end of the movie”. Event timing was considered in the middle or in the end according to the distribution of all events. If an event appeared before the median time of all SE’s it was considered in the middle, and vice versa.

**Fig. 4: The relationship between anticipatory gaze and explicit memory reports.**

The free recall was self-paced, whereas the other four questions were limited to 5 s. The movie order in the first and second viewing was randomized, but which movies were presented once or twice was fixed and identical for all participants.

Experiment 3: Naturalistic movies

Experiment 3 (Fig. 5) investigated anticipatory gaze at our sleep lab using naturalistic videos from YouTube. After setting up the eye tracking and EEG, the first viewing session started around 2 PM and included watching 100 naturalistic movies. Then, the participants had a 2-h break and were instructed to remain awake. The second viewing session also included 100 movies (80 seen movies and 20 new ones) as well as a simple recognition task (“Have you seen this movie before? (“Yes” or “No”),”) and confidence feedback (“How confident are you in your answer?” on a scale from “Not at all (1)” to “Very confident (4)”)”) as in Experiment 1. Since the movies were publicly available on YouTube, there was a possibility that the participants had already seen some of them before the experiment. To address this, a recognition task was also carried out during the first viewing, and any trials with positive responses were excluded.

**Fig. 5: The anticipatory gaze effect is replicated in naturalistic (YouTube) videos.**

Experiment 4: Naturalistic movies with nap or wake

In Experiment 4 (Fig. 6), the setup was identical to Experiment 3 but introduced new participants and a modification: participants were given a nap opportunity during the 2 h break while we monitored EEG, electrooculogram (EOG) and electromyogram (EMG) to monitor sleep. Data collection and sleep scoring were previously described⁴³. Data from 8/27 participants was excluded due to short sleep duration ( < 50% sleep efficiency).

**Fig. 6: MEGA scores improve after sleep compared to wakeful rest.**

Control experiment: Animations without an event

In a control experiment, we repeated experiment 1 but the surprising event was removed in both the first and second viewings. Therefore, any change in gaze between the viewings would reflect changes based on familiarity with the scenery.

Movie stimuli and visual presentation

Overall, we used 185 different movie clips. We presented movies in full-screen mode to maximize engagement. Between the movie clips, a gray background with a fixation cross was displayed for 2 s to standardize the visual field and prepare the participants for the upcoming stimulus. In all experiments, the order of the movie clips was pseudo-randomized for each participant to avoid order effects that could influence memory encoding and retrieval processes. Previously unseen movies were incorporated in the second session to increase task difficulty and enhance performance variability but were not subsequently analyzed. The experiments were coded in Python, using the PsychoPy⁴⁴ package and the PyLink package, which facilitates interfacing with EyeLink eye-trackers (see below).

For Experiments 1 and 2, we used 65 silent animated colored movie clips. For the naturalistic movies, we included 120 silent black and white movie clips collected from YouTube. The movie clips for Experiment 1 and 2 were custom-designed animations depicting simple ecological scenarios. Precise event timings and locations were equally distributed across the movie length and screen space. Detailed description and examples can be found in Supplementary Information (Supplementary Movie 1 and 2 and Supplementary Fig. S3).

Eye tracking

Eye tracking employed EyeLink 1000 Plus (SR Research) as in Sharon et al.⁴⁵ with a sampling rate of 500 Hz. We first determined the dominant eye of each participant, utilizing a modified version of the Porta test^46,47. Participants were then instructed to position their heads on a chin rest 50–70 cm from the screen to maximize eye tracking quality. Next, a 9-point calibration and validation process was performed until the error was below 0.5° of visual angle.

Event-related analysis

The gaze coordinates and pupil size were computed for each movie seen by the participants. A custom validation tool was employed to ensure data integrity, running several checks such as for the correct number of files and recording rates (code available). Movie trials with more than 30% tracking loss were excluded. Furthermore, all movies that were not shown in the first encoding session were excluded as well (12 animations & 20 naturalistic videos). Next, we computed the Euclidean gaze distance between each gaze coordinates and the center point of the event location (in degrees of visual angle, DVA). Data quality was visually ensured, example can be found in the Supplementary Movie 1, 2.

Gaze average distance (GAD)

GAD quantifies the mean Euclidean distance between each gaze data point and the SE center point, calculated separately for each time point prior to the appearance of the SE and averaged across all computed distances (not just fixations). SE center points were a priori defined by the animation studio that compiled the movies. Thus, GAD captures a cumulative estimate of how closely participants’ gazes approximate the SE. GAD was averaged for each subject across all movies within each session.

MEGA Score: a metric for gaze anticipation

The MEGA Score is a normalized metric reflecting how anticipatory gaze behaviors change across repeated viewings of the same movie: MEGA Score = (GAD^1st,viewing - GAD^2nd,viewing)\ max(GAD^1st,viewing, GAD^2nd,viewing). Because the averaging of ratios is biased, specifically if the denominator changes, we divided by either the average of the 1^st viewing or the average of the 2^nd viewing, depending on which was larger. Importantly, central outcomes remain consistent regardless of whether normalization was applied, or which specific normalization method was used (no normalization, normalization by the max, by the baseline and by the mean). Higher MEGA Scores indicate that participants looked closer to upcoming SEs during the second viewing, reflecting enhanced anticipatory gaze behavior guided by memory.

Pupillometry

Pupil size was extracted during all fixations prior to the surprising event using the EyeLink segmentation tool. Trials with >30% missing data were excluded. Pupil size prior to the events was used to compute a normalized score (as for the MEGA score): pupil size score = (pupil size^1st,viewing - pupil size^2nd,viewing) / max(pupil size^1st,viewing, pupil size^2nd,viewing). The pupil size score provides a nuanced measure of change in pupil size across repeated movie viewings, where a higher pupil size score reflects a larger pupil size in the 2^nd viewing.

Behavioral analysis

To disentangle the relationship between anticipatory gaze and explicit report, we collected verbal reports in all four experiments. For Experiment 1, 3 and 4 we asked after each movie “Have you seen this movie before? (“Yes” or “No”),” and collected their confidence rating. In Experiment 2, five retrieval tasks were presented: i - movie recognition, ii - free recall, iii - object recognition, iv - location recall, v - temporal recall. For the anticipatory gaze analysis in relation to the explicit report, we computed the MEGA score for each movie. To estimate the difference between the first and the second viewing, we averaged the MEGA score over all movies for each participant (‘new’ movies were excluded) and then tested the MEGA score against chance. To estimate the subsequent memory effect, we compared the average recognized and unrecognized MEGA score. This was done for naturalistic and animated movies. Moreover, we used the same procedure to compare the sleep and the wake groups. In Experiment 2, anticipatory gaze was evaluated in relation to event recollection. Based on the four forced-choice tasks (i – movie recognition, iii – object recognition, iv – location recognition, v – early or late SE), we defined and evaluated three labels of remembered memory content: (A) context and event recollection, (B) context recognition, and (C) not recognized. The free recall (ii) was coded and analyzed separately. An extensive description of the process and the result of each task is available as Supplementary Fig.

Single-trial decoding of movie viewing

Raw eye-tracking data were transformed into 243 engineered features containing fixation, saccades, blinks, and pupil related parameters. 87 of these parameters were in relation to the location and timing the surprising event (see Supplementary Note 1, 2). An ensemble of XGBoost classifiers was employed, optimized using grid search, and evaluated with leave-one-subject-out (LOSO) cross-validation to classify first and second viewings of the movies. Model performance was assessed using classification accuracy, confusion matrices, and ROC curves. The statistical significance of the classification was quantified against the chance level (50%) by computing a one-sample t-test of the ROC AUC score. SHAP analysis was applied to interpret the contributions of individual features to predictions, providing insights into the model’s decision-making process.

Statistical analysis

None of the reported experiments were preregistered. We used parametric methods for statistical testing when the data was normally distributed. Specifically, we computed the suitable t-tests and Cohen’s D (95% CI) for direct comparisons. For the comparisons of three groups (event memory, scenery memory, no memory) one-way ANOVAs were computed and Effect sizes were estimated using eta-squared (η²), which represents the proportion of variance explained by the independent variable. Post hoc comparisons were conducted using Tukey’s Honestly Significant Difference (HSD) test following a one-way ANOVA to identify pairwise differences between group means. In Experiment 2, we corrected for multiple comparisons when examining all the verbal report tasks. Where applicable, normal distribution and equal variances were formally tested. For non-normally distributed data, we used Wilcoxon signed-rank test and rank-biserial effect sizes from Wilcoxon-rank test. The statistical significance of ML classification was quantified by applying a one-sample t-test to the ROC AUC score compared to the chance level (50%). To evaluate the evidence for the null hypothesis, we conducted Bayesian null hypothesis testing by computing Bayes factors (BF₁₀), quantifying the relative likelihood of the data under the alternative versus the null model. The prior distributions used to compute BF₁₀ were informed by effect sizes from the most methodologically similar experiment. Pearson’s correlation coefficient was used to assess the linear relationship between variables. All t-tests were two-sided, except for the sleep–wake comparison, which was one-sided due to a strong a priori expectation that sleep would enhance memory retrieval. The error probability of 5% was chosen for all statistical tests and the statistical tests were either computed in R or Python. The sample size of Experiment 1 was similar to those generally in the field. All the following experiments were estimated with a power analysis based on the effect size of experiment 1 (Cohen’s D = 1.8).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Results

We constructed the Memory Episode Gaze Anticipation (MEGA) paradigm with the aim of capturing memory retrieval in its raw form without the additional layer of verbal reports. Participants viewed short (8–23 s) movie clips in two sessions conducted a few hours apart while gaze was monitored using an infrared video-based eye tracking system (see Methods). Each movie contained a surprising event that saliently occurred in an unexpected location and time (e.g., an animal suddenly appearing behind a rock, Fig. 1a; Supplementary Movie 1, 2). We hypothesized that in the first viewing, gaze patterns before the event occurs (‘pre-event) would be exploratory, whereas, in the second viewing, after the formation of memory, gaze patterns would anticipate the event and preferentially occur around the event location (Fig. 1b). Accordingly, the memory for the surprising event may manifest as a difference between the 1^st and 2^nd viewings at pre-event intervals around the anticipated location of the event (red rectangle and heat maps in Fig. 1b). To test this, the first experiment included 34 participants who viewed 65 movie clips in two viewing sessions separated by a 2-h break (Fig. 1c and see detailed methods). In the 2^nd viewing session, verbal reports and confidence measures were collected after each movie, to allow comparison with standard explicit reports.

Gaze anticipation indexes memory for events

To quantify anticipatory gaze towards the remembered location in each movie viewing, we assessed, for each time point separately, the Euclidean distance from the gaze location to the event location. Figure 2a illustrates this calculation, which utilized all data points to transform the multivariate eye tracking data into a single time series that encapsulates anticipatory gaze behavior. During the 1^st viewing, the distance from the event location was mostly large, but upon 2nd viewing, we observed that the gaze gravitated toward the expected location of the event before its onset (see representative example of a single movie in Fig. 2b). We quantified this by the Gaze Distance (GAD), the mean distance from each gaze point to the event location from the movie’s beginning until the event onset. GAD is a simple indicator that can reveal a tendency to gaze closer to the event location before its onset. Computing the GAD across all movies for each participant (Fig. 2c) revealed a significant convergence to the event location upon 2^nd viewing, observed in 31/34 (91%) of participants (mean GAD 1^st viewing = 18.13 ± 0.62° vs. mean GAD 2^nd viewing = 16.91 ± 1.05°; t(33) = 6.199, p = 5.377e-⁰⁷; Cohen’s D = 1.4, 95% CI [0.87, 1.84]). To test if implicit learning may play a role, we compared the GAD for the first ten movies to the last movie in the 1^st viewing session. GAD values were comparable (first ten vs last movie: BF₁₀ = 0.20 ± 0.06%, prior according to subsequent memory effect), suggesting that effects were not driven by implicit learning of the surprising event’s appearance. Next, to observe the temporal dynamics of anticipatory gaze, GAD was averaged across all movies and participants without averaging across time (Fig. 2d). This analysis revealed a closer gaze towards the event location in the 2^nd viewing that was present throughout the seconds leading to its appearance (Fig. 2d, green time-course), followed by a sudden drop in GAD after-event appearance (reflecting gaze towards the surprising event once it occurred). The anticipatory effect, measured as the difference in GAD between the first and second viewing, remained stable over the entire pre-event interval, without evidence for a temporal reinstatement specifically timed to the event (Supplementary Fig. S4).

Machine learning discriminates first and second viewings at a single trial resolution

We tested to what extent machine learning (ML)-based classification of multiple eye tracking features could extend the intuitive GAD metric and accurately identify memory traces at the single-trial level. First, we focused on reducing gaze distance as captured by the GAD metric. We employed the XGBoost classification algorithm⁴⁸ at the single-trial level, complemented by a leave-one-subject-out cross-validation method (Fig. 3a). Models achieved an average correct classification of 69 ± 10% (chance: 50%), demonstrating the model’s ability to distinguish, based only on GAD, whether a single trial’s data represented the first movie viewing or whether it had been viewed before- 1^st or 2^nd viewing (Fig. 3b). The Receiver Operating Characteristic (ROC) curves for each subject-specific model and the average ROC curve indicated consistent model performance across subjects (Fig. 3c). Accordingly, the area under the ROC curve (AUC-ROC) was 0.75 ± 0.26, quantifying the model’s overall effectiveness (one-sample t-test: t(33) = 10.76, p = 2.47e-12, Cohen’s D = 0.96, 95% CI [0.55, 1.36]). In 32/34 of the participants (94%), model accuracy was greater than chance levels, with an average accuracy of 0.69 ± 0.1, attesting to its reliable prediction of identifying memory traces (Fig. 3d).

Next, we employed an exploratory ‘bottom-up’ ML strategy by broadening our analytical scope to encompass a wider array of eye-tracking features, irrespective of our initial GAD metric. We manually engineered multiple features from eye-tracking data centered around the event location, thereby minimizing potential session-specific confounds such as differences in time-of-day and cognitive load due to tasks and reports in 2^nd viewing (Methods). Such features (87 in total) included aspects such as fixation count ratios relative to the event location, the velocity and visual angle of saccades directed towards the event, and the change in pupil radius during fixations within the event location compared to prior fixations, thereby capturing event-related eye tracking spatial and temporal features. With these features, models achieved an average classification accuracy of 71 ± 11.73% and 70 ± 12.34% for 1^st and 2^nd viewings, respectively. Associated AUC-ROC metrics exhibited an average of 0.75 ± 0.26 (one-sample t-test: t(33) = 10.85, p = 1.98e⁻¹², Cohen’s D = 1.86, 95% CI [1.3, 2.41]). In 32/34 of the participants (94%), model accuracy was greater than chance levels, with an average accuracy of 0.7 ± 0.11. Remarkably, our initial analyses based solely on the GAD metric already showed comparable performance to the extensive 87-feature model (see Supplementary Note 2). A logistic regression based on GAD alone, applied to the same leave-one-subject-out cross-validation method, was able to differentiate in 29/34 of the participants (85%) between first and second viewing (average accuracy 0.65 ± 0.12). Associated AUC-ROC metrics exhibited an average of 0.68 ± 0.15 (one-sample t-test: t(33) = 7.22, p = 2.84e⁻⁰⁸, Cohen’s D = 1.24, 95% CI [0.78, 1.68]). This adds to GAD’s potent predictive value and shows that distance metrics alone are sufficient for precise identification of episodic-like memory traces in the MEGA paradigm.

To further understand which eye-tracking features are most effective in capturing event memory, we incorporated SHAP (SHapley Additive exPlanations) feature importance analysis, a game theory-based method to distill and quantify each feature’s influence on the model’s output⁴⁹. SHAP provides an average impact of each feature on model prediction by examining its performance with and without the presence of each feature across all possible feature combinations. This analysis identified distance-based features as particularly informative, with GAD ranking as the top feature (Fig. 3e). Along the same lines, a waterfall plot of a representative trial illustrates how individual features (especially GAD) cumulatively influence the model’s decision-making process for a single trial, highlighting the predictive power of GAD in indexing memory in the MEGA paradigm (Fig. 3f).

Anticipatory gaze marks event recollection, whereas pupil size indexes context recognition

What aspects of memory does anticipatory gaze capture? To what degree does it reflect episodic-like memory for the event versus familiarity with the general context? To address these questions, we first tested how MEGA relates to simple verbal reports (‘Have you seen this movie before?’). We compared GAD scores in movies reported as ‘seen before’ vs. ‘not seen before’, focusing only on trials with reports associated with high confidence ratings (Fig. 1b). We found a significant reduction in GAD upon 2^nd viewing for movies that were reported to be seen before (GAD_{1st v.}= 18.1 ± 0.75°, GAD_{2nd v.}= 16.9 ± 1.1°, t(33) = 5.79, p = 1.748e⁻⁰⁶, Cohen’s D = 1.29, 95% CI [0.81, 1.76], Fig. 4a) but also a significant effect for movies that (incorrectly) reported as ‘not seen before’ (GAD_{1st v.}= 18.5 ± 2°, GAD_{2nd v.}= 17.58 ± 1.85°, t(33) = 5.24, p = 9.105e⁻⁰⁶, Cohen’s D = 0.48, 95% CI [0.11, 0.85], Fig. 4a). To directly compare the difference between the first and second viewings of the explicitly recognized and not-recognized movies, we computed the normalized decrease of GAD from 1^st to 2^nd viewing (GAD^1st,viewing - GAD^2nd,viewing / max(GAD^1st,viewing, GAD^2nd,viewing), Fig. 4b, Methods). Higher values reflect an anticipatory effect during the second viewing and thus reflect memory-guided behavior. This score exhibited a trend towards higher values for movies that were subsequently recognized than for not-recognized movies (M _{MEGA recognized} = 0.07 ± 0.07, M_{MEGA not recognized} = 0.05 ± 0.05, paired t-test: t(33) = 1.99, p = 0.055, Cohen’s D = 0.35, 95% CI [0.00, 0.71], Fig. 4b and time course in Supplementary Fig. S4).

To better understand what aspects of memory are captured by anticipatory gaze beyond ‘Have you seen this movie before?’, a second experiment was conducted to evaluate multiple dimensions of memory reports in detail, aiming to distinguish between event recollection and context recognition (Fig. 4c). Participants watched 48 animated movies depicting similar surprising events. After a 4-h break, they watched these movies again together with 12 novel movies in a randomized order. However, in this experiment, the 2^nd session included the exact same movies, except the surprising event was omitted. Because the surprising event was not presented during the 2^nd viewing, we could follow up with an array of retrieval tasks immediately after each movie. These tasks aimed to provide additional sensitivity to better investigate the relationship between explicit memory and anticipatory gaze. We collected the following five verbal reports: recognition, free recall, object recognition, event location recall, and temporal recall (for details see Methods).

Results reliably replicated the anticipatory gaze effect with a new group of participants. 30/30 of participants (100%) demonstrated significantly greater gaze proximity to the event location during the second viewing (M_MEGA-score = 0.086 ± 0.037, t(29) = 12.8, p = 2e⁻¹³, Cohen’s D = 2.34, 95% CI [1.61, 3.07], Fig. 4d). Next, we analyzed the GAD of each movie based on its explicit report in three categories (Methods): event recollection (correct movie recognition, as well as object recognition and event location recall), context recognition (scenery was recognized, but the object recognition or event location recall was incorrect), and unrecognized movies (incorrect recognition task, independent of the answer in event location recall or object recognition). Temporal recall (when precisely the event occurred) was not further analyzed because retrieval performance was at chance (t(29) = 0.45, p = 0.65, Cohen’s D = 0.08, 95% CI [−0.27, 0.44]). Behaviorally, retrieval performance for movie recognition, object recognition, and location recall were all significantly above chance (recognition: t(29) = 15.0, p = 3.21e⁻¹⁵, Cohen’s D = 2.74, 95% CI [1.95, 3.53], object recognition: t (29) = 36.8, p < 2.2e⁻¹⁶, Cohen’s D = 1.58, 95% CI [1.03, 2.11], location recall: t(29) = 11.8, p = 1.36e⁻¹², Cohen’s D = 2.15, 95% CI [1.49, 2.81], Supplementary Fig. S1).

Analyses revealed that MEGA scores were significantly higher for movies where participants recollected the event in full, highlighting the sensitivity of MEGA to episodic-like recall (Fig. 4e, f). Accordingly, ANOVA with the dependent variable MEGA score and the factor memory content (event, context, and no memory) revealed a significant main effect (F(2,87) = 14.11, p = 4.9e⁻⁰⁶, η² = 0.24, 95% CI [0.12, 1.00]). Post-hoc pairwise comparison revealed a significantly higher MEGA-score for full recollection of the event compared to MEGA-scores of movies where only context was recognized (M_event= 0.12 ± 0.06, M_context = 0.08 ± 0.04, p_Tukey = 0.002) or compared to unrecognized movies (M_event=0.12 ± 0.06, M_{no-recognition} = 0.05 ± 0.05, p_Tukey < 0.001), but the latter two conditions did not differ significantly (context vs. no-recognition: p_Tukey = 0.2). In fact, Bayesian statistics suggest that the anticipatory gaze for movies where the scenery was familiar is similar to the ones of movies that were not recognized (BF₁₀ = 0.32, prior used according to the effect size of the M_event vs M_{no-recognition} comparison).

Moreover, the MEGA score of each movie correlated with participants’ precision in reporting the event location such that stronger anticipatory gaze effects are associated with higher proximity to the event location (Pearson r = 0.22, p < 2.2e⁻¹⁶, 1438 movies, Cohen’s D = 0.45, 95% CI [0.35, 0.56]). Accordingly, in each trial, the higher the precision in explicitly reporting the event location, the closer the anticipatory gaze was to that location before it occurred on the screen. Accordingly, the MEGA score of participants correlated with the number of trials categorized as event recollection (pearson r = 0.37, p = 0.043, N = 30, Cohen’s D = 0.80, 95% CI [0.02, 1.69])) but did not correlate with the number of trials that were not recognized (r = −0.046, p = 0.81, N = 30, =, Cohen’s D = -0.09, 95% CI [−087, 0.68])). Correlation with the number of trials where only the context was recognized exhibited a marginally significant negative correlation (r = −0.36, p = 0.05, N = 30, Cohen’s D = −0.77, 95% CI [−1.66, 0])).

Finally, we investigated anticipatory gaze within the context of participants’ free recall of surprising events, rather than location recall and forced-choice object recognition. As expected, the MEGA score was bigger if the object of a surprising event was explicitly recalled, compared to movies for which participants could not remember the event’s object (free recall: MEGA score _recalled = 0.14 ± 0.08, MEGA score _{not recalled} = 0.07 ± 0.03, paired t-test: t(28) = 5.46, p = 7.8e⁻⁰⁶, Cohen’s D = 1.22, 95% CI [0.71, 1.72]). Furthermore, this difference in anticipatory gaze linearly increased for the participants’ ratio of objects recalled and forgotten (Pearson r = 0.56, p = 0.0016, N = 29, Cohen’s D = 1.35, 95% CI [0.5, 2.4]).

To further distinguish MEGA from context recognition, we focused on pupil dilation as an index of familiarity upon repeated stimulus presentation^50,51,52,53. Specifically, previous findings suggest that pupil dilation is increased for recognized words compared to unrecognized words^54,55. In line with this literature, we also found that pupil size was larger for the second viewing compared to the first viewing (pupil size _{1st v} = 4436 ± 378, pupil size _{2nd v} = 4648 ± 378, t(29) = −5.5, p = 6.3e⁻⁰⁶, Cohen’s D = −1, CI [−1.44, −0.56]). Next, we compared changes in pupil size across repeated movie viewings in relation to the verbal report. To this end, we computed a normalized pupil size score using the same formula used for the MEGA score (pre-event time window, negative values reflect a larger increase in pupil size). We found a larger increase in pupil size for movies that were recognized compared to movies that were not recognized (higher pupil size score for ‘not recognized’ in Fig. 4g). ANOVA with pupil dilation score as the dependent variable and the explicit memory questions as a factor revealed a significant main effect (F(2,87) = 5.05, p = 0.008, η² = 0.1, 95% CI [0.02, 1.00]). Post-hoc pairwise comparison revealed a significantly higher pupil score for movies where the event was recollected in full (M_event = −0.059 ± 0.048, M_{no-recognition} = −0.021 ± 0.054, p_Tukey = 0.013) and for movies where the context was recognized (M_context = −0.054 ± 0.049, M_{no-recognition} = −0.021 ± 0.054, p_Tukey = 0.033) compared to unrecognized movies. Crucially, pupil dilation score did not differ between the movies with event recollection and context recognition, suggesting no specific relationship with event memory (event vs context: p_Tukey = 0.93, BF₁₀ = 0.30 ± 0.03%, default prior = 0.707). This was stable over the whole pre-event time interval (Supplementary Fig. S5). These findings suggest that while the pupil is indicative of recognition of the context alone, the anticipatory gaze is guided by a richer memory that includes the recollection of event details, such as its location within the context.

Anticipatory gaze reflects relational memory rather than familiarity

Next, we conducted another control experiment to investigate the possibility that gaze differences during second viewing might reflect familiarity with scenery, rather than relational memory. Specifically in our example movie (Supplementary Movie 2), there is the possibility that after watching the giraffe for the second time, the viewer - due to familiarity or just being bored - looks systematically away from that location and/or closer to the upcoming surprising event. Could this account for the anticipatory gaze, independent of the relational retrieval of the SE? To test this, we ran a control experiment while presenting the movies without surprising events in the first or the second viewings (i.e., just the same background scenery). A reduction of GAD (distance to SE) between first and second viewing without any events would suggest that familiarity can explain anticipatory gaze effects, while lack of GAD difference between viewings would suggest that anticipatory gaze reflects relational memory. We found that GAD was significantly smaller in this control compared to Experiment 1 (M _control = 0.006 ± 0.028, t-test: t(52) = 4.186, p = 0.0001, Cohen’s D = 1.8, 95% CI [0.6, 1.8]) and Experiment 2 (t-test: t(48) = 8.247, p = 2.923e⁻⁰⁹, Cohen’s D = 2.4, 95% CI [1.6, 3.1]), confirming that unexpected events strongly modulate anticipatory gaze behavior (Fig. 4i,j). Crucially, to test whether mere familiarity with the scenery could account for anticipatory gaze shifts, we compared GAD measures between first and second viewings in this no-surprise control condition. We observed no significant change in GAD from first to second viewing (t-test: t(49) = 0.94726, p = 0.3554, 95% CI [0.5636226, 1.7956199]). We confirmed this using a Bayesian analysis, which supported the null hypothesis. When anticipation magnitudes from Experiment 2 (Prior Cohen’s D = 1.34; BF₁₀ = 0.22 ± 0.09%) and Experiment 1 (Prior Cohen’s D = 0.88; BF₁₀ = 0.31 ± 0.04%) were incorporated as informative priors, the Bayes Factor indicated evidence against familiarity-driven effects. Thus, these results suggest that relational memory processes, rather than familiarity with the scene, underpin the anticipatory gaze patterns observed.

Anticipatory gaze is replicated in naturalistic movies

To what extent can anticipatory gaze be revealed using other movies, not necessarily animations compiled specifically for scientific research? We set out to test the degree to which anticipatory gaze captures episodic-like memory recall in settings that closely mimic real-world experiences, with the aim of bridging laboratory research and everyday memory. To this end, we performed a third experiment, where 32 naïve participants viewed 100 YouTube videos (Fig. 5a). First, it was necessary to define the surprising event location and timing since these were not defined as a-priori as in the tailor-made animations. A group of 55 independent participants marked the spatial and temporal coordinates of the surprising event. Each movie’s event location was defined as the median of their choice of coordinates. 48 movies that exhibited a maximal level of consensus (within one standard deviation of the median time or location) were used for subsequent analysis. Once again, analysis of GAD preceding the surprising event replicated the anticipatory gaze effect, with a completely different set of stimuli and a different group of subjects. A significant increase in gaze proximity to the event location was observed upon 2^nd viewing in all participants but one, robustly indexing memory without report (GAD_{1st v.}=10.96 + −0.46°, GAD_{2nd v.}= 10.04 + −0.72°, paired t-test: t(30) = 10.272, p = 2e⁻¹¹, Cohen’s D = 1.52, 95% CI [0.98, 2.06], Fig. 5b, c). Increased proximity of gaze to the event location was evident throughout the seconds leading to the event (Fig. 5b). GAD declined upon the event onset in both 1^st and 2^nd presentations, reflecting gazing towards the event once it occurred. This drop was less steep and interestingly began already prior to event onset. Arguably because the gaze of participants was already “drawn” towards the event location by narrative cues in naturalistic movies compared to the highly unexpected event appearance in tailor-made animations. Analyzing GAD and MEGA computed separately for recognized and unrecognized movies (according to verbal report, Fig. 5a) showed that the MEGA score significantly exceeded chance-level for both remembered and forgotten movies, replicating the observation in the previous experiments (explicitly remembered trials: t(30) = 10.75, p = 8e⁻¹² Cohen’s D = 1.93, 95% CI [1.30, 2.56]; for explicitly forgotten trials: t(30) = 5.71, p = 3e⁻⁶, Cohen’s D = 1.02, 95% CI [0.57, 1.48], Fig. 5d). Together, the results show that the anticipatory gaze effect robustly replicates with naturalistic movies, attesting to the utility of this approach in diverse contexts, including real-life situations.

Anticipatory gaze reveals sleep’s benefit for memory consolidation without report

We demonstrate one potential application of the MEGA paradigm in terms of sleep benefits for memory consolidation. Naïve participants were recruited for a fourth experiment, viewing naturalistic videos (identical to experiment 3) in two sessions separated by a 2 h break that included either a nap opportunity for some individuals (n = 19) or an equally long interval of wakefulness for other individuals (n = 34). First, regardless of sleep or wake, we replicated the anticipatory gaze effect with a new group of participants. We observed significantly lower GAD reflecting anticipatory gaze (Fig. 6b) in 16/19 of participants (84%), substantiated by statistical analysis (GAD^1st,v.=11.3 ± 0.62°, GAD^2nd,v.=10.12 ± 0.87°, Wilcoxon signed-rank test: V(18) = 190, p = 4e⁻⁶; rank-biserial effect size = 0.88, 95% CI [0.88, 0.88]). This constitutes a third successful replication of the anticipatory gaze effect, reinforcing its reliability in capturing memory. Next, comparing sleep and wake, we found that the MEGA score in the nap condition was 23.6% higher than in the awake condition (M_wake=0.07 ± 0.048, M_nap = 0.09 ± 0.048, Wilcoxon signed-rank test: z(N₁ = 19, N₂ = 34) = 1.71, p = 0.0439, Cohen’s D = 0.48, 95% CI [0.1 1.1], Fig. 6c). In contrast, the verbal reports did not reflect this pattern. Bayesian statistics suggests that explicit recognition rates were comparable following sleep and wake consolidation (BF₁₀ = 0.39 ± 0%, prior used according to the effect size of the anticipation effect, Fig. 6d). Participants recognized 76.8 ± 7.1% of the movies after sleep and 74.3 ± 7.5% after wakefulness (Wilcoxon signed-rank test: W = 401, p = 0.098). This discrepancy might highlight the sensitivity of the MEGA paradigm that could be masked when only considering explicit verbal reports. Overall, these results indicate that the nap had a positive impact on memory consolidation as reflected in anticipatory gaze, demonstrating the potential of MEGA as a no-report paradigm for studying the relation between sleep and episodic-like memory consolidation.

Discussion

This study shows that tracking gaze during repeated viewings of movies with surprising events constitutes an effective method for investigating memory without verbal reports. The results establish that during the second movie viewing, the gaze gravitates towards the event location, exhibiting memory-guided prediction that anticipates its occurrence. Gaze distance (GAD) can be used as an intuitive metric to capture the degree of this predictive anticipation, showing significantly higher proximity to the event location during the second viewing. The anticipatory gaze effect was captured before changes were visible in the movie and even when the surprising event was entirely absent in the second movie viewing. This establishes that it corresponds to memory-guided prediction irrespective of the visual cues that mark it. Anticipatory gaze is a highly robust effect that is consistently observed and replicated several times across multiple stimulus types and in different naïve groups of participants (N = 134) - attesting to its versatility and utility across settings. Machine learning classifications of features extracted from gaze data identify memory traces at a single-trial level in new participants. In a separate experiment where we collected verbal reports about the movie events, we found that anticipatory gaze effects are largest when the surprising event was fully recalled, showing dissociation from pupil size measures not associated with recalling the surprising event. Finally, we illustrate how applying MEGA without verbal reports can effectively replicate the classical beneficial effect of sleep on declarative memory^43,56,57. MEGA can be successfully employed using either naturalistic movies or custom animations specifically designed for this purpose, a resource made available for any future follow-up study (https://yuvalnirlab.com/resources/).

The MEGA paradigm approach creates one-shot encoding of movies with scene-event pairs that are recalled after 2 h. Anticipatory gaze reflects the recollection of specific events, positioning MEGA as a no-report variation of a cued recall task. By creating our own paradigm including tailor-made animations, we extend previous research by Kano & Hirata¹³ on great apes, as well as adaptations of their paradigm^36,58. We combine elements from decade-long research on visual exploration in images^{25,27,28,29,30,53}, visual search^15,16 and contextual cueing^19,21,22 in picture scenes, as well as long-term memory guided anticipatory spatial orienting of attention^23,59 and anticipatory viewing behaviors^{38,39,40,41,42}. We integrated these elements into tailor-made animations designed to elicit memory-guided anticipatory gaze toward the location of the upcoming event, thereby maximizing single-trial accuracy of event recollection. Thus, our work adds to the growing literature on utilizing anticipatory gaze as a marker of relational memory^26,27,60.

Two complementary approaches were used to quantify the anticipatory gaze effect. The first and intuitive metric, GAD, captures the average Euclidean distance to the location of the surprising event from the beginning of the movie presentation until the event occurs and enables identification of memory-guided gaze in ~90% of participants. A second machine learning approach using the XGBoost classification algorithm is aimed at the predictive power of single-trial gaze. Surprisingly, machine learning applied on merely 7-second intervals of gaze data during pre-event movie viewing - without any averaging across movies or participants - was sufficient to significantly identify whether this viewing was associated with memory for the event (first or second viewing). Strikingly, classification performance was maintained even when we only used gaze distance instead of a comprehensive set of 87 gaze distance-related eye-tracking features. A post-hoc comparison of all features’ importance confirmed that GAD was most informative.

We created MEGA to capture the memory of the ‘what, where and when’ of the surprising event. But does anticipatory gaze exclusively reflect episodic-like memory? Although the current dataset cannot conclusively establish that anticipatory gaze exclusively reflects episodic-like memory, several reasons suggest that this is the case. First, anticipatory gaze correlated with the event recollection, its free recall, the location recall and the object recall, therefore reflecting the recollection of the surprising event. Even on a single-trial level, anticipatory gaze correlated with the accuracy of the participants estimation of the event location. Second, one-shot encoding of detailed long-term memory makes it unlikely to be explained by a non-declarative memory system, especially since some of the cues (scenes) are highly similar for different events. Third, we demonstrate that the anticipatory gaze vanishes if there is only familiarity of the scene (control experiment). Also, statistical learning seems unlikely because GAD during the first viewing was not lower at the end of the first (encoding) viewing session. Nonetheless, we acknowledge that robust (but reduced) anticipatory gaze was also observed for movies that were not consciously recognized. A more definitive answer regarding selectivity of our task for episodic memory may be provided with time by neuroimaging and/or lesion studies assessing the role of medial temporal lobe systems in MEGA.

What can be gained by estimating memory with MEGA, independent of verbal reports? Foremost, our new analytical approach demonstrates sufficient sensitivity to detect memory traces at the single-trial level. Specifically, our metric is correlated with the accuracy of the recollection of the event’s location, on a movie-level. This validates MEGA’s potential as a no-report paradigm for episodic-like memories. Furthermore, MEGA provides an additional method and approach in clinical settings where verbal communication is challenging. For instance, individuals with cognitive impairments, such as aphasia or developmental disorders, often struggle with complex instructions or language comprehension and production. Along the same lines, a stroke patient may be unable to speak, yet we would like to estimate their memory capacity. Furthermore, MEGA can enhance consistency in memory research by increasing generalizability across participants who speak different languages. Finally, from a basic science perspective, current research employing reports confounds memory itself with the ability to articulate it. In this context, MEGA can help distill the brain activities and diseases affecting memory per se beyond its access and report.

Another important advantage of MEGA is that it goes beyond a binary ‘recognized’ vs. ‘non-recognized’ report to represent memory as a continuous quantitative variable. The MEGA-score metric is sensitive to various anticipatory behaviors, whether through a few prolonged fixations at the event location, multiple fixations around it, or even subtle proximity to the event location. This granularity allows us to reveal a linear relation between the anticipatory gaze scores and the individual precision in reporting the event’s location for single movies, suggesting that anticipatory gaze captures “more” than just the recognized – non-recognized distinction. Although the aggregation over the pre-event interval proved robust, the temporal dynamics of the anticipatory gaze remain inconclusive and warrant further investigation (see Fig. S5 and Supplementary Note 1).

Limitations

At present, it is still unknown the extent to which MEGA depends on activity in the hippocampus and the medial temporal lobe (MTL). Are the neuronal underpinnings comparable to those associated with episodic memory, and/or to other implicit eye-movement-based memory effects?^50,61,62 Additionally, it should be acknowledged that the current results involve a task where participants were required to report if the movies were previously seen or not, thereby introducing an additional layer of cognitive processing that could influence the natural recall of events. Future studies should assess how anticipatory gaze unfolds in entirely passive viewing conditions without any instructions. The present findings already extend the literature on sleep consolidation, suggesting that the benefits of sleep for memory consolidation extend beyond verbally recalling memories. However, the direct relationship between memory consolidation, as captured by MEGA, and brain activities supporting this process —such as slow waves and sleep spindles—remains unclear and require further investigation.”

Looking ahead, eye-tracking may offer a promising avenue for understanding and diagnosing memory disorders^58,63. In clinical settings, our findings in healthy adults hold promise for improving and refining existing non-verbal tasks^17,64,65,66 for early diagnosis of memory disorders in mild cognitive impairment (MCI) and for monitoring Alzheimer’s disease progression. Standard cognitive and neuropsychological assessments like the MMSE⁶⁷ or the MoCA⁶⁸ are limited in detecting preclinical memory deficits. Beyond degeneration, our approach may provide a new perspective on how to assess memory upon damage to the medial temporal lobe (MTL)⁶⁹, when it is often unclear to what extent damage affects mnemonic systems or their interface with other brain systems that enable conscious report. Another clinical application pertains to patients suffering from motor or language disorders that limit verbal report, such as those with aphasia^70,71, offering a non-verbal means of assessing memory integrity.

In conclusion, the MEGA paradigm offers a valuable analytical approach and introduces a significant advance in memory research. Utilizing eye tracking to study memory without verbal reports offers wide implications for both basic research in cognitive neuroscience and clinical fields.

Data availability

Source data underlying the results presented in figures can be found here⁷²: https://osf.io/b64qk/. Additional data will be provided per IRB (institutional review board) guidelines upon request to the corresponding author.

Code availability

The code used to analyze the gaze data in this study is available at: https://github.com/dyamin/MEGA and in https://osf.io/b64qk/. Specific code for detailed analysis is available from the authors upon reasonable request. The experiment code is available at: https://github.com/dyamin/MEGA-Experiment The stimuli used in the experiments are publicly available at: https://yuvalnirlab.com/resources/.

References

Gardiner, J. M. Functional aspects of recollective experience. Mem. Cognit. 16, 309–313 (1988).
Article PubMed Google Scholar
Migo, E. M., Mayes, A. R. & Montaldi, D. Measuring recollection and familiarity: Improving the remember/know procedure. Conscious. Cogn. Int. J. 21, 1435–1455 (2012).
Article Google Scholar
Tulving, E. Memory and consciousness. Can. Psychol. Psychol. Can. 26, 1–12 (1985).
Article Google Scholar
Henke, K. A model for memory systems based on processing modes rather than consciousness. Nat. Rev. Neurosci. 11, 523–532 (2010).
Article PubMed Google Scholar
Cohen, N. J. & Eichenbaum, H. Memory, Amnesia, and the Hippocampal System. (MIT Press, 1993).
O’Reilly, R. C., Bhattacharyya, R., Howard, M. D. & Ketz, N. Complementary Learning Systems. Cogn. Sci. 38, 1229–1248 (2014).
Article PubMed Google Scholar
Tsuchiya, N., Wilke, M., Frässle, S. & Lamme, V. A. F. No-Report Paradigms: Extracting the True Neural Correlates of Consciousness. Trends Cogn. Sci. 19, 757–770 (2015).
Article PubMed Google Scholar
Morris, R. Developments of a water-maze procedure for studying spatial learning in the rat. J. Neurosci. Methods 11, 47–60 (1984).
Article PubMed Google Scholar
O’Keefe, J. & Dostrovsky, J. The hippocampus as a spatial map. Preliminary evidence from unit activity in the freely-moving rat. Brain Res 34, 171–175 (1971).
Article PubMed Google Scholar
Schroeder, C. E., Wilson, D. A., Radman, T., Scharfman, H. & Lakatos, P. Dynamics of Active Sensing and perceptual selection. Curr. Opin. Neurobiol. 20, 172–176 (2010).
Article PubMed PubMed Central Google Scholar
Henderson, J. M. & Hollingworth, A. Eye movements and visual memory: Detecting changes to saccade targets in scenes. Percept. Psychophys. 65, 58–71 (2003).
Article PubMed Google Scholar
Hayhoe, M. & Ballard, D. Eye movements in natural behavior. Trends Cogn. Sci. 9, 188–194 (2005).
Article PubMed Google Scholar
Kano, F. & Hirata, S. Great Apes Make Anticipatory Looks Based on Long-Term Memory of Single Events. Curr. Biol. 25, 2513–2517 (2015).
Article PubMed Google Scholar
Voss, J. L., Bridge, D. J., Cohen, N. J. & Walker, J. A. A Closer Look at the Hippocampus and Memory. Trends Cogn. Sci. https://doi.org/10.1016/j.tics.2017.05.008 (2017).
Chau, V. L., Murphy, E. F., Rosenbaum, R. S., Ryan, J. D. & Hoffman, K. L. A flicker change detection task reveals object-in-scene memory across species. Front. Behav. Neurosci. 5, (2011).
Brockmole, J. R. & Henderson, J. M. Using real-world scenes as contextual cues for search. Vis. Cogn. 13, 99–108 (2006).
Article Google Scholar
Haque, R. U. et al. VisMET: a passive, efficient, and sensitive assessment of visuospatial memory in healthy aging, mild cognitive impairment, and Alzheimer’s disease. Learn. Mem. Cold Spring Harb. N 26, 93–100 (2019).
Article Google Scholar
Boettcher, S. E. P., Shalev, N., Wolfe, J. M. & Nobre, A. C. Right place, right time: Spatiotemporal predictions guide attention in dynamic visual search. J. Exp. Psychol. Gen. 151, 348–362 (2022).
Article PubMed Google Scholar
Le-Hoa Võ, M. & Wolfe, J. M. The role of memory for visual search in scenes. Ann. N. Y. Acad. Sci. 1339, 72–81 (2015).
Article PubMed Google Scholar
Wong-Kee-You, A. M. B. & Adler, S. A. Anticipatory eye movements and long-term memory in early infancy. Dev. Psychobiol. 58, 841–851 (2016).
Article PubMed Google Scholar
Ferreira, F., Apel, J. & Henderson, J. M. Taking a new look at looking at nothing. Trends Cogn. Sci. 12, 405–410 (2008).
Article PubMed Google Scholar
Wynn, J. S., Shen, K. & Ryan, J. D. Eye Movements Actively Reinstate Spatiotemporal Mnemonic Content. Vis. Basel Switz. 3, 21 (2019).
Google Scholar
Summerfield, J. J., Lepsien, J., Gitelman, D. R., Mesulam, M. M. & Nobre, A. C. Orienting Attention Based on Long-Term Memory Experience. Neuron 49, 905–916 (2006).
Article PubMed Google Scholar
Turk-Browne, N. B., Scholl, B. J., Johnson, M. K. & Chun, M. M. Implicit Perceptual Anticipation Triggered by Statistical Learning. J. Neurosci. 30, 11177–11187 (2010).
Article PubMed PubMed Central Google Scholar
Johansson, R., Nyström, M., Dewhurst, R. & Johansson, M. Eye-movement replay supports episodic remembering. Proc. Biol. Sci. 289, 20220964 (2022).
PubMed PubMed Central Google Scholar
Hannula, D. E. et al. Worth a Glance: Using Eye Movements to Investigate the Cognitive Neuroscience of Memory. Front. Hum. Neurosci. 4, (2010).
Ryan, J. D. & Shen, K. The eyes are a window into memory. Curr. Opin. Behav. Sci. 32, 1–6 (2020).
Article Google Scholar
Hannula, D. E. & Ranganath, C. The eyes have it: hippocampal activity predicts expression of memory in eye movements. Neuron 63, 592–599 (2009).
Article PubMed PubMed Central Google Scholar
Olsen, R. K. et al. The relationship between eye movements and subsequent recognition: Evidence from individual differences and amnesia. Cortex J. Devoted Study Nerv. Syst. Behav. 85, 182–193 (2016).
Article Google Scholar
Sharot, T., Davidson, M. L., Carson, M. M. & Phelps, E. A. Eye Movements Predict Recollective Experience. PLOS ONE 3, e2884 (2008).
Article PubMed PubMed Central Google Scholar
Urgolites, Z. J., Smith, C. N. & Squire, L. R. Eye movements support the link between conscious memory and medial temporal lobe function. Proc. Natl. Acad. Sci. USA. 115, 7599–7604 (2018).
Article PubMed PubMed Central Google Scholar
Ryan, J. D., Althoff, R. R., Whitlow, S. & Cohen, N. J. Amnesia is a Deficit in Relational Memory. Psychol. Sci. 11, 454–461 (2000).
Article PubMed Google Scholar
Heisz, J. J. & Ryan, J. D. The effects of prior exposure on face processing in younger and older adults. Front. Aging Neurosci. 3, 15 (2011).
Article PubMed PubMed Central Google Scholar
Liu, Z.-X., Shen, K., Olsen, R. K. & Ryan, J. D. Visual Sampling Predicts Hippocampal Activity. J. Neurosci. 37, 599–609 (2017).
Article PubMed PubMed Central Google Scholar
Cannon, E. N. & Woodward, A. L. Infants generate goal-based action predictions. Dev. Sci. 15, 292–298 (2012).
Article PubMed Google Scholar
Nakano, T. & Kitazawa, S. Development of long-term event memory in preverbal infants: an eye-tracking study. Sci. Rep. 7, 44086 (2017).
Article PubMed PubMed Central Google Scholar
Leckey, S. et al. Response latencies and eye gaze provide insight on how toddlers gather evidence under uncertainty. Nat. Hum. Behav. 4, 928–936 (2020).
Article PubMed Google Scholar
Büchel, P. K., Klingspohr, J., Kehl, M. S. & Staresina, B. P. Brain and eye movement dynamics track the transition from learning to memory-guided action. Curr. Biol. 34, 5054–5061.e4 (2024).
Article PubMed Google Scholar
Roth, N., McLaughlin, J., Obermayer, K., & Rolfs, M. Gaze Behavior Reveals Expectations of Potential Scene Changes. Psychological science, 35, 1350–1363 (2024).
Hermann, M. M., Wahlheim, C. N., Alexander, T. R. & Zacks, J. M. The role of prior-event retrieval in encoding changed event features. Mem. Cognit. 49, 1387–1404 (2021).
Article PubMed Google Scholar
Wahlheim, C. N., Eisenberg, M. L., Stawarczyk, D. & Zacks, J. M. Understanding Everyday Events: Predictive-Looking Errors Drive Memory Updating. Psychol. Sci. 33, 765–781 (2022).
Article PubMed PubMed Central Google Scholar
Kazemi, A., Christopher-Hayes, N., Lee, J., Geng, J. & Ghetti, S. Looking Behavior and Relational Memory: Novel Approaches to Analyze Temporal Dynamics of Eye Movements. Preprint at https://doi.org/10.31234/osf.io/7qjad (2023).
Schmidig, F. J. et al. A visual paired associate learning (vPAL) paradigm to study memory consolidation during sleep. J. Sleep Res. e14151 https://doi.org/10.1111/jsr.14151 (2024).
Peirce, J. et al. PsychoPy2: Experiments in behavior made easy. Behav. Res. Methods 51, 195–203 (2019).
Article PubMed PubMed Central Google Scholar
Sharon, O., Fahoum, F. & Nir, Y. Transcutaneous Vagus Nerve Stimulation in Humans Induces Pupil Dilation and Attenuates Alpha Oscillations. J. Neurosci. 41, 320–330 (2021).
Article PubMed PubMed Central Google Scholar
DELLA PORTA, Giovan Battista (c.1538-1615). De refractione optices parte: libri novem. Naples: Horatius Salvianus for Joannes Jacobus Carlinus and Antonio Pace, 1593. | Christie’s. https://www.christies.com/en/lot/lot-6069542.
Gronwall, D. M. & Sampson, H. Ocular dominance: A test of two hypotheses. Br. J. Psychol. 62, 175–185 (1971).
Article PubMed Google Scholar
Chen, T. & Guestrin, C. XGBoost: A Scalable Tree Boosting System. in 785–794 https://doi.org/10.1145/2939672.2939785 (2016).
Lundberg, S. M. & Lee, S.-I. A Unified Approach to Interpreting Model Predictions. in Advances in Neural Information Processing Systems vol. 30 (Curran Associates, Inc., 2017).
Kragel, J. E. & Voss, J. L. Looking for the neural basis of memory. Trends Cogn. Sci. 26, 53–65 (2022).
Article PubMed Google Scholar
Goldinger, S. D. & Papesh, M. H. Pupil dilation reflects the creation and retrieval of memories. Curr. Dir. Psychol. Sci. 21, 90–95 (2012).
Article PubMed PubMed Central Google Scholar
Kucewicz, M. T. et al. Pupil size reflects successful encoding and recall of memory in humans. Sci. Rep. 8, 4949 (2018).
Article PubMed PubMed Central Google Scholar
Kafkas, A. & Montaldi, D. Familiarity and recollection produce distinct eye movement, pupil and medial temporal lobe responses when memory strength is matched. Neuropsychologia 50, 3080–3093 (2012).
Article PubMed Google Scholar
Papesh, M. H., Goldinger, S. D. & Hout, M. C. Memory strength and specificity revealed by pupillometry. Int. J. Psychophysiol. Off. J. Int. Organ. Psychophysiol. 83, 56–64 (2012).
Google Scholar
Võ, M. L.-H. et al. The coupling of emotion and cognition in the eye: introducing the pupil old/new effect. Psychophysiology 45, 130–140 (2008).
Article PubMed Google Scholar
Geva-Sagiv, M. et al. Augmenting hippocampal–prefrontal neuronal synchrony during sleep enhances memory consolidation in humans. Nat. Neurosci. 26, 1100–1110 (2023).
Article PubMed PubMed Central Google Scholar
Rasch, B. & Born, J. About sleep’s role in memory. Physiol. Rev. 93, 681–766 (2013).
Article PubMed PubMed Central Google Scholar
Hanazuka, Y. et al. The Eyes Are More Eloquent Than Words: Anticipatory Looking as an Index of Event Memory in Alzheimer’s Disease. Front. Neurol. 12, (2021).
Stokes, M. G., Atherton, K., Patai, E. Z. & Nobre, A. C. Long-term memory prepares neural activity for perception. Proc. Natl. Acad. Sci. USA. 109, E360–367 (2012).
Article PubMed Google Scholar
Kragel, J. E. & Voss, J. L. Temporal context guides visual exploration during scene recognition. J. Exp. Psychol. Gen. 150, 873–889 (2020).
Article PubMed PubMed Central Google Scholar
Ryals, A. J., Wang, J. X., Polnaszek, K. L. & Voss, J. L. Hippocampal contribution to implicit configuration memory expressed via eye movements during scene exploration. Hippocampus 25, 1028–1041 (2015).
Article PubMed PubMed Central Google Scholar
Bone, M. B. et al. Eye movement reinstatement and neural reactivation during mental imagery. Cereb. Cortex 29, 1075–1089 (2019).
Article PubMed Google Scholar
Senju, A., Southgate, V., White, S. & Frith, U. Mindblind eyes: an absence of spontaneous theory of mind in Asperger syndrome. Science 325, 883–885 (2009).
Article PubMed Google Scholar
Antón-Méndez, I., Talk, A. & Johnston, S. Gaze direction reveals implicit item and source memory in older adults. PloS One 14, e0226018 (2019).
Article PubMed PubMed Central Google Scholar
Wynn, J. S., Buchsbaum, B. R. & Ryan, J. D. Encoding and retrieval eye movements mediate age differences in pattern completion. Cognition 214, 104746 (2021).
Article PubMed Google Scholar
Ryan, J. D., Wynn, J. S., Shen, K. & Liu, Z.-X. Aging changes the interactions between the oculomotor and memory systems. Aging Neuropsychol. Cogn. 29, 418–442 (2022).
Article Google Scholar
Folstein, M. F., Folstein, S. E. & McHugh, P. R. Mini-mental state’. A practical method for grading the cognitive state of patients for the clinician. J. Psychiatr. Res. 12, 189–198 (1975).
Article PubMed Google Scholar
Nasreddine, Z. S. et al. The Montreal Cognitive Assessment, MoCA: a brief screening tool for mild cognitive impairment. J. Am. Geriatr. Soc. 53, 695–699 (2005).
Article PubMed Google Scholar
Scoville, W. B. & Milner, B. Loss of recent memory after bilateral hippocampal lesions. J. Neurol. Neurosurg. Psychiatry 20, 11–21 (1957).
Article PubMed PubMed Central Google Scholar
Robinson, G., Blair, J. & Cipolotti, L. Dynamic aphasia: An inability to select between competing verbal responses?. Brain J. Neurol. 121, 77–89 (1998).
Article Google Scholar
van der Meulen, I., van de Sandt-Koenderman, W. M. E., Duivenvoorden, H. J. & Ribbers, G. M. Measuring verbal and non-verbal communication in aphasia: reliability, validity, and sensitivity to change of the Scenario Test. Int. J. Lang. Commun. Disord. 45, 424–435 (2010).
Article PubMed Google Scholar
Schmidig, F. & Yamin, D. Anticipatory Eye Gaze as a Marker of Memory [data set]. https://doi.org/10.17605/OSF.IO/B64QK (2025).

Download references

Acknowledgements

We extend our deepest gratitude to Dr. Noa Regev for her administrative assistance and unwavering support throughout this project. We are indebted to Yuval Shapira for their initial analyses and for developing a user interface that enabled participants to accurately select event coordinates in natural movies. Special thanks go to Odessa Goldberg, Alexandra Klein, Yael Gat, Rotem Falach, Or Ra’anan, and the additional research assistants for their invaluable help with data collection. We also want to express our heartfelt thanks to the participants of this study, whose involvement was crucial to the success of our research. Additionally, we are grateful to Studio Plonter® for their exceptional work in creating the tailor-made animations that played a pivotal role in our experiments. We are also thankful to Dr. Roy Amit for his advice on eye-tracking data collection and preprocessing in the initial stages of the project. This research was supported by ERC-2019-CoG 864353 (Y.N.). Additionally, F.J.S. was supported by a Tel Aviv University Sagol School of Neuroscience Postdoctoral Fellowship. This research was conducted while O.S. was a Glenn Foundation for Medical Research Postdoctoral Fellow. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.

Author information

These authors contributed equally: Flavio Jean Schmidig, Daniel Yamin, Omer Sharon.

Authors and Affiliations

Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, Israel
Flavio Jean Schmidig, Daniel Yamin & Yuval Nir
Department of Physiology & Pharmacology, Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
Flavio Jean Schmidig, Daniel Yamin, Yoav Nadu, Jonathan Nir & Yuval Nir
Center for Human Sleep Science, Department of Psychology, Helen Wills Neuroscience Institute, University of California, Berkeley, CA, USA
Omer Sharon
The Edmond and Lily Safra Center for Brain Sciences, The Hebrew University of Jerusalem, Edmond J. Safra Campus, Givat Ram, Jerusalem, Israel
Jonathan Nir
Center for Neuroscience, Department of Psychology, University of California Davis, Davis, CA, USA
Charan Ranganath
Department of Biomedical Engineering, Faculty of Engineering, Tel Aviv University, Tel Aviv, Israel
Yuval Nir
The Sieratzki-Sagol Center for Sleep Medicine, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
Yuval Nir
Sagol Brain Institute, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
Yuval Nir

Authors

Flavio Jean Schmidig
View author publications
Search author on:PubMed Google Scholar
Daniel Yamin
View author publications
Search author on:PubMed Google Scholar
Omer Sharon
View author publications
Search author on:PubMed Google Scholar
Yoav Nadu
View author publications
Search author on:PubMed Google Scholar
Jonathan Nir
View author publications
Search author on:PubMed Google Scholar
Charan Ranganath
View author publications
Search author on:PubMed Google Scholar
Yuval Nir
View author publications
Search author on:PubMed Google Scholar

Contributions

Conception and design of research: D.Y., O.S., F.J.S., and Y.N.; funding acquisition: Y.N. and O.S.; compilation of movie stimuli: O.S. and D.Y.; data collection: D.Y., F.J.S., O.S., and Y.N.; data analysis: D.Y., F.J.S., O.S., and J.N.; manuscript writing: D.Y., F.J.S., and Y.N.; advice on experimental design and data analysis: C.R.; critical review of results and manuscript comments: all authors.

Corresponding author

Correspondence to Yuval Nir.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Psychology thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editor: Marike Schiffer. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Transparent Peer Review file

Supplementary information

Description of additional supplementary file

Movie 1

Movie 2

reporting summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Schmidig, F.J., Yamin, D., Sharon, O. et al. Anticipatory eye gaze as a marker of memory. Commun Psychol 3, 122 (2025). https://doi.org/10.1038/s44271-025-00305-7

Download citation

Received: 05 May 2025
Accepted: 28 July 2025
Published: 11 August 2025
DOI: https://doi.org/10.1038/s44271-025-00305-7

Subjects

Abstract

Similar content being viewed by others

Introduction

Methods

Participants

Experimental procedures

Experiment 1: Tailor-made animation movies

Experiment 2: Animation movies with extensive explicit memory reports

Experiment 3: Naturalistic movies

Experiment 4: Naturalistic movies with nap or wake

Control experiment: Animations without an event

Movie stimuli and visual presentation

Eye tracking

Event-related analysis

Gaze average distance (GAD)

MEGA Score: a metric for gaze anticipation

Pupillometry

Behavioral analysis

Single-trial decoding of movie viewing

Statistical analysis

Reporting summary

Results

Gaze anticipation indexes memory for events

Machine learning discriminates first and second viewings at a single trial resolution

Anticipatory gaze marks event recollection, whereas pupil size indexes context recognition

Anticipatory gaze reflects relational memory rather than familiarity

Anticipatory gaze is replicated in naturalistic movies

Anticipatory gaze reveals sleep’s benefit for memory consolidation without report

Discussion

Limitations

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links