Cross-cultural analysis of eye-movement patterns in visual scene perception: a comparison of seven cultural samples

Čeněk, Jiří; Halámková, Daniela; Caha, Jan; Lacko, David; Kalenská, Petra; Stachoň, Zdeněk; Tsai, Jie-Li; Ahenkan, Albert; Dresler, Thomas; Lüdtke, Jana; Dostálová, Nicol; Šašinková, Alžběta; Ugwitz, Pavel; Šašinka, Čeněk

doi:10.1038/s41598-025-12724-x

Download PDF

Article
Open access
Published: 05 August 2025

Cross-cultural analysis of eye-movement patterns in visual scene perception: a comparison of seven cultural samples

Jiří Čeněk^1,2,3,
Daniela Halámková³,
Jan Caha^4,5,
David Lacko^2,3,
Petra Kalenská⁶,
Zdeněk Stachoň³,
Jie-Li Tsai^7,8,
Albert Ahenkan⁹,
Thomas Dresler^10,11,12,
Jana Lüdtke¹³,
Nicol Dostálová³,
Alžběta Šašinková³,
Pavel Ugwitz³ &
…
Čeněk Šašinka^2,3

Scientific Reports volume 15, Article number: 28574 (2025) Cite this article

1334 Accesses
Metrics details

Subjects

Abstract

This eye-tracking research investigates cross-cultural similarities and differences in visual attention in complex scenes free-viewing perception. The study utilizes 70 real-world photos with one or two focal objects as stimulus materials. The study examines the amount of time spent on focal objects, saccadic lengths, temporal changes in saccadic lengths and factors that influence these metrics. Data were collected between 2020 and 2022 from seven cultural samples in Africa, East Asia, Europe, and the Near East (N = 408). Contrary to initial hypotheses, the findings challenge the expected order of countries in terms of attention toward objects. Participants from Taiwan, assumed to exhibit holistic patterns, displayed the most holistic viewing pattern. Surprisingly, participants from Germany and Czechia did not significantly differ from those in Taiwan. Furthermore, participants from Ghana and Türkiye, expected to be moderate, showed the most analytic pattern. This challenges preconceived notions and contributes to understanding patterns of scene perception in underrepresented countries. Additional analyses explored the relationship between number and size of focal objects and dwell time, as well as the potential influence of sociodemographic variables, on dwell time.

Exploring cross-cultural variations in visual attention patterns inside and outside national borders using immersive virtual reality

Article Open access 01 November 2023

Distribution and associated factors of keratometry and corneal astigmatism in an elderly population

Article Open access 26 August 2024

Eye tracking based clustering using the Korean version of the reading the mind in the eyes test

Article Open access 31 January 2025

Introduction

Cross-cultural research has yielded valuable insights into the systematic variations in perceptual and cognitive processes across nations and cultural groups. This rich area of research encompasses a diverse range of cognitive processes, such as object categorization, attitudes toward contradictions, object-background separation, selective attention in complex scenes, global-local processing, and sensitivity to visual field changes¹.

Expanding on this foundation, a theory of holistic and analytic cognition has been formulated² which posits the existence of two distinct modes of cognitive processing: holistic and analytic cognition. It suggests that individuals tend to engage predominantly in one of these styles, shaping their cognitive processing. Holistic thinkers, compared to their analytic counterparts, rely more on intuitive strategies rather than rigid rules in object categorization^3,4, embrace dialectical thinking over formal logic, face challenges in object-background differentiation⁵, prioritize the contextual relationships between objects over salient focal objects and attributes⁶, exhibit a preference for global over local features⁷, demonstrate higher sensitivity to contextual changes rather than focal object changes⁸, and lower central fixation bias⁹. Most of the research mentioned above concludes that analytic cognitive mode is more typical for Western countries (USA, Western Europe) and holistic mode of perception is more typical for Asian countries.

Drawing inspiration from this theory, our current research focuses on one specific aspect of visual perception: selective attention to objects and their surrounding context when perceiving complex natural scenes, also referred to as context sensitivity or attention to the field¹⁰. Within this framework, “analytic” cognition has been linked to an object-centred attentional mode: people attend to focal objects, classify them via rule-based categories, and explain events by reference to internal object properties. By contrast, “holistic” cognition emphasises field-wide information; attention is distributed across background elements and inter-object relations, and reasoning is guided by similarity and contextual fit rather than abstract rules². Several previous studies have explored cross-cultural differences in context sensitivity using natural scenes and eye-tracking techniques within the free-viewing paradigm^11,12,13. More specifically, some researchers have discovered distinctive patterns of eye movements between Chinese and North American participants¹¹ and between Chinese and African participants¹², or between Taiwanese and Czech participants⁶. In this context, an analytic pattern empirically manifests as longer dwell time on focal objects and correspondingly shorter saccades within the first seconds of viewing, whereas a holistic pattern is characterised by broader scan paths and more time allocated to the background¹¹.

However, some studies failed to replicate or even contradicted the theory of holistic and analytic cognition. For instance, Rayner et al.¹⁴ recorded eye movements across six information-processing tasks and found no reliable East–West divergence in attention to objects versus backgrounds. Using the original Chua et al.¹¹ materials, Evans et al.¹³ likewise reported null cultural effects between American and Chinese in eye-movements and scene recognition. Furthermore, Rayner et al.¹⁵ showed that American and Chinese observers looked equally quickly at the “weird” components of manipulated scenes. Finally, Miellet et al.¹⁶ masked central vision dynamically during an animal-search task and found identical oculomotor strategies in Western and Eastern participants. Given the contradictory evidence on the systematic cultural differences in perception and cognition, the reseach in this field is far from concluded.

Despite the existing body of research, there are still some gaps that need to be addressed.Most existing eye-tracking studies contrast a single “Western” sample (typically the United States) with a single “East-Asian” sample (Japan, China, or Korea). This dichotomy leaves unanswered whether the purported holistic–analytic split is categorical or graded, and whether it covaries with specific socio-ecological variables such as urbanisation, education system, or relational mobility^17,18,19. We therefore adopt a gradient view: societies positioned closer to the historical core of Western Europe on indices of individualism, “Book religions”, and alphabetic literacy are expected to show the most object-focused (analytic) gaze, whereas societies further away on those dimensions should exhibit progressively more context-focused (holistic) patterns. Taiwan represents the East-Asian pole because its Confucian heritage, logographic writing system, and collectivistic value profile mirror those of the cultures in which holistic gaze patterns were first documented^10,11.

Furthermore, the limited sample diversity and the predominant focus on Western (read Anglo-Saxon, associated with analytic cognition) and Eastern countries (read East-Asian, associated with holistic cognition) when defining the theory of holistic and analytic cognition present obstacles to its potential advancement. To expand the body of knowledge on scene perception beyond the commonly compared countries, we gathered data from three European countries (Germany, Bulgaria, Czechia), two samples from a country on the border between Europe and Asia (Türkiye), an African country (Ghana), and one East-Asian country (Taiwan). The decision to collect two separate samples from Türkiye aimed not only to examine cross-country differences but also to explore within-country differences in the relatively largest and most diverse country in the sample. To achieve this, we deliberately gathered data from participants born and raised in both the Western and Eastern regions of Türkiye.

One of the central questions was how to determine the rank of the countries on the holistic-analytic spectrum prior to delving into the data, especially when faced with a lack of robust empirical evidence to substantiate the formulation of our hypotheses, at least in some of the comparisons (e.g. between Türkiye and Ghana). Based on the previous research, we expected the participants from Germany (a representative of the Western country) to be the most analytic¹¹, followed by participants from Czechia and Bulgaria (post-communist Europe)²⁰. On the other hand, Taiwan is expected to be the most holistic country. Participants from Ghana and Türkiye should fall between these countries¹². Finally, we expected no differences between the participants from Eastern Türkiye compared to their Western counterparts²¹, but decided to test them.

In the current research, we focus on cross-cultural similarities and differences in one of the aspects of visual perception - attention to (focal) objects and background - using complex visual scenes (naturalistic photographs). The main variables of interest (dependent variables) were the amount of time spent on objects (focal object dwell time; calculated as a sum of fixation durations) and saccadic length (calculated in pixels). Dwell time reflects how long a viewer attends to focal objects vs. contextual elements, with cultural differences between object-oriented (analytic) and context-sensitive (holistic) styles. Saccadic length and its temporal changes provides insight into whether a viewer is making smaller, detail-focused movements or larger, more context-focused shifts²². These measures are frequently used in eye-tracking studies of context sensitivity^6,11,21,23. We were also interested in the temporal changes in saccadic lengths during the stimulus presentation, and in factors that might have influenced the object-oriented attention. Therefore, we formulated several assumptions related to differences in eye-movements across cultural samples.

The hypotheses are categorized as major or minor (H_ma, H_mi). First, statistical equivalence of both Türkish samples in the dependent variables (dwell time on focal objects, saccadic length) was tested using a TOST procedure (two one-sided t-tests) proposed by Lakens²⁴. When statistical equivalence is not achieved, merging the samples and analyzing them as one group is not advisable. As already mentioned, we expected the same patterns of eye-movements in both Türkish samples (H_mi1).

Next, we tested for differences in the average dwell time on focal objects (main hypothesis H_ma1) and the average saccadic length (H_ma2). Analytic perceivers should according to the theory attend relatively more to the focal objects, therefore, the average dwell time on objects should be longer. Moreover, analytic perceivers should more prevalently use focal visual scanning patterns that are typical with relatively longer fixations and relatively shorter lengths of saccades in the first few seconds after the stimulus onset²². Therefore, we expected the following order of countries in the average dwell on focal objects: Taiwan < Ghana < Türkiye-East < Türkiye-West < Bulgaria < Czechia < Germany. The opposite pattern was expected in the saccadic length (Taiwan > Ghana > Türkiye-East > Türkiye-West > Bulgaria > Czechia > Germany).

Next, we tried to test several minor hypotheses. Follet et al.²² demonstrated a significant drop in saccadic length after three seconds from the stimulus onset when free-viewing complex visual scenes. This effect should be independent from the content of the scene. We have no specific cross-cultural hypothesis regarding this, but we would expect to replicate the drop in saccadic length in our study (H_mi2).

In the next set of hypotheses, we wanted to explore potential effects of stimulus characteristics, namely (a) the number of focal objects, and (b) the relative size of focal objects, on the average dwell time.The manipulation with the number of focal objects tests whether increasing object load demands additional visual processing: when two independent items are present, viewers must individuate, compare, and encode each one in turn, which should prolong the cumulative time their gaze remains within the object ROIs. We therefore expect that, as the number of focal objects (1 vs. 2) rises (Hmi3) and as their relative size increases (Hmi4), participants across countries will display longer mean dwell time on the focal objects.

The last exploratory hypothesis tries to investigate whether individual sociodemographic factors such as gender, age, or socioeconomic status²⁵ have some kind of effect on dwell time or saccadic length during free-viewing of complex visual scenes (H_mi5) as suggested by some previous research in the field²⁶.

Methods

Research ethics

The Research Ethics Committee of Masaryk University reviewed the application to conduct the research project and has approved this project (Proposal No.: 0257/2018) to be conducted on 13 March 2019. Nature of methods used and their administration was performed in accordance with the relevant guidelines and regulations of Masaryk University and national laws. Informed consent was obtained in writing from all participants.

Use of LLMs

Chat GPT (v. 4O, https://chatgpt.com/) LLM was used for English proofreading of all sections of the text, and for polishing the analytic code.

Materials

Real-world scenes

The stimulus materials consisted of complex scenes - real-world photos. The scenes were obtained from the online databases containing free-to-use for non-commercial purposes license photos (CC BY-NC-SA and more permissive licenses). We set several pre-selection criteria of the photos: (a) the image should contain one or two focal objects, (b) there should be no people or animals present in the image, (c) there should be no components of the image that would reveal the exact location, where the image was taken, present in the picture (no commonly known landmarks, no writing).

To verify that the scenes fit our criteria, we conducted a qualitative evaluation of the scenes. The preselected images were sent to four independent evaluators (members of the research team) from countries involved in this research. The evaluators were given the definitions of a focal object (“The main figure(s) or object(s) in the picture. It is salient and can be in the foreground.”) and background (“The part of a picture that forms a setting for the main figure(s) or object(s), or appears furthest from the viewer.”) and asked to: (a) indicate how many objects they identify in the image, (b) name those objects. Minimum of three out of four evaluators had to agree in the evaluated criteria. Scenes that passed this qualitative evaluation (i.e. they had the same number of objects, same components of the image are perceived as objects) were kept in the test battery, other pictures were discarded.

In total, 98 scenes out of the original 160 were retained after the qualitative evaluation. In the next step, we unified the size of each image and added a black frame occupying 10% of the entire stimulus (see Fig. 1; size without the frame: 922 × 692 px; size with the frame: 1024 × 768 px). This was done due to the lower accuracy of eye-tracking systems in the display periphery²⁷. Subsequently, for each stimulus, we defined the surface occupied by focal objects and computed an index of relative focal object size that was used both for further selection of stimuli and later on in the statistical analysis. In the next step, we ranked the images by the relative size of the focal object (min = 1%, max = 40%). Images with extremely small or large portions occupied by focal object(s) were discarded. The resulting range of relative focal object size was between 5% and 28%. This variable was transformed to z-score. Based on z-scores, the stimuli were placed in one of three categories: stimuli with small (N = 12), medium (N = 43), and large (N = 15) focal objects (cut-off points at z-score − 1 and 1). These 70 stimuli comprised the test stimuli (trials).

Each trial was presented for four seconds and was preceded by a central fixation cross displayed until the moment a participant fixated on it. The participants were instructed to “view a series of pictures” and evaluate “how much they liked each picture” on a scale ranging from 1 to 5 (1–very good, 5–very poor). The task of rating scene likability was included solely to maintain participant engagement and prevent boredom effects, contributing to enhanced attention and motivation during the experimental session^11,12,22.

Sociodemographic questionnaire

Apart from the scenes, sociodemographic data were collected by a questionnaire. Data on sex, age, nationality, marital status, number of siblings, mother tongue, socioeconomic status, and other variables were collected. This research was part of a larger test battery and not all of the variables collected were used in this research. For a full list of variables collected, including their coding, please see Supplement. All materials were translated by two independent translators using a back-translation method. The translation process for each language was overseen by native members of the research team from the respective countries, all of whom were well-acquainted with the research objectives.

Apparatus and procedure

The sociodemographic data were collected using the software Hypothesis^28,29 running on Google Chrome web browser between February 2020 and April 2022. The eye-tracking data were collected using SMI RED 250 mobile in all countries except for Taiwan. In Taiwan, an EyeLink 1000 was used. The sampling frequency was set to 250 Hz with 9-point calibration. The desired accuracy threshold of calibration was set to 1° of visual angle. The selection of eye-tracking systems was guided by their specifications and performance parameters to guarantee methodological comparability of the recorded data. The SMI RED 250 mobile is specified at 0.4° accuracy and 0.03° RMS precision by the manufacturer³⁰, whereas the EyeLink 1000 is specified at ≤ 0.15° (typical 0.25–0.50°) accuracy and 0.01° RMS precision³¹. Independent benchmark studies report mean empirical accuracies of ~ 0.5–0.9° for the SMI RED 250 under 9-point calibration³² and ~ 0.6° for the EyeLink 1000 in a 49-target fixation grid³³. These values confirm that both systems meet the ≤ 1° criterion recommended for dispersion-based scene-perception research³⁴. Both systems can be considered relatively accurate and precise, and also comparable in both parameters.

However, we conducted the task even with participants with lower accuracy to make the entire experimental procedure constant for all participants (the same order of tests in the test battery). In all countries, stimulus material was presented on 22” LCD monitors set to 60 Hz refresh rate and 1024 × 768 resolution). Participants viewed the stimuli using chin-rest positioned approximately 60 cm away from the monitor. The laboratory setup, data collection, as well as all the subsequent phases of data cleaning and analysis adhere to the guidelines for eye-tracking studies by Dunn et al.³⁵.

Data processing and cleaning

Sampling plan

The sampling plan and participant exclusion criteria for this study were pre-registered (see Data availability section). As part of a larger test battery, we collected data from university students across 7 countries (Ghana, Taiwan, Armenia, Bulgaria, Türkiye, Czechia, and Germany) in 8 independent samples (for contrast, Türkish participants born in Eastern and Western regions of the country were collected). Sample size was determined based on a priori power analysis with specifying small-to-medium effect size (Cohens’ f = 0.18). Using common settings (α = 0.05, β = 0.20) in G*Power (v3.1.9.7)³⁶ for One-Way ANOVA with 8 groups resulted in total 456 participants, i.e. 57 participants per a group (Noncentrality parameter λ = 14.77, Critical F = 2.03, Numerator df = 7, Denominator df = 448). Counting with a certain data loss that is quite common in eye-tracking studies we aimed to collect 60–70 participants per cultural group. Due to the low data quality of a sample collected in Armenia, the number of participants would drop far below the desired number (to only 15 participants out of 58 collected). All the data cleaning and sample information below are therefore reported without the Armenian sample (deviation from the pre-registration described in the Supplement).

Due to the fact that the number of groups changed, and the final number of participants was below the planned number, we calculated a post-hoc sensitivity analysis for the statistical tests that were used in this study. ANOVa (α = 0.05, β = 0.20) for 7 groups and 408 participants should be able to detect small-to-medium effect (2 = 0.03). Post-hoc t-tests with the same settings should detect Hedges’s medium to large effect (g = 0.59)³⁷. TOST procedure²⁴ used for the tests of equivalence between both Türkish samples was able to detect medium effect (d = 0.40).

Data processing—raw data

Since data collection was conducted using multiple eye-tracking systems and due to the incompatibility of their software (different native fixation detection algorithms, impossibility to import data gathered with one eye-tracker into another), we employed a meticulous data processing procedure described below. This procedure should eliminate or at least minimize any method-related artifacts. Additionally, to assure good data quality, several data quality checks were performed and data not meeting a sufficient quality were discarded from the further analysis. For a full list of discarded participants and reasons for their removal see the Supplement.

In the first step, data were exported from SMI BeGaze and SR Research Data Viewer. Also, the first phase of data cleaning was performed. We discarded all participants with incomplete administration, low accuracy, administration with incorrect sampling frequency, or impossibly short experiment duration. Thirty-two participants were discarded in this phase. Subsequently, we prepared the raw data to be imported to Ogama software (v5.0.1)³⁸. This included mainly changing file names, relabeling of trial names and IDs, removal of duplicate timestamps, relabeling of participant IDs, and reordering columns. The R-codes for these transformations are available in the OSF repository.

Data processing—Ogama

In Ogama, the ROIs were drawn for each test trial (Fig. 2). ROIs are defined around the contours of the focal objects. Both the picture slides and the ROI coordinates exported from Ogama are available at the OSF repository. Subsequently to the definition of ROIs, the eye-movement metrics were calculated. Ogama uses an I-DT dispersion-type algorithm for fixation detection³⁹. We used the settings recommended for 250 Hz data by Popelka⁴⁰: maximum distance of 15 px, minimum number of 20 samples, size of 31 px for the fixation detection ring, automated elimination of first fixation, and no merging of consecutive fixations. The following metrics were exported for each participant and trial: trial duration, data loss, fixation count (trial and each ROI), fixation duration (dtto), and saccade distance (dtto). Ogama exports saccade length in screen-pixel units, and we recalculated the saccade length to more commonly used degrees of visual angle. All sites used identical 22′′ monitors (1024 × 768) viewed from a chin-rest–fixed 60 cm distance. Under this geometry 1 pixel ≈ 0.042° of visual angle, so each pixel value was multiplied by 0.042 to yield saccade length in degrees.

Data processing—quality check after fixation calculation

After the ROI definition, fixation calculation, and export of eye-movement metrics we conducted a second phase of data cleaning (pre-registered). Trials were excluded if they met the following conditions: (1) the percentage of data loss was greater than 20%, which equated to more than 800 ms of lost data, based on the guidelines proposed by Komogortsev et al.⁴¹; or (2) the number of fixations was less than five, as recommended by Follet et al.²². The average number of trials excluded in this fashion varied between 4.66% in Czechia and 15.90% in Ghana. Furthermore, if a participant had more than 20% (i.e., 15 or more) of trials excluded based on these criteria, they were excluded from subsequent analyses to ensure data integrity. Fourty additional participants were deleted in this phase. The number of missing (deleted) trials after the removal of participants was 4.37% and the mean data loss as exported directly from Ogama was 3.05% across all participants with very similar values in all countries (minimum of 2.71% in Türkiye-West and maximum of 3.82% in Taiwan). Both numbers indicate high data quality after the data cleaning procedure. As an additional quality control, we also computed a median of intersample angular distances precision^34,42,43 for every trial, fixation, and participant, and compared these across countries using a mixed-effects model (see the Supplement). The S2S seems to be acceptable to make meaningful comparisons across cultural samples⁴⁴.

Tests of assumptions and equivalence

Assumptions of ANOVA (normality, homoscedasticity, sphericity) were tested before each of the presented analyses. In the case of violation of these assumptions, a robust variant of the statistical test was calculated (Welch’s ANOVA, bootstrap heteroscedastic ANOVA for trimmed means, Games-Howell post-hoc). Results of assumptions testing can be found in the Supplement. To test for equivalence of both Türkish samples, we used TOST)²⁴ with medium equivalence bounds of d = 0.40. The procedure allows - unlike null hypothesis significance testing - to interpret evidence in favor of null hypotheses (in our case, in favor of equivalence of the two cultural groups).

Data were analyzed using R, version 4.1.2⁴⁵ and the packages afex, effsize⁴⁶, ggforce, ggdist⁴⁷, gghalves, lme4⁴⁸, lsr, mclust⁴⁹, performance⁵⁰, psych⁵¹, rstatix, TOSTER⁵², and WRS2⁵³.

Research sample

After the data cleaning procedure, 408 cases were retained in the research sample (Table 1). The minimum size of a subsample is 42 (Ghana and Türkiye West), the maximum is 92 (Czechia). The percentage of females varies between 32 in Türkiye East to 66 in Taiwan. Regarding the age of participants, the mean age of participants varies around 21–22 years, with the exception of Germany that is represented by marginally older participants (25 years average). The total sample size is slightly below the desired sample size (see the Sampling plan section), which is primarily caused by lower number of participants retained in subsamples from Ghana and Türkiye West).

Table 1 Sample characteristics.

Full size table

Results

Equivalence of Türkish samples

To be able to determine, whether or not both Türkish samples can be considered equivalent (H_mi1), we performed the TOST with set d = 0.40 bounds²⁴. If found equivalent, the samples could be grouped and analyzed together (see the Supplement for details on the equivalence tests). Tests were computed for both main dependent variables, the average dwell on focal objects and the saccadic length. Regarding the mean dwell time on focal objects, there was no significant difference between the two Türkish samples, t(79.8) = 0.93, p = .358, mean difference = 56.52; Hedges’s g = 0.19, CI_95% [− 0.15, 0.53]. However, the TOST suggests that the groups are not equivalent; t(79.8) = 0.92, p = .82. Regarding the mean saccadic length, no significant difference was found between the two Türkish samples, t(95.6) = − 1.69, p = .094, mean difference = − 9.71; Hedges’s g = − 0.34, CI_95% [− 0.66, − 0.00] and the samples were non-equivalent, t(95.6) = − 1.60, p = .950.

In summary, the testing of sample equivalence in the two dependent variables shows no evidence of equivalence of neither the samples from Türkiye. This means that the grouping should not be performed and the samples will be further compared individually.

Dwell time on focal objects

One-way ANOVA with cultural group as an independent variable and average dwell on focal objects as a dependent variable was conducted to identify potential differences in dwell across cultural groups (H_ma1; Fig. 3). Effect of culture was significant, F(6, 401) = 12.77, p < .001, with large effect size ω² = 0.15, CI_95% [0.09, 1.00]. The mean focal object dwell time varied between 1,895 ms (SD = 304, SE = 46) in Ghana and 1,528 ms (SD = 281, SE = 34) in Taiwan. When inspected in the direction from longest to shortest average dwell, the first significant difference can be observed between Ghana and Czechia (mean difference = 247 ms, p < .001; Hedges’s g = − 0.92, CI_95% [− 1.30, − 0.54]). Inspecting the differences in the opposite direction, the first significant difference can be found between Taiwan and Bulgaria (mean difference = 205 ms, p = .001; Hedges’s g = 0.75, CI_95% [0.38, 1.11]). Full descriptives and all post-hoc tests are reported in the Supplement.

To further explore patterns of attention to object vs. attention to the background, we performed a series of t-tests on the size of differences between dwell time on objects and background in each of the cultural samples (Table 2). Each pairwise comparison is statistically significant. Yet, in the case of Taiwan, the effect size is notably lower (medium) compared to the other subsamples (high).

Table 2 Differences in dwell time on focal object (FO) and background (BG) by culture (in ms).

Full size table

Saccadic length

To analyze potential cross-cultural variations in saccadic length (H_ma2; Fig. 4), we conducted a Welsch one-way ANOVA. The effect of culture was significant, F(6, 163.2) = 7.43, p < .001, with large effect size ω² = 0.18, CI_95% [0.08, 1.00]. The mean saccadic length varied between 7.06° (SD = 1.09, SE = 0.17) in Ghana and 8.31° in Taiwan (SD = 1.45, SE = 0.18). When inspected in the direction from shortest to longest mean saccadic length, the first significant difference can be found between Ghana and Bulgaria (mean difference = 0.75°, p = .001; Hedges’s g = 0.81, CI_95% [0.43, 1.18]). When inspecting in the opposite direction, the first significant difference can be found between participants from Taiwan and Türkiye East (mean difference = 0.72°, p = .001; Hedges’s g = 0.44, CI_95% [0.44, 1.17]). Full descriptives and all post-hoc tests are reported in the Supplement.

Temporal saccadic length changes

To replicate the findings of Follet et al.²² that, using almost identical research design, identified a drop in saccadic length three seconds after the stimulus onset, we calculated mean saccadic lengths for each of four seconds of stimulus presentation (H_mi2). At the aggregate level, the mean saccadic length did not show the expected pattern, but fluctuated from 7.44° (SD = 1.48) in Second 1, through 7.86° (SD = 1.50) in Second 2 and 7.61° in Second 3 (SD = 1.60), to 8.07° (SD = 1.56) in Second 4. The results of robust (bootstrapped, with 0.2 trimmed means) repeated measures ANOVA showed a significant main effect of time interval after stimulus onset F(3, 404) = 18.03, p < .001, ξ² = 0.22. All differences between individual levels were significant at alpha 0.05, except of saccadic length in Second 1 and 3, p = .140, Hedges’s g = − 0.18, CI_95% [− 0.28, − 0.09] and in Second 2 and 4, p = .060, Hedges’s g = − 0.25, CI_95% [− 0.35, − 0.15]. For full results see the Supplement. This pattern of fluctuations in saccadic length was similar at the level of each individual cultural group (Fig. 5).

Effects of stimulus characteristics on dwell time

In the next step, we explored cross-cultural differences in dwell time on focal objects based on the characteristics of stimuli. More specifically, we were interested in the effects of number of focal objects and their relative size (percentage of stimulus occupied by focal object).

Effect of number of focal objects

Overall, the focal object dwell time was higher on stimuli with one focal object (H_mi3); mean = 1694 ms, SD = 305) compared to stimuli with two focal objects (mean = 1628, SD = 333). The difference was significant, t(407) = 6.14, p < .001; Hedges’s g = 0.30, CI_95% [0.20, 0.40]. At the cultural level (Table 3), the differences were significant in the case of Taiwan and both Türkish samples that exhibited longer dwell time on objects in the case of stimuli with one focal object compared to stimuli with two objects.

Table 3 Focal object dwell time by culture - stimuli with one and two focal objects (in ms).

Full size table

Effect of relative size of focal object

Furthermore, we conducted an analysis of effects of relative focal object sizes on dwell time (H_mi4). Across all samples, there is an apparent trend that the dwell time on objects increases together with their relative size. Focal object dwell time in trials with small objects (mean = 1523, SD = 316) was significantly shorter than in trials with medium (mean = 1686, SD = 309), t(407) = − 15.33, p < .001, Hedges’s g = − 0.76, CI_95% [− 0.87, − 0.65], and large objects (mean = 1800, SD = 350), t(407) = − 21.27, p < .001, Hedges’s g = − 1.05, CI_95% [− 1.17, − 0.93]. The differences between trials with medium and large objects were also significant, t(407) = − 12.13, p < .001, Hedges’s g = − 0.60, CI_95% [− 0.70, − 0.49]. The same pattern is observable at the level of individual countries (see Supplement for all descriptives and t-tests).

Effects of sociodemographic variables

To determine whether the inclusion of sociodemographic variables improves the model fit (H_mi5), we included gender, age, and socioeconomic status as covariates of cultural group, and dwell time on focal objects, resp. saccadic length as dependent variables. Subsequently, we performed likelihood ratio tests of the nested regression models. The inclusion of covariates did not improve the model neither in the case of dwell, χ²(5) = 379,530, p = .419, nor in the case of saccadic length χ²(5) = 4059.4, p = .372, none of the covariates was significant. Corresponding AIC and BIC values of the reported models are in the Supplement.

Discussion

In this paper, we present results of possibly the largest cross-cultural study on perception of complex visual scenes, in both the number of cultural samples and overall sample size. Our findings aim to expand the body of research on holistic and analytic cognition beyond the WEIRD samples. In line with the recent findings by Uskul et al.⁵⁴, our findings contradict the idea that while the “Western” samples are characteristic with analytic perception while the “Eastern” should perceive holistically. Much more complicated patterns of cultural differences emerge when we focus on characteristics of eye-movements such as dwell time on focal objects and background and saccadic length. Our results, therefore, support the idea that a much more differentiated approach to research on cross-cultural specifics of perception and cognition is needed. The insights gained from this study might enable researchers to formulate more refined and context-specific hypotheses in future investigations, thereby advancing the field’s knowledge base.

Most notably, the main hypotheses (H_ma1, H_ma2) that assumed a specific order of the countries regarding the attention to objects^11,12 were not fully confirmed in the presented study. Eventough, participants from Taiwan exhibited the most holistic pattern of scene viewing having the shortest average dwell time on focal objects and the longest average saccadic length suggesting relatively highest proportion of ambient processing. On the other hand, and quite surprisingly, participants from Germany and Czechia that were assumed to be the most analytic (longest dwell on objects and shortest saccades), were in fact not significantly different from the participants from Taiwan. Moreover, participants from Ghana and Türkiye that were hypothesized to be somehow moderate in both main metrics were in fact the most analytic with the highest focus on focal objects and the shortest saccades. While not aligning with our initial hypotheses, this finding represents a substantial contribution to the understanding of context sensitivity in scene perception in underrepresented countries, including Ghana, Bulgaria, and Türkiye. Upon examining the relative dwell time allocated to focal objects and the background, a clear conclusion emerges: participants consistently devoted significantly more time to observing the objects compared to countries like Taiwan, Czechia or Germany. Notably, the Taiwan sample exhibited distinct characteristics compared to the other samples, as it displayed comparatively smaller differences in dwell times between objects and background. This finding suggests the presence of holistic patterns in the scanning of complex scenes within the Taiwan sample.

In exploratory analysis of saccadic length changes (H_mi2) we investigated whether the scanning patterns of participants from individual countries develop during the time from stimulus onset. Follet et al.²² observed a significant drop in saccadic length after three seconds from the start of the trial. This change in saccadic length suggests a switch from the initial ambient scanning of the scene to the object oriented (focal) processing. Their findings were not replicated in the current study. Moreover, we had initially assumed that the decrease in saccadic length would vary across cultural groups studied, with variations in the timing of the drop following stimulus onset. We expected that in countries with a holistic mindset, the decrease would happen later or be less prominent. However, when examining the average dwell time on focal objects and the average saccadic length as key indicators of holistic and analytic perception in this study (i.e. Ghana representing the most analytic and Taiwan the most holistic), we did not observe any temporal changes in saccadic length that aligned with these expectations. Although it should be noted that such patterns might emerge in some participants if analyzed at the individual level.

Next, we explored the relationship between characterics of the scenes - number of focal objects (H_mi3) and relative size focal objects (H_mi4) and the dwell time spent on focal objects. Interestingly, participants from Taiwan and both Türkish samples spent significantly more time on objects in stimuli with one focal focal object compared to stimuli with two focal objects. It was not surprising that the dwell time on focal objects was rising together with their relative size, as the size of objects on the retina (visual saliency) is one of the factors (apart from other factors such as their centrality or semantic saliency) that draw visual attention⁵⁵.

The final analysis aimed to explore the potential effects of participant age, gender, and socioeconomic status on the dwell time allocated to focal objects (H_mi5). However, none of these variables demonstrated a significant impact on the observed dwell time.

While this research has contributed new insights into cross cultural similarities and differences in context sensitivity, it is important to acknowledge its limitations. Although we were able to collect a substantial cross-cultural dataset on scene perception, the final number of participants after data cleaning (N = 408, divided into seven groups) fell slightly below the target sample size determined through power analysis (N = 456, calculated for eight groups). Additionally, a sensitivity analysis revealed that we were able to detect small to medium effects in the case of ANOVA, medium to large effects in the case of post hoc tests, and medium effect in TOST. As a result, there is a possibility of an increased risk of Type II errors, which means we may have been less likely to detect small true effects⁵³. Future research with a similar number of coutries and with ambition to detect small effects should take this into account and increase sample size.

As mentioned above, the research on holistic and analytic perception commonly compares typical “Western” (North America, Western Europe) and typical “Eastern” (East Asia) countries. Despite our research sample covering a relatively diverse set of countries from Europe, the Near East, Africa, and East Asia, the “Western” end of the spectrum is missing as a potential benchmark for an analytic country. This omission was due to the “higher power” intervention of COVID-19 and the associated restrictions. Moreover, it should be noted that our sample consists solely of university students and the results should not be generalized to the broader populations of the studied countries. It is crucial to recognize that our participants are primarily from larger cities and represent individuals with relatively higher levels of education and wealth within their respective cultures. The future research should therefore focus on deeper exploration of within-country variations of scanning patterns during scene perception, especially in the relatively understudied countries.

Apart from possible data collections in other countries, cultures, or subpopulations, the future research in the field should focus on the following issues. The field of cross-cultural research of perception and cognition suffers from its inability to find individual- and cultural-level predictors of the observed differences. Despite the current attempts to link the differences in perception with psychological constructs such as interdependent and independent social orientation⁵⁶, social class²⁶, or means of sustenance⁵⁷, their results found no strong links or are limited to specific cultural contexts or have not been yet replicated elsewhere.

The progress is, in this respect, also hindered by psychometric and methodological issues connected with self- report scales that are widely used for measurement of the said psychological constructs. Specifically, the research on individualism/collectivism, resp. independence/interdependence encounters issues with adaptability of the self-report scales across cultures, more specifically with measurement invariance across cultures^58,59. Therefore, part of the problem is a lack of reliable scales that could be used as predictors for less common cultural samples. For example Vignoles et al.⁶⁰ attempted to validate a self-construal scale across 33 nations. Although this research is generally very impressive, the authors report only “marginally acceptable fit on all indices” which actually translates as a poor fit by the standards in the SEM research field⁶¹.

Besides these broader issues, future research should focus on cross-cultural variations of eye-movement patterns under different experimental tasks and designs. As demonstrated by Yarbus⁶² the nature of the instructions can solely influence the eye-movement patterns of participants. The obtained results were based on participants being instructed to view the pictures and assess their subjective liking. However, it is important to note that different instructions given to participants may lead to varying scanning patterns during scene perception. For example, Senzaki et al.⁶³ observed that cultural differences were strongly mitigated when participants merely observed animated vignettes compared to condition when they were to describe the content of the scene. Similar observation was made by Köster and Kärtner⁶⁴, who argue that context-sensitivity is socialized via a verbal route. Therefore, it would be valuable to replicate this research using alternative instructions or even different task paradigms, such as visual search, change detection, or moving object tracking. A further limitation is that we did not collect post-trial ratings of image familiarity or semantic clarity; cross-cultural differences in how easily a scene is recognised or interpreted could modulate the gaze patterns we report. Future studies should obtain such ratings to disentangle perceptual style from stimulus familiarity.

The timing of stimulus presentation is an additional factor that may have influenced the observed results. It is plausible that the duration of the trials could have impacted eye-movement patterns, such as attention to objects or saccadic lengths, leading to potential differences across the studied samples. For instance, it is possible that the transition from ambient to focal processing, which is associated with shortened saccadic lengths as described by Follet²², might occur at a later time following stimulus onset or vary in a culturally dependent manner. Moreover, longer presentation times would allow the calculation of eye-tracking metrics that are not illustrative in this research design such as transitions between various ROIs.

It is important to note that not all potentially plausible analyses were conducted or reported in this study. Exploring low-level characteristics of the scenes, such as centrality⁶⁵, visual saliency⁶⁶, or utilizing classification algorithms to assess background complexity, would be valuable additions to further enhance the understanding of the phenomenon. Moreover, relatively new and interesting methods of eye-movement analysis are currently available. Studies based on these methods employ a different approach than the analysis of fixations and saccades present in the current paper. For example Hsiao et al.⁶⁷ developed a statistical method of individual-level comparison of eye-movement patterns based on combination of hidden Markov models and co-clustering that allows for a concurrent clustering of participants in both temporal (position of fixations) and spatial (order of fixations) dimension of eye-movements. Another option might be using at the trial-level the Gini coefficient of dispersion⁶⁸.

Although we harmonised all acquisition parameters and showed that key data-quality metrics (data loss, maximum calibration accuracy threshold) did not differ by site, unmeasured device-specific artefacts can never be ruled out completely³⁴. Importantly, our dependent variables - dwell time and saccade length - are relatively insensitive to the spatial errors below 1°.

Data availability

Preregistration, data, analysis code, and other research materials are available at OSF: (https://osf.io/73arx/). Study design, sampling, and analysis plan of this study were pre-registered at OSF before conducting raw data cleaning and analysis. Due to some relevant critique, pre-registration was updated after peer review in previous submission. Deviations from the pre-registration are reported in the Supplement. Analyses reported here should be therefore considered as “exploratory”.

References

Nisbett, R. E. & Masuda, T. Culture and point of view. Proc. Natl. Acad. Sci. USA. 100, 11163–11170 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Nisbett, R. E., Peng, K., Choi, I. & Norenzayan, A. Culture and systems of thought: Holistic versus analytic cognition. Psychol. Rev. 108, 291–310 (2001).
Article CAS PubMed Google Scholar
Chiu, L. H. A cross-cultural comparison of cognitive styles in Chinese and American children. Int. J. Psychol. 7, 235–242 (1972).
Article Google Scholar
Norenzayan, A., Smith, E. E., Kim, B. J. & Nisbett, R. E. Cultural preferences for formal versus intuitive reasoning. Cogn. Sci. 26, 653–684 (2002).
Article Google Scholar
Kühnen, U. et al. Cross-cultural variations in identifying embedded figures: Comparisons from the united states, germany, russia, and Malaysia. J. Cross-Cult Psychol. 32, 366–372 (2001).
Article Google Scholar
Čeněk, J. et al. Cultural variations in global and local attention and eye-movement patterns during the perception of complex visual scenes: Comparison of Czech and Taiwanese university students. PLoS One. 15, 1–22 (2020).
Article Google Scholar
McKone, E. et al. Asia has the global advantage: Race and visual attention. Vis. Res. 50, 1540–1549 (2010).
Article PubMed Google Scholar
Masuda, T. & Nisbett, R. E. Culture and change blindness. Cogn. Sci. 30, 381–399 (2006).
Article PubMed Google Scholar
Boduroglu, A. et al. Cultural differences in allocation of attention in visual information processing. J. Cross-Cult Psychol. 40, 349–360 (2009).
Article PubMed PubMed Central Google Scholar
Masuda, T. & Nisbett, R. E. Attending holistically versus analytically: Comparing the context sensitivity of Japanese and Americans. J. Pers. Soc. Psychol. 81, 922 (2001).
Article CAS PubMed Google Scholar
Chua, H. F., Boland, J. E. & Nisbett, R. E. Cultural variation in eye movements during scene perception. Proc. Natl Acad. Sci. USA 102, 12629–12633 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Duan, Z. et al. Culture shapes how we look: Comparison between Chinese and African University students. J. Eye Move Res. 9, 1–10 (2016).
Google Scholar
Evans, K. et al. Scene perception and memory revealed by eye movements and receiver-operating characteristic analyses: Does a cultural difference truly exist? Q. J. Exp. Psychol. 62, 276–285 (2009).
Article Google Scholar
Rayner et al. Eye movements during information processing tasks: Individual differences and cultural effects. Vis. Res. 47, 2714–2726 (2007).
Article ADS PubMed Google Scholar
Rayner et al. Eye movements when looking at unusual/weird scenes: Are there cultural differences? J. Exp. Psychol.: Learn. Mem. Cogn. 35, 254–259 (2009).
Article PubMed PubMed Central Google Scholar
Miellet et al. Investigating cultural diversity for extrafoveal information use in visual scenes. J. Vis. 10, 21–21 (2010).
Article PubMed Google Scholar
Masuda, T., Batdorj, B. & Senzaki, S. Culture and attention: Future directions to expand research beyond the geographical regions of WEIRD cultures. Front. Psychol. 11, 1394 (2020).
Article PubMed PubMed Central Google Scholar
Henrich, J. et al. Most people are not WEIRD. Nature 466, 29 (2010).
Article ADS CAS PubMed Google Scholar
Muthukrishna, M. et al. Beyond western, educated, industrial, rich, and democratic (WEIRD) psychology: Measuring and mapping scales of cultural and psychological distance. Psychol. Sci. 31, 678–701 (2020).
Article PubMed Google Scholar
Varnum, M. et al. Holism in a European cultural context: Differences in cognitive style between central and East Europeans and Westerners. J. Cogn. Cult. 8, 321–333 (2008).
Article Google Scholar
Šašinková, A. et al. Exploring cross-cultural variations in visual attention patterns inside and outside National borders using immersive virtual reality. Sci. Rep. 13, 18852 (2023).
Article ADS PubMed PubMed Central Google Scholar
Follet, B. et al. New insights into ambient and focal visual fixations using an automatic classification algorithm. i-Perception 2, 592–610 (2011).
Article PubMed PubMed Central Google Scholar
Boland, J. E. et al. How we see it: Culturally different eye movement patterns over visual scenes. In Cognitive and Cultural Influences on Eye Movements 363–378 (Routledge, 2023).
Chapter Google Scholar
Lakens, D. Equivalence tests: A practical primer for t-tests, correlations, and meta-analyses. Soc. Psychol. Pers. Sci. 8, 355–362 (2017).
Article Google Scholar
Zhang, H. et al. High-status people are more individualistic and analytic-thinking in the West and wheat-farming areas, but not rice-farming areas. Eur. J. Soc. Psychol. 51, 878–895 (2021).
Article Google Scholar
Grossmann, I. & Varnum, M. E. Social class, culture, and cognition. Soc. Psychol. Pers. Sci. 2, 81–89 (2011).
Article Google Scholar
Holmqvist, K. Common predictors of accuracy, precision and data loss in 12 eye-trackers. In The 7th Scandinavian Workshop on Eye Tracking 1–25 (2017).
Šašinka, Č. et al. The hypothesis platform: An online tool for experimental research into work with maps and behavior in electronic environments. ISPRS Int. J. Geo-Inf. 6, 407 (2017).
Article Google Scholar
Popelka, S. et al. EyeTribe tracker data accuracy evaluation and its interconnection with hypothesis software for cartographic purposes. Comput. Intell. Neurosci. 2016, 9172506 (2016).
Article PubMed PubMed Central Google Scholar
SensoMotoric & Instruments SMI RED 250 Mobile: High-performance Remote Eye Tracking on your Laptop [Brochure] (SensoMotoric Instruments GmbH, 2014).
SR Research. EyeLink 1000 Plus Eye Tracker: Hardware Specifications [Technical Specification Sheet] (SR Research Ltd., 2023).
Blignaut, P. et al. Improving the accuracy of video-based eye tracking in real time through adaptive post-calibration regression. Beh Res. Methods. 46, 920–933 (2014).
Google Scholar
Ehinger, B. V. et al. A new comprehensive eye-tracker accuracy measurement method: The effect of head pose, calibration, and data filtering on accuracy. PeerJ 7, e7086 (2019).
Article PubMed PubMed Central Google Scholar
Holmqvist, K. et al. Eye tracker data quality: What it is and how to measure it. In Proceedings of the symposium on eye tracking research and applications 45–52. (2012).
Dunn, M. J. et al. Minimal reporting guideline for research involving eye tracking. Behav. Res. Methods 56, 4351 (2023).
Article PubMed PubMed Central Google Scholar
Faul, F. et al. G* power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav. Res. Methods. 39, 175–191 (2007).
Article PubMed Google Scholar
Cohen, J. Statistical Power Analysis for the Behavioral Sciences (Routledge, 1988).
Google Scholar
Voßkühler, A. et al. OGAMA (Open gaze and mouse Analyzer): Opensource software designed to analyze eye and mouse movements in slideshow study designs. Behav. Res. Methods. 40, 1150–1162 (2008).
Article PubMed Google Scholar
Salvucci, D. D. & Goldberg, J. H. Identifying fixations and saccades in eye-tracking protocols. In Proceedings of the 2000 symposium on Eye tracking research & applications 71–78 (ACM Press, 2000).
Popelka, S. Eye-tracking (nejen) v kognitivní kartografii: Praktický průvodce tvorbou a vyhodnocením experimentu. [Eye-tracking (not only) in cognitive cartography: A practical guide for creation and analysis of an experiment]. UPOL (2018).
Komogortsev, O. V. et al. Standardization of automated analyses of oculomotor fixation and saccadic behaviors. IEEE Trans. Biomed. Eng. 57, 2635–2645 (2010).
Article Google Scholar
Chaudhary, A. K. & Pelz, J. B. Πt -Enhancing the precision of eye tracking using iris feature motion vectors. Preprint at https://arxiv.org/abs/2009.09348 (2020).
Ehinger, B. V. et al. A new comprehensive eye-tracking test battery concurrently evaluating the pupil labs glasses and the eyelink 1000. PeerJ 7, e7086 (2019).
Article PubMed PubMed Central Google Scholar
Blignaut, P. & Wium, D. Eye-tracking data quality as affected by ethnicity and experimental design. Behav. Res. Methods. 46, 67–80 (2014).
Article PubMed Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2021).
Torchiano, M. effsize: Efficient effect size computation. Zenodo (2020).
Kay, M. ggdist: Visualizations of distributions and uncertainty. Zenodo (2022).
Bates, D. et al. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48 (2015).
Article ADS Google Scholar
Scrucca, L. et al. Mclust 5: Clustering, classification and density estimation using Gaussian finite mixture models. R J. 8, 289–317 (2016).
Article PubMed PubMed Central Google Scholar
Lüdecke, D. et al. Performance: An R package for assessment, comparison and testing of statistical models. J. Open. Source Softw. 6, 3139 (2021).
Article ADS Google Scholar
Revelle, W. Psych: Procedures for Psychological, Psychometric, and Personality Research (Northwestern University, 2022).
Caldwell, A. R. Exploring equivalence testing with the updated TOSTER R package. PsyArXiv (2022).
Mair, P. & Wilcox, R. Robust statistical methods in R using the WRS2 package. Behav. Res. Methods. 52, 464–488 (2020).
Article PubMed Google Scholar
Uskul, A. K. et al. Neither Eastern nor western: Patterns of independence and interdependence in mediterranean societies. J. Pers. Soc. Psychol. 125, 471 (2023).
Article PubMed Google Scholar
Wu, C. C. et al. Guidance of visual attention by semantic information in real-world scenes. Front. Psychol. 5, 54 (2014).
Article PubMed PubMed Central Google Scholar
Kitayama, S. & Imada, T. Implicit independence and interdependence. In Mind Context, 174–200 (2010).
Dong, X. et al. Teens in rice county are more interdependent and think more holistically than nearby wheat county. Soc. Psychol. Pers. Sci. 10, 966–976 (2019).
Article Google Scholar
Lacko, D. et al. The necessity of testing measurement invariance in cross-cultural research: Potential bias in cross-cultural comparisons with individualism–collectivism self-report scales. Cross-Cult Res. 56, 228–267 (2022).
Article Google Scholar
van de Vijver, F. J. R. & Leung, K. Equivalence and bias: A review of concepts, models, and data analytic procedures. In Cross-cultural Research Methods in Psychology. 17–45 (Cambridge University Press, 2011).
Vignoles, V. L. et al. Beyond the ‘east–west’ dichotomy: Global variation in cultural models of selfhood. J. Exp. Psychol. Gen. 145, 966–1000 (2016).
Article PubMed Google Scholar
Hu, L. T. & Bentler, P. M. Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Struct. Equ Model. 6, 1–55 (1999).
Article Google Scholar
Yarbus, A. L. Eye Movements and Vision (Springer, 1967).
Book Google Scholar
Senzaki, S. et al. When is perception top-down and when is it not? Culture, narrative, and attention. Cogn. Sci. 38, 1493–1506 (2014).
Article PubMed Google Scholar
Köster, M. & Kärtner, J. Context-sensitive attention is socialized via a verbal route in the parent-child interaction. PLoS One. 13, e0207113 (2018).
Article PubMed PubMed Central Google Scholar
Porubanova-Norquist, M. & Sikl, R. Various categories of changes in the inducement of change blindness. Stud. Psychol. 55, 83–94 (2013).
Google Scholar
Itti, L. et al. A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell. 20, 1254–1259 (1998).
Article Google Scholar
Hsiao, J. H. et al. Eye movement analysis with hidden Markov models (EMHMM) with co-clustering. Behav. Res. Methods. 53, 2473–2486 (2021).
Article PubMed PubMed Central Google Scholar
Dunn, J. D. et al. Face-information sampling in super-recognizers. Psychol. Sci. 33, 1615–1630 (2022).
Article PubMed Google Scholar

Download references

Acknowledgements

We would also like to thank the HUME Lab–Experimental Humanities Laboratory, Masaryk University, for providing us with the necessary machine time and equipment. The data gathering was supported by the Czech Science Foundation (GC19-09265 J: The Influence of Socio-Cultural Factors and Writing Systems on the Perception and Cognition of Complex Visual Stimuli; https://gacr.cz/en/ and the data analysis and publication by the project LL2304 with financial support from the Ministry of Education, Youth and Sports within the targeted support of the ERC CZ program. The funders had no role in the study’s design, data collection and analysis, the decision to publish, or preparation of the manuscript.

Author information

Authors and Affiliations

Department of Social Studies, Faculty of Regional Development and International Studies, Mendel University in Brno, Zemědělská 1, 61300, Brno, Czechia
Jiří Čeněk
Interdisciplinary Research Team on Internet and Society, Faculty of Social Studies, Masaryk University, Brno, Czechia
Jiří Čeněk, David Lacko & Čeněk Šašinka
Laboratory of Information and Cognitive Sciences, Faculty of Arts, Masaryk University, Brno, Czechia
Jiří Čeněk, Daniela Halámková, David Lacko, Zdeněk Stachoň, Nicol Dostálová, Alžběta Šašinková, Pavel Ugwitz & Čeněk Šašinka
Department of Regional Development, Faculty of Regional Development and International Studies, Mendel University in Brno, Brno, Czechia
Jan Caha
Department of Geoinformatics, Faculty of Mining and Geology, VŠB–Technical University of Ostrava, Ostrava, Czechia
Jan Caha
Institute of Mathematics, Faculty of Mechanical Engineering, Brno University of Technology, Brno, Czechia
Petra Kalenská
Department of Psychology, National Chengchi University, Taipei, Taiwan
Jie-Li Tsai
Research Center for Mind, Brain, and Learning, National Chengchi University, Taipei, Taiwan
Jie-Li Tsai
Department of Public Administration, University of Ghana Business School, Accra, Ghana
Albert Ahenkan
Department of Psychiatry & Psychotherapy, Tübingen Center for Mental Health, University Hospital Tübingen, Tübingen, Germany
Thomas Dresler
LEAD Graduate School & Research Network, University of Tübingen, Tübingen, Germany
Thomas Dresler
German Center for Mental Health (DZPG), partner site Tübingen, Tübingen, Germany
Thomas Dresler
Department of Education and Psychology, Experimental and Neurocognitive Psychology, Freie Universität Berlin, Berlin, Germany
Jana Lüdtke

Authors

Jiří Čeněk
View author publications
Search author on:PubMed Google Scholar
Daniela Halámková
View author publications
Search author on:PubMed Google Scholar
Jan Caha
View author publications
Search author on:PubMed Google Scholar
David Lacko
View author publications
Search author on:PubMed Google Scholar
Petra Kalenská
View author publications
Search author on:PubMed Google Scholar
Zdeněk Stachoň
View author publications
Search author on:PubMed Google Scholar
Jie-Li Tsai
View author publications
Search author on:PubMed Google Scholar
Albert Ahenkan
View author publications
Search author on:PubMed Google Scholar
Thomas Dresler
View author publications
Search author on:PubMed Google Scholar
Jana Lüdtke
View author publications
Search author on:PubMed Google Scholar
Nicol Dostálová
View author publications
Search author on:PubMed Google Scholar
Alžběta Šašinková
View author publications
Search author on:PubMed Google Scholar
Pavel Ugwitz
View author publications
Search author on:PubMed Google Scholar
Čeněk Šašinka
View author publications
Search author on:PubMed Google Scholar

Contributions

ČJ: Conceptualization, Methodology, Investigation, Formal Analysis, Data Curation, Writing – Original Draft, Writing – Review & Editing, Visualization; HD: Investigation, Methodology, Writing – Review & Editing; CJ: Formal Analysis, Writing – Review & Editing; LD: Investigation, Formal Analysis, Writing – Review & Editing; KP: Formal analysis, Writing – Review & Editing, Visualization; SZ: Investigation, Writing – Review & Editing; TJ-L: Investigation, Writing – Review & Editing; AA: Investigation, Writing – Review & Editing; DT: Investigation, Writing – Review & Editing; LJ: Investigation, Writing – Review & Editing; DN: Investigation, Writing – Review & Editing; ŠA: Investigation, Writing – Review & Editing; UP: Investigation, Writing – Review & Editing; ŠČ: Conceptualization, Methodology, Investigation, Writing – Review & Editing, Visualization, Supervision, Funding Acquisition.

Corresponding author

Correspondence to Jiří Čeněk.

Ethics declarations

Competing interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Čeněk, J., Halámková, D., Caha, J. et al. Cross-cultural analysis of eye-movement patterns in visual scene perception: a comparison of seven cultural samples. Sci Rep 15, 28574 (2025). https://doi.org/10.1038/s41598-025-12724-x

Download citation

Received: 15 October 2024
Accepted: 18 July 2025
Published: 05 August 2025
DOI: https://doi.org/10.1038/s41598-025-12724-x

Subjects

Abstract

Similar content being viewed by others

Exploring cross-cultural variations in visual attention patterns inside and outside national borders using immersive virtual reality

Distribution and associated factors of keratometry and corneal astigmatism in an elderly population

Eye tracking based clustering using the Korean version of the reading the mind in the eyes test

Introduction

Methods

Research ethics

Use of LLMs

Materials

Real-world scenes

Sociodemographic questionnaire

Apparatus and procedure

Data processing and cleaning

Sampling plan

Data processing—raw data

Data processing—Ogama

Data processing—quality check after fixation calculation

Tests of assumptions and equivalence

Research sample

Results

Equivalence of Türkish samples

Dwell time on focal objects

Saccadic length

Temporal saccadic length changes

Effects of stimulus characteristics on dwell time

Effect of number of focal objects

Effect of relative size of focal object

Effects of sociodemographic variables

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links