Abstract
Infants’ tendency to focus on socially relevant cues is commonly referred to as ‘social attention’ and is often conceptualized as a unified construct. However, few studies have examined the concordance among measures that fall under this label. In this study, we assessed 50 ten-month-old infants using three common measures of social visual attention: eye (versus mouth) preference, gaze following, and face preference (versus non-social objects). We also examined their association with concurrent parent-rated socio-communicative abilities. No significant associations were found among the gaze measures. Notably, only eye preference was uniquely associated with communicative abilities (β = 0.364, p = 0.018), while the other gaze measure showed no such relationship. These findings suggest that what is commonly referred to as ‘social attention’ reflects a set of functionally distinct and seemingly unrelated behaviors, highlighting the need to re-evaluate its conceptualization as a unified construct in early development.
Introduction
Selective visual attention represents one of the earliest mechanisms through which infants engage with and learn from their environment, and from a young age, children show a marked preference for social stimuli (e.g.1,2,3). Already at birth, newborns preferentially orient towards faces4, and this preference for faces increases continuously throughout the first year of life5. Individual differences in infants’ selective attention to faces are predictive of later outcomes. For example, a stronger preference for images of static faces over non-social objects at five months has been associated with later language comprehension6.
When looking at dynamic faces, infants generally prefer looking at the eyes of another person in the first six months of life (e.g.7), a preference that shifts toward the mouth in the latter half of the year (e.g.8). In line with this preferential attention, a tendency to look at the eyes at five months has been found to predict later language comprehension7, while increased attention to the mouth in the second half of the first year has been associated with concurrent and later social and communicative skills9,10,11.
Another milestone in social development is the ability to synchronize visual attention with others towards external objects, enabling infants to direct focus towards the most relevant information for their learning and environment. This ability, known as gaze following, is central to attention sharing and social cognition12,13 and has been linked to both language development (see14 for a systematic review) and later socialization behaviors15.
While face preference, early eye-vs-mouth looking, and the emergence of gaze following each follow distinct developmental trajectories, they share a common functional aim: to guide infants’ visual attention toward socially meaningful information. This tendency to focus on socially relevant cues is commonly referred to as ‘social attention’ and is often conceptualized as a single, unified construct (e.g.16). Framing these patterns of looking as facets of a broader social attentional system help explain their collective role in scaffolding later social and communicative development. In this sense, social attention in infancy is often regarded as early social competency.
Consistent with the view of social attention as a unified construct are studies demonstrating that visual attention to the mouth of a talking person and the tendency to follow gaze is moderately correlated (r = 0.26) in 12-month-olds10, suggesting that these behaviors may stem from a shared drive for social engagement. At the same time, individual differences indicate that this system is not monolithic as children vary widely in how they attend to social stimuli6,7,17. A recent twin study of more than 500 five-month-olds showed that both eye preference (versus looking at the mouth) and face preference (versus looking at non-social objects) are largely heritable, yet only weakly correlated (r = 0.11), and appear to be shaped by distinct genetic factors6. This challenges the view that all social looking behaviors stem from a single underlying trait and instead suggests they emerge from partially independent underlying mechanisms and are shaped by separate evolutionary pressures18.
Traits that are initially unrelated may become correlated over time19, which might explain the lack of association among social attention measures at 5 months6 and the moderate association found later at 12 months10. This view aligns with neurodevelopmental accounts suggesting that the social brain is not yet matured in infancy20,21. Although regions such as the STS, fusiform gyrus, and amygdala are active in infants during processing of social information, these responses are not yet fully integrated and socially specialized. At the same time, research on the concordance among measures frequently labeled as ‘social attention’ is scarce, highlighting the need for a more detailed investigation of this foundational construct. In this study, we aimed to examine the association between three commonly used eye-tracking measures of social attention: the eye-mouth-index (EMI; i.e., eye versus mouth preference), gaze following (GF), and face preference (versus non-social objects) in 10-month-old infants, an age at which they reliably exhibit a preference for faces5, demonstrate the ability to follow gaze22, show robust differentiation in attention to the eyes versus the mouth8, and are thought to possess a more advanced understanding of others as intentional social beings23,24. We hypothesized that we would find a statistically significant correlation among all three measures of social attention. We expected to find a positive association between gaze following and face preference and a negative association between EMI and gaze following (meaning that more gaze following is associated with more mouth-looking, in line with earlier findings in 12-month-olds10. However, we had no directional hypothesis regarding the association between EMI and face preference. Without correlations among these three measures, it is difficult to regard them as reflecting a unified construct of social attention in the way they are typically portrayed in the literature: as indicators of an early social competency.
A unified dimension of social attention would suggest that these measures are similarly associated with social behaviors. Therefore, we also examined whether these measures relate to concurrently measured parent-rated socio-communicative abilities. We hypothesized that we would find a significant association between all three measures of social attention and socio-communicative abilities. We expected the associations between socio-communicative abilities and gaze following/face preference to be positive, but we had no directional hypothesis regarding the association with EMI. All hypotheses were pre-registered at OSF (https://osf.io/dsvkf/).
Methods
Participants
Participants were recruited at the Child and Babylab at Uppsala University, from a list of families who has previously expressed interest in participating in research with their child. The families were contacted via email or by phone, and then invited to the lab for the eye tracking session. Infants were included only if their age was 10 months (+/- 2 weeks), they had no uncorrected visual or hearing impairment, they were born at week 37 or later, and they had no traumatic brain injury or neurological condition. After the eye tracking session, which lasted for approximately 8 min, the caregiver filled in questionnaires on demographic information and child behavior. In total, 53 infants participated in the study. Three infants were subsequently excluded due to technical issues with the eye tracker, and the final sample (with valid data in at least one of the conditions) was 50 children. Demographic information is provided in Table 1. The study was approved by the Swedish Ethical Review Authority and was conducted in accordance with the Declaration of Helsinki. Written informed consent was obtained from all caregivers.
Eye tracking measures
Gaze data was recorded using a Tobii TX300 eye tracker (120 Hz) and a Tobii Pro Spectrum (120 Hz), each with a standard screen (24”), 1920 × 1080 pixels. Of the total sample, 41 were tested using the TX300 eye tracker, and 9 were tested using the Tobii Pro Spectrum eye tracker (due to sudden technical issues with the TX300 towards the end of the data collection). The infant was seated in their parent’s lap, approximately 60 cm from the screen. Before the eye-tracking session, a 5-point calibration session was conducted.
Before analyzing the gaze data, each participant’s data at all instances of attention-grabbers (salient stimuli presented in the middle of the screen between trials) were evaluated via ocular inspection, and a simple linear transformation of data was performed when linear drifts were detected (using custom MATLAB scripts).
EMI condition
In this condition, infants were presented with 12 videos of two different women singing common Swedish nursery rhymes (each woman appeared in six videos). These videos have been previously used in studies of infant gaze patterns (e.g.7). AOIs were created by first analyzing the videos using the open-source software OpenFace, which detects faces via computer vision and extracts facial landmarks25. We then used the x- and y-coordinates in pixels for different facial landmarks to create dynamic AOIs for each frame of the videos. The eye AOI was 510 × 180 pixels, and the mouth AOI was 350 × 150 pixels (see Fig. 1). The horizontal radius of the face AOI was 300 pixels and the vertical radius of the face AOI was 400 pixels. The EMI was calculated based on total looking time at the eyes divided by total looking time at both eyes and mouth (scale 0–1, where 0 means looking exclusively at the mouth and 1 means looking exclusively at the eyes). Trials were excluded if the infant looked at the screen for less than 2.5 s, and at least four valid trials were needed in order to be included in the EMI analyses.
GF condition
In this condition, the infants viewed 6 videos depicting a woman looking at the camera before looking at one of the objects in front of her. A ‘beep’ sound was played before the woman looked at the camera. Three rectangular AOIs were created: one covering the actor’s face and two covering the toys (see Fig. 2). GF was assessed using the first look paradigm, where the infant’s initial gaze shift from the actor to either toy was recorded as either congruent or incongruent. GF was calculated by subtracting the number of incongruent trials from the number of congruent trials, resulting in a difference score. Over six trials, this score ranged from − 6 to 6, with a positive score indicating greater GF. A trial was classified as valid if the child looked at the face and then at either of the objects. At least two valid trials were needed in order to be included in further analyses.
Face preference condition
In this condition, infants were shown 8 arrays of complex displays (see Fig. 3), each shown for 8 s. These arrays included one face (with direct gaze, and with its location in the array counterbalanced between trials), and three non-face objects. Half of the arrays included a female face and half of them included a male face, every array included a unique face not present in any other stimuli. These images were created using the Chicago Face Database26. The non-face objects consisted of both objects that are familiar to most children (e.g., a ball, a clock, and a lamp) and objects that may be unfamiliar at this age (e.g., a beetle, a crystal, and an umbrella). Before each array, an attention grabber was presented in the middle of the screen. The AOIs were 530 × 470 pixels, and covered all objects in the array (Fig. 3). Face preference was calculated as the total looking time at the face divided by the total looking time at the face and all objects (scale 0–1, where 0 means no looking time at the face and 1 means that the child looked exclusively at the face). Trials were excluded if participants looked at the screen for less than 3 s. At least two valid trials were needed in order to be included in further analyses of the face preference condition.
Parent-rated measure
Socio-communicative abilities were measured using the Infant-Toddler Checklist (ITC;27), which is a 24-item parent-rated questionnaire, used to identify children with any type of socio-communicative delay. Higher scores indicate a higher degree of socio-communicative abilities. Items include, for example, questions on whether the parent knows when the child is happy or sad, and whether the child lets the parent know when they need help reaching an object. We used the total score as the primary measure of socio-communicative abilities. We also extracted composite scores on the communication, expressive speech, and symbolic subscales.
Statistical analyses
We examined the associations between each looking measure by calculating Pearson’s r. As per the analysis plan, a confirmatory factor analysis (CFA) and a calculation of Cronbach’s alpha for the three measures were planned. However, this would only be performed if significant correlations were found between the measures.
In addition, we examined the associations between each looking measure and concurrent socio-communicative abilities by calculating Pearson’s r for each association.
The EMI was not associated with sex (r = -0.007, p = 0.965), age (r = -0.007, p = 0.960), family income (r = -0.003, p = 0.984), maternal education level (r = -0.266, p = 0.071), paternal education level (r = 0.053, p = 0.722), or number of children in the household (r = 0.134, p = 0.370). The difference score in the gaze following condition was not associated with sex (r = 0.220, p = 0.157), age (r = -0.039, p = 0.806), family income (r = 0.089, p = 0.571), maternal education level (r = 0.084, p = 0.591), paternal education level (r = -0.006, p = 0.971) or number of children in the household (r = -0.089, p = 0.568). Similarly, face preference was not associated with sex (r = -0.119, p = 0.410), age (r = -0.275, p = 0.053), family income (r = 0.142, p = 0.324), maternal education level (r = 0.200, p = 0.163), paternal education level (r = 0.163, p = 0.259), or number of children in the household (r = -0.173, p = 0.229). Based on these results, no covariates were included in further analyses.
Results
Descriptive statistics on eye tracking measures and the ITC are presented in Table 2 (see Supplementary Information S1 for distributional plots of the primary gaze measures and Supplementary Information S2 for distributional plots of the ITC). The number of included trials for each condition was not associated with the EMI (r = -0.186, p = 0.344), GF (r = 0.208, p = 0.181), or face preference (r = 0.040, p = 0.783).
The mean EMI (0.62) was significantly different from 0.50 (t(46) = 3.267, p < 0.001, Cohen’s d = 0.477), meaning that, at a group level, the infants looked preferentially more at the eyes than at the mouth. The mean GF difference score (1.81) was significantly different from zero (t(42) = 5.534, p < 0.001, Cohen’s d = 0.844), meaning that, at a group level, the infants displayed a tendency to follow the gaze of the actor. The mean percentage of looking time at the face in the face preference condition was 0.53, which was significantly different from 0.25 (i.e., chance level; t(49) = 14.580, p < 0.001, Cohen’s d = 2.062), indicating a preference for looking at the face over the non-social objects.
There was no significant association between the EMI and GF (r = -0.196, p = 0.207), between EMI and face preference (r = -0.103, p = 0.489), or between GF and face preference (0.036, p = 0.818). Due to the lack of correlations across measures, we did not move on to a factor analysis or a Cronbach’s alpha. Bayesian analyses suggested moderate support for the null hypothesis with regards to all three correlations (EMI vs. GF: BF01 = 3.809, EMI vs. face preference: BF01 = 6.915, GF vs. face preference: BF01 = 8.183).
There was no significant association between the EMI and ITC (r = 0.276, p = 0.061), GF and ITC (r = 0.098, p = 0.533), or face preference and ITC (r = -0.244, p = 0.087). In light of these findings, we added a number of analyses to further examine potential associations between early gaze behaviors and socio-communicative abilities. None of the following analyses were pre-registered, starting with a linear regression model including all gaze measures as predictors and ITC total score as outcome variable. The regression model was significant (F(3,39) = 2.924, p = 0.046, R2 = 0.184), and the EMI was a significant positive predictor of ITC (β = 0.357, p = 0.022). Neither GF (β = 0.174, p = 0.245) nor face preference (β = -0.180, p = 0.228) were significant predictors of ITC. To further examine the association to socio-communicative abilities, we added all gaze measures to three linear regression models, with the three subscales of ITC as outcome variables. The regression model with the communication subscale as outcome was significant (F(3,39) = 3.370, p = 0.028, R2 = 0.206). The EMI was a significant predictor (β = 0.364, p = 0.018), but neither GF (β = 0.252, p = 0.092) nor face preference (β = -0.171, p = 0.245) were significant predictors. The regression model with expressive speech as outcome variable was not significant (F(3,39) = 0.787, p = 0.509) and neither was the regression model with the symbolic subscale as outcome (F(3,39) = 1.902, p = 0.145).
Additional analyses
As prompted by a reviewer, we additionally calculated the number of times each infant first fixated on the face in the face preference task. Fixations were extracted using the fixation detection algorithm I2MC28. Fixation durations were only considered valid if the temporal duration was 100 milliseconds or longer. We then calculated the ratio of first looks to the face (relative to the number of valid trials for each participant). The mean ratio was 0.512 (SD = 0.228, Min = 0.125, Max = 1.00). There was no statistically significant association between EMI and ratio of first looks to faces (r = -0.034, p = 0.822) or between gaze following and ratio of first looks to faces (r = -0.072, p = 0.646).
Discussion
In this study of 10-month-old infants, we examined whether three standard measures of visual social attention reflect a common dimension and whether they are related to concurrent socio-communicative behaviors as rated by their caregiver. We found no associations across the tendency to look at eyes versus mouth, the tendency to follow gaze, and the preference for looking at faces (versus non-social objects). Bayesian analyses suggested moderate support for the null hypothesis across these correlations. These findings, together with behavioral genetic evidence, support a broader theoretical reconsideration of social attention in infancy. Specifically, our results call for a more nuanced understanding of the behaviors that fall under this umbrella term. Rather than reflecting a unified construct (e.g.16), social attention may be better understood as an umbrella term for multiple functionally distinct behaviors with separate developmental trajectories (e.g.18). This interpretation aligns with recent behavioral genetic findings showing that different genetic factors influence eye and face preferences in early infancy6, indicating that different social attention measures reflect different underlying processes.
While these behaviors may draw on innate sensitivities to social agents as proposed by core knowledge theories29, the lack of coherence at 10 months suggests that these sensitivities are expressed through diverse pathways rather than a unified system. Future work should explore how and when these behaviors begin to align and whether such alignment is driven by neural maturation, social learning, or interactive feedback from communicative success. From a neurodevelopmental perspective, the social brain is still maturing at 10 months and is not yet functionally integrated20,21. Within this framework, early activations in regions such as the STS, fusiform gyrus, and amygdala reflect broadly tuned, perceptual responses to social stimuli that serve as precursors of later social specialization. Functional connectivity among these regions increases gradually across the first years of life, supporting a transition from domain-general perceptual responsiveness to selective, socially specific processing21. In other words, early activation in these regions does not necessarily imply shared representational meaning.
It is further possible that a lack of association among these gaze behaviors could be due to low-level domain-general visual processing styles, which affect these measures differently. For example, a recent study in adults found that the EMI is highly correlated with vertical looking preference on non-social stimuli30, suggesting that an eye or mouth preference merely reflect a preference for upper or lower stimuli, regardless of social saliency. Whether this pattern extends to infants remains to be tested. Gaze following, in turn, might be affected by sensitivity to certain perceptual cues, such as the spatial distance between the actor and the object31,32,33, or motion towards an object22,34,35. Domain-general processing across these measures does not imply that they should correlate, because the relevant perceptual or attentional processes may not rely on the same underlying mechanisms. If both domain-general and domain-specific mechanisms jointly govern these behaviors, then accounting for these perceptual biases or processing styles may reveal associations that are otherwise obscured. In other words, once low-level attentional tendencies are measured and controlled for, the relationship among social attention measures could strengthen and more clearly reflect socially driven attention. Identifying and modeling these domain-general influences therefore represents an important direction for future research.
While Tenenbaum et al.10 found a moderate association between mouth preference and gaze following at 12 months of age, it is important to note that both measures in their study were derived from responses to the same set of stimuli videos. The gaze measures in the present study were derived from separate sets of stimuli, each of which has either been repeatedly used in previous studies (EMI and gaze following) or is closely modeled on well-established stimuli (face preference) that have reliably produced robust effects6,7,36. While we did not find any significant associations across these measures at 10 months of age, it remains possible that different social attention behaviors become correlated over time19. Neurodevelopmental accounts emphasizing immature integration of the social brain network predict weak or absent correlations early in life, with stronger coherence emerging as connectivity strengthens. A key limitation is that the present study includes infants at only one age. Consequently, our results cannot determine whether the lack of concordance at 10 months reflects a persistent fractionation of social attention or whether stronger associations emerge later in development. Distinguishing between these alternatives requires either longitudinal designs or cross-sectional samples spanning a broader age range.
The developmental dissociation among gaze measures in our study was further reflected in how they related to concurrent socio-communicative abilities more broadly. We examined behaviors such as whether the infant tries to get their caregiver’s attention, whether they wave to greet people, and whether they try to make the caregiver laugh, as captured by the ITC scale. After controlling for gaze following and face preference, we found that only the eye-mouth-index (EMI) was uniquely related to concurrent communicative behaviors with a positive association between preference for eyes and concurrent communicative behaviors. Although gaze following has previously been linked to later language outcomes in toddlerhood(14, though see37 for a critical appraisal), the ITC items at this age reflect many behaviors related to initiating joint attention. This ability is, perhaps somewhat surprisingly, not associated with the ability of responding to joint attention, which gaze following is an example of38. The tendency to look at eyes versus the mouth may instead be more closely linked to the mechanisms underlying initiating joint attention, which could explain the lack of association between the EMI and gaze following. However, this interpretation remains speculative and warrants further investigation.
An alternative explanation to the lack of concordance among the social attention measures is that they reflect different expressions of social preferences rather than social competences. For instance, infants might prefer looking at the eyes or the mouth, both socially relevant stimuli, even if they differ in informativeness. While this argument may hold some merit for the EMI, it is more difficult to apply to the other measures: Infants do disengage from faces, and when they do, some synchronize visual attention through gaze following while others do not. At this point, there are no competing socially relevant targets comparable to the eye–mouth distinction, and thus no alternative “social” way of looking, only visual synchronization. The same reasoning applies to face preference, where no other social stimulus is present. Therefore, while the EMI could arguably reflect individual differences in social preference, this reasoning cannot explain the lack of association between face preference and gaze following. Moreover, the literature treats these social attention measures as indicators of early social competence. Under this conceptualization, they should reflect a shared underlying construct, and their lack of correlation is therefore theoretically problematic. Even if one interprets the three measures as tapping partially distinct facets of social processing, at least moderate correlations should be detectable under the assumption that they are indicators of a unified dimension implying social competence. Bayesian analyses instead provided evidence in favor of the null hypothesis for all pairwise associations, supporting the conclusion that these measures do not operate as a single unified construct at 10 months. However, it remains possible that the measures reflect genuinely distinct facets of social attention. These facets may vary in the extent to which they depend on domain-general versus domain-specific processes: some behaviors may be shaped by combinations of preference, perceptual biases, and processing speed, whereas others may more directly reflect social understanding or social motivation. Under this view, a specific social preference need not map onto social competence, and dissociation across measures would not necessarily imply a failure of any single construct but rather the presence of multiple, distinct components of early social attention. Consequently, it may be more appropriate to discuss these measures in terms of the specific processes they index, rather than treating them broadly as social attention.
The lack of concordance among these measures may also have implications for research on neurodevelopmental conditions, such as autism. If different gaze measures do not cohere even in typical development, it will be important for future work to examine whether atypical performance on one measure predicts difficulties on others, or whether different measures might identify distinct autistic subgroups with differing developmental pathways. Such work could also clarify whether interventions should target specific components of social attention rather than assuming a single underlying mechanism, and whether trajectories of concordance across measures differ between autistic and neurotypical children.
While we included three frequently used gaze measures of infant social attention, our study is not covering all measures that have been labelled as ‘social attention’. For example, earlier eye tracking studies have also focused on how infants view interaction among other children39, how they view complex social scenes40, and how they respond to bids for joint attention in a live situation41. A more comprehensive analysis of early social looking behaviors might shed light on whether there are other measures of social attention that may be intercorrelated. However, it is worth noting that the measures included in this study are well-known and widely used in the field of infant eye tracking.
A limitation of the current study is the relatively small sample size. While we would expect moderate-to-large associations among measures belonging to unified construct, we did not have power to detect potential small-to-moderate effects. We included Bayesian analyses suggesting moderate support for the null hypothesis, but future studies should recruit a larger and more diverse sample, in order to examine whether there might be small (but significant) associations across these social attention measures.
Conclusions
How infants orient to and scan faces, and how they follow the gaze of other people, are behaviors often referred to as ‘social attention’, though attempts to assess them as a unified construct remain scarce in the literature. In this study, we found no statistically significant associations between any of these measures. Furthermore, only the tendency to look at eyes (versus mouth) was uniquely related to concurrent communicative abilities, while no association to other gaze measures was found. These findings suggest that the concept of ‘social attention’, as currently defined, reflect fragmented behaviors, and a growing body of research suggests that much of what has been labeled as social attention may instead be driven by domain-general perceptual or attentional mechanisms. Together, these findings call for a re-evaluation of the notion that social attention represents a unified dimension early in development.
Data availability
The analyses presented here were preregistered at OSF (https://osf.io/dsvkf/). The data and code necessary to reproduce the analyses presented here are not publicly accessible, but will be made available upon reasonable request to the corresponding author. Note that sharing of pseudonymized personal data will require a data sharing agreement, according to Swedish and EU law.
References
Di Giorgio, E. et al. Face detection in complex visual displays: an eye-tracking study with 3- and 6-month-old infants and adults. J. Exp. Child. Psychol. 113 (1), 66–77 (2012).
Frank, M. C., Amso, D. & Johnson, S. P. Visual search and attention to faces during early infancy. J. Exp. Child Psychol. 118, 13–26 (2014).
Frank, M. C., Vul, E. & Johnson, S. P. Development of infants’ attention to faces during the first year. Cognition 110 (2), 160–170 (2009).
Farroni, T., Menon, E. & Johnson, M. H. Factors influencing newborns’ preference for faces with eye contact. J. Exp. Child Psychol. 95 (4), 298–308 (2006).
Belteki, Z. et al. How infants direct their gaze to faces in the presence of other objects: the development of face preference between 4 and 7 months after birth. Infancy 30 (1), e12633 (2025).
Portugal, A. M. et al. Infants’ looking preferences for social versus non-social objects reflect genetic variation. Nat. Hum. Behav. 8 (1), 115–124 (2024).
Viktorsson, C. et al. Preferential looking to eyes versus mouth in early infancy: heritability and link to concurrent and later development. J. Child. Psychol. Psychiatry (2022).
Lewkowicz, D. J. & Hansen-Tift, A. M. Infants deploy selective attention to the mouth of a talking face when learning speech. Proc. Natl. Acad. Sci. U.S.A. 109 (5), 1431–1436 (2012).
Pons, F., Bosch, L. & Lewkowicz, D. J. Twelve-month-old infants’ attention to the eyes of a talking face is associated with communication and social skills. Infant Behav. Dev. 54, 80–84 (2019).
Tenenbaum, E. J. et al. Attention to the mouth and gaze following in infancy predict Language development. J. Child. Lang. 42 (6), 1173–1190 (2015).
Young, G. S. et al. Gaze behavior and affect at 6 months: predicting clinical outcomes and Language development in typically developing infants and infants at risk for autism. Dev. Sci. 12 (5), 798–814 (2009).
Flom, R., Lee, K. & Muir, D. Gaze-following: its Development and Significance (Psychology Press, 2017).
Mundy, P. C. Autism and Joint Attention: Development, neuroscience, and Clinical Fundamentals (Guilford Publications, 2016).
Cetincelik, M., Rowland, C. F. & Snijders, T. M. Do the eyes have it? A systematic review on the role of eye gaze in infant Language development. Front. Psychol. 11, 589096 (2020).
Sheinkopf, S. J. et al. Infant joint attention skill and preschool behavioral outcomes in at-risk children. Dev. Psychopathol. 16 (2), 273–291 (2004).
Constantino, J. N. et al. Infant viewing of social scenes is under genetic control and is atypical in autism. Nature 547 (7663), 340– (2017).
Kennedy, D. P. et al. Genetic influence on eye movements to complex scenes at short timescales. Curr. Biol. 27 (22), 3554– (2017).
Shakeshaft, N. G. & Plomin, R. Genetic specificity of face recognition. Proc. Natl. Acad. Sci. U S A. 112 (41), 12887–12892 (2015).
van der Maas, H. L. J. et al. A dynamical model of general intelligence: the positive manifold of intelligence by mutualism. Psychol. Rev. 113 (4), 842–861 (2006).
Johnson, M. H. et al. The emergence of the social brain network: evidence from typical and atypical development. Dev. Psychopathol. 17 (3), 599–619 (2005).
Grossmann, T. The development of social brain functions in infancy. Psychol. Bull. 141 (6), 1266–1287 (2015).
Astor, K., Thiele, M. & Gredebäck, G. Gaze following emergence relies on both perceptual cues and social awareness. Cogn. Dev. 60, 101121 (2021).
Brooks, R. & Meltzoff, A. N. The development of gaze following and its relation to Language. Dev. Sci. 8 (6), 535–543 (2005).
Rochat, P. Early Social Cognition: Understanding Others in the First Months of Life. (Psychology Press, 2014).
Baltrušaitis, T., Robinson, P. & Morency, L. P. Openface: an open source facial behavior analysis toolkit. In IEEE Winter Conference on Applications of Computer Vision (WACV). (IEEE, 2016).
Ma, D. S., Correll, J. & Wittenbrink, B. The Chicago face database: A free stimulus set of faces and norming data. Behav. Res. Methods. 47, 1122–1135 (2015).
Wetherby, A. & Prizant, B. Communication and Symbolic Behaviour Scales Developmental Profile- First Normed Edition (Paul H. Brookes, 2002).
Hessels, R. S. et al. Noise-robust fixation detection in eye movement data: identification by two-means clustering (I2MC). Behav. Res. Methods. 49 (5), 1802–1823 (2017).
Spelke, E. What Babies Know: Core Knowledge and Composition 1 (Oxford University Press, 2022).
Broda, M. D. & de Haas, B. Individual differences in human gaze behavior generalize from faces to objects. Proc. Natl. Acad. Sci. U S A. 121 (12), e2322149121 (2024).
Astor, K. & Gredeback, G. Gaze following in 4.5- and 6-month-old infants: the impact of proximity on standard gaze following performance tests. Infancy 24 (1), 79–89 (2019).
Butterworth, G. & Jarrett, N. What Minds have in common is space: Spatial mechanisms serving joint visual attention in infancy. Br. J. Dev. Psychol. 9 (1), 55–72 (1991).
D’Entremont, B. A perceptual–attentional explanation of gaze following in 3-and 6‐month‐olds. Dev. Sci. 3 (3), 302–311 (2000).
Deák, G. O. When and where do infants follow gaze? In Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob). (IEEE, 2015).
Farroni, T. et al. Infants’ use of gaze direction to cue attention: the importance of perceived motion. Visual Cognition. 7 (6), 705–718 (2000).
Astor, K. et al. Social and emotional contexts predict the development of gaze following in early infancy. R Soc. Open. Sci. 7 (9), 201178 (2020).
Astor, K. & Gredeback, G. Gaze following in infancy: five big questions that the field should answer. Adv. Child. Dev. Behav. 63, 191–223 (2022).
Astor Währborg, K. Gaze following in infancy: mechanisms and developmental context. Acta Universitatis Upsaliensis (2024).
Viktorsson, C., Bolte, S. & Falck-Ytter, T. How 18-month-olds with later autism look at other children interacting: the timing of gaze allocation. J. Autism Dev. Disord. (2023).
Kelly, D. J. et al. Infants rapidly detect human faces in complex naturalistic visual scenes. Dev. Sci. 22 (6), e12829 (2019).
Thorup, E. et al. Altered gaze following during live interaction in infants at risk for autism: an eye tracking study. Mol. Autism. 7, 12 (2016).
Funding
Open access funding provided by Uppsala University.
Author information
Authors and Affiliations
Contributions
The hypotheses and goals of this study were conceptualized by C.V and K.A. Stimuli were created by C.V and K.A. Data were preprocessed by C.V and K.A, and analyzed by C.V. C.V drafted the manuscript, and both authors reviewed, edited, and approved the final manuscript for submission.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Viktorsson, C., Astor, K. Lack of concordance among infant social attention measures. Sci Rep 16, 2591 (2026). https://doi.org/10.1038/s41598-026-36807-5
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-026-36807-5


