Learning of the mean, but not variance, of color distributions cues target location probability

Blondé, Philippe; Hansmann-Roth, Sabrina; Pascucci, David; Kristjánsson, Árni

doi:10.1038/s41598-024-84750-0

Download PDF

Article
Open access
Published: 04 March 2025

Learning of the mean, but not variance, of color distributions cues target location probability

Philippe Blondé¹,
Sabrina Hansmann-Roth¹,
David Pascucci^2,3 &
…
Árni Kristjánsson¹

Scientific Reports volume 15, Article number: 7591 (2025) Cite this article

1626 Accesses
3 Altmetric
Metrics details

Subjects

Abstract

Humans are good at picking up statistical regularities in the environment. Probability cueing paradigms have demonstrated that the location of a target can be predicted based on spatial regularities. This is assumed to rely on flexible spatial priority maps that are influenced by visual context. We investigated whether stimulus features such as color distributions differing in mean and variance can cue location regularities. In experiment 1, participants searched for an oddly colored target diamond in a 6 × 6 set. On each trial, the distractors were drawn from one of two color distributions centered on different color averages. Each distribution was associated with different target location probabilities, one distribution where the target had an 80% chance to appear on the left (the rich location), while the rich location would be on the right for the other distribution. Participants were significantly faster at locating the target when it appeared in the rich location for both distributions, demonstrating learning of the relationship between color average and location probability. In experiments 2 and 3, observers performed a similar search task, but the distributions had different variances with the same average color. There was no evidence that search became faster when the target appeared in a rich location, suggesting that contingencies between target probabilities and color variance were not learned. These results demonstrate how statistical location learning is flexible, with different visual contexts leading to different spatial priority maps, but they also reveal important limits to such learning.

Statistical learning re-shapes the center-surround inhibition of the visuo-spatial attentional focus

Article Open access 05 March 2025

Attenuation of spatial bias with target template variation

Article Open access 03 April 2024

Target templates and the time course of distractor location learning

Article Open access 30 January 2023

Introduction

Imagine you are looking for your cat in the garden. You have noticed that on bright days, she tends to lounge on the lawn, while on cloudy or rainy days, she prefers to stay under the tree. Since the sky is bright today, you instinctively check the lawn first. This simple example illustrates our ability to learn contextual and statistical regularities in the environment and direct our attention to locations where objects of interest are likely to appear in a given context^1,2,3,4,5,6.

The last decades of research on visual statistical learning have revealed that we can extract a large amount of information from statistical regularities, often implicitly and unconsciously³. Such effects are evident both in behavioral tasks and neural activity patterns^{7,8,9,10,11,12,13,14,15} and can be observed when regularities occur in various stimulus features such as orientations^16,17, auditory inputs^18,19 and more complex stimuli such as written and spoken language^20,21,22,23.

A common method for studying the effects of statistical learning on visual attention is the probability cueing paradigm²⁴, where participants typically search for a target among distractors (usually a T among Ls) without prior knowledge about its location probability. Unbeknownst to the participants, the location probability of the target is biased so that it appears more frequently in a specific high-probability “rich” quadrant of the screen (e.g., the top-right part of the screen). Both behavioral and eye-tracking data show that participants quickly develop a search habit biased toward the rich quadrant^{25,26,27,28,29,30}, to the point that the bias persists even when the probabilities are equalized across all quadrants³¹ (even though the extent of this inflexibility is debated³²).

Although these biases tend to persist and are resilient to updating²⁶, there is evidence that humans can flexibly alternate their search strategies between multiple probabilistic representations, likely stored in parallel^33,34. Zhang and Carlisle³⁴ found that observers could learn the location probabilities of multiple objects and when they had to find a particular one, they found it faster if it appeared in the expected location. This flexibility is highly useful in daily life, allowing us to search for different objects in different places based on where they are most likely to be found²⁷. It can be hypothesized that statistical learning relies on multiple attentional priority maps — i.e., neural representations that rank different locations or features in the visual field according to their relevance^35,36 — that are formed through the combination of templates held in both long-term and short-term memory³⁷ which can be differentially activated depending on the learned association between the features of a target object and its context.

What types of associations support the multiple and parallel statistical learning of attentional priority maps? In classic probability cueing studies, statistical learning is assessed and manipulated by introducing regularities or biases in the location of target stimuli. Learning is therefore primarily dictated by first-order spatial associations — direct relationships between a specific location and the probability of finding a target there. However, learning can also be shaped by more complex associations, such as second-order relationships, where contextual features predict different target locations at different times. For example, varying luminance levels in a search display can be associated with different target location probabilities, leading to statistical learning effects where different priority maps are activated based on the luminance in the current trial³³. In this study by Hong et al., two ensembles were shown simultaneously: a black and a white stimulus set. Participants were tasked with finding a target in one or the other, but unbeknownst to them, different spatial probability biases were associated with the two ensembles (e.g., the rich quadrant would be the top-right one for the black ensemble but the bottom-left one for the white ensemble). Nonetheless, observers responded faster if the target appeared in a rich quadrant associated with the target ensemble, but only if the luminance was relevant to the task at hand. However, since Hong et al.³³ only examined the predictive properties of uniform black or white ensembles, this leaves unanswered whether other visual information can be used to predict target location.

The distribution of colors often characterizes and distinguishes environments where a target object might be located. For example, when searching for ripe fruit in a garden, ripe fruits are more commonly found among green foliage than among other colors, such as yellow or brown. Similar to other feature distributions^{38,39,40,41,42}, humans can rapidly and efficiently learn statistical properties from color distributions, including the mean, variance, and distribution shape^43,44,45,46. Thus, learning these statistical properties may facilitate second-order associations, where color distributions act as predictive cues for the likely target locations.

Investigations of the use of color to influence and guide search have notably been conducted in the domain of contextual cueing^47,48. Surprisingly, while the use of color as a marker to distinguish task-relevant and task-irrelevant context is well documented^{49,50,51,52,53,54}, few studies have investigated how color itself can be used as a cue for predicting a target’s location. Kunar et al.^55,56 found that repetition of color background speeded target processing but provided little or no visual search guidance, especially when other cues such as spatial layout were provided. Similarly, contextual cueing with real-world images rich in semantic information was unaffected by color changes⁵⁷ and colored cues led to a contextual cueing effect only when devoid of semantic information⁵⁸. When considering the stimuli themselves — not their background or any additional cues — only one study, to our knowledge, has investigated how color regularities influence visual search. Huang⁵⁹ found that repeated color arrangement (a combination of spatial and featural properties) can cue target position, but did not address whether color can, on its own, cue target location. Thus, to date, whether color features can serve as a cue for statistical regularities remains unclear.

The current study

We examined whether statistical properties of color distributions can be used to learn and flexibly exploit the most likely locations of a visual target. To this aim we employed a modified version of the classic probability cuing paradigm where the location probability of the target — a uniquely colored diamond shape — depended on distractor color distributions. In three experiments, we investigated second-order statistical learning effects, testing the predictive properties of average color (Experiment 1) and variance (Experiments 2–3). We focused on these two distribution parameters as they have been shown to be automatically extracted and have an influence on visual search^43,45,46.

Experiment 1 – Can the mean of a color distribution serve as a cue for likely target locations?

Method

Participants

Thirty-nine participants (53.85% ♀, 26.46 ± 5.49 years old) recruited at the University of Iceland took part in the experiment, completing a single experimental session of about 45 min, and were rewarded with 1,000 ISK. Participants signed an informed consent briefly describing the experiment and informing them that their data would be processed anonymously and that they were free to end their participation at any moment. After completing the experiment, they received their reward and were debriefed about the goals of the experiment. The experiments were approved by the National Bioethics committee in Iceland (Vísindasiðanefnd, http://vsn.is) and performed in accordance with the Helsinki Declaration.

Material and procedure

Participants were seated in a dark room, positioned approximately 60 cm from a 27-inch monitor. All participants underwent color blindness screening using the Ishihara test⁶⁰. No participants were excluded based on this test.

The experimental task was programmed in Python 3.6 ⁶¹ and executed with Spyder⁶². Search displays consisted of 36 colored diamonds, each with a randomly selected corner cut off (see Fig. 1). These diamonds were arranged on a 6 × 6 invisible grid with fixed positions, and a small positional jitter of 0.5 degrees of visual angle was applied to the x- and y-coordinates. The overall search display spanned a visual angle range from − 10 to + 10 degrees on both the X and Y axes. The diamond colors were drawn from a color space consisting of 48 isoluminant hues, designed so that each color difference represented approximately 1 group-averaged Just Noticeable Difference (JND)^43,63.

The main manipulation was that the distractors could be drawn from two color distributions, each with a different target location probability. These distributions were truncated normal distributions with a standard deviation of 3 JND and cut off at 1.5 SD⁴⁵. The average color values of the two distractor distributions were selected from opposite points on the color wheel, corresponding to a 24 JND difference between them (see Fig. 2).

For the first participant, the initial mean values for the two distributions were set at 0 and 24. With each subsequent participant, the means were increased by 8 JND, so that after 6 participants, the distributions had completed a full rotation on the color wheel. The target color was equiprobably located 4 to 5 JNDs away from the distractor color that was the furthest from the mean of the distribution, either to the left or to the right of the color wheel.

Unbeknownst to participants, the target had an 80% chance of appearing on the left side of the screen for one distribution (Distribution A) and 80% chance of appearing on the right side for the other distribution (Distribution B). We labelled the screen side with the higher probability (80%) as the rich side, and the side with the lower probability (20%) as the scarce side.

Participants performed an odd-one-out search task, in which they were required to find the oddly colored target and indicate the location of its missing corner using the corresponding directional arrow keys as quickly as possible. If participants made a mistake, the word “ERROR” appeared briefly in red in the center of the screen.

Trials were separated by a 1000 ms intertrial interval, during which a fixation cross was presented at the center of the screen. Participants were instructed to focus on this fixation cross to ensure consistent eye position at the start of each trial.

The experiment consisted of 5 blocks of 200 trials, with short breaks between blocks. Each block contained an equal number of trials for each distribution, randomly intermixed, resulting in 500 trials for each distribution. Participants also completed a training task involving 20 trials with color distributions having the same properties as in the main task but with random color and no location bias. Participants were invited to perform the training again or immediately start the experiment.

Analysis methods

Statistical analyses were conducted using R 4.2.2 ⁶⁴. Given that RTs generally decrease nonlinearly over time^65,66, we modelled RT changes using a nonlinear approach. Specifically, we examined how RTs changes with time-on-task and predictions of the color distribution about target position. To capture a potential nonlinear trend, we employed an exponential decay model⁶⁷ described by the following formula:

$$\:Y=\:{Y}_{0}{\:+\:(Y}_{f\:}{-\:Y}_{0})\:\times\:\:{e}^{-\alpha\:X}$$

(1)

RTs $(\text{Y})$ are predicted based on three key parameters: (1) Y₀: the upper asymptote, representing RTs at the initial trial; (2) Y_f: the lower asymptote, representing RTs at infinite trial durations; (3) α: the rate of decay, reflecting the speed at which RTs approach the lower asymptote, with X as the time-on-task variable, here the trial number.

We used the nlme package⁶⁸ to fit this model, incorporating both fixed and random effects. Specifically, we tested fixed effects of predictability of target position and accounted for random effects of individual participants on the Y₀ and Y_f parameters. The hierarchical framework was implemented in two stages. We first created a baseline model where we tested the parameters against 0, including random effects to account for between-participant variability. Next, we updated the baseline model with a nested model by introducing a fixed effect representing whether the target appeared on the rich (80% likelihood) or the scarce side of the array (20% likelihood). To optimize the model parameters, we used the getInitial function, which automatically selects initial parameter values based on the formula. This provided starting points for the optimization algorithm to estimate the best-fitting parameters. The initial values for the fixed effects in the nested model were based on estimates from the baseline model.

In addition to the nonlinear modelling, we also applied a mixed linear model to analyze the data, using the lme4 package⁶⁹ to better highlight potential interactions between predictors, as interaction terms are more straightforward to interpret in the general linear model framework. Additionally, this approach allowed us to treat time-on-task as a categorical factor, to compare effects of target position during the first and second halves of the experiment, using both frequentist and Bayesian indicators.

Our mixed linear model had the following form:

$$\:Y={\beta\:}_{0}+{\beta\:}_{1}\text{}{X}_{1}\text{}\text{}+{\beta\:}_{2}\text{}{X}_{2}+{\beta\:}_{3}\text{}3({X}_{1}\text{}\text{}\times\:{X}_{2})+{\beta\:}_{4}\text{}\text{}C+P+\epsilon$$

(2)

This formula predicts RTs (Y) with time-on-task (X₁; a categorical factor comparing the first and second halves of the experiment), predictive property of the target’s position (X₂; a categorical factor distinguishing the rich and scarce sides), their interaction (X₁ × X₂), and the distribution (C; a controlled categorical variable representing the two color distributions, A or B) and participants (P) treated as a random effect. The target was more likely to appear on the left for Distribution A while for Distribution B it was more likely on the right. Including the distribution variable helped control for any side-specific RT effects.

The assumptions of linear regression were evaluated using the performance package⁷⁰ assessing linearity, homogeneity of variance, influential observations, normality of residuals and normality of random effects. A comparison of model predictions with the observed data indicated that log-transformed RTs provided a better fit than raw RTs, so we applied this transformation in our linear models (see Lo & Andrews, 2015 ⁷¹ for a discussion on the relevance of this transformation on RTs). Data was not transformed for the non-linear model.

Trials with incorrect responses or RTs longer than 3 s were excluded from analyses. To assess the strength of evidence for or against the null hypothesis, we computed Bayes Factors (BF) in addition to frequentist statistical indicators.

Results

Overall mean accuracy was 96.29% (± 2.6) and average RT was 1098.32 ms (± 364.23). A non-linear regression analysis assessed how reaction times were impacted by time-on-task and target location. First, we fitted a base exponential decay model predicting RTs with time-on-task (Eq. 1). The results indicated that all model parameters differed significantly from zero: the upper asymptote was Y₀ = 1230.68 ms (SE = 59.3 ; t(36457) = 20.75, p < .001), the lower asymptote was Y_f = 909.93 ms (SE = 26.48 ; t(36457) = 34.36, p < .001), and the rate of decay was α = -5.69 (SE = 0.06 ; t(36457) = -101.01, p < .001).

These results confirm that RTs followed an exponential decay pattern over time. Next, we fitted a nested model by including the predictive property of distribution (rich vs. scarce locations) as a fixed effect. Again, all model parameters were significantly different from zero. The upper asymptote did not differ between rich and scarce locations (Y_{0 diff} = -5.19 ms, SE = 16.37; t(36454) = -0.32, p = .75), but the lower asymptote did (Y_{f diff} = -31.71 ms, SE = 8.83 ; t(36454) = -3.59, p < .001). However, the rate of change did not differ between the two (α_diff = -0.08, SE = 0.07; t(36454) = -1.1, p = .27).

This suggests that RTs for rich and scarce conditions started similarly and decreased at the same rate throughout the task, which is a time-course similar to the one described in previous studies²⁶ despite differences in experimental and statistical designs. However, by the end of the experiment, RTs stabilized at different levels, with participants responding faster when targets appeared in the rich location (Y_{f Rich} = 903.24, SE = 26.57) than the scarce location (Y_{f Scarce} = 934.95, SE = 27.29). In other words, participants’ performance improved over time, with a larger advantage for rich over scarce target locations as the experiment progressed (see Fig. 2).

To further assess the interaction between time-on-task and target location, we recoded time-on-task to a categorical factor (first vs. second half of the experiment) and fitted a linear mixed model (Eq. 2). This model explained a substantial portion of the variance (conditional R² = 0.27), with the fixed effects alone accounting for R² = 0.02.

The effect of time-on-task was significant and negative (β = -0.11, 95% CI [-0.12, -0.10], t(36455) = -30.03, p < .001), indicating that RTs decreased over time. However, the predictive property of the target location was not significant (β = 0.08^− 1, 95% CI [-0.03^− 1, 0.02], t(36455) = -1.38, p = .17). Crucially, the interaction between time-on-task and the predictive property was statistically significant and positive (β = 0.02, 95% CI [-0.03^− 1, 0.04], t(36455) = 2.39, p < .05).

Marginal contrast analyses revealed no significant difference in average reaction times between rich and scarce locations during the first half of the experiment (x̄_diff = -0.08⁻¹, 95% CI [-0.02, 0.07⁻¹], p = .17) while a significant difference was observed for the second half (x̄_diff = -0.03, 95% CI [-0.04, -0.01], p < .001), with faster RTs for the rich (x̄ = 925 ± 385 ms) than for the scarce location (x̄ = 955 ± 400 ms, see Fig. 2).

To complement this contrast analysis, we calculated Bayes Factors (BF₁₀) for the difference between rich and scarce locations in both halves of the experiment. There was very strong evidence in favor of the null hypothesis for the first half (BF₁₀ = 0.03), while strong evidence against the null was found in the second half (BF₁₀ = 14.17), confirming that participants responded faster to targets in rich locations as the experiment progressed.

Experiment 2a – can differences in color variance cue target location?

Experiment 1 demonstrated that participants learned to associate target location probabilities with the color distribution of distractors during visual search. In the first experiment, only the mean of the color distribution varied, while all other parameters remained constant. Previous research has shown that humans can extract not only the mean from a distribution but also its variance and even its shape^41,45,46. Subjectively, color distributions with high variance appear more heterogenous than those with low variance.

In Experiment 2a and 2b we investigated whether changes in the homogeneity of color distributions (reflected in their variance) would allow participants to distinguish between two distributions and assign different priority maps based on target location probability. Specifically, we tested whether different variances in the distractor color distributions could lead to statistical learning of location probabilities while the mean of the distractor color distribution remained constant.

Experiments 2a and 2b were identical except for the increased variance difference between the two distributions in Experiment 2b, to assess whether larger variance differences would lead to stronger learning of location probabilities from distributional variance.