Topological modelling of urban air pollution and cognition

Engleitner, Holger; Suárez Pinilla, Marta; Rossor, Martin; Nachev, Parashkev

doi:10.1038/s44482-025-00009-z

Download PDF

Article
Open access
Published: 08 April 2026

Topological modelling of urban air pollution and cognition

Holger Engleitner¹,
Marta Suárez Pinilla²,
Martin Rossor¹ &
…
Parashkev Nachev¹

npj Digital Public Health volume 1, Article number: 7 (2026) Cite this article

224 Accesses
Metrics details

Subjects

Epidemiology

Abstract

The impact of air pollution on cognitive function has attracted repeated scrutiny, and current results point to a potentially detrimental role. The purpose of this study was to examine this association in geographic space. Cross-sectional, complete observational data from UK Biobank were extracted for the four metropolitan regions of Birmingham, Leeds, Liverpool and Manchester in England, UK, including three pollution indicators and two measures of cognitive performance. A set of additional covariates served to adjust for potential confounders. Spatial analyses for each region and combination of pollution indicator and cognition measure were conducted using mass-univariate linear regression in GeoSPM. Conventional non-spatial Bayesian regression models were used for comparison. A significant interaction between air pollution and cognitive performance was identified in 51 areas based on a two-tailed t-test at p < 0.05 FEW (voxel-level family-wise correction). In 29 of those areas, increased pollution and reduced cognition co-occur, and a pattern of central locations and primary roads emerges, suggesting a potentially harmful effect predicated by geography.

Effect of air pollutants particulate matter (PM_2.5, PM₁₀), sulfur dioxide (SO₂) and ozone (O₃) on cognitive health

Article Open access 23 August 2024

Air pollution and cognitive function: the potential protective effect of physical activity

Article Open access 23 November 2025

Reframing air pollution as a cognitive and socioeconomic risk

Article Open access 09 March 2026

Introduction

There is growing recognition of a potential adverse link between exposure to air pollution and human cognition, supported by a steadily accumulating number of studies^1,2,3. Amongst the potential effects of air pollution are structural changes in the neonatal brain⁴, impaired cognitive performance in children as well as adults^5,6 and the onset of dementia⁷, to name a few recent examples. The connection between air pollution, cognitive decline and dementia in older people has also recently been independently reviewed and acknowledged by an advisory group to the UK government, the Committee on the Medical Effects of Air Pollutants⁸.

Cognitive impairment in general and the specific question of how it can be traced to air pollution refers back to a larger debate about cognitive health in society, in which we have advocated for the notion of a cognitive footprint⁹: As cognitive capital over the human lifespan can be affected by actions and factors (or their absence) in different domains and situations, their collective and cumulative impact on cognitive function should be recognised and accounted for.

Here, we intend to examine the cognitive footprint of air pollution spatially: not only is spatial variability a key aspect of environmental exposure that requires appropriate treatment from a methodological point of view, but even more so because of what spatial analysis affords: Insights into patterns, associations and disparities that would be hard to detect and describe otherwise. Beyond its immediate utility for research, where one might be interested in establishing if a spatial pattern is random or represents a substantial deviation from a null hypothesis, spatial analysis is relevant in public health because it can guide policy formation and supply decision makers with actionable evidence. This might be in the form of identifying areas of elevated risk, by examining the spatial structure of environmental hazards or aiding in how resources are allocated based on establishing a spatial distribution of needs. As an added benefit, its visual outputs can help to communicate findings to a wider audience more effectively.

Surprisingly, spatial aspects appear to play a role in only a fraction of studies that examine air pollution and cognition. Based on a series of PubMed queries we conducted on titles and abstracts as of 15 September 2025, out of 2581 articles that matched at least one relevant term for air pollution and cognition, less than 10% also mentioned a term referring to a spatial aspect, for example residential location or terms suggestive of a spatial analysis (a more detailed description is provided in supplementary materials, Tables S.1–S.8).

We identified UK Biobank¹⁰, internationally the largest multi-modal repository of rich data in public health and medicine, as a suitable resource for this study, since it provides home locations as well as cognitive assessments for its large cohort, and spatially covers several larger cities in the UK. Interestingly, as with the earlier PubMed search, a dearth of spatially oriented analyses seems to persist within UK Biobank, as far as air pollution and cognition are concerned: A PubMed search of “air pollution” and “UK Biobank” in the title or abstract yields 320 articles as of 15 September 2025. 29 studies were concerned with cognition (Table S.9), mainly in terms of dementia, or brain-morphological effects, but none of them had a discernible spatial component—such as spatial covariates, spatial autocorrelation, spatial errors or spatially varying or weighted models—or spatially-structured outputs. Conversely, 11 studies (Table S.10) outside cognition incorporated some spatial aspect as part of their analysis: areal differentiation, heat maps, spatial interpolation or point plots, or geographic maps. As can be seen from Table S.11, UK Biobank is by far the largest cohort among potential cohorts in the UK that hold information on cognition and residential location.

Reaction time is a commonly used measure of cognitive performance, especially in the context of aging and age-related effects, and assumed to be representative of the speed and efficiency of underlying cognitive processes^11,12. It is known to be susceptible to short term fluctuations: observable changes in this intraindividual variability have garnered interest as a potential indicator of age and cognitive health¹³. For the purposes of this study, with a firmly middle-aged cohort, reaction time seems a suitable measure of cognitive function. In UK Biobank, cognitive performance is assessed by several touchscreen-based tests during the initial assessment centre visit, and we chose mean reaction time to identify a match in a simple visual card game as one of our measures. Because this is the only reaction time-based variable in UK Biobank, we also included completion time for matching memorised pairs in a grid of cards as another speed-dependent variable.

Our aim for this study is to explore the relationship between air pollution and cognition spatially, as this seems to be an aspect that has not been well explored, neither in the context of UK Biobank nor beyond. The objective is to determine whether a significant effect of air pollution on cognition can be ascertained in geographic space while controlling for relevant variables, and to describe its spatial structure in an urban environment, which is where the majority of the world’s population now live.

We conducted our analysis using GeoSPM¹⁴, a software tool we previously developed for efficient spatial inference on point data. GeoSPM is a genuine spatial method: It relies on the asymptotic topological properties of the geometry of excursion sets of (Gaussian) random fields to strictly control type 1 errors when thresholding the statistic parametric maps of regression coefficients. It trades off the greater expressivity afforded by more powerful non-linear multivariate methods¹⁵ for greater robustness to noise, conceptual simplicity, modest computational complexity, and flexibility to examine interaction effects, which is the means by which we will determine effect sizes.

To the best of our knowledge, as of 15 September 2025 no previously published study in UK Biobank has explored the relationship between air pollution and cognition using spatial methods or determined the significance of a corresponding spatial effect based on interactions.

Results

Main analysis

Computation of the Bayesian multiple regression models with a ridge prior produced the posterior estimates summarised in Tables 1–2, for the effect of pollution on reaction time, and for the effect of pollution on completion time, respectively. In the latter setting, in 11 out of 12 models, pollution shows a significant effect, but the rank within each model is low. In the case of reaction time, the number of models with a significant pollution effect decreases to six out of 12, with equally low model ranks. Unsurprisingly in view of their simplicity, these models do not represent a good fit, leaving most of the variation unexplained. However, they do show an association between cognition and pollution that is quantitively not negligible.

Table 1 Posterior estimates of the effect of pollution on reaction time for Bayesian regression models

Full size table

Table 2 Posterior estimates of the effect of pollution on completion time for Bayesian regression models

Full size table

For GeoSPM, there were also 24 models in total, one model for each combination of the cognition and pollution variables across the four cities. The corresponding regression coefficient maps are reproduced in Figs. 1–4 for each city. Significant regions were determined by a two-tailed t-test at p < 0.05 FWE (voxel-level family-wise correction) and omitted when they included less than 10 participants in the underlying cohort (of which there were only a few). The square areas span 26 by 26 km at the original 1 km² resolution of the data and reflect the uneven distribution of samples in UK Biobank.

**Fig. 1: Geographic regression maps showing interaction and individual effects for cognition and air pollution in Birmingham with smoothing applied (95% of kernel density within a 5 km diameter).**

**Fig. 2: Geographic regression maps showing interaction and individual effects for cognition and air pollution in Leeds with smoothing applied (95% of kernel density within a 5 km diameter).**

**Fig. 3: Geographic regression maps showing interaction and individual effects for cognition and air pollution in Liverpool with smoothing applied (95% of kernel density within a 5 km diameter).**

**Fig. 4: Geographic regression maps showing interaction and individual effects for cognition and air pollution in Manchester with smoothing applied (95% of kernel density within a 5 km diameter).**

These maps present a spatially de-confounded view of the interaction between cognitive performance and air pollution as far as the other covariates are concerned. We obtain regions of significant interactions for every single combination and city. Out of 51 regions in total, the interaction is positive and the signs of the individual effects afford a consistent interpretation in 37: In 29 of these areas (Birmingham: 1, 4, 6, 7, 9, 10, 12; Leeds: 2, 3, 5, 6, 7, 8; Liverpool: 2, 3, 4, 6, 7, 8; Manchester: 3, 4, 10, 12, 13, 16, 17, 18, 21, 22), we find above average pollution and reduced cognition, while in 8 areas (Birmingham: 2, 3, 5; Leeds: 1, Liverpool: 5, Manchester: 8, 19, 20), below average pollution co-occurs with increased cognition. Another 6 of the 51 regions show negative interactions, where the direction of individual effects is in opposition and the interpretation of the modifying effect of pollution is counter-intuitive, so that above average pollution is spatially associated with increased cognition and vice-versa (Birmingham: 8, 11; Leeds: 4; Liverpool: 1; Manchester: 2, 7). The remaining group of eight regions does not show consistent signs across the different effects, so their interpretation in aggregate is inconclusive: 7 of those are located in Manchester (1, 5, 6, 9, 11, 14, 15) and one in Birmingham (13).

The strongest pattern emerges for Leeds, Liverpool—and to a slightly lesser degree—Birmingham, where a positive interaction between reduced cognition and increased pollution persists in mainly central areas and for all three pollutants. The pattern applies to both reaction and completion time: in Leeds, the extent of the effect is weaker for reaction time, while in Birmingham the distinction is even more pronounced, in extent as well as localisation. For the latter city, we can also distinguish more peripheral areas showing an inversion of the pattern: increased cognition appears alongside reduced pollution.

Manchester reveals a more heterogeneous structure: an increase in variation between NO₂, NO_x and PM_2.5, a tendency towards the periphery, two regions—both for reaction time—where the interaction is negative and the interpretation counter-intuitive, and seven regions without consistent signs between interaction and individual effects. Notable among these areas without a conclusive interpretation is a set of regions with a large share of samples (reaction time: 609 for NO₂, area 1; 599 for PM_2.5, area 9; completion time: 889 for NO₂, area 11; 385 for NO_x, area 15) that roughly coincide with the suburb of Chorlton-cum-Hardy in the southwest. There, the interaction is negative, both pollution measure effects are positive, but the area straddles cognition effects that are either fully or partially positive. For area 9, this is clearly an artefact of two smaller significant interaction areas in direct adjacency: when looking at the individual effects for the northern and southern half in isolation, the signs are consistent.

Nonetheless, the pattern of reduced cognition and increased pollution observed in the centre of the other three cities survives somewhat diminished in Manchester for completion time and NO₂ and NO_x, and again for NO₂ and reaction time. In addition, the inverse relationship of increased cognition and less pollution manifests approximately in Brooklands, southwest of Manchester. The corresponding area contains 150 samples for PM_2.5 (area 8) and reaction time and PM_2.5 and completion time (area 20).

Tables S.15 and S.16 summarise parts (A) and (B), respectively, of Figs. 1–4, providing the number of the region in the figures, number of participants (N) and 1 km² cells (C), as well as place labels (where possible) from Open Street Map data.

Sensitivity Analyses

The robustness of these results in terms of our choice of parameters—kernel size, region size and region location—are demonstrated by the sensitivity analyses reported in supplementary materials in Figs. S3–S23.

In general, as kernel size varies, significant areas remain located around the same positions, while their spatial extent and contiguity changes, as expected: The choice of kernel size involves a trade-off between sensitivity and spatial resolution, but does not impact the validity of the results. The full range of kernel sizes is reported only for Liverpool (Figs. S3–S9), whereas two kernel variations are reported for Birmingham (Figs. S10 and S11), Leeds (Figs. S12 and S13) and Manchester (Figs. S14 and S15) in the interest of brevity.

Similarly, the results of the primary analysis reported here are stable with respect to an enlarged size of the analysis region, as can be seen in Figs. S16, S18, S20 and S22. Minor variations do occur, which is not surprising, giving that the cohorts for three out of four regions increase considerably (Leeds 44%, Liverpool 32%, Manchester 38%).

A greater impact on the stability of results can be observed when the area of analysis is shifted, which is in line with our expectation, as the composition of each cohort underlying the shifted area changes considerably: Isolated single-voxel areas might be lost, and the boundaries of larger clusters might vary. Nevertheless, positions remain stable, and areas maintain similarity to their unshifted analogues (cf. Figs. S17, S19, S21 and S23).

We observe that in the reduced cohort case, which only includes participants that remained at the same residential address at the 1 km by 1 km grid cell level, results are largely comparable to the main results reported here. The similarity is greater for Leeds and Manchester, and less pronounced for Liverpool and Birmingham, especially with reaction time. The reduction in cohort sizes is substantial, amounting to 25% per city on average.

Discussion

We examined the spatial association between cognitive performance (reaction time and completion time) and air pollution (NO₂, NO_x and PM_2.5) in Birmingham, Leeds, Liverpool and Manchester by estimating the corresponding interaction and individual effects in a spatially de-confounding GeoSPM regression model, using non-spatial Bayesian regression models for comparison. Our study found positive interactions between increased pollution and reduced cognition in central areas of all four cities and for all three pollutants. For a small number of mostly peripheral instances this relationship is expressed in its inverted form, indicating a benefit of cleaner air for increased cognition. Some areas where the interaction cannot be easily interpreted were also identified as statistically significant: a negative interaction is counter-intuitive, because it suggests that pollution has a beneficial effect on cognition or that the lack of it is cognitively detrimental, contrary to the biological evidence^16,17. This suggests hidden confounders not captured in the model. An inconclusive interaction on the other hand has more local variability in terms of its individual effects and their signs, so that the region, while still statistically significant, has no unique interpretation in aggregate: The interaction can still be interpreted for each grid cell separately. Such an interaction can sometimes indicate a structure of subregions with consistent interpretations that happen to be in proximity. In both cases, further investigation is merited, but outside the scope of this analysis. The detection of positive interactions between increased pollution and reduced cognition is consistent with previous findings reported in the literature (see the reviews by Thompson et al.¹ and Delgado-Saborit et al.³).

A recent study¹⁸ of the French CONSTANCES cohort used a non-spatial regression analysis between cognitive performance and air pollutants of a sub-sample of 61462 participants 45 years and older: Their results showed significant associations of worse cognitive function and increased pollution, mostly for black carbon and NO₂, whereas exposure to PM_2.5 was mainly associated with decreased performance in a semantic fluency test in suburban areas. The comparison to this work seems particularly relevant. CONSTANCES and UK Biobank share a similar approach in their design. They are based on a random sample derived from health care records, with a stratified sampling scheme to be representative of the local population, and enrolment through several regional assessment centres. In both studies, cohorts are middle-aged and large, exposure was assessed for a 2010 baseline based on similar LUR modelling and used a participant’s residential address at the time of enrolment, with an overlapping set of air pollutants (NO₂ and PM_2.5). The varying results of stratified analyses based on four categories of residential area in the CONSTANCES study hint at the importance of the spatial context in this type of analysis. Key differences to this study concern the imputation vs exclusion of missing data and the age cut-off of people over 65, based on the assumption that with age the likelihood of pathological decrease in cognitive function increases.

Another recent study by Wood et al.¹⁹ examined air pollution and cognition in national and London-only cohorts (n = 8883 and n = 768) derived from the English Longitudinal Study of Ageing (ELSA) using linear mixed-effects models. They found significant decreases in test scores of composite memory and executive function when increasing long-term exposure for NO₂, PM_2.5, and PM₁₀ based on repeated cognitive measurements. Exposure assessment employed annual average concentrations from the 2012 Community Multiscale Air Quality Urban dispersion model and the assignment to participants’ postcodes was based on deciles due to privacy concerns. Address changes were accounted for by assigning pollution estimates based on the year of the (follow-up) interview and the postcode available for that year.

A Bayesian spatial survival analysis of a smaller cohort (n = 1572) is reported by Sullivan et al.²⁰ for the Monongahela valley of south-western Pennsylvania in the United States, a region with an industrial history of steel production and high air pollution. Sullivan et al.²⁰ investigated the relationship between PM_2.5 exposure in later life and the incidence of mild cognitive impairment as well as dementia among a community-based sample of adult, mainly life-long, residents of this region. They found significant higher adjusted risk of incident mild cognitive impairment and dementia for increases in PM_2.5 assessed at the census-tract level. The interest here lies in the fact that this relationship was detected by a spatial analysis based on a much smaller sample, and the overlap with our results for PM_2.5.

For a Chinese cohort (n = 15163), Gao et al.²¹ analyse the effect of PM_2.5, NO₂ and O₃ on cognitive function, including a spatial modelling approach using a spatial error model, which treats spatial correlations as a nuisance factor (they determined a low level presence of spatial correlation in the data). They apply a more basic exposure assessment solely based on kriging of 1389 air monitoring sites across China for annual pollution concentrations. Their results show PM_2.5 as the dominant air pollutant affecting cognition over 1-, 2-, 3- and 4-year exposure periods, while NO₂ only produced a significant effect at the 4-year period, which was more significant than the corresponding 4-year PM_2.5 effect.

Interestingly, a non-spatial study (Cullen et al.²²) based on the same UK Biobank dataset underlying this work, only found small negative relationships between pollutant exposure (PM_2.5, PM₁₀, PM_2.5–10, NO₂ and NO_x) and cognitive test performance in unadjusted regression models. These associations shrank further and became inconsistent in terms of direction, when adjusted for confounders. Out of 25 regression models, only 5 had a p-value less than 0.05 (false discovery rate), and 2 of those showed a counter-intuitive direction of the pollution effect on a reasoning score. The other 3 significant results showed significant effects for PM_2.5 and NO_x on reaction time and for NO₂ on pairs matching completion time, which were found to be consistently significant in this work, as described above. Cullen et al.²² do not consider changes in address for their main analysis but found slightly stronger associations in their sensitivity analysis using alternative exposure measurements that included the history of residential addresses for each participant. The number of observations per analysis varied but was over 70000 for reaction time and pairs matching completion time for PM_2.5, NO₂ and NO_x. These results contrast with our findings here and could be an indication that the spatial nature of exposure (and potentially other covariates) cannot be ignored.

Local geography offers additional context for the interpretation of individual regions for which a co-occurrence of reduced cognition and increased pollution was found, even if the coarse spatial resolution provided by UK Biobank renders matching effects to local features difficult. However, we can constrain ourselves to an examination of the primary road structure, as a major source of the pollutants discussed here is road traffic.

In Birmingham, we see that significant grid cells tend to concentrate in the city centre and in proximity to primary roads (Fig. 5). Most of the A4540 Middleway ring road appears in regions 7, 9 and 12 for completion time, as well as the inner A38 ring road and its Queensway section that flank the historic centre. The southern part of the A4540 near Attwood Green also appears in regions 1, 4 and 6. These roads have been previously identified by the UK government as exceeding emission limits in its 2017 air quality plan for nitrogen dioxide in UK (p. 13)²³, with modelled concentrations ranging between 41–60 µgm/m³ for most sections, and a part of Queensway in exceedance of 60 µgm/m³. Measured pollution levels matched these exceedances, as published in a status report by Birmingham city council in 2019²⁴. The considerable majority of stations along those main roads reported measurements in the range between 40–79 µgm/m³. This is acknowledged in the report’s conclusion, stating that “The City continues to have air quality breaches against the annual mean objective for NO₂ with known exceedance areas being within the city centre. The primary source of air quality issues within Birmingham is road transport.” A similar statement can be found in the city’s Air Quality Action Plan 2021–2026²⁵. Birmingham implemented a clean air zone inside the Middleway ring road, excluding the road itself in June 2021²⁶. A recent assessment of the change concluded significant but modest reductions in NO₂ and NO_x concentrations, but none for PM_2.5²⁷. In view of the exceedances reported by Birmingham council, it is important to remember that the World Health Organisation updated their air quality guidelines in 2021²⁸ with stricter limits than the air quality objectives currently mandated in the UK: The new recommendations are 10 µg/m³ for NO₂ (currently 40 µg/m³) and 5 µg/m³ for PM_2.5 (currently 20 µg/m³, 12 µg/m³ from 2028).

**Fig. 5: Detail of geographic context for Birmingham for a subset of the significant areas shown in Fig. 1.**

Additional discussion of the geographic context for the other cities is provided in Supplementary Materials (Figs. S28–S30) and shows comparable correspondences.

The results of this study contribute to the growing signs that indicate a negative effect of air pollution on cognition: at the policy level, these findings underline the continued, pressing need to reduce emissions of air pollutants because of their cognitive footprint. A diminishment of cognitive capital because of factors that can be controlled through policy is not only devastating from an individual perspective but should be avoided for its negative repercussions from a societal and economic perspective: Increased cognitive impairment, especially in old age, puts additional strain on health care systems and families. Separately, in the context of a global transition to knowledge-based economies, societies through their policies make a choice about how much they value the collective cognitive capital of their populations. Independently, the contrast between central and peripheral areas and the change in interpretation of the effect hints at an inequality of the burden of air pollution in terms of its geographical distribution that must be of concern to policy makers. At a technical level, the presence of spatial structure in the relationship between air pollution and cognition implies the need to consider spatially informed approaches to analysis, as non-spatial approaches might underestimate or even fail to detect associations mediated in space. In addition, a finer grained mesh of measurement sites – not only along major traffic routes but also in residential areas and other areas of interest – could give future studies an immense boost in spatial precision.

The strength of this study lies in applying a spatial methodology to a large sample size, which allows to analyse multiple cities in a shared geographic context and enables observations of a common spatial pattern—including centrality and correlations with the primary road network—addressing an underrepresented aspect in the existing literature.

Some limitations must be considered. The group most affected by air pollution, babies and small children, is regrettably not represented in UK Biobank at all, so the conclusions here are not transferrable to that age group. Evidence of selection bias in UK Biobank has been reported in the literature²⁹: participants appear to be healthier and live in less socio-economically deprived areas than non-participants. This suggests that our results are more likely to underestimate potentially harmful interactions. Nevertheless, we expect our results to be generalisable to other comparable datasets given the size of the sample, the stratification of UK Biobank’s recruitment, and the direction of selection bias. Our analysis could also be improved with a more detailed approach to accounting for exposure instead of relying on a fixed baseline of pollution at a single address. Lastly, although sex is a variable in this analysis, no conclusions can be drawn about pollution-cognition interaction differences between sexes. This would require a comparison between the interaction-related effect maps produced by separate analyses for each sex and could be an important component of future studies, as other work in this area seems to suggest^30,31,32.

We regard this study as a first step towards determining the cognitive footprint of air pollution, and we hope it can provide additional impetus to research its effect on cognition in more detail, using finer grained and more complex means of analysis.

Methods

Study design and participants

The cohort used for this study is from UK Biobank, which holds a wide range of health-related information from over half a million participants recruited across the United Kingdom between 2006 and 2010. Potential participants were invited based on their age (40–69 years) and reasonable travel distance from one of the 22 assessment centres through the National Health Service (NHS)³³. The baseline visit at an assessment centre required participants to give their written consent, complete a touch-screen questionnaire, take part in a brief interview, have their physical measurements taken and provide samples of their blood, urine and saliva. UK Biobank holds the sex of participants as recorded by the NHS but allows corrections upon request.

Crucially, UK Biobank provides residential location information for each participant, derived from the address the recruitment invitation was sent to, and geocoded to a reference grid with a cell size of 1 km². A higher resolution level of 100 m² is available, but access is only granted on a case-by-case basis and when deemed essential for the purposes of a study. For this reason, the present analysis was performed at the lower resolution level.

Participants are primarily clustered around major cities in Great Britain, and we choose four cities based on their population size, comparable geographic extent and large UK Biobank participation rate: Birmingham, Leeds, Liverpool and Manchester. All four have familiar histories of heavy industry beginning with the Industrial Revolution, followed by industrial decline and modern regeneration based on service and information-oriented economic activity. To facilitate inter-city comparisons, we adopted a common spatial scale and settled on a square analysis region spanning 26 km by 26 km, centred at the geographic centre of each city, which allowed us to capture as many participants as possible while maintaining a coherent geographic context for each place. A map of the four study regions within the UK is shown in Figure S1.

Out of a total number of 502543 in the full UK Biobank cohort we extracted 17,829 participants (f = 9945, m = 8275) in Birmingham, 19161 (f = 10760, m = 8401) in Leeds, 15830 (f = 8837, m = 6993) in Liverpool and 19601 (f = 10,474, m = 9127) in Manchester without missing values at baseline. Participants with extreme (outside the 1 or 99th percentile) or missing values were removed from their respective region. A diagrammatic overview of the study design is shown in Fig. S2.

The full set of variables included in the analysis and their derivation is shown in Table S.12 and comprises two measures of cognitive performance (reaction time and pair matching completion time), three different indicators for air pollution (NO₂, NO_x and PM_2.5, based on estimates for 2010) as well as demographic and non-demographic factors of potential relevance. The characteristics of participants in the four city sub-cohorts are shown in Table S.13, environmental variables are summarised in Table S.14. UK Biobank stratified key demographic parameters (age, sex and postcode as a measure of social deprivation) at the invitation stage, over-sampling certain groups as deemed required³³. No sampling was applied to the data other than through geographic location and excluding missingness. In all four city-based sub-cohorts, there are slightly more women than men. Further details on how variables were collected by UK Biobank are available at the online showcase (https://biobank.ndph.ox.ac.uk/showcase/). A detailed description of the UK Biobank enrolment process is described in Sudlow et al.¹⁰.

Exposure assessment

We chose annual average concentration estimates of NO₂, NO_x and PM_2.5 as pollution indicators because of their suspected harmful influence on cognitive health³⁴. Particulate matter smaller than 2.5 µm is considered particularly harmful because it can cross the alveolar-capillary barrier³⁵.

UK Biobank linked each participant’s home address to estimates from land use regression (LUR) models developed by the ESCAPE (European Study of Cohorts for Air Pollution Effects, http://www.escapeproject.eu/) project based on monitoring in the year 2010 (https://biobank.ndph.ox.ac.uk/ukb/ukb/docs/EnviroExposEst.pdf)^36,37. These models predict pollutants using traffic intensity, topography and land use among other factors.

A known limitation of ESCAPE estimates exists for particulate matter: These estimates are considered reliable only up to a distance of 400 km from the original monitoring area in Greater London, so measures for 33,935 addresses beyond this cut-off are coded as missing in UK Biobank. However, all four cities are within the given distance and therefore not affected.

Accurate exposure assessments are difficult to achieve, so we follow the common practice of using the environmental pollution at a participant’s residential address at baseline as an approximation, which is naturally limited. Other contributing individual factors are not considered. We do not take changes of residential address into account for the main analysis but consider its impact in a sensitivity analysis. Our motivation in doing so stems from our focus on the spatial aspect of the analysis and maximising the available sample.

Statistical analysis

The variables in Table S.12 were selected from a larger set based on minimising co-linearities while retaining those that showed a degree of correlation with our cognitive variables of interest—reaction time and completion time—by inspecting a covariance matrix. We excluded fluid intelligence as a variable of interest because of its significantly lower coverage in UK Biobank compared to the other two measures (N = 221,104 vs. N = 496,765 for reaction time and N = 498,838 for completion time). All non-binary variables, discrete or continuous, were standardised (µ = 0, std = 1) prior to their evaluation in any of the analyses, except for the cognition time variables, which we transformed via ordered quantile normalisation³⁸.

As a benchmark for the spatially confounded case, we assessed the association between cognition and the other covariates, including pollution, by performing Bayesian multiple regression with a ridge prior with reaction time or completion time as the response for each of the cities. The predictors in these models were age, sex, ethnic minority, formal qualifications, subjective health score, basal metabolic rate, alcohol consumption level, anxious feelings, walking pace, frequency of social visits, greenspace percentage and one of the three pollution measures: NO₂, NO_x or PM_2.5. Our tool for evaluating these models was the MATLAB version of BayesReg (version 1.9.1), which evaluates the required Markov chains using Gibb’s sampling. The posterior computations for each chain were based on 20,000 samples, after discarding an initial 100,000 samples for burn-in and thinning a further 100,000 samples by a factor of 5.

Many observable features in epidemiology, environmental medicine, healthcare policy, and public health have a natural expression in a spatial frame of reference and could be described by a joint spatial distribution. We all live, work, exist somewhere, after all, and our biological and behavioural characteristics interact with and are shaped by the physical environment that surrounds us. The analysis of spatially varying or spatially confounded associations between these features, observed at discrete points, necessitates suitable methods that take account of spatial correlations, are suitable for surfacing spatial structure and are robust when faced with noise and sparseness. The method we are employing here, GeoSPM, offers interpretability, flexibility, robustness and does not require intricate parametric specification, instead operating on a fixed assumption of Gaussian random fields. Its relative simplicity and accessibility contrasts with the greater complexity of multivariate spatial methods and thus is particularly suitable for exploring datasets in a spatial manner.

GeoSPM (https://github.com/high-dimensional/geospm) is our previously published extension of SPM12 to the geo-spatial domain. SPM12³⁹ (https://www.fil.ion.ucl.ac.uk/spm/) is the dominant regression analysis framework for spatial inference in brain imaging that has found wide-spread use over a period of more than 30 years. GeoSPM makes point-based data amenable to the mass-univariate paradigm of SPM12. It estimates the regression of a local response on a global matrix of predictors, which yields maps of localised parameter estimates that give an indication of the marginalised distribution of the underlying covariate. A key feature of this approach is the topologically derived correction for multiple comparisons when testing regression coefficients or their contrasts. It allows to identify the spatial extent of significant parameters in a controlled manner. Additional information about the wider context of statistical analysis in neuroimaging can be found in the provided references^40,41,42.

In GeoSPM, sample concentration takes up the role of the response. The sample concentration for each observation in GeoSPM is derived by placing a Gaussian kernel of pre-determined size at its location. For the present study, we concentrated 95% of the kernel density within a radial diameter of 5 km. At each location, this arrangement of the response can be interpreted as an indicator for the closeness of each individual observation to that point in space: the further away an observation occurred from a particular location, the lower the response.

As in a conventional regression model, we use interaction terms to examine any modifying effects between the covariates we are interested in – air pollution and cognition. A significant interaction in GeoSPM suggests a form of dependence or association of covariates within the spatial setting. The sign of the interaction indicates the direction of the association: for example, a positive interaction indicates either more pollution and reduced cognition or less pollution and increased cognition in a place. To distinguish between the two, we also examine—within the same region—the signs of the corresponding individual effects in aggregate. In cases where the signs of the individual effects are not consistent with the sign of the interaction, we refrain from such an interpretation. The inconsistency can arise because there is no constraint on the sign of effects, but also because the signs within individual effects can vary, as the region we examine is taken from the interaction. In practice, we find that signs remain largely homogenous and consistent. We determine the sign of a region based on its median.

Our model structure for GeoSPM follows from the preceding observation about interactions. With the stated objective of examining cognitive performance and air pollution spatially—while controlling for plausible confounders available to us in UK Biobank—we extend the shared set of covariates used in the Bayesian regression models with two additional covariates for cognition and pollution, as well as interaction terms for the cognition covariate and each of the other covariates.

We thus obtained six models per city that only differ in the respective cognition and pollution variables (either reaction time or completion time, and one of NO₂, NO_x or PM_2.5) and the corresponding cognition interactions but are otherwise identical. GeoSPM produced estimates for the regression coefficient maps as well as corresponding maps of two-tailed t-tests for these coefficients. GeoSPM determines the significance of these maps using its voxel-level family-wise correction for multiple comparison based on the topology of a Gaussian random process, as outlined earlier, and our chosen level of significance. For each significant region, place names were retrieved from a list of labels extracted from Open Street Map data and a summary describing the size and number of participants in it was automatically generated.

Sensitivity analysis

We conducted a series of sensitivity analyses to gauge the effect of the kernel size, region size and region location on the results of the main analysis for all four cities. We varied kernel size from 1.25 km to 10 km in 1.25 km steps, to gain insight into the robustness of significant areas reported for the chosen kernel size of 5 km. In addition, for each city, we examined two alternative study regions: an expanded region spanning 36 km by 36 km that contains the primary region and adds a buffer of 5 km in each cardinal direction around it and a translated region, of the same size as the primary region. The shift consisted of a translation of 8 km in the north and east directions—roughly a third of the corresponding edges of the primary region—and represents a trade-off between maintaining enough overlap with the primary region and injecting a meaningful change in position.

Another aspect that requires consideration is the effect of changes in residency. For the main analysis, we included all participants regardless of their residential history with their baseline address. To evaluate how changes in residency affect the main results, we conducted a separate analysis for each city in which participants with changes in their residential address were excluded.

Data availability

The data analysed in this study is available on application to UK Biobank.

Code availability

The open-source software implementation of GeoSPM used in this study is available on GitHub: https://github.com/high-dimensional/geospm (https://doi.org/10.5281/zenodo.7258970).

References

Thompson, R. et al. Air pollution and human cognition: a systematic review and meta-analysis. Sci. Total Environ. 859, 160234 (2023).
Article CAS PubMed Google Scholar
Woodward, N., Finch, C. E. & Morgan, T. E. Traffic-related air pollution and brain development. AIMS Environ. Sci. 2, 353–373 (2015).
Article CAS PubMed PubMed Central Google Scholar
Delgado-Saborit, J. M. et al. A critical review of the epidemiological evidence of effects of air pollution on dementia, cognitive function and cognitive decline in adult population. Sci. Total Environ. 757, 143734 (2021).
Article CAS PubMed Google Scholar
Bos, B. et al. Prenatal exposure to air pollution is associated with structural changes in the neonatal brain. Environ. Int 174, 107921 (2023).
Article CAS PubMed PubMed Central Google Scholar
Gui, Z. et al. Exposure to ambient air pollution and executive function among Chinese primary schoolchildren. Int J. Hyg. Environ. Health 229, 113583 (2020).
Article CAS PubMed Google Scholar
Chen, M.-C. et al. Air pollution is associated with poor cognitive function in Taiwanese adults. Int J. Environ. Res Public Health 18, 316 (2021).
Article CAS PubMed PubMed Central Google Scholar
Carey, I. M. et al. Are noise and air pollution related to the incidence of dementia? A cohort study in London, England. BMJ Open 8, e022404 (2018).
Article PubMed PubMed Central Google Scholar
Maynard, R. et al. 53 cognitive decline, dementia and air pollution: a report by the committee on the medical effects of air pollutants. Ann. Work Exposures Health 67, i80–i81 (2023).
Article Google Scholar
Rossor, M. & Knapp, M. Can we model a cognitive footprint of interventions and policies to help to meet the global challenge of dementia? Lancet 386, 1008–1010 (2015).
Article PubMed Google Scholar
Sudlow, C. et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779 (2015).
Article PubMed PubMed Central Google Scholar
Salthouse, T. A. Aging and measures of processing speed. Biol. Psychol. 54, 35–54 (2000).
Article CAS PubMed Google Scholar
Deary, I. J. & Der, G. Reaction time, age, and cognitive ability: longitudinal findings from age 16 to 63 years in representative population samples. Aging Neuropsychol. Cogn. 12, 187–215 (2005).
Article Google Scholar
Haynes, B. I., Bauermeister, S. & Bunce, D. A systematic review of longitudinal associations between reaction time intraindividual variability and age-related cognitive decline or impairment, dementia, and mortality. J. Int. Neuropsychol. Soc. 23, 431–445 (2017).
Article PubMed Google Scholar
Engleitner, H. et al. GeoSPM: geostatistical parametric mapping for medicine. Patterns 3, 100656 (2022).
Article PubMed PubMed Central Google Scholar
Lawson A. B. Bayesian Disease Mapping: Hierarchical Modeling in Spatial Epidemiology, Third Edition, 3rd edn. New York: Chapman and Hall/CRC, https://doi.org/10.1201/9781351271769 (2018).
Arias-Pérez, R. D. et al. Inflammatory effects of particulate matter air pollution. Environ. Sci. Pollut. Res. 27, 42390–42404 (2020).
Article Google Scholar
Cho, J. et al. Long-term ambient air pollution exposures and brain imaging markers in Korean adults: the environmental pollution-induced neurological effects (EPINEF) study. Environ. Health Perspect. 128, 117006 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sakhvidi, M. J. Z. et al. Outdoor air pollution exposure and cognitive performance: findings from the enrolment phase of the constances cohort. Lancet Planet. Health 6, e219–e229 (2022).
Article Google Scholar
Wood, D. et al. Exposure to ambient air pollution and cognitive function: an analysis of the English longitudinal study of ageing cohort. Environ. Health 23, 35 (2024).
Article CAS PubMed PubMed Central Google Scholar
Sullivan, K. J. et al. Ambient fine particulate matter exposure and incident mild cognitive impairment and dementia. J. Am. Geriatrics Soc. 69, 2185–2194 (2021).
Article Google Scholar
Gao, H., Shi, J., Cheng, H., Zhang, Y. & Zhang, Y. The impact of long- and short-term exposure to different ambient air pollutants on cognitive function in China. Environ. Int. 151, 106416 (2021).
Article CAS PubMed Google Scholar
Cullen, B. et al. Cross-sectional and longitudinal analyses of outdoor air pollution exposure and cognitive function in UK Biobank. Sci. Rep. 8, 12089 (2018).
Article PubMed PubMed Central Google Scholar
Department for Environment Food & Rural Affairs. Air Quality Plan for tackling roadside nitrogen dioxide concentrations in West Midlands Urban Area (UK0002). 2017; published online July. https://uk-air.defra.gov.uk/assets/documents/no2ten/2017-zone-plans/AQplans_UK0002.pdf (accessed Jan 21, 2025).
Birmingham City Council. 2019 Air Quality Annual Status Report (ASR). 2019; published online Nov 30. https://www.birmingham.gov.uk/downloads/file/15061/air_quality_annual_status_report_2019_containing_data_for_2018.
Birmingham City Council. Birmingham City Council Air Quality Action Plan. 2021; published online Feb. https://www.birmingham.gov.uk/downloads/download/4061/birmingham_city_council_air_quality_action_plan_2021-2026 (accessed Jan 21, 2025).
Birmingham City Council. Payments for the Clean Air Zone go live. https://www.birmingham.gov.uk/news/article/888/payments_for_the_clean_air_zone_go_live (accessed Jan 21, 2025).
Liu, B. et al. Assessing the impacts of Birmingham’s clean air zone on air quality: estimates from a machine learning and synthetic control approach. Environ. Resour. Econ. 86, 203–231 (2023).
Article Google Scholar
World Health Organization (WHO). WHO global air quality guidelines: Particulate matter (PM2.5 and PM10), ozone, nitrogen dioxide, sulfur dioxide and carbon monoxide. Geneva: World Health Organization, 2021 https://www.who.int/publications/i/item/9789240034228 (WHO, 2025).
Fry, A. et al. Comparison of sociodemographic and health-related characteristics of UK biobank participants with those of the general population. Am. J. Epidemiol. 186, 1026–1034 (2017.
Article PubMed PubMed Central Google Scholar
Mo, S. et al. Sex disparity in cognitive aging related to later-life exposure to ambient air pollution. Sci. Total Environ. 886, 163980 (2023).
Article CAS PubMed Google Scholar
Kim, H. et al. Gender difference in the effects of outdoor air pollution on cognitive function among elderly in Korea. Front Public Health 7, 375 (2019).
Article PubMed PubMed Central Google Scholar
Rodríguez-Fernández, B. et al. Sex differential effects on the joint contribution of air pollution and biological aging on cognitive performance in individuals at risk of Alzheimer’s disease. Alzheimer's Dement. 19, e078156 (2023).
Article Google Scholar
UK Biobank. UK Biobank: Protocol for a large-scale prospective epidemiological resource: protocol No: UKBB-PROT-09-06 (Main Phase). 2007; published online March 21. https://www.ukbiobank.ac.uk/media/gnkeyh2q/study-rationale.pdf (accessed Jan 21, 2025).
Parra, K. L., Alexander, G. E., Raichlen, D. A., Klimentidis, Y. C. & Furlong, M. A. Exposure to air pollution and risk of incident dementia in the UK Biobank. Environ. Res. 209, 112895 (2022).
Article CAS PubMed PubMed Central Google Scholar
You, R., Ho, Y.-S. & Chang, R. C.-C. The pathogenic effects of particulate matter on neurodegeneration: a review. J. Biomed. Sci. 29, 15 (2022).
Article PubMed PubMed Central Google Scholar
Beelen, R. et al. Development of NO2 and NOx land use regression models for estimating air pollution exposure in 36 study areas in Europe–The ESCAPE project. Atmos. Environ. 72, 10–23 (2013).
Article CAS Google Scholar
Eeftens, M. et al. Development of land use regression models for PM2.5, PM2.5 absorbance, PM10 and PMcoarse in 20 European study areas; results of the ESCAPE project. Environ. Sci. Technol. 46, 11195–11205 (2012).
Article CAS PubMed Google Scholar
Peterson, R. A. & Cavanaugh, J. E. Ordered quantile normalization: a semiparametric transformation built for the cross-validation era. J. Appl. Stat. 47, 2312–2327 (2020).
Article PubMed Google Scholar
Friston, K. J., Worsley, K. J., Frackowiak, R. S., Mazziotta, J. C. & Evans, A. C. Assessing the significance of focal activations using their spatial extent. Hum. Brain Mapp. 1, 210–220 (1994).
Article CAS PubMed Google Scholar
Penny W. D., Friston K. J., Ashburner J. T., Kiebel S. J. Nichols T. E., editors. Statistical Parametric Mapping: The Analysis of Functional Brain Images, 1st edn. (Academic Press, 2007).
Frackowiak R. S. J. et al. editors. Human Brain Function, 2nd edn. (Academic Press, 2004).
Huettel S. A., Song A. W. & McCarthy G. Functional Magnetic Resonance Imaging, 3rd edn. (Sinauer Associates, 2014).

Download references

Acknowledgements

Holger Engleitner and Parashkev Nachev are supported by the NIHR UCLH Biomedical Research Centre. PN is funded by Wellcome. Marta Suárez Pinilla is supported by the Health Foundation. Martin Rossor is supported by the National Institute for Health Research (NIHR) Senior Investigator Award/NF-SI-0512-10033. The views expressed are those of the authors and not necessarily those of the NIHR, the Department of Health and Social Care, or the Health Foundation. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

Authors and Affiliations

UCL Queen Square Institute of Neurology, University College London, London, UK
Holger Engleitner, Martin Rossor & Parashkev Nachev
Clinical Neuroscience Laboratory, Centre for Biomedical Technology, Universidad Politécnica de Madrid, Madrid, Spain
Marta Suárez Pinilla

Authors

Holger Engleitner
View author publications
Search author on:PubMed Google Scholar
Marta Suárez Pinilla
View author publications
Search author on:PubMed Google Scholar
Martin Rossor
View author publications
Search author on:PubMed Google Scholar
Parashkev Nachev
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualisation, H.E., P.N., and M.R. Methodology, H.E., and P.N. Software, H.E. Validation, H.E., P.N., and M.S.P. Formal analysis, H.E. Investigation, H.E., and M.S.P. Resources, P.N. Data curation and verification, H.E., and M.S.P. Writing – original draft, H.E. Writing – review & editing, H.E., P.N., M.S.P., and M.R. Visualisation, H.E. Supervision, P.N. Funding acquisition, P.N. and M.R. All authors had full access to the data and reviewed the manuscript.

Corresponding authors

Correspondence to Holger Engleitner or Parashkev Nachev.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Engleitner, H., Suárez Pinilla, M., Rossor, M. et al. Topological modelling of urban air pollution and cognition. npj Digit. Public Health 1, 7 (2026). https://doi.org/10.1038/s44482-025-00009-z

Download citation

Received: 29 April 2025
Accepted: 30 November 2025
Published: 08 April 2026
Version of record: 08 April 2026
DOI: https://doi.org/10.1038/s44482-025-00009-z