Detecting urban PM2.5 hotspots with mobile sensing and Gaussian process regression

Perry, Niál; Pedersen, Peter P.; Christensen, Charles N.; Nussli, Emanuel; Heinonen, Sanelma; Dagallier, Lorena Gordillo; Jacquat, Raphaël; Horstmann, Sebastian; Franck, Christoph

doi:10.1038/s44407-025-00038-1

Download PDF

Article
Open access
Published: 04 December 2025

Detecting urban PM_2.5 hotspots with mobile sensing and Gaussian process regression

Niál Perry¹,
Peter P. Pedersen^1,2,
Charles N. Christensen²,
Emanuel Nussli¹,
Sanelma Heinonen¹,
Lorena Gordillo Dagallier²,
Raphaël Jacquat²,
Sebastian Horstmann² &
…
Christoph Franck²

npj Clean Air volume 1, Article number: 38 (2025) Cite this article

1424 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Low-cost mobile sensors can be used to collect PM_2.5 concentration data throughout an entire city. However, identifying air pollution hotspots from the data is challenging due to the uneven spatial sampling, temporal variations in the background air quality, and the dynamism of urban air pollution sources. This study proposes a method to identify urban PM_2.5 hotspots that addresses these challenges, involving four steps: (1) equip citizen scientists with mobile PM_2.5 sensors while they travel; (2) normalise the raw data to remove the influence of background ambient pollution levels; (3) fit a Gaussian process regression model to the normalised values; (4) calculate spatially explicit ‘hotspot scores’ using the probabilistic framework of Gaussian processes, which summarise the relative pollution levels throughout the city. We apply our method to create the first ever map of PM_2.5 pollution in Kigali, Rwanda, at a 200m resolution, where we uncover several pollution hotspots. We also evaluate our method using simulated mobile sensing data for Beijing, China, where we show that the hotspot scores capture the ground-truth spatial PM_2.5 distribution. Thanks to the use of open-source software and low-cost sensors, our method can be re-applied globally to help fill the gap in urban air quality information.

Machine learning-guided integration of fixed and mobile sensors for high resolution urban PM_2.5 mapping

Article Open access 10 March 2025

A novel spatiotemporal prediction approach to fill air pollution data gaps using mobile sensors, machine learning and citizen science techniques

Article Open access 19 December 2024

The role of fine and submicron aerosol particles in urban air pollution in the context of meeting new air quality standards

Article Open access 02 December 2025

Introduction

Air pollution is considered ‘the single biggest environmental threat to human health’ by the World Health Organisation¹. A particularly harmful category of pollutants is PM_2.5, defined as particulate matter with an aerodynamic diameter ≤2.5 μm. PM_2.5 particulates are small enough to penetrate deep into the lungs and sometimes even the bloodstream, and are associated with a wide range of adverse respiratory², cardiovascular³, and cerebrovascular health outcomes⁴. Increased exposure to ambient PM_2.5 has also been linked to adverse birth outcomes⁵ and increased incidence of diabetes mellitus⁶. An estimated 4.1 million deaths in 2019 can be attributed to long-term ambient PM_2.5 exposure⁷, and it is estimated that PM_2.5-attributable deaths increased in all global regions except Europe and the Americas from 2000 to 2019⁸, a concerning trend.

In 2015, the 193 member states of the United Nations ratified the Sustainable Development Goals (SDGs), including SDG 11.6 which seeks to improve urban air quality by 2030. Despite receiving global support, current PM_2.5 monitoring infrastructure is insufficient to measure whether SDG 11.6 is achieved. As of 2019, the majority (141) of the world’s countries had no regular PM_2.5 monitoring at all, and the global mean population distance to a PM_2.5 monitor was 220 km⁹. A more recent analysis estimates that in-situ PM_2.5 measurements are missing for over 50% of the world’s urban population¹⁰. PM_2.5 estimates based on satellite data are a welcome contribution to filling the data gap (e.g., ref. ¹¹). However, satellite estimates often have a granularity of hundreds of meters or even kilometers, which is too coarse to capture the full variability of PM_2.5 within a city and accurately assess citizens’ personal exposure. Furthermore, satellite estimates can exhibit errors in the range 22–85% when they are not calibrated by ground PM_2.5 measurements¹².

Low-cost sensors are a promising alternative to satellite data, and several studies have already explored using the technology to estimate the concentration of PM_2.5 throughout an urban area. Low-cost sensors can be installed at fixed sites to increase the spatial coverage of the city’s monitoring network^13,14,15. An alternative approach that more easily scales for city-wide data capture is mobile sensing, wherein the sensors are transported while they record geolocated air quality measurements. Mobile sensors can complement a stationary monitoring network to support spatial air quality estimation^16,17 or can operate as a standalone system for mapping pollution levels. Several studies have deployed mobile sensors along predetermined routes on multiple different days and/or at different times of the day^18,19,20. This approach allows one to analyse how pollution levels vary along a fixed route, as well as how pollution varies across time when the location is controlled. However, deploying sensors along prescribed routes is not optimal for assessing pollution exposure because the routes may differ from routes that people usually take, and the spatial coverage is often limited. Other studies have instead fit mobile sensors to people²¹ or vehicles such as taxis^22,23 or trams²⁴ while going about their daily business. This leads to PM_2.5 concentrations data that are more informative about people’s day-to-day exposure and the data will often cover a larger spatial area than predetermined routes.

Once pollutant concentrations have been recorded with mobile sensing, a key question is where the air pollution hotspots are in the city. A hotspot is a location where the average pollution level exceeds that of the surrounding area. Identifying hotspots is relevant from a health perspective because short-term exposure to high levels of air pollution has been positively associated with increased inflammatory markers in the lungs²⁵ and with increased all-cause mortality for the pollutants PM_2.5, PM₁₀, NO₂ and O₃²⁶. The evidence suggests that health outcomes are not just determined by average PM_2.5 exposure, but also by the maximum or near-maximum PM_2.5 exposure. Additionally, hotspots are relevant from a policymaking perspective, as they constitute locations where policymakers can focus their clean air initiatives to achieve the maximal improvement in health outcomes. Hotspot detection can also help to pinpoint PM_2.5 pollution sources that were previously unknown²⁷.

In this study we propose an urban PM_2.5 hotspot-detection method that combines mobile sensing data with Gaussian process regression. Gaussian process regression has three particularly attractive properties for modelling PM_2.5 concentrations data: (1) it allows for a rich specification of the target function²⁸; (2) it is flexible, accommodating diverse data types and (3) compared to many geospatial models, specifying a Gaussian process requires little expert intervention^29,30. These advantageous properties have motivated increased research into applying Gaussian process regression to predict PM_2.5 concentrations^31,32,33 and to interpolate sparse PM_2.5 concentrations data^34,35. However, to the best of our knowledge, this study is the first to apply Gaussian process regression with the explicit aim of PM_2.5 hotspot detection from mobile sensing data.

Our method contributes to the growing body of literature on identifying urban PM_2.5 hotspots. One study³⁶ considers data from stationary PM_2.5 sensors and identify hotspots based on the frequency, scale, and consistency with which pollution levels exceed a given threshold (see also ref. ³⁷). However, their approach is designed for fixed PM_2.5 monitoring stations rather than for the uneven sampling pattern and wide spatial coverage of mobile sensing. Another approach which is more compatible with mobile sensing considers a ‘window’ around every observation, consisting either of neighbouring observations³⁸ or neighbouring pixels of gridded observations³⁹, and identifies PM_2.5 spikes within these windows as hotspots. Although effective for localized hotspot detection, these window-based approaches lack a shared baseline across windows, making them less suitable for city-wide hotspot identification. The ${G}_{i}^{* }$ statistic^40,41 is another popular choice for PM_2.5 hotspot identification^18,42. For every location i in a spatial grid, ${G}_{i}^{* }$ is calculated as the weighted sum of the PM_2.5 values at the other locations j, with the weights w_i,j being inversely proportional to the distance d_i,j between the location pairs^40,41. A location i is considered a hotspot if its ${G}_{i}^{* }$ statistic value is significantly larger than one would expect under a spatially-random distribution of the PM_2.5 observations. A limitation is that the ${G}_{i}^{* }$ are merely descriptive statistics about the clustering of observed data, and they convey less information than probabilistic approaches which also report the localised variability in the observed data.

To address these limitations, we propose a Gaussian process-based approach that is compatible with spatially- and temporally-uneven mobile sensing data, identifies hotspots relative to the whole city, and outputs spatially-explicit hotspot scores that capture the variability in the observed data. We proceed to evaluate the PM_2.5 hotspot-detection method on two datasets. The first is a mobile sensing dataset that we gathered in Kigali, Rwanda, in September 2021. The population of Kigali has grown from tens of thousands of people to over 1 million in the last 50 years, yet as of 2021 the city had no publicly-accessible reference PM_2.5 monitoring station. Kigali is therefore a good representation of a city in a low-income country with an air quality data gap that mobile sensing can help to resolve, in pursuit of SDG 11.6. The second dataset comprises seasonal estimates of the average PM_2.5 concentrations for Beijing. Published by Wang et al.⁴³, the dataset is based on several information sources, such as satellite top-of-atmosphere data, traffic emissions and land use data, and contains average PM_2.5 estimates at a spatial resolution of 30 m. Treating the Beijing dataset as a ‘ground truth’ for hotspot locations, we first simulate mobile sensing data, and then evaluate whether our method can successfully reconstruct the spatial PM_2.5 distribution from the simulated mobile sensing data.

Results

Kigali data

After processing, the Kigali dataset consists of 2.02 million observations of the PM_2.5 concentration in and around Kigali. With one measurement per second, this corresponds to >562 h of data. In general, tiles with the greatest number of measurements encompass a stretch of a major road (Fig. 1a), and the measurements are concentrated in the city centre more than the outskirts. The median number of observations per tile is 100 and there is a large gulf between the minimum (1) and maximum (48,670), reflecting the uneven spatial sampling. Regarding the temporal density, PM_2.5 observations are available at every hour of the day (Figure 1b). The least sampled hours are 01:00-03:00 where fewer than 1000 observations were recorded across the twenty days considered. Meanwhile, between 06:00 and 21:00 there is an abundance of data, with at least 50K observations per hourly interval. The temporal distribution reflects that the drivers carried out most of their work during daylight hours. Between calendar days the sampling intensity was fairly consistent: the mean number of measurements was 101.2K per day, with a minimum of 71K on Sunday 5th September 2021 and a maximum of 134.3K on Monday 13th September 2021.

The average PM_2.5 concentration recorded exhibits a diurnal pattern (Fig. 2), with the mean concentration peaking in the early morning and again in the late afternoon and evening (Fig. 2a). The standard deviation of the PM_2.5 measurements is smallest at night (00:00–04:00) and larger throughout the rest of the day, although given the lack of nighttime observations, the nighttime standard deviation estimates are less reliable than those in the daytime. This diurnal pattern is similar on every day of the week (Fig. 2b), and closely matches the pattern observed in previous PM_2.5 measurement studies in Kigali^44,45.

**Fig. 2: Diurnal PM_2.5 profile in Kigali.**

An interesting feature is that during the morning (07:00–11:00), average PM_2.5 is clearly lower on Sundays than on any other weekday, but from 12:00 onwards the PM_2.5 level on Sunday resembles the other weekdays again. This coincides with the timing of the ‘car-free Sundays’ scheme in Kigali ⁴⁴. On the first and third Sunday of every month, several main roads in Kigali are closed to motorized vehicles between 07:00 and 10:00, and pedestrians are encouraged to gather in the vacant streets for group exercise sessions. At 10:00 all roads are re-opened. Two out of the three Sundays within our data collection campaign were ‘car-free’, which could plausibly affect the air quality encountered by the mobile sensors, although more robust evidence would be needed to establish a causal link between the scheme and decreased ambient pollution levels. Figure 2b also suggests that the mean PM_2.5 concentration increases sharply on Friday nights and dips during the early hours of Wednesday morning. However, these extremes both correspond to a small number of measurements, so probably do not reflect a typical trend.

We argue that because our data is gathered while the drivers carry out their usual jobs - as opposed to driving along pre-determined routes - the average PM_2.5 concentrations recorded are representative of the PM_2.5 exposure of Kigali’s road users. It is therefore alarming to see that the mean PM_2.5 concentration exceeds the WHO’s recommended level of 5 μg/m³ at all hours of the day. The overall mean PM_2.5 concentration in the Kigali data is 44.61 μg/m³ with a standard deviation of 39.47 μg/m³. Furthermore, 91.9% of PM_2.5 concentration observations exceed 15 μg/m³, whereas the WHO guidelines recommend that PM_2.5 exposure should not exceed 15 μg/m³ for any more than a handful of days in the year. Our data supports previous studies finding that air pollution in Kigali often exceeds the safe limits set by the WHO^44,45,46, and provides evidence that the air quality encountered by road users is dangerously unhealthy as a result.

Hotspot detection in Kigali

We then calculated the hotspot scores (h(j))_j∈J from the Kigali mobile sensing data. For the spatial resolution we chose 200 × 200 m as a compromise between a high spatial granularity and a large number of observations per tile. After the scores had been computed, we classified every tile j ∈ J as follows: if h(j) > 0.8 then j is a hotspot; if h(j) < 0.2 then j is a coldspot; otherwise, j is neutral, although these thresholds can naturally be changed in different mobile sensing studies. Using this classification, 184 of the tiles are hotspots, 60 of the tiles are coldspots, and 3919 of the tiles are neutral. The mean and standard deviation of observed PM_2.5 concentrations in hotspot tiles is 46.3 ± 22.3 μg/m³; in coldspot tiles, 26.5 ± 0.8 μg/m³; and in neutral tiles, 39.1 ± 42.3 μg/m³. Although hotspots only account for 4.4% of the gridded area, over 21.1% of observations are recorded in tiles classified as hotspots. This implies that the drivers spent a disproportionate amount of time in the most polluted locations in the city, highlighting the importance of measures to tackle Kigali’s urban hotspots.

The hotspot scores are variegated (Fig. 3a), reflecting the flexibility of Gaussian process regression models. A large hotspot in Kigali is centered around 1°56’31”S, 30°2’36”E, which is the intersection between four major roads: the KN1, KN7, KN20 and Kigali-Gatuna. Here, the hotspot scores are among the largest for the whole city, with hotspot scores >0.9 in several neighbouring tiles, indicating that ambient PM_2.5 concentrations in this area often exceed the city-wide median. Google Street View images of this neighbourhood (Fig. 3b) reveal that there are multiple lanes of traffic, several petrol stations, and pockets of exposed sand (e.g., at the intersection between the KN1 and Kigali-Gatuna roads). High hotspot scores are continually observed along the four major roads for at least a kilometer as they depart from this intersection. A second prominent hotspot is in the South of the Kicukiro neighbourhood, around 1°59’ 01.9 ”S 30°06’08.7”E. Two neighbouring tiles have hotspot scores > 0.9, encompassing a stretch of the multi-lane KK15 highway and a local tyre retreading company. For comparison, Google Street View images were also inspected at a coldspot along the KG 649 street (Fig. 3c), where the model assigned low hotspot scores (<0.2). The Google Street View images suggest that this is a low-traffic area, with few visible vehicles. Open fields directly border the road rather than denser developments like residential housing, and the road is lined with trees.

We also compared the hotspot locations with three public geographic datasets (Table 1): the road network from the OpenStreetMap⁴⁷, the 10 m-resolution map of built-up surface area from the Global Human Settlement Layer⁴⁸ - which estimates the proportion of land occupied by human settlements - and the 10m-resolution Rwanda digital elevation model (DEM) from the United Nations Development Programme⁴⁹. A binary variable was calculated for every tile indicating whether or not it contains a stretch of a main road, and the built-up surface and DEM were upsampled to 200 m resolution by taking the mean. We found that 60% of hotspot tiles contain a main road compared to 33% of neutral and coldspot tiles, which suggests that traffic volume is a major contributor to spatial PM_2.5 variability in Kigali. Additionally, the average built-up surface area among hotspot tiles is 27% versus 22% in neutral tiles and just 11% in coldspot tiles, and the rank correlation between hotspot scores and built-up surface area is ρ = 0.22. This implies that a higher density of human settlements is also associated with worse air quality. Meanwhile, the average elevation (m.a.sl) of hotspot tiles is around 60 m lower than non-hotspot tiles, and the rank correlation between mean elevation and the hotspot scores is negative ρ = − 0.08. Kigali is a notoriously hilly city and this result suggests that Kigali’s air quality is worse at lower altitudes, although more work would be needed to identify to what extent topographical factors are responsible versus socioeconomic and urban planning factors⁵⁰.

Table 1 Hotspot co-occurrence with geographic features

Full size table

Hotspot detection in Beijing

We then applied our hotspot-detection method on the simulated mobile sensing data in the two evaluation areas in Beijing and evaluated the results as described in Section “Validating the method”. The hotspot scores correspond closely with the true spatial PM_2.5 profiles. In Beijing LP (Fig. 4a, b), the figure-of-eight shape in the Northwest is clearly identifiable and the major hotspot along the right and bottom of this quadrant is also evident. One also sees that the model delineates the boundaries between high and low pollution, although a caveat is that the estimated boundaries are much smoother than the reality. The boundary between the hotspot on the right and the average pollution region in the middle is only a single tile wide (30 m). However, the method outputs a boundary seven tiles wide (210 m). This indicates that the Gaussian process regression surface is overly smooth, and does not effectively model the variations in PM_2.5 concentration across small spatial scales. In Beijing CLT (Fig. 4c, d), the hotspot scores also match the main spatial contours of PM_2.5 pollution. However, the spatial profile of the hotspot scores is again very smooth. In particular, the small patches of varying PM_2.5 levels in the left of the quadrant are smoothed over in the hotspot scores.

**Fig. 4: Validation of the hotspot-detection method on the Beijing dataset.**

The Spearman rank correlation coefficient value ρ between the hotspot scores and the spatial PM_2.5 averages is high in both evaluation areas. When all grid tiles are included, ρ = 0.8361 in Beijing CLT and ρ = 0.7451 in Beijing LP. These values are close to the maximum achievable correlation of 1, which supports the fact that our method effectively reconstructs the spatial profile of PM_2.5 pollution from spatiotemporally sparse mobile sensing data. We also experimented with re-computing the correlation while filtering to the grid tiles where n_measurements > t and varying the minimum number t, to investigate whether the spatial distribution of the hotspot scores is more accurate among the more frequently sampled grid tiles. Figure 5 shows that as t increases, the correlation increases markedly in Beijing LP but remains stable in Beijing CLT. This suggests that increasing the frequency with which the mobile sensors visit all the tiles could improve the accuracy of the resulting hotspot classification. Future mobile monitoring studies should therefore attempt to ensure that all areas of interest in the city are visited several times by the drivers carrying the sensors.

**Fig. 5: The effect of sampling frequency on the accuracy of the hotspot scores.**

Probabilistic calibration

Overall, the hotspot scores are reasonably well-calibrated as posterior probabilities (Fig. 6). In both areas, the exceedance proportions in the holdout test set were found to increase monotonically with the hotspot scores estimated on the training set. I.e., the greater the hotspot score in a tile, the more frequently the PM_2.5 concentration is indeed exceeding the city-wide median. In Beijing CLT, the calibration is particularly good in the top three bins of hotspot scores, where the calibration curve aligns with the line of perfect calibration. However, for lower bins, the model was generally overconfident: e.g., among tiles where the hotspot score was in the interval (0.3, 0.4], f(x_i) exceeds median(y) for only 9.3% of the observations. In Beijing LP, the calibration curve has an inverse C-shape: the hotspot scores and the observed exceedance proportions are close together at lower and higher values of the hotspot scores, whereas in between the extremities, the calibration curve diverges from the ideal and the model is severely over-confident. Overall, the calibration curves suggest that the hotspot scores are more accurate at extreme hotspots and coldspots, and less accurate in the tiles where the average pollution is close to the city-wide average.

**Fig. 6: Calibration analysis for the hotspot scores calculated on the Beijing dataset.**

The calibration procedure corrects for the estimation errors (Fig. 6b, d). The correction is particularly acute among the least-polluted tiles, where the uncalibrated model tends to over-estimate the exceedance proportion. For example, in Beijing CLT (Fig. 6b), tiles with a hotspot score up to 0.3307 have a calibrated hotspot score of 0.0855, while in Beijing LP (Fig. 6d), tiles with a hotspot score in the range 0.3968–0.4228 have a calibrated hotspot score of 0.1618. The calibration procedure greatly improves key metrics computed on the holdout test set. In Beijing CLT, calibration reduces the ECE from 0.1527 to 0.0081 and the Brier score from 0.2085 to 0.1637. In Beijing LP, calibration reduces the ECE from 0.2294 to 0.0441 and the Brier score from 0.2270 to 0.1688. After transformation with the isotonic regressor, the reliability curve aligns very well with the ideal diagonal curve in both areas (Fig. 6a, c).

Discussion

Short-term exposure to high PM_2.5 concentrations is associated with increased PM-related mortality and a wide range of adverse health outcomes^51,52, hence identifying and cleaning up PM_2.5 hotspots is a major opportunity to improve public health, in pursuit of the UN’s Sustainable Development Goal 11.6. This study presents a method for identifying PM_2.5 hotspots in an urban area, combining mobile sensing with Gaussian process regression. The method consists of four steps: (1) equip a fleet of vehicles with low-cost PM_2.5 sensors to measure the air quality; (2) normalise the raw observations for background concentrations by subtracting the median concentration from the previous x_w minutes (in this case study, x_w = 15); (3) fit a Gaussian process regression surface to the normalised observations, and (4) compute posterior probabilities across a spatial grid that the normalised PM_2.5 concentration exceeds the city-wide median, which are the ‘hotspot scores’. Using observed data from Kigali, Rwanda, we showed that our method is compatible with the data from real-world mobile sensing campaigns, and that the hotspots detected by the method coincide with main roads, denser urban settlements, and lower altitude regions. Although previous air quality studies have been performed in Kigali^44,45,46, to the best of our knowledge, ours is the first study to present the spatial distribution of PM_2.5 pollution throughout the entire city of Kigali.

Using simulated mobile sensing data from Beijing, China, we also showed that the hotspot scores preserve the spatial profile of PM_2.5 concentrations in a city. I.e., if an area of the city is more polluted than average, then its ‘hotspot score’ is typically higher, and vice versa. We also showed that the hotspot scores are probabilistically well-calibrated, and that the probabilities can be made even more accurate with isotonic regression, an out-of-the-box calibration regressor. These properties suggest that our hotspot scores are apt for identifying PM_2.5 hotspots in an urban area, which corroborates the growing enthusiasm for applying Gaussian process regression in spatial statistics problems^28,29.

Our hotspot-detection method is designed for urban areas in low-income countries, where PM_2.5 pollution is among the worst in the world and yet fixed PM_2.5 monitoring equipment is often lacking¹⁰. To this end, only low-cost sensors are used for the data collection, and a background normalisation procedure is proposed that does not require measurements from a fixed PM_2.5 monitoring station. Additionally, all computations and analysis were performed with free, open-source software (R, Python, QGIS) and data (OSM) so that the method can be recreated without expensive software licences. These considerations should remove some of the barriers for practitioners in lower-income countries to apply our method for their own case study. Furthermore, although Gaussian process modelling is typically computationally intensive, we propose an implementation that greatly reduces the computational requirements of the calculations based on simple bagging with subsamples of the data, as explained in Section “Practical implementation”. For larger urban areas than Kigali or denser mobile sensing campaigns where more data is collected, it is possible that further steps would still be needed to reduce the computational runtime. In that case, future practitioners could explore combining the bagging procedure with other methods for simplifying Gaussian process calculations such as inducing-point methods⁵³ or methods of distributed computing⁵⁴.

In our method, PM_2.5 data is collected by equipping a fleet of on-road vehicles with low-cost sensors, then measuring the ambient air quality while the vehicles drive around as usual. We argue that the hotspots identified by our method are therefore highly relevant to the pollution exposure of local road users. In the Kigali case study, we found that the drivers spent a disproportionate amount of time in the most polluted locations in the city. There is growing research to suggest that the air quality within cars correlates closely with the outer air quality, particularly when the car windows are open⁵⁵. Moreover, motorbike users typically breathe more polluted air than car users because they are less insulated from the outer air⁵⁶. Accordingly, improving on-road air quality would directly improve the air quality encountered by road users, and our method suggests some priority locations for such an effort.

A challenge in mobile sensing campaigns is data sparsity, which can reduce the accuracy of air quality estimates^17,57,58. Although this study benefitted from a large dataset of 2.02 million observations in Kigali with a wide spatiotemporal coverage, it is therefore worth considering how our method would perform in a data-sparse context. If the data coverage is low during certain hours of the data acquisition, then the main pitfall is that the rolling baseline may become biased and misrepresent the ambient pollution level, introducing error into the background-normalised values y_i. Adjusting the window length x_w according to the temporal resolution of the data is therefore important. As argued in ref. ⁵⁹ and implemented in this study, a good criterion is that the background normalisation should produce a smooth temporal baseline. On the other hand, if a location has sparse data coverage, the pitfall is that the hotspot score may be less accurate there. Indeed, the results from Beijing LP (Fig. 5) suggest that the hotspot scores perform better in tiles with more mobile sensing observations. Adjusting the grid size to the spatial density of the data is therefore important, to ensure that the tiles are generally well-sampled. It is also important to choose an appropriate kernel for the Gaussian process regression model, because the kernel determines the degree of spatial interpolation in under-sampled areas. In the future, additional spatial covariates could be added to the model such as land-use⁶⁰ to improve predictions in data-sparse tiles.

A shortcoming of the validation procedure is that the simulation process for the synthetic mobile sensing data in Beijing is overly simplified. Synthetic PM_2.5 measurements were generated according to the process described in Section “Validating the method”, considering the road network, the spatial PM_2.5 distribution in Beijing, the hourly trends in real-world monitoring station data, and two sources of stochastic measurement interference. State-of-the-art air pollution simulations generally incorporate atmospheric dispersion, explicit pollution sources, meteorological conditions, and sometimes traffic patterns^61,62. However, the main purpose of the mobile sensing data simulation is to generate data with a high spatial and temporal resolution where the spatial PM_2.5 profile is known, so that we can assess the accuracy of our method with reference to a ground truth. The realism of the air pollution dynamics is therefore of secondary importance. Nevertheless, a mobile sensing simulator with high spatiotemporal resolution and a greater incorporation of real-world air pollution dynamics would enable a more convincing validation.

In Section “Probabilistic calibration” we showed that calibrating the hotspot scores with an isotonic regression fit makes the posterior probability estimates more accurate. Therefore, an interesting direction for future work is to devise a calibration procedure that also works on real, non-simulated mobile sensing data. The calibration procedure here relies on the fact that noise-free observations of f(x) are available when the dataset is simulated, but this is not the case in a real-world mobile sensing campaign, where observations y_i are theorised to be corrupted by some random noise. Future studies could consider combining the measurements from multiple sensors or triangulating the mobile measurements with a reference station in order to estimate f(x), and then using the estimates $\widehat{f}({\bf{x}})$ to calibrate the hotspot scores. Nevertheless, we reiterate that the calibration transformation does not change the relative ranking of the grid tiles, so the hotspot scores can still be useful to identify hotspots even if poorly calibrated.

Methods

Mobile sensors

The air quality monitors used as part of the Kigali deployment, Fig. 7, were designed by open-seneca to be low-cost and portable, measuring geotagged PM_2.5 data at a cadence of 1 Hz. The air quality monitors are based on an integrated circuit board and embedded software design, based on an ARM Cortex M4 microcontroller (STM32F405RGT6). The integrated circuit board controls an external Sensirion SPS30 particulate matter sensor and includes an on-board GNSS module (u-blox MAX-M8Q), a Sensirion SHTC3 temperature and humidity sensor, a display for data visualisation, and microSD card for data logging. All sensor readings and metadata are written as comma-separated values to an onboard microSD card. Further information on the hardware and software can be found on open-seneca GitHub page (https://github.com/open-seneca).

**Fig. 7: The open-seneca air quality monitor mounted on an electric motorbike, part of Ampersand’s delivery fleet.**

The Sensirion SPS30 particulate matter sensor is factory pre-calibrated (https://sensirion.com/products/catalog/SPS30). Prior to this study, a co-location of 50 sensors from the same batch as those used in Kigali showed excellent inter-sensor comparability. They achieved ±0.38 μg/m³ standard deviation from the mean over a 2 week period of co-location, as shown in Fig. 8, between the range of 0.1 and 28.0 μg/m³. The co-location performance was only evaluated while the sensors were stationary. However, during this period the sensors were exposed to wind - akin to moving on a bike - which makes us confident in their performance during mobile deployment.

**Fig. 8: Co-location of 50 sensors over a 2 week period, taking 60 s averages over the whole period, performed in Lisbon, Portugal.**

Calibration against a reference station was not possible in Kigali due to the absence of a public reference station. However, given the nature of this study and the excellent out-of-the-box inter-sensor comparability, this was not necessary. For reference, the Supplementary Information presents a previous calibration of the same sensor type as those used in Kigali with a reference station at a different location. These calibration data were not used in this study, as the focus was on sensor inter-comparability rather than absolute values.

Experiment setup in Kigali

The study conducted in Kigali was designed to address air pollution concerns in Rwanda’s capital city. It employed 16 open-seneca air quality monitors, mounted on electric delivery motorbikes, part of Ampersand’s Rwanda Ltd fleet. The data collection occurred from 1st of September 2021 to the 1st of October 2021, as part of the Urban Pathways project, supported by the Wuppertal Institute, UN-Habitat, and the United Nations Environment Programme (UNEP), and funded by the International Climate Protection Initiative (IKI) of the German Ministry of the Environment.

The study was facilitated through collaboration with the University of Rwanda, the City of Kigali, the Urban Electric Mobility Initiative (UEMI), and Ampersand Rwanda Ltd. The method involved engaging professional delivery drivers as key participants, who carried sensors during their daily delivery routes. This approach ensured efficient data collection and coverage while adhering to the study’s time constraints.

The sensors were mounted at the front of the vehicle, behind the handlebars, and were continuously powered via the electric motorbike’s onboard USB outlet, as shown in Fig. 7. In the open-seneca air quality monitor, the inlet of the Sensirion SPS30 PM_2.5 sensor faced opposite to the direction of travel to minimise the impact of direct wind on measurements, as the sensors are sensitive to airflow within the sampling channel.

At the conclusion of the data collection period, the microSD cards were manually retrieved from each of the 16 devices. The raw data from each sensor was stored in individual .csv files. These files contained time-series data for PM_2.5, temperature, relative humidity, latitude, longitude, and GPS accuracy metrics, recorded at the 1 Hz frequency. The collected raw data was subsequently aggregated, cleaned, and processed to prepare it for spatial and temporal analysis.

Data processing

Duplicate rows in the data were removed, as well as any rows where the date, time, latitude or longitude are missing. The data were also filtered to a spatial bounding box encompassing Kigali, with latitude between 2° 9’36”S and 1°51’00”S, and longitude between 29°54’00”E and 30°15’00”E. Measurements were filtered to the 1st–20th of September 2021, the most active days of the study, and the temporal granularity of the data was reduced to one-second intervals: for all seconds where a device registered more than one measurement, the first measurement was retained and the rest were discarded. Measurements where speed equals zero were removed, because there were instances where a device remained switched on for several hours while the vehicle was stationary, possibly collecting measurements inside the driver’s house rather than the ambient pollution. Observations where PM_2.5 > 500 μg/m³ were deemed outliers and removed, with the cut-off value of 500 chosen after consulting typical PM_2.5 ranges in polluted cities (e.g., ref. ⁶³). The processed dataset is publicly available at ref. ⁶⁴.

Hotspot detection method

Our method for detecting urban PM_2.5 hotspots consists of four main steps (Fig. 9). The first step is to collect PM_2.5 concentrations data by equipping a fleet of vehicles with mobile PM_2.5 sensors. The collected data should be appropriately processed, e.g., to remove outliers, as in Section “Data processing”. However, we do not prescribe the processing stages because the guidelines should be as general as possible, whereas the data processing required depends on the context of the mobile sensing deployment. After the data are collected and processed, the second step is background normalisation, which entails correcting for the hour-to-hour meteorological variations in ambient PM_2.5 concentrations. Mobile monitoring studies often normalise for background concentrations by scaling the measured values according to the measurements from a local PM_2.5 monitoring station^20,65,66. However, in September 2021, Kigali had no public reference PM_2.5 monitoring station, and moreover, we want our method to be applicable in other cities with no fixed pollution monitoring apparatus. We therefore propose a background-normalisation procedure that only depends on the mobile sensing data. The raw PM_2.5 observations are normalised by selecting a window length x_w and subtracting the median from the previous x_w minutes of observed PM_2.5 concentrations (Eq. (1)), an approach similar to the subtraction of a rolling background value in refs. ^59,67,68.

$${y}_{i}={y}_{i}^{{\text{raw}}}-{\text{median}}(\{{y}_{k}^{{\text{raw}}}:{t}_{i}-{t}_{k}\in [0,{x}_{w}],k\in I\})$$

(1)

where

$$\begin{array}{rcl}{y}_{i}^{{\text{raw}}} & = & {\text{raw}}\,({\text{observed}}){{\text{PM}}}_{2.5}\,{\text{concentration}}\\ {y}_{i} & = & {\text{normalised}}\,{{\text{PM}}}_{2.5}\,{\text{concentration}}\\ {t}_{i} & = & {\text{timestamp}}\,{\text{of}}\,{i}^{{{\prime} }}\,{\text{th}}\,{\text{observation}}\\ I & = & {\text{index}}\,{\text{set}}\,{\text{for}}\,{\text{mobile}}\,{\text{sensing}}\,{\text{observations}}\end{array}$$

x_w is an important hyperparameter in our method. A longer window (e.g., x_w = 60 min) means more observations and a greater number of different drivers are included in the median calculation, so the subtracted baseline is more robust to outliers and noise. On the other hand, a shorter window (e.g., x_w = 1 min) enables normalising for variation at a higher temporal resolution. We compared several different window lengths empirically on the Kigali data (x_w = 1, 5, 15, 30, 60 min), and eventually selected x_w = 15 min because, when grouping observations into 15-min buckets, the resulting average diurnal profile is smoother than for other choices of x_w. Here smoothness is measured as minimising the absolute deviation between measurements in consecutive intervals. While x_w = 15 is appropriate for the Kigali data, we stress that the parameter should be recalibrated on future datasets, either through the same empirical procedure or by considering factors such as local meteorology, traffic dynamics, and the temporal coverage of the dataset.

**Fig. 9: The PM_2.5 hotspot identification method proposed in this paper.**

The third step is to fit a Gaussian process regression model to the spatial extent of the city. As in the general regression setting, the objective in Gaussian process regression is to learn a function f relating a quantity of interest y_i to a known quantity x_i in the presence of random noise ε_i:

$${y}_{i}=f({{\bf{x}}}_{i})+{\varepsilon }_{i}$$

(2)

In our context, i ∈ I indexes an observation from the data, ${{\bf{x}}}_{i}\in {\mathcal{X}}\subset {{\mathbb{R}}}^{2}$ is the vector of latitude and longitude coordinates, y_i denotes the normalised PM_2.5, and f relates the pair of coordinates to its PM_2.5 concentration. The Gaussian process model thus only considers the spatial location of an observation, with the background normalisation procedure designed to enable the direct comparison of observations made at different times throughout the month. In Gaussian process regression, the key assumption is that f has a Gaussian process as prior distribution. At any location x* in the city extent ${\mathcal{X}}$, the Gaussian process regression model thus represents the pollution f(x*) as a probability distribution as opposed to a static value, which we argue is a faithful representation of the real-world dynamics of PM_2.5. At any location in an urban environment, the ambient PM_2.5 concentration is determined by a multitude of factors such as meteorological conditions, passing vehicles and nearby construction activities, all of which vary in intensity across short time scales. Representing the pollution as a probability distribution rather than a static value quantifies this variability. Additionally, Gaussian process regression outputs spatially-explicit variance estimates as opposed to one single variance estimate for the city, so the model captures the fact that some areas experience a greater variability in pollution levels than others (e.g., a busy intersection versus a rural backstreet).

The fourth and final step is to divide the urban area into a grid of tiles and calculate tile-wise hotspot scores. We consider as hotspots the locations where the pollution levels are frequently higher than the average pollution level across the whole city. This definition is compatible with the fact that pollution sources are dynamic, so any location in a city could theoretically experience high pollution at a given time. Therefore, what makes a location a hotspot is that the pollution is regularly greater than the average for the whole city. We propose ‘hotspot scores’ derived from the Gaussian process model to quantify the hotspots. Formally, for a tile j with a centroid at x_j = (lat_j, lon_j), we define the hotspot score h(j) as the estimated posterior probability that f(x_j) exceeds the empirical median of normalised PM_2.5 for the city, median(y):

$$h(j):=\widehat{{\rm{{\mathbb{P}}}}}[f({{\bf{x}}}_{j}) > {\text{median}}(y)]$$

(3)

The hotspot score at x_j quantifies our belief that an observation of the PM_2.5 concentration there would exceed the empirical median for the city. For example, a hotspot score of 0.3 means that we expect 30% of noise-free PM_2.5 observations to exceed the city-wide median. All quantities are background-normalised, so the hotspot scores exclude the hour-to-hour variation in ambient PM_2.5 concentrations and summarise the spatial variability between locations. A common baseline median(y) is used for every tile so that the hotspots are relative to the whole city, as opposed to the window-based approaches discussed in the Introduction. The tile size regulates the granularity of the air quality estimates. Urban air quality studies with gridded observations typically choose a tile length of tens or hundreds of meters^43,66,69. We recommend adjusting the tile size according to the sparsity of the observed data, since sampling density has been shown to affect the accuracy of spatial air pollutant estimates⁶⁵.

Practical implementation

Throughout this work we apply the exponential kernel as the covariance function k:

$$k({{\bf{x}}}_{i},{{\bf{x}}}_{j})={\sigma }_{k}^{2}\exp \left(-\,\frac{\parallel {{\bf{x}}}_{i}-{{\bf{x}}}_{j}{\parallel }_{1}}{\ell }\right)$$

ℓ is the length-scale parameter which models the ‘spikiness’ of the kernel, and ${\sigma }_{k}^{2}$ is the kernel variance parameter, which are both optimised during model training by gpflow⁷⁰. Other researchers have typically preferred the squared exponential kernel for modelling PM_2.5 concentrations with Gaussian process regression^31,35,71. However, during empirical testing on the Kigali dataset, we found that the squared exponential kernel smoothed over the spatial variation in PM_2.5 levels, producing near-equal hotspot scores throughout most of the city and then very high hotspot scores at a single location. By contrast, the exponential kernel produced variegated hotspot scores.

As described by Rasmussen and Williams⁷², ‘One issue with Gaussian process prediction methods is that their basic complexity is ${\mathcal{O}}({n}^{3})$, due to the inversion of a n × n matrix. For large datasets this is prohibitive (in both time and space)’. With mobile sensing datasets typically comprising a large number of observations, fitting a single Gaussian process to all of the mobile sensing observations can be computationally infeasible. Hence we instead apply simple bagging with subsamples of the data⁷³, as detailed in the Gaussian process context by Chen and Ren⁷⁴. Instead of estimating one spatial PM_2.5 distribution for the city extent, we take B random samples of size m < < n from the mobile sensing data and fit a separate model ${\widehat{f}}_{b}$ to each sample b = 1, …, B. Predicted means and variances at a point x_i are computed by averaging the B individual predictions according to:

$$\widehat{\mu }({{\bf{x}}}_{i})=\frac{1}{B}\mathop{\sum }\limits_{b=1}^{B}\widehat{{\mu }_{b}}({{\bf{x}}}_{i})$$

(4)

$${\widehat{\sigma }}^{2}({{\bf{x}}}_{i})=\frac{1}{B}\mathop{\sum }\limits_{b=1}^{B}{\widehat{{\sigma }_{b}}}^{2}({{\bf{x}}}_{i})+\frac{1}{B}\mathop{\sum }\limits_{b=1}^{B}{(\widehat{{\mu }_{b}}({{\bf{x}}}_{i})-\widehat{\mu }({{\bf{x}}}_{i}))}^{2}$$

(5)

where $\widehat{{\mu }_{b}}({{\bf{x}}}_{i})$ and ${\widehat{{\sigma }_{b}}}^{2}({{\bf{x}}}_{i})$ are respectively the mean and variance values of f(x_i) predicted by the b’th model. However, unlike Chen and Ren⁷⁴, we do not select observations for the B samples uniformly at random. Instead, we employ a scheme to disproportionately sample from the most polluted and least polluted grid tiles, to ensure that more data is gathered about tiles suspected to be hotspots or coldspots. Our sampling scheme orders the grid tiles according to their median normalised PM_2.5 concentration. Then, we specify probabilities p_high and p_low and thresholds r_high and r_low, and proceed to sample observations from the top r_high proportion of tiles with probability p_high and from the bottom r_low proportion of tiles with a probability proportional to p_low. In our experiments we chose p_high = p_low = 0.3 and r_high = r_low = 0.2, meaning observations belonging to the most fifth and the least polluted fifth of tiles are 50% more likely to appear in a bootstrapped sample than the rest of the observations.

Once the B gaussian process regression surfaces have been fit, we predict the mean and variance of f at every tile centroid x_j in the grid according to Eqs. (4) and (5). By the marginalisation property of Gaussian processes⁷², the normalised PM_2.5 f(x_j) at a tile centroid x_j is a random variable with the distribution N(μ(x_j), σ²(x_j)) which we estimate by $N(\widehat{\mu }({{\bf{x}}}_{j}),{\widehat{\sigma }}^{2}({{\bf{x}}}_{j}))$. Denoting by median(y) the estimated median of normalised PM_2.5 throughout the city, the estimated probability that f(x_j) exceeds the city-wide median PM_2.5:

$$\widehat{{\rm{{\mathbb{P}}}}}[f({{\bf{x}}}_{j}) > {\text{median}}(y)]$$

thus equals:

$${\rm{{\mathbb{P}}}}\left[Z > \frac{{\text{median}}(y)-\widehat{\mu }({{\bf{x}}}_{j})}{\widehat{\sigma }({{\bf{x}}}_{j})}\right]$$

where Z ~ N(0, 1) is a standard Gaussian variable. This probability is exactly the hotspot score h(j) reported in this study.

All experiments were run on a machine with a 12th Gen Intel(R) Core i7-1260P CPU (12 cores, 16 processors), 16GB RAM, running Microsoft Windows 11. The code was executed using Python 3.11.7 with version 2.9.1 of package gpflow. We bagged B = 100 models fit to subsets of m = 2000 datapoints in the experiments. The process of fitting the models, predicting the mean and variance across the spatial grid and then combining the results typically took around 3 hours to execute.

Validating the method

In addition to the PM_2.5 concentrations data gathered in Kigali, we evaluate our method using another dataset of spatial average PM_2.5 concentrations in Beijing, China. Published by Wang et al.⁴³, this dataset consists of estimated seasonal average PM_2.5 concentrations at a 30 × 30 m spatial resolution. The authors estimated PM_2.5 concentrations by combining the WRF-CMAQ atmospheric dispersion model⁷⁵ with several data sources, including top-of-atmosphere satellite data, land-use data, and terrain data like elevation above sea level. We evaluate our hotspot-detection method by treating their spatial PM_2.5 estimates in Beijing as a ground-truth spatial PM_2.5 distribution, then simulating a mobile sensing dataset and applying our hotspot-detection method to the simulated data.

Simulating the mobile sensing PM_2.5 measurements began with simulating the vehicle routes. Using road network data from the Open Street Map (OSM), we selected ten thousand pairs of random on-road points within the evaluation area. Every pair of points represents a start and end location of a driver’s journey: $\{(({\text{lat}}_{s}^{{\text{start}}},{\rm{l}}{\rm{o}}{{\rm{n}}}_{s}^{{\text{start}}}),({\text{lat}}_{s}^{{\text{end}}},{{\text{lon}}}_{s}^{{\text{end}}})){\text{for}}\,s=1,2,\ldots ,10,000\}$. The networkx package in Python was then used to compute the shortest on-road path between every pair of points⁷⁶. This produces ten thousand on-road routes, which roughly corresponds to sixteen delivery drivers making thirty trips per day for twenty days (20 × 16 × 30 = 9600).

The next step was to generate observation locations and associated timestamps along all the routes. Recall that the mobile sensors measure the PM_2.5 concentration once per second. Assuming a constant speed of 30 km/h, this corresponds to one observation every 8.33 m, so observation locations were generated at uniform intervals of 8.33 m along the on-road routes. Every start location was assigned a random timestamp in the period of September 2019, then timestamps were assigned in increments of one second to the subsequent observations within the same journey. This resulted in a total of 2.7 million observation locations of the form (lat, lon, t) for each evaluation area, equivalent to ~750 h of data.

Finally, a PM_2.5 measurement was assigned to every (lat, lon, t) tuple according to the following formula:

$${y}_{i}^{raw}=h{m}_{{t}_{i}}\times {{\text{PM}}}_{2.5}^{j}+{\kappa }_{i}{\gamma }_{i}+{\varepsilon }_{i}$$

(6)

where

$$\begin{array}{rcl}j & = & {\text{index}}\,{\text{of}}\,{\text{tile}}\,{\text{containing}}\,{\text{observation}}\,i\\ {\text{PM}}_{2.5}^{j} & = & {\text{average}}\,{{\text{PM}}}_{2.5}\,{\text{concentration}}\,{\text{in}}\,{\text{tile}}\,{\rm{j}}\\ h{m}_{{t}_{i}} & = & {\text{hourly}}\,{\text{multiplier}}\,{\text{for}}\,{\text{time}}\,{t}_{i}\\ {\varepsilon }_{i},{\kappa }_{i},\,{\text{and}}\,{\gamma }_{i} & = & {\text{stochastic}}\,{\text{noise}}\,{\text{variables}}\end{array}$$

The hourly multiplier term $h{m}_{{t}_{i}}$ captures the fact that depending on the time of day, a driver passing through the same location can experience drastically different pollution levels due to traffic patterns or meteorological variations. Hourly multipliers were computed using data from the US embassy’s stationary PM_2.5 sensor in Beijing. The sensor measures the ambient PM_2.5 once every hour, and we converted the embassy measurements to hourly multipliers by dividing each measurement by the overall median of the embassy measurements for September 2019. As an example, an hourly multiplier of 1.2 means that, in a given hour on a given day, the PM_2.5 concentration was 20% higher than the overall median for September 2019. By scaling according to the embassy measurements, we introduce realistic temporal variation into the synthetic observations.

Remaining variables in Eq. (6) are stochastic: ε_i ~ N(0, 1), κ_i ~ Bern(0.05), and γ_i ~ Gamma(5, 5). κ_iγ_i is a stochastic term that only affects 5% of the measurements, and corrupts them with an additive PM_2.5 value drawn from the Gamma(5, 5) distribution. The Gamma(5, 5) distribution has a positive support and a long tail, so this term represents sudden pollution spikes like the exhaust emissions from a passing truck or dust from unpaved roads, a phenomenon observed in our Kigali data and also reported in previous mobile sensing campaigns⁶⁵. ε is always present and represents random noise, including possible measurement error. These stochastic interferences further complicate the recovery of the spatial PM_2.5 hotspots.

Two evaluation areas ~3 × 3 km in size were selected. The first evaluation area is in the East of Beijing and is referred to as ‘CLT’, because the area encompasses a landmark building called the China Life Tower (39°55’26”N, 116°26’22”E). The second evaluation area is in the Southeast of the city, denoted ‘LP’ because it encompasses the Longtan Park (39°52’47”N, 116°26’25”E). Although Wang et al.⁴³ published PM_2.5 concentrations estimates for all four seasons, we only used their Autumn estimates.

We address two main questions when evaluating our method on the simulated Beijing data:

Do the hotspot scores reflect the spatial profile of average PM_2.5 concentrations throughout the city?
Are the hotspot scores probabilistically well-calibrated?

To address the first question, we plot the spatial distributions of the hotspot scores and of the gridded average PM_2.5 concentrations side by side.

We also compute the Spearman rank correlation coefficient ρ(h, PM_2.5) between the hotspot scores h ≔ (h(j))_j∈J and the spatial average pollution values, ${\text{PM}}_{2.5}:={({\text{PM}}_{2.5}^{j})}_{j\in J}$:

$$\rho (h,{{\text{PM}}}_{2.5}):=\frac{{\text{Cov}}(R(h),R({\text{PM}}_{2.5}))}{{\sigma }_{R(h)}\cdot {\sigma }_{R({\text{PM}}_{2.5})}}$$

(7)

Here R(h) and R(PM_2.5) are the rank vectors of the hotspot scores and the spatial average PM_2.5 values, respectively, while the denominator terms are the standard deviations of these rank vectors. Spearman’s rank correlation coefficient ρ is a non-parametric measure of correlation which is robust to non-linearity. It is bounded between 1, which means a perfect positive correlation, and −1, which means a perfect negative correlation. The closer ρ is to 1, the better the hotspot scores are at capturing the spatial pollution profile.

To address the second question: recall that the hotspot scores are estimated posterior probabilities for the event {f(x_i) > median(y)} (Eq. (3)). In the simulated data, f(x_i) is known for every observation x_i. Hence we can perform a calibration analysis, comparing the occurrence of the event {f(x_i) > median(y)} against the hotspot scores, which are the estimated probabilities of this event. Perfect calibration would mean that in every tile, the hotspot score equals the proportion of observations where the event {f(x_i) > median(y)} occurs, which we henceforth call the exceedance proportion.

To perform a calibration analysis, we split the mobile sensing data into a test set, containing observations from six randomly-selected days in September 2019, and a training set, containing the observations from the remaining twenty four days. Hotspot scores are computed using only the training data, then the calibrations are evaluated on the test data. Test observations are grouped into ten uniformly-spaced bins according to the hotspot score of their tile, i.e.,

$$\begin{array}{l}{\text{bin}}_{1}=\{({{\bf{x}}}_{i},{y}_{i})\in {\mathcal{X}}\times {\mathcal{Y}}:h({{\bf{x}}}_{i})\in [0,0.1]\}\\ {\text{bin}}_{2}=\{({{\bf{x}}}_{i},{y}_{i})\in {\mathcal{X}}\times {\mathcal{Y}}:h({{\bf{x}}}_{i})\in (0.1,0.2]\}\\ \vdots \\ {\text{bin}}_{10}=\{({{\bf{x}}}_{i},{y}_{i})\in {\mathcal{X}}\times {\mathcal{Y}}:h({{\bf{x}}}_{i})\in (0.9,1.0]\}\end{array}$$

(8)

which enables plotting reliability diagrams to show the actual exceedance proportion among observations in each bin. We also report the Brier score:

$$\begin{array}{rcl}{\text{Brier}}\,{\text{score}} & = & \frac{1}{N}\mathop{\sum }\limits_{i=1}^{N}{(h({{\bf{x}}}_{i})-{b}_{i})}^{2}\\ {\text{where}}\,{b}_{i} & = & \left\{\begin{array}{l}\begin{array}{cc}1 & f({{\bf{x}}}_{i}) > {\text{median}}(y)\end{array}\\ \begin{array}{cc}0 & f({{\bf{x}}}_{i})\le {\text{median}}(y)\end{array}\end{array}\right.\\ {\text{and}}\,h({{\bf{x}}}_{i}) & = & {\text{hotspot}}\,{\text{score}}\,{\text{of}}\,{\text{tile}}\,{\text{containing}}\,{{\bf{x}}}_{i}\end{array}$$

(9)

which is the average squared error between the hotspot scores and the binary variable b_i for whether the i’th observation exceeds the median. Additionally, we report the expected calibration error (ECE):

$$\begin{array}{rcl}{\text{ECE}} & = & \mathop{\sum }\limits_{k=1}^{10}\frac{| {\text{bin}}_{k}| }{10}| {\text{exc}}({\text{bin}}_{k})-{\text{conf}}({\text{bin}}_{k})| \\ {\text{where}}\,{\text{exc}}({\text{bin}}_{k}) & = & \frac{| \{({{\bf{x}}}_{i},{y}_{i})\in {{\text{bin}}}_{k}:f({{\bf{x}}}_{j}) > {\text{median}}(y)\}| }{| {\text{bin}}_{k}| }\\ {\text{and}}\,{\text{conf}}({\text{bin}}_{k}) & = & \frac{1}{| {\text{bin}}_{k}| }\mathop{\sum }\limits_{i:({{\bf{x}}}_{i},{y}_{i})\in {\text{bin}}_{k}}h({{\bf{x}}}_{i})\end{array}$$

(10)

which is the mean absolute difference between the exceedance proportion and the average hotspot score across the ten bins⁷⁷.

Calibration regression

As part of the analysis we also evaluate whether a calibration regression can improve the probabilistic calibration. A calibration regression is a model regressing the observed frequencies in the data on the predicted probabilities. It is commonly applied to transform the outputs of a model that predicts probabilities in the supervised learning setting, with the aim to improve the accuracy of the predicted probabilities^78,79.

As argued by Dormann⁸⁰, a calibration regression is not necessary when the goal of the hotspot-detection is merely to rank the tiles by their average pollution, because calibration regression procedures are generally monotone transformations. However, a calibration regression is important when one wants to interpret the hotspot scores as posterior probabilities. In Section “Results” we investigate whether an out-of-the-box isotonic regression transformation improves the calibration^78,81. Isotonic regression entails fitting an isotonic (monotonic-increasing) function m to transform the hotspot scores h_i into calibrated hotspot scores ${h}_{i}^{{\text{cal}}}$:

$${h}_{i}^{{\text{cal}}}=m({h}_{i})+{\varepsilon }_{i}$$

(11)

Using the sklearn implementation of the isotonic regressor⁸², we investigate whether this transformation reduces the expected calibration error of the hotspot scores in the simulated data.

Data availability

The collected and processed Kigali data is available at https://zenodo.org/records/15206350. The underlying code for this study is not publicly available but may be made available to qualified researchers on reasonable request from the corresponding author.

Code availability

The underlying code for this study is not publicly available but may be made available to qualified researchers on reasonable request from the corresponding author.

References

World Health Organization. WHO Global Air Quality Guidelines: Particulate Matter (PM_2.5 and PM₁₀), Ozone, Nitrogen Dioxide, Sulfur Dioxide and Carbon Monoxide (World Health Organization, 2021).
Xing, Y.-F., Xu, Y.-H., Shi, M.-H. & Lian, Y.-X. The impact of PM_2.5 on the human respiratory system. J. Thorac. Dis. 8, E69–74 (2016).
Google Scholar
Zhang, D. et al. Air pollution exposure and heart failure: a systematic review and meta-analysis. Sci. Total Environ. 872, 162191 (2023).
Article CAS Google Scholar
Gutiérrez-Avila, I. et al. Cardiovascular and cerebrovascular mortality associated with acute exposure to PM_2.5 in Mexico City. Stroke 49, 1734–1736 (2018).
Article Google Scholar
Stieb, D. M. et al. Associations of pregnancy outcomes and PM_2.5 in a National Canadian Study. Environ. Health Perspect. 124, 243–249 (2016).
Article CAS Google Scholar
Bowe, B. et al. The 2016 global and national burden of Diabetes Mellitus attributable to PM_2.5 air pollution. Lancet Planet. Health 2, e301–e312 (2018).
Article Google Scholar
GBD 2019 Diseases and Injuries Collaborators. Global burden of 369 diseases and injuries in 204 countries and territories, 1990-2019: a systematic analysis for the global burden of disease study 2019. Lancet 396, 1204–1222 (2020).
Southerland, V. A. et al. Global urban temporal trends in fine particulate matter (PM_2.5) and attributable health burdens: estimates from global datasets. Lancet Planet. Health 6, e139–e146 (2022).
Article Google Scholar
Martin, R. V. et al. No one knows which city has the highest concentration of fine particulate matter. Atmos. Environ. X 3, 100040 (2019).
CAS Google Scholar
Apte, J. et al. Air inequality: global divergence in urban fine particulate matter trends https://chemrxiv.org/engage/chemrxiv/article-details/60c75932702a9baa0818ce61 (2021).
Wei, J. et al. First close insight into global daily gapless 1 km PM_2.5 pollution, variability, and health impact. Nat. Commun. 14, 8349 (2023).
Article CAS Google Scholar
Alvarado, M. J. et al. Evaluating the use of satellite observations to supplement ground-level air quality data in selected cities in low- and middle-income countries. Atmos. Environ. 218, 117016 (2019).
Article CAS Google Scholar
Gao, M., Cao, J. & Seto, E. A distributed network of low-cost continuous reading sensors to measure spatiotemporal variations of PM_2.5 in Xi’an, China. Environ. Pollut. 199, 56–65 (2015).
Article CAS Google Scholar
Lu, Y., Giuliano, G. & Habre, R. Estimating hourly PM_2.5 concentrations at the neighborhood scale using a low-cost air sensor network: a Los Angeles case study. Environ. Res. 195, 110653 (2021).
Article CAS Google Scholar
Considine, E. M., Reid, C. E., Ogletree, M. R. & Dye, T. Improving accuracy of air pollution exposure measurements: statistical correction of a municipal low-cost airborne particulate matter sensor network. Environ. Pollut. 268, 115833 (2021).
Article CAS Google Scholar
Song, J., Han, K. & Stettler, M. E. J. Deep-maps: machine-learning-based mobile air pollution sensing. IEEE Internet Things J. 8, 7649–7660 (2021).
Article Google Scholar
Song, J. et al. Toward high-performance map-recovery of air pollution using machine learning. ACS EST Eng. 3, 73–85 (2022).
Article Google Scholar
Cummings, L. E., Stewart, J. D., Reist, R., Shakya, K. M. & Kremer, P. Mobile monitoring of air pollution reveals spatial and temporal variation in an urban landscape. Front. Built Environ. 7, 648620 (2021).
Article Google Scholar
Hart, R., Liang, L. & Dong, P. Monitoring, mapping, and modeling spatial-temporal patterns of PM_2.5 for improved understanding of air pollution dynamics using portable sensing technologies. Int. J. Environ. Res. Public Health 17, 4914 (2020).
Article Google Scholar
Lim, C. C. et al. Mapping urban air quality using mobile sampling with low-cost sensors and machine learning in Seoul, South Korea. Environ. Int. 131, 105022 (2019).
Article CAS Google Scholar
Bousiotis, D. et al. Pinpointing sources of pollution using citizen science and hyperlocal low-cost mobile source apportionment. Environ. Int. 193, 109069 (2024).
Article CAS Google Scholar
Leung, Y. et al. Integration of air pollution data collected by mobile sensors and ground-based stations to derive a spatiotemporal air pollution profile of a city. Geogr. Inf. Syst. 33, 2218–2240 (2019).
Article Google Scholar
Noulas, A. et al. Modeling urban air quality using taxis as sensors. Preprint at https://arxiv.org/abs/2506.11720 (2025).
Li, J. J., Faltings, B., Saukh, O., Hasenfratz, D. & Beutel, J. Sensing the air we breathe — the OpenSense Zurich dataset. Proc. Conf. AAAI Artif. Intell. 26, 323–325 (2021).
Google Scholar
Dauchet, L. et al. Short-term exposure to air pollution: associations with lung function and inflammatory markers in non-smoking, healthy adults. Environ. Int. 121, 610–619 (2018).
Article CAS Google Scholar
Orellano, P., Reynoso, J., Quaranta, N., Bardach, A. & Ciapponi, A. Short-term exposure to particulate matter (PM₁₀ and PM_2.5), nitrogen dioxide (NO₂), and ozone (O₃) and all-cause and cause-specific mortality: systematic review and meta-analysis. Environ. Int. 142, 105876 (2020).
Article CAS Google Scholar
Collado, J. T. et al. Spatiotemporal assessment of PM_2.5 exposure of a high-risk occupational group in a Southeast Asian megacity. Aerosol Air Qual. Res. 23, 220134 (2023).
Article CAS Google Scholar
Gelfand, A. E. & Schliep, E. M. Spatial statistics and Gaussian processes: a beautiful marriage. Spat. Stat. 18, 86–104 (2016).
Article Google Scholar
Christianson, R. B., Pollyea, R. M. & Gramacy, R. B. Traditional kriging versus modern Gaussian processes for large-scale mining data. Stat. Anal. Data Min. 16, 488–506 (2023).
Article Google Scholar
Oliver, M. A. & Webster, R. A tutorial guide to geostatistics: computing and modelling variograms and kriging. Catena 113, 56–69 (2014).
Article Google Scholar
Stoddart, C. et al. Gaussian processes for monitoring air-quality in Kampala. Preprint at https://arxiv.org/abs/2311.16625 (2023).
Liu, H., Yang, C., Huang, M., Wang, D. & Yoo, C. Modeling of subway indoor air quality using Gaussian process regression. J. Hazard. Mater. 359, 266–273 (2018).
Article CAS Google Scholar
Wang, P. et al. A Gaussian process method with uncertainty quantification for air quality monitoring. Atmosphere 12, 1344 (2021).
Article CAS Google Scholar
Tang, M., Wu, X., Agrawal, P., Pongpaichet, S. & Jain, R. Integration of diverse data sources for spatial PM_2.5 data interpolation. IEEE Trans. Multimed. 19, 408–417 (2017).
Article Google Scholar
Cheng, Y., Li, X., Li, Z., Jiang, S. & Jiang, X. Fine-grained air quality monitoring based on Gaussian process regression. In International Conference on Neural Information Processing, 126–134, https://link.springer.com/chapter/10.1007/978-3-319-12640-1_16 (2014).
Goyal, P., Gulia, S. & Goyal, S. K. Identification of air pollution hotspots in urban areas - an innovative approach using monitored concentrations data. Sci. Total Environ. 798, 149143 (2021).
Article CAS Google Scholar
Bhardwaj, A. et al. Comprehensive monitoring of air pollution hotspots using sparse sensor networks. ACM J. Comput. Sustain. Soc. https://doi.org/10.1145/3748821 (2025).
Zhang, Y., Hannigan, M. & Lv, Q. Air pollution hotspot detection and source feature analysis using cross-domain urban data. In SIGSPATIAL '21: Proceedings of the 29th International Conference on Advances in Geographic Information Systems, 592−595 (2021).
Zheng, T., Bergin, M., Wang, G. & Carlson, D. Local PM_2.5 hotspot detector at 300 m resolution: a random forest–convolutional neural network joint model jointly trained on satellite images and meteorology. Remote Sens. 13, 1356 (2021).
Article Google Scholar
Getis, A. & Ord, J. K. The analysis of spatial association by use of distance statistics. Geogr. Anal. 24, 189–206 (1992).
Article Google Scholar
Ord, J. K. & Getis, A. Local spatial autocorrelation statistics: distributional issues and an application. Geographical Anal. 27, 286–306 (1995).
Article Google Scholar
Ruidas, D. & Pal, S. C. Potential hotspot modeling and monitoring of PM_2.5 concentration for sustainable environmental health in Maharashtra, India. Sustain. Water Resour. Manag. 8, 98 (2022).
Article Google Scholar
Wang, Y. et al. Ultra-high-resolution mapping of ambient fine particulate matter to estimate human exposure in Beijing. Commun. Earth Environ. 4, 451 (2023).
Article Google Scholar
Subramanian, R. et al. Air pollution in Kigali, Rwanda: spatial and temporal variability, source contributions, and the impact of car-free sundays. Clean Air J. 30 https://www.researchgate.net/publication/346916819_Air_pollution_in_Kigali_Rwanda_spatial_and_temporal_variability_source_contributions_and_the_impact_of_car-free_Sundays Accessed: 2025-03-28 (2020).
Kagabo, A. S. Spatial-Temporal variation of particulate matter less than 2.5 microns (PM_2.5) concentrations in Kigali https://www.researchgate.net/publication/343971560_Spatial-Temporal_Variation_of_Particulate_Matter_less_than_25_microns_PM25_concentrations_in_Kigali (2018).
Henninger, S. M. When air quality becomes deleterious—a case study for Kigali, Rwanda. J. Environ. Prot. 04, 1–7 (2013).
Article CAS Google Scholar
OpenStreetMap contributors. Planet dump retrieved from https://planet.osm.org (2017).
Pesaresi, M. et al. Advances on the global human settlement layer by joint assessment of Earth observation and population survey data. Int. J. Digital Earth 17 https://doi.org/10.1080/17538947.2024.2390454 (2024).
Igarashi, J. Digital elevation model (DEM), 10 m resolution, Rwanda. https://geohub.data.undp.org/data/00d5add9be37e465398b081683c3ec03#Info (2024).
Manirakiza, V., Mugabe, L., Nsabimana, A. & Nzayirambaho, M. City profile: Kigali, Rwanda. Environ. Urbanization ASIA 10, 290–307 (2019).
Article Google Scholar
Kloog, I., Ridgway, B., Koutrakis, P., Coull, B. A. & Schwartz, J. D. Long- and short-term exposure to PM_2.5 and mortality: using novel exposure models. Epidemiology 24, 555–561 (2013).
Article Google Scholar
Yu, W. et al. Estimates of global mortality burden associated with short-term exposure to fine particulate matter (PM_2.5). Lancet Planet. Health 8, e146–e155 (2024).
Article Google Scholar
Hensman, J., Fusi, N. & Lawrence, N. D. Gaussian processes for big data. In UAI'13: Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, 282−290 (2013).
Deisenroth, M. P. & Ng, J. W. Distributed Gaussian processes. In ICML'15: Proceedings of the 32nd International Conference on Machine Learning, Vol. 37, 1481−1490 (2015).
Kumar, P., Omidvarborna, H., Tiwari, A. & Morawska, L. The nexus between in-car aerosol concentrations, ventilation and the risk of respiratory infection. Environ. Int. 157, 106814 (2021).
Article CAS Google Scholar
Grana, M., Toschi, N., Vicentini, L., Pietroiusti, A. & Magrini, A. Exposure to ultrafine particles in different transport modes in the city of Rome. Environ. Pollut. 228, 201–210 (2017).
Article CAS Google Scholar
Messier, K. P. et al. Mapping air pollution with Google Street View cars: efficient approaches with mobile monitoring and land use regression. Environ. Sci. Technol. 52, 12563–12572 (2018).
Article CAS Google Scholar
Hofman, J. et al. Opportunistic mobile air quality mapping using sensors on postal service vehicles: from point clouds to actionable insights. Front. Environ. Health 2 https://doi.org/10.3389/fenvh.2023.1232867 (2023).
Brantley, H. L. et al. Mobile air monitoring data-processing strategies and effects on spatial air pollution trends. Atmos. Meas. Tech. 7, 2169–2183 (2014).
Article Google Scholar
Wu, J. et al. Applying land use regression model to estimate spatial variation of PM_2.5 in Beijing, China. Environ. Sci. Pollut. Res. 22, 7045–7061 (2014).
Article Google Scholar
U.S. Environmental Protection Agency. Air quality models https://www.epa.gov/scram/air-quality-models (2024).
Baek, B. H. et al. The comprehensive automobile research system (CARS) – a Python-based automobile emissions inventory model. Geosci. Model Dev. 15, 4757–4781 (2022).
Article CAS Google Scholar
Sun, Y. et al. Impact of heavy PM_2.5 pollution events on mortality in 250 Chinese counties. Environ. Sci. Technol. 56, 8299–8307 (2022).
Article CAS Google Scholar
Perry, N. et al. Kigali PM_2.5 dataset openseneca September 2021. Zenodo https://doi.org/10.5281/zenodo.15206350 (2025).
Van den Bossche, J. et al. Mobile monitoring for mapping spatial variation in urban air quality: development and validation of a methodology based on an extensive dataset. Atmos. Environ. 105, 148–161 (2015).
Article Google Scholar
Apte, J. S. et al. High-resolution air pollution mapping with Google Street View cars: exploiting big data. Environ. Sci. Technol. 51, 6999–7008 (2017).
Article CAS Google Scholar
Tessum, M. W. et al. Mobile and fixed-site measurements to identify spatial distributions of traffic-related pollution sources in Los Angeles. Environ. Sci. Technol. 52, 2844–2853 (2018).
Article CAS Google Scholar
Bukowiecki, N. et al. A mobile pollutant measurement laboratory-measuring gas phase and aerosol ambient concentrations with high spatial and temporal resolution. Atmos. Environ. 36, 5569–5579 (2002).
Article CAS Google Scholar
Hasenfratz, D. et al. Deriving high-resolution urban air pollution maps using mobile sensor nodes. Pervasive Mob. Comput. 16, 268–285 (2015).
Article Google Scholar
Matthews, A. G. d. G. et al. GPflow: a Gaussian process library using TensorFlow. J. Mach. Learn. Res. 18, 1299–1304 (2017).
Google Scholar
Patel, Z. B., Purohit, P., Patel, H. M., Sahni, S. & Batra, N. Accurate and scalable Gaussian processes for fine-grained air quality inference. Proc. Conf. AAAI Artif. Intell. 36, 12080–12088 (2022).
Google Scholar
Rasmussen, C. E. & Williams, C. K. Regression, 7–32 (The MIT Press, 2005).
Breiman, L. Bagging predictors. Mach. Learn. 24, 123–140 (1996).
Article Google Scholar
Chen, T. & Ren, J. Bagging for Gaussian process regression. Neurocomputing 72, 1605–1610 (2009).
Article Google Scholar
Wong, D. C. et al. WRF-CMAQ two-way coupled system with aerosol feedback: software development and preliminary results. Geosci. Model Dev. 5, 299–312 (2012).
Article Google Scholar
Hagberg, A. A., Schult, D. A. & Swart, P. J. Exploring network structure, dynamics, and function using NetworkX. In Proceedings SciPy,11–15 (2008).
Pakdaman Naeini, M., Cooper, G. & Hauskrecht, M. Obtaining well calibrated probabilities using Bayesian binning. In Proceedings of the AAAI Conference on Artificial Intelligence, 29, https://doi.org/10.1609/aaai.v29i1.9602 (2015).
Niculescu-Mizil, A. & Caruana, R. Predicting good probabilities with supervised learning. In Proceedings of the 22nd international conference on Machine learning 625–632 https://doi.org/10.1145/1102351.1102430 (2005).
Zadrozny, B. & Elkan, C. Transforming classifier scores into accurate multiclass probability estimates. In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 694–699 https://doi.org/10.1145/775047.775151 (2002).
Dormann, C. F. Calibration of probability predictions from machine-learning and statistical models. Glob. Ecol. Biogeogr. 29, 760–765 (2020).
Article Google Scholar
Barlow, R. E. & Brunk, H. D. The isotonic regression problem and its dual. J. Am. Stat. Assoc. 67, 140–147 (1972).
Article Google Scholar
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Google Scholar

Download references

Acknowledgements

We would like to thank Krzysztof Cybulski and Lukas Graz from the Statistical Consulting team at ETH Zürich for their advice on the methodology. We also thank the following partners for funding the data collection in Kigali: the Wuppertal Institute, UN-Habitat, and the United Nations Environment Programme (UNEP), and the International Climate Protection Initiative (IKI) of the German Ministry of the Environment. Additionally, we thank the following partners who assisted with collecting the data: the University of Rwanda, the City of Kigali, the Urban Electric Mobility Initiative, and Ampersand Rwanda Ltd. Those affiliated with the University of Cambridge acknowledges funding by the Engineering and Physical Sciences Research Council Centre for Doctoral Training in Sensor Technologies and Applications (EP/L015889/1). We thank the Centre for Global Equality (CGE) for its continued support of open-seneca and its involvement in the Urban Pathways project. We thank Yongyue Wang from Tsinghua University for kindly sharing the Beijing data.

Funding

Open access funding provided by Swiss Federal Institute of Technology Zurich.

Author information

Authors and Affiliations

Eidgenössische Technische Hochschule Zürich, Zürich, Switzerland
Niál Perry, Peter P. Pedersen, Emanuel Nussli & Sanelma Heinonen
University of Cambridge, Philippa Fawcett Drive, Cambridge, UK
Peter P. Pedersen, Charles N. Christensen, Lorena Gordillo Dagallier, Raphaël Jacquat, Sebastian Horstmann & Christoph Franck

Authors

Niál Perry
View author publications
Search author on:PubMed Google Scholar
Peter P. Pedersen
View author publications
Search author on:PubMed Google Scholar
Charles N. Christensen
View author publications
Search author on:PubMed Google Scholar
Emanuel Nussli
View author publications
Search author on:PubMed Google Scholar
Sanelma Heinonen
View author publications
Search author on:PubMed Google Scholar
Lorena Gordillo Dagallier
View author publications
Search author on:PubMed Google Scholar
Raphaël Jacquat
View author publications
Search author on:PubMed Google Scholar
Sebastian Horstmann
View author publications
Search author on:PubMed Google Scholar
Christoph Franck
View author publications
Search author on:PubMed Google Scholar

Contributions

N.P.: Data curation, Formal analysis, Methodology, Software, Validation, Visualization, Writing - original draft. P.P.P.: Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Writing - original draft. C.N.C.: Investigation, Project administration, Supervision, Writing - original draft. E.N.: Data curation, Methodology, Software, Writing - review & editing. S.He.: Data curation, Methodology, Software, Writing - review & editing. L.D.G.: Funding acquisition, Investigation, Writing - review & editing. R.J.: Funding acquisition, Investigation, Writing - review & editing. S.Ho.: Funding acquisition, Investigation, Writing - review & editing. C.F.: Funding acquisition, Investigation, Writing - review & editing.

Corresponding author

Correspondence to Niál Perry.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Perry, N., Pedersen, P.P., Christensen, C.N. et al. Detecting urban PM_2.5 hotspots with mobile sensing and Gaussian process regression. npj Clean Air 1, 38 (2025). https://doi.org/10.1038/s44407-025-00038-1

Download citation

Received: 05 September 2025
Accepted: 06 November 2025
Published: 04 December 2025
Version of record: 04 December 2025
DOI: https://doi.org/10.1038/s44407-025-00038-1