Deep learning framework for hourly air pollutants forecasting using encoding cyclical features across multiple monitoring sites in Beijing

Alsabagh, Abdel Salam; Alawi, Omer A.; Kamar, Haslinda Mohamed; Nafea, Ahmed Adil; AL-Ani, Mohammed M.; Mohammed, Hussein A.; Kazi, S. N.; Oudah, Atheer Y.; Yaseen, Zaher Mundher

doi:10.1038/s41598-025-05472-5

Download PDF

Article
Open access
Published: 01 July 2025

Deep learning framework for hourly air pollutants forecasting using encoding cyclical features across multiple monitoring sites in Beijing

Abdel Salam Alsabagh¹,
Omer A. Alawi^2,3,
Haslinda Mohamed Kamar²,
Ahmed Adil Nafea⁴,
Mohammed M. AL-Ani⁵,
Hussein A. Mohammed⁶,
S. N. Kazi⁷,
Atheer Y. Oudah^8,9 &
…
Zaher Mundher Yaseen¹⁰

Scientific Reports volume 15, Article number: 22417 (2025) Cite this article

8108 Accesses
4 Citations
Metrics details

Subjects

Abstract

Environmental managers and citizens alike are concerned with air quality. Early warning systems for air pollution are essential to prevent health issues and implement effective prevention strategies. This paper proposes a comprehensive, reliable system with air quality prediction and assessment modules for China’s air pollution. In this study, six air pollutants were observed, including Carbon Monoxide (CO), Nitrogen Dioxide (NO₂), Ozone (O₃), Sulphur Dioxide (SO₂), Fine particulate matter (PM_2.5), and Coarse particulate matter (PM₁₀). The current dataset includes hourly air pollutants data from 10 national air-quality monitoring sites, such as Aotizhongxin, Changping, Dongsi, Guanyuan, Huairou, Nongzhanguan, Shunyi, Tiantan, Wanliu, and Wanshouxigong. The dataset was recorded hourly from 01/03/2013 to 28/02/2017. Deep Neural Networks (DNNs) and Convolutional Neural Networks (CNNs) were developed with both unencoded and encoded features to address the forecasting challenge of multivariate time series, specifically in predicting air pollution concentrations. The results showed that, the top accuracy was as follows: 93.8% at the Wanshouxigong station using CNN-Encoded, 91.9% at the Nongzhanguan station using (DNN-Encoded and CNN-Encoded), 93.4% at Aotizhongxin station using DNN-Encoded, 96.2% at Nongzhanguan station using DNN-Encoded, 94% at Dongsi station using CNN-Unencoded, and 92.4% at Aotizhongxin station using (CNN-Unencoded and DNN-Encoded) in forecasting CO, NO₂, O₃, PM_2.5, PM₁₀ and SO₂ pollutants, respectively. The findings indicated that the suggested approaches are efficient and dependable for environmental supervisors in the monitoring and management of air pollution.

Optimized air quality management based on air quality index prediction and air pollutants identification in representative cities in China

Article Open access 02 August 2024

A hybrid deep learning air pollution prediction approach based on neighborhood selection and spatio-temporal attention

Article Open access 29 January 2025

Continuous-time air pollutant forecasting using multi-timescale attention neural ordinary differential equations (MA-NODE)

Article Open access 19 December 2025

Introduction

General background of study

Air pollution is a major issue worldwide due to its negative effects on human health, the environment, and the climate¹. Among several criterion pollutants for determining the levels of air pollution, six parameters are generally considered highly concerning, including SO₂, NO₂, CO, O₃, PM_2.5, and PM₁₀². According to data from the World Health Organisation (WHO), nine out of ten individuals breathe air with high levels of these pollutants, which is beyond the standard limits of the WHO. Air pollution will cause seven million premature deaths globally each year. In addition to impairing vision, air pollution can affect the balance of solar radiation directly or indirectly³ and might even spark more severe weather conditions like drought and flooding⁴. The concentration of air pollutants in ambient air can be influenced by meteorological conditions, which can also contribute to the mobility, emission, chemical synthesis, and deposition of these pollutants. This is crucial to any actions or management initiatives to reduce air pollution⁵.

Variations in meteorological data may result in inaccurate conclusions about management effectiveness or intervention. This can make it difficult to determine the trends in different air contaminants accurately⁶. Therefore, it is essential to distinguish weather effects from data trends on air quality and to identify the precise policy-driven changes in air quality⁴. Determining air pollution parameters using a theoretical model based on algorithmic methods was a reliable alternative method. Regression models mathematically represent statistical correlations, quantifying the influence of multiple independent variables on a single dependent variable. Since big data is gradually affecting every aspect of daily life. In the future, data resources will become more and more valuable. The use of data and technology from big data thinking and artificial intelligence (AI) diagnostic tools can be beneficial to environmental governance⁷. Additionally, based on the availability of online sensor data collection as real-time data monitoring, with the help of citizen participation management and environmental governance, it offers a noble scientific philosophy for government decision-making in public ecological tracking and early warning^8,9. Countries monitoring air quality have increased dramatically in recent years^10,11. These infrastructure developments in air quality monitoring can be attributed to the government’s recently constructed or expanded monitoring networks and the crucial contributions of non-governmental groups and concerned citizens worldwide. Although progress has been achieved, several regions still lack air quality monitoring, necessitating that a substantial portion of the population access information necessary to manage pollution and make informed health decisions.

Literature review

Deep learning (DL) and machine learning (ML) models have significantly contributed to recent developments in air quality monitoring and forecasting. These models have demonstrated extraordinary potential in predicting air pollution levels and identifying contamination sources^12,13.

The co-training framework for air quality monitoring proposed for real-time monitoring in Beijing and Shanghai is noteworthy. By integrating spatial and temporal classifications such as artificial neural networks (ANN) and conditional random fields (CRF), this approach outperformed traditional models such as decision trees and linear interpolation. The use of real-time meteorological and traffic flow data proved beneficial in improving accuracy¹⁴. The key technological advancement here was the integration of various data sources for real-time monitoring. However, a significant challenge remains in the complex integration of these models, particularly when scaling to larger urban areas.

In addition, the use of wavelet-ANN models for short-term air pollution forecasting in Xi’an and Lanzhou was a key factor. The wavelet-ANN (WANN) model demonstrated superior performance in predicting air pollution indices (API), providing a higher R-value (0.8906) than traditional ANN models. This improvement was attributed to the WANN’s ability to capture non-linear patterns in pollution data. However, a challenge was the computational cost and the complexity of processing large datasets¹⁵.

The use of mobile air quality monitoring systems has increased, particularly through a study in Beijing, where electric vehicles equipped with real-time sensors collected PM_2.5 data. These mobile sensors were mapped using decision tree models, which significantly outperformed fixed monitoring stations. The advantage of mobile sensors is their ability to provide high-resolution air quality data, but this approach is a challenge in fleet management and sensor calibration across a wide range¹⁶.

In terms of cost-effective solutions, the Deep-MAPS framework utilized mobile and fixed air quality sensors to estimate PM_2.5 concentrations, delivering results at a resolution of 1 km × 1 km and 1 h. This model reduced hardware costs by up to 90% compared to conventional fixed sensor methods, providing a more economical way to monitor urban air quality. However, the challenges persist in ensuring secure network coverage and expanding the sensor network for broader geographic coverage¹⁷.

The MCST-Tree model proposed for space–time learning of air quality in Chengdu included both mobile and fixed sensor data, achieving a high accuracy (R² = 0.94 for PM_2.5) even with sparse data. This model is capable of handling space–time data gaps, but ensuring high model accuracy with limited sensor data is an ongoing challenge¹⁸. In Chengdu, the Multi-AP learning system was introduced for high-resolution pollutant mapping. This method produced detailed hourly pollution maps, resulting in a decrease in computational efficiency and accuracy. However, the challenge remains to maintain the complexity of real-time predictions, particularly for large-scale urban areas¹⁹.

For long-term forecasting, an ANN-based model for PM_2.5 concentrations in Liaocheng demonstrated a high accuracy (R = 0.9570), resulting in Bayesian regularization. The challenge of mitigating overfitting and guaranteeing consistent performance across a variety of conditions persists, although this approach was successful in long-term predictions²⁰. In forecasting volatility, a hybrid XGBoost-GARCH-MLP model was employed for PM_2.5 volatility prediction, providing better long-term prediction accuracy. The hybrid model’s strength was in incorporating volatility into the forecasting process, yet the complexity and high computational requirements of such models are limited to their practical application²¹. In addition, AI-based models such as ANN, CNN, and LSTM have been employed for climate and air quality forecasting in cities such as Jinan and Hohhot, where CNN-LSTM models showed superior performance. These multimodal forecasting techniques provide a great opportunity to improve air quality predictions, though challenges persist in model generalization and integrating across diverse regions²². ST-Exposure, a promising model, utilizes fixed and mobile sensors to predict PM_2.5 exposure on a pixel-wise basis. This model achieved an SMAPE below 15%, indicating its potential in high-resolution exposure predictions. However, the challenges of sparse sensor deployment and data integration remain obstacles to achieving optimal accuracy²³.

While significant efforts have been made in utilizing ML and DL techniques for air quality forecasting, data quality, computational complexity, model integration, and coverage persist. Future advances in sensor technology, data fusion, and model optimization will be crucial in advancing these methods for broader geographical applications and improving their practical application.

Research objectives and novelty

While deep learning models such as ANN, CNN, and LSTM have shown strong performance in predicting air pollution, each comes with its own set of challenges. ANN models are susceptible to overfitting and often struggle to generalize across different geographical areas. CNNs are good at identifying spatial features, but they typically need large datasets and may fail to capture time-based patterns. LSTMs handle temporal data well, but they require significant computational resources and can perform poorly when data is noisy or incomplete. Additionally, many deep learning approaches have trouble integrating diverse data types—like weather, traffic, and sensor inputs—and scaling efficiently in complex urban settings. These limitations point to the importance of enhanced data preprocessing, hybrid model approaches, and transfer learning techniques. Therefore, the research is aimed at the forecasting problems of six fundamental pollutants—Carbon Monoxide (CO), Nitrogen Dioxide (NO₂), Ozone (O₃), Sulphur Dioxide (SO₂), Fine Particulate Matter (PM_2.5), and Coarse Particulate Matter (PM₁₀) using state-of-the-art machine learning approaches, specifically Deep Neural Networks (DNNs) and Convolutional Neural Networks (CNNs). These pollutants are particularly important due to their adverse effects on the environment and the environment.

The research uses data from the Beijing Municipal Environmental Monitoring Centre (BMEMC) from March 2013 to February 2017, which includes meteorological data and pollutant levels from 10 nationally controlled monitoring sites. This study intends to provide hourly predictions that enable a more accurate assessment of the health-related impacts of air pollution, unlike the traditional models that often provide poor temporal accuracy.

This study is a unique approach to air pollution prediction using DNNs and CNNs based on multivariate time series analysis. Until now, only a few ML studies have attempted to apply such high temporal resolution to pollutant concentration prediction using feature-encoded DNN and CNN frameworks with both encoded and unencoded features. The contribution captures intricate spatiotemporal structures that especially tend to be masked in strongly polluted urban settings where both time and pollution levels fluctuate dramatically. The study utilizes sophisticated pre-processing of data, including interpolation of missing values, as well as thorough exploratory analysis using box plots to ensure the accuracy and integrity of the data. This strategy enhances the accuracy of air quality predictions and the model’s scalability for real-time forecasting; likewise, urban planners and public health regulators can monitor hourly pollutant concentrations in their areas to implement effective pollution management methods. This work aims to provide reliable emission prediction models that could guide decisions regarding human health protection and environmental sustainability.

Study area and data

In January 2013, Beijing established 36 air-quality monitoring sites, 35 of which are Beijing Municipal Environmental Monitoring Center (BMEMC) sites, and one at the US Embassy in Beijing²⁴. The current dataset comprises hourly air pollutant data from 10 national air quality monitoring stations, namely Aotizhongxin, Changping, Dongsi, Guanyuan, Huairou, Nongzhanguan, Shunyi, Tiantan, Wanliu, and Wanshouxigong. These ten stations were chosen due to the availability of free access to their data. The meteorological data in each air-quality site are compared to the nearest weather station from the China Meteorological Administration (CMA). The data was recorded in hours from 01/03/2013 to 28/02/2017. The datasets included four time attributes such as (year of the data, month of the data, day of the data, and hour of the data), six principal air pollutants such as (PM_2.5 concentration (ug/m³), PM₁₀ concentration (ug/m³), SO₂ concentration (ug/m³), NO₂ concentration (ug/m³), CO concentration (ug/m³) and O₃ concentration (ug/m³)), and six relevant meteorological variables such as (temperature (°C), pressure (hPa), dew point temperature (°C), rainfall (mm), wind direction and wind speed (m/s)).

Artificial intelligence models

Deep neural networks (DNN)

Artificial neural networks (ANNs) are an effective machine learning technique based on the human brain’s structure. In self-learning, ANNs can identify patterns and hidden correlations in datasets²⁵. Furthermore, a particular type of ANN called a deep neural network (DNN) has numerous layers of connected nodes, which enables it to represent more complex data relationships and perform better than traditional ANNs²⁶.

The layers in the DNN model typically have an input layer, three or more hidden layers, and an output layer. The input layer receives the data, which is then altered by the hidden layers, and the output layer generates a forecast. Figure 1a displays the DNN model’s architecture employed in this investigation. A single neuron receives inputs, multiplies each input by the corresponding weight ($W$), adds a bias ($b$), and then passes the sum through an activation function ($f(x)$) to produce an output. The weights and biases are calculated to determine the impact of the inputs. At the same time, the activation function provides nonlinearity and enables the model to learn complex patterns²⁷.

A backpropagation approach is employed to alter the weights between the nodes to train a DNN model. This strategy eliminates the disparity between the forecasted and actual output by altering the weights until the network can reliably predict new data. A DNN model can identify complex relationships in data and generate precise predictions when given new data, if it receives sufficient training²⁸.

Convolutional neural network (CNN)

The CNN network’s convolutional and pooling layers are the essential elements of feature extraction²⁹. Time series data are typically the primary application for 1D-CNNs due to their strong feature extraction capabilities³⁰. Alternating convolutional and pooling layers in the 1D-CNN enable the extraction of non-linear features from raw data, and the fully connected layer completes adaptive feature learning³¹.

The basic architecture of the CNN is outlined in Fig. 1b, which comprises an input layer, several convolutional layers, several pooling layers, a fully connected layer, and an output layer. The convolutional and pooling layers are connected in an alternating fashion. In this regard, the CNN feature extraction module comprises the input, convolutional, and pooling layers. The output module includes the connected and output layers³². The complete calculation formula is outlined in Eq. (1).

$$y_{j} = f\left( {\mathop \sum \limits_{{i \in M_{j} }} x_{i}^{l - 1} \otimes w^{l}_{i,j} + b^{l}_{j} } \right)$$

(1)

where, $f$ is the activation function, $\otimes$ is the convolutional operator, $w$ is the weight matrix, and $b$ is the bias deviation.

Model development and configuration

Ten nationally controlled air-quality monitoring sites—Aotizhongxin, Changping, Dongsi, Guanyuan, Huairou, Nongzhanguan, Shunyi, Tiantan, Wanliu, and Wanshouxigong—provided hourly air quality data for this study. The dataset spans from March 1, 2013, to February 28, 2017, and includes four temporal attributes (year, month, day, and hour), six major air pollutants (PM_2.5, PM₁₀, SO₂, NO₂, CO, and O₃ in µg/m³), and six meteorological variables (dew point temperature (°C), air temperature (°C), pressure (hPa), wind direction, wind speed (m/s), and precipitation (mm)).

In preprocessing, each station’s dataset was analyzed to identify and impute missing values using linear interpolation (Table 1). The final cleaned dataset contained 35,064 instances per station. These were split into training, validation, and testing sets with a time-series window size of 10, resulting in: Training: (25,000, 10, number of features), (25,000), Validation: (5000, 10, number of features), (5000) and Testing: (5054, 10, number of features), (5054).

Table 1 Distribution of missing values (NaN) across columns in multiple stations.

Full size table

To improve prediction accuracy (Fig. 2), the proposed method incorporates both temporal and spatial features. Temporal features (e.g., hour, day, month) are inherently cyclical. To model this periodicity, each cyclical feature was encoded using sine and cosine transformations, allowing the model to capture repeating patterns. Specifically Eqs. (2–4):

$$hou{r}_{sin}=sin\left(\frac{2\pi .hour}{24}\right), hou{r}_{cos}=cos\left(\frac{2\pi .hour}{24}\right)$$

(2)

$${day}_{sin}=sin\left(\frac{2\pi .day}{31}\right), {day}_{cos}=cos\left(\frac{2\pi .day}{31}\right)$$

(3)

$${month}_{sin}=sin\left(\frac{2\pi .month}{24}\right), {month}_{cos}=cos\left(\frac{2\pi .month}{24}\right)$$

(4)

These transformations help preserve the cyclical continuity (e.g., hour 23 to hour 0) and support the model in learning seasonal or diurnal effects, especially during transitions such as dawn/dusk or seasonal changes.

Spatial features are derived from the geographical and industrial characteristics unique to each monitoring site. These include proximity to traffic, industrial zones, and residential areas, which introduce local dependencies into pollution patterns. Rather than generalizing across all locations, the model trains separately for each station to account for such location-specific dynamics.

The rationale for selecting DNN and CNN lies in their respective strengths:

i.
DNNs are effective for learning non-linear feature interactions, especially when the dataset includes mixed data types (e.g., meteorological and pollutant data).
ii.
CNNs are chosen for their ability to capture local patterns across the time dimension, as convolutional filters can detect trends and abrupt changes in short sequences—an important feature in hourly air pollution data.

Both models were implemented for each station independently to capture station-specific pollution dynamics. Furthermore, in both models, we set window_size = 10, which refers to the number of consecutive time steps (or rows) used to construct a single input sample for the time series forecasting model. The encoded approach outperformed the unencoded baseline in capturing temporal fluctuations and spatial heterogeneity. This approach led to improved air quality forecasting accuracy and provided insights into region-specific pollution trends. As per Table 2, to ensure fair and optimal performance of both DNN and CNN architectures across unencoded and encoded feature sets, a systematic hyperparameter tuning process was conducted using grid search and fivefold cross-validation on the training data, aiming to minimize validation RMSE. The model architectures were designed to balance complexity and generalization, with DNNs using three dense layers (32 → 16 → 8 units) for progressive feature abstraction and CNNs employing a kernel size of 2 with 32 filters for efficient local pattern extraction. The Adam optimizer with a learning rate of 0.005—selected from the range {0.001, 0.003, 0.005, 0.01}—offered stable convergence and the lowest validation error. MSE was used as the loss function, while RMSE served as the evaluation metric for its interpretability. Models were trained for 100 epochs, with early stopping applied to prevent overfitting.

Table 2 Components and descriptions for DNN and CNN models with Unencoded and Encoded features.

Full size table

Forecasting metrics

In this paper, the following metrics were applied to measure the efficiencies of unencoded and encoded deep learning models:

i.
Mean Absolute Error (MAE)
$$MAE=\frac{1}{N}\times \sum_{i=1}^{N}\left|{P}_{i}-{O}_{i}\right|$$
(5)
ii.
Mean Squared Error (MSE)
$$MSE=\frac{1}{N}\times \sum_{i=1}^{N}{\left({P}_{i}-{O}_{i}\right)}^{2}$$
(6)
iii.
Root Mean Square Error (RMSE)
$$RMSE=\sqrt{MSE}=\sqrt{\frac{1}{N}\times \sum_{i=1}^{N}{\left({P}_{i}-{O}_{i}\right)}^{2}}$$
(7)
iv.
Coefficient of Determination (R²)
$${R}^{2}= 1-\frac{\sum {\left({P}_{i}-{O}_{i}\right)}^{2}}{\sum {\left({P}_{i}-{O}_{i}\right)}^{2}}$$
(8)
v.
Willmott Index (WI)
$$WI= 1-\frac{{\sum }_{i=1}^{N}{({O}_{i}-{P}_{i})}^{2}}{{\sum }_{i=1}^{N}{(\left|{P}_{i}-\overline{O }\right|+\left|{O}_{i}-\overline{O }\right|)}^{2}}$$
(9)
vi.
Kling-Gupta Efficiency (KGE)
$$KGE= 1-\sqrt{{\left(PCC-1\right)}^{2}+{(\frac{std}{rd}-1)}^{2}+(\frac{\overline{O} }{\overline{P} }-1)}$$
(10)

In the above equations, ${O}_{i}$ is the observed (actual) value of air pollutants. ${P}_{i}$ is the forecasted value of air pollutants. $\overline{O }$ and $\overline{P }$ are the average values of the observed and forecasted values of air pollutants, respectively. $PCC$, $std,$ and $rd$ are the Pearson correlation coefficient, the standard deviation of forecasted values, and the standard deviation of observation values, respectively.

Results and discussion

Aotizhongxin station

At Aotizhongxin station (Fig. 3), DNN and CNN models showed distinct strengths across pollutants. CNN-Unencoded performed best for CO (RMSE: 483.5 µg/m³) and PM₁₀ (KGE: 0.921), while DNN-Encoded led in NO₂ (KGE: 0.914), O₃ (RMSE: 12.4 µg/m³), and SO₂ (KGE: 0.952). The dew point was the dominant predictor in ~ 83% of top-performing models. Rainfall and hourly features contributed minimally (< 10%). PM_2.5 forecasts exhibited the highest variability. Overall, performance was driven by broad temporal and environmental patterns, with each model excelling in specific pollutant contexts.

Changping station

At Changping Station (Fig. 4), CNN-Encoded models showed consistently strong performance across pollutants. They achieved the lowest MAE and RMSE in 67% of cases and ranked highest in R² or KGE for ~ 50%. For CO, CNN-Encoded had R² = 0.849 and MAE = 279.9 µg/m³, outperforming CNN-Unencoded despite a slightly lower KGE. In NO₂, CNN-Encoded reduced MAE by ~ 21% compared to DNN-Unencoded. O₃ forecasts showed close performance: DNN-Encoded had the highest KGE (0.943), while CNN-Encoded achieved the lowest MAE (7.3 µg/m³). For PM_2.5 and PM₁₀, CNN-Unencoded slightly outperformed in KGE and R², but CNN-Encoded had lower error metrics. SO₂ results were mixed, with CNN-Unencoded leading in R² (0.983), while DNN-Encoded topped in KGE (0.852). Dew point, month_cosine, and month were key predictors in over 80% of models, while rainfall and hourly features had < 10% impact.

Dongsi station

At Dongsi Station (Fig. 5), CNN models—particularly CNN-Encoded—demonstrated superior forecasting accuracy for most pollutants. CNN-Encoded achieved the lowest MAE and RMSE in 60% of pollutants, excelling in CO (MAE: 244.2 µg/m³), NO₂ (MAE: 6.93 µg/m³; R²: 0.912), and SO₂ (MAE: 2.21 µg/m³). CNN-Unencoded led in KGE and WI for PM_2.5 (KGE: 0.959) and PM₁₀ (KGE: 0.934), while DNN-Encoded had the lowest RMSE for PM_2.5 (22.05 µg/m³) and the highest KGE for O₃ (0.934). Dew point, month_cosine, and temperature were key predictors in > 80% of cases, while rainfall had < 10% influence. Overall, CNN models outperformed others in MAE/RMSE in most cases, confirming their robustness in pollutant forecasting.

Guanyuan station

At Guanyuan Station (Fig. 6), DNN-Encoded models outperformed in 60% of pollutants, achieving top KGE and NSE for CO (KGE: 0.949; NSE: 0.9), PM_2.5, and SO₂, along with the lowest MAE/RMSE for CO and PM_2.5. CNN-Encoded led in O₃ (KGE: 0.936; WI: 0.97) and PM₁₀ (KGE: 0.949; WI: 0.982). DNN-Unencoded had the highest R² for NO₂ (0.916) and lowest MAE/RMSE for O₃, despite lower KGE/NSE. Key predictors across models included dew point, month_cosine, and temperature (relevant in > 80% of top-performing models), while rainfall and hour_sine had < 10% impact. Overall, encoded models performed better on KGE/NSE/WI, while unencoded models excelled in MAE/RMSE for select pollutants, emphasizing pollutant-specific model suitability.

Huairou station

At Huairou Station (Fig. 7), CNN-Encoded models outperformed others in ~ 70% of pollutant forecasts, achieving top KGE (0.951) and WI (0.976) for CO, and lowest errors for CO, NO₂, and O₃. DNN-Encoded excelled for PM_2.5 (KGE: 0.949), PM₁₀, and SO₂, showing better KGE and NSE in ~ 30% of cases. Encoding improved performance across all pollutants, particularly for CO, NO₂, and O₃. Key predictors—dew point, month, and temperature—were influential in over 80% of top-performing models, while rainfall and hourly features had minimal impact (< 10%). Overall, encoded models consistently delivered superior accuracy by effectively capturing temporal and environmental patterns.

Nongzhanguan station

At Nongzhanguan Station (Fig. 8), CNN-Encoded models outperformed others in ~ 70% of pollutants, achieving top KGE for CO (0.960), O₃ (0.962), and NO₂ (0.943), with the lowest MAE/RMSE (e.g., CO MAE: 209.2 μg/m³; O₃ MAE: 7.1 μg/m³). DNN-Unencoded excelled in PM_2.5 with the highest KGE (0.976) and lowest RMSE (19.7 μg/m³). Forecasts for CO, NO₂, and O₃ had strong accuracy (KGE > 0.92; R² > 0.91), while SO₂ had the lowest performance (max KGE: 0.933; R²: 0.88). Dew point, temperature, and pressure were key predictors in > 80% of cases; rainfall had a < 10% impact. Overall, CNN-Encoded models proved most effective for pollutant forecasting, driven by strong meteorological and temporal features.

Shunyi station

At Shunyi Station (Fig. 9), CNN-Encoded models outperformed in ~ 60% of pollutants, achieving top metrics for CO (KGE: 0.946, R²: 0.907), PM₁₀ (KGE: 0.945, R²: 0.937), and SO₂ (MAE: 2.62 µg/m³, RMSE: 6.56 µg/m³). DNN-Encoded led in NO₂ (KGE: 0.928, R²: 0.908), O₃ (KGE: 0.938), and PM_2.5 (KGE: 0.969, R²: 0.953). CO and O₃ had the strongest model performance (KGE > 0.94), linked to high meteorological sensitivity. Dew point and month_cosine were key drivers in > 80% of cases, while rainfall had a < 10% impact. Overall, CNN-Encoded models provided superior forecasts when pollutant levels were strongly meteorology-dependent.

Tiantan station

At Tiantan Station (Fig. 10), encoded models outperformed unencoded ones in ~ 70% of cases. CNN-Encoded achieved top results for CO, PM_2.5, and SO₂ (KGE > 0.93, R² > 0.88, MAE: 210.2 µg/m³ for CO; 10.53 µg/m³ for PM_2.5). DNN-Encoded led in NO₂ (KGE: 0.941, MAE: 7.32 µg/m³) and O₃ (NSE: 0.917, R²: 0.92). PM₁₀ was best predicted by CNN-Unencoded (KGE: 0.938). Key drivers (dew point, temperature, pressure, wind) influenced forecasts in > 80% of cases, while rainfall had a < 10% impact. Overall, encoded models improved accuracy by effectively capturing seasonal and atmospheric patterns.

Wanliu station

At Wanliu Station (Fig. 11), CNN-Encoded models outperformed unencoded ones in ~ 75% of cases, achieving top accuracy for CO, PM_2.5, and SO₂ (KGE > 0.93, R² > 0.88, MAE: 240.55 μg/m³ for CO; 9.78 μg/m³ for PM_2.5). DNN-Encoded led NO₂ forecasting (KGE: 0.938, MAE: 7.22 μg/m³) and O₃ (NSE: 0.912, R²: 0.923). PM₁₀ predictions were best by CNN-Unencoded (KGE: 0.931, MAE: 15.64 μg/m³). Key meteorological drivers (dew point, temperature, pressure, wind) influenced > 80% of results, while rainfall had minimal (< 10%) impact. Encoded features improved forecast accuracy by effectively capturing complex temporal and atmospheric patterns.

Wanshouxigong station

At Wanshouxigong Station (Fig. 12), CNN-Encoded models led in ~ 80% of cases, achieving the highest accuracy for CO (KGE: 0.964, R²: 0.938, MAE: 192.96 µg/m³) and PM_2.5 (KGE: 0.958, R²: 0.948, MAE: 1.85 µg/m³). O₃ was driven by temperature, wind, and hourly cycles; NO₂ and SO₂ showed strong seasonal (month) effects. PM₁₀ performed best with DNN-Encoded (R²: 0.935, RMSE: 28.20 µg/m³). SO₂ had lower accuracy but CNN-Encoded still improved errors (RMSE: 4.02 µg/m³). Rainfall had a minimal (< 10%) impact. Encoding boosted forecast reliability by capturing seasonal and short-term variations and reducing errors.

Remarks and comparison

As per Tables 3, 4, 5, 6, 7, 8. The model performance across pollutants and locations is generally high, with R² values mostly exceeding 0.85, reflecting strong predictive accuracy. PM_2.5 and PM₁₀ exhibit the highest and most consistent R² scores, often above 0.94, indicating excellent model fit across all sites and methods. O₃ predictions also show robust results, generally above 0.89. CO and NO₂ show slightly more variation but still maintain strong performance, with CNN models—especially those using encoded inputs—tend to have a slight edge over DNNs. SO₂ predictions are the most variable and generally lower, with some locations like Changping and Guanyuan showing R² values closer to 0.76–0.78, suggesting more complexity or noise in the data. Locations such as Nongzhanguan, Wanshouxigong, and Tiantan consistently yield higher R² values across pollutants and models, indicating more stable data or better model generalization, whereas Changping and Guanyuan often show comparatively lower performance. Overall, CNN architectures with encoded inputs generally offer marginal improvements, particularly for more challenging pollutants like SO₂ and CO. Moreover, based on Table 9, CNN achieves the highest R² in 70% of the cases (14 out of 20 combinations) across both unencoded and encoded features. DNN follows, ranking highest in 25% of cases, while LSTM leads only once (5%). ANN consistently underperforms, with the lowest R² in 90% of the stations when features are encoded. Top-performing stations like Nongzhanguan, Tiantan, and Wanliu record R² values above 0.96 with CNN, while lower-performing stations like Changping and Huairou have values around 0.94 or below, highlighting site-specific variability in model accuracy.

Table 3 Comparison of R² values for pollutant CO across locations.

Full size table

Table 4 Comparison of R² values for pollutant NO₂ across locations.

Full size table

Table 5 Comparison of R² values for pollutant O₃ across locations.

Full size table

Table 6 Comparison of R² values for pollutant PM_2.5 across locations.

Full size table

Table 7 Comparison of R² values for pollutant PM₁₀ across locations.

Full size table

Table 8 Comparison of R² values for pollutant SO₂ across locations.

Full size table

Table 9 Comparison of R² values for PM_2.5 prediction across stations using deep learning models with unencoded and encoded features.

Full size table

Conclusion, limitations and future directions

This research highlights how effective deep learning models—specifically DNN and CNN frameworks—are at predicting major urban air pollutants across several monitoring locations using four years of hourly data. Both models delivered strong predictive performance, showing a high level of alignment between real and predicted pollutant values. Notably, the inclusion of feature encoding greatly boosted model accuracy, leading to steady gains of about 2–5% in key evaluation metrics like R², NSE, and KGE.

The findings show that CNNs excelled at detecting spatial and temporal pollution patterns, particularly for pollutants such as CO and PM_2.5. Meanwhile, DNNs demonstrated strong results across a wider range of pollutants. Feature encoding proved essential in enhancing the models’ ability to generalize and reduce prediction errors, underscoring the value of preprocessing in forecasting air quality over time.

Differences between monitoring sites showed the models could adapt to varying pollution trends and levels, reinforcing their usefulness in a variety of urban contexts. These insights suggest that deep learning, especially when supported by encoded features, holds significant potential for delivering accurate and scalable air quality predictions—tools that could be crucial for city planning and public health efforts.

Despite these strong results, the study is not without limitations. The deep learning models require significant computational resources—GPU or cloud-based infrastructure—which may limit their application in resource-constrained settings. Moreover, external environmental drivers such as meteorological data (temperature, wind speed, humidity), traffic emissions, and industrial output were not included in the modeling pipeline. Incorporating these factors could potentially improve model accuracy by up to 10–15%, based on evidence from other related literature. Additionally, the geographic scope of the study was limited to Beijing, reducing the model’s generalizability to regions with different climatic and socio-economic profiles.

The practical implications of this research are remarkable. High-accuracy pollutant forecasting, with R² values above 0.90 in many cases, can support early warning systems, enabling city authorities to issue timely health advisories and reduce exposure risks. The integration of these models into smart city infrastructure could lead to more efficient urban planning, including dynamic traffic control and targeted industrial regulation. Furthermore, the framework demonstrated in this study provides a scalable foundation for AI-driven air quality management, capable of being deployed in various urban areas.

For future research, increasing the input feature space to include meteorological and socioeconomic variables is essential. Preliminary studies indicate that adding weather-related variables can increase forecasting accuracy by 8–12%. Model interpretation should also be prioritized using tools such as SHAP or attention mechanisms to uncover the influence of specific features on predictions. Furthermore, incorporating data from multiple cities with different pollution profiles would enhance its adaptability and general applicability. Finally, transitioning to real-time, cloud-based deployment can provide scalable, on-demand predictions. Hybrid models that combine deep learning with physical or statistical modeling may improve prediction robustness by 10–20%, offering a promising direction for next-generation environmental forecasting systems.

Data availability

All data reported in the manuscripts are available from the corresponding author upon justified request.

References

Fan, H., Zhao, C. & Yang, Y. A comprehensive analysis of the spatio-temporal variation of urban air pollution in China during 2014–2018. Atmos. Environ. 220, 117066 (2020).
Article CAS Google Scholar
Hu, J., Ying, Q., Wang, Y. & Zhang, H. Characterizing multi-pollutant air pollution in China: Comparison of three air quality indices. Environ. Int. 84, 17–25 (2015).
Article CAS PubMed Google Scholar
Li, D. et al. Identification of long-range transport pathways and potential sources of PM_2.5 and PM₁₀ in Beijing from 2014 to 2015. J. Environ. Sci. 56, 214–229 (2017).
Article Google Scholar
Guo, Y. et al. Evaluating the real changes of air quality due to clean air actions using a machine learning technique: Results from 12 Chinese mega-cities during 2013–2020. Chemosphere 300, 134608 (2022).
Article CAS PubMed Google Scholar
Zhang, H., Wang, Y., Hu, J., Ying, Q. & Hu, X.-M. Relationships between meteorological parameters and criteria air pollutants in three megacities in China. Environ. Res. 140, 242–254 (2015).
Article CAS PubMed Google Scholar
Grange, S. K. & Carslaw, D. C. Using meteorological normalisation to detect interventions in air quality time series. Sci. Total Environ. 653, 578–588 (2019).
Article CAS PubMed ADS Google Scholar
Fu, L., Li, J. & Chen, Y. An innovative decision-making method for air quality monitoring based on big data-assisted artificial intelligence technique. J. Innov. Knowl. 8, 100294 (2023).
Article Google Scholar
Chen, M. & Reppen, R. A look at WH-questions in direct and cross-examinations: Authentic vs. TV courtroom language. DELTA: Documentação de Estudos em Lingüística Teórica e Aplicada 36, 2020360310 (2020).
Article Google Scholar
Nahr, J. G., Nozari, H. & Sadeghi, M. E. Green supply chain based on artificial intelligence of things (AIoT). Int. J. Innovat. Manag. Econ. Soc. Sci. 1, 56–63 (2021).
Google Scholar
Chaudhuri, S. & Roy, M. Global ambient air quality monitoring: Can mosses help? A systematic meta-analysis of literature about passive moss biomonitoring. Environ. Dev. Sustain. 26, 5735–5773 (2024).
Article Google Scholar
Bhattarai, H., Tai, A. P., Martin, M. V. & Yung, D. H. Impacts of changes in climate, land use, and emissions on global ozone air quality by mid-21st century following selected Shared Socioeconomic Pathways. Sci. Total Environ. 906, 167759 (2024).
Article CAS PubMed Google Scholar
Rybarczyk, Y. & Zalakeviciute, R. Machine learning approaches for outdoor air quality modelling: A systematic review. Appl. Sci. 8, 2570 (2018).
Article Google Scholar
Liao, Q. et al. Deep learning for air quality forecasts: A review. Curr. Pollut. Rep. 6, 399–409 (2020).
Article Google Scholar
Zheng, Y., Liu, F. & Hsieh, H.-P. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining. 1436–1444.
Guo, Q. et al. Air pollution forecasting using artificial and wavelet neural networks with meteorological conditions. Aerosol. Air Quality Res. 20, 1429–1439 (2020).
Article CAS Google Scholar
Zhao, B. et al. Urban air pollution mapping using fleet vehicles as mobile monitors and machine learning. Environ. Sci. Technol. 55, 5579–5588 (2021).
Article CAS PubMed ADS Google Scholar
Song, J., Han, K. & Stettler, M. E. Deep-MAPS: Machine-learning-based mobile air pollution sensing. IEEE Internet Things J. 8, 7649–7660 (2020).
Article Google Scholar
Song, J. et al. Toward high-performance map-recovery of air pollution using machine learning. ACS ES&T Eng. 3, 73–85 (2022).
Article Google Scholar
Song, J. & Stettler, M. E. A novel multi-pollutant space-time learning network for air pollution inference. Sci. Total Environ. 811, 152254 (2022).
Article CAS PubMed Google Scholar
He, Z., Guo, Q., Wang, Z. & Li, X. Prediction of monthly PM_2.5 concentration in Liaocheng in China employing artificial neural network. Atmosphere 13, 1221 (2022).
Article CAS ADS Google Scholar
Dai, H., Huang, G., Zeng, H. & Zhou, F. PM_2.5 volatility prediction by XGBoost-MLP based on GARCH models. J. Clean. Product. 356, 131898 (2022).
Article CAS Google Scholar
Guo, Q., He, Z. & Wang, Z. Monthly climate prediction using deep convolutional neural network and long short-term memory. Sci. Rep. 14, 17748 (2024).
Article CAS PubMed PubMed Central Google Scholar
Song, J. Towards space-time modelling of PM_2.5 inhalation volume with ST-exposure. Sci. Total Environ. 948, 174888 (2024).
Article CAS PubMed Google Scholar
Zhang, S. et al. Cautionary tales on air-quality improvement in Beijing. Proc. Royal Soc. A: Math. Phys. Eng. Sci. 473, 20170457 (2017).
Article ADS Google Scholar
Agarwal, S. et al. Air quality forecasting using artificial neural networks with real time dynamic error correction in highly polluted regions. Sci. Total Environ. 735, 139454 (2020).
Article CAS PubMed Google Scholar
Mishra, A. & Gupta, Y. Comparative analysis of Air Quality Index prediction using deep learning algorithms. Spat. Inf. Res. 32, 63–72 (2024).
Article Google Scholar
Guo, Q., He, Z. & Wang, Z. Predicting of daily PM_2.5 concentration employing wavelet artificial neural networks based on meteorological elements in Shanghai, China. Toxics 11, 51 (2023).
Article PubMed PubMed Central Google Scholar
Varade, H. P. et al. In 2023 2nd International Conference on Applied Artificial Intelligence and Computing (ICAAIC). 1436–1441 (IEEE).
Wang, J. et al. A deep learning framework combining CNN and GRU for improving wheat yield estimates using time series remotely sensed multi-variables. Comput. Electron. Agric. 206, 107705 (2023).
Article Google Scholar
Kiranyaz, S. et al. 1D convolutional neural networks and applications: A survey. Mech. Syst. Signal Process. 151, 107398 (2021).
Article Google Scholar
Yao, D., Li, B., Liu, H., Yang, J. & Jia, L. Remaining useful life prediction of roller bearings based on improved 1D-CNN and simple recurrent unit. Measurement 175, 109166 (2021).
Article Google Scholar
Guo, Z., Yang, C., Wang, D. & Liu, H. A novel deep learning model integrating CNN and GRU to predict particulate matter concentrations. Process Saf. Environ. Prot. 173, 604–613 (2023).
Article CAS Google Scholar

Download references

Acknowledgements

The authors would like to thank the reviewers and editors for their comprehensive and constructive comments for improving the manuscript. In addition, Zaher Mundher Yaseen would like to thank the Civil and Environmental Engineering Department, King Fahd University of Petroleum & Minerals, Saudi Arabia.

Author information

Authors and Affiliations

Department of Mechanical Engineering, Faculty of Engineering Technology, Al-Balqa Applied University, Amman, Jordan
Abdel Salam Alsabagh
Department of Thermofluids, Faculty of Mechanical Engineering, Universiti Teknologi Malaysia, 81310, Skudai, Johor Bahru, Malaysia
Omer A. Alawi & Haslinda Mohamed Kamar
Department of Power Mechanics Engineering Techniques, Engineering Technical College, Al-Bayan University, Baghdad, 10011, Iraq
Omer A. Alawi
Department of Artificial Intelligence, College of Computer Science and IT, University of Anbar, Ramadi, Iraq
Ahmed Adil Nafea
Center for Artificial Intelligence Technology (CAIT), Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia (UKM), Bangi, Selangor, Malaysia
Mohammed M. AL-Ani
Mechanical Engineering Department, King Fahd University of Petroleum and Minerals (KFUPM), Dhahran, 31261, Saudi Arabia
Hussein A. Mohammed
Department of Mechanical Engineering, Faculty of Engineering, Universiti Malaya, 50603, Kuala Lumpur, Malaysia
S. N. Kazi
Department of Computer Sciences, College of Education for Pure Science, University of Thi-Qar, Nasiriyah, 64001, Iraq
Atheer Y. Oudah
Information and Communication Technology Research Group, Scientific Research Center, Al-Ayen University, Nasiriyah, 64001, Iraq
Atheer Y. Oudah
Civil and Environmental Engineering Department, King Fahd University of Petroleum and Minerals, 31261, Dhahran, Saudi Arabia
Zaher Mundher Yaseen

Authors

Abdel Salam Alsabagh
View author publications
Search author on:PubMed Google Scholar
Omer A. Alawi
View author publications
Search author on:PubMed Google Scholar
Haslinda Mohamed Kamar
View author publications
Search author on:PubMed Google Scholar
Ahmed Adil Nafea
View author publications
Search author on:PubMed Google Scholar
Mohammed M. AL-Ani
View author publications
Search author on:PubMed Google Scholar
Hussein A. Mohammed
View author publications
Search author on:PubMed Google Scholar
S. N. Kazi
View author publications
Search author on:PubMed Google Scholar
Atheer Y. Oudah
View author publications
Search author on:PubMed Google Scholar
Zaher Mundher Yaseen
View author publications
Search author on:PubMed Google Scholar

Contributions

Abdel Salam Alsabagh: Methodology, Formal analysis, Investigation, Writing—Review & Editing. Omer A. Alawi: Conceptualization, Methodology, Validation, Formal analysis, Investigation, Data Curation, Writing—Original Draft, Writing—Review & Editing. Haslinda Mohamed Kamar: Formal analysis, Resources, Supervision, Project administration, Funding acquisition. Ahmed Adil Nafea: Methodology, Formal Analysis, Investigation, Visualization, Writing—Review & Editing. Mohammed M. AL-Ani: Methodology, Formal analysis, Investigation, Visualization, Writing—Review & Editing. Hussein A. Mohammed: Formal analysis, Investigation, Writing—Review & Editing, Visualization, Supervision. S.N. Kazi: Formal analysis, Investigation, Writing—Review & Editing, Visualization, Supervision. Atheer Y. Oudah: Formal analysis, Investigation, Writing—Review & Editing, Visualization, Supervision. Zaher Mundher Yaseen: Methodology, Validation, Formal analysis, Investigation, Writing—Original Draft, Writing—Review & Editing, Visualization.

Corresponding author

Correspondence to Zaher Mundher Yaseen.

Ethics declarations

Competing interest

The authors declare no competing interests.

Ethical approval

The current work was inspired by a notebook available at: (https://www.kaggle.com/code/shwetasingh76/forecasting-using-encoding-cyclical-features).

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Alsabagh, A.S., Alawi, O.A., Kamar, H.M. et al. Deep learning framework for hourly air pollutants forecasting using encoding cyclical features across multiple monitoring sites in Beijing. Sci Rep 15, 22417 (2025). https://doi.org/10.1038/s41598-025-05472-5

Download citation

Received: 30 December 2024
Accepted: 03 June 2025
Published: 01 July 2025
Version of record: 01 July 2025
DOI: https://doi.org/10.1038/s41598-025-05472-5

Keywords

This article is cited by

Designing a residual-enhanced hybrid Prophet–LSTM framework for urban air pollution forecasting in Beijing
- Musa Milli
Scientific Reports (2025)

Subjects

Abstract

Similar content being viewed by others

Optimized air quality management based on air quality index prediction and air pollutants identification in representative cities in China

A hybrid deep learning air pollution prediction approach based on neighborhood selection and spatio-temporal attention

Continuous-time air pollutant forecasting using multi-timescale attention neural ordinary differential equations (MA-NODE)

Introduction

General background of study

Literature review

Research objectives and novelty

Study area and data

Artificial intelligence models

Deep neural networks (DNN)

Convolutional neural network (CNN)

Model development and configuration

Forecasting metrics

Results and discussion

Aotizhongxin station

Changping station

Dongsi station

Guanyuan station

Huairou station

Nongzhanguan station

Shunyi station

Tiantan station

Wanliu station

Wanshouxigong station

Remarks and comparison

Conclusion, limitations and future directions

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interest

Ethical approval

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Designing a residual-enhanced hybrid Prophet–LSTM framework for urban air pollution forecasting in Beijing

Search

Quick links