Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
Air quality index prediction using a hybrid CEEMDAN-CNN-IGWO-BiGRU-Attention model
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 03 April 2026

Air quality index prediction using a hybrid CEEMDAN-CNN-IGWO-BiGRU-Attention model

  • Yong Fang1,
  • Suping Liu1 &
  • Zhihao Su1 

Scientific Reports , Article number:  (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Computational biology and bioinformatics
  • Engineering
  • Mathematics and computing

Abstract

Precise forecasting of the Air Quality Index (AQI) is essential for environmental management and public health protection. However, the non-linear and non-stationary nature of AQI time series presents a significant challenge for traditional predictive models. Most current deep learning approaches still face limitations in feature extraction and rely on inefficient manual hyperparameter tuning. To address these constraints, this study proposes an integrated forecasting framework combining Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN), Convolutional Neural Networks (CNN), Bidirectional Gated Recurrent Units (BiGRU), and an Attention mechanism. The methodology begins by using CEEMDAN to decompose the complex AQI signals into multiple stable frequency components, which effectively reduces the impact of data noise. Each component is then processed by a hybrid sub-network where CNNs extract local features and BiGRU units capture long-term temporal dependencies. An attention layer is incorporated to dynamically assign weights to critical time steps. Furthermore, an Improved Grey Wolf Optimizer (IGWO) is introduced to automate the hyperparameter search, ensuring optimal network performance without manual intervention. Experimental results on a long-term dataset from Guangzhou (2014–2024) show that the proposed model achieves an MSE of 10.2456 and a coefficient of determination (R2 ) of 0.9615. These findings, supported by detailed ablation studies and cross-city generalization tests, demonstrate that the model is both robust and accurate for real-world air quality early-warning systems.

Similar content being viewed by others

Air quality prediction model based on deep learning hybrid framework

Article Open access 03 February 2026

Air quality prediction using multi-source remote sensing data integration with hybrid deep learning framework

Article Open access 15 December 2025

Improving air quality prediction using hybrid BPSO with BWAO for feature selection and hyperparameters optimization

Article Open access 16 April 2025

Data availability

The datasets used in this study are publicly available from the Tianqi Houbao (AQI) database athttps:/www.tianqihoubao.com. Descriptive statistics, preprocessing details, sensitivity analyses, and additional results are provided in the SupplementaryMaterial (Tables S1–S7, Figures S1–S4).

References

  1. Hu, J., Chen, J., Ying, Q. & Zhang, H. One-year simulation of ozone and particulate matter in China using WRF/CMAQ modeling system. Atmos. Chem. Phys. 16, 10333–10350. https://doi.org/10.5194/acp-16-10333-2016 (2016).

    Google Scholar 

  2. Zhang, L. Y., Qiu, R. Z. & Hu, X. S. Trend analysis and forecast of PM2.5 in Fuzhou, China using the ARIMA model. Ecol. Indic. 95, 702–710. https://doi.org/10.1016/j.ecolind.2018.08.032 (2018).

    Google Scholar 

  3. Mani, G., Viswanadhapalli, J. K. & Soni, A. Prediction and forecasting of air quality index in chennai using regression and arima time series models. J. Eng. Res. 9 https://doi.org/10.36909/jer.9517 (2021).

  4. Gourav, Rekhi, J. K., Nagrath, P. & Jain, R. Springer, Forecasting air quality of delhi using arima model. In Advances in Data Science, Security and Applications, vol. 612 of Lecture Notes in Electrical Engineering, 315–325, (2019). https://doi.org/10.1007/978-981-15-0372-6_25

  5. Bhatti, U. A. et al. Time series analysis and forecasting of air pollution particulate matter (PM2.5): An SARIMA and factor analysis approach. IEEE Access 9, 41019–41031. https://doi.org/10.1109/ACCESS.2021.3060744 (2021).

    Google Scholar 

  6. Dong, Y., Li, Z. & Zhong, W. Analysis of average air temperature in New York based on SARIMA model. Highlights Sci. Eng. Technol. 49, 148–156. https://doi.org/10.54097/hset.v49i.8495 (2023).

    Google Scholar 

  7. Cecen, R. K. Flight regimes and environmental impact: A linear regression analysis of CO2 emissions for en-route air traffic operations. Int. J. Aeronaut. Space Sci. 26, 2058–2069. https://doi.org/10.1007/s42405-024-00860-z (2025).

    Google Scholar 

  8. Sethi, J. K. & Mittal, M. An efficient correlation based adaptive lasso regression method for air quality index prediction. Earth Sci. Informatics 14, 1777–1786. https://doi.org/10.1007/s12145-021-00618-1 (2021).

    Google Scholar 

  9. Kant, S. From data to decision-making: Utilizing decision tree for air quality monitoring in smart urban areas. Int. J. Inf. Technol. 17, 665–672. https://doi.org/10.1007/s41870-024-02208-y (2025).

    Google Scholar 

  10. Anuradha, Y., Mishra, V. P., Bali, M. & Ara, T. Explainable forecasting of air quality index using a hybrid random forest and arima model. MethodsX 15, 103517. https://doi.org/10.1016/j.mex.2025.103517 (2025).

    Google Scholar 

  11. Chao, B. & Guangqiu, H. Innovative svm optimization with differential gravitational fireworks for superior air pollution classification. Sci. Rep. 14(1), 24450. https://doi.org/10.1038/s41598-024-75839-7 (2024).

    Google Scholar 

  12. Liu, B.-C., Binaykia, A., Chang, P.-C., Tiwari, M. K. & Tsao, C.-C. Urban air quality forecasting based on multi- dimensional collaborative support vector regression (svr): A case study of Beijing-Tianjin-Shijiazhuang. PLoS One 12, e0179763. https://doi.org/10.1371/journal.pone.0179763 (2017).

    Google Scholar 

  13. Borge, R., Jung, D., Lejarraga, I., de la Paz, D. & Cordero, J. M. Assessment of the Madrid region air quality zoning based on mesoscale modelling and k-means clustering. Atmos. Environ. 287, 119258. https://doi.org/10.1016/j.atmosenv.2022.119258 (2022).

    Google Scholar 

  14. Kazemi, Z. et al. Assessment of the risk of exposure to air pollutants and identifying the affecting factors on making pollution by pca, cfa. Int. J. Environ. Anal. Chem. 104, 2128–2147. https://doi.org/10.1080/03067319.2022.2059364 (2022).

    Google Scholar 

  15. Naz, F. et al. Air quality and healthy ageing: Predictive modeling of pollutants using cnn quantum-lstm. IEEE Access 13, 94212–94223. https://doi.org/10.1109/ACCESS.2025.3570526 (2025).

    Google Scholar 

  16. Wang, S. & Zhang, Y. An attention-based cnn model integrating observational and simulation data for high-resolution spatial estimation of urban air quality. Atmos. Environ. 340, 120921. https://doi.org/10.1016/j.atmosenv.2024.120921 (2025).

    Google Scholar 

  17. Kushwah, V. & Agrawal, P. Hybrid model for air quality prediction based on lstm with random search and bayesian optimization techniques. Earth Sci. Inform. 18, 32. https://doi.org/10.1007/s12145-024-01514-0 (2025).

    Google Scholar 

  18. Wang, X., Yan, J., Wang, X. & Wang, Y. Air quality forecasting using the gru model based on multiple sensors nodes. IEEE Sens. Lett. 7, 1–4. https://doi.org/10.1109/LSENS.2023.3290144 (2023).

    Google Scholar 

  19. Wang, B., Kong, W. & Zhao, P. An air quality forecasting model based on improved convnet and rnn. Soft Comput. 25, 9209–9218. https://doi.org/10.1007/s00500-021-05843-w (2021).

    Google Scholar 

  20. Qi, Z. et al. Deep-air: A hybrid cnn-lstm framework for fine-grained air pollution estimation and forecast in metropolitan cities. IEEE Access. 10, 55818–55841. https://doi.org/10.1109/ACCESS.2022.3177893 (2022).

    Google Scholar 

  21. Yongkang, Z. et al. Air quality forecasting with hybrid lstm and extended stationary wavelet transform. Build. Environ. 213, 108822. https://doi.org/10.1016/j.buildenv.2022.108822 (2022).

    Google Scholar 

  22. Wani, T. et al. Hybridization of air quality forecasting models using machine learning and clustering: an original approach to detect pollutant peaks. Aerosol Air Qual. Res. 24, 405–416. https://doi.org/10.4209/aaqr.230181 (2024).

    Google Scholar 

  23. Jin, X.-B. et al. Deep hybrid model based on emd with classification by frequency characteristics for long-term air quality prediction. Mathematics 8(2), 214. https://doi.org/10.3390/math8020214 (2020).

    Google Scholar 

  24. Jiang, X., Wei, P., Luo, Y. & Li, Y. Air pollutant concentration prediction based on a ceemdan-fe-bilstm model. Atmosphere 12, 1452. https://doi.org/10.3390/atmos12111452 (2021).

    Google Scholar 

  25. Tang, C., Wang, Z., Wei, Y., Zhao, Z. & Li, W. A novel hybrid prediction model of air quality index based on variational modal decomposition and ceemdan-se-gru. Process. Saf. Environ. Prot. https://doi.org/10.1016/j.psep.2024.11.061 (2024).

    Google Scholar 

  26. Zhao, Z. & Yap, D. S. Research on air quality prediction model based on ceemdan-cnn-bilstm-attention. In Proceedings of the 2024 8th International Conference on Electronic Information Technology and Computer Engineering, (2024). https://doi.org/10.1145/3689969.3689987

  27. Xiaokai, Q. et al. Prediction of air leakage rate of sintering furnace based on bp neural network optimized by pso. Wirel. Commun. Mob. Comput. 1–9, (2022). https://doi.org/10.1155/2022/5128993 (2022).

  28. Hongzhuo, W. et al. Prediction of air quality index (aqi) based on optimized gwo-lstm neural network. In 2022 IEEE 8th International Conference on Computer and Communications (ICCC), 1445–1449, (2022). https://doi.org/10.1109/ICCC56324.2022.10065731

  29. Jing, C., Jieshu, H., He, G. & Yu, S. Optimization of cnn-lstm air quality prediction based on the poa algorithm. Sustainability 17 https://doi.org/10.3390/su17125006 (2025).

  30. Zhou, H. et al. Informer: Beyond efficient transformer for long sequence time-series forecasting. Proc. AAAI Conf. on Artif. Intell. 35, 11106–11115, (2021). https://doi.org/10.1609/aaai.v35i12.17325

  31. Wu, N., Green, B., Ben, X. & O’Banion, S. Transformer for time series: A survey 2009.07685. (2020).

  32. Torres, M. E., Colominas, M. A., Schlotthauer, G. & Flandrin, P. A complete ensemble empirical mode decomposition with adaptive noise. In 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4144–4147, https://doi.org/10.1109/ICASSP.2011.5947265 (IEEE, 2011).

  33. Cho, K. et al. Learning phrase representations using rnn encoder-decoder for statistical machine translation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 1724–1734, (2014). https://doi.org/10.3115/v1/D14-1179

  34. Mirjalili, S., Mirjalili, S. M. & Lewis, A. Grey wolf optimizer. Adv Eng. Softw 69, 46–61, (2014). https://doi.org/10.1016/j.advengsoft.2013.12.007

Download references

Funding

This work has been supported by Guangdong University of Science and Technology Doctoral Startup Fund Project (No. GKY-2025BSQDK-14).

Author information

Authors and Affiliations

  1. Guangdong University of Science and Technology, Dongguan, 523000, China

    Yong Fang, Suping Liu & Zhihao Su

Authors
  1. Yong Fang
    View author publications

    Search author on:PubMed Google Scholar

  2. Suping Liu
    View author publications

    Search author on:PubMed Google Scholar

  3. Zhihao Su
    View author publications

    Search author on:PubMed Google Scholar

Contributions

YF conceived the study, proposed the hybrid forecasting framework, and led the overall implementation. SPL and ZHS contributed to the methodological design and the development of the optimization strategy. YF and SPL conducted the experiments and performed the formal analysis of the results. YF drafted the manuscript and handled project correspondence. All authors reviewed and approved the final version of the paper.

Corresponding author

Correspondence to Yong Fang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1 (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fang, Y., Liu, S. & Su, Z. Air quality index prediction using a hybrid CEEMDAN-CNN-IGWO-BiGRU-Attention model. Sci Rep (2026). https://doi.org/10.1038/s41598-026-46978-w

Download citation

  • Received: 06 January 2026

  • Accepted: 28 March 2026

  • Published: 03 April 2026

  • DOI: https://doi.org/10.1038/s41598-026-46978-w

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics