A hybrid Harris Hawks Optimization with Support Vector Regression for air quality forecasting

Houssein, Essam H.; Mohamed, Meran; Younis, Eman M. G.; Mohamed, Waleed M.

doi:10.1038/s41598-025-86275-6

Download PDF

Article
Open access
Published: 17 January 2025

A hybrid Harris Hawks Optimization with Support Vector Regression for air quality forecasting

Essam H. Houssein¹,
Meran Mohamed¹,
Eman M. G. Younis¹ &
…
Waleed M. Mohamed¹

Scientific Reports volume 15, Article number: 2275 (2025) Cite this article

1694 Accesses
2 Citations
Metrics details

Subjects

Abstract

This paper proposes a hybridized model for air quality forecasting that combines the Support Vector Regression (SVR) method with Harris Hawks Optimization (HHO) called (HHO-SVR). The proposed HHO-SVR model utilizes five datasets from the environmental protection agency’s Downscaler Model (DS) to predict Particulate Matter ($PM_{2.5}$) levels. In order to assess the efficacy of the suggested HHO-SVR forecasting model, we employ metrics such as Mean Absolute Percentage Error (MAPE), Average, Standard Deviation (SD), Best Fit, Worst Fit, and CPU time. Additionally, we contrast our methodology with recently created models that have been published in the literature, such as the Grey Wolf Optimizer (GWO), Salp Swarm Algorithm (SSA), Henry Gas Solubility Optimization (HGSO), Barnacles Mating Optimizer (BMO), Whale Optimization Algorithm (WOA), and Manta Ray Foraging Optimization (MRFO). In particular, the proposed HHO-SVR model outperforms other approaches, establishing it as the optimal model based on its superior results.

Improving air quality prediction using hybrid BPSO with BWAO for feature selection and hyperparameters optimization

Article Open access 16 April 2025

Design and optimization of haze prediction model based on particle swarm optimization algorithm and graphics processor

Article Open access 26 April 2024

Innovative SVM optimization with differential gravitational fireworks for superior air pollution classification

Article Open access 18 October 2024

Introduction

Over the past twenty years, meta-heuristic optimization approaches have gained immense popularity. Most of them, such as SSA¹, EO², HHO³, GWO⁴, BMO⁵, MRFO⁶, WOA⁷, AO⁸, AOA⁹, and HGSO¹⁰, are well acknowledged by scientists of multiple disciplines in addition to machine learning experts. These optimization approaches have been applied to a wide range of study topics and have been used to solve numerous optimization issues, including jobs that are non-linear, non-differentiable, or computationally demanding with many local minima. In addition, a substantial number of scientific papers have been conducted on these methodologies. The surprising popularity of metaheuristics can be attributed to four main factors: clarity, pliability, derivation-free processes, and the ability to avoid local optima^4,11. Typically, these techniques can be categorized into four distinct groups¹²: evolution-based, physics-inspired, swarm-based^13,14, and human-based algorithms.

Evolution-based models: integrate mechanisms such as chemical sensing and movement, reproductive processes, removal, distribution, and movement patterns.¹⁵. Among these, the Genetic Algorithm (GA) stands out as a common and powerful evolutionary technique¹⁶. In particular, GA does not require derivatives, unlike mathematical optimization methods. By emulating successful strategies, GA improves populations through efficient tactics such as escaping local optima. Over time, different approaches were suggested to improve the efficiency of GA. Furthermore, other evolutionary techniques have emerged based on the success of GA¹⁷, including Evolutionary Programming (EP)¹⁸, Differential Evolution (DE)¹⁹, Evolutionary Strategies (ES)²⁰, and the Artificial Algae Algorithm (AAA)²¹.

Physics-based: simulate the physical rules governing our planet. One of the most recognized algorithms in this category is Simulated Annealing (SA)²². SA approximates physical material thermodynamics. The process of annealing, which involves cooling and crystallizing hot metals, is used to reduce electricity consumption. Additionally, several new physics-inspired algorithms, such as the Gravitational Search Algorithm (GSA)²³. Lévy Flight Distribution (LFD)²⁴, Archimedes optimization algorithm (AOA)²⁵, have been established.

Swarm-based algorithms: strive to replicate the social tendencies observed in creatures, such as self-organizing mechanisms and the assignment of work duties.²⁶. Two notable case studies in this domain are Particle Swarm Optimization (PSO)²⁷ and Ant Colony Optimization (ACO)²⁸. PSO, inspired by bird flocking activities, adjusts every agent based on both its best individual performance and the best global within the group. ACO, on the other hand, draws inspiration from ant swarms’ foraging habits and the diminishing strength of pheromones over time. Ants use this approach to find the most efficient path from their nest to a food source. In addition, other swarm-inspired techniques include Glowworm Swarm Optimization (GSO)²⁹, Harris Hawks Optimization (HHO) , and cuckoo search (CS)³⁰, as well as Artificial Ecosystem-based Optimization (AEO)³¹.

Human-based algorithms: mainly derived from human behavior where each individual has a unique way of accomplishing tasks that can impact their overall performance. Which becomes a motivation for researchers to improve the models³². The most well-known human-based algorithm is called Teaching-Learning-Based Optimization (TLBO), and it was developed to simulate the classroom interactions between the instructor and his students³³. Human Mental Search (HMS)³⁴ was designed by simulating human behavior versus online auction platforms. Doctor and Patient optimization algorithm (DPO)³⁵ was designed with consideration for interactions between healthcare providers and patients, including illness prevention, examination, and therapy.

The No Free Lunch Theorem³⁶ in optimization states that no single optimizer performs optimally across all optimization scenarios. Consequently, the pursuit of robust swarm-inspired optimizers has become a driving force for researchers aiming to tackle intricate real-world problems^37,38 . In this study, we propose eight hybrid frameworks that incorporate modern metaheuristic techniques. These frameworks are specifically designed to fine-tune support vector regression parameters for forecasting the daily maximum concentration of Particulate Matter ($PM_{2.5}$).

According to the literature, various methods have been employed with different characterizations. Among these, optimization methods have proven their efficiency in solving $PM_{2.5}$ forecasting problems compared to traditional approaches. However, SVR and optimization methods have been underutilized, despite their potential to provide more reliable solutions for forecasting. Existing search methods often face limitations in performance, model complexity, and time required to build and solve the problem. Consequently, realizing accurate calculation results can be challenging. Furthermore, as highlighted in³⁹, a significant gap in this problem lies in the complex process of model establishment, which necessitates a comprehensive understanding of each variable’s impact on the target value. Unfortunately, some factors may be overlooked during implementation⁴⁰. Although most current studies focus on non-linear models for $PM_{2.5}$ forecasting, only a few have explored advanced machine learning and optimization techniques. This claim is reinforced by⁴¹, in which a cutting-edge optimization technique is utilized as a prediction system relying on unstructured data, leading to more accurate and coherent forecasts. In general, the consensus in the literature is that the $PM_{2.5}$ forecasting problem is highly intricate and requires an efficient approach⁴².

This study compares the proposed HHO hybrid model with other recognized metaheuristic optimization techniques. These encompass GWO, WOA, SSA, BMO, HGSO, MRFO, and EO. Table 1 shows a summary of each of the algorithms, detailing their core principles, strengths, and limitations.

Table 1 Summary of comparative algorithms used in this study.

Subjects

Abstract

Similar content being viewed by others

Improving air quality prediction using hybrid BPSO with BWAO for feature selection and hyperparameters optimization

Design and optimization of haze prediction model based on particle swarm optimization algorithm and graphics processor

Innovative SVM optimization with differential gravitational fireworks for superior air pollution classification

Introduction

Materials and methods

Support Vector Regression (SVR)

Harris Hawks Optimization (HHO)

Exploration phase

Exploration to exploitation transition

Exploitation phase

The proposed HHO-SVR model

Objective function

Computational complexity of HHO-SVR

Theoretical justification for the proposed HHO-SVR

Experimental results analysis and discussion

Data description

Data preprocessing

Evaluation metrics

Results analysis and discussion

Conclusion and future directions

Data availibility

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

FLSVR: Solving Lagrangian Support Vector Regression Using Functional Iterative Method

Search

Quick links