Introduction

With the advancement of drone technology, aerial devices have become faster, more user-friendly, and cost-effective. This has enabled drones to find widespread application across various fields. When integrated with wireless sensor networks, they can collaborate to offer a solution that is both reliable and resilient. This synergy not only enhances the efficiency and scope of data collection but also strengthens the system’s stability and adaptability, ensuring high performance even in complex or harsh environments. These advantages have resulted in the frequent deployment of UAVs in applications such as long-distance data collection1, route inspection2, and logistics distribution3. However, determining the optimal path for asynchronous UAVs to maximize the acquisition of sensing information while adhering to energy constraints remain a significant challenge4,5,6.

Initially, several scholars focused on optimizing the path of individual UAVs7,8,9. Mignardi et al.7 treated UAVs as mobile base stations, optimizing their paths under the constraints of limited wireless resources. Xie et al.8 addressed the problem of planning an optimal path for a single UAV to efficiently cover multiple, separated convex polygonal regions. Alemayehu et al.9 investigated the impact of sensor failures due to node or link faults, aiming to enhance the fault tolerance and data acquisition efficiency of a single UAV. As UAV applications expanded, the issue of multi-UAV collaboration attracted increasing scholarly attention10,11,12. For example, Ren et al.10 suggested that each UAV could be launched from one node and retrieved at another, employing heuristic dynamic programming to address UAV scheduling on fixed routes. Chen et al.11 explored the use of multiple UAVs or a UAV network for remote sensing applications, focusing on the rapid acquisition of large-scale, ultra-high-resolution data. Jiang et al.12 considered factors such as UAV energy consumption, payload, and distribution center conditions to minimize the total distribution cost, developing a multi-type, multi-distribution UAV model to enhance UAV delivery capabilities. Given the limited battery capacity of UAVs, some researchers shifted their focus to the energy consumption of UAV flights in path planning13,14,15. For instance, Stodola13 addressed the path planning of UAV formations for sustained surveillance of specific areas. Kyriakakis et al.14 proposed a hybrid algorithm variant to tackle UAV path planning for pickup and delivery tasks. Lee et al.15 developed a multi-agent system capable of multi-path planning, incorporating the energy consumption of UAV logistics deliveries. Huang et al.16 established a model for multi-UAV inspection route planning in wind farms, aimed at minimizing total flight energy consumption. Although these studies accounted for energy consumption during steady flight, UAVs, in practice, operate in a variety of flight states, each with distinct energy demands.

To ensure UAVs can return to base smoothly, it is crucial to consider the energy consumption of different flight states and design precise flight energy consumption models. Dong et al.17 derived accurate UAV energy consumption calculations under various conditions of turning angles and flight speeds, and based on this model, designed a path planning model for individual UAVs. Zhu et al.18 considered the maximum curvature constraints of UAV flight, proposing a rapid assessment path model for multiple UAVs post-earthquake. To reduce the total energy consumption of shipborne UAVs collecting sensor data, Huang et al.19 considered factors such as wind-hover, data collection, and wireless communication, proposing a UAV path planning model; Huang et al.20 also developed an automatic inspection system model for UAVs in offshore wind farms assisted by MEC servers and near-Earth orbit satellites, optimizing the total inspection energy consumption. However, previous studies did not consider multiple energy consumption due to steady level flight, acceleration, deceleration, turning and climbing in multiple unmanned aerial vehicles path planning. To construct an accurate flight energy consumption model, this paper considers the energy consumed during steady horizontal flight, acceleration, deceleration, turning, and climbing, aiming to minimize the total cruising time of UAV paths, and proposes a Multiple unmanned aerial vehicles path planning model marked as MUAVPP-MEC.

Particle Swarm Optimization (PSO) is widely employed to solve combinatorial optimization problems, such as the path planning of multiple unmanned aerial vehicles21,22,23. However, traditional PSO faces challenges with slow convergence and low precision when tackling large-scale discrete problems. In response to these limitations, Chen et al.24 introduced a more effective Bee-Foraging Learning Particle Swarm Optimization in 2021. Building upon this, the present paper proposes an improved Bee-Foraging Learning Particle Swarm Optimization for MUAVPP-MEC, incorporating an efficient encoding and decoding strategy to transform problem-solving spaces into algorithmic search spaces, and utilizing an energy-constrained 2-opt local search strategy to enhance the algorithm’s optimization capabilities.

In summary, the contributions of this paper are as follows.

In the context of UAV data collection, this study considers the energy consumption associated with UAVs ascending to various altitudes at different collection points. Additionally, as rotary UAVs alter their flight direction to target different angles, angular energy consumption must also be calculated. Consequently, this paper introduces a model for Multi-UAV Path Planning Considering Multiple Energy Consumptions (MUAVPP-MEC). This model takes into account energy consumption during steady-level flight, acceleration, deceleration, ascent, and turning, all under the constraint of maximum energy storage, with the aim of minimizing the total cruising time of UAVs. The MUAVPP-MEC model is validated through multiple instance sizes in Experiment 1.

Tailored to the characteristics of the MUAVPP-MEC, this paper designs an Improved Bee Foraging Learning Particle Swarm Optimization algorithm (IBFLPSO) to solve the problem. The algorithm builds upon Bee Foraging Learning Particle Swarm Optimization25,26, incorporating an efficient encoding and decoding strategy that translates problem solution spaces into algorithmic search spaces. The integration of an energy-constrained 2-opt local search further enhances the algorithm’s optimization capabilities. In Experiment 2, IBFLPSO was compared with four other algorithms, demonstrating superior performance in both the optimal and average solutions.

The structure of this paper is as follows: “Mathematical models” section introduces the MUAVPP-MEC model. “The proposed algorithm” section details the IBFLPSO algorithm used to solve the MUAVPP-MEC. “Experiment and analysis” section presents multiple cases of varying scales to demonstrate the effectiveness of the proposed model and solution method. Finally, “Conclusions and future works” section concludes the paper and discusses potential directions for future research.

Mathematical models

In this section, the details of the model are presented, along with the definition of the optimization objectives and associated constraints.

Problem description

The MUAVPP-MEC involves several nodes, with one serving as the central point and the others as data collection points, each positioned at different altitudes. A limited number of UAVs are stationed at the central point, each having a maximum battery capacity. These UAVs are tasked with flying a route through multiple data collection points to complete data acquisition tasks. The cruising process begins with departure from the central point, followed by acceleration to cruising speed and subsequent steady flight. Upon reaching each data collection point, IoT sensor data is gathered via onboard equipment. If energy permits, the UAV maintains its speed and turns toward the next collection point; otherwise, it returns to the central point, decelerating to a stop as it approaches the destination. The schematic diagram, shown in Fig. 1, illustrates the paths of two UAVs traversing nine data collection points. In this process, the varying altitudes of each node are considered, and the ascent and descent of UAV1 are depicted in Fig. 2. In this figure, UAV1 collects data from four collection points before returning to the starting point.

Fig. 1
figure 1

Schematic of the flight paths for UAV1 and UAV2.

Fig. 2
figure 2

Graphical representation of UAV1’s altitude changes.

To precisely define the data model for this problem, the following assumptions are made:

  1. 1.

    UAVs start from rest, accelerating at 1 m/s2 to a predetermined speed, after which they transition to steady flight. The distance covered during the acceleration phase (DAcc) and the time required (TAcc) are fixed values. As UAVs approach their targets, they decelerate from the set speed to zero at a rate of 1 m/s2. The distance covered during the deceleration phase (DDec) and the time required (TDec) are also fixed values.

  2. 2.

    The energy consumed during acceleration, steady horizontal flight, and deceleration is determined by polynomial equations17 derived from empirical data. Since both the acceleration and deceleration phases are fixed, the energy consumed during these phases (EAcc and EDec) are set values. The energy consumed during steady flight is the product of the power required and the flight time.

  3. 3.

    In addition to the distance-based energy, the energy consumed during flight also accounts for angular energy and climbing energy. Angular energy is a linear function17 of the flight turn angle. The climbing energy consumed when a UAV ascends from a lower to a higher node depends on the ascent height, UAV mass, and gravitational acceleration. No energy is consumed when descending from a higher to a lower node.

Objective function

The objective of the MUAVPP-MEC model is to minimize the total cruising time of the UAVs, as outlined in Eq. (1), which functions as the optimization criterion.

$$\begin{aligned} \min \quad Z = \sum _{k = 1}^K\left( \frac{\sum _{i\in V}\sum _{j\in V}c_{ij}x_{ijk}-DAcc-DDec}{SConst}+TAcc+TDec\right) . \end{aligned}$$
(1)

In Eq. (1), the UAV accelerates to a predetermined speed before transitioning into steady flight and decelerates to zero speed as it approaches the target. The distances flown during the acceleration and deceleration phases are denoted by DAcc and DDec, respectively, with the corresponding times required being TAcc and TDec. The cij represents the distance from node i to node j, and \(x_{ijk}\) are the decision variables, where 1 indicates that UAV k flies from node i to node j, and 0 otherwise. While Eq. (1) serves as the objective function of our Mixed-Integer Nonlinear Programming (MINLP) problem, in the IBFLPSO algorithm we propose, it forms part of the fitness function. Additional penalty terms, as outlined in Eq. (14) in “Fitness function” section, exist to suppress infeasible solutions during the search process in the solution space of IBFLPSO.

Constraint conditions

Path continuity constraints

A data collection point can be serviced by only one UAV, as specified in Eq. (2). No more than K UAVs may depart from the central point, as defined in Eq. (3). The number of UAVs arriving at and departing from each node must be balanced, as indicated in Eq. (4). Once a UAV reaches a node, it provides services as described in Eq. (5). Equation (6) is employed to prevent the formation of sub-loops.

$$\begin{aligned}&\mathrm {s.t.~}\sum _{k = 1}^{K}y_{ik} \forall i\in V\setminus \{0\}, \end{aligned}$$
(2)
$$\begin{aligned}&\sum _{k = 1}^{K}y_{1k}\le K, \end{aligned}$$
(3)
$$\begin{aligned}&\sum _{j\in \textrm{V}}x_{ijk} = \sum _{j\in \textrm{V}}x_{jik},\quad \forall i\in V,k = 1,...,K, \end{aligned}$$
(4)
$$\begin{aligned}&\sum _{j\in \textrm{V}}x_{ijk} = y_{ik},\quad \forall i\in V,k = 1,...,K, \end{aligned}$$
(5)
$$\begin{aligned}&\sum _{i\in S}\sum _{j\in S}x_{ijk}\le \begin{vmatrix}S\end{vmatrix}\text {-l}\quad \forall S\subseteq V\setminus \{0\},\begin{vmatrix}S\end{vmatrix}\ge 2,k = 1,...,K. \end{aligned}$$
(6)

Energy consumption constraints

Similar to literature27, the total energy consumption for UAV k comprises its distance energy consumption \(EDist_{k}\), angular energy consumption \(EAngl_{k}\), and climbing energy consumption \(EClimb_{k}\). The total energy consumption of a UAV must not exceed the maximum capacity of its battery, as stipulated in Eq. (7).

$$\begin{aligned} EDist_k+EAngl_k+EClimb_k\le Q,\forall k = 1,...,K. \end{aligned}$$
(7)
Distance energy consumption

After accelerating to the designated speed and entering steady flight, the UAV decelerates to zero speed as it approaches the target. The distances covered during the acceleration and deceleration phases are DAcc and DDec, respectively, with the energy required for these phases being EAcc and EDec, respectively. The distance remaining between the acceleration and deceleration phases of the UAV’s journey is covered at a constant cruising speed. SConst represents the constant cruising speed of the UAV, and PConst denotes the power required by the UAV during the steady flight phase. The energy consumption during steady flight is the product of the steady flight time and the required power, as specified in Eq. (8).

$$\begin{aligned} EDist_{k} = \sum _{k = 1}^{x}\left( \frac{\sum _{i\in V}\sum _{j\in V}c_{ij}x_{jk}-DAcc-DDec}{SConst}\times PConst+EAcc+EDec\right) . \end{aligned}$$
(8)
Angular energy consumption

Assume that nodes i, j, and l are not aligned linearly. The UAV flies from node i to node j and then turns towards node l. The angle between the extension line of i and j and the line segment jl is the rotation angle of the ijl flight, denoted as \(\theta _{ijl}\), as illustrated in Fig. 3.

Fig. 3
figure 3

Diagram of the flight angle.

According to literature28, fixed-wing UAVs incur additional energy consumption when changing direction, compared to straight flight at the same speed. To alter its flight direction, a fixed-wing UAV must roll into a banked position to counteract the centrifugal acceleration. The instantaneous power required for a banked level turn is determined by the speed, tangential acceleration, and centrifugal acceleration28. In contrast, rotary-wing UAVs can change direction with a constant angular velocity. According to literature17, when a rotary-wing UAV turns at an angular velocity \({\omega _t}\), the relationship between the energy consumption during the turn and the turning angle can be expressed as Eq. (9):

$$\begin{aligned} E_t\left( \theta \right) = P_t\frac{\theta }{\omega _t}. \end{aligned}$$
(9)

In Eq. (9), \(E_t\left( \theta \right)\) represents the energy consumption of the UAV at a turning angle \(\theta\), and \(P_t\) denotes the power consumed by the UAV during the turn. In literature17, experimental designs were used to determine the UAV’s power \(P_t\) during turning. The experimental results reveal that the energy consumption during flight is linearly related to the turning angle, with angular acceleration having a minimal effect on the energy consumption. This linear relationship is represented by \(CAng_{1}\theta _{ijl}+CAng_{2}\), where \(CAng_{1}\) and \(CAng_{2}\) are two constants obtained by linear fitting. It is assumed that for UAV k traveling from i to j to l, the energy consumed during the turn at node j is a linear function of the flight’s turning angle \(\theta _{ijl}\)17. The angular energy consumption for UAV k is outlined in Eq. (10).

$$\begin{aligned} EAngl_\kappa = \sum _{i\in V}\sum _{j\in V}\sum _{l\in V}x_{ijk}x_{jlk}(CAng_1\theta _{ijl}+CAng_2). \end{aligned}$$
(10)
Climbing energy consumption

When a UAV ascends from a lower to a higher node, it incurs additional energy consumption related to the altitude change. According to literature29,30, climbing energy consumption, i.e., the vertical power consumption, can be modeled as \(E_\textrm{v} = M\times G\times \Delta h\), where \(\Delta h\) represents the upward altitude change. In terms of the drone’s power output, as stated in literature31,32, power transfer efficiency must be considered to represent the ratio between the power output from the battery and the power supplied to the rotors of the drone. The total energy consumption for climbing during the flight is computed as shown in Eq. (11)32.

$$\begin{aligned} EClimb_k = \sum _{i\in V}\sum _{j\in V}x_{ijk}\frac{M\times G\times \max ((h_j-h_i),0)}{\eta }. \end{aligned}$$
(11)

What needs to be illustrated is that when a UAV carries a payload, such as an edge computing motherboard, the mass of the UAV increases, thereby incurring higher consumption of distance energy, angular energy, and climbing energy. In such cases, the values of PConst, EAcc and EDec in Eq. (8), the values of \(CAng_{1}\) and \(CAng_{2}\) in Eq. (10), and the value of M in Eq. (11) may need to be adjusted to higher corresponding values.

The proposed algorithm

This chapter initially outlines the fundamental concepts and framework of the improved bee foraging learning particle swarm optimization algorithm (IBFLPSO) employed in this study. It then details the encoding and decoding strategies implemented, along with the methodology for calculating the fitness function. Finally, the procedural flow of the IBFLPSO is illustrated through a flowchart.

Bee-foraging learning particle swarm optimization

Traditional particle swarm optimization (PSO)33 is a stochastic, parallel algorithm that leverages both the global best and individual historical bests to update the velocity and position of particles. At the outset, PSO initializes a set of particles, each characterized by its position, velocity, and personal best. These particles iteratively adjust their velocity and position by learning from both their individual best solutions and the global best solutions, progressively moving toward the optimal solution. The position update formula is expressed as follows:

$$\begin{aligned} {\left\{ \begin{array}{ll}p\nu _{i}(t+1) = wp\nu _{i}(t)+c_{1}pr_{1}(t)(pbest_{i}(t)-px_{i}(t))+c_{2}pr_{2}(t)(gbest_{i}(t)-px_{i}(t))\\ px_{i}(t+1) = px_{i}(t)+p\nu _{i}(t+1).\end{array}\right. } \end{aligned}$$
(12)

The variables \(pv_{i}(t)\) and \(px_{i}(t)\) denote the velocity and position of the i-th particle at generation t, respectively, while \(pv_{i}(t+1)\) and \(px_{i}(t+1)\) represent the velocity and position in the subsequent generation, \(t+1\). The terms \(pbest_{i}(t)\) and \(gbest_{i}(t)\) refer to the individual best position and global best position, respectively. The parameter w signifies the inertia factor, whereas \(c_{1}\) and \(c_{2}\) are learning coefficients, and \(pr_{1}\) and \(pr_{2}\) are random numbers in the range [0, 1]. It is important to note that the population in PSO tends to exhibit homogeneity, which often results in premature convergence, thereby trapping the algorithm in local optima. To mitigate this issue, Chen et al.24 introduced the Bee-Foraging Learning model (BFL) into PSO, culminating in the creation of the BFL-PSO algorithm. This algorithm executes a three-stage learning and optimization process: the initial learning stage, the dominance reinforcement stage, and the scout stage.

In brief, the initial learning stage resembles the traditional PSO algorithm, where each particle moves according to its velocity and position update rules to find a better solution. This process is primarily optimized based on the particle’s personal best position (\(pbest_{i}\)) and the global best position (\(gbest_{i}\)), and its mathematical formulation aligns with Eq. (12). The dominance reinforcement stage strengthens the search in the regions around particles with higher fitness, meaning that particles with higher fitness are more likely to be selected for further exploration, allowing for a deeper investigation of these promising areas to uncover potential high-quality solutions, thereby improving the solution quality. This step enhances the algorithm’s development capability, with the mathematical formula provided in Eq. (13). First, the fitness value \(fit(px_{i})\) of each particle is calculated, followed by \(f(pbest_{i})\), based on the fitness value of its \(pbest_{i}\). Then, the selection probability of the i-th particle is computed based on the fitness values. The scout stage, on the other hand, targets particles that have failed to improve their \(pbest_{i}\) after multiple iterations and are thus stuck in a stagnation state. These particles enter the scout learning phase, where they are reinitialized, meaning their velocity and position are randomly generated with new values, introducing new diversity into the system. This step provides the algorithm with the ability to escape local optima and ensures that, after prolonged stagnation, the algorithm can explore new solution spaces, thereby enhancing the global search efficiency.

$$\begin{aligned} \textrm{fit}(px_i) = {\left\{ \begin{array}{ll} \frac{1}{1+f(pbest_i)}, & \textrm{if }f(pbest_i)\ge 0, \\ \frac{1}{1+|f(pbest_i)|}, & \textrm{otherwise}. \end{array}\right. } \end{aligned}$$
(13)

In conclusion, the three stages work together to enable the BFL-PSO algorithm to achieve an effective balance between global and local search in complex optimization problems. The initial learning stage guides the particle search using both individual and global best information. The dominance reinforcement stage improves solution quality by thoroughly exploring high-potential areas. The scout stage introduces new diversity by reinitializing stagnant particles, preventing premature convergence of the algorithm. This multi-stage search strategy significantly enhances the performance of the BFL-PSO algorithm in solving complex optimization problems, making it highly suitable for solving the proposed MUAVPP-MEC model.

Improved bee foraging learning particle swarm optimization

Encoding and decoding strategies

The particle navigates the \(n-1\) dimensional search space in pursuit of the global optimum. This search space is derived from literature34, spanning the range \([-100,100]^{n-1}\). The encoding rule established in this paper specifies that each particle is represented as a real-number vector in \(n-1\) dimensions, corresponding to the number of collection points, with each component confined within the range [− 100, 100]. Figure 4 illustrates this configuration, where each component \(x_{i, 1},x_{i, 2},\ldots ,x_{i, k},\ldots ,x_{i,n-1}\) of particle i lies within the bounds \(-100\le x_{i, k}\le 100, 1\le k\le n-1\), as depicted in Fig. 4.

Fig. 4
figure 4

Diagram of the position vector of particle i.

The decoding procedure is outlined in detail as follows. The process consists of obtaining the sequence, segmenting the sequence, and optimizing the path.

Obtaining the sequence

The particle position vector is a real-number vector. The first step in the decoding process is to convert this real-number sequence into an integer sequence representing the collection points. The components of the real-number vector are sequentially numbered and then sorted according to their values, resulting in a sequence of component indices that correspond to the collection points. The node sequence obtained through this sorting process is shown in Fig. 5.

Fig. 5
figure 5

Diagram for the process of obtaining a sequence of nodes by sorting.

Segmenting the sequence

The preceding step converts the individual’s real-number encoding into an \(n-1\) dimensional integer node sequence. In the solution space, the route of a single UAV is represented using a path representation method35. The node sequence can be viewed as the interconnected collection point sequence of the cruising paths of multiple UAVs. For instance, the collection point sequence (4, 1, 5, 2, 8, 7, 3, 9, 6) represents the interconnected inspection paths of two UAVs, requiring the algorithm to divide the sequence into two segments, such as \(R_{1}\) = (0, 4, 1, 5, 2, 8, 0) and \(R_{2}\) = (0, 7, 3, 9, 6, 0). To finalize the multi-UAV cruising distance planning, the algorithm employs a greedy sequence segmentation decoding method. This strategy is regulated by the energy consumption required for distance, angle, and altitude changes, which must not exceed the UAV’s total battery capacity. Additionally, the calculation of distance-based energy consumption accounts for the distinct phases of acceleration, steady flight, and deceleration.

To illustrate the segmentation process, a 5-node example is constructed, where node information is as follows: (0,1850, 1750, 0), (1, 1050, 2900, 47), (2, 900, 1650, 105), (3, 500, 350, 8), (4, 2400, 1300, 132). Each node’s four numbers represent the index, x-coordinate, y-coordinate, and node altitude, respectively, with node 0 serving as the central point and the others as collection points. Constants for distance energy consumption are as follows: SConst = 15 m/s, PConst = 429.8705 VA, DAcc = 112.5 m, EAcc = 6096.3 W, DDec = 112.5 m, EDec = 4944.1875 W. Parameters for angular energy consumption are as follows: \(CAng_{1}\) = 5.3316, \(CAng_{2}\) = 104.65. Parameters for climbing energy consumption are as follows: M = 3.8 kg, G = 9.8 \(\text {m/s}^{2}\), \(\mu\) = 0.74, and the UAV battery’s maximum capacity Q = 200,000.0 J. s is defined as the current position index in the node sequence. A path matrix, denoted as Route, is used to store multiple cruising routes, with the number of rows equal to the number of available UAVs and the number of columns equal to \(n+1\).In summary, IBFLPSO achieves the transformation from the problem-solving space to the algorithm’s search space through an effective encoding strategy. This strategy maps the UAV’s route combinations to the particle position vectors in the algorithm’s search space, enabling the algorithm to efficiently handle and optimize these solutions.

The detailed process of the greedy sequence segmentation decoding method is illustrated in Fig. 6.

Fig. 6
figure 6

Schematic of the greedy sequence segmentation decoding.

Given the node sequence 2, 1, 3, 4, the decoding steps are as follows.

STEP 1 The UAV departs from node 0 and proceeds to node 2, calculating the remaining energy consumption. The current node is now node 2.

STEP 2 Node 1 is designated as the target. The energy required from node 2 to node 1 is first calculated, followed by the remaining energy from node 1 to node 0. If the remaining energy is greater than 0, node 1 becomes the current node.

STEP 3 Node 3 is designated as the target. The energy required from node 1 to node 3 is calculated, followed by the remaining energy from node 3 to node 0. If the remaining energy is less than 0, indicating that the UAV cannot reach node 3, it directly returns to the central point from node 1, generating the path Route[0] = 0,2,1,0.

STEP 4 The UAV departs from node 0 with its remaining energy equal to the maximum storage capacity of the battery and proceeds to node 3.

STEP 5 Steps 1 through 4 are repeated, resulting in the generation of the path Route[1] = 0,3,4,0.

Energy-constrain 2-opt path optimization

The results are subjected to energy-constrained 2-opt local optimization36. Each generated path undergoes 2-opt local optimization, taking into account that energy consumption for angle changes and altitude adjustments may increase on a shorter path. If the energy consumption of the optimized path exceeds the UAV’s maximum battery capacity, the new path is discarded, and the original path is retained.

Fitness function

In accordance with the problem definition presented in “Mathematical models” section, the MUPP-AEC algorithm utilizes a corresponding fitness function to evaluate the particle swarm, thereby selecting the most optimal individuals. The fitness function is defined as:

$$\begin{aligned} \begin{aligned}&fit = \sum _{k = 1}^K\left( \frac{\sum _{i\in V}\sum _{j\in V}c_{ij}x_{ijk}-DAcc-DDec}{SConst}+TAcc+TDec\right) \\&\quad +Penalty\times \sum _{k = 1}^K\max (EDist_k+EAngl_k+EClimb_k+EMgn_{k}-Q,0).\end{aligned} \end{aligned}$$
(14)

The Penalty in Eq. (14) is a large integer. \(EMgn_{k}\) represents the Energy Margin for UAV k to handle operational challenges, which increases the energy consumption of the drones due to factors such as varying weather conditions or sensor malfunctions. Therefore, the lower the cost of MUPP-AEC, the better the fitness, providing the particle with a higher probability of replacing the existing particle during the algorithm’s iterative process.

Algorithm flow

The flow diagram of the IBFLPSO algorithm is presented in Fig. 7. The first stage, initialization, involves randomly initializing particles, evaluating their fitness, and updating the swarm’s global optimum based on this fitness. Subsequently, BFLPSO is used to optimize the particles. The aforementioned encoding strategy is applied, followed by decoding using the 2-opt algorithm. The algorithm checks whether 500 consecutive iterations have been performed, and if so, it outputs the optimal solution and terminates. The complexity of the approach mainly depends on the complexity of the BFL discussed in “Bee-foraging learning particle swarm optimization” section, the decoding strategies in “Encoding and decoding strategies” section, and the energy-constrained 2-opt path optimization in “Energy-constrain 2-opt path optimization” section, with complexities of O(n), O(\(n^2\)) and O(\(n^2\)), respectively. Therefore, the overall complexity of the IBFLPSO algorithm is O(\(n^2\)).

Fig. 7
figure 7

Algorithm flow diagram of IBFLPSO.

Experiment and analysis

Experimental cases and simulation environment

For the simulation experiments, this study employed a hexacopter model X4108, outfitted with a battery capacity of 10,000 mAh and a voltage of 11.1 V, yielding an effective energy capacity of 399,600 J. The experimental dataset incorporates case studies from the literature37, with coordinate values scaled by a factor of 50, and altitude values generated via a random strategy, with the range of random numbers set between 0 and 150 m. The model parameters are presented in Table 1, with angular energy consumption data drawn from literature17, and climbing energy consumption referenced from literature38,39. The simulation environment was configured on a Windows 10 platform, supported by an Intel(R) Core(TM) i5-10400F CPU @ 2.90GHz, complemented by 16 GB of RAM. To ensure consistency in analysis conditions, the population size for each algorithm was set to 40, with each algorithm undergoing 500 iterations. In pursuit of robustness and dependable results, each algorithm was independently executed 10 times across various case scenarios.

Table 1 Model parameters.

Experimental results and analysis

Experiment 1: model analysis

To assess the efficacy of the MUAVPP-MEC model, this experiment incorporated three case studies, each with 30, 40, and 50 collection points, respectively. The outcomes of these cases, resolved using the IBFLPSO, are presented in Table 2. The data presented in this table indicates that, when solving Case Study 1, the MUAVPP-MEC model exhibited the most consistent total cruising time and energy consumption. As the number of collection points increased to 40, compared to Case Study 1, the average total cruising time rose by 66.47%, while the average total energy consumption increased by 65.87%. In the larger Case Study 3, the average total cruising time amounted to 3831.86 seconds, with a total energy consumption of 1,728,929 Joules. Figures 8 and 9 visually illustrate these findings, providing comparisons of the optimal solution times and total energy consumption across different case studies over ten iterations.

Table 2 Results of model testing.
Fig. 8
figure 8

Comparative graph of total cruising times.

Fig. 9
figure 9

Comparative graph of total energy consumption.

As depicted in Figs. 8 and 9, with the expansion of the collection points, both the total cruising time and energy consumption of the UAVs exhibited a substantial increase. This aligns with real-world UAV flight scenarios, thereby indirectly confirming the accuracy of our model. The optimal solutions for the three cases are presented in Fig. 10.

Fig. 10
figure 10

Optimal path graphs for the three case studies.

As shown in Fig. 10, Case Studies 1, 2, and 3 require 2, 3, and 5 UAVs, respectively, to collect data, with each UAV following a distinct, non-intersecting flight path. To evaluate the scheduling effectiveness of the MUAVPP-MEC model, this experiment centers on the optimal solution from Case Study 1, as outlined in Table 3.

Table 3 Specific details of the optimal solution for Case Study 1.

As shown in Table 3, UAV1 departs from the starting point, visits 26 distinct collection points, and returns to the origin, while UAV2 follows a similar route, visiting 4 unique points. Both UAVs efficiently manage their distance, angular, and climbing energy consumption, returning before their battery reserves are exhausted, thereby adhering to the model’s constraints. These results affirm that IBFLPSO is capable of effectively solving the MUAVPP-MEC model.

Experiment 2: comparison of algorithms

To evaluate the optimization performance of the proposed algorithm, five algorithms were employed to solve the MUAVPP-MEC model, and the resultant data were compared. The first algorithm is a traditional PSO, the second is a traditional PSO enhanced with the energy-constrained 2-opt modification (denoted as PSO-2OPT), the third is the GA algorithm, the fourth is the BFLPSO algorithm, and the fifth is the proposed method. All five algorithms were tested on a 51-sample point case. The iterative processes for obtaining optimal solutions with these algorithms are shown in Fig. 11.

Fig. 11
figure 11

Iterative graphs showing optimal solutions obtained by the five algorithms.

Figure 11 shows that IBFLPSO outperforms the other four algorithms in terms of convergence speed and effectiveness. Additionally, Figs. 12 and 13 provide a detailed comparison of the results from ten independent experiments involving the five algorithms.

Fig. 12
figure 12

Comparison graph of total energy consumptions from 10 independent experiments.

Fig. 13
figure 13

Comparison graph of cruising times from 10 independent experiments.

As shown in Figs. 12 and 13, IBFLPSO consistently outperforms PSO-2OPT, PSO, and BFLPSO across ten independent experiments. In comparison with the GA algorithm, IBFLPSO demonstrates clear superiority in seven out of the ten experiments. The specific experimental data are detailed in Table 4.

Table 4 Statistical results.

As detailed in Table 4, in the optimal solution, IBFLPSO saved 1,596,280 joules compared to the PSO (53.21% savings), Cruise time was reduced by 3704.15 s (54.64% superior); IBFLPSO saved total 1,297,972 joules from PSO-2OPT (48.04% savings), cruise time was reduced by 3008.44 s (49.45% superior); IBFLPSO saved 463,062 (24.80% savings) joules compared to GA, Cruise time was reduced by 1068.01 s (25.78% superior); Compared to BFLPSO, IBFLPSO saved 395,642 joules (21.99% savings), Cruise time was reduced by 914.24 s (22.92%superior). In the average solution, IBFLPSO saved 1,440,177 joules and 3332.65 s in cruise time compared to PSO-2OPT, 1,776,046 joules and 4155.02 s compared to PSO, 374,981 joules and 857.62 s in cruise time compared to GA, and 298,444 joules and 683.76 s compared to BFLPSO. Finally, in terms of the running time of the optimal solution, PSO-2OPT increases by 6.07 s compared to PSO, IBFLPSO increases by 85.88 s compared to BFLPSO, IBFLPSO needs longer running time than the other four algorithms, but will get better iterative results. These data confirm the validity of the 2-opt algorithm. Finally, IBFLPSO outperforms the other four algorithms in both optimal and average solutions, showing superior optimization capabilities and stability.

Conclusions and future works

This paper aims to minimize the combined cruising time of UAVs while considering energy expenditures associated with steady level flight, acceleration, deceleration, turning, and climbing maneuvers, by proposing the MUAVPP-MEC model. In IBFLPSO proposed, BFLPSO was integrated with energy-constrained 2-opt and an efficient encoding and decoding strategy was developed. The model and algorithm were validated across three distinct case studies, confirming the stability and efficacy of the proposed methods. Compared to four other algorithms, IBFLPSO achieved superior optimal and average solutions. The optimal solution of IBFLPSO is 54.64%, 49.45%, 25.78%, and 22.92% superior to those of the traditional PSO algorithm, PSO-2OPT algorithm, GA, and BFLPSO, respectively. Future work will further explore factors such as UAV payload to optimize the model and develop new solution algorithms.