Introduction

Unmanned platforms offer advantages of low cost, high convenience, and strong concealment, enabling efficient work and reduced personnel risk. They have wide applications across aerospace, maritime exploration and environmental monitoring1. Among them, path tracking2, trajectory tracking3,4 and target tracking5 are the core technologies for autonomous navigation and mission planning of unmanned platforms, and there are significant differences in the technical connotations of the three. Path tracking emphasizes time-constraint-free tracking of predefined geometric paths, trajectory tracking requires synergistic control of paths and temporal parameters, and target tracking focuses on real-time target state estimation. In this paper, we take target tracking as an entry point and focus on the trajectory optimization problem of unmanned platforms.

Methods to improve target tracking accuracy are divided into two main categories: the first is data processing optimization. Error in airborne unmanned platform marine target observation stems from sensor detection and attitude errors, which are generally composed of random error and systematic error6. Random error can be filtered out using algorithms like Kalman Filter (KF), Extended Kalman Filter (EKF) and Cubature Kalman Filter (CKF)7,8,9. Systematic error requires error reduction algorithms including real-time quality control, least squares10 , maximum likelihood11 , exact maximum likelihood estimation12 and other algorithms. In addition, some researchers13 proposed a target localization method using auxiliary beacons. The unmanned platform observes the auxiliary beacons and the target simultaneously to reduce systematic error.

The second is observation geometry optimization. The positional relationship between the platforms and the target affects tracking accuracy. This is referred to as the optimal observation configuration for an unmanned platform, determined by target observation. Fisher Information Matrix (FIM) is commonly used to analyze optimal observation configurations, which can be divided into two categories: A-optimal and D-optimal. A-optimal aims to minimize the inverse trace of the FIM, while D-optimal seeks to maximize the determinant of the FIM.SHI et al.14 extended the optimal observation configuration for UAVs based on distance and angle measurements to 3D space. Hung et al.15 proposed a multi-UAV system using heterogeneous sensors for tracking, with a cost function based on the D-optimal criterion for the problem of unknown target tracking. Chen et al.16 focused on sensor deployment optimization for arrival time direct localization in shallow water multipath environments, considering multipath signal loss and angle and distance constraints. They proposed an optimization problem based on the A-optimal criterion to determine the optimal sensor location. Wu et al.17 proposed a sensor deployment optimization method for the simultaneous time-of-arrival based multi-objective localization problem using A-optimal criterion, combining alternating minimization, the multiplier alternating direction method and maximizing minimization. However, most existing optimal observation configuration analysis based on FIM only considers the effect of observation random error on target tracking, with few considering the effect of observation systematic error.

Cooperative observation optimization is essentially a trajectory optimization problem, belonging to the control methods of airborne unmanned platforms. Common trajectory optimization methods for unmanned airborne platforms include optimal control18, guidance19 and intelligent algorithms20. Optimal control is a control strategy that optimizes the system performance index, and commonly uses dynamic programming, model predictive control and reinforcement learning. Literature21. proposed an end-to-end artificial intelligence framework to deal with the trajectory optimization problem for airborne unmanned platforms, significantly improving performance and efficiency. Literature22 proposed a dual-deep Q-network based trajectory optimization method for airborne unmanned platforms considering reconfigurable intelligent surfaces. Literature23 used a distributed model predictive control method to optimize the trajectory of an airborne unmanned platform in a dynamic environment. Although optimal control can achieve optimality and high accuracy, it has high computational complexity, strict model constraints, sensitivity to initial conditions, poor real-time performance and difficulty in dealing with complex constraints. Lyapunov potential field guidance method controls an airborne unmanned platform by forming a potential field around the guidance path, which has low computational complexity and real-time potential field changes, for rapid control and guidance. Literature24 controlled the airborne unmanned platform heading to complete tracking of straight lines, arcs and arbitrary paths based on the vector field. Literature25 designed the outer loop controller of the airborne unmanned platform to generate the required heading rate command via vector field . Although the guidance method has good real-time performance, it requires designing the guidance law based on the known optimal observation configuration. It cannot solve the trajectory optimization problem under complex unknown conditions. Compared to the guidance method, intelligent algorithms do not require an exact mathematical model. They can be solved by evaluating the objective function value, suitable for problems lacking accurate models. Intelligent algorithms have strong versatility and adaptability, enabling effective solutions to complex practical problems. Intelligent algorithms are widely used in unmanned platforms. Literature26 applied swarm intelligence algorithms to search and rescue scenarios of airborne unmanned platforms. Literature27 utilized a distributed control framework based on homing pigeon hierarchical strategy to address airborne unmanned platform swarming problems. Swarm intelligence algorithms are also widely used in trajectory optimization problems. Literature28 combined adaptive differential evolution with Nash optimization for trajectory optimization, improving target tracking accuracy and reducing complexity. The results showed that the new method performed better in real target trajectory tracking, airborne unmanned platform safety, and stability. Literature29 proposed an improved bat algorithm to optimize tracking trajectory of an airborne unmanned platform for dynamic intrusion target tracking in oilfield inspection. Literature30 proposed a new multi-objective trajectory optimization algorithm using a cut-and-fill coding strategy to optimize populations where individuals may have different lengths.

DBO31 is a new swarm intelligence algorithm proposed by XUE et al. in 2022. Compared to PSO32 , GWO33, SSA34, SCA35, MVO36 and HHO37, results proved the superiority of its performance. Once proposed, DBO had wide applications across various fields. Literature38 used DBO to determine optimal feature color combination for olfactory sensors to help determine fatty acids in stored wheat. Literature39 combined DBO with KELM (Extreme Learning Machine) to improve accuracy of recognizing internal leaks in unilateral and bilateral ball valves. Like other swarm intelligence algorithms. DBO also easily falls into local optima and has weak global exploration ability. To fully develop its performance, Literature40 improved IDBO to select hyperparameters for BiLSTM41,42,43 DBO algorithm is optimized in terms of population initialization, search strategy, and population diversity, effectively improving DBO performance. Existing studies have not used DBO for trajectory optimization of airborne unmanned platforms.

In summary, current research on optimizing unmanned platform cooperative observation of marine targets has two deficiencies: one is that the current observation optimization problem focuses on homogeneous platforms; the other is that previous research on trajectory optimization mainly considers the influence of random error, and rarely considers the influence of systematic error on tracking accuracy. In this paper, we address these problems, introduce the orientation system error of sensor observation, collaborate airborne unmanned platforms with sea unmanned platforms, and study the optimization method of multi-unmanned platforms collaborative observation of maritime targets.

The main contributions of this paper compared to previous studies are as follows:

  • (1) Based on the error reduction algorithm, first-order Taylor expansion of error is used to analyze azimuthal systematic error’s impact on tracking error. Optimal observation configurations for the airborne unmanned platform, maritime cooperative unmanned platform, and target are derived;

  • (2) AEDBO is proposed using improved Tent chaos mapping and center-of-mass contrastive inverse learning for population initialization, a new adaptive convergence factor, an adaptive nonlinear ball-rolling dung beetle tree-volume decreasing model, and an adaptive Cauchy-Gaussian mutation strategy.

  • (3) Taking airborne and maritime unmanned platforms as cooperative objects, AEDBO is used to optimize the flight trajectory of airborne unmanned platforms in safe and hazardous area environments to obtain the optimal observation configurations of the airborne unmanned platform, maritime cooperative unmanned platform and target, thereby improving target tracking accuracy under a receding horizon optimization framework.

The paper is organized as follows. Section II describes and models the problem. Section III analyzes the trajectory optimization of the airborne unmanned platform, based on the error reduction algorithm to analyze the optimal observation configuration under azimuthal system error. Section IV proposes an optimal control method for airborne unmanned platforms based on AEDBO, which consists of four improved methods: population initialization incorporating improved Tent chaos mapping and center-of-mass dyadic inverse learning, adaptive convergence factor, adaptive nonlinear rolling dung beetle decreasing number model, and adaptive Cauchy-Gaussian mutation strategy with complexity analysis. AEDBO solving steps for the optimal control method of airborne unmanned platforms are given. Section V conducts simulation experiment verification. The performance analysis of AEDBO is carried out firstly, and then the optimal trajectory optimization of the airborne unmanned platform in different environments is carried out using AEDBO. Section VI concludes with future outlook.

Problem description and models

Problem description

The marine target localization and tracking scenario for an airborne unmanned platform based on error reduction of a maritime cooperative unmanned platform is shown in Fig. 1. The airborne unmanned platform observes the maritime cooperative unmanned platform and target simultaneously via its radar, obtaining position and velocity information. It then eliminates systematic error effects on target localization and tracking through an error reduction algorithm. Finally, the platform obtains relatively accurate target state information via filtering estimation. In this case, observation configuration affects the performance of the error reduction algorithm. Due to the unmanned platform’s random initial position, ensuring an optimal observation configuration initially is challenging. The key of accurate target tracking via the maritime cooperative unmanned platform for airborne unmanned platform is determining the optimal observation geometry of the airborne unmanned platform, maritime cooperative unmanned platform, and marine target. The goal is to control the airborne unmanned platform to the optimal observation position.

Fig. 1
figure 1

Marine target tracking scenario by airborne and maritime unmanned platforms cooperation.

Unmanned platform motion model

Airborne unmanned platforms are typically controlled using a mathematical model that incorporates an autopilot, a practical approach for engineering applications. Each platform is assumed to have a low-order flight controller that adjusts speed and heading angular velocity to regulate the platform’s flight state. The fixed-height model of an airborne unmanned platform in three-dimensional space is formulated as:

$$\left( {\begin{array}{*{20}c} {\dot{x}} \\ \begin{gathered} {\dot{y}} \hfill \\ {\dot{h}} \hfill \\ \end{gathered} \\ {\dot{\psi }} \\ {\dot{v}} \\ {\dot{\omega }} \\ \end{array} } \right) = f({\varvec{x}}{\mathbf{,}}{\varvec{u}}) = \left( {\begin{array}{*{20}c} {v\cos \psi } \\ \begin{gathered} v\sin \psi \hfill \\ \, 0 \hfill \\ \end{gathered} \\ \omega \\ { - \frac{1}{{\tau_{v} }}v + \frac{1}{{\tau_{v} }}u_{v} } \\ { - \frac{1}{{\tau_{\omega } }}\omega + \frac{1}{{\tau_{\omega } }}u_{\omega } } \\ \end{array} } \right)$$
(1)

where,\({\varvec{x}} = [x,y,h,\psi ,v,\omega ]^{T}\) denotes the state of the airborne unmanned platform, including its position, heading angle, velocity, and heading angular velocity of the airborne unmanned platform. \(\tau_{v}\) and \(\tau_{\omega }\) are velocity and angular velocity time delay constants related to the platform and its flight state. \({\varvec{u}} = [u_{v} ,u_{\omega } ]^{T}\) constitutes the control input, indicating the autopilot speed control command and heading angular velocity control command. These commands adhere to the following constraints:

$$\left| {u_{v} - v_{0} } \right| \le v_{\max }$$
(2)
$$\left| {u_{\omega } } \right| \le \omega_{\max }$$
(3)

Here,\(v_{0}\) denotes the cruise speed of the platform.\(v_{\max }\) and \(\omega_{\max }\) correspond to the maximum range of the cruise speed variation and the maximum heading angular velocity respectively.

In order to facilitate receding horizon optimization control of the airborne unmanned platform, the continuous-time model in Eq. (1) is converted into the following discrete-time model using Euler integration:

$${\varvec{x}}_{k + 1} = f_{d} ({\varvec{x}}_{k} {\mathbf{,}}{\varvec{u}}_{k} ) = {\varvec{x}}_{k} + T_{s} f({\varvec{x}}_{k} {\mathbf{,}}{\varvec{u}}_{k} )$$
(4)

where \({\varvec{x}}_{k} = [x_{k} ,y_{k} ,\psi_{k} ,v_{k} ,\omega_{k} ]^{T}\), \({\varvec{u}}_{k} = [u_{vk} ,u_{\omega k} ]^{T}\) and \(T_{s}\) is the sampling time.

The motion of a maritime unmanned platform can be approximated as two-dimensional space motion, mainly involving speed and heading control. This motion model is similar to that of the airborne unmanned platform.

Airborne unmanned platform observation model

In the three-dimensional space, the state of the marine target at time k is denoted as \({\mathbf{X}}_{k} = [x_{t} ,y_{t} ,z_{t} ]_{k}^{T}\) , and the state of the airborne unmanned platform is \({\mathbf{U}}_{k} = [x_{u} ,y_{u} ,z_{u} ]_{k}^{T}\) . Relative vectors of the airborne unmanned platform and the target are \({\mathbf{r}}_{k} = [r_{x} ,r_{y} ,r_{z} ]_{k}^{T}\) ,\(r_{x} = x_{u} - x_{t}\) ,\(r_{y} = y_{u} - y_{t}\) ,\(r_{z} = z_{u} - z_{t}\) . The target information observed by the airborne unmanned platform is:

$${\mathbf{z}}\left( k \right) = {\mathbf{h}}\left( {{\mathbf{X}}_{k} ,{\mathbf{U}}_{k} } \right) + {\mathbf{v}}\left( k \right) = \left[ {\begin{array}{*{20}c} {\sqrt {r_{x}^{2} + r_{y}^{2} + r_{z}^{2} } } \\ {\tan^{ - 1} \left( {\frac{{r_{x} }}{{r_{y} }}} \right)} \\ {\tan^{ - 1} \left( {\frac{{r_{z} }}{{\sqrt {r_{x}^{2} + r_{y}^{2} } }}} \right)} \\ \end{array} } \right] + {\mathbf{v}}\left( k \right)$$
(5)

where \({\mathbf{v}}\left( k \right)\) denotes the observation noise at time k, obeying a zero-mean Gaussian distribution. The error covariance matrix is \(R = {\text{diag}}\left( {\sigma_{r}^{2} ,\sigma_{\alpha }^{2} ,\sigma_{\beta }^{2} } \right)\) , where \(\sigma_{r}\) ,\(\sigma_{\alpha }\) , and \(\sigma_{\beta }\) denote the standard deviation of the sensor measurement of distance, bearing, and pitch respectively.

Hazardous area model

Airborne unmanned platforms operating in complex environments at sea may face threats from other airborne platforms and incoming targets. Therefore, it is necessary to consider the impact of hazardous areas. Based on the literature13, hazardous areas are divided into collision avoidance and collision zones, and the hazardous area function is given by

$$O_{i} = \left\{ \begin{gathered} \left( {\frac{{R_{a}^{2} - \left\| {{{\varvec{\upchi}}} - {\mathbf{P}}} \right\|_{2}^{2} }}{{\left\| {{{\varvec{\upchi}}} - {\mathbf{P}}} \right\|_{2}^{2} - R_{c}^{2} }}} \right)^{2} ,R_{c} < \left\| {{{\varvec{\upchi}}} - {\mathbf{P}}} \right\| \le R_{a} \hfill \\ {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} 0{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} ,{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} \left\| {{{\varvec{\upchi}}} - {\mathbf{P}}} \right\| \ge R_{a} \hfill \\ \end{gathered} \right.$$
(6)

where \({{\varvec{\upchi}}}\) denotes the position of the airborne unmanned platform and \({\mathbf{P}}\) denotes the position of the obstacle. The collision radius \({{\varvec{\Phi}}}_{c} = \left\{ {{{\varvec{\upchi}}} \in R^{2} |\left\| {{{\varvec{\upchi}}} - {\mathbf{P}}} \right\| < R_{c} } \right\}\) and obstacle avoidance radius \({{\varvec{\Phi}}}_{a} = \left\{ {{{\varvec{\upchi}}} \in R^{2} |R_{c} < \left\| {{{\varvec{\upchi}}} - {\mathbf{P}}} \right\| \le R_{a} } \right\}\) are defined as shown in Fig. 2.

Fig. 2
figure 2

Hazardous area for the unmanned platform.

Error reduction model

This paper utilizes an error reduction algorithm based on a maritime cooperative unmanned platform for target tracking. Assuming the observation value of the airborne unmanned platform on the maritime cooperative unmanned platform is \(\left( {r + \Delta r,\alpha + \Delta \alpha ,\beta + \Delta \beta } \right)\) , where \(r\) ,\(\alpha\) and \(\beta\) represent the true values of distance, azimuth and elevation angle observed by the airborne unmanned platform respectively, and \(\Delta r\) ,\(\Delta \alpha\) and \(\Delta \beta\) are the corresponding observation errors. The position of the maritime cooperative unmanned platform in the airborne unmanned platform’s unstable carrier coordinate system is obtained

$${\mathbf{X}}_{co,u} = \left( {\begin{array}{*{20}c} {\left( {r + \Delta r} \right)\cos \left( {\alpha + \Delta \alpha } \right)\cos \left( {\beta + \Delta \beta } \right)} \\ {\left( {r + \Delta r} \right)\sin \left( {\alpha + \Delta \alpha } \right)\cos \left( {\beta + \Delta \beta } \right)} \\ {\left( {r + \Delta r} \right)\sin \left( {\beta + \Delta \beta } \right)} \\ \end{array} } \right)$$
(7)

In an ideal environment, there exists

$${\mathbf{X}}_{c} = {\mathbf{X}}_{f} + {\mathbf{T}}_{uts} ({\mathbf{P}}){\mathbf{X}}_{co,u}$$
(8)

where: \({\mathbf{X}}_{f}\) and \({\mathbf{X}}_{c}\) are the positions of the maritime cooperative and airborne unmanned platform in the Earth coordinate system,\({\mathbf{T}}_{uts}\) is the transformation from unstable to stable coordinate system, \({\mathbf{P}}\) is the attitude of the airborne unmanned platform.

However, in the real environment, Eq. (8) is difficult to establish due to attitude error, observation error and airborne unmanned platform position error. Therefore, Eq. (9) is constructed for error reduction

$${\text{f}} \left( {\mathbf{P}} \right) = \left| {{\mathbf{X}}_{f} + {\mathbf{T}}_{uts} ({\mathbf{P}}){\mathbf{X}}_{co,u} - {\mathbf{X}}_{c} } \right|$$
(9)

where:\({\text{f}} \left( {\mathbf{P}} \right)\) denotes the error between the position of the airborne unmanned platform observing the maritime cooperative unmanned platform and the true position of the maritime cooperative unmanned platform.

We solve \({\mathbf{P}}_{\min }\) to make \(\min {\text{f}} \left( {\mathbf{P}} \right) = {\text{f}} \left( {{\mathbf{P}}_{\min } } \right)\) . Then \({\mathbf{P}}_{\min }\) is used for attitude value for marine target tracking to improve tracking accuracy.

Trajectory optimization

Tracking error analysis based on error reduction

Airborne unmanned platform observation model approximation

Airborne unmanned platforms observing maritime cooperative unmanned platform without accounting for random error can be described as \({\mathbf{Z}}_{c,m}\)

$${\mathbf{Z}}_{c,m} { = }{\mathbf{Z}}_{c,t} + {\mathbf{E}}_{c,se}$$
(10)

where: \({\mathbf{Z}}_{c,t}\) and \({\mathbf{E}}_{c,se}\) are the truth value and systematic error of observing the maritime cooperative unmanned platform.

Converted to Cartesian Coordinates

$${\mathbf{X}}_{c,um} { = }{\mathbf{h}}_{str} ({\mathbf{Z}}_{c,t} + {\mathbf{E}}_{c,se} )$$
(11)

where: \({\mathbf{h}}_{str}\) denotes spherical coordinates in rectangular coordinates.

First-order Taylor expansion is at the truth value

$${\mathbf{X}}_{c,um} \approx {\mathbf{h}}_{str} ({\mathbf{Z}}_{c,t} ) + {\mathbf{H}}_{str} ({\mathbf{Z}}_{c,t} ){\mathbf{E}}_{c,se}$$
(12)

where: \({\mathbf{H}}_{str}\) is the Jacobian matrix of \({\mathbf{h}}_{str}\).

Without considering random error, the corrected attitude of the airborne unmanned platform is

$${\mathbf{P}}_{r} { = }{\mathbf{P}}_{t} + {\mathbf{P}}_{rs}$$
(13)

where: \({\mathbf{P}}_{t}\) and \({\mathbf{P}}_{rs}\) are the truth and correction value of the attitude.

Unstable coordinate system transformed to stable coordinate system

$${\mathbf{T}}_{uts} ({\mathbf{P}}_{r} ){ = }{\mathbf{T}}_{uts} ({\mathbf{P}}_{t} + {\mathbf{P}}_{rs} )$$
(14)

First-order Taylor expansion is at the truth value

$${\mathbf{T}}_{uts} ({\mathbf{P}}_{r} ) \approx {\mathbf{T}}_{uts} ({\mathbf{P}}_{t} ) + {\mathbf{dT}}(\varphi ){\mathbf{T}}(\theta ){\mathbf{T}}(\gamma )\varphi_{rs} + {\mathbf{T}}(\varphi ){\mathbf{dT}}(\theta ){\mathbf{T}}(\gamma )\theta_{rs} + {\mathbf{T}}(\varphi ){\mathbf{T}}(\theta ){\mathbf{dT}}(\gamma )\gamma_{rs}$$
(15)

where: \(\varphi\),\(\theta\),\(\gamma\) are yaw ,pitch and roll,\(\varphi_{rs}\)\(\theta_{rs}\)\(\gamma_{rs}\) are the systematic error of yaw ,pitch and roll.

$${\mathbf{T}}(\varphi ){ = }\left[ {\begin{array}{*{20}c} {{\text{cos}}\left( \varphi \right)} & {{\text{ - sin}}\left( \varphi \right)} & 0 \\ {{\text{sin}}\left( \varphi \right)} & {{\text{cos}}\left( \varphi \right)} & 0 \\ 0 & 0 & 1 \\ \end{array} } \right]\;{\mathbf{T}}(\theta ){ = }\left[ {\begin{array}{*{20}c} 1 & 0 & 0 \\ 0 & {{\text{cos}}\left( \theta \right)} & {{\text{ - sin}}\left( \theta \right)} \\ 0 & {{\text{sin}}\left( \theta \right)} & {{\text{cos}}\left( \theta \right)} \\ \end{array} } \right]\;{\mathbf{T}}(\gamma ){ = }\left[ {\begin{array}{*{20}c} {{\text{cos}}\left( \gamma \right)} & 0 & {{\text{sin}}\left( \gamma \right)} \\ 0 & 1 & 0 \\ {{\text{sin}}\left( \gamma \right)} & 0 & {{\text{cos}}\left( \gamma \right)} \\ \end{array} } \right]$$
$${\mathbf{dT}}(\varphi ){ = }\left[ {\begin{array}{*{20}c} {{\text{ - sin}}\left( \varphi \right)} & {{\text{ - cos}}\left( \varphi \right)} & 0 \\ {{\text{cos}}\left( \varphi \right)} & {{\text{ - sin}}\left( \varphi \right)} & 0 \\ 0 & 0 & 0 \\ \end{array} } \right]\;{\mathbf{dT}}(\theta ){ = }\left[ {\begin{array}{*{20}c} 0 & 0 & 0 \\ 0 & {{\text{ - sin}}\left( \theta \right)} & {{\text{ - cos}}\left( \theta \right)} \\ 0 & {{\text{cos}}\left( \theta \right)} & {{\text{ - sin}}\left( \theta \right)} \\ \end{array} } \right]\;{\mathbf{dT}}(\gamma ){ = }\left[ {\begin{array}{*{20}c} {{\text{ - sin}}\left( \gamma \right)} & 0 & {{\text{cos}}\left( \gamma \right)} \\ 0 & 0 & 0 \\ {{\text{cos}}\left( \gamma \right)} & 0 & {{\text{ - sin}}\left( \gamma \right)} \\ \end{array} } \right]$$

Let

$${\mathbf{dT}}({\mathbf{P}}_{r} ) = {\mathbf{dT}}(\varphi ){\mathbf{T}}(\theta ){\mathbf{T}}(\gamma )\varphi_{rs} + {\mathbf{T}}(\varphi ){\mathbf{dT}}(\theta ){\mathbf{T}}(\gamma )\theta_{rs} + {\mathbf{T}}(\varphi ){\mathbf{T}}(\theta ){\mathbf{dT}}(\gamma )\gamma_{rs}$$
(16)

Therefore:

$${\mathbf{T}}_{uts} ({\mathbf{P}}_{r} ) \approx {\mathbf{T}}_{uts} ({\mathbf{P}}_{t} ) + {\mathbf{dT}}({\mathbf{P}}_{r} )$$
(17)

True positions of the maritime cooperative unmanned platform and target in the stable coordinate system with the airborne unmanned platform as the origin and considering the airborne unmanned platform’s sway, are respectively

$${\mathbf{X}}_{c,t} = {\mathbf{T}}_{uts} ({\mathbf{P}}_{t} ){\mathbf{X}}_{c,ut}$$
(18)
$${\mathbf{X}}_{t,t} = {\mathbf{T}}_{uts} ({\mathbf{P}}_{t} ){\mathbf{X}}_{t,ut}$$
(19)

where \({\mathbf{X}}_{c,ut}\), \({\mathbf{X}}_{t,ut}\) represent true positions of the maritime cooperative unmanned platform and target in the unstable coordinate system with the airborne unmanned platform as the origin.

Error calculation based on error reduction

Errors present after obtaining the corrected attitude \({\mathbf{P}}_{r}\) , theoretically

$${\mathbf{X}}_{c,t} = {\mathbf{T}}_{uts} ({\mathbf{P}}_{r} ){\mathbf{X}}_{c,um}$$
(20)
$${\mathbf{X}}_{t,t} = {\mathbf{T}}_{uts} ({\mathbf{P}}_{r} ){\mathbf{X}}_{t,um}$$
(21)

where \({\mathbf{X}}_{c,um}\), \({\mathbf{X}}_{t,um}\) represent observation value of the maritime cooperative unmanned platform and target in the unstable coordinate system with the airborne unmanned platform as the origin.

Integration of (18), (19), (20), (21)

$${\mathbf{T}}_{uts} ({\mathbf{P}}_{t} ){\mathbf{X}}_{c,ut} - {\mathbf{T}}_{uts} ({\mathbf{P}}_{t} ){\mathbf{X}}_{t,ut} = {\mathbf{T}}_{uts} ({\mathbf{P}}_{r} ){\mathbf{X}}_{c,um} - {\mathbf{T}}_{uts} ({\mathbf{P}}_{r} ){\mathbf{X}}_{t,um}$$
(22)

Simplified, it can be obtained that:

$${\mathbf{T}}_{uts} ({\mathbf{P}}_{t} )({\mathbf{X}}_{c,ut} - {\mathbf{X}}_{t,ut} ) = {\mathbf{T}}_{uts} ({\mathbf{P}}_{r} )({\mathbf{X}}_{c,um} - {\mathbf{X}}_{t,um} )$$
(23)

First order Taylor expansion

$$\begin{gathered} {\mathbf{T}}_{uts} ({\mathbf{P}}_{t} )({\mathbf{X}}_{c,ut} - {\mathbf{X}}_{t,ut} ) \hfill \\ \approx ({\mathbf{T}}_{uts} ({\mathbf{P}}_{t} ) + {\mathbf{dT}}_{uts} ({\mathbf{P}}_{r} ))({\mathbf{X}}_{c,ut} + {\mathbf{H}}_{str} ({\mathbf{Z}}_{c,t} ){\mathbf{E}}_{c,se} - {\mathbf{X}}_{t,ut} - {\mathbf{H}}_{str} ({\mathbf{Z}}_{t,t} ){\mathbf{E}}_{t,se} ) \hfill \\ = {\mathbf{T}}_{uts} ({\mathbf{P}}_{t} )({\mathbf{X}}_{c,ut} - {\mathbf{X}}_{t,ut} ) + {\mathbf{T}}_{uts} ({\mathbf{P}}_{t} )({\mathbf{H}}_{str} ({\mathbf{Z}}_{c,t} ){\mathbf{E}}_{c,se} - {\mathbf{H}}_{str} ({\mathbf{Z}}_{t,t} ){\mathbf{E}}_{t,se} ) + \hfill \\ \, {\mathbf{dT}}_{uts} ({\mathbf{P}}_{r} )({\mathbf{X}}_{c,ut} - {\mathbf{X}}_{t,ut} ) + {\mathbf{dT}}_{uts} ({\mathbf{P}}_{r} )({\mathbf{H}}_{str} ({\mathbf{Z}}_{c,t} ){\mathbf{E}}_{c,se} - {\mathbf{H}}_{str} ({\mathbf{Z}}_{t,t} ){\mathbf{E}}_{t,se} ) \hfill \\ \end{gathered}$$
(24)

Let

$${\mathbf{e}}1 = {\mathbf{T}}_{uts} ({\mathbf{P}}_{t} )({\mathbf{H}}_{str} ({\mathbf{Z}}_{c,t} ){\mathbf{E}}_{c,se} - {\mathbf{H}}_{str} ({\mathbf{Z}}_{t,t} ){\mathbf{E}}_{t,se} )$$
(25)
$${\mathbf{e}}2 = {\mathbf{dT}}_{uts} ({\mathbf{P}}_{r} )({\mathbf{X}}_{c,ut} - {\mathbf{X}}_{t,ut} )$$
(26)
$${\mathbf{e}}3 = \, {\mathbf{dT}}_{uts} ({\mathbf{P}}_{r} )({\mathbf{H}}_{str} ({\mathbf{Z}}_{c,t} ){\mathbf{E}}_{c,se} - {\mathbf{H}}_{str} ({\mathbf{Z}}_{t,t} ){\mathbf{E}}_{t,se} )$$
(27)

Among them,\({\mathbf{e}}1\) is the main error caused by the sensor, \({\mathbf{e}}2\) is the main error caused by the attitude and \({\mathbf{e}}3\) is the coupling error between the sensor and the attitude, which is small and negligible compared to the former.

The final error mainly consists of

$${\mathbf{e}} = {\mathbf{e}}1 + {\mathbf{e}}2$$
(28)
$${\mathbf{e}}1 = \left[ {\begin{array}{*{20}c} {(\cos \varphi_{t} \cos \gamma_{t} + \sin \varphi_{t} \sin \theta_{t} \sin \gamma_{t} )\sigma_{1}^{e1} + (\cos \varphi_{t} \sin \gamma_{t} + \sin \varphi_{t} \sin \theta_{t} \cos \gamma_{t} )\sigma_{1}^{e1} - \sin \varphi_{t} \cos \theta_{t} \sigma_{3}^{e1} } \\ {(\sin \varphi_{t} \cos \gamma_{t} - \cos \varphi_{t} \sin \theta_{t} \sin \gamma_{t} )\sigma_{1}^{e1} + (\sin \varphi_{t} \sin \gamma_{t} - \cos \varphi_{t} \sin \theta_{t} \cos \gamma_{t} )\sigma_{2}^{e1} + \cos \varphi_{t} \cos \theta_{t} \sigma_{3}^{e1} } \\ {\sin \theta_{t} \sigma_{1}^{e1} + \cos \theta_{t} \sin \gamma_{t} \sigma_{2}^{e1} + \cos \theta_{t} \cos \gamma_{t} \sigma_{3}^{e1} } \\ \end{array} } \right]$$
(29)

Among them:

$$\begin{gathered} \sigma_{1}^{e1} = r_{c,se} \cos \alpha_{c,t} \cos \beta_{c,t} - r_{t,se} \cos \alpha_{t,t} \cos \beta_{t,t} - \alpha_{c,se} r_{c,t} \sin \alpha_{c,t} \cos \beta_{c,t} + \hfill \\ \, \alpha_{t,se} r_{t,t} \sin \alpha_{t,t} \cos \beta_{t,t} - \beta_{c,se} r_{c,t} \cos \alpha_{c,t} \sin \beta_{c,t} + \beta_{t,se} r_{t,t} \cos \alpha_{t,t} \sin \beta_{t,t} \hfill \\ \end{gathered}$$
(30)
$$\begin{gathered} \sigma_{2}^{e1} = r_{c,se} \sin \alpha_{c,t} \cos \beta_{c,t} - r_{t,se} \sin \alpha_{t,t} \cos \beta_{t,t} + \alpha_{c,se} r_{c,t} \cos \alpha_{c,t} \cos \beta_{c,t} - \hfill \\ \, \alpha_{t,se} r_{t,t} \cos \alpha_{t,t} \cos \beta_{t,t} - \beta_{c,se} r_{c,t} \sin \alpha_{c,t} \sin \beta_{c,t} + \beta_{t,se} r_{t,t} \sin \alpha_{t,t} \sin \beta_{t,t} \hfill \\ \end{gathered}$$
(31)
$$\sigma_{3}^{e1} = r_{c,se} \sin \beta_{c,t} - r_{t,se} \sin \beta_{t,t} + \beta_{c,se} r_{c,t} \cos \beta_{c,t} - \beta_{t,se} r_{t,t} \cos \beta_{t,t}$$
(32)
$${\mathbf{e}}2 = \left[ {\begin{array}{*{20}c} {e_{21} } \\ {e_{22} } \\ {e_{23} } \\ \end{array} } \right]$$
(33)

Among them:

$$\begin{gathered} e_{21} = \sigma_{3}^{e2} {\mkern 1mu} \left( {\gamma_{rs} {\mkern 1mu} \left( {\sigma_{11}^{e2} - \sigma_{4}^{e2} } \right) - \varphi_{rs} {\mkern 1mu} \left( {\sigma_{8}^{e2} - \sigma_{7}^{e2} } \right) + \theta_{rs} {\mkern 1mu} {\text{cos}}\theta_{t} {\mkern 1mu} {\text{cos}}\gamma_{t} {\text{sin}}\varphi_{t} } \right) - \hfill \\ \left( {\varphi_{rs} {\mkern 1mu} {\text{cos}}\varphi_{t} {\text{cos}}\theta_{t} - \theta_{rs} {\mkern 1mu} {\text{sin}}\varphi_{t} {\mkern 1mu} {\text{sin}}\theta_{t} } \right){\mkern 1mu} \sigma_{1}^{e2} - \sigma_{2}^{e2} \left( {\varphi_{rs} {\mkern 1mu} \left( {\sigma_{9}^{e2} - \sigma_{6}^{e2} } \right) + \gamma_{rs} {\mkern 1mu} \left( {\sigma_{10}^{e2} - \sigma_{5}^{e2} } \right) - \theta_{rs} {\mkern 1mu} {\text{cos}}\theta_{t} {\text{sin}}\varphi_{t} {\mkern 1mu} {\text{sin}}\gamma_{t} } \right) \hfill \\ \end{gathered}$$
(34)
$$\begin{gathered} e_{22} = \sigma_{3}^{e2} {\mkern 1mu} \left( {\varphi_{rs} {\mkern 1mu} \left( {\sigma_{10}^{e2} + \sigma_{5}^{e2} } \right) + \gamma_{rs} {\mkern 1mu} \left( {\sigma_{9}^{e2} + \sigma_{6}^{e2} } \right) - \theta_{rs} {\text{cos}}\varphi_{t} {\text{cos}}\theta_{t} {\mkern 1mu} {\text{cos}}\gamma_{t} } \right) - \hfill \\ \left( {\varphi_{rs} {\mkern 1mu} {\text{cos}}\theta_{t} {\mkern 1mu} {\text{sin}}\varphi_{t} + \theta_{rs} {\mkern 1mu} {\text{cos}}\varphi_{t} {\mkern 1mu} {\text{sin}}\theta_{t} } \right){\mkern 1mu} \sigma_{1}^{e2} - \sigma_{2}^{e2} {\mkern 1mu} \left( {\gamma_{rs} \left( {\sigma_{8}^{e2} + \sigma_{8}^{e2} } \right) - \varphi_{rs} \left( {\sigma_{11}^{e2} + \sigma_{4}^{e2} } \right) + \theta_{rs} {\mkern 1mu} {\text{cos}}\varphi_{t} {\mkern 1mu} {\text{cos}}\theta_{t} {\mkern 1mu} {\text{sin}}\gamma_{t} } \right) \hfill \\ \end{gathered}$$
(35)
$$e_{23} = \sigma_{2}^{e2} {\mkern 1mu} \left( {\gamma_{rs} {\mkern 1mu} {\text{cos}}\theta_{t} {\text{cos}}\gamma_{t} - \theta_{rs} {\mkern 1mu} {\text{sin}}\theta_{t} {\text{sin}}\gamma_{t} } \right) - \left( {\theta_{rs} {\mkern 1mu} {\text{cos}}\gamma_{t} {\mkern 1mu} {\text{sin}}\theta_{t} + \gamma_{rs} {\mkern 1mu} {\text{cos}}\theta_{t} {\mkern 1mu} {\text{sin}}\gamma_{t} } \right){\mkern 1mu} \sigma_{3}^{e2} + \theta_{rs} {\mkern 1mu} {\text{cos}}\theta_{t} {\mkern 1mu} \sigma_{1}^{e2}$$
(36)
$$\sigma_{1}^{e2} { = }r_{c,t} {\mkern 1mu} {\text{cos}}\beta_{c,t} {\mkern 1mu} {\text{sin}}\alpha_{c,t} - r_{t,t} {\mkern 1mu} {\text{cos}}\beta_{t,t} {\mkern 1mu} {\text{sin}}\alpha_{t,t}$$
(37)
$$\sigma_{2}^{e2} = r_{c,t} {\mkern 1mu} {\text{cos}}\beta_{c,t} {\mkern 1mu} {\text{cos}}\alpha_{c,t} - r_{t,t} {\mkern 1mu} {\text{cos}}\beta_{t,t} {\mkern 1mu} {\text{cos}}\alpha_{t,t}$$
(38)
$$\sigma_{3}^{e2} = r_{c,t} {\mkern 1mu} {\text{sin}}\beta_{c,t} {\mkern 1mu} - r_{t,t} {\mkern 1mu} {\text{sin}}\beta_{t,t} {\mkern 1mu}$$
(39)
$$\sigma_{4}^{e2} = {\text{sin}}\varphi_{t} {\mkern 1mu} {\text{sin}}\theta_{t} {\mkern 1mu} {\text{sin}}\gamma_{t}$$
(40)
$$\sigma_{5}^{e2} = {\text{sin}}\varphi_{t} {\mkern 1mu} {\text{sin}}\theta_{t} {\mkern 1mu} {\text{cos}}\gamma_{t}$$
(41)
$$\sigma_{6}^{e2} = {\text{cos}}\varphi_{t} {\mkern 1mu} {\text{sin}}\theta_{t} {\mkern 1mu} {\text{sin}}\gamma_{t}$$
(42)
$$\sigma_{7}^{e2} = {\text{cos}}\varphi_{t} {\mkern 1mu} {\text{sin}}\theta_{t} {\mkern 1mu} {\text{cos}}\gamma_{t}$$
(43)
$$\sigma_{8}^{e2} = {\text{sin}}\varphi_{t} {\mkern 1mu} {\mkern 1mu} {\text{sin}}\gamma_{t}$$
(44)
$$\sigma_{9}^{e2} = \sin \varphi_{t} {\mkern 1mu} \cos \gamma_{t}$$
(45)
$$\sigma_{10}^{e2} = {\text{cos}}\varphi_{t} {\mkern 1mu} {\text{sin}}\gamma_{t}$$
(46)
$$\sigma_{11}^{e2} {\text{ = cos}}\varphi_{t} {\mkern 1mu} {\text{cos}}\gamma_{t}$$
(47)

where \(\varphi_{t}\),\(\theta_{t}\),\(\gamma_{t}\) are true value of yaw ,pitch and roll; \(r_{c,t}\),\(\alpha_{c,t}\),\(\beta_{c,t}\) represent the true values of distance, azimuth and elevation angle of the airborne unmanned platform to the cooperative platform respectively; \(r_{c,se}\),\(\alpha_{c,se}\),\(\beta_{c,se}\) represent corresponding systematic errors of the airborne unmanned platform to the cooperative platform; \(r_{t,t}\),\(\alpha_{t,t}\),\(\beta_{t,t}\) represent the true values of distance, azimuth and elevation angle of the airborne unmanned platform to the maritime target respectively; \(r_{t,se}\),\(\alpha_{t,se}\),\(\beta_{t,se}\) represent corresponding systematic errors of the airborne unmanned platform to the maritime target.

Analysis of optimal observation configuration

Azimuth error is the main source of sensor error for long-range marine target detection. Therefore, this subsection focuses on azimuth systematic error to discuss optimal observation configurations for unmanned platforms.

Assuming the airborne unmanned platform flies at a fixed height, it can be regarded as moving in a two-dimensional plane. The azimuths of the marine target observed by both the airborne and maritime cooperative unmanned platforms serve as reference quantities for the relative positions of these platforms, and the target.

For convenience in discussion, we set the distance, elevation and attitude systematic errors in Eq. (28) to 0. Equation (28) can be simplified to

$${\mathbf{e}}_{\alpha } = \left[ {\begin{array}{*{20}c} {r_{c,t} {\mkern 1mu} \alpha_{c,se} {\mkern 1mu} \cos \beta_{c,t} {\text{sin}}\alpha_{c,t} - r_{t,t} \alpha_{t,se} \cos \beta_{t,t} {\text{sin}}\alpha_{t,t} } \\ {r_{t,t} \alpha_{t,se} \cos \beta_{t,t} {\text{cos}}\alpha_{t,t} - r_{c,t} \alpha_{c,se} {\mkern 1mu} \cos \beta_{c,t} {\text{cos}}\alpha_{c,t} } \\ 0 \\ \end{array} } \right]$$
(48)

The systematic error in the azimuths of the airborne unmanned platform observing the target and the maritime cooperative unmanned platform during a cycle is so small that it can be approximated as the same value for both. This value can be denoted as \(\alpha_{se}\) .

Solving the optimal observation configuration problem can be translated into minimizing the marine target position error. An objective function \(G_{\alpha }\) is constructed as follows

$$G_{\alpha } = \sqrt {\left( {r_{c,t} {\mkern 1mu} \alpha_{c,se} {\mkern 1mu} {\mkern 1mu} \cos \beta_{c,t} {\text{sin}}\alpha_{c,t} - r_{t,t} \alpha_{t,se} {\mkern 1mu} \cos \beta_{t,t} {\text{sin}}\alpha_{t,t} } \right)^{2} + \,\left( {r_{t,t} \alpha_{t,se} {\mkern 1mu} \cos \beta_{t,t} {\text{cos}}\alpha_{t,t} - r_{c,t} \alpha_{c,se} {\mkern 1mu} {\mkern 1mu} \cos \beta_{c,t} {\text{cos}}\alpha_{c,t} } \right)^{2} }$$
(49)

It can be simplified

$$G_{\alpha } = \sqrt {\alpha_{{_{se} }}^{2} \left( {r_{{_{c,t} }}^{2} \cos^{2} \beta_{c,t} + r_{{_{t,t} }}^{2} \cos^{2} \beta_{t,t} } \right){\mkern 1mu} - 2r_{t,t} r_{c,t} \alpha_{{_{se} }}^{2} \cos \beta_{c,t} \cos \beta_{t,t} \cos \left( {\alpha_{c,t} - \alpha_{t,t} } \right)}$$
(50)

Let \(r_{c,xyt} = r_{c,t} \cos \beta_{c,t}\) ,\(r_{t,xyt} = r_{t,t} \cos \beta_{t,t}\) ,\(\delta = \alpha_{c,t} - \alpha_{t,t}\) , assume \(r_{c,xyt} = \left| {c_{r} } \right|r_{t,xyt}\) , where \(c_{r}\) is the scale factor, and \(\left| {c_{r} } \right| < 1\) (for the safety of the detection platforms, it is assumed that the airborne unmanned platform is always closer to the maritime cooperative unmanned platform. Equation (50) is simplified

$$\begin{gathered} G_{\alpha } = \sqrt {\alpha_{{_{se} }}^{2} \left( {c_{r}^{2} r_{{_{t,xyt} }}^{2} + r_{{_{t,xyt} }}^{2} } \right){\mkern 1mu} - 2\alpha_{{_{se} }}^{2} \left| {c_{r} } \right|r_{{_{t,xyt} }}^{2} \cos \delta } \hfill \\ \, = \sqrt {\alpha_{{_{se} }}^{2} r_{{_{t,xyt} }}^{2} \left( {c_{r}^{2} - 2\cos \delta \left| {c_{r} } \right| + 1} \right){\mkern 1mu} } \hfill \\ \end{gathered}$$
(51)

From Eq. (51), we can conclude that there are two factors affecting the observation position \(c_{r}\) and \(\delta\) .\(c_{r}\) denotes the distance position relationship between the airborne unmanned platform, maritime cooperative unmanned platform and target.\(\delta\) denotes the angle position relationship between the airborne unmanned platform and maritime cooperative unmanned platform.

A two-stage analysis is used for the above two factors, with \(c_{r}\) considered in the first stage and \(\delta\) in the second stage.

The phase1 begins with the analysis of \(c_{r}\) .

Equation (51) is rewritten as

$$G_{\alpha } = \left\{ {\begin{array}{*{20}c} {\sqrt {\alpha_{{_{se} }}^{2} r_{{_{t,xyt} }}^{2} \left( {c_{r}^{2} + 2\cos \delta c_{r} + 1} \right){\mkern 1mu} } } & {c_{r} < 0} \\ {\sqrt {\alpha_{{_{se} }}^{2} r_{{_{t,xyt} }}^{2} \left( {c_{r}^{2} - 2\cos \delta c_{r} + 1} \right){\mkern 1mu} } } & {c_{r} \ge 0} \\ \end{array} } \right.$$
(52)

It is easy to get the minimum value of \(G_{\alpha }\) when \(c_{r} = - \cos \delta {\text{ or }}\cos \delta\)\(G_{{_{\alpha \min } }}^{{c_{r} }}\)

$$G_{{_{\alpha \min } }}^{{c_{r} }} = \sqrt {\alpha_{{_{se} }}^{2} r_{{_{t,xyt} }}^{2} \left( {1 - \cos^{2} \delta } \right){\mkern 1mu} }$$
(53)

In phase2, we analyze \(\delta\) .

It is easy to obtain that when \(\cos \delta {\text{ = 1or}} - {1}\) ,\(G_{{_{\alpha \min } }}^{{c_{r} }}\) obtains the minimum value \(G_{{_{\alpha \min } }}^{\delta }\) .

When \(\cos \delta {\text{ = 1or}} - {1}\) is \(\delta {\text{ = 0or}}\pi\) , the angular positional relationship is such that the airborne unmanned platform, maritime cooperative unmanned platform, and target are in a straight line.

When \(\cos \delta {\text{ = 1or}} - {1}\) is \(r_{c,xyt} = r_{t,xyt}\) , the distance positional relationship is that the horizontal projection from the airborne unmanned platform to the maritime cooperative unmanned platform is equal to the horizontal projection from the airborne unmanned platform to the marine target.

Among the above positional relationships, the angular positional relationship is easy to satisfy and should be satisfied as the primary position, and the distance positional relationship should be considered according to the actual observation environment. Therefore, the angular positional relationship is a primary consideration for trajectory optimization.

Objective function based on receding horizon optimization

Receding horizon optimization is an optimization method for dealing with dynamic systems. Based on the receding horizon optimization and the error analysis in 3.1, the performance index of unmanned platforms cooperatively in observing marine targets at moment k is obtained as:

$$I_{o,k} = \sum\limits_{m = k}^{{k + N_{p} }} {\left\| {{\mathbf{e}}_{m} } \right\|}$$
(54)

where \(N_{p}\) is the number of predicted steps and \({\mathbf{e}}_{m}\) is the tracking error of the unmanned platform cooperative observation at time \(m\) .

The hazardous area indicator \(J_{h}\) measures the ability of the airborne unmanned platform to avoid the hazardous area, and the value of \(J_{h}\) is inversely related to the distance from the hazardous area, i.e. The larger \(J_{h}\) is, the closer the airborne unmanned platform is to the obstacle, and the smaller \(J_{h}\) is, the further the airborne unmanned platform is from the obstacle. In order to make the airborne unmanned platform avoid the hazardous area, combined with the hazardous area model, we can get the hazardous area indicator at time k is:

$$J_{h,k} = \sum\limits_{n = k}^{{k + N_{p} - 1}} {O_{n} }$$
(55)

Among them:

$$O_{n} = \left\{ \begin{gathered} \sum\limits_{m = 1}^{M} {\left( {\frac{{R_{a}^{2} - \left\| {{{\varvec{\upchi}}}_{n} - {\mathbf{P}}_{m} } \right\|_{2}^{2} }}{{\left\| {{{\varvec{\upchi}}}_{n} - {\mathbf{P}}_{m} } \right\|_{2}^{2} - R_{c}^{2} }}} \right)^{2} } ,R_{c} < \left\| {{{\varvec{\upchi}}}_{n} - {\mathbf{P}}_{m} } \right\| \le R_{a} \hfill \\ {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} 0{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} ,{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} \left\| {{{\varvec{\upchi}}}_{n} - {\mathbf{P}}_{m} } \right\| \ge R_{a} \hfill \\ \end{gathered} \right.$$
(56)

where:\({\mathbf{P}}_{m}\) denotes the location of the \(m{\text{th}}\) obstacle observed, and \(M\) denotes the number of obstacles observed.

Therefore, when considering the hazardous area, the optimization problem of cooperative marine target observation by multiple unmanned platforms can be transformed into a nonlinear model optimal control problem of minimizing the objective function under constraints, and the mathematical model is as follows:

$$\begin{gathered} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} \hfill \\ \left\{ \begin{gathered} {\kern 1pt} \, \mathop {\min }\limits_{{{\varvec{u}}_{k}^{*} }} I_{o,k} + I_{h,k} \hfill \\ s.t.{\kern 1pt} {\kern 1pt} {\varvec{x}}_{i,k + 1} - f_{d} ({\varvec{x}}_{i,k} {\mathbf{,}}{\varvec{u}}_{i,k} ) = 0 \hfill \\ \, L_{v} ({\varvec{u}}_{i,k} ){\kern 1pt} = \frac{{\left| {u_{i,vk} - v_{0} } \right| - v_{\max } }}{{v_{\max } }} \le 0 \hfill \\ \, L_{\omega } ({\varvec{u}}_{i,k} ){\kern 1pt} = {\kern 1pt} \frac{{{\kern 1pt} \left| {u_{i,\omega k} } \right| - \omega_{\max } }}{{\omega_{\max } }} \le 0 \hfill \\ \end{gathered} \right. \hfill \\ \end{gathered}$$
(57)

where:\({\varvec{u}}_{k}\) is the control input variable of the airborne unmanned platform at time \(k\).\(v_{0}\) is the cruising speed of the airborne unmanned platform. \(v_{\max }\) is the maximum speed value.\(\omega_{\max }\) is the maximum value of the angular velocity of the heading.

Optimal control of the airborne unmanned platform based on AEDBO

Adaptive enhanced dung beetle optimization

Improved population initialization

DBO employs low-randomness, high-uncertainty initialization, which often results in uneven initial populations with poor diversity and narrow search ranges. To ensure the convergence accuracy and speed of the algorithm, it is necessary to make the population distribution more uniform and increase population diversity during initialization. We propose an improved adaptive Tent chaotic mapping to achieve more uniform distributions and enhanced diversity.

The original Tent chaotic mapping expression is:

$$xtm_{i + 1} = f_{Tent} \left( {xtm_{i} } \right) = \left\{ {\begin{array}{*{20}c} {xtm_{i} /\kappa } & {0 < xtm_{i} < \kappa } \\ {\left( {1 - xtm_{i} } \right)/\left( {1 - \kappa } \right)} & {\kappa \le xtm_{i} \le 1} \\ \end{array} } \right.$$
(58)

An improved version is:

$$xtm_{i + 1} = f_{Tent} \left( {xtm_{i} } \right) = \left\{ {\begin{array}{*{20}c} {1 - \zeta * rand\left( {0,1} \right) + rand\left( {0,1} \right)/N} & {xtm_{i} = 0} \\ {1 - xtm_{i} /\kappa + rand\left( {0,1} \right)/N} & {0 < xtm_{i} < \kappa } \\ {\left( {1 - xtm_{i} } \right)/\left( {1 - \kappa } \right) + rand\left( {0,1} \right)/N} & {\kappa \le xtm_{i} \le 1} \\ \end{array} } \right.$$
(59)

where:\(\zeta \in \left( {0,1} \right]\) is the adjustment factor.\(\kappa \in \left( {0,2} \right]\) is the chaos factor. This paper sets \(\zeta\) to 1and \(\kappa\) to 0.5.

Mapped populations are initialized as

$$x_{i,j} = lb + \left( {ub - lb} \right) \times xtm_{i}$$
(60)

where:\(lb\) and \(ub\) denote lower and upper bound of search space respectively.

Figure 3 shows frequency distributions of different Tent chaotic sequences. Tent chaotic mapping 1 is from Ref.44 The Tent chaotic mapping 2 is from Ref.45 . As seen in Fig. 3, the improved chaotic mapping in this paper generates more uniform initial population distributions versus other common improved Tent chaotic mappings, enhancing initial search ability and the likelihood of obtaining the global optimum.

Fig. 3
figure 3

Frequency distribution of different Tent chaotic sequences. (a) Tent chaotic mapping1, (b) Tent chaotic mapping2, (c) Improved Tent chaotic mapping.

Moreover, algorithm development can entail significant meaningless search from chaotic inverse learning, increasing computational cost and impeding convergence. To address this, we introduce center-of-mass adversarial inverse learning to enhance initial population quality using fitness values.

The center of mass \({\mathbf{C}}\) is defined as:

$${\mathbf{C}} = \frac{{{\mathbf{X}}_{1} + {\mathbf{X}}_{2} + \cdots + {\mathbf{X}}_{n} }}{n}$$
(61)

where:

$$M_{j} = \frac{{\sum\limits_{j = 1}^{D} {x_{i,j} } }}{n},i = 1,2, \cdots ,n$$
(62)

The center of gravity reversal point \({\mathbf{\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{X} }}_{i}\) is defined as

$${\mathbf{\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{X} }}_{i} = 2 \times {\mathbf{C}} - {\mathbf{X}}_{i} ,i = 1,2, \cdots ,n$$
(63)

The reversal point lies in a dynamic boundary search space, denoted \(x_{i,j} \in \left[ {a_{j} ,b_{j} } \right]\) . A change in the dynamic boundary places the reversal point in a shrinking and constantly-changing space. The dynamic boundary expression is:

$$a_{j} = \min \left( {x_{i,j} } \right)$$
(64)
$$b_{j} = \max \left( {x_{i,j} } \right)$$
(65)

If the reversal point falls outside the dynamic boundary range, it is recalculated with the expression:

$$\overline{x}_{i,j} = \left\{ {\begin{array}{*{20}c} {a_{j} + rand\left( {0,1} \right) \times \left( {M_{j} - a_{j} } \right)} & {\overline{x}_{i,j} < a_{j} } \\ {M_{j} + rand\left( {0,1} \right) \times \left( {b_{j} - M_{j} } \right)} & {\overline{x}_{i,j} > b_{j} } \\ \end{array} } \right.$$
(66)

Adaptive convergence factor

The original convergence factor decreases linearly, and the area of the optimal spawning region decreases gradually, so the algorithm focuses on global search in the early stage and local search in the later stage. To enhance the search ability of the algorithm, a new adaptive convergence factor is proposed. When \(0 \le t \le T_{\max } /2\), \(R\) decreases first slowly and then quickly, and the optimal spawning area decreases slowly, allowing the dung beetle to explore more unknown areas and avoid falling into the local optimum, improving the algorithm’s search ability in the early stage; when \(T_{\max } /2 \le t \le T_{\max }\), \(R\) decreases first quickly and then slowly, and the optimal spawning decreases fast, forcing the dung beetle to conduct a fine search in the vicinity of the current optimal solution to increase solution precision, improving the algorithm’s local search ability in the later stage. The algorithm can improve the local search ability in the later stage. In summary, new adaptive convergence factor improves the global search ability of the algorithm in the early stage and the local search ability in the late stage, thus enhancing the algorithm’s ability to obtain the optimal solution and convergence speed. Comparison of convergence factor is shown in Fig. 4. The expression of the new adaptive convergence factor is:

$$R = \left\{ {\begin{array}{*{20}c} {1 - \frac{{e^{{2t/T_{\max } }} - 1}}{{2\left( {e - 1} \right)}}} & {0 \le t \le \frac{{T_{\max } }}{2}} \\ {\frac{{e^{{2\left( {1 - t/T_{\max } } \right)}} - 1}}{{2\left( {e - 1} \right)}}} & {\frac{{T_{\max } }}{2} \le t \le T_{\max } } \\ \end{array} } \right.$$
(67)
Fig. 4
figure 4

Comparison of convergence factor.

Adaptive nonlinear rolling dung beetle quantity decreasing model

The number of ball-rolling dung beetles in DBO was kept constant. Ball-rolling dung beetles provide global search capability to the algorithm. To further enhance the algorithm’s search capability in the early stage, an adaptive nonlinear ball-rolling dung beetle number decreasing model is designed. This model increases the proportion of ball-rolling dung beetles early, then gradually decreases it over iterations. The expression for rolling dung beetle quantity is:

$$P = \left\{ {\begin{array}{*{20}c} {P_{1} + P_{2} *\left[ {\cos \left( {\frac{t}{{T_{\max } }}\pi } \right)} \right]^{q} } & {0 \le t \le \frac{{T_{\max } }}{2}} \\ {P_{1} - P_{2} *\left[ { - \cos \left( {\frac{t}{{T_{\max } }}\pi } \right)} \right]^{q} } & {\frac{{T_{\max } }}{2} \le t \le T_{\max } } \\ \end{array} } \right.$$
(68)
$$P_{1} = \frac{{P_{initial} + P_{end} }}{2}$$
(69)
$$P_{2} = \frac{{P_{initial} - P_{end} }}{2}$$
(70)

where:\(P_{initial}\) is the initial rolling dung beetle population proportion. \(P_{end}\) is the final rolling dung beetle population proportion. \(q \in \left( {0,1} \right]\) is the decay index. In this paper,\(P_{initial}\) is taken as 0.4,\(P_{end}\) is taken as 0.2and \(q\) is taken as 1.

The proportion of the number of dung beetles stolen is

$$P_{t} = 0.6 - P$$
(71)

Theft dung beetles contribute to the algorithm’s local search capability. As iterations proceed, the number of theft dung beetles increases, enhancing algorithm’s local search capability.

When \(0 \le t \le T_{\max } /2\), the number of rolling dung beetles is kept higher in the early stages to enhance global search; when \(T_{\max } /2 \le t \le T_{\max }\), the number of rolling dung beetles is reduced in the later stages and the number of stealing dung beetles is increased, freeing up resources to be used to improve localized search.

Adaptive cauchy-gaussian mutation strategy

In the later stage of DBO, dung beetles gradually move closer to optimal individuals, potentially resulting in lack of population diversity and premature convergence. To address this, a Cauchy-Gaussian mutation strategy is introduced. The Cauchy distribution has heavy-tailed characteristics and is suitable for long-distance jumps, dominating global exploration in the early stage; the Gaussian distribution is suitable for local fine-tuning, promoting local development in the late stage. The algorithm selects the current best individual for mutation, compares positions before and after mutation, and selects the better position for the next iteration. The Cauchy-Gaussian mutation strategy is mathematically defined as:

$$U_{best}^{t + 1} = X_{best}^{t} \left[ {1 + \lambda_{1} Cauchy\left( {0,\sigma^{2} } \right) + \lambda_{2} Gauss\left( {0,\sigma^{2} } \right)} \right]$$
(72)
$$\sigma = \left\{ {\begin{array}{*{20}c} 1 & {f\left( {X_{best} } \right) < f\left( {X_{i} } \right)} \\ {\exp \left( {\frac{{f\left( {X_{best} } \right) - f\left( {X_{i} } \right)}}{{\left| {f\left( {X_{best} } \right)} \right|}}} \right)} & {otherwise} \\ \end{array} } \right.$$
(73)
$$\lambda_{{1}} { = 1 - }\frac{{t^{2} }}{{T_{\max }^{2} }}$$
(74)
$$\lambda_{2} { = }\frac{{t^{2} }}{{T_{\max }^{2} }}$$
(75)

where:\(X_{best}\) is the position of the elite individual.\(U_{best}^{t + 1}\) is the position of the elite individual after mutation.\(\sigma\) is the standard deviation of the Cauchy-Gaussian mutation strategy. \(Cauchy\left( {0,\sigma^{2} } \right)\) is a random variable satisfying the Cauchy distribution.\(Gauss\left( {0,\sigma^{2} } \right)\) is a random variable satisfying the Gaussian distribution. \(\lambda_{1}\) and \(\lambda_{2}\) are dynamic parameters adaptively adjusted with iterations. Initially,\(\lambda_{1}\) is larger so the algorithm can explore the optimal solution over a larger range with larger mutation steps, and \(\lambda_{2}\) is smaller so the algorithm can search near the optimal solution. During the search,\(\lambda_{1}\) gradually decreases and \(\lambda_{2}\) increases.

AEDBO framework is shown Algorithm.

Algorithm
figure a

AEDBO framework.

Computational complexity analysis

It is assumed that \(N\),\(D\),\(Iter\) and represent the population size, optimization problem dimension and number of iterations respectively. The time complexity of DBO is \({\text{O}} \left( {N*D*Iter} \right)\) and the space complexity is \({\text{O}} \left( {N*D} \right)\).

Analyze the time complexity of AEDBO. AEDBO adopts the improved Tent chaotic mapping in the population initialization stage to generate a chaotic sequence with a time complexity of \({\text{O}} \left( {N*D} \right)\), the time complexity of the center-of-mass adversarial inverse learning strategy to compute the center-of-mass and generate the adversarial solution is \({\text{O}} \left( {N*D} \right)\), and the total time complexity of initialization is \({\text{O}} \left( {N*D} \right)\). In the single iteration stage, the adaptive convergence factor changes the calculation method of the convergence factor without introducing additional loop commands, which does not affect the computational complexity; the adaptive nonlinear rolling ball dung beetle number decreasing model only changes the proportion of different species of dung beetles, not the total number of dung beetles, and the computational time complexity of a single position update for the whole population is \({\text{O}} \left( {N*D} \right)\); the introduction of the adaptive Cauchy-Gaussian mutation strategy increases the computation of the elite individuals, and the time complexity is \({\text{O}} \left( D \right)\), and the single iteration time complexity is \({\text{O}} \left( {N*D} \right)\). In summary, the total time complexity is \(O\left( {N*D*Iter} \right)\).

Analyze the space complexity of AEDBO. When storing the population, the original population stores \(N*D\) parameters; different dung beetles in the population generate temporary solutions during iteration, requiring \({\text{O}} \left( {N*D} \right)\) space for temporary storage, and the total population space is \({\text{O}} \left( {N*D} \right)\). When calculating intermediate variables, \({\text{O}} \left( 1 \right)\) space is needed to store the \(D\) dimensional center-of-mass vector, the fitness values of \(N\) individual dung beetles, and other parameters such as the convergence and scaling, and the total intermediate variable space is . The adaptive Cauchy-Gaussian mutation strategy increases elite individual’s positions and fitness, requiring \({\text{O}} \left( {N + D} \right)\) space. In summary, the total space complexity is \({\text{O}} \left( {N*D} \right)\).

The above analysis reveals that AEDBO and DBO have the same computational time complexity and space complexity, indicating that their superior performance does not depend on higher complexity, proving their feasibility in practical applications.

Optimal control of the airborne unmanned platform Based on AEDBO

Based on the above analysis and improvements, AEDBO is utilized to solve the optimal control problem of an airborne unmanned platform. Specific solution steps are as follows:

Step 1: Determine the initial parameters, including the unmanned platform system parameters (initial position of the airborne unmanned platform, initial position of the maritime unmanned platform), AEDBO parameters (population number, maximum number of iterations, etc.), and error reduction algorithm parameters;

Step 2: Real-time generation of observation information of the airborne unmanned platform on the maritime cooperative unmanned platform and target, and use AEDBO to obtain the optimal control sequence at the current moment;

Step 3: Control the movement of the airborne unmanned platform using the obtained control quantity to obtain the position of the airborne unmanned platform at the next moment;

Step 4: Repeat step 2 and 3 to obtain the optimal motion trajectory of the airborne unmanned platform.

Figure 5 shows the flowchart of optimal control of an airborne unmanned platform based on AEDBO.

Fig. 5
figure 5

Optimal control of the airborne unmanned platform based on AEDBO.

Simulation results and discussions

Performance analysis of AEDBO

The performance of AEDBO is first analyzed, including comparisons with other swarm intelligence algorithms and ablation experiments.

Comparison of AEDBO with Other swarm intelligence optimization algorithms

To evaluate AEDBO performance, we test it on 15 benchmark function sets and compare it with 7 other swarm intelligence optimization algorithms. The 15 benchmark function sets are shown in Table 1. \(f_{1} (x)\) ~ \(f_{4} (x)\) are single-peak test functions for assessing algorithmic optimization performance; \(f_{5} (x)\) ~ \(f_{7} (x)\) are multi-peak test functions for evaluating local optimum avoidance ability; \(f_{8} (x)\) ~ \(f_{15} (x)\) are fixed-dimension multi-peak test functions for assessing local optimum avoidance and high-dimension problem handling performance. The 7 swarm intelligence optimization algorithms include both classical and contemporary algorithms, specifically PSO, FOA, GWO, WOA, HHO, SSA and DBO to provide a comprehensive performance comparison. Evaluation metrics include optimal value, mean value and standard deviation of calculation results. To ensure the experimental validity, the initial population size of all algorithms is set to 30, and maximum number of iterations is 500.To eliminate experimental chance, each algorithm is run independently on test functions for 100 times.

Table 1 Benchmark function.

Figure 6 shows the convergence curves of AEDBO with other 7 swarm intelligence algorithms on 15 test functions. Table 2 shows the metrics statistics of AEDBO and other 7 swarm intelligence algorithms on 15 test functions. Figure 7 shows the performance ranking of different algorithms in comparison experiments on the test functions.

Fig. 6
figure 6

Convergence curves for different test functions.

Table 2 Statistical results of different algorithms.
Fig. 7
figure 7

Performance comparison ranking.

The following analysis can be made from the experimental results:

  • (1) The experimental results on the single-peak test functions ~ show that both AEDBO have better global optimality seeking ability and robustness. In the early stage, the ability of optimization seeking of AEDBO is slightly weaker than that of HHO and SSA, but in the middle and late stage, both can obtain optimal performance result. The experimental results verify the stronger global exploration ability of AEDBO compared to other algorithms.

  • (2) Experimental results on the multi-peak test functions ~ show that AEDBO has better performance compared to the other seven algorithms. In \(f_{5} (x)\) and \(f_{6} (x)\) , AEDBO converges to the optimum in the fastest speed, and the convergence speed on \(f_{7} (x)\) is slightly behind SSA. The results show that AEDBO performs better in terms of optimization seeking accuracy and robustness. It can converge to the global optimum quickly and effectively avoiding the problem of local optimality.

  • (3) The experimental results on the fixed dimensional multi-peaked test functions \(f_{8} (x)\) ~ \(f_{15} (x)\) show that AEDBO has better performance in dealing with high dimensional problems. Except for the lagging convergence speed on \(f_{10} (x)\) , AEDBO has a faster convergence speed on the other test functions. Except for the overall performance on \(f_{11} (x)\) which is not as good as SSA, AEDBO has better performance, and especially on \(f_{13} (x)\) , \(f_{14} (x)\),\(f_{15} (x)\) , AEDBO has much better performance than other algorithms. The results show that search capability of AEDBO is greatly improved.

Ablation experimental analysis.

To further validate the algorithm’s effectiveness, ablation experiments were conducted on AEDBO. There are four variants in the ablation experiments, DBO1, DBO2, DBO3, and DBO4. DBO1 combines improved Tent chaotic mapping and center-of-mass inverse learning with DBO. DBO2 combines adaptive inertia weight factors with DBO. DBO3 combines population adaptive scaling with DBO. DBO4 combines adaptive Cauchy-Gaussian mutation with DBO. The rest of the settings are the same as in Section "Comparison of AEDBO with Other swarm intelligence optimization algorithms".

Figure 8 shows the convergence curves of AEDBO with the 4 algorithm variants on 15 test functions. Table 3 shows the metrics statistics of AEDBO and the 4 algorithm variants on 15 test functions. Figure 9 shows the performance ranking of different algorithms in ablation experiments on the test functions.

Fig. 8
figure 8figure 8figure 8

Convergence curves of different test functions in ablation experiments.

Table 3 Statistical results in ablation experiments.
Fig. 9
figure 9

Ranking of ablation experiments.

Based on the experimental results the following analysis can be performed:

  • (1) The results of the ablation experiments on the single-peak test functions \(f_{1} (x)\) ~ \(f_{4} (x)\) show that AEDBO improves the algorithm’s global optimization-seeking ability, and the population adaptive ratio has the greatest impact on the algorithm’s global optimization-seeking ability. In the pre-iteration period of \(f_{1} (x)\) , DBO3 and AEDBO have similar performance, and in the pre-iteration period of \(f_{4} (x)\) , AEDBO is slightly weaker than DBO3, but in the late period of the iteration, AEDBO performs better than DBO3 on all test functions. The experimental results verify the improvement of the population adaptive scaling strategy on the optimization seeking ability of DBO.

  • (2) The experimental results on multi-peak test functions \(f_{5} (x)\) ~ \(f_{7} (x)\) show that AEDBO effectively solves the problem that DBO easily falls into local optimality. In \(f_{5} (x)\) , DBO, DBO3 and DBO4 all fall into the local optimum, and DBO1 and AEDBO can quickly jump out of the local optimum and converge to the optimum. In \(f_{6} (x)\) and \(f_{7} (x)\) , DBO3 has similar performance to AEDBO. The results show that the population initialization method and the population adaptive scaling strategy based on improved Tent chaotic mapping and center-of-mass inverse learning improve the ability of DBO to avoid local optimum, followed by adaptive inertia weighting.

  • (3) The experimental results on fixed-dimensional multi-peak test functions \(f_{8} (x)\) ~ \(f_{15} (x)\) show that AEDBO improves the ability of DBO to handle high-dimensional problems. On \(f_{8} (x)\) ,\(f_{9} (x)\) ,\(f_{12} (x)\) ,\(f_{13} (x)\) , and \(f_{15} (x)\) , the optimization seeking ability of DBO1 is comparable to that of AEDBO. On \(f_{10} (x)\) , DBO3 and DBO4 have better optimization accuracy than AEDBO in the early stage. The results show that AEDBO’s ability to handle high dimensions is enhanced.

  • (4) The experimental results show that in most of the test functions, each strategy improves the algorithm’s optimization accuracy. Only in \(f_{4} (x)\) and \(f_{6} (x)\), the population initialization problem resulted in a decrease in the accuracy of the algorithm.

  • (5) Except \(f_{4}\) , DBO1 outperforms DBO in all test functions, and the results show that the population initialization strategy incorporating improved Tent chaos mapping and center-of-mass adversarial inverse learning can effectively improve the performance of the algorithm; Except \(f_{9}\)\(f_{12}\)\(f_{13}\), DBO2 outperforms DBO in all test functions, and the results show that the proposed adaptive convergence factor can effectively improve the performance of the algorithm; Except \(f_{5}\)\(f_{11}\)\(f_{12}\)\(f_{15}\), DBO3 outperforms DBO, and the results show that the proposed adaptive nonlinear rolling dung beetle decreasing number model can effectively improve the performance of the algorithm; Except \(f_{6}\)\(f_{11}\)\(f_{12}\)\(f_{13}\), DBO4 outperforms DBO in the test functions, and the results show that the proposed adaptive Cauchy-Gaussian mutation strategy can effectively improve the performance of the algorithm.

Optimal observation trajectory of the airborne unmanned platform

Non-hazardous area

In this part, AEDBO is used to realize the control of the airborne unmanned platform to obtain the optimal flight trajectory. Table 4 shows the relevant parameters set of the unmanned platform during the simulation experiment.

Table 4 Parameters related to unmanned platforms.

Two typical postures are selected in a no-hazardous-area environment: Posture 1- the maritime cooperative and airborne unmanned platform and the target are on the same side, with the airborne initial position at [-5000,-10,000,3500]m; Posture 2 -the maritime cooperative and airborne unmanned platform and target are oppositely positioned, with the airborne initial position at [-5000,10,000,3500]m.

Figure 10 shows airborne unmanned platform trajectory optimization under Posture 1, including the overall posture, line-of-sight angle change, tracking error, and state vector / control volume change of the airborne unmanned platform under Posture 1, respectively. Figure 11 shows airborne unmanned platform trajectory optimization under Posture 2, including the overall posture, line-of-sight angle change, tracking error, and state vector/control volume change of the airborne unmanned platform under Posture 2, respectively.

Fig. 10
figure 10

Simulation result of the airborne unmanned platform optimization trajectory in posture 1, (a) Overall posture, (b)Line-of-sight angle, (c) Tracking error, (d) State vector and control.

Fig. 11
figure 11

Simulation result of the airborne unmanned platform trajectory optimization in posture 2. (a) Overall posture, (b) Line-of-sight angle, (c) Tracking error, (d) State vector and control.

From Fig. 10, the airborne unmanned platform quickly flies rearward of the maritime cooperative unmanned platform at maximum speed, with the line-of-sight angle gradually approaching 0 aligning the three on a horizontal plane. It maintains this observation configuration; thereafter, forming an optimal observation configuration around 77s; Compared to the initial tracking error of 50.51m, the error is 12.29m by 77s improving accuracy over 4 times compared to stationary observation. From Fig. 11, the airborne unmanned platform flies towards the marine target at maximum speed, with the line-of-sight angle gradually approaching 180 aligning the three on a horizontal plane. It maintains this observation configuration; thereafter, forming an optimal observation configuration around 200s. Compared to the initial tracking error of 36.88m, the error is 3.58m by 200s improving accuracy over 10 times compared to stationary observation.

From Fig. 10 and Fig. 11, the marine target tracking error decreases rapidly during observation configuration formation, and is smaller and slower after formation proving suitable observation configuration directly affects tracking accuracy. The above results show AEDBO has good trajectory optimization ability and optimal observation configuration formation effectively improves tracking accuracy.

Hazardous area

Given the airborne unmanned platform may be impacted by hazardous areas during flight, this section simulates and analyzes the airborne unmanned platform in environments with hazardous areas. Two typical postures are selected in the environment with hazardous area: Posture 3- the maritime cooperative unmanned platform and target are on the same side of the unmanned platform and there is a hazardous area from 1 to 4. The initial position of the unmanned platform is [-5000,-10,000,3500]m; Posture 4- the maritime cooperative unmanned platform and target are on the opposite side of the unmanned platform and there is a hazardous area from 5 to 8. The initial position of the unmanned platform is [ -5000,10,000,3500]m.

Hazardous area 1 is an ellipse with center point at (-2700m, -12000m), and horizontal radius is 400m, vertical radius is 500m; Hazardous area 2 is an ellipse with center point at (-4000m, -14000m), and horizontal radius is 1000m, vertical radius is 1200m; Hazardous area 3 is a rectangular area with side lengths of 1000m and 2,000m rectangular area, the lower right point is located at (500m, -15,000m); Hazardous area 4 is an elliptical area, the center point is located at (0m, -20,000m), the horizontal radius is 400m, and the vertical radius is 500m; Hazardous area 5 is an elliptical area, the center point is located at (-4,000m, 13,000m), the horizontal radius is 400m, the vertical radius is 500m; Hazardous area 6 is an elliptical area, the center point is located at (-3000m, 16000m), the horizontal radius is 1000m, and the vertical radius is 1200m; Hazardous area 7 is a rectangular area with the side lengths of 1000m and 2000m respectively, and the lower right point is located at (0m, 14000m); Hazardous area 8 is an elliptical area with center point at (0m, 21000m), and horizontal radius is 400m, vertical radius is 500m. The marine target is moving in the north direction with a speed of 5m/s, and the other simulation parameters are the same as in 5.2.1.

Figure 12 shows the simulation result of airborne unmanned platform trajectory optimization under Posture 3, including the overall posture diagram, line-of-sight angle change diagram, tracking error diagram, and the change process of state vector and control quantities of the airborne unmanned platform respectively. Figure 13 shows the simulation result of the airborne unmanned platform trajectory optimization under posture 4, including the overall posture diagram, the line-of-sight angle change diagram, the tracking error diagram, and the state vector and control volume change process of the airborne unmanned platform trajectory optimization under posture 4 respectively.

Fig. 12
figure 12

Simulation result of the airborne unmanned platform trajectory optimization in posture 3. (a) Overall posture, (b) Line-of-sight angle, (c) Tracking error, (d) State vector and control.

Fig. 13
figure 13

Simulation result of the airborne unmanned platform trajectory optimization in posture 4. (a) Overall posture, (b) Line-of-sight angle, (c) Tracking error, (d) State vector and control.

From Fig. 12, it can be seen that the airborne unmanned platform can effectively avoid hazardous area and still form the optimal observation configuration under the influence of hazardous area. In the influence of hazardous areas in 21 ~ 25s, 53 ~ 57s, 102 ~ 153s, the line-of-sight angle of the airborne unmanned platform tends to slow down towards 0, and even increases in the opposite direction in 102 ~ 153s, and the reduction speed of the tracking error slows down accordingly, but the overall still shows a decreasing trend, because in the process of obstacle avoidance, the tracking error has a smaller influence in the objective function, and the airborne unmanned platform makes obstacle avoidance as the main objective. From Fig. 13, it can be seen that the airborne unmanned platform can effectively avoid the hazardous area and still form the optimal observation configuration in the influence of hazardous area. In the influence of hazardous area, in 33 ~ 45s and 108 ~ 121s, the line-of-sight angle of the airborne unmanned platform tends to slow down towards 180, and even increases in the opposite direction in 108 ~ 121s, and the reduction of tracking error slows down accordingly, but the overall still shows a decreasing trend, because in the process of obstacle avoidance, the tracking error has a smaller influence in the objective function, and the airborne unmanned platform takes obstacle avoidance as the primary objective.

Overall, after considering the hazardous area, the airborne unmanned platform still forms the optimal observation configuration, and the tracking error keeps decreasing, proving the effectiveness of the algorithms in complex environments.

Practical experiment

We developed a realistic scenario to collect data for verifying the practical application value. Due to the high cost of UAV radar observation, we chose inertial guidance plus radar to simulate UAV observation Two small boats served as the cooperation platform and the target moving on the lake, respectively. The high-precision positions of the observation platform and small boats were acquired by RTK measurements, as shown in Fig. 14 The processed data results are shown in Figs. 15, 16, 17.

Fig. 14
figure 14

Schematic diagram of the practical experiment.

Fig. 15
figure 15

Data 1 result. (a) Line-of-sight angle, (b) Tracking error.

Fig. 16
figure 16

Data 2 result. (a) Line-of-sight angle, (b) Tracking error.

Fig. 17
figure 17

Data 3 result. (a) Line-of-sight angle, (b) Tracking error.

The line-of-sight angle changes of the three datasets range from 0 to 90°, and the overall trend of line-of-sight angle changes is consistent with the trend of tracking error, which is manifested in the fact that the closer to 0° the smaller the tracking error is, and the closer to 90° the larger the tracking error is. However, there are certain fluctuations present, which are caused by external factors such as environmental conditions and the hardware conditions of the equipment. In summary, the conclusion of the practical experiment is consistent with the conclusion of the simulation experiment.

Conclusion and future work

This paper takes air unmanned platform and maritime unmanned cooperative platform synergy as the research body to improve target tracking accuracy. It analyzes the optimal observation configuration based on error reduction algorithm and proposes AEDBO for optimal control of airborne unmanned platform. The following conclusions are drawn:

  • (1) Error analysis based on error reduction shows the tracking error mainly consists of three parts: sensor error, attitude error, and sensor and attitude coupling error. The latter is small compared to the first two errors, and can be ignored with limited computational resource.

  • (2) Optimal observation configuration considering azimuthal systematic error as the main error indicates all three platforms- the air unmanned platform, maritime unmanned cooperative platform, and target should be in a horizontal straight line. This means the observation line-of-sight angle of the airborne unmanned platform is 0 or \(\pi\) .

  • (3) AEDBO is proposed for control of unmanned airborne platform. We propose a population initialization method integrating improved Tent chaos mapping and center-of-mass contrastive inverse learning, an adaptive convergence factor, an adaptive nonlinear ball-rolling dung beetle number decreasing model, and an adaptive Cauchy-Gaussian mutation strategy to enhance DBO. This allows realization of the optimal flight trajectory of the airborne unmanned platform. Comparison with other swarm intelligence algorithms and ablation experiments prove the effectiveness of each improvement method and the superiority of algorithm performance;

  • (4) The proposed AEDBO effectively optimizes the trajectory of airborne unmanned platforms in different initial conditions and environments to achieve optimal observation configuration.

Future steps will focus on:

  • (1) Introducing more systematic error to further investigate optimal observation configuration under complex error conditions;

  • (2) Introducing more unmanned platforms to study trajectory optimization and marine target tracking under complex environments with multiple airborne unmanned platforms, multiple maritime cooperative unmanned platforms, and multiple marine targets.