Table 6 Overall performance comparison across algorithms. DQN-based methods report mean ± standard deviation over 12 independent runs with 20 test episodes per run. T-APF reports mean ± standard deviation over 20 test episodes due to stochastic escape behavior. E-APF is deterministic and reports single-run values.

From: A hybrid APF-DQN framework with transformer-based current prediction for USV path planning in dynamic ocean environments

Algorithm

Success rate (%)

Path length (m)

Decision steps

Energy consumption

T-APF

95.0

\(348.18 \pm 63.35\)

\(92.50 \pm 16.26\)

\(116.99 \pm 30.37\)

E-APF

100.0

270.09

47.00

53.67

DQN

100.00

\(294.69 \pm 20.86\)

\(50.85 \pm 6.17\)

\(76.04 \pm 13.30\)

DQN-EG

100.00

\(307.91 \pm 5.43\)

\(47.05 \pm 1.47\)

\(61.81 \pm 3.37\)

APF-DQN-NOC

100.00

\(312.35 \pm 11.01\)

\(55.55 \pm 3.41\)

\(79.53 \pm 7.46\)

APF-DQN

100.00

\(262.85 \pm 4.18\)

\(41.30 \pm 0.84\)

\(49.55 \pm 2.05\)