Table 2 Algorithm result index comparison in “long corridor”.
From: Improved double DQN with deep reinforcement learning for UAV indoor autonomous obstacle avoidance
Double DQN | Improved Double DQN | Improvement | |
|---|---|---|---|
Average cumulative reward | 17.66 | 21.7 | 22.88% |
Maximum cumulative reward | 51.81 | 104.43 | 101.56% |
Average safe flight distance(m) | 22.7 | 27.96 | 23.17% |
Maximum safe flight distance(m) | 64 | 131.6 | 105.62% |