Table 2 Parameter settings for simulation experiment 1.
From: Autonomous air combat decision making via graph neural networks and reinforcement learning
Training parameters | Values |
|---|---|
act hidden size | 128 128 |
recurrent hidden layers | 1 |
recurrent hidden size | 128 |
buffer size | 3,000 |
clip param | 0.2 |
entropy coef | 0.001 |
gae lambda | 0.95 |
gamma | 0.99 |
lr | 0.0003 |
max grad norm | 2 |
num mini batch | 5 |
num env steps | 100,000,000 |