Table 1 PPO hyper parameters.
From: Reinforcement learning for patient-specific optimal stenting of intracranial aneurysms
32 | Nb. epochs |
8 | Nb. environments |
2 | Size of mini-batches |
\(5\times 10^{-3}\) | Learning rate |
0.3 | Clipping range |
From: Reinforcement learning for patient-specific optimal stenting of intracranial aneurysms
32 | Nb. epochs |
8 | Nb. environments |
2 | Size of mini-batches |
\(5\times 10^{-3}\) | Learning rate |
0.3 | Clipping range |