Table 1 PPO hyper parameters.

From: Reinforcement learning for patient-specific optimal stenting of intracranial aneurysms

32

Nb. epochs

8

Nb. environments

2

Size of mini-batches

\(5\times 10^{-3}\)

Learning rate

0.3

Clipping range