Table 1 Hyperparameters.

From: Efficient crowd simulation in complex environment using deep reinforcement learning

Parameter

Value

Description

\(\gamma\)

0.99

Discount factor

\(A_rl\)

1e–4

Actor learning rate

\(C_rl\)

1e–4

Critic learning rate

\(B_{size}\)

256

Batch size

\(RB_{size}\)

1e6

Reply buffer size

\(\tau\)

0.005

Target update rate

\(P_n\)

0.2

Policy noise

\(E_n\)

0.25

Exploration noise

\(w_1\)

0.15

 

\(w_2\)

0.08

 

\(w_3\)

1.2

 

\(w_4\)

0.4

 

\(r_{arrv}\)

10

 

\(r_{move}\)

– 0.6

Â