Table 1 Parameters of the computational model

From: Leveraging agent-based models and deep reinforcement learning to predict taxis in cell migration

Hyperparameter

Value/Description

Batch size

512 (number of transitions sampled from the replay buffer)

Discount factor (γ)

0.1 (importance of future rewards)

Epsilon (ϵ)

0.9, decays to 0.0001 over 1000 episodes

Learning rate

1 × 10−4 (for the AdamW optimizer)

Soft update rate (Ï„)

0.005 (rate at which the target network is updated)

Number of observations (No)

72 (NN input)

Number of actions (Na)

16 (NN output)

Time step (Δt)

10 min

Cell velocity (vc)

0.2 μm/min (mean velocity of the cell from17)

Radius of the cell (Rc)

10 μm

Radius of the cell nucleus (RN)

4 μm