Table 3 Hyperparameters and settings.
From: An optimized resource allocation in cloud using prediction enabled reinforcement learning
Parameter | Value | Description |
|---|---|---|
Learning rate (α) | 0.01 | Q-learning rate |
Discount factor (γ) | 0.9 | Future reward importance |
No. of episodes | 500 | RL training loops |
FSWOA population size | 30 | Number of whales |
FSWOA iterations | 50 | Optimization cycles |
K in KNN | 5 | Neighbors for prediction |
RT tree depth | 6 | Max tree depth |