Table 3 Hyperparameters and settings.

From: An optimized resource allocation in cloud using prediction enabled reinforcement learning

Parameter

Value

Description

Learning rate (α)

0.01

Q-learning rate

Discount factor (γ)

0.9

Future reward importance

No. of episodes

500

RL training loops

FSWOA population size

30

Number of whales

FSWOA iterations

50

Optimization cycles

K in KNN

5

Neighbors for prediction

RT tree depth

6

Max tree depth