Scientific Reports

Table 3 Hyperparameters and settings.

From: An optimized resource allocation in cloud using prediction enabled reinforcement learning

Parameter	Value	Description
Learning rate (α)	0.01	Q-learning rate
Discount factor (γ)	0.9	Future reward importance
No. of episodes	500	RL training loops
FSWOA population size	30	Number of whales
FSWOA iterations	50	Optimization cycles
K in KNN	5	Neighbors for prediction
RT tree depth	6	Max tree depth

Back to article page

Search

Advanced search

Quick links