Table 1 Summary of deep RL components.

DRL component	Notation	Description
State	s_t	Current functional group locations on the GBP at time step t
State space	S	All possible functional group locations on the GBP
Action	a_t	To assign a functional group to a functional group spot on the GBP
Action space	\({{{\mathcal{A}}}}({{{\mathbf{s}}}}_t)\)	All available functional group spots left given s_t
Reward	r_t	Standardized toughness if at terminal step; otherwise, 0

Quick links

Search