Table 1 Summary of deep RL components.
From: Designing mechanically tough graphene oxide materials using deep reinforcement learning
DRL component | Notation | Description |
|---|---|---|
State | st | Current functional group locations on the GBP at time step t |
State space | S | All possible functional group locations on the GBP |
Action | at | To assign a functional group to a functional group spot on the GBP |
Action space | \({{{\mathcal{A}}}}({{{\mathbf{s}}}}_t)\) | All available functional group spots left given st |
Reward | rt | Standardized toughness if at terminal step; otherwise, 0 |