Fig. 4

The algorithm architecture. Both the main Q-network and the target Q-network consist of one LSTM layer and two fully connected layers.

The algorithm architecture. Both the main Q-network and the target Q-network consist of one LSTM layer and two fully connected layers.