Extended Data Fig. 3: Measured potentiation and depression curves of the memristor devices used for the hardware runs of the T-maze navigation task (Fig. 4 of the main text and Supplementary Note 7).
From: Actor–critic networks with analogue memristors mimicking reward-based learning

The 9 critic weights are denoted as wj and the 18 actor weights as θij.