Figure 3
From: Scalable photonic reinforcement learning by time-division multiplexing of laser chaos

Decision-making performance as a function of inter-decision sampling interval. CDR comparison at cycles (a) 10 and (b) 100. In this analysis, the reward probabilities of Machine 0 and 1 are 0.1 and 0.5, respectively.