Fig. 3: Influence of connectivity.

Characteristics of the 13 families of [[9, 3, 3]] codes found with our framework, clustered according to families distinguished by their quantum weight enumerators (13). Families 9 and 13 are degenerate, while the rest are non-degenerate. We have trained a total of 10240 agents for each of both cases. In the all-to-all (directed: CNOT(i < j)) connectivity, 9574 agents were successful, while this number went down to 3808 in the other case. The bars display how these codes are distributed across different families. Codes in the same family found by different agents are not necessarily distinct, so the bars are rather an indication of the likelihood of a training run to find a code within the family. The points show the mean circuit size, averaged within each family, while the error bar is its standard deviation. It is interesting to see that even with different connectivities, families occur with similar likelihoods during training. We explicitly list the corresponding quantum weight enumerators computed with (13) in the Supplementary.