Extended Data Fig. 2: ProDomino correctly identifies insertion-tolerant regions in AraC.

a, The insertion score for the bacterial transcription factor AraC is shown for each amino acid position. The scores of models trained for different numbers of steps are shown in different shades of gray. Five individual subplots are presented for clarity. Green regions indicate experimentally validated insertion tolerant sites. The two sites previously used to engineer light-regulated AraC variants, I113 and S170, are indicated in dark green. b, ROC curves based on the predictions in a are shown. The area under the curve (AUC) is given for each model variant.