Fig. 2: Motif analysis of OPTI-CGBEs.

a The C-to-G transversion efficiency induced by eOPTI-CGBE or cOPTI-CGBE of targeted Cs bearing different nucleotides 1nt upstream. N = A, T, G, or C. P values above each group were calculated between the group with “GCN” group. b The C-to-G transversion efficiency induced by eOPTI-CGBE or cOPTI-CGBE of targeted Cs bearing different nucleotides 1nt downstream. P values above each group were calculated between the group with “NCG” group. c Comparison of base editing efficiency of eOPTI-CGBE or cOPTI-CGBE at “WCW” or other motif of the 34 original and 20 additional target sites. W = A or T. d Indel frequency of eOPTI-CGBE or cOPTI-CGBE at “WCW” or other motif of the 34 original and 20 additional target sites. e Comparison of C-to-G editing efficiency of OPTI-CGBEs with eA3A deaminase at “TCW” or other motif of the 34 original and 20 additional target sites. f Comparison of C-to-G editing efficiency of OPTI-CGBEs with hA3G or hA3G-CTD deaminase at “CCN” or other motif of the 34 original target sites and 26 additional target sites. The center line indicates the median, and the bottom and top lines of the box represent the first quartile and third quartile of the values, respectively. Tails extend to the minimum and maximum values. n = 3 biological replicates for each site. All P values were calculated by two-sided Wilcoxon rank sum tests. g C-to-G editing efficiency of each C induced by OPTI-CGBEs with hA3G or hA3G-CTD deaminase when the target sites had more than 2 Cs. n = 3 biological replicates for each site. Data are presented as mean values ± SEM. h Motif logo detected by a logistic regression model developed with a training dataset (80%) sampled from the detected base editing activities with the paired sgRNA library. The y-axis represents learned weights from the regression model for each nucleotide.