Extended Data Fig. 5: Average SSM sequence entropy for different regions of binders. | Nature

Extended Data Fig. 5: Average SSM sequence entropy for different regions of binders.

From: Design of protein-binding proteins from the target structure alone

Extended Data Fig. 5

The sequence entropy of a single position was calculated by looking at the counts from the sort with the concentration closest to 10-fold lower than the estimated parent SC50 and performing a simple Shannon entropy calculation on all amino acids observed at that position. Each plotted point is the average entropy of all positions within each of the three zones respectively. Validated vs Not Validated refers to the SSM Validation procedure with a cutoff of 0.005 (see Methods and Extended Data Figure 15). Since one would expect the core residues of the monomer and core residues of the interface to be conserved while the surface residues should not matter, the validated binders trend above the line. Points on the line do not show a difference between their surfaces and cores, potentially indicating unfolded or misfolded proteins. Points below the line may be misfolded or binding with alternate residues.

Back to article page