Fig. 6: The feature screening results.

Correlation screening and subsequent recursive elimination process for the input molecular feature descriptors in a strong base, b strong acid and c mixed enzyme dataset. The red star represented the number of the feature descriptors corresponding to the highest accurate model. Corresponding detailed process of feature screening via correlation analysis and recursive elimination in d strong base, e strong acid and f mixed enzyme dataset. The Y-axis was the name of the feature descriptors while the X-axis meant the round of the recursion elimination process (round 0 represented the results after the correlation screening. Yellow, blue and red rhombi represented the feature descriptors at round 0, remaining feature descriptors after each round of screening, and main feature descriptors obtained by the recursive elimination process, respectively.