Table 1 Feature set. Weighted means of all assessed features, and whether they were used in the final model.
From: Machine-learning approach expands the repertoire of anti-CRISPR protein families
Feature name | Acr mean | Non-Acr mean | Used in final model |
|---|---|---|---|
Containing Genome is Prokaryote | 0.8956 | 0.8958 | No |
Containing Genome is Self-Targeting | 0.3344 | 0.1919 | Yes |
Directon Annotated Protein Fraction | 0.22 | 0.69 | Yes |
Directon Protein Lengths Mean | 119.27 | 251.71 | Yes |
Directon Predicted Membrane-Associated Fraction | 0.06 | 0.26 | No |
Directon Size | 3.49 | 3.5 | Yes |
Protein is Annotated | 0.0641 | 0.6731 | Yes |
Protein has HTH-Downstream | 0.4008 | 0.1181 | Yes |
Protein is Predicted Membrane Associated (TMHMM, SignalP) | 0.0256 | 0.2781 | No |
Directon Spacing | 18.37 | 13.7 | No |
Protein Length | 104.11 | 245.54 | Yes |
Mean Hydrophobicity (Kyte and Doolittle) | −0.48 | −0.15 | Yes |