Figure 6 | Scientific Reports

Figure 6

From: Machine learning for design of degenerate Cas13a crRNAs using lassa virus as a model of highly variable RNA target

Figure 6

Maximizing guides with positive signals and recommended feature ranges (Watson–Crick). Decision tree for maximizing the number of guides with positive signals. Percentage of positive guides for each set of conditions is highlighted in red. Using the rules output by the RuleFit classifier, conditional control statements were used to filter the initial 84 positive guides from the total set of 192. The features used are n, IQR, min, and PFS_1. Linearized, the decision rules for maximizing positive signals in our set of guides are 1) if IQR is ≥ 6.25, n is between 0 to 3 and pfs_1 is A, U, or C, 2) if IQR is < 6.25, n is between, 0 to 2, and 3) if n = 4, IQR is between 9 and 28 and min is between 1 and 3. These rules achieve 95% positive guides from an initial pool that is 44% positive, including guides with up to four mismatches.

Back to article page