Table 7 Screening success of the general scoring functions trained with MLR, SMOreg and RF evaluated on the AKT2, KIT, and MK01 datasets from DUD-E. ac, dec and tot are the number of active, decoy compounds and the total number of molecules in the final dataset (i.e., compounds that were docked and rescored with DockThor and DockTScore, respectively). Only the top-scored protonation state of each compound according to each scoring function (SF) was kept.

From: New machine learning and physics-based scoring functions for drug discovery

Target

Metrics

General SFs

MLR

SMOreg

RF

AKT2

AUC

0.769

0.800

0.814

ac = 116

EF1% (max = 60.414)

24.166

15.535

13.809

dec = 6,892

BEDROC20

0.421

0.378

0.379

tot = 7,008

BEDROC100

0.394

0.288

0.269

KIT

AUC

0.640

0.635

0.657

ac = 166

EF1% (max = 63.934)

3.016

2.413

5.428

dec = 10,447

BEDROC20

0.148

0.146

0.176

tot = 10,613

BEDROC100

0.063

0.043

0.090

MK01

AUC

0.786

0.766

0.745

ac = 78

EF1% (max = 59.308)

10.314

12.893

7.736

dec = 4,548

BEDROC20

0.352

0.364

0.340

tot = 4.626

BEDROC100

0.153

0.220

0.193