Machine learning (ML) is a powerful tool in the field of drug discovery, with the continuous development of new models, however, rational selection of the most appropriate model based on the task remains challenging. Here, the authors explore the capabilities of classical ML algorithms and newer models over a range of dataset tasks and show an optimal zone for each model type, developing a predictive model to aid in the selection of a modeling method based on dataset size and diversity.
- Scott H. Snyder
- Patricia A. Vignaux
- Sean Ekins