Table 6 Different researches classification method and results.
From: An open automation system for predatory journal detection
Research | Dataset | F1-score |
|---|---|---|
Al-Matham & Al-Khalifa (2017) | IsPredatory publishers' database which contains 1000 + entries of predatory publishers |  |
Bedmuth et al. (2020) | 6,268 articles from OMICS (as predatory) and 34,763 articles from BMC (as non-predatory) | Engineering Area Naïve Bayes:0.89 Random Forest: 0.83 Decision Tree: 0.71 Biomedical Area Naïve Bayes:0.96 Random Forest: 0.93 Decision: 0.90 |
Adnan et al. (2018) | 200 websites for training 200 websites for testing (Beall’s list as predatory, directory of open access journals lists as non-predatory) | Heuristic features Naïve Bayes:0.95 SVM: 0.98 KNN(k = 3): 0.94 NWF Naïve Bayes: 0.89 SVM:0.96 KNN(k = 3): 0.93 |
Our results | 1,636 websites for training 410 websites for testing (Predatory Journal Lists were collected from updated Beall’s and the Stop Predatory Journals lists. Legitimate journal list data were collected from the Berlin Institute of Health (BIH) Quest website) | NWF Gaussian Naïve Bayes: 0.89 SVM:0.952 Random Forest: 0.98 SGD:0.972 KNN(k = 4): 0.945 |