Table 1 Fitness of self-contained classifiers to address characteristics and issues in enterprise data.
Classifiers | Imbalance | Mixed features | Heterogeneity | Sparsity | Inconsistency | Dynamics | Data quality issues |
---|---|---|---|---|---|---|---|
KNN | \(\checkmark\) | ||||||
Naive Byes | \(\checkmark\) | \(\checkmark\) | |||||
SVM | \(\checkmark\) | \(\checkmark\) | \(\checkmark\) | ||||
Decision Tree | \(\checkmark\) | \(\checkmark\) | \(\checkmark\) | \(\checkmark\) | |||
Random Forest | \(\checkmark\) | \(\checkmark\) | \(\checkmark\) | \(\checkmark\) | |||
XGBoost | \(\checkmark\) | \(\checkmark\) | \(\checkmark\) | \(\checkmark\) | |||
DNN | \(\checkmark\) | \(\checkmark\) | \(\checkmark\) | ||||
Table2Vec | \(\checkmark\) | \(\checkmark\) | \(\checkmark\) | \(\checkmark\) | \(\checkmark\) | \(\checkmark\) | \(\checkmark\) |