Table 9 The performance for K-mean algorithm for the first situation used the text files with Preprocessing 1.
From: Open source Arabic research paper dataset for natural language processing
# K | Silhouette coefficient (euclidean) | Silhouette coefficient (cosine) | ARI | Davies–Bouldin Index |
---|---|---|---|---|
2 | 0.023 | 0.040 | 0.031 | 3.626 |
3 | 0.042 | 0.076 | 0.073 | 3.659 |
4 | 0.033 | 0.060 | 0.078 | 3.370 |
5 | 0.047 | 0.087 | 0.245 | 5.595 |
6 | 0.046 | 0.084 | 0.266 | 5.906 |
7 | 0.053 | 0.097 | 0.371 | 5.175 |
8 | 0.029 | 0.054 | 0.306 | 4.550 |
9 | 0.037 | 0.068 | 0.419 | 5.123 |
10 | 0.013 | 0.019 | 0.070 | 1.866 |