Table 11 The performance for mini-batch K-means algorithm for the first situation used the text files with Preprocessing 1.
From: Open source Arabic research paper dataset for natural language processing
# K | Silhouette coefficient (euclidean) | Silhouette coefficient (cosine) | ARI | Davies–Bouldin Index |
---|---|---|---|---|
2 | 0.013 | 0.022 | 0.004 | 4.827 |
3 | 0.028 | 0.050 | 0.037 | 2.597 |
4 | 0.007 | 0.011 | 0.040 | 2.359 |
5 | 0.030 | 0.054 | 0.125 | 2.922 |
6 | 0.031 | 0.056 | 0.120 | 3.038 |
7 | 0.022 | 0.038 | 0.101 | 2.615 |
8 | −0.001 | -0.005 | 0.147 | 3.348 |
9 | 0.004 | 0.004 | 0.120 | 3.607 |
10 | −0.019 | -0.039 | 0.122 | 2.952 |