Table 3 The impact of the amount of data on the algorithms.
Algorithm | Data source | Data sizes/GB | PR/% | RE/% |
|---|---|---|---|---|
SimHash | Company1 | 2.048 | 36 | 81 |
20.48 | 35 | 83 | ||
Company2 | 20.48 | 32 | 85 | |
204.8 | 30 | 86 | ||
MR-ST | Company1 | 2.048 | 96 | 91 |
20.48 | 96 | 91 | ||
Company2 | 20.48 | 95 | 92 | |
204.8 | 95 | 92 |