Table 1 Summary of the dataset (E = English/C = Chinese).
Reception dataset | Source dataset | |||
|---|---|---|---|---|
Goodreads(E) | Douban(C) | Original(E) | Translation(C) | |
No of Characters | 269,565 | 87,735 | 87,194 | 30,587 |
No of Tokens | 48,010 | 8299 | 19,300 | 2910 |
No of reviews | 300 | 300 | N/A | N/A |