Table 1 Summary of corpus.

From: Simplification in interpreting: the text classification of spoken and interpreted Chinese through ensemble learning techniques

Sub-corpora

Language

Year

Source

Text Count

Overall Size

Mean Text Length

Interpreted

Chinese

2014–2016

UN conference and international forum

235

252610

1075

Spoken

Chinese

2014–2016

235

251885

1072