Table 4 Variable Description.

From: South Korean Election Campaign Booklet and Party Statements Corpora

Variable Name

Measurements

date

date of the election

name

name of the candidate

region

metro-level region where the district is located

district

name of the district

office_id

office identifier (see Table 5)

office

office where the candidate is running in English

giho

candidate identifier per NEC

party

name of the party

party_eng

name of the party in English

result

election result

result_code

not elected = 0, elected = 1

sex

candidate’s sex

sex_code

female = 0, and male = 1

birthday

candidate’s birthday

age

candidate’s age

job_id

identifier for candidate’s job per NEC

job

candidate’s job

job_name

candidate’s job category per NEC

job_name_eng

English translation of job_name

job_code

standardized identifier for job_name (see Table 6)

edu_id

identifier for candidate’s education level per NEC

edu

candidate’s education

edu_name

candidate’s education category per NEC

edu_name_eng

English translation of edu_name

edu_code

standardized identifier for edu_name (see Table 7)

career1

candidate’s career

career2

candidate’s career

pages

number of pages in the booklet

code

booklet identifier per NEC

text

full text from the booklet

filtered_text

khaiii parsed and filtered text from the booklet