Table 2 Predictor variables from the Census 2018 dataset.

From: Using machine learning to explore the efficacy of administrative variables in prediction of subjective-wellbeing outcomes in New Zealand

Predictor variable

Type of variable

Number of categories

Description

1. Age (in years)

Continuous

NA

0–120

2. Gender

Categorical

2

Male or Female

3. Ethnicity

Categorical

5

European,

NZ Māori,

Pacific,

Asian,

Middle Eastern/Latin American/African and Other Ethnic groups

4. Region

Categorical

6

Auckland,

Wellington,

Northland group (Northland, Bay of Plenty, Gisborne),

Rest of North Island,

Canterbury,

Rest of South Island

5. Marital Status

Categorical

5

Married (not separated),

Separated,

Divorced or dissolved,

Widowed or surviving civil union partner,

Never married and never in a civil union

6. Birth Country

Categorical

2

New Zealand, Other

7. Highest Qualification

Categorical

8

No Qualification,

School Qualification,

Post-school Qualification,

Bachelor’s degree and Level 7 Qualification,

Post-graduate and Honours Degrees,

Master’s Degree,

Doctorate Degree,

Overseas Secondary School Qualification

8. Personal Income

Categorical

9

$0–$30,000

$30,001–$35,000

$35,001–$40,000

$40,001–$50,000

$50,001–$60,000

$60,001–$70,000

$70,001–$100,000

$100,001-$150,000,

$150,001 or More

9. Household Income

Categorical

9

Same as Personal Income

10. Number of income sources

Categorical

5

No source of income,

One source,

Two sources,

Three sources,

Four sources,

Five or more sources,

11. Workforce Status

Categorical

4

Employed Full-time,

Employed Part-time,

Unemployed,

Not in the Labour Force

12. Study Participation Code

Categorical

3

Full-time study,

Part-time study,

Not studying

13. Number of Languages spoken

Categorical

7

None,

One Language,

Two Languages,

Three Languages,

Four Languages,

Five Languages,

Six Languages

14. Home Ownership

Categorical

3

Hold in a family trust,

Own or partly own,

Do not own and do not hold in a family trust

15. Index of Socioeconomic Deprivation Score 201826

Continuous

Derived variable

823–1552

16. Index of Socioeconomic Deprivation 201826

Categorical

10

1-Least deprived

10-Most deprived

17. Dwelling dampness indicator

Categorical

4

Always damp,

Sometimes damp,

Not damp,

Don’t know

18. Dwelling mould indicator

Categorical

4

Mould over A4 size–always,

Mould over A4 size–sometimes,

No mould/mould smaller than A4 size,

Don’t know

19. Difficulty in Seeing

Categorical

4

No difficulty,

Some difficulty,

A lot of difficulty,

Cannot do at all

20. Difficulty in Hearing

Categorical

4

Same as above

21. Difficulty in Washing

Categorical

4

Same as above

22. Difficulty in Communication

Categorical

4

Same as above

23. Difficulty in Remembering

Categorical

4

Same as above

24. Difficulty in Walking

Categorical

4

Same as above

25. Disability indicator

Categorical

2

Not disabled,

Disabled

26. Crowding code-based on Canadian National Occupancy Standard

Categorical

5

2 + beds needed,

1 bed needed,

no beds needed,

1 bed spare,

2 + beds spare

27. Cigarette smoking behaviour

Categorical

3

Regular Smoker,

Ex-Smoker,

Never Smoked Regularly

28. Have you ever smoked?

Categorical

2

Yes or no

29. Do you smoke regularly?

Categorical

2

Yes or no