Table 1 Variables included within the model to predict a home’s risk of damp.

From: A predictive model for damp risk in english housing with explainable AI

Variable name

Description

hasDamp

The dependant variable for the model. A TRUE/FALSE Indicator for whether a home has damp issues. This is the outcome variable that the model aims to predict.

Current Energy Efficiency

A numerical measure of the home’s current energy efficiency.

Property Type

A categorical variable describing the type of property (e.g., flat, house, bungalow).

Energy Consumption Current

A numerical variable indicating the EPC-predicted energy consumption of the home (kWh per year)

Heating Cost Current

A numerical variable representing the EPC-predicted annual heating cost (£ per year), calculated using modelled energy efficiency estimates under standardised conditions.

Total Floor Area

A numerical variable denoting the total floor area of the property (square meters).

Number of Habitable Rooms

A numerical variable indicating the number of habitable rooms in the home.

Number of Heated Rooms

A numerical variable specifying the number of rooms that are heated within the home.

Current Energy Rating

A categorical variable representing the current energy rating of the home (e.g., A, B, C, etc.).

Built Form

A categorical variable describing the building’s structure or form (e.g., detached, semi-detached, terraced).

Floor Description

A categorical variable providing details about the type of flooring or floor construction (e.g., solid, suspended).

Walls Description

A categorical variable detailing the type of walls or wall insulation (e.g., cavity wall, solid wall).

Walls Energy Efficiency

A categorical variable indicating the energy efficiency rating of the walls.

Construction Age Band

A categorical variable indicating the age band or period when the property was constructed (e.g., pre-1900, 1967–1975).