Table 1 Energy-use datasets published in the residential sector.

From: High resolution synthetic residential energy use profiles for the United States

Authors/Dataset

Description

Klemanjak et al.26,75

A synthetic energy demand dataset was released for 21 appliances in Austria in 2020. Data collected from two households was used to train models and then appropriate noise was added for appliance start times and durations to mimic variations in actual consumption patterns.

Kolter et al.76,77

The Reference Energy Disaggregation Data Set (REDD) is published by MIT. The dataset contains high-frequency current/voltage waveform data of the power mains in households along with labeled circuits in the house.

Makonin et al.78

The Rainforest Automation Energy (RAE) dataset was published by Harvard in 2017. The dataset contains 1 Hz data (mains and sub-meters) from two residential houses.

Murray et al.79,80

Load measurements from 20 households of UK from a two year longitudinal study.

Pecan Street22,23

Labeled circuit data for households across major cities in the U.S. This is said to be the most comprehensive dis-aggregate energy data available for the U.S.

Rashid et al.81,82

The I-blend dataset has recorded minute-level consumption of all the buildings at an academic institute in India over a period of 52 months

Paige et al.83,84

The flEECe dataset provides energy data at a 1 Hz sampling rate for four circuits for six net-zero energy senior housing units in Virginia, USA for nine months

Shin et al.85,86

The first Korean dataset measuring appliance-level energy data was released in 2019 for 22 houses in Korea.

Kelly et al.20,87

Power demand is recorded from five houses UK houses at two levels – whole house and individual appliances. This dataset is referred to as the UK-Dale dataset. Two versions of this dataset have been released.

Anderson et al.88,89

Building-Level fUlly-labeled dataset for Electricity Disaggregation (BLUED) for one household in Pittsburg U.S. for one week. State transition of appliances are labeled and time-stamped, providing the necessary ground truth for the evaluation of NILM algorithms.

Barker et al.90,91

Electricity usage data is monitored every minute from nearly every plug load from 400 anonymous homes.

Beckel et al.92

Electricity consumption is monitored via smart plugs for six households in Switzerland over a period of 8 months.

Pereira et al.93,94,95

Power usage for 44 apartments and 6 homes in Portugal is collected for 264 days at 30 minute intervals. The advanced version of this dataset ‘SustDataED2’ dataset contains 96 days of aggregated and individual appliance consumption from one household in Portugal.

Monacchi et al.96,97

Common household devices are monitored for power consumption in Austria and Italy (GREEND dataset).

Pullinger et al.98,99

1-second electricity data is gathered over a period of 23 months from 255 UK homes (IDEAL household energy dataset).

Ruhnau et al.100,101

Synthetic national time series of heat demand that covers over 16 countries in the EU from 2008 to 2018.