Table 2 Summary of variables in the geography dataset.

From: A dataset for understanding self-reported patterns influencing residential energy decisions

Variable

Description

Comments

PermNum

Unique integer across all datasets.

 

geo_state_zip

Information about the quality of reported state and ZIP Code.

• State and ZIP Code Provided (9,546)

• No state or ZIP Code provided (8)

• No ZIP Code provided (352)

• Provided Problematic ZIP Code (13).

geo_review

Combined measure of located based data quality.

Created by pasting together results of state_chk, zcta_chk, and county_chk.

state_chk

Whether or not the reported state is in same location as IP address state.

Excluded responses that were not within the U.S. but accepted responses from a different reported state, based on assumption that some respondents were using VPNs.

zcta_chk

Whether or not reported state is in same state as assigned ZCTA.

ZIP Code Tabulation Areas (ZCTAs) are U.S. Census approximations of the geographic extent of U.S. ZIP Codes, which are postal routes and points43.

Each respondent assigned to a ZCTA, based on reported ZIP Code. If respondent did not report a valid ZIP Code, ZCTA assigned them one based on geography review of location variables in QGIS39.

county_chk

Whether or not county assigned based on ZCTA is same as IP address county.

 

region

Assigned each observation to one of five regions: West, Midwest, Northeast, Central Southwest, Southeast.

Assigned based on assigned state in R31.

assigned_state

Assigned state for each respondent.

Reported state is same as assigned state, unless respondent did not report state. In that case, assigned state is IP address state.

reported_state

State that respondent reported as their residence.

 

zcta_state

State in which assigned ZCTA occurs.

 

assigned_county_fip

5 digit code for county to join with other datasets.

 

assigned_county

Name of assigned county.

Â