Table 2 Summary of variables in the geography dataset.
From: A dataset for understanding self-reported patterns influencing residential energy decisions
Variable | Description | Comments |
---|---|---|
PermNum | Unique integer across all datasets. | Â |
geo_state_zip | Information about the quality of reported state and ZIP Code. | • State and ZIP Code Provided (9,546) • No state or ZIP Code provided (8) • No ZIP Code provided (352) • Provided Problematic ZIP Code (13). |
geo_review | Combined measure of located based data quality. | Created by pasting together results of state_chk, zcta_chk, and county_chk. |
state_chk | Whether or not the reported state is in same location as IP address state. | Excluded responses that were not within the U.S. but accepted responses from a different reported state, based on assumption that some respondents were using VPNs. |
zcta_chk | Whether or not reported state is in same state as assigned ZCTA. | ZIP Code Tabulation Areas (ZCTAs) are U.S. Census approximations of the geographic extent of U.S. ZIP Codes, which are postal routes and points43. Each respondent assigned to a ZCTA, based on reported ZIP Code. If respondent did not report a valid ZIP Code, ZCTA assigned them one based on geography review of location variables in QGIS39. |
county_chk | Whether or not county assigned based on ZCTA is same as IP address county. | Â |
region | Assigned each observation to one of five regions: West, Midwest, Northeast, Central Southwest, Southeast. | Assigned based on assigned state in R31. |
assigned_state | Assigned state for each respondent. | Reported state is same as assigned state, unless respondent did not report state. In that case, assigned state is IP address state. |
reported_state | State that respondent reported as their residence. | Â |
zcta_state | State in which assigned ZCTA occurs. | Â |
assigned_county_fip | 5 digit code for county to join with other datasets. | Â |
assigned_county | Name of assigned county. | Â |