Large national-level electronic health record datasets offer new opportunities for disentangling the roles of genes and environment in human diseases. Here, the authors propose a spatial mixed linear effect model (SMILE) to dissect genetic and environmental risk factors for diseases and assess the causality of air pollutants in an insurance claim dataset with 50 million individuals.
- Daniel McGuire
- Havell Markus
- Bibo Jiang