Extended Data Fig. 1: Standardized mean differences between COVID-19 non-hospitalized, hospitalized, and the control before and after weighting in analyses of death, hospitalization, and sequelae.

Plots standardized mean differences before weighting (left) and after weighting (right). Each row represents a sub-cohort used in analysis of the risks of death, hospitalization, and sequelae that was free of history of the respective outcome at baseline. Rows are ordered, from top to bottom, on the basis of the lowest to highest percent of SMD that were unbalanced (SMD > 0.1) among unweighted sub-cohorts. Each row consists of the distribution of SMD within the sub cohort. Each cell represents a percentile (from 0th to 100th) of the distribution of the SMD within the respective sub-cohort (x-axis). Includes SMD for non-hospitalized COVID-19 compared to the control, hospitalized COVID-19 compared to the control, and non-hospitalized compared to hospitalized COVID-19. SMD was estimated for differences in participant characteristics including pre-defined covariates, algorithmically selected high dimensional covariates, and non-selected high dimensional covariates. After weighting, no covariates had an SMD greater than 0.1 in any sub-cohorts.