Table 2 Performance of seven approaches to genetic association analyses for trait change in simulated and empirical longitudinal data

From: Analyzing longitudinal trait trajectories using GWAS identifies genetic variants for kidney function decline

	Simulated data			Empirical data
	UKB scenario for eGFR-trajectories			UKB eGFR-trajectories
Approaches	T1E [%] (CI)	Power [%] (CI)	Bias [%]	T1E [%] (CI)	“Power” in 9	“Bias” in 9	Identified in 595
Without singletons
Difference model	5.1 (4.7, 5.5)	12.0 (11.4, 12.7)	0.6	4.9 (4.5; 5.3)	4/9	0.0	2 (2)
Time model RI&RS	4.7 (4.2, 5.1)	25.6 (24.7, 26.4)	0.0	4.8 (4.4; 5.2)	7/9	8.2	7 (2)
Age model RI&RS	4.8 (4.4, 5.3)	31.4 (30.5, 32.4)	0.7	5.0 (4.5; 5.4)	8/9	Reference	9 (2)
Age model RI&RS uncorr.	7.5 (7.0, 8.1)	38.8 (37.8, 39.8)	0.7	8.8 (8.2; 9.3)	8/9	0.7	13 (3)
Age model RI-only	32.4 (31.4, 33.3)	59.0 (58.1, 60.0)	0.6	43.1 (42.2; 44.1)	9/9	16.9	98 (41)
BLUPs&LinReg	5.3 (4.9, 5.8)	44.8 (43.9, 45.8)	−37.7	5.0 (4.6; 5.5)	8/9	−38.5	13 (3)
Including singletons
Age model RI&RS	5.2 (4.8, 5.7)	44.1 (43.1, 45.1)	0.5	4.8 (4.3; 5.2)	8/9	0.2	12 (6)

We compared seven approaches (“Methods” section and Supplementary Table 2) regarding type I error, power, and bias: six approaches analyze individuals with ≥2 trait assessments over age/time (no singletons, i.e., individuals with =1 trait assessment), the 7th approach repeats age model RI&RS including singletons. Simulations were based on distributions of age, global/random trait effects, and random error as in UKB 350K for eGFR and simulated genotypes (EAF = 30%; 10,000 simulation runs; “Methods” section and Supplementary Table 3). This scenario covers a setting as in UKB (~50% singletons) for trajectories of a trait like eGFR with prounouced age effect on trait. We show estimates of type I error (T1E), power, and bias from 10,000 simulation runs. In empirical analyses using UKB 150K (no singletons) or 350K (including singletons), we show permutation-based type I error, proxies of power and bias (based on 9 SNPs known for eGFR-decline¹⁰), and number of SNPs identified with P_decline < 0.05/595 (P_decline< 5 × 10⁻⁸) among the 595 SNPs known for association with cross-sectional eGFR⁸.
Simulated data: T1E = proportion of SNPs with P_decline < 0.05 across 10,000 simulated SNPs given zero true effect on decline, β_decline = 0 (95% CI using SEs from exact binomial test); Power = proportion of SNPs with P_decline < 0.05 across 10,000 simulated SNPs given true effect on decline, β_decline = −0.025 (95% CIs derived from SEs using exact binomial test); Bias = relative bias of effect estimates given true effect on decline, β_decline = −0.025, derived as average (across 10,000 simulation runs) of (\({\hat{{{{\rm{\beta }}}}}}_{{{{\rm{decline}}}}}\)−β_decline)/β_decline; Empirical data: T1E = proportion of SNPs with P_decline < 0.05 among 10,000 permutation-based “null-SNPs” using eGFR-trajectories of UKB individuals (95% CIs using SEs from exact binomial test); “Power” in 9 = proportion of SNPs directionally consistent with P_decline < 0.05 in UKB 150K (for age model RI&RS: additionally in UKB 350K) among the 9 SNPs known for eGFR-decline¹⁰; “Bias” in 9 = relative deviation of effect estimate from reference among the 9 SNPs known for eGFR-decline¹⁰ derived as average across the 9 SNPs of (\({\hat{{{{\rm{\beta }}}}}}_{{{{\rm{decline}}}}}-{\hat{{{{\rm{\beta }}}}}}_{{{{\rm{decline}}}}({{{\rm{reference}}}})}\))/\(\,{\hat{{{{\rm{\beta }}}}}}_{{{{\rm{decline}}}}({{{\rm{reference}}}})}\); Identified in 595 = number of SNPs with P_decline < 0.05/595 (in parentheses: with P_decline < 5 × 10⁻⁸) among 595 SNPs tested.

Back to article page

Table 2 Performance of seven approaches to genetic association analyses for trait change in simulated and empirical longitudinal data

Search

Quick links