Genome wide association study and genomic risk prediction of age related macular degeneration in Israel

Grunin, Michelle; Triffon, Daria; Beykin, Gala; Rahmani, Elior; Schweiger, Regev; Tiosano, Liran; Khateb, Samer; Hagbi-Levi, Shira; Rinsky, Batya; Munitz, Refael; Winkler, Thomas W.; Heid, Iris M.; Halperin, Eran; Carmi, Shai; Chowers, Itay

doi:10.1038/s41598-024-63065-0

Download PDF

Article
Open access
Published: 06 June 2024

Genome wide association study and genomic risk prediction of age related macular degeneration in Israel

Michelle Grunin^1,2,
Daria Triffon¹,
Gala Beykin²,
Elior Rahmani³,
Regev Schweiger^4,8,
Liran Tiosano²,
Samer Khateb²,
Shira Hagbi-Levi²,
Batya Rinsky²,
Refael Munitz²,
Thomas W. Winkler⁷,
Iris M. Heid⁷,
Eran Halperin^4,5,6,
Shai Carmi¹^na1 &
…
Itay Chowers²^na1

Scientific Reports volume 14, Article number: 13034 (2024) Cite this article

4836 Accesses
9 Citations
3 Altmetric
Metrics details

Subjects

Abstract

The risk of developing age-related macular degeneration (AMD) is influenced by genetic background. In 2016, the International AMD Genomics Consortium (IAMDGC) identified 52 risk variants in 34 loci, and a polygenic risk score (PRS) from these variants was associated with AMD. The Israeli population has a unique genetic composition: Ashkenazi Jewish (AJ), Jewish non-Ashkenazi, and Arab sub-populations. We aimed to perform a genome-wide association study (GWAS) for AMD in Israel, and to evaluate PRSs for AMD. Our discovery set recruited 403 AMD patients and 256 controls at Hadassah Medical Center. We genotyped individuals via custom exome chip. We imputed non-typed variants using cosmopolitan and AJ reference panels. We recruited additional 155 cases and 69 controls for validation. To evaluate predictive power of PRSs for AMD, we used IAMDGC summary-statistics excluding our study and developed PRSs via clumping/thresholding or LDpred2. In our discovery set, 31/34 loci reported by IAMDGC were AMD-associated (P < 0.05). Of those, all effects were directionally consistent with IAMDGC and 11 loci had a P-value under Bonferroni-corrected threshold (0.05/34 = 0.0015). At a 5 × 10⁻⁵ threshold, we discovered four suggestive associations in FAM189A1, IGDCC4, C7orf50, and CNTNAP4. Only the FAM189A1 variant was AMD-associated in the replication cohort after Bonferroni-correction. A prediction model including LDpred2-based PRS + covariates had an AUC of 0.82 (95% CI 0.79–0.85) and performed better than covariates-only model (P = 5.1 × 10⁻⁹). Therefore, previously reported AMD-associated loci were nominally associated with AMD in Israel. A PRS developed based on a large international study is predictive in Israeli populations.

Epistatic interactions of genetic loci associated with age-related macular degeneration

Article Open access 23 June 2021

Genome-wide association analyses identify distinct genetic architectures for age-related macular degeneration across ancestries

Article 02 December 2024

Integrating genetics, age and imaging to predict treatment outcomes in neovascular age-related macular degeneration: a proof-of-concept study

Article Open access 07 March 2026

Introduction

Age-related macular degeneration (AMD) is the leading cause of blindness in the elderly population. The risk for developing AMD is strongly associated with the genetic background of the individual^1,2. In 2005, AMD was the first disease for which genome-wide association studies (GWASs) have identified risk variants^3,4. Via a seminal paper published in 2016, the International Age-Related Macular Degeneration Genomics Consortium (IAMDGC) has reported the genotyping of more than 30,000 AMD patients and controls of European ancestry and the discovery of 52 risk variants across 34 loci².

Israel is home to a number of populations of distinct genetic ancestry, including Ashkenazi Jews, non-Ashkenazi Jews—predominantly North-African Jews and Middle-Eastern Jews, and Arabs—predominantly Palestinians, Bedouins, and Druze. These populations are genetically diverse, having genetic ancestry related to the Middle East, Africa, and Europe, with variable admixture proportions^5,6,7,8. Some of the populations have experienced recent population-specific genetic drift due to founder events and endogamy^7,9,10. The unique genetic background of the Israeli populations suggests that the genetic architecture of AMD might be different in these populations compared to Europeans. In addition, the Israeli populations that have experienced strong genetic drift may harbor deleterious risk variants at a considerable frequency. This will increase power for discovering novel risk variants¹¹ as previously observed for other retinal diseases^12,13,14.

Previous studies of the genetic basis of AMD in Israel found that the most prominent risk variants—the genes CFH¹⁵ and HTRA1/ARMS2¹⁶—were associated with AMD. However, the C2 locus, one of the top risk loci worldwide, was not associated with AMD in Israel¹⁷. The 2016 study of the IAMDGC included an Israeli cohort. However, it was analyzed jointly with the other studies, which was uninformative about Israeli-specific genetic architecture and risk variants. Searching for population-specific risk variants is important even beyond the population under study, as any discovered variants and biological pathways may provide insight into the pathogenesis of the disease.

Polygenic risk scores (PRSs) were recently developed for numerous diseases based on the results of large-scale GWASs¹⁸. Conceptually, a PRS is the count of risk alleles carried by an individual, where each allele is weighted by its effect size (usually the log odds-ratio), as estimated by GWAS. In practice, the list of single-nucleotide polymorphisms (SNPs) that are included in the score is optimized based on their strength of association (e.g., comparing multiple P-value cutoffs) and their correlation with other SNPs, in some methods accounting for prior assumptions on the genetic architecture of the disease¹⁹. While PRSs cannot unambiguously distinguish healthy and affected individuals (due to the small proportion of variance in disease liability they explain), individuals at the top PRS quantiles are at a particularly high risk^20,21. These individuals can then be subjected to personalized screening or prevention.

A number of recent papers have developed or examined PRSs for AMD, showing that the PRS has considerable power to predict disease status and disease progression^{2,22,23,24,25,26,27}. However, it is known that PRS accuracy can substantially decrease when evaluated in populations or ancestries other than the ones used for the original GWAS (usually European populations and ancestries)^28,29. So far, no study has examined the accuracy of an AMD PRS in any of the Israeli sub-populations, which forms a barrier to the implementation of DNA-based risk stratification.

In this paper, we used data on 558 AMD cases and 325 controls to investigate the genetic basis of AMD in the Israeli populations. Our study had three main goals. (1) To determine whether previously identified risk variants (from the IAMDGC 2016 GWAS) are associated with AMD in Israel, either across all Israeli sub-populations or in a population-specific manner. (2) To discover putative new AMD risk variants by running a GWAS in the Israeli study, anticipating that despite the small sample size, we may be able to identify risk variants that have drifted to high frequencies in the Israeli founder populations. (3) To evaluate the accuracy in the Israeli population of a PRS generated based on the IAMDGC GWAS. We show that the vast majority of previously discovered risk variants are also associated with AMD in Israel, Accordingly, a PRS based on previously discovered variants has high predictive power. While our study was too small for discovering new risk variants at a genome-wide significance level, our study suggested a number of putative associations at an attenuated significance threshold.

Results

Replication of known AMD loci

A previous large-scale AMD GWAS by the IAMDGC (n = 33,976²) has discovered 34 associated loci. We examined the association of these loci with AMD status in our Israeli discovery set (403 AMD cases and 256 controls). Using the SNP with the lowest P-value in each locus, we found that most loci (31/34) were associated with AMD at a nominal significance level of P < 0.05 with a direction of effect consistent with that of the IAMDGC (Supplementary Tables 2 and 3). The number of loci associated at the Bonferroni correction threshold (0.05/34 = 0.0015) was 11/34 (Supplementary Table 2). The top ranked loci were CFH (P = 1.6 × 10⁻⁹) and nearby loci on chr1, and ARMS2/HTRA1 (P = 3.4 × 10⁻⁹, 5.1 × 10⁻⁹, respectively). The next significant locus was near SYN3 (P = 5.7 × 10⁻⁵). Association statistics for the known AMD risk loci for Ashkenazi Jews (AJ) (242 cases and 136 controls) and Arabs (36 cases and 30 controls) are reported in Supplementary Tables 4 and 5. We note that replication was to some extent expected, given that the majority of the Israeli cohort was included in the IAMDGC. However, the same 34 loci were associated with AMD at genome-wide significance even when all Israeli samples were excluded from the IAMDGC GWAS.

Discovery GWAS

We next ran a GWAS in our discovery set (AMD cases: n = 403, controls: n = 256). No novel variant was associated at the genome-wide significance threshold of 5 × 10⁻⁸. Setting a more liberal threshold of 5 × 10⁻⁵, and excluding variants in known risk loci, we identified four suggestive associations in the genes C7orf50, IGDCC4, FAM189A1, and CNTNAP4 (Table 1; Fig. S2). None of these SNPs were associated with AMD in the IAMDGC data (P ≥ 0.04, Table 1). The variant rs116928937 in IGDCC4 is exonic. Its allele frequency in European Americans was 1.23% (in the Exome Variant Server (NHLBI GO Exome Sequencing Project (ESP), Seattle, WA (URL: http://evs.gs.washington.edu/EVS/))), compared to 2.66% here. It is a missense variant (c.3188G>T), and according to Polyphen³⁰ it is "probably-damaging”.

Table 1 SNP association with AMD in Israel Discovery Cohort.

Full size table

We attempted to replicate the association of these four loci in a replication set of n = 155 AMD cases and n = 69 controls (Supplementary Table 1). We applied a Bonferroni corrected threshold of 0.05/4 = 0.0125. Only the SNP in FAM189A1 (rs1195500, chr15:29687047) replicated in the Israeli population (P < 0.0001 in Fisher’s exact test in both genotype and allele testing). The SNP rs12701455 (C7orf50) attained a P-value of 0.029 in the genotype-based test (Supplementary Table 1).

Evaluating a polygenic risk score for AMD

We developed polygenic risk scores (PRSs) for AMD in Israel based on the results of the IAMDGC GWAS with the Israeli samples excluded (“leave-one-out”) and using two methods. The first method is clumping and thresholding (C + T), in which the most strongly associated SNP is retained from each LD block, as long as its P-value is under a threshold. The second method is LDpred2, which accounts for the influence of LD on effect sizes and incorporates a non-zero prior probability for having null effects. We generated nine C + T PRSs, corresponding to different P-value thresholds (exponentially increasing between 5 × 10⁻⁸ and 1), and four LDpred2 PRSs, corresponding to different values of the proportion of SNPs with non-zero effects and a sparsity parameter. For each PRS, we used logistic regression to predict AMD status based on age, sex, the first two principal components (a proxy of ancestry), and the PRS. We also fit a logistic regression model with covariates only. We used fivefold cross-validation to evaluate the accuracy of the various models, which we quantified using AUC (the area under the receiver operator characteristic curve (ROC curve)).

We compare the ROC curves of the most accurate (highest AUC) C + T model, the most accurate LDpred2 model, and the covariates-only model in Fig. 1. For C + T, the AUC was highest (0.79; 95% confidence interval (CI) 0.75–0.82) for the most stringent P-value threshold (5 × 10⁻⁸), for a PRS that included 360 variants. Interestingly, the AUC showed a trend (not statistically significant) of decreasing with increasing P-value thresholds (Fig. S3). The best LDpred2 model (parameters P = 0.056 and sparsity on) had a slightly higher AUC (0.82; 95% CI 0.79–0.85) than the best C + T model. The covariates-only model had a significantly lower AUC (0.72; 95% CI 0.69–0.76). This was also confirmed by DeLong's test for two correlated ROC curves (P = 5.1 × 10⁻⁹). Overall, our results suggest that including the PRS in the prediction model improves accuracy.

We show the distribution of the best LDpred2 PRS in cases and controls in Fig. 2A. The PRS distribution is different between cases and controls; however, considerable overlap exists. In Fig. 2B, we plot the proportion of cases in each quintile of the LDpred2 PRS, demonstrating that the proportion of cases steadily increases with increasing quintiles. Finally, we used Spearman's correlation test to assess the correlation between age at diagnosis (measured here as age at blood draw) and the PRS among AMD cases. In Fig. S4, we show a modest, yet significant, negative correlation between the variables (ρ = − 0.18, P-value = 0.0003, using the best LDpred2 PRS), suggesting that the PRS may be associated not only with disease status but also with age of onset.

Discussion

In this work, we studied the genetic basis of AMD in the Israeli populations. We confirmed that most of the known risk loci for AMD, as previously identified in a large international study, are also associated with AMD in Israel. This suggests that the genetic architecture of AMD is similar between the Israeli and other populations. We then performed a genome-wide association study in our cohort in an attempt to identify novel risk variants. As expected due to the small size of our cohort, no novel variants were detected at the genome-wide significance threshold. Setting a more relaxed threshold of 5 × 10⁻⁵, we identified four suggestive variants.

One of the suggestive variants (rs1195500, in FAM189A1) replicated, after Bonferroni correction, in a small second set of cases and controls. FAM189A1 is expressed in the brain (the Human Protein Atlas³¹ (https://www.proteinatlas.org/ENSG00000104059-FAM189A1/tissue) and GTEx³²) and in normal human retina (the Retinal Transcriptome data³³), but its function is yet unknown. The FAM189A1 gene is uncharacterized, and as such the function of its proteins is unknown. FAM189A1 was recently discovered to have rare pathogenic variants in neurodevelopmental disorders, and is present in neuronal cells of the brain³⁴. Additional studies will be required to replicate the association of this variant as well as validate the involvement of FAM189A1 in AMD.

C7orf50 encodes for a newly discovered hormone, cholesin, that is secreted from the intestine in response to cholesterol absorption. This very recent discovery indicated that this gene, C7orf50 is in charge of cholesterol synthesis in the liver, and is responsible for a reduction in circulating cholesterol levels³⁵. This may be related to other cholesterol genes found in AMD², as well as AMD’s connection to dietary cholesterol.

The two other suggestive genes, IDGCC4 and CNTNAP4, have been characterized. IGDCC4 belongs to the immunoglobulin superfamily. This protein might have a role in regulating the immune system against "para-inflammatory" diseases like AMD³⁶, but no studies clearly showed its immune system role. It was present in a study discussing methylation patterns in lung adenocarcinoma in smokers, and was methylated in advanced tumors (stage II–IV) as compared to early, stage I tumors³⁷. Methylation patterns in neonates in this gene were associated with higher birth weight, as well as with higher maternal BMI and glucose levels, indicating that it also plays a role in metabolism, both at birth and at later life³⁸. CNTNAP4 is part of the contactin associated protein family (otherwise known as Caspr4), and is involved in both the development of myelinated axons and in stability of neuronal connections^39,40. There is a known relationship between CNTNAP4 and neurodegenerative diseases, and a CNV variant in this gene was found to be inversely associated with healthy aging. The CNV variant was found to be protective against cognitive impairment, Alzheimer’s' and Parkinson’s, giving it a connection with normal neurological function⁴¹. Further investigation would be needed in order to ascertain their significance in the pathogenesis of AMD.

While our sample size is relatively small, it is, with 659 individuals in the base study and another 224 for replication, the largest AMD GWAS in Israel to date. While this was insufficient to discover any new associations at a genome-wide significance, our study was sufficiently powered to replicate most of the previously discovered associated loci. It was also sufficiently powered to detect an association between AMD status and a polygenic risk score, even when adjusting for standard covariates.

The evaluation of the AMD PRS in our Israeli cohort suggested multiple conclusions. First, the LDpred2 PRS had relatively high accuracy (AUC = 0.82), significantly better compared to not including the PRSs (Fig. 1). Second, with the simple clumping and thresholding approach, accuracy increased as more stringent P-value thresholds were used (Fig. S3). This could indicate that AMD is not as polygenic as other diseases. Third, as expected^42,43, LDpred2 performed better than the C + T approach (Fig. 2A). Finally, high AMD PRS in our study associated not only with disease risk, but also with a lower age of onset (Fig. S4), as seen for other diseases such as colorectal cancer or cardiovascular disesase^44,45. Prospective studies will be required to further validate this finding.

Our results for high predictive power of the AMD PRS are in line with previous studies^2,46 in individuals of European ancestry. The transferability of the PRS to the Israeli population is perhaps expected given that the majority of our subjects had Ashkenazi Jewish ancestry, and given that PRSs for other diseases and traits were shown to have high accuracy in Ashkenazi Jews^47,48,49,50. The transferability of PRSs into Ashkenazi Jews may be due to the high percentage of European ancestry in this population⁷. It is also consistent with our replication of the known risk loci. However, a limitation of our study is that it does not allow a direct head-to-head comparison of the strength of the PRS association with AMD between our Israeli cohort and cohorts of individuals of European ancestry. Our sample size was also too small to evaluate the PRS accuracy in other sub-populations, which could be the goal of future studies. Further improvement of the PRS may be achieved via denser genotyping or larger and more diverse imputation reference panels. Additionally, multiple methods can leverage even small samples from a target non-European population to improve a PRS constructed using large GWASs in Europeans^51,52. However, such efforts will require additional samples for evaluation of the resulting PRSs.

Materials and methods

Our discovery set consisted of 403 AMD cases and 256 controls (659 total) recruited at Hadassah Medical Center, as previously reported⁵³. Our cases included both atrophic and neovascular (a more advanced) AMD. The subjects’ mean age was 75.4 years (SD: 2.76, range: 60–97) and 44.6% were female. The criteria for inclusion of AMD patients were: age > 60, AMD diagnosis according to AREDS (Age-Related Eye Disease Study)⁵⁴, and choroidal neovascularization (CNV) and/or geographic atrophy. Diagnosis was also determined according to fluorescein angiogram and optical coherence tomography. Participants were included in all stages of AMD. We excluded individuals with other retinal diseases and individuals with other potential CNV causes such as myopia, trauma, or uveitis. Controls were over the age of 60 with a normal fundus examination and similar systemic exclusion criteria. The study was approved by the institutional ethics committee of Hadassah-Hebrew University Medical Center. All subjects signed informed consent forms that adhered to the tenets of the declaration of Helsinki.

We genotyped all subjects on the custom chip that was developed for the IAMDGC. Genotyping on this chip was performed either via the IAMDGC (at the Center for Inherited Disease Research (Johns Hopkins, USA)) or at the genomics core facility of the Technion (Israel), as previously described². The custom chip, which was previously described, contains ≈ 250,000 tagging markers for imputation and ≈ 250,000 custom markers for AMD². Samples genotyped at either center were merged into a single set and underwent joint imputation, quality control, and further downstream analysis.

We imputed the genomes of our subjects with the following reference panels: the 1000 Genomes Project (1 KG, n = 2504)⁵⁵ and the Ashkenazi Genome Consortium (n = 128)⁶. This strategy was shown to have the highest accuracy for imputing Ashkenazi genomes⁵⁶ and was applied here, given that 60% of our subjects have Ashkenazi ancestry. Unfortunately, a reference panel for non-Ashkenazi Jews or for the non-Jewish populations of Israel does not yet exist, and thus, all samples were imputed with both panels simultaneously. We phased our genomes using SHAPEIT^57,58 and performed imputation using a standard protocol^58,59. We describe next the post-imputation quality control (QC) pipeline, as we previously developed^53,60.

The chip was imputed to 37,126,112 variants. We performed QC according to standard protocols to remove low-quality variants and samples⁶¹. We excluded variants with imputation quality score R² < 0.6, variants with minor allele frequency < 0.01 (removing 403,979 variants), and variants in Hardy–Weinberg disequilibrium (PLINK 1.9^62,63). The sex of patients was confirmed using the sex-check option in PLINK. We excluded individuals who were related, having PIHAT > 0.3 in PLINK. We performed principal components analysis (PCA) in PLINK and GCTA⁶⁴ to account for population stratification; the first two principal components were used as covariates in the association analysis (Fig. S1). The final variant count after filtering was 5,353,842 variants in 403 AMD patients and 256 controls.

We performed the discovery GWAS on case–control status using logistic regression in PLINK. To account for population stratification, we used the first two principal components as covariates. The other covariates were age at blood draw and sex. We generated Manhattan and Q–Q plots with qqman⁶⁵. For genome-wide significance, we used a P-value threshold of 5 × 10⁻⁸. To detect suggestive associations, we used a threshold of 5 × 10⁻⁵. We computed the frequency of risk alleles (either in Europeans or in Ashkenazi Jews) using gnomAD (http://gnomad.broadinstitute.org/) and, if exonic, also in the Exome Variant server (http://evs.gs.washington.edu/EVS). Variants that were outside gene boundaries were reported to nearest gene. Variants contained within a gene were reported with that gene.

To determine whether previously discovered associations replicate in our study, we considered variants within the 34 known loci that were identified in the IAMDGC 2016 GWAS² (Table 5 in the IAMDGC GWAS paper). For each locus (LD block) we retained the variant with the lowest P-value. We considered a nominal significance level of P = 0.05 or a Bonferroni corrected level of P = 0.05/34 = 0.0015.

To test for population-specific replication, we separately studied Ashkenazi Jews (AJ; 242 cases, 136 controls) and Arabs (36 cases and 30 controls). All subjects self-reported their ancestry, and the self-reported ancestry was validated against the genetic ancestry, as determined by their clustering in PCA (Fig. S1). We identified Arab subjects based on self-report (36 cases and 30 controls). We considered all variants in linkage disequilibrium (LD; r² > 0.05 in AJ, using hg19 linkage blocks as per the original Fritsche et al.² paper) to belong to the same locus.

We note that 549/649 of our subjects were part of the original IAMDGC GWAS² (out of a total of 33,976 individuals). Therefore, some degree of replication is expected just by virtue of this sample overlap. However, given that the Israeli samples were less than 2% of the total IAMDGC sample, the effect of the overlap is expected to be small. To validate that, we excluded the Israeli samples from the IAMDGC dataset and re-ran the GWAS analysis (remaining n = 33,515; the “base” study). This created a new, “leave-one-out” GWAS, which did not include any Israeli samples. Indeed, all 34 loci that associated with AMD at genome-wide significance in the original GWAS were also associated with AMD at genome-wide significance in the leave-one-out GWAS.

To replicate putative discoveries from the present study, we recruited additional 155 AMD cases and 69 controls (total 224) according to the same criteria as in the original discovery set. We used this case/control sample to validate the suggestively associated variants from the discovery set. Four variants passed the 5 × 10⁻⁵ genome-wide threshold in the discovery set, after excluding variants in known AMD risk loci. We genotyped these four variants in our entire replication set using the KASP assay (LGC Group, Middlesex, UK) with custom primers. All heterozygotes were confirmed using Sanger sequencing (Macrogen, Seoul, Korea). We tested the association using EPACTS (https://genome.sph.umich.edu/wiki/EPACTS) and R using two tests. For each SNP, an allelic test compared the proportion of minor alleles between cases and controls. A genotypic test compared the proportion of homozygotes to the minor allele out of all homozygotes between cases and controls.

To generate a polygenic risk score for AMD, we first performed quality control according to standard protocols. We removed variants with strand-ambiguous variants from the base study’s summary statistics. Duplicated variants were removed from both studies, separately. Variants with mismatching alleles were also removed. This has left 4,070,992 overlapping variants between the two studies (directly genotyped or imputed). We then extracted effect sizes from the leave-one-out IAMDGC GWAS, where the Israeli samples were excluded. We used these effect sizes to compute the PRS for individuals in the Israeli study (n = 659; the “target” study).

We generated polygenic risk scores using two approaches for variant selection: clumping and thresholding (C + T) and LDpred2. Briefly, in C + T, index variants are sequentially selected based on having the lowest P-value, and nearby variants in LD with the index variants are removed. Index variants with P-value under a threshold are retained^66,67. We used PLINK to implement clumping and computed LD (r²) using the target study. We set the clumping parameters to r² > 0.5 and ± 500 kb and used nine P-value thresholds: 5 × 10⁻⁸, and 10⁻⁷, 10⁻⁶, 10⁻⁵, 10⁻⁴, 0.001, 0.01, 0.1, and 1. The minimum P-value cutoff was set to match the IAMDGC genome-wide significance thresholds.

LDpred2 is a Bayesian method for deriving polygenic scores based on summary statistics while explicitly accounting for LD. Briefly, causal effect sizes are assumed to be a mixture of a normal distribution and a point mass at zero. Posterior mean effects are computed using Gibbs sampling based on the LD matrix and an estimate of the heritability^42,43. We set the SNP heritability (h²) to 0.47 based on a previous IAMDGC estimate² and used five proportions of causal variants (p) spaced on a log scale: 10⁻⁵, 1.8 × 10⁻⁴, 0.0032, 0.056, and 1. We also included a third parameter of sparsity (true/false). The analysis was restricted to HapMap3 variants, as recommended by LDpred2 developers. A list of 1,054,330 HapMap3 variants were downloaded from the online repository provided⁴³ and 715,011 overlapping variants entered the analysis. The LD matrix was computed using the target study. We used the R package bigsnpr to compute the LD matrix and generate the grid of scores⁴³. To avoid confounding by ancestry, in both methods we regressed the scores on the first two principal components and used the residuals as the scores in subsequent analyses.

We used PLINK to calculate PRSs for each of the 659 subjects. Overall, we obtained nine C + T PRSs (nine P-value cutoffs) and ten LDpred2 PRSs, of which four were reported as valid by LDpred2 (P = 0.056, sparse = FALSE; P = 0.056, sparse = TRUE; P = 1, sparse = FALSE; P = 1, sparse = TRUE). To evaluate the accuracy of each score, we used logistic regression of the disease status on the PRS and the following covariates: age, sex, PC1, and PC2. We also included a logistic regression model based on covariates only. We measured the accuracy of each model using the area under the curve (AUC) of the receiver operator characteristic curves (ROC curves), computed using fivefold cross validation. We used the R package pROC (https://cran.r-project.org/web/packages/pROC/pROC.pdf) to generate and analyze ROC curves, AUCs, and AUC confidence intervals (ci.auc ()), and the R package caret (https://cran.r-project.org/web/packages/caret/vignettes/caret.html) for cross-validation. Individuals with missing age data were excluded from the analysis (four cases and five controls).

We visually inspected the discriminatory power of the PRS using plots of the density of the PRS in cases and controls (using kernel density estimation), and the proportion of AMD cases across quintiles (fifths) of the PRS distribution. Both plots were generated with the R package ggplot2 (https://cran.r-project.org/web/packages/ggplot2/index.html). Finally, we computed Spearman's rank correlation coefficient to examine the association between the PRS and age at blood draw (as a proxy of the age at diagnosis) among AMD cases. Plots were generated with R package ggpubr (https://cran.r-project.org/web/packages/ggpubr/index.html).

Ethical approval

The study was approved by the institutional ethics committee. All subjects signed informed consent forms that adhered to the tenets of the declaration of Helsinki.

Data availability

The full IAMDGC dataset values can be accessed at: http://amdgenetics.org/ including the entire IAMDGC and the Jerusalem dataset specifically. In addition, the GWAS summary statistics and code utilized in this manuscript can be found by contacting the corresponding author via reasonable request. The Related Manuscript Variant supplementary file contains all nomenclature for HGVS for all variants.

References

DeAngelis, M. M. et al. Genetics of age-related macular degeneration (AMD). Hum. Mol. Genet. 26, R45–R50 (2017).
Article CAS PubMed PubMed Central Google Scholar
Fritsche, L. G. et al. A large genome-wide association study of age-related macular degeneration highlights contributions of rare and common variants. Nat. Genet. 48, 134–143 (2016).
Article CAS PubMed Google Scholar
Haines, J. L. et al. Complement factor H variant increases the risk of age-related macular degeneration. Science (80-). 308, 419–421 (2005).
Article ADS CAS Google Scholar
Klein, R. J. et al. Complement factor H polymorphism in age-related macular degeneration. Science (80-). 308, 385–389 (2005).
Article ADS CAS PubMed Central Google Scholar
Behar, D. M. et al. The genome-wide structure of the Jewish people. Nature 466, 238–242 (2010).
Article ADS CAS PubMed Google Scholar
Carmi, S. et al. Sequencing an Ashkenazi reference panel supports population-targeted personal genomics and illuminates Jewish and European origins. Nat. Commun. 5, 4835 (2014).
Article ADS CAS PubMed Google Scholar
Waldman, S. et al. Genome-wide data from medieval German Jews show that the Ashkenazi founder event pre-dated the 14(th) century. Cell 185, 4703-4716.e16 (2022).
Article CAS PubMed PubMed Central Google Scholar
Agranat-Tamir, L. et al. The genomic history of the Bronze Age Southern Levant. Cell 181, 1146-1157.e11 (2020).
Article CAS PubMed PubMed Central Google Scholar
Granot-Hershkovitz, E. et al. A study of Kibbutzim in Israel reveals risk factors for cardiometabolic traits and subtle population structure. Eur. J. Hum. Genet. 26, 1848–1858 (2018).
Article PubMed PubMed Central Google Scholar
Zidan, J. et al. Genotyping of geographically diverse Druze trios reveals substructure and a recent bottleneck. Eur. J. Hum. Genet. 8, 1093–1099 (2014).
Google Scholar
Zeggini, E. Using genetically isolated populations to understand the genomic basis of disease. Genome Med. 6, 83 (2014).
Article PubMed PubMed Central Google Scholar
Zelinger, L. et al. A missense mutation in DHDDS, encoding dehydrodolichyl diphosphate synthase, is associated with autosomal-recessive retinitis pigmentosa in ashkenazi jews. Am. J. Hum. Genet. 88, 207–215 (2011).
Article CAS PubMed PubMed Central Google Scholar
Zlotogora, J. & Chemke, J. Medical genetics in Israel. Eur. J. Hum. Genet. 3, 147–154 (1995).
Article CAS PubMed Google Scholar
Beryozkin, A. et al. Whole exome sequencing reveals mutations in known retinal disease genes in 33 out of 68 Israeli families with inherited retinopathies. Sci. Rep. 5, 131187 (2015).
Article Google Scholar
Chowers, I. et al. Association of complement factor H Y402H polymorphism with phenotype of neovascular age related macular degeneration in Israel. Mol. Vis. 14, 1829–1834 (2008).
CAS PubMed PubMed Central Google Scholar
Chowers, I. et al. Sequence variants in HTRA1 and LOC387715/ARMS2 and phenotype and response to photodynamic therapy in neovascular age-related macular degeneration in populations from Israel. Mol. Vis. 14, 2263–2271 (2008).
CAS PubMed PubMed Central Google Scholar
Asleh, S. A. A. et al. Lack of association between the C2 allele of transferrin and age-related macular degeneration in the Israeli population. Ophthalmic Genet. 30, 161–164 (2009).
Article PubMed Google Scholar
Babb de Villiers, C., Kroese, M. & Moorthie, S. Understanding polygenic models, their development and the potential application of polygenic scores in healthcare. J. Med. Genet. 57, 725–732 (2020).
Article PubMed Google Scholar
Wang, Y., Tsuo, K., Kanai, M., Neale, B. M. & Martin, A. R. Challenges and opportunities for developing more generalizable polygenic risk scores. Annu. Rev. Biomed. Data Sci. 5, 293–320 (2022).
Article PubMed PubMed Central Google Scholar
Torkamani, A., Wineinger, N. E. & Topol, E. J. The personal and clinical utility of polygenic risk scores. Nat. Rev. Genet. https://doi.org/10.1038/s41576-018-0018-x (2018).
Article PubMed Google Scholar
Khera, A. V. et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. https://doi.org/10.1038/s41588-018-0183-z (2018).
Article PubMed PubMed Central Google Scholar
Heesterbeek, T. J. et al. Genetic risk score has added value over initial clinical grading stage in predicting disease progression in age-related macular degeneration. Sci. Rep. 9, 6611 (2019).
Article ADS PubMed PubMed Central Google Scholar
Colijn, J. M. et al. Genetic risk, lifestyle, and age-related macular degeneration in Europe: The EYE-RISK Consortium. Ophthalmology 128, 1039–1049 (2021).
Article PubMed Google Scholar
Yan, Q. et al. Genome-wide association studies-based machine learning for prediction of age-related macular degeneration risk. Transl. Vis. Sci. Technol. 10, 29 (2021).
Article PubMed PubMed Central Google Scholar
de Breuk, A. et al. Genetic risk in families with age-related macular degeneration. Ophthalmol. Sci. 1, 100087 (2021).
Article PubMed PubMed Central Google Scholar
Wąsowska, A. et al. Polygenic risk score impact on susceptibility to age-related macular degeneration in Polish patients. J. Clin. Med. 12, 295 (2022).
Article PubMed PubMed Central Google Scholar
Yu, C. et al. Predictive performance of an updated polygenic risk score for age-related macular degeneration. Ophthalmology. https://doi.org/10.1016/j.ophtha.2024.01.033 (2024).
Article PubMed PubMed Central Google Scholar
Martin, A. R. et al. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat. Genet. 51, 584–591 (2019).
Article CAS PubMed PubMed Central Google Scholar
Duncan, L. et al. Analysis of polygenic risk score usage and performance in diverse human populations. Nat. Commun. 10, 3328 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Adzhubei, I., Jordan, D. M. & Sunyaev, S. R. Predicting functional effect of human missense mutations using PolyPhen-2. Curr. Protoc. Hum. Genet. https://doi.org/10.1002/0471142905.hg0720s76 (2013).
Article PubMed PubMed Central Google Scholar
Uhlen, M. et al. Tissue-based map of the human proteome. Science (80-). 347, 1260419–1260419 (2015).
Article Google Scholar
Lonsdale, J. et al. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
Article CAS Google Scholar
Farkas, M. H. et al. Transcriptome analyses of the human retina identify unprecedented transcript diversity and 3.5 Mb of novel transcribed sequence via significant alternative splicing and novel genes. BMC Genom. 14, 486 (2013).
Article Google Scholar
Safizadeh Shabestari, S. A. et al. Overlapping pathogenic de novo CNVs in neurodevelopmental disorders and congenital anomalies impacting constraint genes regulating early development. Hum. Genet. 142, 1201–1213 (2023).
Article CAS PubMed Google Scholar
Hu, X. et al. A gut-derived hormone regulates cholesterol metabolism. Cell 187, 1685-1700.e18 (2024).
Article CAS PubMed Google Scholar
Parmeggiani, F. et al. Mechanism of inflammation in age-related macular degeneration. Mediat. Inflamm. 2012, 546786 (2012).
Article Google Scholar
Tessema, M. et al. Genome-wide unmasking of epigenetically silenced genes in lung adenocarcinoma from smokers and never smokers. Carcinogenesis 35, 1248–1257 (2014).
Article CAS PubMed PubMed Central Google Scholar
Lin, X. et al. Developmental pathways to adiposity begin before birth and are influenced by genotype, prenatal environment and epigenome. BMC Med. 15, 50 (2017).
Article PubMed PubMed Central Google Scholar
Zou, Y. et al. Structure and function of the contactin-associated protein family in myelinated axons and their relationship with nerve diseases. Neural Regen. Res. 12, 1551–1558 (2017).
Article CAS PubMed PubMed Central Google Scholar
Spiegel, I., Salomon, D., Erne, B., Schaeren-Wiemers, N. & Peles, E. Caspr3 and caspr4, two novel members of the caspr family are expressed in the nervous system and interact with PDZ domains. Mol. Cell. Neurosci. 20, 283–297 (2002).
Article CAS PubMed Google Scholar
Iakoubov, L. et al. A common copy number variation (CNV) polymorphism in the CNTNAP4 gene: Association with aging in females. PLoS One 8, e79790 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Vilhjálmsson, B. J. et al. Modeling linkage disequilibrium increases accuracy of polygenic risk scores. Am. J. Hum. Genet. 97, 576–592 (2015).
Article PubMed PubMed Central Google Scholar
Privé, F., Arbel, J. & Vilhjálmsson, B. J. LDpred2: Better, faster, stronger. Bioinformatics 36, 5424–5431 (2020).
Article PubMed Central Google Scholar
Thomas, M. et al. Genome-wide modeling of polygenic risk score in colorectal cancer risk. Am. J. Hum. Genet. 107, 432–444 (2020).
Article CAS PubMed PubMed Central Google Scholar
Vaura, F. et al. Polygenic risk scores predict hypertension onset and cardiovascular risk. Hypertens. (Dallas, Tex. 1979). 77, 1119–1127 (2021).
Article CAS Google Scholar
Qassim, A. et al. Risk stratification and clinical utility of polygenic risk scores in ophthalmology. Transl. Vis. Sci. Technol. 10, 14 (2021).
Article PubMed PubMed Central Google Scholar
Gettler, K. et al. Common and rare variant prediction and penetrance of IBD in a large, multi-ethnic, Health System-based Biobank Cohort. Gastroenterology 160, 1546–1557 (2021).
Article CAS PubMed Google Scholar
Belbin, G. M. et al. Toward a fine-scale population health monitoring system. Cell 184, 2068-2083.e11 (2021).
Article CAS PubMed Google Scholar
Fahed, A. C. et al. Transethnic transferability of a genome-wide polygenic score for coronary artery disease. Circulat. Genom. Precis. Med. 14, e003092 (2021).
Article Google Scholar
Privé, F. et al. Portability of 245 polygenic scores when derived from the UK Biobank and applied to 9 ancestry groups from the same cohort. Am. J. Hum. Genet. 109, 12–23 (2022).
Article PubMed PubMed Central Google Scholar
Cai, M. et al. A unified framework for cross-population trait prediction by leveraging the genetic correlation of polygenic traits. Am. J. Hum. Genet. 108, 632–655 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kachuri, L. et al. Principles and methods for transferring polygenic risk scores across global populations. Nat. Rev. Genet. https://doi.org/10.1038/s41576-023-00637-2 (2023).
Article PubMed PubMed Central Google Scholar
Lorés-Motta, L. et al. Association of genetic variants with response to anti-vascular endothelial growth factor therapy in age-related macular degeneration. JAMA Ophthalmol. https://doi.org/10.1001/jamaophthalmol.2018.2019 (2018).
Article PubMed PubMed Central Google Scholar
Age-Related Eye Disease Study Research, G. The Age-Related Eye Disease Study (AREDS): Design implications. AREDS report no. 1. Control Clin. Trials. 20, 573–600 (1999).
Article Google Scholar
The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article Google Scholar
Lencz, T. et al. High-depth whole genome sequencing of an Ashkenazi Jewish reference panel: Enhancing sensitivity, accuracy, and imputation. Hum. Genet. 137, 343–355 (2018).
Article CAS PubMed PubMed Central Google Scholar
Delaneau, O., Marchini, J. & Zagury, J.-F. A linear complexity phasing method for thousands of genomes. Nat. Methods 9, 179–181 (2012).
Article CAS Google Scholar
Delaneau, O., Zagury, J.-F. & Marchini, J. Improved whole-chromosome phasing for disease and population genetic studies. Nat. Methods 10, 5–6 (2013).
Article CAS PubMed Google Scholar
van Leeuwen, E. M. et al. Population-specific genotype imputations using minimac or IMPUTE2. Nat. Protoc. 10, 1285–1296 (2015).
Article PubMed Google Scholar
Grunin, M. et al. Association of a variant in VWA3A with response to anti-vascular endothelial growth factor treatment in neovascular AMD. Investig. Ophthalmol. Vis. Sci. https://doi.org/10.1167/iovs.61.2.48 (2020).
Article Google Scholar
Anderson, C. A. et al. Data quality control in genetic case-control association studies. Nat. Protoc. 5, 1564–1573 (2010).
Article CAS PubMed PubMed Central Google Scholar
Chang, C. C. et al. Second-generation PLINK: Rising to the challenge of larger and richer datasets. 1–22 (2014).
Purcell, S. et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: A tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Article CAS PubMed PubMed Central Google Scholar
Turner, S. D. qqman: An R package for visualizing GWAS results using Q-Q and manhattan plots. J. Open Source Softw. 3, 731 (2018).
Article ADS Google Scholar
Privé, F., Vilhjálmsson, B. J., Aschard, H. & Blum, M. G. B. Making the most of clumping and thresholding for polygenic scores. Am. J. Hum. Genet. 105, 1213–1221 (2019).
Article PubMed PubMed Central Google Scholar
Choi, S. W., Mak, T.S.-H. & O’Reilly, P. F. Tutorial: A guide to performing polygenic risk score analyses. Nat. Protoc. 15, 2759–2772 (2020).
Article CAS PubMed PubMed Central Google Scholar

Download references

Funding

This work was supported by a Grants from the Israel Science Foundation: 3485/19 to I.C. and S.C. The contribution of the International AMD Genomics Consortium (IAMDGC) was supported by a Grant from NIH (R01 EY022310) and RES516564 to IMH. Genotyping was supported by a contract (HHSN2682012000081) to the Center for Inherited Disease Research. MG is supported by a grant from the Bright Focus Foundation (M2021006F). We would also like to acknowledge the Israeli Ministry of Science (3-17354 Grant Number).

Author information

These authors contributed equally: Shai Carmi and Itay Chowers.

Authors and Affiliations

Braun School of Public Health and Community Medicine, The Hebrew University of Jerusalem, POB 12271, 9112102, Jerusalem, Israel
Michelle Grunin, Daria Triffon & Shai Carmi
Department of Ophthalmology, Hadassah-Hebrew University Medical Center, POB 12000, 91120, Jerusalem, Israel
Michelle Grunin, Gala Beykin, Liran Tiosano, Samer Khateb, Shira Hagbi-Levi, Batya Rinsky, Refael Munitz & Itay Chowers
Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA, USA
Elior Rahmani
Molecular Microbiology and Biotechnology, Tel Aviv University, Tel Aviv, Israel
Regev Schweiger & Eran Halperin
Department of Anesthesiology, University of California, Los Angeles, Los Angeles, CA, USA
Eran Halperin
Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
Eran Halperin
Department of Genetic Epidemiology, University of Regensburg, Regensburg, Germany
Thomas W. Winkler & Iris M. Heid
Department of Genetics, University of Cambridge, CB21TN, Cambridge, UK
Regev Schweiger

Authors

Michelle Grunin
View author publications
Search author on:PubMed Google Scholar
Daria Triffon
View author publications
Search author on:PubMed Google Scholar
Gala Beykin
View author publications
Search author on:PubMed Google Scholar
Elior Rahmani
View author publications
Search author on:PubMed Google Scholar
Regev Schweiger
View author publications
Search author on:PubMed Google Scholar
Liran Tiosano
View author publications
Search author on:PubMed Google Scholar
Samer Khateb
View author publications
Search author on:PubMed Google Scholar
Shira Hagbi-Levi
View author publications
Search author on:PubMed Google Scholar
Batya Rinsky
View author publications
Search author on:PubMed Google Scholar
Refael Munitz
View author publications
Search author on:PubMed Google Scholar
Thomas W. Winkler
View author publications
Search author on:PubMed Google Scholar
Iris M. Heid
View author publications
Search author on:PubMed Google Scholar
Eran Halperin
View author publications
Search author on:PubMed Google Scholar
Shai Carmi
View author publications
Search author on:PubMed Google Scholar
Itay Chowers
View author publications
Search author on:PubMed Google Scholar

Contributions

MG, DT, SC, IC, ER conceived and designed the work, MG, DT, ER, RS, RM designed and performed the experiments and data analysis, MG, DT, ER, RS, GB, LT, SK, SH-L, BR, RM, IC and SC provided data availability and data extraction and analysis, MG, DT, SC, IC wrote the manuscript, MG, DT, SC, IC, ER, RS, EH provided manuscript feedback, revision, and drafts.

Corresponding authors

Correspondence to Shai Carmi or Itay Chowers.

Ethics declarations

Competing interests

S.C. is a paid consultant to MyHeritage. All other authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information. (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Grunin, M., Triffon, D., Beykin, G. et al. Genome wide association study and genomic risk prediction of age related macular degeneration in Israel. Sci Rep 14, 13034 (2024). https://doi.org/10.1038/s41598-024-63065-0

Download citation

Received: 22 December 2023
Accepted: 24 May 2024
Published: 06 June 2024
Version of record: 06 June 2024
DOI: https://doi.org/10.1038/s41598-024-63065-0