Introduction

Polycystic ovary syndrome (PCOS) is the most common reproductive, endocrine and metabolic disorder in women of reproductive age1. Globally, the prevalence of PCOS ranges from 4 to 21%, as defined using NIH 1990 and Rotterdam 2003 criteria2. Epidemiological data from Pakistan are scarce, but the prevalence of PCOS has been reported to be particularly high among women of South Asian ethnic origin3,4.

PCOS is thought to arise as a result of the interplay between genetic and environmental factors5. A growing body of literature indicates that vitamin D deficiency may be a risk factor: meta-analyses of observational studies reveal independent associations between PCOS and low serum concentrations of 25-hydroxyvitamin D (25[OH]D, the major circulating vitamin D metabolite) as well as single nucleotide polymorphisms [SNPs] in the gene encoding the vitamin D receptor (VDR)6,7,8,9,10,11. However, data relating to vitamin D status among women with PCOS in Pakistan are lacking. Moreover, studies in the field have not yet tested for main effects of SNPs in the genes encoding the 1-alpha hydroxylase enzyme CYP27B1 (which converts 25[OH]D to its active metabolite 1,25-dihydroxy vitamin D) or the vitamin D 24-hydroxylase enzyme CYP24A1 (which converts 25[OH]D to its major inactive catabolite 24R,25-dihydroxy vitamin D). Neither have they tested for interactions between SNPs in the vitamin D pathway and vitamin D status in influencing the risk of PCOS.

We have previously demonstrated that genetic variation in the vitamin D pathway can modify the influence of vitamin D deficiency and supplementation on clinical phenotype in the context of tuberculosis12,13,14. We, therefore, conducted a case control study in Lahore, Pakistan, to determine whether vitamin D deficiency and SNPs in four vitamin D pathway genes might influence the risk of PCOS either as main effects or in interaction with each other.

Results

Participant characteristics

A total of 235 PCOS cases and 235 healthy controls were recruited to the study between April 2016 and April 2018. Characteristics of cases and controls are presented in Table 1. The mean age of cases vs. controls was 25.2 vs 29.7 years, respectively, and the majority (96.8%) of participants were of Punjabi ethnic origin. Married and single women were equally represented in both groups. Students were over-represented, and employed women were under-represented, in cases vs controls, but the proportion of participants at different educational levels did not differ between groups. As anticipated, acne, hair loss, hirsutism, menstrual cycle abnormalities, and pelvic ultrasound scan abnormalities were all more common among cases vs controls. The mean body mass index was higher among cases vs controls, but the mean waist-to-hip ratio was lower. Mean serum concentrations of fasting glucose and luteinizing hormone were higher in cases vs controls, but no difference was seen for mean serum concentration of the follicle-stimulating hormone. As anticipated LH/FSH ratio was higher in cases vs controls. Mean serum 25(OH)D concentration was lower in cases vs controls (17.4 vs 21.7 ng/ml, respectively; P < 0.001).

Table 1 Characteristics of cases and controls.

Association between phenotypic features and risk of PCOS

Table 2 presents the results of univariable and multivariable analyses testing for associations between participant characteristics captured by the study questionnaire or found at physical examination and susceptibility to PCOS. PCOS was independently associated with lower age (adjusted odds ratio [aOR] per additional year of age, 0.86; 95% CI 0.81 to 0.93), higher body mass index (aOR per additional kg/m2, 1.26; 95% CI 1.15 to 1.37), lower waist-hip ratio (aOR per additional unit, 0.88; 95% CI 0.82 to 0.95), vitamin D deficiency (serum 25(OH)D <25 ng/ml, aOR 24.81; 95% CI 6.16 to 99.84), lack of outdoor exercise (aOR 13.47; 95% CI 6.26 to 28.97), increased fasting glucose (aOR per 1 mg/dL increase, 1.12; 95% CI 1.09 to 1.17) and a family history of PCOS in at least one first degree relative (aOR 2.75; 95% CI 1.12 to 6.78). The following features were not found to associate with susceptibility to PCOS: monthly household income, or self-reported consumption of sugary/fast foods, chicken, eggs, milk or red meat.

Table 2 Determinants of PCOS risk from study questionnaire and physical examination.

Association between vitamin D pathway genotype and risk of polycystic ovarian syndrome

Table 3 presents results of univariable and multivariable analyses testing for associations between SNPs in the genes encoding the vitamin D receptor (VDR), the vitamin D 1-alpha hydroxylase enzyme (CYP27B1), the vitamin D 24-hydroxylase enzyme (CYP24A1) and the vitamin D binding protein (DBP) and risk of PCOS. No statistically significant association was observed between the genotype of any polymorphism investigated and risk of PCOS.

Table 3 Genotypic determinants of PCOS risk.

Association between vitamin D deficiency and host genotype, stratified by vitamin D status

We next proceeded to investigate whether genetic variation in VDR, CYP24A1, CYP27B1 and DBP modified the association between vitamin D deficiency and risk of PCOS reported above. P values for interaction for these sub-group analyses are presented in Table 3: these revealed no evidence to support the hypothesis that polymorphisms in the vitamin D pathway modify the effect of vitamin D deficiency on risk of PCOS.

Haplotype analysis

We also performed haplotype analysis to assess the cumulative impact of all SNPs on risk of PCOS. The Clark method15 was used to define haplotypes for SNPs showing significant linkage disequilibrium (R2 > 0.80) in the relevant 1000 genomes reference set ([PJL], Punjabi from Lahore, Pakistan). Two SNPs in VDR showed linkage disequilibrium with R2 > 0.80: rs11568820 and rs7976091. Univariate regression analysis for haplotypes TT, TC and CC showed no significant association with the odds of PCOS. Further analysis was performed by adjusting these haplotypes for the phenotypic risk factors for PCOS identified above (age, BMI, WHR, vitamin D status, outdoor physical exercise and fasting glucose). No statistically significant associations with risk of PCOS were identified (Table 4).

Table 4 Haplotype analysis.

Discussion

We report findings from one of the largest and most detailed studies conducted to date to investigate the influence of vitamin D deficiency and genetic variation in the vitamin D pathway on susceptibility to PCOS – and the first such study to be conducted in Pakistan. Uniquely, in addition to investigating SNPs in VDR and DBP, we also explored the influence of variants in CYP24A1 (the gene encoding the major enzymes of 25(OH)D catabolism) and CYP27B1 (the gene encoding the enzyme responsible for converting 25[OH]D to its active metabolite 1,25[OH]2D) on risk of disease. Our major findings were that lower vitamin D status was independently associated with susceptibility to PCOS, but that genetic variation in VDR, DBP, CYP24A1, and CYP27B1 was not. Moreover, we found no evidence of any interaction between serum 25(OH)D levels and genetic variation in the vitamin D pathway on disease risk.

Our finding that vitamin D deficiency associates independently with PCOS risk in women in Pakistan is consistent with reports from other settings including the Netherlands10, Iran16, Egypt17 and the USA18, as well as with findings from a meta-analysis including these and other studies19, showing that mean 25(OH)D level was lower in women with PCOS than controls. The high prevalence of vitamin D deficiency seen among participants in the current study is in keeping with our previous study in Lahore showing a high prevalence of vitamin D deficiency among women of reproductive age20. These and other studies reporting high rates of vitamin D deficiency in other groups21, highlight that vitamin D deficiency is a major public health problem in Pakistan22. The adjusted odds ratio for the association between vitamin D deficiency at the pre-specified 25(OH)D cut-off of 10 ng/ml was higher than has been reported elsewhere; this primarily reflects the very low prevalence of 25(OH)D values <10 ng/ml among controls in the current study. Exploratory analyses investigating 25(OH)D cut-offs of 15 ng/ml (aOR 3.41, 95% CI 1.77 to 6.57), 20 ng/ml (aOR 1.79, 95% CI 0.96 to 3.37), or analyzing 25(OH)D as a continuous variable (aOR 0.98, 95% CI 0.96 to 1.01) showed consistent inverse associations between 25(OH)D level and PCOS risk. Our findings with regard to other factors associated with PCOS, such as higher BMI, reduced physical exercise, higher fasting glucose levels and positive family history are also in keeping with the literature22,23,24,25.

In contrast to these positive findings, the lack of association seen between polymorphisms in VDR in our study contrasts with findings of a recent meta-analysis reporting that the VDR ApaI (rs7975232) polymorphism associated with susceptibility to PCOS in Asian populations (aOR for allelic model, C vs A, 1.19; 95% CI 1.07 to 1.34)6. The lack of association seen for SNPs in DBP is consistent with findings of both the other studies that have investigated polymorphisms in this gene26,27.

Our study has several strengths. Cases comprised a broad range of PCOS phenotypes, including obese and lean, fertile and infertile, and those with and without hyper-androgenic characteristics. We collected detailed information on potential confounders of the association between vitamin D status and PCOS risk, minimizing the potential for residual/unmeasured confounding to explain our findings. We also investigated SNPs in a wider range of vitamin D pathway genes than previously investigated, and applied a stringent correction for multiple testing. By deseasonalizing 25(OH)D data, we were able to calculate individuals average 25(OH)D level throughout the year, which represents their vitamin D status more effectively than a season-specific ‘snapshot’ provided by a single unadjusted reading28.

Our study also has some limitations. Due to the case-control design, reverse causality and/or confounding cannot be excluded as explanations for the associations observed. Although large by comparison with others in the field, our power to detect modest genetic effects, and gene-environment interactions, was limited. 25(OH)D was measured by ELISA rather than with Liquid Chromatography Tandem Mass Spectrometry, which is the gold standard methodology; however, this should not have introduced bias, since the same assay was used to measure vitamin D status in cases and controls. Moreover, the assay we used detects both 25(OH)D2 and 25(OH)D3.

Conclusion

In conclusion, this large case-control study—the first of its kind to be conducted in Pakistan—reports that low vitamin D status associates independently with increased susceptibility to PCOS. However, no statistically significant association between polymorphisms in vitamin D pathway genes and risk of PCOS was demonstrated.

Methods

Study design

We conducted a case control study. Cases were patients aged 14 to 49 years diagnosed with PCOS and recruited from out-patient clinics at the Jinnah Hospital and the Lady Willingdon Hospital, Lahore. A total of 235 women were selected according to Rotterdam criteria, 200329. Patients with thyroid and adrenal diseases and androgen-secreting tumours were excluded. 235 healthy controls were selected from the Citi lab and Research centre, Lahore, on the basis of having no history of infertility, no evidence of clinical hyperandrogenism and normal menstrual cycles. Informed consent was taken from all participants who fulfilled eligibility criteria. Written informed consent was taken from the parents or guardians of the participants who were under 18 years of age. Participants completed a detailed questionnaire including details of age, weight, height, sociodemographic status, dietary habits, physical exercise, sun exposure, androgenic features (acne, hirsutism and patchy hair loss), menstrual cycle history, family history of PCOS and fertility details for married participants. The study was approved by the ethical committee of the University of Punjab (Ref No: Bioethic 125Pu, 13.10.17) and by the ethical review board of Citi lab and Research Centre, Lahore (Ref # 26-17/ERB/CLRC/27th/Dated 28-07-2016). All the methods were performed following the relevant guidelines and regulations approved by the ethical committee.

5 mL of blood was drawn on the 2nd or 3rd day of menstrual cycle30, from a median cubital vein; 2 mL were transferred into vials containing EDTA and frozen at −20 °C for subsequent DNA extraction, and 3 mL was added to serum vials and sent to the laboratory within two hours of collection, where serum was isolated from clotted blood by centrifugation and stored at −20 °C for subsequent determination of 25(OH)D concentration.

Serum 25(OH)D assay

Serum 25(OH) D concentration was determined by ELISA (Calbiotech, EI Cajon U.S.A). Calibrators and controls for the assay were run in duplicates. Interassay CV for serum 25(OH)D for our samples was 14%. Season-adjusted (deseasonalized) values for 25(OH)D were calculated for each participant from their individual standardized 25(OH)D concentration and date of blood sample collection, using a sinusoidal model with values derived from standardized values for all participants as previously described28,31. Vitamin D deficiency was defined using a pre-specified 25(OH)D cut-off of 10 ng/ml; this cut-off was selected a priori on the basis that it is widely used by Public Health bodies32 and that deficiency at this level has been shown to associate most strongly with PCOS11 and other pathologies attributable to vitamin D deficiency33,34,35.

Genotyping

Genomic DNA was extracted from whole blood and quantified using a nanodrop spectrophotometer as previously described36. DNA TaqMan allelic probe assays (Applied Biosystems, Foster City, CA, USA) were used to genotype polymorphisms in genes encoding: the vitamin D receptor (rs4334089, rs10783219, rs4516035, rs11568820 [cdx2], rs7976091, rs731236 [TaqI], rs2228570 [Fok]I, rs1544410 [BsmI], rs7975232 [ApaI], rs7970314, rs2853559, rs2238136); the 1α-hydroxylase enzyme, CYP27B1 (rs4646536, rs4646537); the 24-hydroxylase enzyme, CYP2A41 (rs6013897, rs2762939, rs2248137, rs2762934), and the vitamin D binding protein, DBP (rs7041, rs4588, rs12512631, rs2070741, rs2298849, rs16846876). Taqman® SNP genotyping assays were purchased directly from Life Technologies. Assay IDs are detailed in Supplementary Table 1. One assay was a custom design and primer details are shown in Supplementary Table 2. For SNP Genotyping using the Fluidigm 192.24 Dynamic Array, genomic DNA concentration was normalised to 50 ng/µL and a sample pre-mix was prepared with 2 µL of GTXpress™ Master Mix (2×), 0.2 µL of 20X Fast GT Sample Loading Reagent, 0.2 µL of nuclease-free water and 1.6 µL of genomic DNA. A 10X SNP assay mix was prepared by combining 1.0 µL of Taqman® SNP Genotyping assay with 2.0 µL of 2X Assay Loading Reagent, 0.2uL of ROX™ (50×) and 1.3 µL of nuclease-free water. 3 µL of sample pre-mix and 3 µL of assay mix into each assay inlet of the 192.24 arrays. Pressure fluid was then pipetted into the appropriate wells. The array was then loaded to the Juno™ controller with the RX insert and the Fast PCR 192.24 script run to performs thermal cycling. Thermal conditions were as follows: Hot start 95 °C for 5 minutes, Touchdown (64 °C-61 °C dropping 1 °C per cycle) 4 cycles at 95 °C for 15 seconds, 64 °C 45 for seconds, 72 °C for15 seconds, 34 cycles were run at 95 °C for 15 seconds (denaturing) followed by 60 °C for 45 seconds (annealing) and 72 °C for 15 seconds (extension). Endpoint fluorescent data were collected using the Biomark® Real-Time PCR System, and data were subsequently analysed using the Fluidigm SNP Genotyping Analysis software.

Statistical analysis

Statistical analyses were done with Stata IC (version 15.1). Frequencies of alleles and genotypes were compared using chi-square tests; all were found to be in Hardy-Weinberg equilibrium. Baseline characteristics of cases vs controls were compared using unpaired Student’s t-tests and chi-square tests for continuous and categorical variables, respectively. Chi-square tests were used to test for associations between independent variables and risk of PCOS in univariate analysis. Binary logistic regression was used for multivariate analysis of phenotypic determinants of PCOS risk, with adjustment for factors found to associate with PCOS with P < 0.05 in the univariate analysis of phenotypic determinants. Binary logistic regression was also used to test for association between genotype and risk of PCOS, using an additive model and adjusting for phenotypic factors found to associate independently with PCOS risk (age, body mass index, waist-to-hip ratio, deseasonalized 25(OH)D <10 vs. ≥10 ng/ml, outdoor exercise and fasting glucose). Sub-group analyses were performed to determine whether genetic variation in the vitamin D pathway modified effects of vitamin D status on susceptibility to PCOS by repeating primary efficacy analysis with the inclusion of a term for the interaction between vitamin D status and genotype, using an additive model. Haplotype analysis was performed using the Clark method. Odds ratios are presented with 95% confidence intervals and P values. The Benjamini-Hochberg procedure for multiple testing correction37 was applied to genetic analyses to control the false discovery rate (FDR) at 5%.

Power and sample size

Assuming the risk of vitamin D deficiency (serum 25[OH]D <10 ng/ml) in the control arm to be 56%38, we calculated that 235 cases and 235 controls would need to be recruited in order to detect an odds ratio for the association between vitamin D deficiency and risk of PCOS of ≥1.86 with 80% power and an alpha of 5%.