Introduction

Pregnancy represents a crucial phase in human reproduction, impacting both maternal and offspring health. Particularly, normal maternal thyroid function and lack of autoimmunity during pregnancy, commonly assessed by measuring circulating thyroid-stimulating hormone (TSH), free thyroxine (FT4) levels, and presence and quantification of thyroid peroxidase antibodies (TPOAb), holds significant relevance for pregnancy outcomes and long-term health1,2. Thyroid dysfunctions during pregnancy are determined by these measurements and demonstrate varying prevalence, ranging from 0.5% for overt hypothyroidism (defined as elevated TSH levels and low free T4 concentrations) to 3.47% for subclinical hypothyroidism (SHO) (defined as elevated TSH levels with normal FT4 concentrations)3, and 18% for thyroid autoimmunity2. Observationally epidemiologic studies have consistently linked abnormal thyroid function tests during pregnancy to adverse maternal and infant outcomes, including preterm birth4, low birth weight5, miscarriage, and pre-eclampsia6. However, the underlying biological mechanisms controlling the population variability of TSH, FT4, TPOAb, and thyroid dysfunctions during pregnancy, and their potential causal relationships with pregnancy and long-term health outcomes remain unclear.

Previous research indicates that a significant portion of the inter-individual variability in thyroid hormone levels among general populations can be attributed to genetics. Twin and family studies estimate heritable contributions to ~64–70.8% for serum TSH, 39–80% for FT47,8,9, and 48.8% for TPOAb9. Large-scale meta-genome-wide association studies of thyroid hormones among general European populations have identified lead SNPs explaining 22.8% of TSH variation and 4% of FT4 variation, respectively10,11. These studies suggest that genetics may also influence thyroid function measurements during gestation, an aspect that has been under-studied. Notably, unlike thyroid functions outside the pregnancy period, changes in the hormonal environment during pregnancy lead to more complex alterations in a woman’s thyroid function. Maternal demand for thyroid rises during pregnancy due to fetal use of thyroid hormones and an increase in thyroxine-binding globulin (TBG), among other reasons12. In particular, human chorionic gonadotrophin (hCG), which is structurally similar to TSH, significantly increases in early pregnancy, leading to higher thyroid hormone production, eventually resulting in decreased TSH levels under negative feedback regulation eventually13. Therefore, genome-wide association study (GWAS) studies based on pregnant women, which would greatly enhance our understanding of the genetic basis of thyroid-related traits throughout life, may not be extrapolated from studies on general adults. However, very few GWAS on thyroid-related traits have included pregnant women. To date, only one candidate gene study that involved 974 healthy pregnant women, reported an association between the number of TSH-raising alleles and SHO in pregnancy14. In addition, the association between subclinical dysfunction and pregnancy outcomes, as well as the efficacy of treating mild abnormalities, remains uncertain and controversial owing to the lack of robust causal evidence15,16.

To elucidate the genetic factors influencing thyroid function and disorders during pregnancy, and their potential causal impact on pregnancy complications, birth outcomes, and later-age physical conditions, we conducted a genome-wide association and Mendelian randomization (MR) study among 85,421 Chinese pregnant participants with non-invasive prenatal test (NIPT) sequencing data, comprehensive pregnancy screening and medical records from two hospitals in China. We focused on the most commonly used thyroid biomarkers in pregnancy assessments, including TSH and free thyroxine (FT4), and thyroid autoimmunity measurements such as TPOAb levels and positivity. Additionally, we examined four clinical or subclinical thyroid statuses, including SHO, isolated hypothyroxinemia (ISH), overt hyperthyroidism (OHP), and subclinical hyperthyroidism (SHP).

Results

Study design and demographic characteristics

Our study design and analysis scheme is depicted in Fig. 1. Briefly, we enrolled pregnant women during routine pregnancy screening at two maternal and child healthcare hospitals—abbreviated as Longgang and Baoan hospitals, located in Shenzhen, a metropolitan city in southern China. We imputed genotypes and examined population structure from NIPT sequencing data following a standardized protocol developed by our laboratory17,18. We conducted GWAS analyses for 2 thyroid-related hormones (TSH and FT4) within reference ranges, 4 thyroid dysfunctions (SHO, ISH, SHP, and OHP), and 2 thyroid autoimmunity traits (TPOAb levels and TPOAb positivity) for each of the two hospitals, followed by a fixed-effect meta-analysis across 12.7 million SNPs (Methods). Subsequently, phenotypic association and MR analyses were used to investigate the relationships between these eight traits, gestational complications such as gestational diabetes mellitus (GDM), birth outcomes, and later-life physical conditions using summary statistics from GWAS. Since TSH and FT4 have been investigated in prior GWAS studies in the general population, we further conducted conditional analyses to identify potential novel association signals within the established loci for these two traits. We evaluated the GWAS results by examining the consistency of effect size estimates between the two independent hospitals, the direction of effect sizes with an external GWAS with a smaller sample size (NIPT PLUS cohort, \(n={\mathrm{4,688}}\)), and by comparing effect size estimates with publicly available GWAS data for the same phenotypes among general populations, if available.

Fig. 1: Schematic design of the study.
figure 1

Created with BioRender.com. TSH thyroid-stimulating hormone, FT4 free thyroxine, TPOAb thyroid peroxidase antibody. BBJ Biobank Japan study. MR Mendelian randomization. Figure 1 Created with BioRender.com released under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International license.

Following the screening process, our analysis included a total of over 85,000 subjects, amalgamating data from the two hospitals (Methods for inclusion and exclusion criteria). Among them, more than 67,000 participants had available TSH and FT4 levels data within the reference range during pregnancy. The sample sizes and demographic characteristics of the research subjects included in the GWAS analysis for each trait, such as TSH levels, FT4 levels, TPOAb levels, age, BMI, and gestational weeks at the time of thyroid function measurement, are detailed in Supplementary Data 1. The majority of subjects were in the age range of 29–30 years. As expected, maternal serum TSH concentrations decreased from 8 to 12 weeks and then increased in both cohorts, while the opposite changing pattern was observed for FT4 (Supplementary Fig. 1), corresponding to the increase of hCG during pregnancy19.

Genetic associations with normal thyroid function during pregnancy

We identified 38 genome-wide significant loci associated with serum TSH levels and 22 loci associated with serum FT4 levels (\(P\le 5\times {10}^{-8}\)) (Fig. 2 and Supplementary Data 2). The genomic inflation factor for the 12,699,587 SNPs was 1.068 for TSH and 1.069 for FT4, suggesting negligible inflation of test statistics in the genome-wide association analysis (Supplementary Fig. 2). Among the 60 loci, 4 TSH loci and 13 FT4 loci were identified as novel, with the lead variant situated more than 500 kb away and demonstrated LD R2 < 0.1 from any previously established loci (Supplementary Data 3; Supplementary Fig. 3).

Fig. 2: The Manhattan plots for GWAS meta-analyses of two thyroid-related hormones.
figure 2

A TSH levels, B FT4 levels. After performing GWAS analysis of Longgang Study and Baoan Study using plink2.0, meta-analysis was subsequently completed using METAL software. The P value threshold for determining whether SNP signals were significant was adopted from the Bonferroni-corrected threshold commonly used for GWAS analysis(P < 5 × 10−8). The x-coordinate of the SNPs in the plot represents their position on each chromosome when the y-coordinate implies their P value (−log10 scale) of the association test. The black dotted horizontal line indicates the significant threshold for the genome-wide association test (i.e., 5 × 10−8). Genomic loci containing phenotype-associated variants previously reported in the GWAS catalog are colored in black, while novel loci are colored in red.

Among all loci, 59/60(98.3%) loci demonstrated consistent directionality and had a nominally significant P value below 0.05 between the two hospitals, suggesting little heterogeneity and high fidelity of our genetic discoveries (Supplementary Data 2, Supplementary Fig. 4A, Supplementary Fig. 4C). The remaining SERPINA7 locus was a known locus associated with FT4 in a previous study and the difference in effect size estimates may be attributed to variations in fetal sex distribution between the two hospitals. In comparison with the GWAS statistics with the NIPT PLUS cohort, 86.7% (52/60) of lead SNPs showed consistent effect directions (Supplementary Fig. 5, Supplementary Data 2).

Given that available genetic studies on thyroid function among East Asians are limited to small sample sizes in the general population (e.g., 3618 Koreans20, 4581 Chinese21, and 437 Chinese22), with only a few suggestive significant sites identified for TSH, and the lack of publicly available GWAS summary statistics, we compared our effect size estimates with the two largest-scale GWAS studies of TSH and FT4 in European populations (NTSH = 249,715; NFT4 = 49,269)10,23. Among the loci, 49/60 (81.6%) loci exhibited P values below 0.05 with consistent directional effects compared with the European dataset (Supplementary Fig. 4B, Supplementary Fig. 4D, Supplementary Data 2). While the consistent effect size estimates suggest shared genetic determinants between the two populations (European and East Asian) and between pregnancy and non-pregnancy status, differences in the comparison could be attributed to different LD structures between Europeans and East Asians, as indicated by differing allele frequencies for the lead SNP (Supplementary Fig. 6) or potential effect modifications driven by pregnancy status. Further investigation is required to clarify these hypotheses.

Novel loci and association signals for TSH and FT4

The 4 newly identified loci associated with TSH levels comprise HNRNPA3, FGF10, IGF1, and MAGED2 (Fig. 2A, Supplementary Data 2, Supplementary Fig. 3). The 13 novel loci associated with FT4 levels include SNX7, RRP15, TRH, PDCL2, CAST, CGA, FIG4, UNCX, TRHR, ZNF462, GRK5, METTL15, and SGF29 (Fig. 2B, Supplementary Data 2, Supplementary Fig. 3). Potential eGenes, which expression in 49 issues in GTEx demonstrates colocalization signals with the 17 loci are presented in Supplementary Data 4. Among the 17 loci, HMGA2, IGF1, CGA, and TRHR contain SNPs that were previously reported to have genome-wide associations with several thyroid-related traits in prior GWAS analyses (Supplementary Data 3B). Some loci include genes directly involved in thyroid functions, such as TRH and TRHR, as well as genes involved in the regulation of protein signal transduction and cell proliferation (Supplementary Data 5).

Furthermore, through stepwise conditional analyses, we identified that 13 [10 TSH-associated loci: CAPZB, IGFBP5, PDE8B, VEGFA, GLIS3, XPA, MBIP, CEP128, DET1, and MAF; 3 FT4-associated loci: GLIS3, FOXE1, and DET1] out of the 60 loci contained more than two independent signals, contributing to a total of 33 independent signals (Supplementary Data 6, Supplementary Fig. 7). Particularly, we identified additional 10 novel significant signals within 10 loci known for their involvement in thyroid function traits (Supplementary Data 7, Methods).

Among the 33 independent signals, the genetic effects of all 33 (100%) SNPs were consistent across the two hospitals (Supplementary Data 6), and 26 out of 33 (78.8%) signals demonstrated consistent direction and significance at \(P < 0.05\) compared to the two European datasets mentioned earlier10,23 (Supplementary Data 6).

Collectively, we identified a total of 53 independent association signals across 38 loci for TSH levels and 27 signals across 22 loci for the FT4 levels, collectively explaining 6.6% and 2.8% of the phenotypic variance, respectively.

Genetic associations with normal thyroid function within 8–12 gestational weeks

Considering the strong stimulation of thyroid function by hCG, leading to a significant reduction in TSH and an increase in FT4 during early pregnancy (weeks 8 to 12, Supplementary Fig. 1), we conducted a more focused GWAS analysis within this specific timeframe (weeks 8 to 12) to investigate potential association loci related to hCG stimulation. Within the 8 to 12 weeks period, we identified 20 genome-wide significant loci associated with TSH levels and 11 loci associated with FT4 levels (Supplementary Fig. 8). However, we did not find any loci associated with TSH/FT4 within 8 to 12 weeks that were related to previously published hCG GWAS after Bonferroni correction (Supplementary Data 8)24. These findings suggest that the gestational TSH and FT4 levels do not share a significant genetic correlation with hCG.

Interestingly, when comparing the genetic effects of the 31 loci within the 8 to 12 weeks period with the effects estimated from the entire pregnancy period, we observed that the effect size for 4 out of the 20 TSH loci and 6 out of the 11 FT4 loci did not have overlapping 95% confidence intervals between the two periods (Supplementary Fig. 9 and Supplementary Data 9). Notably, the TSH receptor locus (TSHR, marked by the nearest gene CEP128) showed stronger associations with TSH and FT4, specifically during the 8–12 weeks timeframe of pregnancy. According to GTEx, we found that the lead SNP mutation rs17111346-G, associated with TSH levels during the 8–12 gestational weeks, was associated with a higher TSHR expression level in several tissues and organs (Supplementary Fig. 10). In our study, this variant was associated with increased TSH levels and reduced FT4 levels (Supplementary Data 9). These observations may provide a genetic explanation for the occurrence of hypothyroxinemia during this stage, despite hCG stimulation. Individuals carrying specific genetic variants unrelated to hCG stimulation but associated with lower FT4 and higher TSH during early pregnancy, such as the rs17111346-G variant, may have a higher risk of developing hypothyroxinemia despite hCG stimulation.

Identification of genetic associations with thyroid dysfunction

During early pregnancy, the fetus heavily relies on maternal thyroid hormones. Evidence underscores the correlation between untreated clinical thyroid dysfunction and adverse birth outcomes25,26. SHO, characterized by low TSH levels but normal FT4 level, and isolated hypothyroxinaemia, characterized by low free T4 levels and normal TSH levels—represent common mild thyroid dysfunctions during pregnancy associated with adverse pediatric outcomes, including preterm birth4. Nevertheless, the optimal management of subclinical thyroid dysfunction remains contentious, as some treatments for SHO or hypothyroxinemia have not significantly improved cognitive or birth outcomes26,27. Moreover, untreated OHP during pregnancy is linked to various maternal and fetal complications, such as pre-eclampsia and pregnancy loss, while SHP is not thought to be associated with adverse birth outcomes28.

To elucidate the genetic underpinnings of these thyroid dysfunctions, we conducted a GWAS analysis on four categorical traits: SHO, ISH, OHP, and SHP (cases). We utilized pregnant women with euthyroid function, defined as those maintaining TSH and FT4 levels within the pregnancy-specific reference range throughout pregnancy, as the control group. Notably, we also examined clinical hypothyroidism; however, the statistical power was limited due to the small number of cases (NLonggang = 464, NBaoan = 148), and robust genetic associations were not identified. In particular, we excluded individuals who tested positive for TPOAb from both the case and control populations in the analysis for ISH. After exclusions, the sample sizes of the included research subjects were as follows: (1) SHO [N = 73,050 (Longgang: 3666 cases versus 43,525 controls; Longang: 1244 cases versus 24,615 controls)]; (2) ISH: [N = 50,407 (Longgang: 2981 cases versus 37,152 controls; Longang: 613 cases versus 9961 controls)]; (3) SHP [N = 72,631(Longgang: 2598 cases versus 43,525 controls; Longang: 1893 cases versus 24,615 controls)]; (4) OHP [N = 69,788 (Longgang: 1229 cases versus 43,525 controls; Longang: 419 cases versus 24,615 controls)].

Our analysis led to the identification of 18, 8, 12, and 7 genome-wide association loci associated with maternal SHO, ISH, SHP, and OHP, respectively (Fig. 3, Supplementary Data 10, Supplementary Fig. 11). eGenes with colocalization signals with these 45 loci are presented in Supplementary Data 4. All lead SNPs in these 45 loci exhibited consistent directionality within two hospitals, with 42 out of 45 (93.3%) compared at a P value below 0.05 with consistent directionality (Supplementary Fig. 12A). When we replicated these 45 lead SNPs with the NIPT PLUS cohort, 77.8% (35/45) SNPs showed consistent directions of genetic effects (Supplementary Fig. 12B).

Fig. 3: The Manhattan plots for GWAS meta-analyses of thyroid dysfunction.
figure 3

A Subclinical hypothyroidism, B isolated hypothyroxinemia, C overt hyperthyroidism, and D subclinical hyperthyroidism. After performing GWAS analysis of Longgang Study and Baoan Study using plink2.0, meta-analysis was subsequently completed using METAL software. The P value threshold for determining whether SNP signals were significant was adopted from the Bonferroni-corrected threshold commonly used for GWAS analysis(P < 5 × 10−8). The x coordinate of the SNPs in the plot represents their position on each chromosome when the y-coordinate implies their P value (−log10 scale) of the association test. The black dotted horizontal line indicates the significant threshold for the genome-wide association test (i.e., 5 × 10−8). Genomic loci containing phenotype-associated variants previously reported in the GWAS catalog are colored in black, while novel loci are colored in red.

As no previous GWAS studies have been conducted on the four thyroid dysfunctions investigated in this study, we were unable to directly compare the outcome with external datasets. However, we noted that most of these loci have been reported to be associated with thyroid-related traits such as hypothyroidism, TSH, and FT4 (Supplementary Data 3B). After excluding loci previously associated with thyroid traits, we identified 2 novel low-frequency genome-wide thyroid traits loci including ISH-related SPRY1 (rs190595112-G, MAF = 0.012, OR [95%]: 4.22 [2.52–7.07], P = 4.77 × 10−8) and OHP-related GAPVD1 (rs78254323-T, MAF = 0.04, OR [95%]: 1.72 [1.42–2.09], P = 3.94 × 10−8). These two variants demonstrated consistent directionality and significance between the two hospital cohorts (Supplementary Data 10). The GAPVD1 locus demonstrated certain levels of evidence of associations with GAPVD1 expression in the thyroid, while we did not find eGene for SRY1 locus in all tissues (Supplementary Data 4).

Genetic associations with thyroid autoimmunity

Thyroid autoimmunity, characterized by the presence of antibodies targeting thyroperoxidase, thyroglobulin, and TSH receptor antibodies (TRAbs), is prevalent among pregnant women and is often diagnosed through the detection of thyroperoxidase antibodies (TPOAb) in clinical settings. This condition is associated with thyroid dysfunction and adverse obstetric outcomes29. In this study, we conducted GWAS focusing on TPOAb quantitative levels and positivity during pregnancy (Methods). The sample sizes of the included research subjects were as follows (1) quantitative TPOAb levels [N = 60,225 (Longgang: 46,023; Longang: 14,202)]; (2) TPOAb positivity [N = 69,061(Longgang: 7324 cases versus 45,542 controls; Longang: 2894 cases versus 13,301 controls)].

We identified 38 and 33 loci associated with TPOAb levels and TPOAb positivity, respectively (Fig. 4, Supplementary Fig. 13 and Supplementary Data 11), revealing 35 and 30 newly identified association loci, respectively, compared to previously reported signals in the GWAS catalog. Most of the loci for both traits were shared. The four most significant novel loci, including VANGL2, CTLA4, GPX6, and GPR174, were identified as associated with Graves’ disease (GD)/Hashimoto’s thyroiditis (HT) in the Japanese population30 (Supplementary Data 3B). We identified 5 missense variants in novel loci: rs7522061-C in FCRL3, rs3775291-T in TLR3, rs12793348-G in PANX1 rs229527-C in C1QTNF6, and rs2074158-C in ZNF385C. GWAS-eQTL colocalization analyses identified several TPOAb levels and positivity-related loci that regulate the expression of RPS26, RNASET2, C1QTNF6, FCRL1, FCRL3 genes in multiple tissues (Supplementary Data 4). Previous studies have reported that RNASET2 is expressed in CD4 + T-helper and CD8 + T cells31, while FCRL3 is expressed at high levels during the maturation of B-cells and is thought to regulate B-cell signaling in both positive and negative ways32. FCRL1 also plays a role in the assembly of the BCR signalosome, influences B-cell signaling, and enhances humoral responses33.

Fig. 4: The Manhattan plots for GWAS meta-analyses of thyroid autoimmunity.
figure 4

A TPOAb levels, B TPOAb positivity. After performing GWAS analysis of After performing GWAS analysis of Longgang Study and Baoan Study using plink2.0, meta analysis was subsequently completed using METAL software. The P value threshold for determining whether SNP signals were significant was adopted from the Bonferroni-corrected threshold commonly used for GWAS analysis (P < 5 × 10−8). The x-coordinate of the SNPs in the plot represents their position on each chromosome when the y coordinate implies their P value (−log10 scale) of the association test. The black dotted horizontal line indicates the significant threshold for the genome-wide association test (i.e., 5 × 10−8). Genomic loci containing phenotype-associated variants previously reported in the GWAS catalog are colored in black. Before performing the GWAS analysis, TPOAb levels were naturally log-transformed.

For consistency, 67 out of 71 (94.4%) loci demonstrated consistent direction and significance at P = 0.05 between the two hospitals (Supplementary Fig. 14A, Supplementary Data 11). When replicating GWAS for TPOAb levels and positivity in the NIPT PLUS cohort, 62 out of 71 (87.3%) loci showed consistent directions of genetic effects (Supplementary Fig. 14B, Supplementary Data 11). Comparing the results with external datasets, we initially compared our results with the largest previous GWAS study on TPOAb levels and positivity, conducted by Marco Medici et al. in the European population in 2014 (NTPOAb-levels = 16,528 and NTPOAb-positivity = 18,297)34. Since the study involved 2.14 million variants, only 58 of our lead variants (for their proxy SNPs) were available for analysis in this study. Among these SNPs, 50 loci (86.2%) exhibited a consistent direction of effect between Chinese pregnant women and the European population (Supplementary Fig. 15, Supplementary Data 12).

Given the unavailability of previous GWAS summary data for TPOAb in the East Asian population and the high correlation between TPOAb positivity and autoimmune thyroid diseases such as Graves’ disease and Hashimoto’s thyroiditis35, we also examined our associations for TPOAb positivity using the GWAS results for Graves’ disease and Hashimoto’s thyroiditis from the Biobank Japan (BBJ)30. Of these 33 loci, H2BC8 (rs1051365311) and its proxy SNP were not available in the BBJ dataset. According to the result, 15 loci (FCRL3, VANGL2, CTLA4, LPP, CLNK, TCF7, IRF4, GPX6, HBS1L, RNASET2, FAM76B, IFT81, MAF, C1QTNF6, and GPR174) in Grave’s disease (Supplementary Data 13, Supplementary Fig. 16A) and 2 loci (GPX6, VANGL2) in Hashimoto’s thyroiditis showed significant associations (Supplementary Data 13, Supplementary Fig. 16B). The high degree of concordance (32 out of 33) in the direction of locus effects suggests a shared genetic basis between TPOAb positivity during pregnancy and autoimmune thyroid diseases.

Pairwise genetic correlation among thyroid traits

To explore the pairwise genetic correlation among the thyroid traits investigated, we employed LD score regression36. The analysis revealed a significant genetic correlation between TSH and all other thyroid traits, including TPOAb levels and positivity. Conversely, FT4 exhibited correlation with thyroid dysfunctions but not with TPOAb levels and positivity (Supplementary Fig. 17, Supplementary Data 14). Additionally, we observed a correlation between TPOAb level/positivity and SHO. Nonetheless, this genetic correlation lost significance when the analysis was restricted to euthyroid individuals, suggesting that the observed correlation stemmed from shared genetics between TPOAb and TSH.

Consistent with the established inverse correlation in TSH and FT4 regulation through the Hypothalamus-Pituitary-Thyroid Axis (HPT)37, we observed a significant inverse genetic correlation between TSH and FT4 levels (the genetic correlation calculated by LDSC: rgTSH-FT4 = −0.16, se = 0.04, P = 8.6 × 10−5) (Supplementary Fig. 17, Supplementary Data 14), along with shared associations at 11 loci (TSH: CAPZB, TNS1, GLIS3, XPA, MBIP, DET1, MAF, and FT4: FIG4, GLIS3, FOXE1, DET1) (Bonferroni-corrected threshold P < 0.05/60) (Supplementary Fig. 18, Supplementary Data 15). At these 11 loci, alleles associated with higher TSH levels were consistently associated with lower FT4 except for three loci - FIG4, XPA, and FOXE1—where genetic effects were observed in the same direction for both thyroid-related hormones.

The influence of thyroid-related traits on gestational complications and fetal obstetric outcomes

Previous large meta-analyses of observational studies have demonstrated an association between subclinical thyroid dysfunction and maternal complications and adverse birth outcomes, including GDM38, hypertensive disorders39, miscarriage40, preterm birth, and low birth weight4,5. However, RCTs investigating treatments for mild thyroid function abnormalities have yielded inconclusive results27, resulting in uncertainty and controversy regarding the relationship between subclinical dysfunction and pregnancy outcomes, as well as the therapeutic value of addressing mild abnormalities41,42. High-quality MR studies in clinical thyroidology are deemed crucial for gaining valuable insights into the relationship between thyroid functions and birth outcomes43. However, to date, only one MR study has explored the causal effect of TSH and FT4 levels within the normal range on birth weight and did not identify statistically significant causal associations44.

Here, we applied joint phenotypic association and MR to investigate the impact of thyroid function during pregnancy on gestational complications such as hypertension, and gestational diabetes, as well as birth outcomes, including birth weight, birth length, and gestational duration (Supplementary Data 18). Adhering to the three preconditions for a robust MR analysis, we utilize strong instrumental variables (\(F > 10\)), accounted for heterogeneity and horizontal pleiotropy by applying an inverse weighted approach of random effect (IVWRE) as primary results, and conducted several sensitivity analyses, including six additional two-sample MR approaches and three scenarios including overlapping (Meta-Meta) and non-overlapping samples (Longgang-Baoan and Baoan-Longgang)45. In addition, for MR analysis of the relationship between thyroid traits and pregnancy complications, we accounted for the bi-directionality of the causal estimates (Method). Notably, due to the extreme imbalance in the number of cases and controls of gestational hypertension and pre-eclampsia, we employed quantitative systolic and diastolic blood pressure (DBP) as indicators for hypertension. Considering the substantial genetic correlation among the gestational thyroid traits (Supplementary Fig. 17), we employed Bonferroni testing criteria based on the total number of investigated gestational complications and fetal obstetric outcomes (N = 14). The statistical results of all methods are presented in Supplementary Data 18 and 20, while the instrumental variables used are listed in Supplementary Data 19.

Phenotypic association tests revealed numerous strong associations between TSH and FT4 within the normal range, thyroid dysfunction, and glucose and blood pressure traits, including significant associations between ISH and GDM (\(P < 0.004\)) (Fig. 5A, Supplementary Data 17). MR suggested consistent potential causal associations for a few of these correlations after Bonferroni correction, including a significant negative effect of TSH on fasting plasma glucose (FPG), oral glucose tolerance at 1 hour (OGTT1H), and systolic blood pressure (SBP), as well as a significant effect of FT4, SHO, SHP on SBP (Fig. 5B, C, Supplementary Data 18, \(P < 0.004\)). Additionally, we found a nominally significant protective effect of genetically increased thyroid hormone (FT4) on GDM, consistent between MR and observational studies (observational study: ORlonggang [95% CI]: 0.861 [0.833~0.89], Plonggang = 8.49 × 10−19; MR: OR [95% CI]: 0.844 [0.732~0.972], P = 1.88 × 10−2). Reverse MR did not identify significant potential causal associations between the pregnancy phenotypes and thyroid-related phenotypes after Bonferroni correction (Fig. 5B, Supplementary Data 20).

Fig. 5: The association between thyroid-related traits and maternal blood glucose and blood pressure traits during pregnancy.
figure 5

A Observational analysis. In each grid, the upper left triangle shows the regression coefficients for the phenotypic associations in the Longgang cohort, while the lower right triangle shows those for the Baoan cohort. To facilitate the comparison of effect estimates corresponding to different traits, we converted the absolute values of the estimates of phenotypic associations into Q-quantiles for plotting. We completed the analysis of phenotypic associations between gestational phenotypes and thyroid traits using generalized linear model (glm function in R 4.3.2). Black color of triangle indicates that the result is not available(due to insufficient sample size). B Bidirectional mendelian randomization, the upper left triangle of each grid indicates the results of forward MR (thyroid traits as exposure variables), while the lower right triangle indicates reverse MR (thyroid traits as outcome variables). A, B, if there is a significant difference (P < 0.05) between the effect estimates and 0, the middle of the triangle is labeled with a red asterisk (P < 0.05 and P > 0.05/14) or two red asterisks (P < 0.05/14). C The forest plot of MR results under 3 scenarios between TSH and 5 blood glucose and blood pressure trait. The MR results for three scenarios are shown: (1) Meta-Meta: the two-sample MR method is conducted in one-sample setting; (2) Baoan-Longgang: Baoan Study served as the source of exposed GWAS data, while the Longgang Study provided the outcome population. (3) Longgang-Baoan: Longgang Study served as the source of exposed GWAS data, while the Baoan Study provided the outcome population. Shown in column 4 of the figure are effect sizes (beta) of the MR results estimated using the inverse-variance weighted using multiplicative random effects approach (Method), and the error bars represent 95% confidence intervals. The squares in the middle of the lines are labeled red when there is a significant difference between the effect size and 0, and black otherwise. Abbreviation: TSH TSH levels within reference range during pregnancy, FT4 FT4 levels within reference range during pregnancy, SHO subclinical hypothyroidism during pregnancy, ISH isolated hypothyroxinemia during pregnancy, OHP overt hyperthyroidism during pregnancy, SHP subclinical hyperthyroidism during pregnancy, TPOAb TPOAb levels, Tb(+) TPOAb positivity during pregnancy and EUTb(+) TPOAb positivity with normal functioning thyroid (i.e., euthyroid). SBP systolic blood pressure, DBP diastolic blood pressure, HBA1c, glycated hemoglobin, FPG Fasting plasma glucose, OGTT0H oral glucose tolerance test 0 hour, OGTT1H oral glucose tolerance test 1 hour; OGTT2H oral glucose tolerance test 2 hour, GDM gestational diabetes mellitus.

As for birth outcomes, observational analysis in the Longgang and Baoan study consistently showed that maternal FT4 levels within the reference range were significantly associated with lower birth weight, shorter birth length, and a higher risk of low birth weight. Correspondingly, we also observed a consistently significant positive correlation between ISH during pregnancy and greater birth weight, as well as a significant association between TPOAb levels and decreasing birth weight (Fig. 6A, Supplementary Data 17). However, in MR analysis, we only discovered a significant potential causal effect of elevated TSH levels during pregnancy on greater gestational duration (days) after Bonferroni correction (βMR-IVWRE 95% CI = 0.07 [0.04–0.11], PMR-IVWRE = 7.80 × 10−5) (Fig. 6B, Supplementary Data 18). Similar results were observed across other methods of MR, except for simple mode (SM), which showed attenuation toward the null. We did not identify MR associations between thyroid dysfunction and birth outcomes (Fig. 6B, Supplementary Data 18).

Fig. 6: The association between thyroid-related traits during pregnancy and birth outcomes.
figure 6

A Prospective observational analysis. In each grid, the upper left triangle shows the regression coefficients for the phenotypic associations in the Longgang cohort, while the lower right triangle shows those for the Baoan cohort. We completed the analysis of phenotypic associations between gestational phenotypes and thyroid traits using generalized linear model (glm function in R 4.3.2). B Mendelian randomization. The upper left triangle of each grid indicates the regression coefficients of the phenotypic association results in the Longgang cohort, while the lower right triangle indicates the regression coefficients of the phenotypic association results in the Baoan cohort. A, B, if there is a significant difference (P < 0.05) between the effect estimates and 0, the middle of the triangle is labeled with a red asterisk (P < 0.05 and P > 0.05/14) or two red asterisks (P <0.05/14). C Mendelian randomization analysis for casual associations between TSH and thyroid cancer and between TSH and gestational duration. TSH and gestational duration are based on summary statistics from the meta analysis of the Baoan and Longgang Study (Sample size NTSH = 67,471, and Ngestational duration = 51,592). The results of the IVW, IVWRE, MR Egger, Simple mode, and Weighted mode methods are based on the analysis of the IVs excluded by MR PRESSO. The crosshairs on the plots represent the effect sizes(beta) and 95% confidence intervals for each SNP-TSH or SNP-outcome association. Abbreviation: IVW inverse-variance weighted, IVWRE inverse-variance weighted with multiplicative random effects. TSH TSH levels within reference range during pregnancy, FT4 FT4 levels within reference range during pregnancy, SHO subclinical hypothyroidism during pregnancy, ISH isolated hypothyroxinemia during pregnancy, OHP overt hyperthyroidism during pregnancy, SHP subclinical hyperthyroidism during pregnancy, TPOAb TPOAb levels, Tb (+) TPOAb positivity during pregnancy and EUTb(+): TPOAb positivity with normal functioning thyroid (i.e., euthyroid).

Bidirectional MR between gestational thyroid traits and 220 phenotypes in BBJ

We further investigated the potential relationship between thyroid-related traits during pregnancy and 220 health phenotypes and outcomes in BBJ, with an average age of 62, by conducting a bidirectional MR analysis (Methods). Consistent significant associations in both directions suggest genetic sharing rather than causal relation. The Bonferroni testing criteria are based on the total number of investigated BBJ phenotypes (N = 220), considering the substantial genetic correlation among the gestational thyroid traits (Supplementary Fig. 17). Results surpass the Bonferroni and MR Egger pleiotropy testing criteria, which mainly pertain to BBJ thyroid and cardiac disorders, are discussed herein. The instrumental variables used in the forward MR analysis are presented in Supplementary Data 22.

Regarding thyroid autoimmunity, forward MR analysis suggested that genetically determined elevated susceptibility to maternal TPOAb positivity is associated with an increased risk for all 7 thyroid disorders, with 6 of these associations remaining significant after Bonferroni correction: Graves’ disease (OR [95% CI]: 1.55 [1.41~1.704], P = 9.68 × 10−20), thyroid preparations usage (OR [95% CI]: 1.364 [1.259~1.478], P = 2.66 × 10−14), hypothyroidism (OR [95% CI]: 1.441 [1.297~1.6], P = 9.10 × 10−12), hyperthyroidism(OR [95% CI]: 1.478 [1.318~1.658], P = 2.37 × 10−11), Hashimoto thyroiditis (OR [95% CI]: 1.454 [1.27~1.666], P = 6.61 × 10−08), and goiter (OR [95% CI]: 1.163 [1.077~1.256], P = 1.12 × 10−04) (Fig. 7, Supplementary Fig. 19A–F, Supplementary Data 21). Similar observations apply to maternal TPOAb levels and to euthyroid TPOAb positivity with normal TSH and FT4 levels (Supplementary Fig. 19G~N). However, in the reverse MR, statistics are unavailable for thyroid cancer, hypothyroidism, Hashimoto thyroiditis, and goiter due to the unavailability of instrumental variables. Significant bi-directionality was observed for Graves’ disease and thyroid preparations, and the same effect direction was observed for hyperthyroidism (Fig. 7). In addition to thyroid-related diseases, forward MR also revealed that genetic TPOAb positivity is associated with a reduced risk of depression (OR [95% CI]: OR [95% CI]: 0.848 [0.793~0.907], P = 1.60 × 10−6) and an increased risk of Cardiac valvular disease (OR [95% CI]: OR [95% CI]: 1.109 [1.052~1.169], P = 1.24 × 10−4) (Supplementary Fig. 19O~P). Reverse MR is unavailable for both disorders due to a lack of IVs (Fig. 7, Supplementary Data 23).

Fig. 7: The results of bidirectional Mendelian randomization analysis between thyroid traits during pregnancy and thyroid-related diseases in middle-aged and elderly people.
figure 7

The upper left triangle of each grid indicates the results of forward MR (thyroid traits as exposure variables), while the lower right triangle indicates the results of reverse MR (thyroid traits as outcome variables). The black color of the triangle indicates that the result is not available (due to an insufficient number of IVs). Abbreviation: TSH TSH levels within reference range during pregnancy, FT4 FT4 levels within reference range during pregnancy, SHO subclinical hypothyroidism during pregnancy, ISH isolated hypothyroxinemia during pregnancy, OHP overt hyperthyroidism during pregnancy, SHP subclinical hyperthyroidism during pregnancy, TPOAb TPOAb levels, Tb(+) TPOAb positivity during pregnancy and EUTb(+) TPOAb positivity with normal functioning thyroids (i.e., euthyroid).

In the forward MR analysis of both normal TSH, FT4, and thyroid dysfunction, we observed that genetically elevated TSH levels and a high risk of SHO were linked to an increased usage of thyroid preparations, while OHP was linked to a decreased usage of thyroid preparations (P < 1.72 × 10−4) (Fig. 7, Supplementary Fig. 20A–C and Supplementary Data 21). Except for OHP, TSH and SHO suggest bi-directionality of the MR effect. In addition, genetically elevated TSH (OR [95% CI]: 0.66 [0.55–0.79], P = 1.12 × 10−5) and SHO (OR [95% CI]: 0.81 [0.74–0.89], P = 8.54 × 10−6) were associated with a lower risk of goiter (Supplementary Fig. 20D, E). However, reverse MR is unavailable for both disorders due to a lack of IVs (Fig. 7, Supplementary Data 23). Moreover, we found genetically elevated TSH levels but not thyroid dysfunction are associated with a reduced risk of atrial fibrillation (OR [95% CI]: 0.868 [0.81–0.931], P = 7.91 × 10−5) (Supplementary Fig. 20F), which did not demonstrate evidence of causality in the reverse MR, suggesting potential causal relation.

Discussion

Utilizing hospital electronic medical records and NIPT data from 85,421 Chinese pregnancies, we conducted the first and most comprehensive GWAS studies to date of thyroid traits during pregnancy, encompassing thyroid-related hormones, dysfunction, and autoimmunity. Simultaneously, our GWAS facilitates the exploration of thyroid function’s impact on gestational complications as well as on birth outcomes, utilizing the methods of MR.

In summary, we identified 176 genome-wide loci, among which 125 genome-wide associations had not been previously reported for the 8 thyroid-related traits. This significantly enhances our understanding of the genetic basis of thyroid traits during pregnancy. The high consistency of genetic effects observed across two independent hospitals and in comparison with external datasets demonstrates the robustness of our GWAS results. Particularly, while the TRH and TRHR genes are well-established in the thyroid axis37, our study is the first to identify them as genome-wide association loci for thyroid function (FT4), signifying a significant power gain in our analysis. Moreover, we identified 33 independent association signals in 13 loci (TSH: CAPZB, IGFBP5, PDE8B, VEGFA, GLIS3, XPA, MBIP, CEP128, DET1, and MAF; FT4: GLIS3, FOXE1, and DET1) by the stepwise conditional analysis, suggesting allelic heterogeneity at the thyroid-associated loci. Additionally, comparing the genetic associations with normal TSH and FT4 within 8–12 gestations suggests that individuals carrying specific genetic variants have a higher risk of hypothyroxinemia, despite hCG stimulation.

In the first GWAS of gestational thyroid dysfunction, we identified 18, 8, 12, and 7 genome-wide loci associated with SHO, ISH, SHP, and OHP during pregnancy, respectively. Additionally, our study has doubled the number of genome-wide loci associated with TPOAb levels and positivity. These loci also exhibited connections with other thyroid-related traits and conditions, including hypothyroidism, TSH levels, and the use of thyroid preparations. Further functional and mechanistic studies may be warranted to explore potential new therapeutic targets.

We completed the first joint epidemiological and MR analysis of gestational thyroid phenotypes and complications. Statistics suggest that genetically predicted lower TSH and higher FT4 correlate with elevated blood glucose levels. However, we did not identify a significant MR association between thyroid dysfunction and GDM, implying a continuum of TSH and FT4 action on blood glucose and blood pressure regulation. Previous studies have yielded conflicting conclusions regarding the association between SHO and GDM46,47,48, and our results suggest that the impact of abnormal thyroid function on GDM varies depending on the criteria used for TSH and blood glucose assessment. Moreover, our results suggest the involvement of TSH and FT4 in blood pressure regulation. While a previous MR analysis suggested that genetically predicted TSH elevation leads to reduced SBP, this finding lacked significance after Bonferroni correction49. Our study provides more robust evidence to support this conclusion.

In the investigation of relationship of gestational thyroid phenotypes and birth outcomes, retrospective observational analyses revealed that ISH was associated with higher birth weight, while FT4 levels within the reference range were inversely associated with birth weight, aligning with findings from a previous large meta analysis5. However, we did not find sufficient evidence to suggest that abnormal TSH levels during pregnancy (OHP, SHP, SHO) and TPOAb positivity increase the risk of adverse outcomes such as preterm delivery, macrosomia, and low birth weights. Nonetheless, we cannot exclude the possibility that thyroid dysfunction and autoimmunity may contribute to other adverse outcomes. Consistent with a previous MR study44, we also did not identify an MR association between maternal TSH levels within the reference range and birth weight. Our MR analyses provide evidence suggesting that maternal TSH levels during pregnancy may potentially influence gestational duration, although the effect may be nonlinear. Notably, genetically predicted higher maternal TSH is nominally associated with a heightened risk of preterm birth, and previous research has demonstrated a U-shaped association between FT4 levels and preterm birth50. Further validation through intervention studies is warranted to ascertain the genuine effects.

TPOAb is commonly acknowledged as a pathogenic antibody associated with Hashimoto’s thyroiditis (HT), yet its role in Graves’ disease (GD) remains uncertain, despite the fact that 50–70% of individuals with GD also exhibit TPOAb positivity51,52. And it has been proposed as a marker of impaired thyroid function rather than a direct cause of thyroid damage2. Our bidirectional MR analyses with BBJ phenotypes suggest a genetic link between TPOAb positivity and several thyroid disorders in older individuals, regardless of their thyroid function status. However, given that significant bidirectional MR associations between maternal TPOAb and thyroid autoimmune disease in older age may suffer from instrumental invalidity - a common issue in bidirectional analyses- future research into the biological role of genetic variants associated with TPOAb is necessary to resolve this issue.

Additionally, we observed that genetically predicted maternal TPOAb positivity was associated with an increased risk of cardiac valvular disease, consistent with previous observations of an association between thyroid autoimmunity and rheumatic heart valve disease53. In addition, in line with the findings of previous MR studies conducted in populations of European ancestry23,54, identified-protective effects of lower thyroid function during pregnancy against atrial fibrillation and goiter in East Asians. The consistently observed genetic correlation or MR associations of gestational thyroid phenotypes and later-age thyroid and cardiac diseases suggest the potential for early screening of thyroid disorders in early adulthood and may warrant further clinical studies.

Despite the positive findings, there are limitations in our study. Firstly, while we conducted the first and most comprehensive GWAS study of thyroid-related traits in pregnancy, focusing on a critical period of human reproduction, the study population consisted solely of Chinese women, potentially limiting the generalizability of our findings to other ancestries. However, we would like to note that NIPT data are widely adopted in clinical settings due to their high sensitivity and specificity55. The methods used in this study17,18 will facilitate further investigations into thyroid-related traits in pregnancy across different populations, enhancing our understanding of the trans-ancestral genetic basis. In addition, the shared genetic correlation between the gestational thyroid traits in this study and the thyroid phenotypes in the European general population suggests the existence of a strong correlation between thyroid traits in different life stages.

Secondly, in our MR analysis examining the relationship between gestational thyroid traits and complications as well as birth outcomes, we utilized a strategy involving two-sample MR analytical methods within a single cohort setting, which has been empirically proved not to be prone to sample-overlapping bias if the IVs were sufficiently strong45. Nonetheless, to enhance the robustness of our findings, replication in a two-sample MR without sample overlap would be ideal. We attempted to address this by conducting analyses in multiple scenarios, including using exposures from one hospital and outcomes from the other hospitals, to support our MR results. These strategies consistently suggested that genetically determined elevated TSH levels during pregnancy were associated with lower glycemic levels, reduced blood pressure, and longer gestational duration.

Furthermore, in our MR study of gestational thyroid traits and birth outcomes, we did not differentiate between fetal genetic and intrauterine effects of exposure on birth outcomes. This aspect could benefit from the expansion of sample size in family genomes from birth cohorts in Chinese ancestry56. Additionally, when analyzing the impact of thyroid traits on other outcomes, we focused solely on additive linear effects, without considering nonlinear effects. Finally, in the bidirectional MR study between gestational thyroid traits and phenotypes in BBJ, we were unable to distinguish genetic correlation from potential causal effect of the exposures of thyroid-related traits during pregnancy on a later-age health result. This purpose can be addressed when GWAS studies of the same thyroid traits investigated in this study among general East Asian population become available, enabling a multi-variable MR analysis integrating GWAS statistics of thyroid traits in pregnancy and among the general populations57.

Despite some limitations in the genetic research, our study suggests two clinical implications. First, our findings support the hypothesis that serum levels of TSH and FT4 exhibit substantial inter-individual variation among healthy individuals, with much smaller intraindividual variation58. This indicates that each person has a unique setpoint for TSH and FT459. Hence, when initiating RCTs or treatments for thyroid disorders in pregnant women, it may be beneficial to consider the genomic profile of the patients, as revealed in our study. This approach can help prevent overtreatment and may explain varying responses of the pregnant women to medications such as levothyroxine, which is commonly prescribed for gestational thyroid disorders27. Secondly, as pregnancy screening represents the most comprehensive physical examination during early adulthood for most women, it is valuable to explore early diagnosis and preventive strategies for thyroid disorders later in life, starting from this critical early period.

Methods

Longgang and Baoan Study

The Longgang Study enrolled 70,608 pregnant women who received NIPT results during the first or second trimester at Longgang District Maternity and Child Healthcare Hospital of Shenzhen City, located in the eastern region of Shenzhen, China, between 2017 and 2022. Of these, 58,351 participants had available measurements for TSH, FT4, and TPOAb levels.

The Baoan Study included 50,948 pregnant women who visited the Shenzhen Baoan Women’s and Children’s Hospital in the western region of Shenzhen, China, and completed an NIPT test during the first or second trimester between 2017 and 2022. Of these, 33,781 participants had available TSH measurements, 33,516 had FT4 measurements, and 16,393 had TPOAb measurements.

This study was reviewed and approved by the Ethics Committee of School of Public Health (Shenzhen), Sun Yat-Sen University (2021. No. 8), as well as the Institutional Board of Shenzhen Baoan Women’s and Children’s Hospital (LLSC2021-04-01-10-KS) and Longgang District Maternity and Child Healthcare Hospital of Shenzhen City (LGFYYXLLL-2022-024). The study strictly adhered to regulations governing ethical considerations and personal data protection. Data collection was approved by the Human Genetic Resources Administration of China (HGRAC). Written informed consent of all participants were obtained.

Biochemical measurements and definition of thyroid-related traits

Because TSH and FT4 concentrations during pregnancy are related to gestational week, pregnancy-specific reference ranges are recommended1. In this project, the diagnosis of thyroid dysfunction was based on a method-specific and pregnancy-specific reference interval, as recommended by the Chinese Medical Association Guideline on the diagnosis and management of thyroid diseases during pregnancy and postpartum (2nd edition)60. We also used the cutoff value provided by the kit company to determine whether the subject was in a TPOAb positivity state.

In the Longgang study, thyroid function tests measured concentrations of thyroid-stimulating hormone (TSH), free thyroxine (FT4), and anti-TPOAb. These measurements were taken during a physical examination using the UniCel DxI 800 Immunoassay System (Beckman Coulter Inc., Brea, CA, USA) along with its respective system accessory kit. The reference ranges for TSH are 0.05–3.55 mIU/L for the first trimester, 0.21–3.31 mIU/L for the second trimester, and 0.43–3.71 mIU/L for the third trimester. The reference ranges for FT4 are 9.01–15.89 pmol/L for the first trimester, 6.62–13.51 pmol/L for the second trimester, and 6.42–10.75 pmol/L for the third trimester60. TPOAb positivity was defined according to the cutoff (TPOAb > 9 IU/mL).

In Baoan study, serum levels of TSH, FT4, and TPOAb were measured by Abott I2000 Immunoassay System. The reference ranges of TSH are 0.07–3.38 mIU/L, 0.34–3.51 mIU/L and 0.34–4.32 mIU/L for the first trimester, second trimester and third trimester, respectively. And the reference ranges of FT4 are 11.30–17.80 pmol/L, 9.30–15.20 pmol/L and 7.90–14.10 pmol/L for the first trimester, second trimester, and third trimester, respectively60. The cutoff value we used for TPOAb positivity is 5.6 IU/mL.

In each study, a pregnant woman with both TSH and FT4 levels within the pregnancy-specific reference range was considered a subject with normal thyroid function (ie, euthyroidism). The serum TSH and FT4 levels were analyzed as continuous variables after inverse normal transformation, based on the euthyroid subjects. Additionally, for the GWAS of continuous TPOAb levels, pregnant women exhibiting TPOAb levels below the minimum detection limits of the assay (0.25 IU/mL for the Longgang Study and 0.159l IU/mL for the Baoan Study) were excluded. After that, TPOAb levels were natural log-transformed before performing the GWAS analysis.

The thyroid dysfunction was defined following the 2017 medical guidelines of the American Thyroid Association for the Diagnosis and Management of Thyroid Disease During Pregnancy and the Postpartum1. Specifically, OHP was defined as a TSH concentration below the lower limit of the reference range, accompanied by an FT4 concentration exceeding the upper limit of the reference range. SHP was defined as a TSH concentration below the lower limit of the reference range while maintaining a normal FT4 concentration. SHO was defined by as a TSH concentration upper the limit of the reference range with normal FT4 concentration. ISH is typically defined as having an FT4 concentration below the lower limit of the reference range while maintaining a normal maternal TSH concentration.

To comprehend the genetic underpinnings of these thyroid dysfunctions, we conducted a GWAS analysis on four categorical traits: SHO, ISH, OHP, and SHP (case group). We used pregnant women with euthyroid function, defined as those maintaining TSH and FT4 levels within the pregnancy-specific reference range throughout pregnancy, as the control group. Specifically, we excluded individuals who tested positive for TPOAb from both the case and control populations in the analysis for ISH (Supplementary Data 1). It is worth noting that we also examined overt hypothyroidism; however, the statistical power was limited due to the small number of cases (NLonggang = 464, NBaoan = 148) and we did not identify robust genetic associations.

Statistical analysis

For all participants, baseline characteristics were described using mean ± standard deviation, and results of thyroid function tests (TSH, FT4, and TPOAb) for different pregnancies were presented in median [interquartile range]. In the phenotype association analysis of thyroid-related traits with gestational complications and fetal obstetric outcomes, quantitative traits were analyzed using multiple linear regression, and categorical traits were analyzed using logistic regression. All statistical analyses were conducted using R version 4.3.2.

Genotyping, imputation, and variant annotation

NIPT, which employs low-pass massively parallel sequencing of cell-free DNA fragments from peripheral blood of a pregnant woman, has gained prominence as an unparalleled screening test for fetal aneuploidy due to its high sensitivity and specificity61. Our prior research and recent study have illustrated that when NIPT data (0.06×–0.3×) are integrated with a reference panel, it can achieve high accuracy in genotype imputation, matching or even exceeding the performance of an array17,18.

In summary, we first collected data from NIPT(BGI SEQ-500 were used for NIPT test of peripheral blood) for 121,578 pregnant women in Baoan and Longgang Hospitals in Shenzhen city, Guangdong Province. Each of these participants was whole-genome sequenced to 9.9–21.9 million cleaned reads, representing a sequencing depth of around 0.11×–0.25×. Next, we applied BWA62 to align the cleaned reads to the Genome Reference Consortium Human Reference38 (GRCh38) and used the rmdup option in samtools63 to remove potential PCR duplicates. The GATK realignment and base quality recalibration method was utilized to align the reads and adjust base quality scores64. After that, the alignment files were stored as bam files.

Subsequently, we employed the Glimpse software (version 1.1.1)65, which implements an algorithm for genotype imputation, in conjunction with a deep whole-genome sequencing reference panel of 10 K Chinese66, to perform individual genotype imputation for the NIPT data. Finally, variant annotation was carried out using Ensembl Variant Effect Predictor67 (version 101), utilizing indexed GRCh38 cache files (version 109). All the data used for annotation were obtained in advance from the Ensembl FTP server (https://ftp.ensembl.org/pub/). Considering that a variant can span across multiple transcripts, we employed the “--pick” option to assign a single consequence block to each variant based on a pre-defined set of default criteria in VEP. In addition, the “--nearest” option was applied to identify the nearest gene with a protein-coding transcription start site (TSS) for variants located in the intergenic region.

Genome-wide association analysis in individual Studies

The association of the SNPs was analyzed by GWAS in PLINK 2.0, using linear regression for three continuous traits (levels of TSH, FT4, and TPOAb) and logistic regression for five binary phenotypes (SHO, ISH, SHP, OHP, and TPOAb positivity), separately in two cohorts (Baoan and Longgang). The genotype-phenotype association was conducted using an additive genetic model on SNP dosages. Moreover, analyses were adjusted for the gestational week of the thyroid test, maternal age, and the top ten principal components to account for population stratification.

Meta-analysis

We conducted fixed-effect GWAS meta-analyses based on an inverse-variance weighting of Baoan and Longgang using METAL software68. To visualize the meta analysis results, we generated a Manhattan plot and a quantile–quantile (QQ) plot with a R script.

In terms of interpreting the results of the meta analysis, we defined the lead SNP as the SNP with the most significant P value in the single-SNP association test within a 1-Mbp window. A locus was considered to have reached the significance threshold if the lead SNP within that locus had a P value ≤ 5 × 10−8. A locus is considered novel locus for the trait if its leading SNP is in linkage equilibrium (LD R2 < 0.2) with any known variants (listed in the GWAS Catalog as of December 31, 2023) on the same chromosome. Specifically, using the GWAS catalog database, we obtained all the trait-related known variants located on the same chromosome as our novel lead variants (Triat ID in the Experimental Factor Ontology: TSH: EFO_0004748, FT4: EFO_0005130, TPOAb measurement: EFO_0005666, and hyperthyroidism: EFO_0009189)69. And then, for all of our novel SNPs, we use East Asians (EAS) and Europeans (EUR) populations in the 1000 G genome to calculate the linkage disequilibrium R-square between variants we identified and all known SNPs on the same chromosome using LDpair (https://ldlink.nih.gov/?tab=ldpair). The results of linkage disequilibrium R-square between novel variants and known variants are shown in Supplementary Data 3A.

Conditional analysis

Stepwise conditional analyses were performed for the GWAS-meta results to identify independent signals that affect TSH and FT4 levels, conditioning on the most significant variants (known and novel) identified in our GWAS. After conditioning on the lead genome-wide significant variant, variants identified within a 1 MB region of the variant with a P value < 5.0 × 10−8 were considered independent significant signals. These conditional analyses were repeated, adding in the conditional lead variants until no variant had a conditional P value less than the genome-wide significance (P < 5.0 × 10−8). To determine whether the identified independent significant signals at known loci were independent novel signals, we analyzed East Asian (EAS) and European (EUR) populations from the 1000 Genomes Project. We used LDpair (https://ldlink.nih.gov/?tab=ldpair) to calculate the linkage disequilibrium R-square () between the variants we identified and those previously associated with trait-linked traits. Subsequently, we defined an SNP as an independent significant signal at a locus if the R² was less than 0.2 in both EAS and EUR populations.

Consistency and replication

To enhance the reproducibility of the findings, we checked internal and external consistency across different datasets. Consistency between the two hospitals involved examining the consistency of the effect direction of the lead SNPs and assessing the significance of the GWAS P values in the Baoan and Longgang Studies. On the other hand, consistency of our meta-GWAS results and external datasets involved examining the consistency in an independent dataset from three separate cohorts (ThyroidOmics Consortium, the GWAS meta analysis of Alexander T Williams et al., the GWAS meta analysis of M. Medici et al.)10,11,34.

Beyond the participants previously referenced, our research included 4688 individuals(NIPT PLUS cohort) who attended Shenzhen Baoan Women’s and Children’s Hospital for maternity check-ups during their 40-week gestation. Throughout 2020 and 2021, these participants visited the Hospital in the first or second trimester. These samples notably underwent more in-depth sequencing than standard NIPT, achieving an average depth of 0.3x. The serum levels of TSH, FT4, and TPOAb in NIPT PLUS cohort in were measured by Abott I2000 Immunoassay System with its respective system accessory kit. Using a process similar to Baoan Study, we completed GWAS for thyroid-related traits in the NIPT PLUS cohort.

eQTL colocalization analysis

Colocalization analyses were performed for novel lead variants using coloc R package70. Gene expression data from 49 tissues in European ancestry samples included in the GTEx Project version 8 release were used (https://gtexportal.org/)71. GWAS data from a region spanning ±1 Mb around each novel SNPs were compiled, integrated with the corresponding GTEx eQTL data for each specific tissue, and subsequently used as the input for related analyses. Regions that show evidence of colocalization between eQTL and GWAS signals were identified using pre-defined thresholds: PP4 ≥ 0.75 and PP4/PP3 ≥ 370. Here, PP4 denotes the posterior probability of Hypothesis 4, which assesses the likelihood that both the assumed model are correct and that the GWAS and expression data are simultaneously associated with a single causal variant. PP3 denotes the posterior probability that two distinct causal variants exist for GWAS and expression data, respectively. Given the genetic structural differences between our East Asian cohort and the European populations, our findings should be considered an exploratory endeavor.

Genetic correlation estimate

The meta-GWAS summary statistics were used to estimate SNP-based heritability and genetic correlation (rg) for all phenotypes using LD Score Regression36,72. GCTA was used to calculate LD score files using a deep whole-genome sequencing reference panel of 10k Chinese36,66,73.

Mendelian randomization

MR is a method for inferring causal associations between exposures and outcomes through the use of genetic variants as instrumental variables74. To avoid bias caused by linkage disequilibrium, we selected the IV (instrumental variable) SNPs that reached genome-wide significance in GWAS meta-analysis (P < 5 × 10–8) and achieved independence in linkage disequilibrium (LD) R2 = 0.2 by GCTA-COJO analysis75. The SNPs with a minor allele frequency (MAF) of <0.01 were excluded76.

Since GWAS summary data for maternal and fetal pregnancy outcomes in East Asian ancestry is currently unavailable, we conducted MR analyses between thyroid-related traits and maternal and fetal pregnancy outcomes using the two-sample method within a single cohort45,77, utilizing the TwoSampleMR package after completing our GWAS meta analysis of the Baoan Study and Longgang Study78. We implemented MR analyses under 3 scenarios:

  1. (1)

    Meta-Meta: the two-sample MR method is conducted in one-sample setting;

  2. (2)

    Baon-Longgang: Baoan Study served as the source of exposed GWAS data, while the Longgang Study provided the outcome population.

  3. (3)

    Longgang-Baoan: Longgang Study served as the source of exposed GWAS data, while the Baoan Study provided the outcome population.

In order to achieve sufficient power as much as possible, we regard meta-meta as the main MR result, and the MR results of the remaining two situations serve as support for the meta-meta result.

The variance of continuous traits explained by variants () was estimated using the formula:

$${R}^{2}=\frac{{2\times ({effect\; size})}^{2}\times {{\rm{MAF}}}\times (1-{{\rm{MAF}}})}{{2\times ({effect\; size})}^{2}\times {{\rm{MAF}}}\times \left(1-{{\rm{MAF}}}\right)+{2\times ({SE})}^{2}\times {{\rm{MAF}}}\times (1-{{\rm{MAF}}})}$$

where the effect size is measured in units of the continuous traits after inverse normal transformation. We calculated the F-statistic to evaluate the instrument strength, with values greater than 10 indicating sufficient strength. We applied inverse variance weighted (IVW), IVWRE, MR Egger, weighted median (WM) methods, weighted mode, and outlier methods using MR PRESSO79. If MR PRESSO successfully excludes outlier IVs, we re-execute the MR analysis using the new set of IV SNPs. To avoid the effects of the presence of heterogeneity in IVs, the IVWRE method is regarded as the most accurate estimation.

Maternal and fetal pregnancy outcomes

We made full use of relevant data on gestational diabetes and gestational hypertension in our cohort. Because the prevalence of gestational hypertension and pre-eclampsia was less than 2% in our cohort (gestational hypertension: 0.95% for Longgang and 0.47% for Baoan; pre-eclampsia: 0.42% for Longgang and 1.39% for Baoan), we did not include these two pregnancy complications in our analysis.

Finally, we included five blood glucose traits (FPG: Fasting Plasma Glucose; HBA1c: Glycated Hemoglobin; OGTT0H: Oral Glucose Tolerance Test at 0 hours; OGTT1H: Oral Glucose Tolerance Test at 1 hour; OGTT2H: Oral Glucose Tolerance Test at 2 hours) and two blood pressure traits (SBP; DBP) in our analysis. We completed the meta-GWAS analyses for these seven quantitative traits and GDM.

Furthermore, we conducted GWAS and meta-GWAS analyses for six birth outcomes, including gestational duration (birth gestational days), preterm birth (deliveries occurring at less than 37 weeks gestational age), birth weight, birth length, low birth weight (defined as birth weight less than 2500 g), and macrosomia (defined as birth weight >4000 g)5,80. For the birth outcomes of birth gestational days and preterm birth, we limited the study subjects to mothers who delivered live-born offspring through vaginal delivery.

Given that some studies have shown that TPOAb positivity with euthyroid is also a risk factor for various pregnancy complications1, we included euthyroid TPOAb positivity as an exposure to our analysis. Therefore, after deciding on the list of IV SNPs, we conducted MR analyses of associations between 9 thyroid-related traits (TSH levels within reference range during pregnancy, FT4 levels within reference range during pregnancy, SHO during pregnancy, ISH during pregnancy, OHP during pregnancy, SHP during pregnancy, TPOAb levels during pregnancy, TPOAb positivity with euthyroid during pregnancy, and TPOAb positivity during pregnancy) and 14 maternal and neonatal outcomes (GDM, FPG, HBA1c, OGTT0H, OGTT1H, OGTT2H, SBP, DBP, gestational duration, preterm birth, birth weight, birth length, low birth weight, macrosomia).

We also used phenotypic associations to explore associations between 9 thyroid traits and 14 maternal and neonatal outcomes. We collected birth outcome data for 92,132 pregnant women with thyroid function test results from two hospitals (Supplementary Data 16). Before performing phenotypic association analyses, we excluded pregnant women with multiple pregnancies, vitro fertilization, and aborted stillbirth outcomes. In the regression analysis of birth length, birth weight, low birth weight, and Macrosomia, we adjusted for fetal sex, BMI, age, and birth gestational days. For the analysis of gestational duration and preterm birth, we adjusted only for fetal sex, maternal BMI, and maternal age.

Because the 9 thyroid trait exposures are highly correlated (Supplementary Fig. 17, Supplementary Data 14), we computed the Bonferroni correction mainly based on the number of outcomes. A Bonferroni correction for the 14 traits (P value < 0.05/14 = 0.0036) was applied on the IVWRE P value to define significant results of phenotypic associations and MR.

BBJ study

To investigate the potential causal association between thyroid traits during pregnancy and the physical conditions and diseases in older age, we conducted a bidirectional two-sample MR study using our GWAS results and datasets from the BBJ Study. The BBJ Study conducted genome-wide association studies for 220 deep phenotypes (including diseases, biomarkers, and medication usage) in a cohort of 179,000 Japanese individuals30. Approximately 47% of these subjects were women, and the average age of all subjects was over 62 years81.

A Bonferroni correction for the 220 traits (P value < 0.05/220 = 0.00023) was applied to the IVWRE P value to define significant MR results.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.