Introduction

Early infancy represents a window of developmental plasticity during which environmental exposures can modulate chronic disease risk. Accelerated postnatal weight gain trajectories are associated with increased risk of diabetes, obesity and cardiovascular disease both in humans1, 2, 3 and mammalian models.4, 5 These data have implications for public health, as the postnatal period offers a window during which optimizing nutrition and/or growth rates could reap lifelong benefits. Thus, identifying early markers of rapid infancy weight gain, a potentially modifiable risk factor for obesity and type 2 diabetes, may eventually permit interventions targeting children at high risk for metabolic disease.

Metabolomic analysis allows comprehensive quantification of hundreds of nutrients, metabolic intermediates and small molecules from biological samples, and has proven to be a powerful tool for biomarker discovery. Recent metabolomic analyses have linked higher plasma levels of branched-chain and aromatic amino acids to subsequent development of type 2 diabetes in adolescents and adults.6, 7 We are not aware of any studies linking cord blood metabolomics to obesity or its early life risk factors. However, the cord blood metabolome does appear to be related to important clinical outcomes. For example, distinct cord blood signatures characterize infants with intrauterine growth restriction, notably higher phenylalanine and citrulline, and lower glutamine, choline and glucose.8 Reductions in cord blood phospholipids have also been reported to predict early childhood type 1 diabetes risk.9

We hypothesized that the cord blood metabolome would differ in newborns with subsequent development of early childhood obesity. To identify early biomarkers of rapid weight gain during infancy and childhood, we analyzed plasma metabolomic patterns in venous umbilical cord blood collected at delivery from infants who subsequently experienced or did not experience accelerated postnatal weight gain and childhood obesity.

Subjects and methods

Study population and measurements

We analyzed a subset of participants in Project Viva, a longitudinal cohort study of mothers and children. Detailed experimental protocols for Project Viva have been previously published.10 We recruited pregnant women during the first trimester from eight obstetric offices of Harvard Vanguard Medical Associates in Boston, MA, USA, between 1999 and 2002. Eligibility criteria included singleton pregnancy, ability to answer questions in English and initial prenatal visit before 22 weeks gestation. In-person study visits occurred at recruitment (median 9.9 weeks gestation), mid-pregnancy (median 28.1 weeks gestation), just after delivery, during infancy (median 6.3 months), early childhood (median 3.3 years) and mid-childhood (median 7.7 years); detailed survey data were collected annually at intervening time points. At each visit, research assistants measured weight with a digital scale and length (newborn and infancy) or height (early childhood and mid-childhood) were measured using a calibrated stadiometer. We used 2000 CDC reference data (http://www.cdc.gov/growthcharts/) to calculate weight-for-age Z-scores and body mass index (BMI) percentiles. We assessed body composition in mid-childhood by dual-energy X-ray absorptiometry (Discovery A, Hologic, Bedford, MA, USA). The dual-energy X-ray absorptiometry scanner was monitored daily for quality control, by scanning a standard synthetic spine. We used Hologic software QDR version 12.6 for analysis. A single trained investigator checked all scans for positioning, movement and artifacts, and defined body regions for analysis. Intrarater reliability on duplicate measurements was high (r=0.99).

We obtained written informed consent from the mother at each encounter and verbal assent from children at the mid-childhood visit. The Harvard Pilgrim Health Care Institutional Review Board approved all procedures.

Identification of cases and controls

We identified 26 cases based on accelerated early postnatal weight gain, defined as the top quartile of change in weight for age from 0 to 6 months in Project Viva and overweight status (BMI >85th percentile according to CDC 2000) in mid-childhood. Controls were sex matched and selected based on normal early postnatal weight gain, defined as the 2nd or 3rd quartile of change in weight for age from 0 to 6 months in Project Viva and normal weight (BMI between 25th and 75th percentile, CDC 2000) in mid-childhood.

Cord blood collection

Using a standard protocol, obstetricians and midwives collected cord blood from the umbilical vein at delivery. Whole blood was collected in EDTA tubes, refrigerated for <24 h and centrifuged at 2000 r.p.m. at 4 °C for 10 min. Plasma aliquots were stored in liquid nitrogen until shipment on dry ice to Metabolon, Inc. (Durham, NC, USA) for metabolomic analysis. Aliquots of cord blood were also used for analysis of adiponectin and leptin concentrations, using commercial assay kits (RIA, Linco Research Inc., St. Charles, MO, USA).

Metabolomic analysis

We performed untargeted metabolomic profiling using multi-platform mass spectroscopy (Metabolon, Inc.). Detailed descriptions of sample preparation, mass spectroscopy and automated metabolite identification procedures have been previously published.11, 12 We obtained semi-quantitative concentrations (expressed in arbitrary units) for 415 known biochemicals. We replaced undetectable values with 0.5 × minimum value for each metabolite. We then log2 transformed the metabolite concentrations to minimize the influence of outliers. We excluded from the analysis 27 metabolites with undetectable values in >50% of samples.

Statistical analysis

To determine which cord blood metabolites differed in cases versus controls, we calculated fold changes for each biochemical (calculated by exponentiating the mean log2 metabolite level in cases divided by the exponentiated mean log2 metabolite level in controls) and performed two-sided unpaired Student’s t-tests using JMP Pro v10 software (SAS, Cary, NC, USA). We calculated false discovery rate Q-values to account for multiple comparisons (http://genomics.princeton.edu/storeylab/qvalue/);13 Q-value <0.25 was considered significant. We next performed metabolite set enrichment analysis to identify pathways enriched among the most differentially regulated metabolites between cases and controls (MetaboAnalyst 2.0, www.metaboanalyst.ca).14

To assess the overall variability within our data set, we performed principal components analysis using R (www.r-project.org), including only imputed, log-transformed metabolite concentrations and case/control status. Next, to determine to what extent the metabolome distinguished cases from controls, we performed unsupervised hierarchical clustering (MetaboAnalyst 2.0).14 To assess which metabolites contributed most strongly to distinguishing cases from controls, we next used partial least-squares discriminant analysis using R.

We adjusted for potential confounders through multivariate linear and logistic regression analysis (JMP Pro v 10, SAS). After determining which clinical covariates were associated with case versus control status, using bivariate logistic regression, we next built multivariable models including those covariates with the strongest association with case–control status, namely, maternal pre-pregnancy BMI, maternal age, breastfeeding duration and paternal BMI. As paternal BMI did not contribute to the predictive power (whole-model R2) of the multivariable models, we omitted it from subsequent models. We used linear regression to calculate metabolite concentrations adjusted for maternal BMI, maternal age and breastfeeding duration. To test whether cord blood metabolite levels were associated with odds of case versus control status after adjusting for clinical covariates, we then built models incorporating individual metabolites in addition to maternal BMI, maternal age and breastfeeding duration.

Results

Demographic, pregnancy and hormonal characteristics in infants with accelerated postnatal weight gain

Mothers of cases were younger than controls (29.9±5.4 vs 34.9±4.4 years) and had higher pre-pregnancy BMI (27.9±6.7 vs 23.6±3.8 kg m−2). Paternal BMI was also higher among cases (27.6±3.7 vs 25.3±3.5 kg m−2). Thirteen (50%) of the cases were non-white versus seven (27%) of the controls.

Case infants tended to have lower birth weight (3275±458 vs 3487±349 g) and a trend for lower birth weight Z-scores adjusted for gestational age and sex (−0.19±0.79 vs 0.11±0.67). As expected based on our selection strategy, cases had significantly higher weight for age Z-score in infancy (1.25±0.86 vs 0.26±0.59, P<0.0001) and higher BMI percentile in mid-childhood (95.5±4.7 vs 51.5±15.5, P<0.0001). In mid-childhood, both total fat mass and lean mass were higher in cases versus controls. Cases had a shorter duration of breastfeeding (4.0±3.1 vs 8.2±4.0 months, P<0.001). Demographic, pregnancy and infancy characteristics are summarized in Table 1.

Table 1 Maternal, pregnancy and early childhood characteristics

Unsupervised analysis: cord metabolites do not clearly discriminate cases and controls

We detected a total of 415 biochemicals, including 99 amino acids, 33 carbohydrates, 18 cofactors and vitamins, 42 xenobiotics and 153 lipids in cord blood (full list, Supplementary Table 1). To better understand the structure of the cord blood metabolome in cases versus controls, we used unsupervised principal components analysis to identify metabolites contributing the most to observed differences in the data set. Principal components analysis did not clearly separate the two groups (Supplementary Figure 1A). Similarly, unsupervised hierarchical clustering did not fully separate cases from controls, although visual inspection suggests that subsets of cases and controls did cluster together (Supplementary Figure 1B). We next used partial least-squares discriminant analysis to identify metabolites predictive of case versus control status. Two distinct clusters were observed (Supplementary Figure 1C), but the model was not statistically significant, with permuted P-value=0.11.

Individual cord blood metabolites associated with case versus control status

Mean levels of 16 biochemicals were significantly different between cases and controls (nominal P<0.05); none reached a false discovery rate threshold of <0.25. Two metabolites, threonine and homostachydrine, were significantly higher in cases, while 14 metabolites were lower (Figures 1a–d and Table 2A and B). The metabolite with the greatest fold reduction (50% lower, P=0.01) in cases was 4-vinylphenol sulfate. Two of the most significantly reduced metabolites in cases, dimethylglycine (18% lower, P=0.01) and N-acetylmethionine (16% lower, P=0.01) are methyl donors involved in one-carbon metabolism (Figure 1b). Three of the metabolites with lower levels in cases versus controls are related to tryptophan or its metabolism: tryptophan betaine (48% lower, P=0.03), tryptophyl leucine (26% lower, P=0.03) and serotonin (46% lower, P=0.02, Figure 1c). Three of the top-ranking metabolites were xenobiotics found in plants, including homostachydrine (abundant in alfalfa and citrus; 54% higher in cases, P=0.007), N-methyl proline (citrus; 33% higher in cases, P=0.04) and tryptophan betaine (legumes).

Figure 1
figure 1

Selected top-ranking metabolites in categorical analysis of cases versus controls. (a) Upregulated metabolites. N=26 per group, P<0.05. (b) Methyl donors. N=26 per group, P<0.05. (c) Tryptophan derivatives. N=26 per group, P<0.05. (d) Plant components. N=26 per group, P<0.05. (e) Glutamine to glutamate ratio. N=25 controls and 26 cases, P<0.05. Open circles denote controls; filled circles denote cases.

Table 2A Log2-transformed mean metabolite levels in cord blood, top-ranking in unadjusted analysis
Table 2B Log2-transformed mean metabolite levels in cord blood, top-ranking in adjusted analysis

Given that several clinical characteristics linked to childhood obesity risk differed between cases and controls (that is, maternal BMI, maternal age and breastfeeding duration), we next used linear regression to calculate metabolite concentrations adjusted for these potential confounders. A total of 17 metabolites were significantly altered in cases versus controls at adjusted P<0.05 (Table 2B). Several of the metabolites identified in our raw comparison of cases versus controls remained significant after adjustment, including homostachydrine, 4-vinylphenol sulfate, N-acetylmethionine, serotonin, tryptophyl leucine and 1-stearoyl-GPE (adjusted P<0.05). Of note, 5 of the 17 top-ranking metabolites in the adjusted analysis were related to tryptophan, including serotonin, tryptophan, tryptophyl leucine, N-acetyltryptophan and indolelactate. We also observed lower levels of metabolites related to branched-chain amino acids, including leucine, α-hydroxyisovalerate and α-hydroxyisocaproate, in cases.

Pathway analysis of cord metabolites associated with case versus control status

To test whether cord blood metabolites that differed in cases with accelerated postnatal weight gain versus controls belonged to known biological pathways, we performed metabolite set enrichment analysis. Several pathways were enriched (false discovery rate <0.25). The most significantly enriched pathways were ‘Tryptophan Metabolism’ (P=0.02) and ‘Excitatory neural signaling through 5HTR4/6/7 and serotonin’ (P=0.02) (Supplementary Table 2). Also highly enriched were pathways associated with metabolism of other amino acids, including aspartate and β-alanine (P=0.02 and P=0.03, respectively). Metabolites involved in metabolism of betaine, an organic osmolyte and methyl donor, were also significantly enriched (P=0.02).

Given that several pathways related to amino acid metabolism were altered, we examined the ratio of glutamine to glutamate, which reflects glutamine synthase activity and has been linked to risk of obesity, insulin resistance and diabetes in humans.15 Plasma from cases tended to have a lower glutamine:glutamate ratio (P=0.08), a difference that became significant (33% lower, P=0.046) after excluding an outlier in the control group, with glutamine levels >6 s.d. below the mean (Figure 1e).

Cord blood metabolites predicted case versus control status above and beyond clinical risk factors

As infants with accelerated postnatal weight gain and overweight status in mid-childhood (‘cases’) differed from controls with regard to maternal pre-pregnancy BMI, maternal age, breastfeeding duration and other covariates that could influence obesity risk, we next investigated whether cord blood metabolites were associated with the odds of being a case independent of these potential confounders. We therefore used logistic regression to identify clinical covariates predictive of case versus control status and then tested whether cord blood metabolite levels independently contributed to the risk of being a case above and beyond the clinical risk factors. Using bivariate logistic regression, we found that breastfeeding duration (odds ratio: 0.74, 95% confidence interval: 0.62–0.89, P=0.001) and maternal age (odds ratio: 0.81, 95% confidence interval: 0.71–0.93, P=0.003) were associated with reduced odds of being a case, whereas pre-pregnancy maternal BMI (odds ratio: 1.19, 95% confidence interval: 1.04–1.36, P=0.01) and paternal BMI (odds ratio: 1.21, 95% confidence interval: 1.01–1.45, P=0.04) were the most significant positive predictors of case versus control status (Table 3A). We next built stepwise multivariate logistic regression models and found that breastfeeding duration, maternal age and pre-pregnancy BMI were independently associated with case versus control status, whereas the addition of covariates such as newborn weight, race, sex and paternal BMI contributed little to the strength of the models. We sequentially added each of the top-ranking metabolites identified in our adjusted and unadjusted analyses (Table 2A and B) to these logistic regression models, to determine whether cord blood metabolites improved the ability to predict case versus control status.

Table 3A Associations of maternal and infant characteristics with odds of case versus control status

Many of the top-ranking metabolites substantially improved the strength of the logistic regression models. For example, adding cord blood levels of N-acetylmethionine to a model including breastfeeding duration, maternal age and pre-pregnancy BMI improved the ability to discriminate cases from controls (R2 for model without metabolite: 0.39, P<0.0001; R2 for model with N-acetylmethionine: 0.54, P<0.0001; Table 3B). Similarly, adding metabolites related to tryptophan (for example, tryptophyl leucine, tryptophan and indolelactate) also improved prediction ability, as did metabolites related to branched-chain amino acid metabolism (for example, α-hydroxyisocaproate, α-hydroxyisovalerate and leucine). Moreover, the associations between such metabolites and the odds of case versus control status were independent of the clinical covariates (Table 3B). Only the metabolites that improved the strength of the logistic regression models and contributed independently to the odds of case/control status are included in Table 3B.

Table 3B Associations of maternal and infant characteristics with odds of case versus control status

Discussion

Among cases with rapid infancy weight gain and overweight in mid-childhood versus controls without rapid weight gain and no childhood overweight, we found that mean levels of 16 cord blood metabolites were significantly different. Pathway analysis revealed that biochemicals related to tryptophan and one-carbon metabolism were enriched among the differentially regulated metabolites. Moreover, the cord blood ratio of glutamine:glutamate was lower in cases versus controls. Our analysis is limited by the fact that several clinical characteristics associated with childhood BMI, including maternal age, maternal BMI and breastfeeding duration, differed between cases and controls. Although many of the differentially regulated metabolites survived adjustment for identified covariates, we cannot exclude the possibility that additional factors related to maternal metabolism or breastfeeding that we did not measure or adjust for might have contributed to the observed differences. Moreover, no individual metabolites in our categorical analysis had significant false discovery rates. Logistic regression demonstrated that maternal age, pre-pregnancy BMI and breastfeeding duration were associated with increased odds of being a case versus a control; adding cord blood metabolites related to one-carbon and tryptophan metabolism to these prediction models improved the ability to differentiate cases with early childhood obesity risk from controls. Our results therefore suggest that cord-blood metabolomic signatures may be associated with early childhood obesity trajectories. However, the cross-sectional nature of our analysis and the potential influence of maternal or dietary factors on the cord metabolome preclude assessment of whether these metabolic pathways are causally related to infancy weight gain.

Although many groups have used metabolomic profiling to identify disease biomarkers in adults at risk for insulin resistance,15 diabetes7 and cardiovascular disease,16 few studies have examined plasma metabolomic patterns in infancy or childhood.6, 17, 18 A recent study found that cord blood phospholipids were predictive of early-onset type 1 diabetes.9 Cord blood metabolomic markers are also linked to neonatal hypoxia, prematurity and intrauterine growth retardation.8, 19, 20 To our knowledge, our study is the first to characterize the cord blood metabolome of infants at risk for childhood obesity, before obesity onset.

Our data suggest that differences in maternal diet are associated with postnatal weight gain trajectories in infancy. Indeed, several of the top-ranking metabolites altered between cases and controls were food constituents—mostly plant products. For example, homostachydrine, higher in cases, is linked to dietary intake of citrus21 and dairy products;22 N-methyl proline, reduced in cases, is linked to citrus consumption;22 4-vinylphenol sulfate, lower in cases, is a styrene metabolite found in wine, cider, berries and peanuts;22 and tryptophan betaine, reduced in cases, is abundant in legumes. Moreover, pyridoxate, a vitamin B6 metabolite reduced in cases, is linked to multivitamin consumption, and maltose, reduced in cases, is a disaccharide abundant in many foods. Another metabolite with lower levels in cases, 3-carboxy-4-methyl-5-propyl-2-furanpropanoate (CMPF), is a uremic toxin, but also correlates with dietary intake of green vegetables, garlic, onions and fish.22 CMPF has been described to compete with penicillin for binding to the renal tubule transporter rOat3; it is unclear whether a trend for higher penicillin G levels in cases versus controls (fold change: 2.83, P=0.22, not shown) might have contributed to observed reductions in CMPF in cases.23 Interestingly, elevations in CMPF have recently been described in the setting of impaired glucose tolerance, gestational diabetes, and type 2 diabetes; in mice, CMPF directly inhibits β-cell insulin secretion.24 Such data contrast with our observation of decreased CMPF levels in cord blood from cases, despite a similar prevalence of gestational diabetes in both groups. Future studies will be required to assess transplacental passage and metabolism of this potentially important metabolite. Taken together, the numerous food and plant components among the most differentially regulated metabolites between cases and controls are largely consistent with the literature linking maternal dietary patterns (both during and after pregnancy) with offspring obesity.25

A unique feature of our data is that it implicates specific nutrients and metabolites as potential contributors to early childhood obesity risk. To what degree such signatures might stem from differences in maternal BMI, which differed in cases and controls, or from differences in feeding/fasting in the hours leading up to delivery, versus long-term dietary patterns, is not clear.

Our results point to an association between cord blood levels of metabolites related to tryptophan and one-carbon metabolism, and accelerated postnatal weight gain. Three of the 16 significantly altered metabolites in cases versus controls are structurally related to or metabolized from tryptophan. Moreover, 2 of the 16 metabolites that differed in cases and controls were methyl donors and metabolite set enrichment analysis revealed ‘Betaine Metabolism’ as one of the top-ranking pathways altered between cases versus controls (betaine is diet-derived methyl donor). Associations of biochemicals related to tryptophan and obesity trajectories in our analysis are intriguing in light of the multiple lines of evidence linking serotonin to appetite regulation, β-cell development and brown adipose tissue thermogenesis, as well as links between diet-derived tryptophan betaine and sleep regulation.26, 27, 28, 29, 30 Metabolites involved in one-carbon metabolism (for example, dimethylglycine, methionine and betaine) have also recently been implicated in the pathogenesis of obesity, insulin resistance and fatty liver15,31 in addition to an established role in the regulation of epigenetic marks (for example, DNA and histone methylation). However, it would be premature to conclude, from our analysis of cord blood, that these metabolite patterns are causally linked to obesity pathogenesis. Rather, these relationships may result from differences in dietary patterns among the mothers of cases, or from other differences between cases and controls that were not accounted for in our adjusted analyses. It will be important for future studies to further explore whether tryptophan or one-carbon metabolism are linked to childhood obesity risk.

As our categorical and pathway analyses highlighted differences in the levels of several amino acids and their derivatives, we examined whether the glutamine:glutamate ratio—a marker for the activity of glutamine synthetase—differed in cases versus controls. Cases had a reduced glutamine:glutamate ratio. Other groups have described reductions in cord-blood glutamine levels and in the glutamine:glutamate ratio in low birth weight infants,8, 32 but this has not, to our knowledge, previously been assessed in the setting of early childhood obesity risk. Plasma glutamine levels fluctuate in response to stress and are reduced during acute infection or trauma;33 the reduction in cord glutamine:glutamate ratio we observed in cases versus controls might therefore reflect greater stress around the time of delivery. These associations are intriguing in light of an emerging literature linking reductions in plasma glutamine and in the glutamine:glutamate ratio, to increased risk of diabetes, insulin resistance and hypertension,15 as well as a recent study linking genetic polymorphisms in glutamate metabolism to cardiovascular risk in individuals with type 2 diabetes.34 As we did not measure cord blood insulin or c-peptide levels, we could not directly examine associations between infant hyperinsulinemia or cord homeostatic model assessment of insulin resistance (HOMA-IR) and glutamine:glutamate ratio, or other metabolites. This will be an important area for future analyses.

Our study has several limitations. First, the observational nature of the analysis precludes causal inferences. Second, relatively few metabolites were significantly altered by case versus control status—only 16 out of 415 detected metabolites were altered at P<0.05 in our unadjusted analysis and only 17 were significantly altered (P<0.05) after adjustment for covariates. The small sample size likely limited our power to detect differences. Moreover, clinical heterogeneity between groups, such as differences in maternal BMI, may have contributed to differences in the cord blood metabolome. Future studies using propensity-matched cases and controls could address this possibility. We recognize the possibility that the cord blood metabolome may have limited ‘predictive’ significance, instead providing a snapshot of the immediate developmental and metabolic status of the infant. This possibility is supported by the observation that relatively few metabolites (n=13) were significantly correlated with the change in weight from 0 to 6 months, yet large numbers of cord blood metabolites were significantly associated with newborn adiposity (n=42) and newborn T4 (n=34; not shown). It is also unclear to what extent the cord blood metabolome is influenced by maternal diet immediately before delivery as opposed to maternal, fetal or placental metabolism. Despite these limitations, pathway analysis allowed us to identify concordant regulation of both methyl donor and tryptophan-related metabolites. Some of the strengths include the detailed assessments of mothers during pregnancy and the extensive information captured about clinical and environmental factors that may influence childhood obesity risk.

In summary, we examined the cord blood metabolome in infants with and without accelerated early childhood weight gain, to better understand the nutritional and metabolic determinants of early-onset obesity risk. Although we found few overall differences in the cord metabolome of cases versus controls, several metabolites linked to tryptophan and one-carbon metabolism were significantly reduced in cord blood from cases with accelerated postnatal weight gain. Moreover, the ratio of glutamine to glutamate, which has recently been linked to insulin resistance and cardiometabolic risk, was lower in cases versus controls. Our data raise the possibility that perturbed metabolite patterns may already be present in cord blood of infants at risk for rapid postnatal weight gain, before obesity onset. Elucidating the mechanistic basis for these metabolite alterations and their potential role in obesity pathogenesis will require future study.