Introduction

Childhood overweight and obesity is a complex, multifactorial disease that is associated with numerous adverse health outcomes. These include an increased risk of elevated blood pressure, lipid or glucose metabolism disorders, type 2 diabetes, cardiovascular disorders, and cancer later in life.1 In addition, children with overweight and obesity suffer a notable decline in quality of life2 and are more likely to face bullying.3

The causes of overweight in childhood are diverse, with early life exposures playing a key role, particularly during prenatal and postnatal developmental stages. For instance, altered intrauterine conditions, influenced by maternal overweight, obesity and excessive gestational weight gain (GWG)4,5 can lead to epigenetic changes in fetal metabolism. These changes have been shown to elevate the risk of high birth weight, large for gestational age, rapid infant weight gain, and overweight and obesity later in life.6,7 Additionally, maternal overweight and obesity may impair offspring neuroendocrine regulation, neuronal pathways, and brain structure due to intrauterine exposure to an excess of circulating nutrients as well as elevated levels of certain hormones and inflammatory cytokines. It is posited that these factors may contribute to later neurobehavioral problems.8,9 On the other hand, adhering to a healthy lifestyle during pregnancy, along with a healthy pre-pregnancy body mass index (BMI)10 and breastfeeding,11 has been linked to lower childhood obesity risk. Further, optimizing nutrition during fetal and early postnatal stages can enhance child neurodevelopment and brain function.12

Recent research has focused on interventions aimed at disrupting the transmission of intergenerational obesity and promoting healthy child development.13,14 In the GeliS (“healthy living in pregnancy”) trial, pregnant women in the intervention group (IG) received lifestyle counseling compared to standard care in the control group (CG).15 While the IG women improved some health behaviors,16,17,18 it did not affect the proportion of women with excessive GWG.19 Follow-up analyses on the children of GeliS participants found no intervention effects on anthropometry and neurodevelopment until three years of age.20,21 In this analysis, we examine whether the GeliS intervention conferred any long-term effects on children’s somatic growth and neurodevelopment at 4 and 5 years of age, hypothesizing that the IG will show lower prevalence of overweight and obesity and better neurodevelopmental outcomes.

Subjects and methods

Design and setting of the GeliS study

The GeliS study is a prospective, cluster-randomized, controlled, open intervention trial that integrated lifestyle counseling alongside standard German antenatal care. The trial was carried out across ten regions within Bavaria, Germany, that were randomly assigned as either intervention or control areas. Detailed information on the study design and cluster-randomization can be found in the published study protocol.15 The primary aim was to decrease the proportion of women who gained excessive gestational weight according to the thresholds established by the National Academy of Medicine22 and to decrease the incidence of gestational diabetes mellitus. Results on primary and secondary outcomes are published elsewhere.16,17,18,19,20,21,23 The study adhered to local regulatory standards and followed the principles outlined in the Declaration of Helsinki. The study protocol was approved by the Technical University of Munich Ethics Committee (project number 5653/13), and was registered in the ClinicalTrials.gov Protocol Registration System (NCT01958307).

Participants and lifestyle intervention program

A total of 71 gynecological and midwifery practices actively participated in the recruitment of eligible pregnant women from 2013 to 2015. Inclusion criteria comprised the following: having a pre-pregnancy BMI ranging from ≥ 18.5 kg/m² to ≤ 40.0 kg/m², aged between 18 and 43 years, gestational age less than the 12th week, singleton pregnancy, proficiency in German, and provision of informed consent. Women experiencing severe complications during pregnancy were excluded from the study.15 During the follow-up, women were characterized as drop-outs if they failed to provide contact details, became unreachable, or withdrew from participation.23

Participants allocated to the IG underwent an antenatal lifestyle intervention program, which consisted of four face-to-face counseling sessions provided in the participating practices. These sessions were integrated into routine antenatal care visits and were delivered by medical personnel, midwives, or gynecologists who received prior training on study materials by the research staff. Three counseling sessions were scheduled during pregnancy (12th–16th, 16th–20th, 30th–34th week of gestation), and were 30 to 45 min in duration. Additionally, a counseling session was conducted at 6–8 weeks postpartum.

The counseling addressed various topics, including adherence to appropriate GWG according to the National Academy of Medicine recommendations,22 guidance on maintaining a healthy diet and engaging in physical activity based on national and international recommendations,24,25 the importance of a healthy antenatal and postnatal lifestyle, the benefits of breastfeeding over formula, and supportive information on introducing complementary feeding. A comprehensive overview of the lifestyle intervention program and counseling content is available in the study protocol.15 Women in the CG received routine antenatal care with the addition of an informational flyer containing general guidance on maintaining a healthy lifestyle during pregnancy and lactation. Both CG and IG participants were enrolled in a 5-year follow-up observation program, during which GeliS researchers collected data on mother-child pairs through phone interviews and mailed questionnaires at 1, 3, and 5 years after childbirth.

Collection and processing of data

Maternal sociodemographic data was obtained through a screening questionnaire, administered at recruitment before the 12th week of gestation. Pre-pregnancy BMI was calculated using self-reported pre-pregnancy weight and height. Maternal weight during pregnancy was derived from maternity records. GWG was determined by calculating the difference between the last measured weight before delivery and the initial measured weight at recruitment. Information on the current smoking status of mothers was gathered from a questionnaire completed by participants five years after giving birth.

Birth date and anthropometric measurements were obtained from birth records. The child’s age was calculated by subtracting the date of birth from the respective data collection time point, and single imputation was applied in cases where questionnaire completion dates were missing.

Well-child health visits are planned between 46–48 months and 60–64 months of life as part of routine pediatric health examinations in Germany. Anthropometric data collected during these visits is recorded in a check-up booklet. The data were provided by participants during phone interviews around the child’s 5th birthday and were used to calculate age- and sex-specific percentiles and z-scores based on a German reference group.26 BMI-for-age percentiles below 10.0 classified children as having underweight, percentiles above 90.0 as having overweight, and percentiles above 97.0 as having obesity.26

Neurodevelopmental data for 5 year-old children was collected with the German 60-month version of the validated Ages and Stages Questionnaire (ASQ-3TM), which was mailed to participants for them to complete independently.27 The ASQ consists of five developmental domains: communication, gross motor skills, fine motor skills, problem-solving and personal-social development. Each domain comprises six questions focusing on age-appropriate development. Answers are given numerical values according to the following: “yes” responses ( = 10 points) indicate that the child has mastered the task, “sometimes” ( = 5 points) means that the task is not frequently mastered, or “not yet” ( = 0 points) means that the child has not mastered the task. For domains with two or fewer missing responses, the mean value of the remaining responses was computed. If three or more responses were missing, no score was given for that domain and was excluded from analysis. Each of the domains was scored separately and scores were tallied and compared to unique cut-off scores between 0 and 60. Scores above the cut-off values indicated that the child’s development was on target for their age, whereas scores below cut-off values suggest a potential developmental delay in the corresponding domain. In accordance with the user’s guide,27 the following cutoff scores are assigned for each domain: 33.19 for communication, 31.28 for gross motor skills, 26.54 for fine motor skills, 29.99 for problem-solving, and 39.07 for personal-social development.27

Statistical analysis

The power calculation for the GeliS study was based on the primary endpoint of excessive GWG.15 This study describes findings from intention-to-treat secondary analyses and included all participants who provided child anthropometric and/or neurodevelopment data. Between-group differences in child anthropometric values were computed using likelihood-based mixed models for repeated measures, as described by Bell et al.28 Data from all well-child health visits, conducted annually between the ages of 1 and 5 years, was included. Customized hypotheses were applied to investigate group differences at 4 and 5 years old. Point estimates and 95% confidence intervals (CI) for the mean differences between the IG and CG at 4 and 5 years of age were calculated in models that included visit number, group assignments, and their interaction terms. These models were adjusted for study region as well as various modifiable and non-modifiable factors based on previously observed group differences,20,21,29 including maternal pre-pregnancy age, maternal pre-pregnancy BMI category, parity and child sex. Additionally, child age in days at the respective visit was an adjustment covariate to account for deviations from the suggested visit time point. Weight categories (i.e. underweight, normal weight, overweight and obesity) were determined by BMI-for-age percentiles at 4 and 5 years of age. Between-group differences were investigated using proportional odds ordinal logistic regression models. These models were constructed using generalized estimating equations, as per the approach outlined by Donner et al.30 The analysis was adjusted for maternal pre-pregnancy age, pre-pregnancy BMI category, and parity. Analyses of age- and sex-specific percentiles and z-scores were not adjusted for the sex and age of the child.

Differences in ASQ scores were examined through linear regression models, employing generalized estimating equations. These models were also adjusted for maternal pre-pregnancy age, maternal pre-pregnancy BMI category, parity and child sex. Adjustments for child age in months at completion were incorporated to accommodate deviations from the intended age range for the 60-month version of the ASQ. Additionally, maternal educational level was included as an adjustment covariate for consistency with both the previous GeliS publication on neurodevelopment21 and other randomized controlled trials (RCTs) in the research field.31 Binary logistic regression models utilizing generalized estimating equations were applied to explore group differences regarding the proportion of infants with ASQ scores falling below cut-off values. These models were unadjusted due to the small number of observations.

All analyses were performed using SPSS software (IBM SPSS Statistics for Windows, version 29.0, IBM Corp, Armonk, NY), and statistical significance was defined as two-tailed p values < 0.05. Given the exploratory character of the analyses, no corrections for multiple testing were implemented.

Results

The flow of GeliS participants

From the initially recruited 2 286 women for the GeliS study, 1647 women were included in the 5-year follow-up. For 124 women from the IG and 116 women from the CG, loss to follow-up occurred for various reasons. In five participants, information was missing, resulting in the 1403 mother-child pairs eligible for analyses of anthropometric and neurodevelopment data of the children (Fig. 1).

Fig. 1: Flow of GeliS participants.
Fig. 1: Flow of GeliS participants.
Full size image

GeliS Gesund leben in der Schwangerschaft/healthy living in pregnancy, FU Follow-up.

Baseline characteristics of mother-child pairs

Table 1 presents the characteristics of the mother-child pairs considered in the present analysis. In total, 67.0% of women began their pregnancy with normal weight, 22.3% with overweight, and 10.7% with obesity. Maternal pre-pregnancy age, weight, BMI, and GWG were similar between the IG and CG women. The percentage of primipara women was higher in the IG compared to the CG (65.0% vs. 55.6%, p = 0.004), which was the only statistically significant difference in maternal baseline characteristics (Table S1).

Table 1 Characteristics of mother-child pairs.

Adverse pregnancy outcomes, such as preterm birth, small for gestational age, large for gestational age, and high birth weight, were largely comparable between the groups. The proportions of mother-infant pairs lost to follow-up were also similar (Table S2). However, characteristics of women lost to follow-up differed slightly from those who remained, such as pre-pregnancy weight, GWG, education, country of birth, and parity. Nonetheless, infant birth outcomes were comparable between those groups (Table S2).

Child anthropometrics

Table 2 presents the child anthropometric data at 4 and 5 years of age. At 4 years old, a statistically significant group difference was observed in weight percentiles (adj. effect size 2.99, 95% CI 0.18 to 5.80, p = 0.037) between IG and CG, but not in mean weight (adj. effect size 0.19, 95% CI −0.03 to 0.40, p = 0.091) or weight z-scores (adj. effect size 0.09, 95% CI 0.00 to 0.19, p = 0.053). At 5 years of age, there were significant group differences in mean weight (adj. effect size 0.33, 95% CI 0.04 to 0.61, p = 0.023), weight z-scores (adj. effect size 0.12, 95% CI 0.02 to 0.21, p = 0.021), and weight percentiles (adj. effect size 3.49, 95% CI 0.62 to 6.36, p = 0.017). Mean height, respective sex- and age-specific outcomes, and mean head circumference at both ages were similar between groups. At 5 years old, there were significant between-group differences in mean BMI (adj. effect size 0.19, 95% CI 0.04 to 0.34, p = 0.013), BMI z-scores (adj. effect size 0.12, 95% CI 0.02 to 0.21, p = 0.015), and BMI percentiles (adj. effect size 3.39, 95% CI 0.57 to 6.21, p = 0.018).

Table 2 Child anthropometrics at 4 and 5 years of age.

The prevalence rates for underweight, normal weight, overweight and obesity among 4-year-old children were 9.4%, 84.3%, 5.4% and 1.1%, respectively. At 5 years of age, these rates were 8.5%, 85.2%, 4.3% and 2.2%. The odds for being in a higher weight category were significantly different for children in the IG compared to the CG at 4 years old (adj. OR 1.51, 95% CI 1.08 to 2.12, p = 0.017), however, this difference was no longer significant at 5 years old (adj. OR 1.25, 95% CI 0.98 to 1.59, p = 0.075) (Table 2).

Child neurodevelopment

Table 3 shows the ASQ scores for the five different domains for 5-year-old children. Communication (p = 0.143), gross motor skills (p = 0.627), fine motor skills (p = 0.523), problem-solving (p = 0.628), and personal-social development (p = 0.383) scores were comparable in both groups with no statistical group differences.

Table 3 Child neurodevelopment assessed by ASQ scores in the five domains at 5 years of age.

The percentage of children in both groups achieving scores below the cut-off values in communication (IG 1.8% vs. CG 1.6%, p = 0.912), gross motor skills (IG 0.7% vs. CG 1.1%, p = 0.531), fine motor skills (IG 2.9% vs. CG 3.1%, p = 0.767), problem-solving (IG 0.4% vs. CG 0.2%, p = 0.873) and personal-social development (IG 0.5% vs. CG 0.4%, p = 0.699) were similar, with no evidence of intervention effects in unadjusted models (Table 4).

Table 4 Proportion of children with ASQ scores below cut-off at 5 years of age.

Discussion

The aim of this study was to assess whether somatic growth and neurodevelopment differed between children whose mothers participated in an antenatal lifestyle intervention compared to those who received standard antenatal care. Our results showed that children in the IG had significantly higher odds of being in a higher weight category at 4 years old compared to the CG. This finding was also observed at 3 years old, as reported by Spies et al.21 At 5 years old, IG children continued to show higher somatic growth, with statistically significant higher weight and BMI outcomes. In fact, these children showed consistently lower rates of underweight and higher rates of overweight and obesity compared to the CG. In this analysis we observed no appreciable differences in domain-specific ASQ scores between IG and CG children at 4 and 5 years old. These results contrast with earlier findings, which showed minor between-group differences at 3 years old.21 In both groups, the proportion of children scoring below the cut-off in any domain, indicating a risk for neurodevelopmental delays, was very low and ranged from 0.2–3.1%. To summarize, earlier GeliS follow-up assessments of children from birth to 3 years showed no intervention effect on the prevalence of overweight or obesity, nor on neurodevelopment outcomes.20,21 This current analysis further supports the lack of meaningful clinical impact on somatic growth and neurodevelopment in preschool-aged children. Our hypothesis that the gestational period represents an opportunity for interventions to reduce the risk of childhood obesity and positively influence neurodevelopment could not be supported by our findings.

One possible reason for the GeliS lifestyle program’s lack of effectiveness on long-term childhood outcomes is its failure to reduce the proportion of women who gained excessive gestational weight.19 Despite following the GeliS program curriculum, it is very likely that the healthcare practitioners responsible for counseling women used different approaches and varied in the information they provided. While the method of counseling was inherently part of the public health aspect of the GeliS study, these inconsistencies may have reduced the program’s overall effectiveness.19 Although we observed moderate improvements in maternal dietary habits and physical activity,16,17,18 these did not result in significant improvements on long-term somatic growth and neurodevelopment of the offspring. Since an elevated risk of childhood adiposity increases with higher maternal GWG,5 more effective interventions aimed at reducing GWG might have the potential to positively impact childhood growth and obesity risk.

Previous systematic reviews have reported that antenatal lifestyle interventions are associated with reduced GWG,32,33 but observed group differences are likely too modest to have a clinically significant impact on outcomes for both women and children. Our findings are consistent with recent systematic reviews that examined antenatal intervention trials which were conducted in various countries and utilized different intervention modalities and intensities. For instance, Raab et al.14 found no sustained effects of antenatal lifestyle interventions on offspring anthropometry from 1 month to 7 years of age. This lack of positive impact was consistent in subgroup analyses examining intervention content and duration. Similarly, an individual participant data meta-analysis on antenatal lifestyle interventions for high-risk populations of women with overweight and obesity found no effects on infant anthropometry nor neurodevelopment outcomes at 3 to 5 years of age.13

Although most GeliS participants began their pregnancies in the normal weight range, similar negative findings have also been reported in RCTs focusing on women with overweight and/or obesity. For example, the Australian LIMIT trial showed no effect on child growth, adiposity, or neurodevelopment, at 6 months,34 18 months35 and 3 to 5 years of age.36 Long term follow up of this cohort continued to show no effect on childhood obesity risk or secondary developmental measures at 8 to 10 years of age.37 Similar results were reported from the UPBEAT trial38 which demonstrated no lasting impact on child growth or adiposity, despite the relatively intense prenatal intervention method. The overall lack of positive effects suggests that postnatal environmental factors may have a stronger influence on weight and adipose tissue development, thereby overriding early prenatal influences.39,40

Two smaller trials have further explored the impact of lifestyle interventions on child neurodevelopment, with mixed results. Menting et al.41 found that two RCTs conducted in women during the preconception and prenatal periods resulted in no significant improvements on neurodevelopment in a subsample of children aged 3 to 6 years. In contrast, Braeken and Bogaerts31 reported that children whose mothers participated in a brochure-based lifestyle intervention had improved temperament at 3 to 7 years, compared to those whose mothers received routine antenatal care along with additional lifestyle intervention sessions. Overall, the comparability of results is complicated by the fact that studies recruit diverse study populations, take place in various settings, and use a variety of measurement tools, which limits the ability to draw conclusions. Moreover, the lack of consistency in how researchers measure obesity is problematic and has recently been addressed in a systematic review advocating for the agreement of a core outcome set.42 In summary, it appears unlikely that current lifestyle interventions in pregnancy have a long-lasting effect on children’s somatic growth or neurodevelopment. Further research is needed to explore whether more holistic and personalized approaches in healthcare settings can produce lasting effects on child development and obesity risk. This effort should most likely begin earlier and focus on interventions that seek to optimize women’s health and weight before conception, due to the strong link between maternal pre-pregnancy BMI and offspring adiposity. Another approach is exploring how technology can enhance the efficiency and impact of interventions for child health.43 For instance, mobile application-supported adaptive interventions, including tele-coaching, may offer a promising way to facilitate changes in maternal lifestyle behaviors. Additionally, tailored interventions based on rigorous early pregnancy risk assessment,44 and involving both parents or the entire family45,46 show potential for improving outcomes. Finally, robust public health policies are crucial, acknowledging the role that environmental factors have on the health of individuals, families, and communities.

The follow-up analysis has several strengths worth mentioning. The GeliS trial is one of the largest RCTs worldwide evaluating a comprehensive lifestyle intervention conducted within the framework of routine antenatal care. Including pregnant women from different BMI categories broadens the applicability of findings. The long-term follow-up for mother-child pairs at 1, 3 and 5 years gave us access to data on weight, length and head circumference throughout early childhood. Thus, this paper extends our understanding by addressing data up to preschool age, a period critical for adiposity rebound which can predict later obesity.47,48 A particular strength of the study is that more than two thirds of participants from the original GeliS cohort provided full datasets of their children at 5 years of age. This signifies an exceptionally high retention rate during the follow-up phase in comparison to other lifestyle intervention trials.38

Despite the findings presented, it is important to consider several limitations. First, we observed differences in some characteristics between mothers who declined participation during the five-year follow-up and those who continued (Table S2). While statistical models adjusted for some of these characteristics, we cannot completely rule out the possibility of confounding. Additionally, child height, weight and head circumference were measured by clinic staff during routine visits, and lack of standardization may reduce comparability. A further limitation was that part of the follow-up period coincided with the COVID-19 pandemic, which led to delays in well-child health visits for 52% of GeliS children, impacting the timing of anthropometric measurements at 5 years old. The ASQ was mailed to participants and completed independently, resulting in delays as well as variations in children’s ages when the ASQ was filled out. To account for these timing discrepancies, we calculated the actual age of the children at each data point collection time and included them as covariates in the analyses. Lastly, we used German-based reference values for z-scores and weight category prevalence estimation, following the recommendation to use national data when available.49 However, this may limit the generalizability of findings to non-German populations.

Conclusion

In summary, our analysis does not demonstrate evidence that an antenatal lifestyle intervention conferred long-term clinical effects on somatic growth or neurodevelopment in children up to the age of five. Future research should focus on whether tailored approaches that target at-risk women lead to long-term health benefits for mothers and offspring.