Introduction

Being born small for gestational age (SGA; birth weight < 10th percentile for gestational age) is associated with multiple adverse health outcomes later in life. Adults born SGA at term have a higher risk of obesity, type 2 diabetes and cardiovascular disease1, poorer mental health2,3, and psychiatric disorders such as anxiety, attention deficit hyperactivity disorder (ADHD) and mood disorders4,5. Multiple studies have also demonstrated structural underdevelopment in the brains of both newborns and adults who were born both preterm and at term with low birth weight6,7,8. In terms of cognitive functions, some studies suggest that individuals born SGA experience more cognitive difficulties and have a lower intelligence quotient (IQ) than their non-SGA peers9,10. However, other studies have not identified differences in cognitive development or functioning between SGA and non-SGA individuals11,12.

Several studies have examined executive functions in individuals born with low birth weight. However, most of this research has been performed in individuals born preterm and/or with very low birth weight (VLBW). These studies report impaired functions across various executive domains among these individuals compared with term-born controls13,14,15,16. Executive dysfunctions, especially in neurocognitive tests that measure performance speed, have also been found in young adults born SGA at term17,18,19. In contrast, other studies show that individuals born SGA late preterm or at term perform similarly as their non-SGA term-born peers in most executive functions20,21.

Most of the research on executive functions relies solely on performance-based measures, with few studies exploring the association between performance-based and self-reported executive functions. However, as self-reports are used to evaluate executive functions in a range of clinical settings22, the relationship between the two measures is highly relevant. Only one Norwegian study by Sølsnes and colleagues23 and one Finnish study by Heinonen and colleagues24 have investigated this objective in low-birth-weight populations. Both studies found that adults born with VLBW, especially those born SGA, scored lower on performance-based tests assessing executive functions compared with the non-SGA control group. However, neither study found differences in self-reported executive functions between the groups, indicating that the individuals born with VLBW did not perceive executive dysfunctions themselves. Furthermore, associations between the performance-based test scores and the self-reports were weaker in the VLBW group compared with the control group.

No studies have examined executive functions in individuals born SGA at term beyond the third decade of life. There is evidence to suggest that the prefrontal cortex, and consequently executive functions, continues to develop throughout the twenties before stabilizing in the thirties25. We aimed to examine if executive function was poorer in adults born SGA at term compared with non-SGA term-born controls and investigate the association between performance-based and self-reported measures of executive function. Based on the previous findings in adult populations born preterm, we hypothesized that adults born SGA would exhibit poorer executive function on performance-based tests, but that they would have similar levels of executive functioning as controls by self-report. Therefore, we expected that the correlation between performance-based and self-reported measures of executive functions would be weaker in the SGA group compared with the control group.

Methods

Study design

This study is part of the NTNU Low Birth Weight in a Lifetime Perspective (NTNU LBW Life) study, where two groups of adults born at term (gestational age ≥ 37 weeks) between 1986 and 1988 have been followed prospectively from birth. One group was born SGA (birth weight < 10th percentile), while the other group was born non-SGA (birth weight ≥ 10th percentile). The mothers of the participants were initially recruited from a multicenter study of pregnant women aimed to assess causes and consequences of intrauterine growth restriction26,27. They were all carrying a singleton and had given birth at least once before. All participants were born to women residing in the same geographical region.

The participants have been examined at several time points from early childhood to adulthood. In the present study, data from the 32-year follow-up were used. Data were collected from September 2019 to January 2021. Examiners were blinded to group status. In addition to examination of executive functions, the participants underwent assessments of motor and visual function, physical fitness28, physical activity29 and health-related quality of life30.

Study population

A total of 1249 women in Trondheim, Norway, consented to participate in the initial multicenter study (Fig. 1). The women were split into three groups. First, a random 10% sample of women were selected for follow-up during pregnancy (n = 132), regardless of risk factors. Next, a sample of women at high risk of delivering an SGA infant were selected for follow-up (n = 390). These women exhibited at least one of five predetermined risk factors: a previous child with low birth weight or perinatal death, a pre-pregnancy weight < 50 kg, chronic maternal disease (i.e. hypertension, heart or renal disease) or smoking at the time of conception. The remaining pregnant women included in the multicenter study from the total eligible population did not receive additional follow-up throughout pregnancy (n = 727). All infants with a birth weight < 10th percentile at term born to women in the random sample, the high-risk sample or the sample not followed in pregnancy were included in the SGA group (n = 104) and infants born to women in the random sample with a birth weight ≥ 10th percentile formed the control group (n = 120).

Fig. 1
figure 1

Flow of participants at 32 years of age. *The high-risk group consisted of women with one or more of the following risk factors: previous low birth weight child, low pre-pregnancy weight (< 50 kg), previous perinatal death, chronic maternal disease and smoking. SGA small for gestational age

SGA group

The SGA group included 104 individuals with a birth weight < 10th percentile adjusted for gestational age, sex and parity, according to a reference standard presented by the Norwegian Medical Birth Registry31. Out of 104 SGA participants, three were excluded due to congenital syndromes, multimorbidity or death before follow-up. At the 32-year follow-up, 15 individuals were living abroad, could not be reached or had previously refused to participate. The remaining 86 individuals were invited to participate, but 30 of these did not consent, leaving 56 participants. In total, 56 (31 women, 25 men) adults born SGA (65% of invited and 55% of eligible) participated and were included in the current study.

Control group

The control group included 120 individuals born at term with a birth weight ≥ 10th percentile. Two of these were excluded due to congenital syndromes or multimorbidity. Out of the 118 eligible controls at the 32-year follow-up, 14 were living abroad, could not be reached or had previously refused to participate. Altogether, 104 participants were invited, but 36 did not consent, which left 68 participants in the control group. In total, 68 (39 women, 29 men) controls (65% of invited and 58% of eligible) participated and were included in the current study.

Non-participants

There were no differences in background characteristics between participants and those who did not consent to participate at the age of 32 years in either group (Additional file; Table S1).

Background characteristics

At birth, sex, gestational age, birth weight, birth length, head circumference and maternal age were reported for all infants. Data on parental socioeconomic status (SES) was collected during follow-up assessments when the participants were 14 years and supplemented at 19 years. We used the Hollingshead Two-Factor Index of Social Position, which is based on a combination of education and occupation, ranging from 1 at the lowest to 5 at the highest32. When the participants were 32 years of age, their education was reported and categorized into three groups based on the International Standard Classification of Education (ISCED) levels: lower secondary or lower, intermediate and lower tertiary or higher.

Outcome measures

Trail Making Test

The Trail Making Test (TMT) from the Delis-Kaplan Executive Function System (D-KEFS)33 provides an objective measure of executive functions. The conditions being tested are visual scanning (TMT 1), number sequencing (TMT 2), letter sequencing (TMT 3), number-letter switching (TMT 4) and motor speed (TMT 5). TMT 1–3 assess basic attention and processing speed, TMT 4 additionally requires set shifting, inhibition and working memory, and TMT 5 measures motor speed and eye-hand coordination. The participants were asked to connect circles, either empty or containing a certain value (number or letter), in a particular order. Longer time spent on the tests indicate poorer executive function. To isolate the effect of cognitive flexibility, a variable subtracting the time spent to complete the TMT 2 from the time spent to complete the TMT 4 (TMT 4-2) was calculated. We present both raw scores, i.e. time in seconds used to complete the tasks, and calculated scaled scores according to the participants’ age based on the normative sample in the manual33 with a mean of 10 (standard deviation [SD] 3). The TMT assessment was carried out by examiners blinded to group adherence.

Behavior Rating Inventory of Executive Function – Adult Version

The Behavior Rating Inventory of Executive Function – Adult Version (BRIEF-A)34 is a standardized self-report questionnaire for adults assessing the participants’ subjective perception of executive functioning in everyday activities over the last month. It is composed of 75 items comprising nine scales. Participants are asked how often each of the listed behaviors has been a problem during the past month and must place them on a 3-point Likert scale, with the options never, sometimes and often. The answers are summarized into three composite scores: the overall Global Executive Composite and the two index scores Behavioral Regulation Index and Metacognition Index. The Behavioral Regulation Index measures the individual’s ability to appropriately regulate emotions and behavior, and is composed of the clinical scales Inhibit, Shift, Emotional Control and Self-Monitor. The Metacognition Index measures the individual’s ability to solve problems across different contexts, and is composed of the clinical scales Initiate, Working Memory, Plan/Organize, Task Monitor and Organization of Materials. Higher scores indicate poorer executive functioning.

Ethics

This study was conducted in accordance with the Helsinki Declaration and approved by the Regional Committee for Medical and Health Research Ethics (REK) in Central Norway (23879). The adult participants gave written informed consent. The clinical assessments were non-invasive and did not entail risk for injury or adverse events. Participants received medically relevant feedback and were referred to health services as appropriate. Data was handled securely in Services for Sensitive Data (TSD).

Statistical analyses

Data was analyzed using IBM SPSS Statistics 29.0. Statistical significance was set at two-sided p-value below 0.05 and a confidence interval (CI) of 95% were reported when relevant. Normality was assessed by visual inspection of histograms and Q-Q plots of the residuals. Group differences in background characteristics were analyzed using the independent Student’s t-test for continuous and normally distributed data, Pearson’s Chi square for categorical variables and Mann-Whitney U Test for ordinal data. Unadjusted mean (SD) and Cohen’s d were given for the TMT conditions and BRIEF-A clinical scales and composite scores. Effect sizes were interpreted as small (0.2), medium (0.5) or large (0.8)35. Group differences were analyzed using linear regression with TMT and BRIEF-A scores as the dependent variable, group (SGA vs. control) as a fixed factor, and sex as a covariate. In additional analyses, parental SES was also included as a covariate. Associations between TMT and BRIEF-A scores were examined using partial correlations, adjusted for sex.

Results

Background characteristics

Background characteristics of the participants are presented in Table 1. The SGA participants were smaller at birth with lower birth weight, birth length, ponderal index and head circumference compared with the control participants. Maternal age at delivery was on average 2.5 years lower in the SGA group compared with the non-SGA control group. Gestational age, parental SES, age at follow-up and participants’ educational levels were similar in the two groups.

Table 1 Background characteristics of participants in the SGA and non-SGA control groups.

Performance-based executive function

Table 2 presents TMT raw scores (seconds) and scaled scores in the two groups. Unadjusted effect sizes ranged from 0.16 for the difference between number-letter and number sequencing (TMT 4-2) to 0.59 for letter sequencing (TMT 3). Adjusted for sex, the SGA group used 1.6 (95% CI 0.1–3.1) seconds more at visual scanning (TMT 1) and 3.9 (95% CI 1.2–6.5) seconds more at TMT 3. These differences were also reflected by lower scaled scores for these conditions.

Table 2 Performance-based executive function measured by the Trail Making Test (TMT) in the SGA and non-SGA control groups at 32 years of age.

When we adjusted for sex among participants with available data on parental SES, the difference in time spent at number sequencing (TMT 2) also became significant. Additional adjustment for parental SES did not change the results.

Self-reported executive function

Clinical scales and composite scores of the BRIEF-A in the two groups are presented in Table 3. Unadjusted effect sizes ranged from 0.06 for the clinical scale Organization of Materials to 0.41 for Emotional Control. Adjusted for sex, the SGA group had 1.5 (95% CI 0.2–2.8) points higher score than the control group on Emotional Control. The difference was 1.7 (95% CI 0.2–3.1) points among participants with available data on parental SES and did not change much after adjustment for parental education. There were no other group differences in BRIEF-A clinical scales or composite scores.

Table 3 Self-reported executive function measured by the Behavior Rating Inventory of Executive Function – Adult Version in the SGA and non-SGA control groups at 32 years of age.

Correlation between self-reported and performance-based executive functions

The correlations between BRIEF-A clinical scales and composite scores and TMT raw scores are presented in Table 4 for the SGA group and Table 5 for the control group. In the SGA group, higher scores on Emotional Control correlated with longer time spent on both TMT 4 (r = 0.332) and higher raw scores on TMT 4-2 (r = 0.345). In the control group, higher scores on the Behavioral Regulation Index correlated with longer time spent on TMT 4 (r = 0.273) and higher raw scores on TMT 4-2 (r = 0.314). Furthermore, higher scores on the clinical scales Shift, Emotional Control and Plan/Organize correlated with higher raw scores on TMT 4-2 in the control group, with correlation coefficients ranging from 0.318 to 0.276.

Table 4 Partial correlation between BRIEF-A clinical scales and composite scores and TMT raw scores in the SGA group at 32 years of age, adjusted for sex.
Table 5 Partial correlation between BRIEF-A clinical scales and composite scores and TMT raw scores in the non-SGA control group at 32 years of age, adjusted for sex.

The correlations with TMT scaled scores were of similar magnitude (Additional files; Table S2 and S3), but significant also for the clinical scale Task Monitor and TMT 3 in the SGA group.

Discussion

To our knowledge, this study is the first to explore both performance-based and self-reported measures of executive functions in adults born SGA at term. Overall, our results suggest that adults born SGA have poorer executive functioning in some performance-based tasks than their non-SGA peers. However, the same results were not observed for self-reported executive functions. Except for emotional control, adults born SGA did not report executive dysfunctions in everyday life. Moreover, the correlations between performance-based and self-reported executive functions were weak in both groups. These findings align with our a priori hypotheses.

The strengths of this study include the prospective cohort design, use of standardized validated methods and blinded assessment. However, a potential limitation is the use of the 10th percentile definition of SGA, which does not differentiate between genetically small infants and infants who have experienced intrauterine growth restriction (IUGR). Moreover, loss to follow-up is an issue in any long-term study. The relatively small sample size may reduce the ability to detect differences between the groups and associations within the groups. Non-significant differences and associations should therefore be interpreted with caution. A small sample size may further limit the generalizability of the study. However, there were no differences in available background characteristics of participants and non-participants, indicating that our sample was representative of the initial study population.

Both the TMT and the BRIEF-A are widely used for clinical assessment and research to assess various executive processes. The TMT is sensitive to executive dysfunction, intuitive and easy to understand, and has a short administration36. However, it does not discriminate well between the multiple functions needed to carry it out, such as motor and visual functions, attention, and executive functions. Moreover, the TMT places significant demands on performance speed. Also, test-retest reliability indices are only moderate for ages 20–49 years, ranging from 0.36 at the lowest for TMT 3 to 0.55 for TMT 133. As performance-based measures of executive functions may not accurately reflect the application of executive function skills in an everyday environment, we also used the BRIEF-A, which may offer a better portrayal of the individuals’ self-perceived executive strengths and weaknesses37. However, self-report measures are vulnerable to response biases, where the results can be affected by social desirability and/or extreme responses, knowledge of the individual and by cultural factors38. Nevertheless, the test-retest reliability of BRIEF-A is high, ranging from 0.82 at lowest for the clinical scale Plan/Organize to 0.94 at highest for the Global Executive Composite in adults aged 18–90 years in the original standardization sample37.

As hypothesized, the SGA group performed poorer on some of the performance-based measures. Significant group differences were of moderate effect sizes ranging from 0.30 to 0.59. Our findings that the SGA group used longer time than the control group on two conditions of the TMT aligns with previous findings in the same SGA population. Østgård and colleagues18 administered a comprehensive battery of neuropsychological tests to assess the executive functions when the participants in the NTNU LBW Life study were 19–20 years old. They found that the SGA group performed significantly poorer than the control group on the TMT 1, TMT 2 and TMT 3 with effect sizes ranging from -0.40 to -0.61. However, the SGA group performed as well as or better than the control group on other tests of executive functions, such as the Wisconsin Card Sorting Test and Tower test, where performance speed is less important. In a study by Weider and colleagues19, examining this population at 26 years of age, the SGA group performed slower than the controls on all TMT conditions. Moreover, our results are supported by a study by Suffren and colleagues17 reporting lower performances on tests assessing attention and executive functions in adults born SGA at 20 years of age.

Despite of the SGA group performing poorer than the controls on some of the TMT conditions, they reported similar levels of executive functioning on the BRIEF-A, except for the clinical scale Emotional Control. As such, the correlation between the performance-based TMT and the self-reported BRIEF-A was weak. Similar findings were reported by both Sølsnes and colleagues23 and Heinonen and colleagues24, even though they examined preterm populations. They found that adults born with VLBW exhibited more executive dysfunctions in performance-based tests compared with controls. However, they rated their executive functioning comparably to the control group on the BRIEF-A. Both studies reported weak correlations between performance-based measures and self-reported measures of executive functions. Results from our study are consistent with these findings.

Although the TMT results in both groups were close to the population norm, the poorer performance in the SGA group on TMT conditions 1 and 3 suggests some difficulties with visual scanning and letter sequencing. Visual functions were examined by Lindqvist and colleagues in the same SGA cohort in adolescence39,40, indicating no visual impairments that can explain the results. Alternative explanations may be related to brain development in areas and networks needed for higher-order cognitive processes. Several studies, performed both postmortem and in vivo, show that being born SGA with and without documented IUGR leads to a reduction in total brain volume6. An in vivo study on preterm infants born with IUGR using quantitative volumetric magnetic resonance imaging, showed pronounced reductions in cerebral cortical gray matter7. The degree of volume reduction correlated with both head circumference and functional outcome at term, most prominently attention. In our cohort, Eikenes and colleagues8 showed that being born SGA leads to microstructural changes in white matter in adulthood. The authors speculated that these changes are caused by a reduced number of axons and myelination present from birth, causing differences in the development of white matter. The slower performance on some of the TMT conditions in the SGA compared with the control group may be related to subtle changes in the white matter.

Poorer emotional control in the SGA group could be related to functional problems analogous to those seen in ADHD and mood disorders. The Emotional Control scale measures the impact of executive functions within the emotional domain and the ability to regulate emotional responses34. In our cohort, Lund and colleagues4 found higher psychiatric morbidity in the SGA group compared with the control group at 19 years of age, the most prevalent diagnoses being anxiety and ADHD. Additionally, the SGA group reported more inattention and hyperactivity. Similar results were found in the SGA group at 26 years of age with higher overall prevalence of psychiatric disorders, especially anxiety and mood disorders5. If these disorders are still present in our SGA group, it may explain the lower self-reported emotional control.

Prior research has highlighted a generally weak correlation between performance-based and self-reported measures of the same construct41. Whereas performance-based measures provide an indication of processing efficiency, the self-report measures may provide an indication of individual goal pursuit42. A meta-analysis found very small to modest associations between performance-based and self-report measures, indicating that the two measures of executive function assess different aspects42. Consequently, this could explain the weak association between performance-based and self-report measures in our study. Thus, it is important to be aware that the two methods cannot be used interchangeably in research or in clinical settings. It is also noteworthy that the adults born SGA with poorer performance on tests measuring executive functions did not experience this as a hinder in their everyday life.

Our results suggest that some of the difficulties in executive functions identified in adolescence and young adulthood in the same study population are still present in mid-adulthood. With SGA contributing to a sizable proportion of the population (10% per definition), the long-term effects of low birth weight even if born at term are important and could have a potentially large influence on public health and societal welfare. Executive dysfunctions have been associated with poorer academic performance16 and an increased prevalence of psychiatric disorders43,44. Weaker school performances and the need for special education can lead to lower levels of education, limited professional attainment and lower income. Additionally, other factors such as marital status, mental health and satisfaction with life might be adversely affected. Our findings indicate that term-born SGA adults may have some challenges in performance-based executive functioning compared with their peers, even though they did not report poorer executive functioning in everyday life. Further research on the long-term effects of low birth weight even if born at term is crucial to enhance our understanding of this issue and ensure the well-being of this population.

Conclusion

In this prospective cohort study, we found that individuals born SGA at term exhibited poorer executive functioning than term-born non-SGA controls on performance-based measures in mid-adulthood. However, they had similar perceptions of their executive functioning as measured by self-report, except for emotional control. Moreover, the correlation between the performance-based TMT and the self-report BRIEF-A was weak in the SGA group.