Introduction

In addition to the high risk of early mortality, preterm infants (PT) are vulnerable to a range of short- and long-term complications. These complications often include a wide spectrum of neurodevelopmental impairments, which manifest throughout infancy and childhood. The most severe outcome is cerebral palsy, which remains a leading motor outcome of very preterm birth1. However, motor impairments represent only part of the spectrum. A significant proportion of preterm survivors also exhibit cognitive, behavioral, and sensory deficits, including executive dysfunction, attention difficulties, impaired visuospatial processing, and language delays. These outcomes may occur even in the absence of obvious brain injury2,3.

The neurodevelopmental sequelae of prematurity are closely linked to structural and microstructural alterations of the developing brain. Diffuse white matter injury is recognized as a predominant pattern in very preterm infants. This injury disrupts axonal growth, myelination, and oligodendrocyte maturation during a critical window of vulnerability4,5. In addition to white matter pathology, prematurity has been associated with abnormal development of key gray matter structures, including the thalamus, basal ganglia, cerebellum, and cerebral cortex6.

Preterm birth disrupts typical cortical folding trajectories due to the premature transition from the intrauterine to the extrauterine environment, leading to alterations in gyral and sulcal morphology. These alterations, notably reduced cortical surface area and decreased gyrification index, are especially evident in regions involved in sensorimotor integration, such as the central sulcus (CS) and the adjacent pre- and postcentral gyri7,8. Consistent findings from neuroimaging studies reveal that preterm infants exhibit simplified cortical folding and reduced sulcal complexity compared to term-born peers, likely resulting from both direct brain injury and secondary effects of inflammation and altered sensory input. The central sulcus, in particular, is of interest due to its early development and its fundamental role in motor function, making it a relevant marker for studying the impact of prematurity on brain structure and subsequent neurodevelopmental outcomes9,10.

The CS undergoes critical morphological development during the late second and early third trimesters. In preterm infants, the sulcus becomes visible and begins to exhibit interindividual morphological variability as early as 30 weeks postmenstrual age (PMA). Recent findings suggest that many of its defining morphological features are already established by this time, underscoring the susceptibility of sulcal development to the extrauterine environment in the context of prematurity7. The early maturation of the CS is particularly important, because it may have long-term implications for functional outcomes, such as fine motor abilities and manual dexterity. Several studies have used magnetic resonance imaging (MRI) as the main tool to describe the morphology of CS in newborns, particularly in the preterm population. Through quantitative and longitudinal analyses, parameters such as sulcus depth, surface area, and curvature have been characterized, as well as its hemispheric variability and its predictive value on long-term motor and cognitive functions7,8,11,12. However, despite the extensive use of MRI in this field, our study introduces a novel approach by employing 3D ultrasound (US) to characterize CS.

When applied to the CS evaluation, 3D US combines the advantages of traditional 2D US with the additional benefits of multiplanar reconstruction, volume rendering, and anatomical segmentation13. These features enable a comprehensive visualization of the sulcal morphology, including volume, tortuosity, surface, length, depth, and sulcal opening. If 3D US is to play a meaningful role in the early assessment of cortical folding and neuromotor prognosis, robust and efficient segmentation tools must be used. This pilot study aimed to verify the feasibility of transfontanellar 3D US for semi-automatic quantification of the CS in PT infants. Two secondary objectives were also established: to quantify its reproducibility and to describe the maturation trajectory, including hemisphere analysis. In this pilot study, our aim was to assess bedside feasibility and to obtain preliminary evidence that the measurements are reproducible and sensitive to developmental trends. Given the bedside-oriented nature of the protocol, we designed this work as a single-center pilot feasibility study. Accordingly, it was not powered for subgroup analyses or for linking imaging metrics to neurodevelopmental outcomes, and all inferential analyses should be interpreted as exploratory.

Results

Study population

Patients and 3D US

We included 13 preterm infants born at less than 32 weeks of gestational age (GA). The mean GA was 28.6 weeks (± 2.97), and the mean birth weight was 1023.1 g (± 363.4). One infant (7.7%) was classified as small for gestational age (SGA). Eight patients were male (61.5%) and ten (77%) were born by cesarean Sect.  (76.9%). Most mothers received a complete course of antenatal steroids (83.3%). The mean maternal age was 29.1 years (SD 4.76), with a predominance of intermediate educational level (75%). According to the GMH-IVH classification, 5 patients (45.5%) showed no hemorrhage, 3 (27.3%) had grade I GMH, 2 (18.2%) had grade III GMH, and 1 patient (9.1%) presented with intraparenchymal hemorrhage (IPH). A more detailed description of the perinatal variables is shown in Table 1. The 13 patients had a median of 8 3D US, with a minimum of 4 and a maximum of 10. The total number of 3D US included in the study was 109.

Reliability

We assessed reproducibility across three independent 3D ultrasound acquisitions during the same day, which enabled us to quantify reproducibility. Using a two-way random effects intraclass correlation coefficient (ICC), single acquisitions did not yield robust reproducibility for any parameter (none reached good reliability). Averaging the three same-day acquisitions improved reliability to 0.53, 0.49, 0.38, 0.34, 0.36, 0.45, 0.39, 0.55, 0.78, 0.88, 0.64, and 0.81, respectively. These findings indicate that most variability stems from the acquisition step, particularly for volumetric and depth measures, whereas combining the three sweeps yields moderate repeatability for most CS parameters.

Bland-Altman analysis of mean sulcal opening

Bland-Altman plots for the three pair-wise combinations of same-day acquisitions (Fig. 1) showed minimal bias (< 0.35 mm) and symmetric 95% limits of agreement of approximately ± 3 mm. In each plot, more than 95% of points lay within these limits, and no trend toward greater dispersion at larger opening values was observed. Together with the ICC results, these findings confirm that averaging three sweeps greatly reduces acquisition noise and delivers an absolute error for mean sulcal opening that is acceptable for clinical use Table 2.

Table 1 Perinatal and clinical characteristics of the study population (n = 13). Mean ± standard deviations (± SD), total number, and percentage (%), or median and interquartile range [IQR].
Table 2 Reliability of repeated same-day 3D ultrasound acquisitions from the same infant, as measured by the intraclass correlation coefficient (ICC).
Fig. 1
Fig. 1
Full size image

Pair-wise Bland–Altman plots comparing the three same-day acquisitions for mean sulcal opening: Acquisition 1 vs. Acquisition 2 (A), Acquisition 1 vs. Acquisition 3 (B), and Acquisition 2 vs. Acquisition 3 (C). The solid line indicates the bias; dashed lines represent the 95% limits of agreement.

Central sulcus analysis according to postmenstrual age, sex, and hemisphere

We quantitatively analyzed the segmented central sulcus (CS) in 3D ultrasound studies using twelve morphological parameters: volume, tortuosity, surface area, length, mean and maximum depth, sulcal opening, and fold opening (Fig. 2). Sulci were categorized by postmenstrual age (PMA) at the time of the scan into three groups: <28, 28–32, and 32–36 weeks.

A significant increase in sulcal length was observed with advancing PMA, being significantly greater in the 32–36-week group.

(34.35 ± 8.94 mm) compared to the < 28-week group (25.86 ± 9.77 mm; p = 0.006). Similarly, maximum lineal depth was higher in the 32–36-week group (9.30 ± 2.41 mm) than in the < 28-week group (8.87 ± 2.23 mm; p = 0.014). Notably, mean sulcal opening also increased significantly with PMA, reaching 5.70 ± 1.92 mm in the 32–36-week group vs. 3.98 ± 1.76 mm in the < 28-week group (p = 0.023). (Table 3).

Linear mixed-effects models showed significant positive associations with PMA for most central sulcus metrics, including volume, tortuosity, surface, length, and depth-related measures (Table 4). Variance components and σ²between/σ²within ratios are reported in Table 4.

Analysis by hemisphere revealed consistently greater opening values for the right CS vs. the left CS, with significant differences in mean sulcal opening (5.41 ± 1.27 mm vs. 3.23 ± 1.47 mm; p < 0.0001), maximum opening (8.93 ± 2.14 mm vs. 5.99 ± 2.74 mm; p < 0.0001), and maximum fold opening (9.17 ± 2.19 mm vs. 6.36 ± 2.80 mm; p < 0.0001). These findings suggest rightward asymmetry in sulcal development during early maturation (Table A1).

Fig. 2
Fig. 2
Full size image

3D reconstructions of the central sulcus obtained using 3D ultrasound in a preterm infant born at 28 weeks of gestational age. The images illustrate the longitudinal development of the CS at different postmenstrual ages: at birth (A), 30 weeks (B), 32 weeks (C), and 38 weeks (D) postmenstrual age.

Discussion

This study demonstrates the feasibility of semi-automated quantification of CS morphology using 3D US in very preterm infants. Importantly, the reproducibility analysis showed that single acquisitions yielded low intraclass correlation coefficients (ICC ≤ 0.40) for most metrics, indicating limited reliability for individual clinical prognostic assessment. However, averaging the three acquisitions substantially improved reliability, particularly for the sulcal opening parameters (ICC ≥ 0.75), suggesting potential for clinical application in certain morphometric domains. Other measures, such as volume and depth, showed moderate reliability (ICC 0.45–0.55), which may still support use in longitudinal or research settings. These findings reinforce the value of 3D-US for characterising early alterations in CS morphology.

Recent MRI work that applied manifold-learning techniques to detailed shape reconstructions of the central sulcus has shown that several geometric descriptors are already well defined by 30 weeks PMA and that a right-greater-than-left asymmetry emerges early and persists into later childhood, potentially related to laterality development, including handedness7. Our method complements these high-resolution but episodic MRI assessments by enabling high-frequency monitoring during this critical developmental window because 3D US can be repeated at the bedside without sedation.

Neonatal intensive care unit (NICU) typical Clinical exposures(low birth-weight, multiple gestation and prolonged mechanical ventilation) selectively blunt early growth of the CS and other primordial folds; these alterations are associated with less favorable fine-motor and overall cognitive scores at two years9. MRI studies in very PT cohorts have reported associations between early cortical folding features and later neurodevelopmental performance at toddler age, including motor and broader cognitive domains. Because mean opening was the only parameter that reaches good reproducibility after averaging three sweeps (ICC 0.88), it could constitute a robust ultrasound signature of motor vulnerability that could be combined with MRI-based cortical metrics to refine early prognostic models while reducing the need for multiple transportations to the MRI scan. Serial 3D US could extend this developmental trajectory back into the early life exposure after preterm birth, yielding a continuous view of how early extrauterine factors sculpt the morphology of the CS. Changes in the depth of the CS motor representations continue over the first three years of life and run in parallel with the achievement of major gross- and fine-motor milestones14.

Advanced MRI imaging studies have shown that extrauterine stressors as nutritional deficits, inflammation, and white-matter injury, can leave lasting imprints on cortical folding and motor circuitry15. Because 3D US is innocuous, quick, and easily repeatable, it represents a practical tool for real-time monitoring of brain growth and development with the potential to evaluate how neuroprotective strategies influence CS development during the NICU stay.

This study has several methodological strengths that enhance the reliability of its findings. The cohort was prospectively examined with a standardized 3D US protocol that acquired three independent sweeps per session, enabling a true within-subject.

Table 3 Central sulcus morphological parameters by postmenstrual age (PMA) group at the time of 3D ultrasound.
Table 4 Linear mixed-effects model results for central sulcus metrics by postmenstrual age.

assessment of reproducibility under real NICU conditions. The workflow captured the full 3D contour of the CS rather than relying on single planar slices, yielding a comprehensive set of quantitative metrics for each examination. Each infant underwent several serial scans (median = 8 per patient), resulting in a dense longitudinal dataset that could be analyzed to account for individual growth trajectories.

This preliminary single-center study has some limitations. First, the number of infants is small and all participants were recruited from a single NICU. Therefore, confidence intervals are expected to be wide, subgroup analyses are underpowered, and external validity is limited. These results should be considered hypothesis-generating and require confirmation in larger cohorts. Second, reproducibility was assessed only intra-day with one operator and scanner; inter-observer, cross-platform, and longer-term reliability still need to be established. Future reliability studies should include inter-observer segmentation variability, and inter-day acquisitions. Third, the absence of concurrent MRI data precluded direct cross-modal validation of our US metrics. Because MRI was not acquired concurrently, we cannot quantify systematic bias or directly assess measurement accuracy against an established cortical morphometry reference. A planned next step is to acquire paired 3D-US and MRI within a short time window to enable cross-modal agreement analyses and calibration of US-derived opening metrics.

Conclusions

3D ultrasound enables bedside quantification of CS morphology in very preterm infants. Averaging three sweeps yields highly reproducible sulcal opening measurements (ICC ≥ 0.75). The portability and repeatability of this method offer advantages over MRI for longitudinal monitoring of cortical development. Integration of 3D US morphometry into neonatal care may improve understanding of CS maturation and support early neuroprotective interventions.

Methods

Patients

Eligible participants were very-low-birth-weight infants (VLBWI) admitted to the NICU of Hospital Puerta del Mar, defined as a birthweight  1,500 and/or a gestational age ≥ 32 weeks. Infants with congenital or chromosomal anomalies, metabolic disorders, or central nervous system infections were excluded. The Research and Ethics Committee approved the study, and.

written informed consent was obtained from all parents or guardians.

We reviewed the 109 3D US scans serially performed at different time points, with a minimum of 1 day apart. This was a feasibility pilot study; therefore, the sample size was determined by convenience and operational constraints (available eligible infants and scanning windows), with no formal a priori power calculation. All inferential analyses, including sex-stratified comparisons, were considered exploratory. Transfontanellar 3D US images of the brain were acquired using the 4D option in the 3D/4D Voluson i portable ultrasound system (GE Healthcare, Milwaukee, WI, USA). First, the transducer (SVNA5-8B, 5–8 MHz), using a center frequency of 6.5 MHz, was situated in the third coronal plane, and the scan angle was settled to 90º. With the transducer fixed in that position, the beam moves from them anterior to posterior planes with a maximum total scan time of 40–50 s.

Ethics approval and consent to participate

The study protocol was approved by the Research and Ethics Committee of Hospital Universitario Puerta del Mar (Cádiz, Spain). All procedures were performed in accordance with relevant institutional and national guidelines and regulations and with the Declaration of Helsinki. Written informed consent was obtained from the parents or legal guardians of all participants prior to inclusion.

Image analysis

Image segmentation and post-processing were conducted using a semi-automated pipeline. Manual slice-by-slice delineation of the CS was performed by trained raters using multiplanar 3D US reconstructions. Morphometric parameters were extracted using custom-developed software. Each segmentation was subjected to visual inspection and manual correction to ensure anatomical fidelity before final metric extraction.

Assisted segmentation of cerebral sulci in 3D ultrasound using a graphical interface

An interactive tool was developed using MATLAB (R2024a Update 5) for the semi-automatic segmentation of cerebral sulci from 3D ultrasound volumes. The volume is represented as a three-dimensional matrix, and users can freely explore axial, coronal, and sagittal planes through a graphical user interface (GUI). (Fig. 3)

Fig. 3
Fig. 3
Full size image

Screenshot of the custom MATLAB-based graphical user interface (GUI) for semi-automatic sulcal segmentation.

The left panel displays the 3D ultrasound volume as navigable orthogonal planes, with the central sulcus manually refined (yellow trace). The right panel shows the corresponding 3D reconstruction of the segmented sulcus. The interface provides interactive tools for slice navigation, labeling, and visualization, and automatically outputs quantitative morphometric metrics.

Sulcus segmentation is performed on selected 2D slices, starting from a manually marked point of interest. From this point, Frangi filters16are applied to enhance sulcus-like structures, and closed level-set contours are identified around the region. The initial mask is then refined using an active contour model17,18to better capture the sulcus geometry. The segmentation can be progressively propagated to adjacent slices. The quality of the segmentation depends on the resolution and contrast of the ultrasound images, as well as on the technique employed. The developed software supports a flexible approach, offering different segmentation modes ranging from manual tracing to local region-adaptive active contours. The system allows labeling of anatomical sulci, visualization and manual editing of their contours, and the calculation of morphometric features such as volume, surface area, depth, width, and tortuosity. Results can also be exported in Excel format. This tool is particularly suited for clinical and research settings that require personalized, guided segmentation of cortical relief in neonatal ultrasound. Once segmentation is completed, various morphometric measurements are computed directly from the segmented volume to characterize the geometry and spatial configuration of the central sulcus. These include:

Volume. The volume is calculated as the total number of segmented voxels multiplied by the unit voxel volume defined by the acquisition spacing. (Fig. 4)

Fig. 4
Fig. 4
Full size image

Definition of the volume metric (mm3) for the central sulcus.

Surface Area. The surface area is estimated by extracting the 3D contour of the segmented sulcus and generating a surface mesh. The total area is obtained as the sum of the triangle areas in this mesh. (Fig. 5)

Fig. 5
Fig. 5
Full size image

Definition of the surface area metric (mm²) for the central sulcus (grid = measured surface).

Sulcal opening. The sulcal opening is estimated from equally spaced cross-sections along the sulcus’s main axis. In each slice, the 2D binary section is reconstructed and the number of active voxels per row is computed. This results in a local width matrix representing the sulcus opening along its length. A localized fold opening measure is also defined as the maximum width among the uppermost active rows, providing a clinically relevant index of cortical compression or expansion near the.

surface. (Fig. 6)

Fig. 6
Fig. 6
Full size image

Definition of opening measures (mm) for the central sulcus. Arrows indicate orthogonal distances: opening and fold opening.

Tortuosity. From the 3D skeleton of the segmented volume, a spatial graph is constructed that represents the sulcus trajectory. Tortuosity is defined as the ratio between the actual path length and the straight-line Euclidean distance between its endpoints, quantifying the deviation from an ideal linear path. (Fig. 7)

Fig. 7
Fig. 7
Full size image

Tortuosity of the central sulcus (unitless): defined as Lcurved/LstraightLcurved/Lstraight between endpoints A and B.

Depth. Sulcal depth is estimated from the segmented volume following a conceptually similar approach to that proposed by Coulon et al. (2006)19, adapted to volumetric data. Instead of solving the heat equation for surface parameterization, this method uses equally spaced transverse slices along the main sulcal axis, previously aligned by PCA-based rotation. In each slice, equidistant points are interpolated along the sulcal contour, and a 3D graph is built connecting consecutive points between adjacent slices. Depth is calculated as the sum of shortest-path distances from each central point to the outer sulcal edges, following the actual surface geometry. A complementary linear depth is also computed as the direct Euclidean distance between those endpoints. (Fig. 8)

Fig. 8
Fig. 8
Full size image

Depth metrics of the central sulcus (mm): semiperimeter depth and linear depth.

Statistical analysis

We performed a descriptive analysis, and the results were described as frequency and percentage if they were categorical, mean and standard deviation (sd), or median and interquartile range [IQR] according to their distribution if they were quantitative. Bivariate analysis was performed using Pearson’s chi-square test or Fisher’s exact test for categorical data and Student’s t-test or the Mann–Whitney U test for two-group comparisons, and one-way ANOVA across the three PMA groups. Differences in morphometric parameters across the three PMA groups (< 28, 28–32, and 32–36 weeks) were assessed using one-way ANOVA. Reliability was assessed with two-way random-effects ICCs. Bland–Altman plots assessed agreement. Longitudinal associations between PMA and each CS metric were assessed using linear mixed-effects models with PMA as a continuous fixed effect and a subject-specific random interceptto account for repeated measurements. For each model, we report the PMA slope (β, units/week) with 95% confidence interval and variance components (σ² between-subject and σ² within-subject residual variance), including their ratio (σ²between/σ²within).Statistical analysis was performed using Stata 16.0 (Stata Statistical Software: Release 16. College Station, TX: StataCorpLP). A result was considered statistically significant with a p < 0.05. All the methods were carried out in accordance with relevant guidelines and regulations.