Background

The treatment of congenital deformities of the nasolabial region has far-reaching consequences for the future lives of our young patients. It should therefore be the goal of every cleft surgeon to provide the best possible treatment in order to create both function and aesthetics. Today, long after Millard’s work, the field of cleft surgery continues to change and improve, or as Millard himself once stated: ‘Semper investigans, nunquam perficiens. Always searching, never achieving (perfection)’1,2.

In order to progress towards the optimal outcome, it is essential to visualise and define it. To date, many studies have assessed the ‘where have we got to?’ by evaluating the outcomes of cleft lip surgery using various aesthetic scoring systems or symmetry assessments3,4,5,6,7. So far, Loo et al. are one of the very few to assess the ‘where exactly do we want to get to’ by describing the anthropometic morphology of the ‘normal Cupid’s bow’ in healthy young children8.

In today’s era of three-dimensional (3D) analysis techniques, it is essential that we have reference databases of these ‘normal’ anthropometics combined with appropriate and precise evaluation methods. These tools are indispensable in order to identify the optimal, or in other words the most unobtrusive, result that we are looking for, and to analyse the aspects in which we have not yet achieved it. This will provide the basis for further improving surgeons’ understanding and skills, and therefore the surgical outcomes. By establishing an objective and valid basis for comparison, it will also allow us to evaluate the surgical outcomes in detail, and to improve the surgical techniques in a very targeted way. It is important to note that this is only possible if the databases’ range corresponds to the age at which primary surgical intervention takes place. Available literature suggests that this procedure takes place between three and nine months after birth9,10. The present work therefore focuses on this timeframe.

A recent shift towards 3D assessment in cleft surgery has led to the use of different data acquisition tools, ranging from 3D photography to the extraoral use of intraoral scanners11,12,13,14,15,16. As part of the validation of a suitable measurement tool, our group has recently demonstrated the superiority of the extraoral application of a commercially available intraoral scanner (3Shape, Trios4 and 5) in terms of accuracy and applicability in mapping the nasolabial region12. The use of this high-precision instrument, which is now well established, provides the opportunity to apply a novel measurement method using previously unidentified measurement parameters that differ from those used in conventional anthropometry.

The EUROCLEFT and SCANDCLEFT studies, with their respective scoring scales, have already established global standards for the assessment of outcomes following surgical correction of cleft lip and palate (CLP) defects10,17. Although both studies highlight a glaring lack of evidence in orofacial cleft surgery, they do not yet make use of new 3D anthropometry within the scope of the respective possibilities. The recent works of Bugaighis et al. and Xu et al. uses 3D instruments for image acquisition. However, they are limited to the conventional measurement parameters established in two-dimensional anthropometry, such as direct distances and angles15,18.

As we stand on the verge of transitioning from 2D to 3D assessment in cleft surgery, the aim of this study is to establish a baseline and reference group for future evaluations. Additionally, an accurate and reproducible methodology is to be established for the 3D assessment of the nasolabial region in children aged 3 to 9 months, taking into account surface reflections, surface curves and perpendicular lengths. As part of the study of healthy children, a classification of alar base types will be presented.

Methods

Patient acquisition

An intraoral scanner, previously established and validated by our working group12was used to acquire the nasolabial region of 47 healthy children aged 3 to 9 months, who presented to the Division of Phoniatrics and Paediatric Audiology at the Department of Otorhinolaryngology at Erlangen University Hospital. Three members of the working group performed the scans, and the scanner was calibrated before each process. Scans that were distorted or otherwise corrupted by the childrens’s movements were excluded, leaving 25 patients who met the requirements for 3D anthropometric measurement for further analysis. The average time taken was 3 min per child, and all children with nasolabial or generalised cranial deformities were excluded. Of the 25 children included, 13 were male and 12 were female. The mean age was 4 months and 18 days with a minimum age of 3 months and a maximum age of 8 months and 30 days. Birth weight was recorded as another demographic parameter.

All parents were informed in detail about the study and gave their informed written and verbal consent. All children’s legal guardians gave informed consent for the publication of identifying information/images in an online open access publication. The study was approved by the Ethics Committee of the Friedrich-Alexander University of Erlangen-Nürnberg on May 6, 2022 under the application number 22-100-B and all methods were performed in accordance with the relevant guidelines and regulations.

Measuring methods

The measurement methodology was derived from the seminal work of Leslie G. Farkas, namely ‘Anthropometry of the head and face’, as well as from the contributions from Chong et al., Bughaigis et al. and Kesting et al.1,18,19,20. Subsequently, our working group modified this methodology to take advantage of the new possibilities offered by modern 3D anthropometry. This involved focusing on light reflections and incorporating surface curves and perpendicular lengths. The basis of all our 3D anthropometric measurements were 33 landmarks (Fig. 1a; Table 1), i.e. points on a surface that can be defined as constructed elements without a vector. The step-by-step guide, which is presented in the results section, was developed with the aim of defining their precise location and facilitating their placement for future work. Our final reference database consists of 32 distances (Fig. 1b), 33 surface curves (Fig. 1c), 2 angles (Fig. 1d) and 18 indices (Table 2).

Table 1 Overview and definition of the 33 3D anthropometric landmarks of the nasolabial region. An additional table enhanced with images can be found in the supplementary material section (Supplementary Material 1).
Table 2 Overview and definition of the 3D anthropometric measurements consisting of of 32 distances (Fig. 1b), 33 surface curves (Fig. 1c), 2 angles (Fig. 1d) and the resulting 18 indices.
Fig. 1
figure 1

(a) Overview of the 3D anthropometric landmarks of the nasolabial region. The 33 landmarks are placed on the virtual surface, taking into account anatomical structures, colour/texture and surface reflection. Detailed instructions on placing the landmarks can be found in Supplementaray Material 1 and 4. Corresponding number assignments (e.g. L1 - Nasion) can be found in the table in Supplementary Material 1. (b) Overwiew of the 3D anthropometric distances of the nasolabial region The 32 distances are measured as a direct length between two landmarks or as a perpendicular length between a previous measurement and a new landmark, as in the example of effective nasal length, dimple depth or right and left vermillion convexity. Corresponding number assignments (e.g. D1 - Nasal height) can be found in the table in Supplementary Material 1. (c) Overwiew of the 3D anthropometric surface curves of the nasolabial region. The 32 surface curves are positioned directly on the mesh. They can be set by raw placement using landmarks only, or by using intersection points. Detailed instructions on placing the surface curves and corresponding number assignments (e.g. S1 - Volumetric upper nasal height) can be found in Supplementaray Material 1. (d) Overwiew ot the 3D anthropometric angles of the nasolabial region The 2 angles consist of a 3-point angle and a bipartite angle. The vermillion angle is constructed as a 3-point angle, the collumellar angle as a 2-directional angle. Corresponding number assignments (e.g. A1 - Columellar angle) can be found in the table in Supplementary Material 1.

A distance is defined as the direct length between two points on a mesh or between a point on the mesh and a point on a previous measurement. A distance between a previous measurement and a new landmark can be constructed as a perpendicular length, as in the example of effective nasal length, dimple depth or right and left vermillion convexity (Fig. 1b).

Unlike normal curves, surface curves are positioned directly on the mesh (Fig. 1c). They are used to generate curvature curves that reflect the subject’s natural anatomy, while remaining independent of mesh/face rotation in space. This was confirmed by the authors through repetitive measurements using surface curves while manipulating the respective meshes in space, as well as by consulting the software developers. As a fundamental advance in the transition from conventional to 3D anthropometry, surface curves provide the ability to make new and more comprehensive statements about the facial morphology. They include the previously overlooked anatomical configurations between their respective start and end points. Taking into account the specific characteristics of each parameter and its prospective integration in future studies of cleft patients, we have determined for each surface curve, in particular, whether the intersection points are valid. If no intersection points are valid, only the start and end points of the surface curve should be set manually. Detailed instructions for each parameter are given in the Supplementary Material section.

3D anthropometric angles can be constructed in two ways: as three-point angles or as bipartite angles (Fig. 1d). Three-point angles are defined by three landmarks, one of which acts as the angle point. Bipartite angles can be constructed using elements containing a direction, a normal or a line (e.g. planes and lines) without them having to touch. In the context of our 3D anthropometric methodology, the vermillion angle is constructed as a 3-point angle between right and left cheilion and labrale superius, with the latter acting as the angle point. As a 2-directional angle, the columellar angle is formed by the line/distance between columellar median high point and subnasale, and the line/distance between right and left cheilion.

The indices function as constructed variables, facilitating the formation of new assertions regarding the interrelationship between the parameters and allowing a rapid assessment of the patient’s nasolabial conditions.

The data set was analysed according to 3D anthropometric criteria by metrically accurate measurements of distances, surface curves, angles and indices using 3D inspection software (GOM Inspect 2019 Hotfix 6, Rev. 125216, Build 2020-02-27, Co. Zeiss, Jena, Germany).

Statistical analysis

Reliability tests were conducted using a sub-collective as a proxy for the entire collective to ensure reliable measurements. With an interval of two weeks intra-rater reliability was assessed for five children. Intra-rater reliability was calculated using the time points as different assessment occasions. The two-way intraclass correlation coefficient (ICC) for consistency and Pearson’s correlation coefficient (r) were calculated as measures of intra-rater reliability. Inter-rater reliability is evaluated across all children for five parameters between three raters using the intraclass correlation coefficient (ICC) for two-way models employing an absolute agreement definition21,22. To ensure a high degree of reliability in the test, the working group mixed presumably ’easy’ and ‘hard’ parameters, as well as parameters relating to the nose and lip. Distances and surface curves of each kind were included. An ICC of < 0.40 is as considered poor agreement, values between 0.40 and 0.75 as substantial- and an ICC exceeding 0.75 as excellent23.

To improve the accuracy of the data, values were imputed based on clinically relevant information, such as the subject’s sex, age, and birth weight. This led to the predicted values presented in the Results section. Rationale for the imputation process: Providing the algorithm with high quality data from a limited number of samples, which have been measured with precision by an expert in the field, is a time-consuming process, but is preferable to measurements made by an expert under time pressure or by a non-expert for a large number of samples. In the present study, valid and precise data are employed to predict nonobserved values, rather than providing additional clinical data within the same time frame. This approach aims to minimize the impact of human error. All variables for which quantitative data were available were used to predict explicit missing values using the R package Multivariate Imputation by Chained Equations (mice 3.15.0) with predictive mean matching. Predictive mean matching was chosen because it has been shown to be effective in dealing with nonlinearity and outliers. An example of the observed and predicted parameters for 9 direct lengths and 7 surface curves is given in Table 3.:

Table 3 Example for observed and predicted parameters for 9 direct lengths and 7 surface curves. All values are given in mm.

The dataset used to impute missing values consisted of 25 children, with each subject having 85 distinct variables. The age range used for imputation was 3 to 8 months and 30 days, respectively. For the purposes of this analysis, each month was assumed to consist of 30 days. For the age range of 5 months (5/0–5/30 months/days) only one measurement point was available (5/22 months/days). The child was included in the dataset, but the age range of five months was not predicted. Similarly, for the age of seven months, there was no data available for a child and therefore the range was not predicted.

Parameters that are likely to be highly correlated, such as length measurements, were approximated together and grouped by individual. The rationale is once more straightforward:

It can be assumed that the “left lateral columella length” corresponds to the “right lateral columella length” of a healthy individual. Yet, for the purposes of this approximation, it is not necessary for this axiom to be met. However, if the aforementioned axiom is proven to be true, the information is utilised. It is important to note that the data are inherently interdependent, as some indices are calculated from the raw values of other variables. In addition, the values are naturally dependent on age, sex and birth weight. Consequently, if a raw value is missing, the index cannot be calculated, resulting in a dependent pattern of missing data. It should be noted that subsequently Little’s MCR Test was not computed.

Statistical analysis led to the presentation of the results in the form of growth curves and formulae according to Brons et al., Matthews et al. and Liang et al.24,25,26.

Results

Our final reference database consists of 32 distances (Fig. 1b), 32 surface curves (Fig. 1c), two angles (Fig. 1d) and 18 indices; representing the optimal morphological outcome as a control group for future research in cleft lip and palate surgery. Descriptive plots of all 84 datasets can be found in Supplementary Material 2, with observed values on the left and predicted values on the right. Age is given in months on the x-axis, and the specific value for each parameter is given on the y-axis in mm for distances and surface curves, in degrees for angles, and in numbers without dimension for indices. The red lines represent growth curves for female individuals and the blue lines for male individuals. To facilitate the application in both clinical and research settings, the comprehensive mathematical equation for each parameter is provided in Supplementary Material 3, with the equations presented for both male and female patients. In order to determine the optimal morphological result for a particular parameter, it is necessary to enter the current clinical value as age in months in the equation, or for simplification purposes in the column provided for the respective sex. We recommend using the predicted value equations.

For each child and each parameter, the intra-rater reliability was good/high or excellent/very high with a minimum value of ICC = 0.89 and r = .89. Stratified by type of parameter intra-rater reliability is lowest for angles and highest for surfaces (ICC = 0.99) with distances ranging in between (ICC = 0.90). For a tolerance of 1 mm, percentage agreement across all indices and children was 81.4%. Across variables assessed ICC for inter-rater reliability was 0.99 (p < .001) indicating nearly perfect absolute agreement between raters. The lowest ICC was found for the right vermillion length with an ICC of 0.73 (p < .001) indicating substantial agreement among raters. For a tolerance of 1 mm percentage agreement between raters was 43.9%. A possible cause of the lower ICC for some parameters, including right vermillion length, could be the challenging placement of the cheilion points.

Accurate positioning of the updated 3D anthropometric landmarks is required to ensure reproducible measurements. Therefore, in addition to a step-by-step tabular guide (Table 1 and Supplementary Material 1), we present a short video guide for 3D anthropometric landmark positioning (Supplementary Material 4).

As part of the establishment of our 3D anthropometric measurements and conclusive reference database, the new methodology allowed for a dedicated classification of alar and nasal shape. To facilitate future studies and to improve the individualised surgical approach to cleft lip and other nasolabial surgical procedures in general, we have identified the following three types of alar base:

  • Type I: mabR/mabL is more lateral than cR/cL.

  • Type II: mabR/mabL is more medial than cR/cL.

  • Type III: Direct transition from mabR/mabL to cbR/cbL.

Alar base type 1, the most common in our study population, is characterised by a wide and shallow labial nasal entrance and a continuously narrow columella. Alar base type 2 is characterised by a narrow and shallow labial nasal entrance and a columella that extends far cranially/anteriorly. In alar base type 3, a continuous ridge shields the labial nasal entrance from the rest of the upper lip. As shown in Fig. 2, alar base types not only influence the appearance of the alar, but also have a major impact on the overall appearance of the patient’s nose and can therefore be used to classify the nose in general.

Fig. 2
figure 2

Alar base types. Type I: mabR/mabL is more lateral than cR/cL. Type II: mabR/mabL is more medial than cR/cL. Type III: Direct transition from mabR/mabL to cbR/cbL.

In addition to the previous validation of metric accuracy of the intraoral scanner12we can now confirm its clinical applicability based on the long-term use in awake and sleeping children aged 3 to 9 months.

Discussion

To facilitate future studies in cleft surgery, we have identified the optimal morphological endpoint in the nasolabial region of healthy children aged 3 to 9 months. Our reference database will allow clinicians and research groups to compare the status quo or the morphological baseline of cleft patients before and after cleft lip repair with a healthy control group. Similar to the description of bone growth in healthy children by Liang et al., we present growth curves that now allow graphical and mathematical prediction of the expected value at a given age and sex25. In addition to previously established anthropometric parameters of the nasolabial region20our database takes into account recent advances in 3D anthropometry and includes surface curves, perpendicular lengths and the resulting comprehensive indices. To our knowledge, this is the first 3D anthropometric database of the nasolabial region in healthy children aged 3 to 9 months.

As mentioned above, 3D anthropometry is a highly accurate and advanced tool for evaluating the nasolabial region and has been shown to be superior to two-dimensional methods27,28. In line with this statement, recent literature indicates a paradigm shift in the evaluation of CLP surgery towards 3D evaluation13,15,18,29,30,31,32. To date, the majority of recent studies lack validation of the 3D (facial) scanner used13,15,18,30,31 and it is worth noting that some of the scanners used do not achieve the accuracy for craniofacial assessment touted by the industry12. As a 3D scanner for the assessment of cleft lip and palate must not only be highly accurate but also mobile in order to be used in clinical applications, a suitable device has recently been validated by our working group using a previously validated industrial scanner as a baseline12.

The use of this new data acquisition tool has opened up a host of new possibilities for 3D data processing and measurement. The mere ability to rotate the meshes in 3D space, together with a constant virtual light source, enables the accurate identification of previously difficult or indistinguishable anatomical structures through light reflection and shading. As light reflection and shading occur naturally at the edges of 3D structures in space and have been shown to be suitable for the perception and detection of 3D structures33,34,35,36the reflections appear to be well suited for the detection and separation of anatomical structures in the nasolabial region, such as the nose from the facial and labial tissues, the alae from the alar base, and the vermillion from the upper lip (see video and tables in the supplementary material). Furthermore, both our own experience and the current literature indicate that 3D anthropometry on coloured 3D surfaces seems to benefit 3D shape perception36. As most of the current articles on 3D anthropometry of the nasolabial region include measurements on monochrome surfaces32,37,38,39we recommend their implementation for further research in our field. It is worth mentioning, as a limiting factor, that although general data reproducibility was high to very high, some landmarks were slightly less reproducible than others with regard to inter-rater reliability. For instance, this was the case for parameters including the left and right cheilion as landmarks. These limitations could be overcome through future research and technological advances.

The novel approach of combining precise measuring instruments with the use of light reflections on coloured 3D surfaces now paves the way for a new and accurate method of measuring the outcome of cleft lip plasty.

Self-critical examination of one’s own accuracy and, above all, the reproducibility of data collection and processing is an important part of science and one that has been increasingly emphasised in recent years40,41. As we are now on the threshold of a paradigm shift towards 3D evaluation of cleft lip surgery with the implementation of all its technical and methodological possibilities, a unified and standardised approach within the discipline and between the working groups is of great importance. To this end, in addition to the reference database, we present step-by-step instructions for the consistent and reproducible placement of the updated anthropometric landmarks of the nasolabial region in text and video. Once the 33 landmarks have been created, the children (i.e. their scans) are measured using distances, surface curves, angles and indices. Once again, the use of 3D anthropometric methods offers new and far-reaching possibilities in all four areas of application mentioned, which go beyond their pure superiority in terms of precision and applicability.

The concept of distance has expanded beyond its traditional definition. It now encompasses more than just the linear measurement between two points (A and B). In modern spatial analysis, distance can also be quantified as the perpendicular length from a reference line to a point on a mesh or virtual surface. This technique, which to our knowledge has never been used in 3D nasolabial anthropometry, was used by our working group to determine the parameters of effective nasal length, dimple depth and right and left vermillion convexity.

When measuring the distance between point (A) and point (B), we disregard all information about the morphology in the form of natural convexities and concavities of the nasolabial region between the two points. Therefore, with the new capabilities of 3D anthropometry, we introduce the entity of surface curves, which follow the patient’s natural anatomy on the surface of the mesh, allowing it to be included in future evaluations and research.

Vermillionplasty is essential for adequate closure of the lip in unilateral cleft lip. Various techniques, with Noordhoff’s technique as a prominent forerunner, are currently being discussed with regard to their effect on vermillion convexity and, in the case of a suboptimal result, the loss of vermillion convexity1,42,43. Similarly, different techniques for bilateral cleft lip closure and their respective effects on the dimple morphology, which strongly influences the perception of a person’s attractiveness, have been the subject of much debate1,44,45,46,47. By quantifying the aforementioned distances, in particular dimple depth and left and right vermillion convexity, and adding surface curves, in this case the volumetric philtrum and vermillion length, as a new entity, we allow for profound and far-reaching conclusions to be drawn in future studies on the impact of different techniques in cleft lip surgery.

The pursuit of reproducible data collection and processing should go beyond the implementation of accurate and precise measuring methods. It is also essential to use a reliable statistical approach. Inclusion of missing data is used in several research areas where resources such as expertise, cost or readily available samples are scarce48,49. As long as the prediction model does not become too complex (at present), it has been shown that clinical data can be imputed reliably. For example, using data from 13 common laboratory tests and deliberately removing known information, 12 international teams were able to correctly estimate the missing information using probabilistic algorithms50. Clinical research needs to revalue missing values not as a genuine problem of a dataset, but as a challenge for mathematical modelling given the probabilistic and Bayesian algorithms available. Simply treating missing information as not present can lead to more severe artefacts in statistical analysis (e.g. assuming non-normality where data are actually normally distributed, up-weighting of actually random occurrences, which enlarges the confidence interval) than treating them51.

In our case, the benefit of imputation can be demonstrated by examining, for example the length of the right lateral columella (Fig. 3). The observed information leads the equation to assume a decrease in males, based on the large confidence interval between four and six months. However, the multiple chained regression equations for the predicted values indicates growth, thereby reducing the size of the confidence interval and providing information on a more fine-grained y-axis.

Fig. 3
figure 3

Observed and predicted values based on age, sex and weight for right lateral columnella length. All values are given in months and mm. The observed information leads the equation to assume a decrease in males, based on the large confidence interval between four and six months. However, the multiple chained regression equations for the predicted values indicates growth, thereby reducing the size of the confidence interval and providing information on a more fine-grained y-axis. Y-axis shows distance (mm). X-axis shows age (months).

The morphology of the nose, in particular the nostrils and their attachment point to the upper lip and the aforementioned columella, has a major influence on the overall appearance of an individual. Leslie G. Farkas has previously classified alar base configurations based on visually assessed criteria such as ‘short curved’ and ‘full curved’20. Despite the new 3D anthropometric capabilities, modern classifications are limited to pathologies and different treatment methods without describing the optimal morphological endpoint52,53,54,55. We therefore propose a simple classification of alar base types based on the 3D anthropometric landmarks cR/cL, cbR/cbL and mabR/mabL and their spatial relationship (Fig. 2). The potential for a detailed and consistent characterisation of alar base types now allows them to be incorporated into surgical procedures for the treatment of cleft lip. This can be achieved by considering the healthy side of the nose in unilateral clefts, or by considering the alar base types of close relatives. It should be noted, however, that further work on the inheritance of alar base types is needed to make accurate predictions based on close relatives.

Despite the high drop-out rate as a limiting factor of the study, it seems important to lay an early foundation in the form of reference databases on the nasolabial region of healthy children aged 3–9 months, thus anticipating technical advances in clinical 3D anthropometry. By minimising the time required per scan, intraoral scanners may become more feasible for awake and active children in the future. The use of smartphone-based scans also offers promising approaches, despite losses in metric accuracy56,57,58.

Good coverage rates and confidence interval widths can be achieved with small samples when suitable donor cases are available for imputation. The suitability of donor cases is inevitably linked to the (clinical) population of interest, addressing the underrepresentation of different ethnic groups. Although children of Asian (n = 2), Arab (n = 4) and African (n = 1) ethnicity were included to represent the average Central European patient population, there is no specific ethnic analysis of the optimal morphological endpoint. Given the small sample size and high exclusion rate despite precise measurement methodology, this should be the subject of future studies. The generation of ethnicity-specific databases, and the resulting potential to improve the treatment of nasolabial defects for individual patients, represents a challenging yet promising area of research.

Conclusions

By defining the optimal morphological endpoint, we have successfully established a basis for future 3D anthropometric measurements in children with cleft lip and palate. The establishment of a reference database and a step-by-step tutorial will allow reproducibility of measurements in future studies and facilitate the comparison of the clinical status quo with the optimal morphological endpoint. In addition, alar base types based on 3D anthropometric measurements may allow for a more individualised surgical approach to cleft lip and palate treatment.