SinusC-Net for automatic classification of surgical plans for maxillary sinus augmentation using a 3D distance-guided network

Hwang, In-Kyung; Kang, Se-Ryong; Yang, Su; Kim, Jun-Min; Kim, Jo-Eun; Huh, Kyung-Hoe; Lee, Sam-Sun; Heo, Min-Suk; Yi, Won-Jin; Kim, Tae-Il

doi:10.1038/s41598-023-38273-9

Download PDF

Article
Open access
Published: 19 July 2023

SinusC-Net for automatic classification of surgical plans for maxillary sinus augmentation using a 3D distance-guided network

In-Kyung Hwang¹^na1,
Se-Ryong Kang²^na1,
Su Yang³,
Jun-Min Kim⁴,
Jo-Eun Kim⁵,
Kyung-Hoe Huh⁵,
Sam-Sun Lee⁵,
Min-Suk Heo⁵,
Won-Jin Yi^3,5 &
…
Tae-Il Kim¹

Scientific Reports volume 13, Article number: 11653 (2023) Cite this article

2864 Accesses
10 Citations
Metrics details

Subjects

Abstract

The objective of this study was to automatically classify surgical plans for maxillary sinus floor augmentation in implant placement at the maxillary posterior edentulous region using a 3D distance-guided network on CBCT images. We applied a modified ABC classification method consisting of five surgical approaches for the deep learning model. The proposed deep learning model (SinusC-Net) consisted of two stages of detection and classification according to the modified classification method. In detection, five landmarks on CBCT images were automatically detected using a volumetric regression network; in classification, the CBCT images were automatically classified as to the five surgical approaches using a 3D distance-guided network. The mean MRE for landmark detection was 0.87 mm, and SDR for 2 mm or lower, 95.47%. The mean accuracy, sensitivity, specificity, and AUC for classification by the SinusC-Net were 0.97, 0.92, 0.98, and 0.95, respectively. The deep learning model using 3D distance-guidance demonstrated accurate detection of 3D anatomical landmarks, and automatic and accurate classification of surgical approaches for sinus floor augmentation in implant placement at the maxillary posterior edentulous region.

Deep learning-based fully automatic segmentation of the maxillary sinus on cone-beam computed tomographic images

Article Open access 17 August 2022

Convolutional neural network for automatic maxillary sinus segmentation on cone-beam computed tomographic images

Article Open access 07 May 2022

Automated classification of midpalatal suture maturation stages from CBCTs using an end-to-end deep learning framework

Article Open access 29 May 2025

Introduction

When rehabilitating the posterior maxilla with implants, clinicians are frequently confronted with the problem of limited alveolar bone width and height. Sinus floor elevation via the lateral or transcrestal approach is the most predictable and commonly used method to resolve residual bone deficiency^1,2. Several anatomic factors should be considered in selecting the surgical approach: residual bone height (RBH)³, sinus floor morphology^4,5, presence of septa^6,7, thickness of the lateral wall⁸, residual bone quality⁹, vascular anatomy¹⁰, and the number of teeth to be replaced. Among those factors, RBH is the most crucial factor in the decision-making process, and several classification systems or criteria were devised to select proper sinus elevation techniques based on RBH^3,11,12,13.

Before placing implants at the posterior maxilla, clinicians find the ideal implant position and designate the landmarks in cone-beam computed tomography (CBCT) scans that are needed to measure RBH, horizontal bone width, and size of the vertical defect. Based on the preoperative evaluation, they determine the appropriate surgical approach. Fugazzotto proposed a protocol for selecting a surgical method that uses a mathematical formula for the RBH after reviewing treatment outcomes from the lateral or transcrestal approach with and without simultaneous implant placement¹². Chiapasco et al. categorized maxillary alveolar bone defects into nine classes based on the width and height of the residual ridge and the relationship between the arches³. Unfortunately, those classifications are rarely used in treatment planning because they are excessively complex and detailed, and evaluation results are inconsistent among clinicians. The ABC sinus augmentation classification proposed by Hom–Lay Wang is a widely used, straightforward approach to treatment planning in the posterior maxilla¹³. The method broadly categorizes the edentulous posterior maxilla into three classes, A, B, and C, based on the location of the sinus floor relative to the alveolar bone crest. Then, it subdivides the classes based on amount of horizontal or vertical ridge resorption and suggests appropriate treatment options¹³.

Deep learning methods are widely used for detection^14,15,16, classification^17,18,19, segmentation^20,21,22, and enhancement^23,24 of medical and dental images. Deep learning has proved its worth in periodontology, including studies to identify periodontally compromised teeth²⁵, detect alveolar bone loss^26,27, and evaluate the severity of periodontitis^17,28. In dental implant surgery, deep learning can assist clinicians by localizing critical anatomic structures such as the mandibular canals²⁹ or measuring the ridge width and height from CBCT scans³⁰. A recent study used deep learning to automatically detect missing teeth and to label the exact tooth number on CBCT images³¹. These achievements can save clinicians time and effort and increase the accuracy of implant surgery planning. In addition, deep learning can serve as validation for treatment outcomes by predicting implant stability or failure^32,33.

To date, most deep learning research on treatment planning for implant placement has been limited to specific basic tasks, such as segmenting anatomic structures²⁹ or identifying edentulous sites³¹. As far as we know, no previous studies have attempted to apply deep learning to surgical planning for maxillary sinus floor augmentation in implant surgery. In this study, we developed an automatic end-to-end method that used deep learning to replace the time- and labor-consuming process of determining the surgical approach for implant placement at the posterior maxilla.

We hypothesized that a deep learning model would be able to automatically determine the appropriate treatment plan of the maxillary posterior edentulous region in CBCT images according to the ABC sinus augmentation classification in implant placement¹³. Therefore, our objective in this study was to automatically classify surgical plans for maxillary sinus augmentation in implant placement at the maxillary posterior edentulous region using a 3D distance-guided network (SinusC-Net) that consisted of two stages: a volumetric regression network for anatomical landmark detection and a 3D distance-guided network for classifying the treatment plan. Our main contributions are as follows: (1) we applied convolutional long short-term memory (convLSTM)^34,35, multi-scale inputs (MSI)³⁶, and deep supervision (DS)^37,38 in the volumetric regression network to accurately predict the locations of anatomical landmarks, and (2) we used the distance relationships among the predicted anatomical landmarks in 3D CBCT images as feature information that the 3D distance-guided network could use to determine the treatment plan.

Materials and methods

Data acquisition and preparation

This study was approved by the Institutional Review Board of Seoul National University Dental Hospital (No. ERI18001), and all data collection and experiments were conducted in compliance with relevant guidelines and regulations. The CBCT images were obtained from 133 patients (61 females and 72 males; mean age 62.65 ± 10.64 years) who visited Seoul National University Dental Hospital for implant treatment between November 2018 and March 2022. We included the patients’ CBCT images where the single tooth or two and three adjacent teeth were missing in the posterior maxilla, and those with the maxillary posterior edentulous area of which extraction sockets showed completed healing. We excluded the images with impacted third molars in the maxilla, anatomical abnormalities, or a history of surgery in the maxillary sinus, and those with invasive sinus pathology that required referral to an otolaryngologist, such as those with a likelihood of malignancy, opacifications exceeding $50{\%}$ of the sinus volume, air-fluid levels, or air bubbles³⁹. Images of poor-quality due to severe artifacts from implants or metal restorations, and of a radiographic stent in place were also excluded.

We acquired patient data from three different CBCT systems. The CBCT images of 45, 45, and 43 patients were obtained using a Dinnova3 (HDXWILL, Seoul, Korea), CS 9600 (Carestream Health, Inc., Rochester, NY, USA), and CS 9300 (Carestream Health, Inc.), respectively. The CBCT images from the Dinnova3 were obtained under 100 kVp and 9 mA with a voxel size of 0.3 × 0.3 × 0.3 mm³, dimensions of 670 × 670 pixels, and 16-bit depth; those from the CS 9600 used conditions of 90 kVp and 8 mA with a voxel size of 0.15 × 0.15 × 0.15 mm³, dimensions of 1067 × 1067 pixels, and 16-bit depth; those from the CS 9300 used 80 kVp and 8 mA with a voxel size of 0.25 × 0.25 × 0.25 mm³, dimensions of 669 × 669 pixels, and 16-bit depth. We used cropped images of 256 × 256 × 250 pixels that were centered at the maxillary and mandibular regions. For deep learning, we prepared 72 volumes for the training dataset, 8 for the validation dataset, and 53 for the test dataset.

Sinus augmentation classification and anatomical landmarks for classification

The ABC sinus augmentation classification method broadly divides the maxillary sinus into three classes (A, B, and C) based on residual bone height for deciding the surgical approach of maxillary sinus augmentation in implant placement at the posterior maxilla¹³. Class A represents abundant bone with a height greater than 10 mm below the sinus floor that allows proper implant placement. Class B indicates barely sufficient bone, with 6–9 mm of bone height beneath the sinus floor. Class C indicates compromised bone, with bone height of 5 mm or less below the sinus floor¹³. Cases that require additional grafting techniques along with sinus elevation are divided into sub-classifications: division h for horizontal defect, division v for vertical defect, and division c for combined defect¹³.

We modified the original ABC classification method into the simplified one consisting of five surgical approaches based on the study-specific criteria (Table 1) to effectively train the deep learning model, considering that all the sub-classifications required the guided bone regeneration (GBR). Three dentists with 2, 3, and 6 years of clinical experience evaluated and classified the 133 CBCT images according to our simplified method at the Department of Periodontology at Seoul National University Dental Hospital. The ground-truth class for each image was determined using a majority vote. After we determined the ground-truth classes, we calculated the Fleiss kappa value to evaluate inter-rater agreement and reliability⁴⁰.

Table 1 The modified ABC classification for five surgical approaches to maxillary sinus augmentation.

Full size table

The anatomical landmarks used in automatically classifying the maxillary sinus for the surgical approaches by deep learning consisted of the alveolar bone crest (AC), maxillary sinus floor (SF), medial point of the horizontal bone width (MH), lateral point of the horizontal bone width (LH), and adjacent cementoenamel junction (CEJ) (Fig. 1). One periodontist with 6 years of clinical experience annotated the landmarks on the CBCT volumes using a software (3D Slicer for Windows 10, Version 4.10.2; MIT, Massachusetts, USA)⁴¹.

Overall framework for the deep learning model (SinusC-Net)

The proposed framework for the deep learning model (SinusC-Net) consisted of two stages of detection and classification according to the modified ABC classification on CBCT images (Fig. 2). An overview of the SinusC-Net framework is presented in Fig. 2. In the first stage of anatomical landmark detection, the five landmarks were automatically detected in CBCT images using a cascaded volumetric regression network (D-Net) that relied on the 3D heatmap approach. The D-Net produced 3D heatmaps of the landmarks^42,43, which were used as inputs alongside with the original CBCT images in the next stage. In the second stage of classification, a 3D distance-guided network (C-Net) uses the simplified ABC classes to automatically classify the CBCT images into the five surgical approaches.

To improve the landmark detection performance, we used 3D Gaussian heatmaps centered at the annotated landmark as labels (Fig. 1). Heatmap regression methods are widely used in recent advances in deep learning models for landmark localization tasks^42,43. We used the Gaussian function to encode the distance matrix to obtain a normalized distance matrix as the 3D heatmap $H=\mathrm{exp}\left(-\frac{{D}^{2}}{2{\sigma }^{2}}\right)$, where $\sigma$ was a hyper-parameter for the width of the heatmap. In practice, since the 3D heatmap represented the probability of each landmark at each location, it could be considered a soft segmentation label, different from the labels used in general segmentation tasks. The soft segmentation allowed the network to robustly learn local geometric features from the neighboring points around each landmark⁴⁴.

Anatomical landmark detection network in SinusC-Net

We designed a cascaded network (D-Net) for accurate landmark detection using a coarse-to-fine learning strategy (Fig. 2). The main structure of the D-Net consists of 3D convolution blocks, 3D Max-pooling, 3D transposed convolution blocks, and skip connections. The 3D convolutional block includes 3 × 3 × 3 convolution, batch normalization, and rectified linear unit activation layers. Max-pooling and a 3D transposed convolution block were used for down-sampling and up-sampling, respectively, with a stride of two. Skip connections were used three times between an encoder and a decoder. The number of feature channels increased from 8 to 64 at each level of the layer. The sub-modules of D-Net consist of MSIs, convLSTM, and DS. To mitigate the loss of spatiotemporal information from the pooling layers, the MSI downsizes from the input volume using 2 × 2 × 2 average pooling, and it is concatenated at each level of the encoder²⁹. The convLSTM is used to capture spatiotemporal information for learning the 3D anatomical structures of the input data²⁹. The DS provides direct feedback to the hidden layers instead of only the final output layer by merging the feature maps from the different decoder layers⁴⁵.

Using the coarse-to-fine learning strategy, the coarse D-Net was trained to detect coarse locations of the landmarks from CBCT volumes of 256 × 256 × 250 pixels. After training, the coarse D-Net produced coarse 3D heatmaps of the landmarks from the CBCT volumes. The volume for each landmark was then cropped into a volume of interest (VOI) patch of 128 × 128 × 128 pixels centered on the coarsely predicted location. For more accurate landmark detection, we trained the fine D-Net to regress the fine locations of the landmarks from the VOI patches. The coarse and fine D-Net were trained using cross-entropy (CE) loss with the DS approach. The CE was calculated as

$$CE\left(H,P\right)=-\frac{1}{n}\sum_{i}^{n} \left({H}_{i}\mathrm{log}\left({P}_{i}\right)+\left(1-{H}_{i}\right)\mathrm{log}\left(1-{P}_{i}\right)\right)$$

where $n$ is the number of pixels, $H$ is a 3D heatmap of the ground truth, and $P$ is a predicted 3D heatmap. To improve training stability and detection performance, the DS was used at each level of the decoder. In that way, the final loss function $(FL)$ based on the DS approach was determined as

$$FL\left(H,P\right)=C{E}_{1}\left(H,P\right)+C{E}_{2}\left(H,P\right)+C{E}_{3}\left(H,P\right)$$

where $C{E}_{l}$ is the loss calculated from each side output at the level of layer $l$ in the decoder. After training, the D-Net produced 3D heatmaps of the landmarks that were then used with the CBCT images as inputs for the C-Net.

Sinus augmentation classification network of SinusC-Net

To determine the final classification, we designed the classification network (C-Net) using two-channel inputs, 3D convolution blocks, 3D Max-pooling, and distance priors (Fig. 2). The two-channel inputs of the original CBCT image and the corresponding 3D heatmap predicted by the D-Net were used to simultaneously learn both the anatomical structure and the geometric relationships between landmarks. The 3D convolutional block has 3 × 3 × 3 convolution, batch normalization, and rectified linear unit activation layers. 3D Max-pooling was used to down-sample the feature maps. To improve the distance awareness ability of the C-Net, we used three distance priors, which were the absolute distances between AC and SF, MH and LH, and AC and CEJ on the landmarks predicted by the D-Net. The three distance priors were concatenated with the flattened feature maps after 3D global average pooling. The C-Net was trained using the CE loss function defined above.

The networks were trained by the RMSprop optimizer for 300 epochs with an initial learning rate of 10⁴, which decreased by a factor of $0.5$ when the validation stopped decreasing for 25 epochs. We used a batch size of 8 and a single GPU with 24 Gb RAM. All networks were implemented in Python3 using a Keras framework with a Tensorflow backend. The data augmentation was performed with random rotation (in the range of − 25$^\circ$ to 25$^\circ$), re-scaling (in the range of – 10 to 20%), and modification of the intensity by randomly adjusting the brightness, contrast, saturation, and hue.

Performance evaluation for detection and classification

We evaluated the predictive performance of the landmark localization by measuring the mean radial error (MRE) and the successful detection rate (SDR)⁴⁶. The MRE was calculated as

$$\mathrm{MRE}={\sum }_{i}^{n}({R}_{i}/n)$$

where n is the number of data points, and R is the Euclidean distance between the center of ground truth and the predictive result. The SDR was calculated as the ratio at which the distance between the predicted and ground truth landmark was within a given distance (2.0 mm, 2.5 mm, 3.0 mm, and 3.5 mm).

To evaluate the classification performance, we calculated the accuracy ($ACC=\frac{TP+TN}{TP+TN+FP+FN}$), sensitivity ($Sens=\frac{TP}{TP+FN}$), specificity ($Spec=\frac{TN}{FP+TN}$), and area under the receiver operating characteristic (ROC) curve (AUC)⁴⁷, where TP, TN, FP, and FN denote the true positive, true negative, false positive, and false negative, respectively.

Ablation studies were also conducted to evaluate impacts on the detection and classification performances by use of different modules (MSI, convLSTM, and DS) in detection network (D-Net) of SinusC-Net, and to evaluate impacts on the classification performance by use of distance priors in classification network (C-Net) of the SinusC-Net.

Statistical analysis

To evaluate the inter-observer agreement in determining the ground truth of the ABC sinus augmentation classification from 133 CBCT scans, we applied Fleiss' kappa statistics using SPSS version 26 (SPSS Inc., IBM Corp.; Armonk, NY, USA). The statistical significance level was set to 0.05. The final classification results of the SinusC-Net were represented as a confusion matrix, from which we computed accuracy, sensitivity, and specificity. In addition, we calculated the AUC from the ROC curves, a graph that presents the true-positive rate in relation to false-positive rate by varying the discrimination threshold. We utilized Python (Python Software Foundation, Version 3.6.1; Wilmington, DE, USA) for the visualization and computation of the classification performance.

Results

After we generated the ground truth classes according to the modified ABC classification, Fleiss kappa values were calculated to evaluate the consistency and reliability between observers (Table 2). The kappa value indicates the level of agreement: poor (0–0.2), fair (0.21–0.4), moderate (0.41–0.6), good (0.61–0.8), and almost perfect (0.81–1.0)⁴⁸. The overall kappa value was 0.72 (P < 0.00), indicating that the three raters had good general consistency in determining the ground truth.

Table 2 The Fleiss kappa values for evaluating consistency and reliability between observers when determining the ground truth according to the modified ABC classification.

Full size table

The anatomical landmark detection performance by the volumetric regression network (D-Net) of SinusC-Net was evaluated using the 53 CBCT volumes of the test dataset. The predicted and ground truth landmarks are represented with green and red dots, respectively, in Fig. 3. As can be seen in the figure, the D-Net of SinusC-Net performed well for landmark localization (Fig. 3). The SDR and MRE of the detection performance for the five anatomical landmarks in the test dataset are shown in Table 3. The mean MRE was 0.87 mm, and the SDR for 2 mm or lower was 95.47% for overall landmark detection. Among the anatomical landmarks, the lateral point of the horizontal bone width had the highest level of accuracy, with an MRE of 0.82 mm. On the other hand, the adjacent CEJ showed the lowest level of accuracy, with an MRE of 0.92 mm. The cumulative distribution curves of MRE show that the D-Net had high performance for all landmarks (Fig. 4). Therefore, the volumetric regression network of SinusC-Net achieved high performances for anatomical landmark detection, generally.

Table 3 The detection performance of five anatomical landmarks in terms of SDR and MRE by the detection network of SinusC-Net.

Full size table

The classification performance by SinusC-Net was evaluated using the same 53 CBCT volumes of the test dataset. The confusion matrix presents the overall classification results for the five surgical approaches predicted by SinusC-Net (Fig. 5). The SinusC-Net achieved the performance of mean accuracy (ACC) of 0.97, sensitivity (Sens) of 0.92, specificity (Spec) of 0.98, and AUC of 0.95 (Table 4). Among the classes, Class A had the highest accuracy with an ACC of 1.00, Sens of 1.00, Spec of 1.00, and AUC of 1.00; and Class C had the lowest accuracy, with an ACC of 0.94, Sens of 0.92, Spec of 0.98, and AUC of 0.94. Thus, SinusC-Net showed the highest classification accuracy for Class A and the lowest for Class C, and its overall performance in classifying surgical approaches was high.

Table 4 The classification performance in terms of accuracy, sensitivity, specificity, and AUC by SinusC-Net.

Full size table

The ablation study testing the use of different modules in the D-Net of SinusC-Net revealed that the network using both MSI and convLSTM was more accurate than that using only convLSTM, and the network using all three modules (convLSTM, MSI, and DS), which is the complete SinusC-Net, showed the highest performance in both detection and classification (Table 5). Furthermore, the ROC curve of the network using all three modules showed the highest AUC in classification (Fig. 6). Those results demonstrate the effectiveness of using convLSTM, MSI, and DS together in the detection and classification network of SinusC-Net. Additionally, the classification performance of the network using the distance priors was more accurate than that of the network not using them (Table 6), which demonstrates the effectiveness of applying distance priors for classification in SinusC-Net.

Table 5 Results of the ablation study using different modules in the detection network (D-Net) of SinusC-Net.

Full size table

Table 6 Results of the ablation study on the use of distance priors in the classification network (C-Net) of SinusC-Net.

Full size table

Discussion

In this study, we proposed a deep learning network with 3D distance-guidance (SinusC-Net) that can automatically classify surgical plans for sinus augmentation at the maxillary posterior edentulous region in CBCT images. SinusC-Net consists of a volumetric regression network for anatomical landmark detection (D-Net) and a 3D distance-guided network for classifying treatment plans (C-Net). The cascaded D-Net uses a coarse-to-fine learning strategy with MSI, convLSTM, and DS modules and demonstrated high accuracy in detecting the 3D anatomical landmarks used to determine treatment plans. For overall landmark detection by the D-Net, the MRE was 0.87 mm, and the SDR for 2 mm or lower was 95.47% of the time. The C-Net that used distance information between the landmarks as priors for classification demonstrated high classification accuracy that was consistent with the ground-truth results determined by clinicians. The mean accuracy, sensitivity, specificity, and AUC were 0.97, 0.92, 0.98, and 0.95, respectively, for the overall classification performance by SinusC-Net. We demonstrated the ability of the proposed deep learning network (SinusC-Net) to accurately and automatically classify surgical approaches for sinus floor augmentation in implant placement at the maxillary posterior edentulous region.

To date, many studies in dentistry have used deep learning models for automatic landmark detection to assist in orthodontic treatment or orthognathic surgery, mainly focusing on detecting landmarks on 2D lateral cephalometric images^49,50,51. The MREs for landmark detection on 2D images by deep learning mostly ranged from 0.9 to 1.53 mm, and an SDR for 2 mm or lower from 77.01 to 82.43% of cases^49,50,51. The accuracies of landmark detection on 3D images have been much lower than those on 2D images, with MREs of 3.63 to 5.785 mm^52,53, because a large amount of volume data and high-performance hardware were required for the algorithms to learn 3D anatomical structures^52,53. In the present study, to increase the landmark detection accuracy on 3D images, we used a cascaded volumetric regression network with a coarse-to-fine learning strategy. The first network produced coarse locations of the landmarks from whole CBCT volumes, and then smaller cropped VOI patches centered on the coarsely predicted landmarks were used for fine detection of the locations in the following network. Furthermore, our approach stands out from other methods because it incorporates convLSTM, MSI, and DS in constructing the detection network architecture. The use of those techniques in SinusC-Net enabled more accurate detection of 3D landmarks by learning the 3D anatomical structures of the input data through the convLSTM, and enhancing supervision of scaled features through MSI and DS.

The classification network used multi-channel inputs of the original CBCT image and the corresponding 3D heatmap predicted by the previous detection network to simultaneously learn both the anatomical structure and the geometric relationships between landmarks. Furthermore, to improve the distance awareness in the classification, three distance priors representing the absolute distances between the predicted landmarks were concatenated with the flattened feature maps after 3D feature encoding. Our results show the effectiveness of applying the distance priors to classification; SinusC-Net demonstrated high classification accuracy despite the small amount of data available. The distance-guided 3D network of the SinusC-Net had advantages in classification of surgical approaches according to the modified ABC classification, which were simultaneous learning of semantic relationships between anatomical structure and landmark distribution by using multi-channel inputs of the original CBCT image and the 3D heatmap of landmarks, and improving the distance awareness ability of the network using distance priors of the absolute distances between landmarks.

Although the deep learning model showed high classification accuracy for predicting surgical plans, the confusion matrix indicated that some of the predictions differed from the ground truth. The deep learning model showed perfect agreement between the predicted and actual classes in Class A cases, but disagreement occurred in other classes. However, that confusion was consistent with the clinicians' tendency in classifying the maxillary sinus when they were determining the ground truth. The individual kappa values indicated that the three clinicians showed high agreement for Class A and lower agreement for the other classes. In other words, the deep learning model shared the same difficulty with clinicians in classifying the maxillary sinus. Considering the deep learning model would also confuse with classes that would be ambiguous for clinicians, we modified the original ABC sinus augmentation classification by combining the sub-classifications when training the model. From a clinician’s point of view, it is often difficult to distinguish between horizontal, vertical, and combined bone deficiencies. Although we simplified the original classification, the recommended treatment approaches were the same because all the sub-classifications required GBR.

This study has some limitations. First, the deep learning model predicted classes consistent with the ground truth in cases in which the residual bone was clearly abundant, barely sufficient, or compromised, but it predicted incorrect classes in borderline cases. In cases of the residual bone on the borderline of the classes, clinicians choose an appropriate surgical approach by considering other factors, such as the anatomy of the maxillary sinus, patients' systemic conditions, and their own preference for less invasive techniques^54,55. A more advanced deep learning model based on multi-modal inputs needs to be developed to incorporate data other than the residual bone volume in CBCT images so that it can provide more suitable treatment plans for borderline cases. Second, although we demonstrated that the deep learning model provided excellent outcomes in classifying surgical plans for maxillary sinus augmentation, the performance of our model needs to be validated for generalization and robustness. We used a relatively small sample of 133 CBCT images from three CBCT systems for our study due to our strict criteria for selecting images and the large size of data. Although we collected data from multiple CBCT machines to ensure model generalizability, we need to train and test the model using larger datasets from various organizations to improve its generalization.

Applying deep learning in treatment planning can help dental practitioners by enabling faster and more accurate treatment planning from precise measurements and automatic analyses of CBCT images. Deep learning techniques are particularly beneficial for treatment planning in digital dentistry because they can facilitate computer-aided diagnosis and design procedures with little to no manual intervention, producing significant advances in this field. Integrating deep learning research on surgical planning in the posterior maxilla with investigations on dental implant placement could automate the fabrication of surgical guides and enable optimal implant positioning in complicated cases.

Conclusion

In this study, we proposed a deep learning network with 3D distance-guidance (SinusC-Net) to automatically classify surgical plans for maxillary sinus floor augmentation at the maxillary posterior edentulous region in CBCT images. SinusC-Net demonstrated accurate detection of 3D anatomical landmarks and automatic and accurate classification of surgical approaches for sinus floor augmentation in the maxillary posterior edentulous region during implant planning. These deep learning techniques can help dental practitioners by enabling faster and more accurate treatment planning with precise measurements and automatic analysis of CBCT images in digital dentistry.

Data availability

The datasets generated and/or analyzed during the current study are not publicly available due to restriction by the Institutional Review Board of Seoul National University Dental Hospital to protect patient privacy, but they are available from the corresponding author upon reasonable request. Please contact the corresponding author for any commercial implementation of our research.

References

Esposito, M., Felice, P. & Worthington, H. V. Interventions for replacing missing teeth: Augmentation procedures of the maxillary sinus. Cochrane Database Syst. Rev. 5, CD008379. https://doi.org/10.1002/14651858.CD008397.pub2 (2014).
Article Google Scholar
Lundgren, S. et al. Sinus floor elevation procedures to enable implant placement and integration: Techniques, biological aspects and clinical outcomes. Periodontol. 2000(73), 103–120 (2017).
Article Google Scholar
Chiapasco, M., Zaniboni, M. & Rimondini, L. Dental implants placed in grafted maxillary sinuses: A retrospective analysis of clinical outcome according to the initial clinical situation and a proposal of defect classification. Clin. Oral Implants Res. 19, 416–428 (2008).
Article PubMed Google Scholar
Chan, H.-L., Monje, A., Suarez, F., Benavides, E. & Wang, H.-L. Palatonasal recess on medial wall of the maxillary sinus and clinical implications for sinus augmentation via lateral window approach. J. Periodontol. 84, 1087–1093 (2013).
Article PubMed Google Scholar
de Souza Nunes, L. S., Bornstein, M. M., Sendi, P. & Buser, D. Anatomical characteristics and dimensions of edentulous sites in the posterior maxillae of patients referred for implant therapy. Int. J. Periodont. Restor. Dent. 33, 337–345 (2013).
Article Google Scholar
Kim, M.-J. et al. Maxillary sinus septa: Prevalence, height, location, and morphology. A reformatted computed tomography scan analysis. J. Periodontol. 77, 903–908 (2006).
Article PubMed Google Scholar
Wen, S.-C., Chan, H.-L. & Wang, H.-L. Classification and management of antral septa for maxillary sinus augmentation. Int. J. Periodont. Restor. Dent. 33, 508–517 (2013).
Article Google Scholar
Kang, S.-J. et al. Anatomical structures in the maxillary sinus related to lateral sinus elevation: A cone beam computed tomographic analysis. Clin. Oral Implants Res. 24, 75–81 (2013).
Article MathSciNet PubMed Google Scholar
Misch, C. Bone classification, training keys to implant success. Dent. Today 8, 39–44 (1989).
CAS PubMed Google Scholar
Rosano, G., Taschieri, S., Gaudy, J.-F., Weinstein, T. & Del Fabbro, M. Maxillary sinus vascular anatomy and its relation to sinus lift surgery. Clin. Oral Implants Res. 22, 711–715 (2011).
Article PubMed Google Scholar
Jensen, O. T., Shulman, L. B., Block, M. S. & Iacono, V. Report of the sinus consensus conference of 1996. Int. J. Oral Maxillofac. Implants 13, 11–45 (1998).
PubMed Google Scholar
Fugazzotto, P. A. Augmentation of the posterior maxilla: A proposed hierarchy of treatment selection. J. Periodontol. 74, 1682–1691 (2003).
Article PubMed Google Scholar
Wang, H.-L. & Katranji, A. ABC sinus augmentation classification. Int. J. Periodont. Restor. Dent. 28, 382–389 (2008).
Google Scholar
Lee, J.-H., Kim, D.-H., Jeong, S.-N. & Choi, S.-H. Diagnosis and prediction of periodontally compromised teeth using a deep learning-based convolutional neural network algorithm. J. Periodontal Implant Sci. 48, 114–123 (2018).
Article PubMed PubMed Central Google Scholar
Ahn, J. M. et al. A deep learning model for the detection of both advanced and early glaucoma using fundus photography. PLoS One 13, e0207982. https://doi.org/10.1371/journal.pone.0211579 (2018).
Article PubMed PubMed Central Google Scholar
Phan, S., Satoh, S., Yoda, Y., Kashiwagi, K. & Oshika, T. Evaluation of deep convolutional neural networks for glaucoma detection. Jpn. J. Ophthalmol 63, 276–283. https://doi.org/10.1007/s10384-019-00659-6 (2019).
Article PubMed Google Scholar
Chang, H.-J. et al. Deep learning hybrid method to automatically diagnose periodontal bone loss and stage periodontitis. Sci. Rep. 10, 7531. https://doi.org/10.1038/s41598-020-64509-z (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Shen, W. et al. Multi-crop convolutional neural networks for lung nodule malignancy suspiciousness classification. Pattern Recognit. 61, 663–673. https://doi.org/10.1016/j.patcog.2016.05.029 (2017).
Article ADS Google Scholar
Kumar, A., Kim, J., Lyndon, D., Fulham, M. & Feng, D. An ensemble of fine-tuned convolutional neural networks for medical image classification. IEEE J. Biomed. Health Inform. 21, 31–40. https://doi.org/10.1109/JBHI.2016.2635663 (2016).
Article PubMed Google Scholar
Yu, Y. et al. Deep transfer learning for modality classification of medical images. Information 8, 91. https://doi.org/10.3390/info8030091 (2017).
Article Google Scholar
Cheng, J. Z. et al. Computer-aided diagnosis with deep learning architecture: Applications to breast lesions in US images and pulmonary nodules in CT scans. Sci. Rep. 6, 24454. https://doi.org/10.1038/srep24454 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Christ, P. F. et al. Automatic liver and tumor segmentation of CT and MRI volumes using cascaded fully convolutional neural networks. https://doi.org/10.48550/arXiv.1702.05970 (arXiv preprint) (2017).
Morgan, N. et al. Convolutional neural network for automatic maxillary sinus segmentation on cone-beam computed tomographic images. Sci. Rep. 12, 7523. https://doi.org/10.1038/s41598-022-11483-3 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Yong, T.-H. et al. QCBCT-NET for direct measurement of bone mineral density from quantitative cone-beam CT: A human skull phantom study. Sci. Rep. 11, 15083. https://doi.org/10.1038/s41598-021-94359-2 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Heo, M.-S. et al. Artificial intelligence in oral and maxillofacial radiology: What is currently possible?. Dentomaxillofac. Radiol. 50, 20200375. https://doi.org/10.1259/dmfr.20200375 (2021).
Article PubMed Google Scholar
Kim, J., Lee, H.-S., Song, I.-S. & Jung, K.-H. DeNTNet: Deep neural transfer network for the detection of periodontal bone loss using panoramic dental radiographs. Sci. Rep. 9, 17615. https://doi.org/10.1038/s41598-019-53758-2 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Krois, J. et al. Deep learning for the radiographic detection of periodontal bone loss. Sci. Rep. 9, 8495. https://doi.org/10.1038/s41598-019-44839-3 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Lee, C.-T. et al. Use of the deep learning approach to measure alveolar bone level. J. Clin. Periodontol. 49, 260–269 (2022).
Article CAS PubMed Google Scholar
Jeoun, B.-S. et al. Canal-Net for automatic and robust 3D segmentation of mandibular canals in CBCT images using a continuity-aware contextual network. Sci. Rep. 12, 13460. https://doi.org/10.21203/rs.3.rs-1537019/v1 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Kurt Bayrakdar, S. et al. A deep learning approach for dental implant planning in cone-beam computed tomography images. BMC Med. Imaging 21, 86. https://doi.org/10.1186/s12880-021-00618-z (2021).
Article PubMed Google Scholar
do Nascimento Gerhardt, M. et al. Automated detection and labelling of teeth and small edentulous regions on cone-beam computed tomography using convolutional neural networks. J. Dent. 122, 104139 (2022).
Article Google Scholar
Huang, Z. et al. The construction and evaluation of a multi-task convolutional neural network for a cone-beam computed-tomography-based assessment of implant stability. Diagnostics (Basel) 12, 2673. https://doi.org/10.3390/diagnostics12112673 (2022).
Article MathSciNet PubMed Google Scholar
Huang, N. et al. Predicting the risk of dental implant loss using deep learning. J. Clin. Periodontol. 49, 872–883 (2022).
Article CAS PubMed Google Scholar
Yin, P., Yuan, R., Cheng, Y. & Wu, Q. Deep guidance network for biomedical image segmentation. IEEE Access 8, 116106–116116. https://doi.org/10.1109/ACCESS.2020.3002835 (2020).
Article Google Scholar
Shi, X. et al. Convolutional LSTM network: A machine learning approach for precipitation nowcasting. Adv. Neural Inf. Process. Syst. https://doi.org/10.48550/arXiv.1506.04214 (2015).
Article Google Scholar
Fu, H. et al. Joint optic disc and cup segmentation based on multi-label deep network and polar transformation. IEEE Trans. Med. Imaging 37, 1597–1605. https://doi.org/10.1109/TMI.2018.2791488 (2018).
Article ADS PubMed Google Scholar
Lou, J. et al. Automatic fetal brain extraction using multi-stage U-net with deep supervision. Mach. Learn. Med. Imaging 11861, 592–600 (2019).
Article Google Scholar
Zeng, G. et al. 3D U-net with multi-level deep supervision: Fully automatic segmentation of proximal femur in 3D MR images. Mach. Learn. Med. Imaging 10541, 274–282 (2017).
Article Google Scholar
Janner, S. F. et al. Sinus floor elevation or referral for further diagnosis and therapy: A comparison of maxillary sinus assessment by ENT specialists and dentists using cone beam computed tomography. Clin. Oral Implants Res. 31, 463–475 (2020).
Article PubMed Google Scholar
Fleiss, J. L. Measuring nominal scale agreement among many raters. Psychol. Bull. 76, 378–381 (1971).
Article Google Scholar
Fedorov, A. et al. 3D slicer as an image computing platform for the quantitative imaging network. Magn. Reason. Imaging 30, 1323–1341 (2012).
Article Google Scholar
Tang, Z., Peng, X., Li, K. & Metaxas, D. N. Towards efficient U-nets: A coupled and quantized approach. IEEE Trans. Pattern Anal. Mach. Intell. 42, 2038–2050. https://doi.org/10.1109/TPAMI.2019.2907634 (2019).
Article PubMed Google Scholar
Huang, X., Deng, W., Shen, H., Zhang, X. & Ye, J. PropagationNet: Propagate points to curve to learn structure information. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7265–7274. https://doi.org/10.48550/arXiv.2006.14308 (2020).
Wang, Y., Cao, M., Fan, Z. & Peng, S. Learning to detect 3D facial landmarks via heatmap regression with graph convolutional network. Proc. AAAI Conf. Artif. Intell. 36, 2595–2603. https://doi.org/10.1609/aaai.v36i3.20161 (2022).
Article Google Scholar
Sahayam, S., Nenavath, R., Jayaraman, U. & Prakash, S. Brain tumor segmentation using a hybrid multi resolution U-Net with residual dual attention and deep supervision on MR images. Biomed. Signal Process. Control 78, 103939 (2022).
Article Google Scholar
Kim, M.-J. et al. Automatic cephalometric landmark identification system based on the multi-stage convolutional neural networks with CBCT combination images. Sensors (Basels) 21, 505. https://doi.org/10.3390/s21020505 (2021).
Article ADS Google Scholar
Yang, X. et al. Multi-modality relation attention network for breast tumor classification. Comput. Biol. Med. 150, 106210. https://doi.org/10.1016/j.compbiomed.2022.106210 (2022).
Article MathSciNet Google Scholar
Landis, J. R. & Koch, G. G. The measurement of observer agreement for categorical data. Biometrics 33, 159–174 (1977).
Article CAS PubMed MATH Google Scholar
Bulatova, G. et al. Assessment of automatic cephalometric landmark identification using artificial intelligence. Orthod. Craniofac. Res. 24, 37–42 (2021).
Article PubMed Google Scholar
Lee, J.-H., Yu, H.-J., Kim, M.-J., Kim, J.-W. & Choi, J. Automated cephalometric landmark detection with confidence regions using Bayesian convolutional neural networks. BMC Oral Health 20, 270. https://doi.org/10.1186/s12903-020-01256-7 (2020).
Article CAS PubMed PubMed Central Google Scholar
Schwendicke, F. et al. Deep learning for cephalometric landmark detection: Systematic review and meta-analysis. Clin. Oral Investig. 25, 4299–4309. https://doi.org/10.1007/s00784-021-03990-w (2021).
Article PubMed PubMed Central Google Scholar
Yun, H. S., Jang, T. J., Lee, S. M., Lee, S.-H. & Seo, J. K. Learning-based local-to-global landmark annotation for automatic 3D cephalometry. Phys. Med. Biol. 65, 085018. https://doi.org/10.1088/1361-6560/ab7a71 (2020).
Article PubMed Google Scholar
Ma, Q. et al. Automatic 3D landmarking model using patch-based deep neural networks for CT image of oral and maxillofacial surgery. Int. J. Med. Robot. Comput. Assist. Surg. 16, 2093. https://doi.org/10.1002/rcs.2093 (2020).
Article Google Scholar
Van Den Bergh, J. P., Ten Bruggenkate, C. M., Disch, F. J. & Tuinzing, D. B. Anatomical aspects of sinus floor elevations. Clin. Oral Implants Res. 11, 256–265 (2000).
Article PubMed Google Scholar
Block, M. S. Improvements in the crestal osteotome approach have decreased the need for the lateral window approach to augment the maxilla. J. Oral Maxillofac. Surg. 74, 2169–2181 (2016).
Article PubMed Google Scholar

Download references

Acknowledgements

This work was supported by a National Research Foundation of Korea (NRF) Grant funded by the Korean Government (MSIT) (no. 2023R1A2C200532611). This work also supported by a Korea Medical Device Development Fund Grant by the Korean government (Ministry of Science and ICT; Ministry of Trade, Industry, and Energy; Ministry of Health and Welfare; Ministry of Food and Drug Safety) (Project Number: 1711194231, KMDF_PR_20200901_0011/1711174552, KMDF_PR_20200901_0147).

Author information

These authors contributed equally: In-Kyung Hwang and Se-Ryong Kang.

Authors and Affiliations

Department of Periodontology, School of Dentistry and Dental Research Institute, Seoul National University, Seoul, 03080, Republic of Korea
In-Kyung Hwang & Tae-Il Kim
Department of Biomedical Radiation Sciences, Graduate School of Convergence Science and Technology, Seoul National University, Seoul, 08826, Republic of Korea
Se-Ryong Kang
Department of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul, 08826, Republic of Korea
Su Yang & Won-Jin Yi
Department of Electronics and Information Engineering, Hansung University, Seoul, 02876, Republic of Korea
Jun-Min Kim
Department of Oral and Maxillofacial Radiology and Dental Research Institute, School of Dentistry, Seoul National University, Seoul, 03080, Republic of Korea
Jo-Eun Kim, Kyung-Hoe Huh, Sam-Sun Lee, Min-Suk Heo & Won-Jin Yi

Authors

In-Kyung Hwang
View author publications
Search author on:PubMed Google Scholar
Se-Ryong Kang
View author publications
Search author on:PubMed Google Scholar
Su Yang
View author publications
Search author on:PubMed Google Scholar
Jun-Min Kim
View author publications
Search author on:PubMed Google Scholar
Jo-Eun Kim
View author publications
Search author on:PubMed Google Scholar
Kyung-Hoe Huh
View author publications
Search author on:PubMed Google Scholar
Sam-Sun Lee
View author publications
Search author on:PubMed Google Scholar
Min-Suk Heo
View author publications
Search author on:PubMed Google Scholar
Won-Jin Yi
View author publications
Search author on:PubMed Google Scholar
Tae-Il Kim
View author publications
Search author on:PubMed Google Scholar

Contributions

I.-K.H.: contributed to the conception and design; data acquisition, analysis, and interpretation; and drafted and critically revised the manuscript. S.-R.K.: contributed to the conception and design; data acquisition, analysis, and interpretation; and drafted and critically revised the manuscript. S.Y.: contributed to the conception and design and drafted and critically revised the manuscript. J.-M.K.: contributed to conception and design, data interpretation, and drafted the manuscript. J.-E.K.: contributed to the conception and design, data interpretation, and drafted the manuscript. K.-H.H.: contributed to the conception and design, data interpretation, and drafted the manuscript. S.-S.L.: contributed to the conception and design, data interpretation, and drafted the manuscript. M.-S.H.: contributed to the conception and design, data interpretation, and drafted the manuscript. W.-J.Y.: contributed to the conception and design; data acquisition, analysis, and interpretation; and drafted and critically revised the manuscript. T.-I.K.: contributed to the conception and design; data acquisition, analysis, and interpretation; and drafted and critically revised the manuscript. “All authors gave their final approval and agreed to be accountable for all aspects of the work”.

Corresponding authors

Correspondence to Won-Jin Yi or Tae-Il Kim.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hwang, IK., Kang, SR., Yang, S. et al. SinusC-Net for automatic classification of surgical plans for maxillary sinus augmentation using a 3D distance-guided network. Sci Rep 13, 11653 (2023). https://doi.org/10.1038/s41598-023-38273-9

Download citation

Received: 03 May 2023
Accepted: 06 July 2023
Published: 19 July 2023
DOI: https://doi.org/10.1038/s41598-023-38273-9