Abstract
Dental infections may result in severe health conditions when not diagnosed and responded to immediately. However, it is a difficult process that can take time and expertise to diagnose oral infections based on X-ray images. In this paper, a new method of dental caries classification based on the panoramic radiographic images is proposed, which is aimed at overcoming the class imbalance and weak anatomical differences. During the preprocessing stage, the clustering technique was used to form similar grouped data to balance the distribution of data, and the Sobel-Feldman edge technique was applied to emphasize critical features. MobileNetV2 and ShuffleNet models were also trained on the preprocessed set of data separately, but the classification ability was poor. A hybrid architecture was designed based on the combination of the strengths of the two models, so the level of precision increased. In a further effort to improve the performance of the model, Ant Colony Optimization (ACO) algorithm was incorporated to the hybrid framework. Addition of ACO made the classification highly accurate since it could perform an efficient global search and parameter tuning. The suggested ACO-enhanced hybrid approach showed better results with 92.67% accuracy than standalone networks which implies that the proposed model can be used on reliable and automated dental diagnosis.
Similar content being viewed by others
Introduction
Bacteria interacting with sugar creates tooth damage that leads to cavity formation. Tooth cavities spread widely throughout the population because more than 90% of individuals age 20 or older develop at least one cavity during their lifetimes1. During early detection, dentists use physical assessment techniques to examine patients. Medical personnel determine the extent of tissue damage through manual testing via periodontal probes. Each tooth receives manual probing which reveals periodontal disease severity while determining its progression by focusing on the most affected area. People experience pain along with discomfort during this complex diagnostic procedure2. Dental cavities become visible as distinct features when radiographic images are analyzed. Therefore, most dental professionals use X-ray images, such as intraoral and panoramic radiographs, as their primary diagnostic tool. Recent advancement in the field of artificial intelligence (AI) now serves as an efficient system for both dental caries classification and segmentation3. Deep learning (DL) applications now utilize new pattern recognition capabilities within medical imaging data. Recent studies focus on pattern recognition since dental cavity detection represents a fundamental pattern recognition task. Artificial neural networks (ANN), together with convolutional neural networks (CNN), enable fast and accurate dental problem evaluation4. Hybrid neural networks combine ANN and Deep neural networks (DNN) structures to extract hidden data attributes through stacked sparse autoencoders and refine these features using logistic regression classifiers for output enhancement. Using conventional DL methods can be very difficult since they require both strong computing power and a lot of memory, limiting how they are used on small devices like smartphones, embedded systems, and IoT gadgets. Due to this issue, people are searching for efficient neural network structures that give good accuracy and low computational cost. Therefore, models are required to meet both large processing needs and requirements for small sizes. One way to address this problem is with EfficientNets, which uses a compound scaling approach to create its models. Traditional models change depth, width, or resolution, while EfficientNet uses a series of fixed scaling coefficients to change these features properly. This method supports smoother running of applications in different computing environments because it improves how resources are managed5. Yet, traditional neural networks encounter various problems in optimization and adaptation, mainly in situations that involve specific domains. In order to solve these problems, the current study introduces a hybrid model that joins several efficient neural network designs with a nature-inspired optimization method. The suggested technique unites MobileNetV2 and ShuffleNet, recognized for belonging to the top lightweight CNNs, which are easy to use and economical. Also, the ACO algorithm is then applied to these models, since it is based on how ants forage and does particularly well in solving both combinatorial and parameter optimization tasks6. Merging the effectiveness of top classifiers with the ability of ACO to search over many instances, the new model seeks to improve the accuracy of classification while still being efficient. Since this hybrid framework works well on low-powered devices, it can be used in real cases like medical imaging, object detection and smart surveillance systems.
This work introduces two significant contributions:
-
Enhancing focus on crucial image features by utilizing MobileNetv2 and ShuffleNet for dental caries classification.
-
Demonstrating superior classification performance with a hybrid network using Ant colony optimizer.
The structure of the article is as follows: A detailed examination of research on caries classification and detection is given in “Literature”. “Methodology” section outlines the research design and techniques used in this research. The results of the evaluation using the suggested technique are shown in “Result analysis”, which also shows that models reached a noteworthy level of accuracy. The study’s major findings are finally summarized in “Conclusion”, along with their implications for further research in this area.
Literature
Machine learning (ML) is being used early on for the classification and identification of dental diseases. Especially in caries detection, Random Forest, Decision tree and Logistic Regression are commonly used ML algorithms7. For tasks related to image recognition, the supervised learning processes are applied. Adding large datasets of dental issues can better train ML8. Over the years, there is shift from ML to DL in medicine and healthcare since traditional ML algorithms (SVM and decision trees) are manual and may fail to detect important patterns due to the need for expertise. DL networks such as Generative adversarial networks (GANs), Recurrent neural networks (RNNs), Deep neural networks are being used for the recent studies9.
A U-Net model and three parallel ResNet50, ResNeXt101 and VGG19 encoders are used to allow the model to find patterns and details at many possible scales. This integration of various encoder networks means the system correctly and effectively identifies carious lesions, helping doctors to discover and address these problems at a very early stage10. A hybrid classification network is utilized for the detection and division of periodontal disease both at individual tooth-level and whole-patient-level. The new framework makes use of two models–H-Net and HC-Net–to label and sort healthy and caries effected teeth hierarchically. The Regions of Interest for periodontal issues are made clearer by using Class Activation Mapping (CAM) when visualizing X-rays11. DL models basically works by learning features and this learning success is affected by hyper parameters which include the learning rate, batch size and the number of layers14. In recent years, applying CNN-based algorithms in DL to image analysis has achieved effective results, making these algorithms widely used in finding caries15. After a major breakthrough in this technology, hybrid deep learning approach is introduced to enhance the image recognition capabilities by combining efficient neural networks to the basic DL networks16. A hybrid model that uses test-time augmentation and describes deep transfer learning was developed to identify prone-to-cavitation teeth from non-standardized photographs, with acceptable clinical results17. To enhance performance, optimizers are also included to hybrid models. DL models, as opposed to traditional gradient-based methods, employ optimizers to increase the rate of convergence, avoid local minima, and more effectively modify hyper parameters. They use strategies seen in nature to cover the whole search area and advance the model’s functionality. The three critical steps in the effective development of dental caries diagnosis and treatment models include classification, identification (detection), and segmentation. Classification is used to assess the nature and extent of caries, detection is used to confirm their existence in radiographic images, segmentation is used to outline areas of interest in a targeted clinical intervention18. This work is concerned with the classification and identification of dental caries presence or absence. Table 1 represents the summary of recent works done in the field of dentistry for then dental infection classification.
Table 2 represents the summary of recent works done in the field of dentistry for the dental infection detection. YOLOv7 is applied to diagnose and classify these conditions, providing outstanding and precise recognition for many dental threats. It is intended to accurately distinguish and assign six main dental problems: healthy teeth, dental caries, impacted teeth, infections, fractured teeth and broken-down crowns or roots (BDC/BDR)19. A unique way to segment and classify tooth types from 3D models made from panoramic dental X-rays. Before further analysis, a random number-based filter on images is used to improve the image quality and prepare the data. The segmentation of teeth in X-rays is carried out using a custom Mask-CNN. An AEG model is employed which can accurately recognize different tooth types in both simple and complicated dental structures20. A new method that uses both optimization and deep learning is introduced to divide the caries-involved areas with a Bat Whale Optimization (BWO) model. Preprocessing is done with a Wiener filter in the suggested framework to decrease image noise and improve quality.
The Gray Level Co-occurrence Matrix (GLCM) is then used to identify key texture patterns. Lastly, a CNN is used for precise classification and the combined BWO-CNN structure results in improved overall diagnostics12. A LeNet (DGLeNet) classifier is used to identify caries lesions in digital bitewing radiographs. In preprocessing step, techniques such as Gaussian filtering, grayscale transformation and thresholding, are used to better shape and distinguish the main features in an image. Kernel-based nonlinear Fisher Analysis is applied to find features that help distinguish one class from another. DGLeNet is then applied to identify each refined feature category21. The approach introduces hybrid dental caries segmentation to increase accuracy and precision in detecting caries in periodic dental X-ray images. The enhancement of the image begins with a CLAHE and ends with applying a bilateral filter. To segment images, a new method using FOC-KKC and HSLnSSO is utilized to find the best results. A HSLnSSO-enhanced version of ResNeXt-RNN is used for detection22. To improve accuracy in finding and pointing carious areas, a hybrid neural network architecture integrating both ANNs and DNNs is implemented. At the preprocessing step, a selective median filter is used to lower the noise amount while not harming significant elements of the image13. Based on the recent developments, this work proposes feature extraction using a hybrid model and optimizer-guided classification to improve caries classification accuracy.
Methodology
This work represents a unified pathway of automating the process of classification of caries via X-ray images based on the traditional approaches of image processing and more modern methods of DL and optimization, as shown in the Fig. 1.
Proposed work flow for dental caries classification.
First, the clustering-based approach is used to balance input dental radiographs to overcome the problem of class imbalance and then preprocessing is performed using Sobel-Feldman operator so as to improve edge features. The images so enhanced are then fed to a hybrid model of deep learning (MobileNetV2 and ShuffleNet) working in parallel so as to have extractors of rich and diverse feature representations. The next stage involves optimizing/filtering the most discriminative features and this is done through the ACO using its bio-inspired search procedure. The features that underwent the optimization process are then labeled caries or normal and the performance of the same model is assessed according to standard measures to determine its robustness and the ability to produce a reliable diagnosis.
Dataset information
The initial dataset consists of 13000 images, which includes 3069 images with caries. A clustering-based selection23 method was applied to select 3069 non-caries images out of the total remaining 9,931 images to ensure a balanced dataset. K-means algorithm clustering is a way of dividing the minority class into groups based on the distribution regulation within it and the similar samples, forming groups for each of them, and creating new in these samples24. First, all the non-caries images are fed through a pre-trained ResNet-18 to get 512-dimensional feature vectors at the ’pool5’ layer. Then, each feature vector goes through K-means which creates up to 100 clusters for similar images. From each group, an amount of images is chosen randomly so that the whole selection has a wide range of differences. This prevents random sampling bias and makes sure that variance is found within the same category. Images with caries are kept in full, whereas only a limited number of non-caries images remain after the selection. Consequently, there are equal or almost equal numbers of caries and non-caries images which is perfect for using deep learning to solve binary classification problems.
Data preprocessing
Before analyzing an image, preprocessing turns it into a form that makes it easier for the model to understand25. Common operations in image processing are resizing, converting to grayscale, reducing background noise, adjusting maximum pixel values to be the same throughout the images and finding the edges to help show important features and make sure all images are related26.
Sobel edge detection algorithm.
In this work Sobel-Feldman operation is used. Digital images have intensity functions defined only at specific points, so the direct calculation of derivatives depends on assuming a smooth continuous and differentiable intensity function hidden behind the sample points. Under these assumptions and extra rules, the derivative of the continuous function can be estimated using the discrete data from the image. In most cases, these methods use the average of intensity in a wide region, but in practice, calculations over small local areas are more common. Some examples of such methods include the Sobel-Feldman operator which gives a coarse estimate of the image gradient27. It determines the gradient only by considering the intensities of the pixel and eight surrounding pixels and using simple numbers to assign importance to each intensity. Its low precision does not matter much because it is computationally light and still sufficient for a wide range of real-world image processing applications28. Sobel edge detection highlights the edges of teeth, caries regions, and structures in X-rays, helping to aid in segmentation and classification. Fig. 2 represents the algorithmic steps applied on the image using Sobel edge operator.
MobileNetV2
The patterns found in X-ray dental images tend to be complex (e.g. structures of teeth, bone density, decay, periodontal diseases). These features are effective to retrieve through the depthwise separable convolutions in MobileNetV2. Its depthwise component deeply filters channels in an element wise manner and may recognize gross patterns such as edges and textures in the X-ray. Such filtered features are then multiplied together point by point to construct more intricate representations that are meaningful to dental pathologies using the subsequent pointwise convolution. Such efficacy is important in handling possibly thousands of X-ray images within a short time.
In contrast to the traditional residual block that links the layers having many channels, the inverted residual block linked by MobileNetV2 connects the thin bottleneck layers29. These low dimensional bottleneck layers contain shortcut connections between them. The design itself is memory-efficient and enhanced the gradient propagation between several layers. The network accepts low dimensional, compressed representation30. First, this input is extended in dimension. The features that are produced are then filtered with a lightweight depthwise convolution. Most importantly, these characteristics are then mapped back to a low-dimensional space by a linear convolution (i.e., no non-linear activation function). These narrow bottleneck layers are important in having non-linearities (such as ReLU) removed. It has been demonstrated experimentally that non-linearity in bottlenecks can cause a poor performance and it is reasoned that non-linearity is capable of destroying information in the low dimensional spaces. The design process is useful in retaining the representational power. MobileNetV2 makes very heavy use of depthwise separable convolutions, originally provided in MobileNetV1. Such factorization decomposes an ordinary convolution into two layers:
-
Depthwise convolution: a single convolutional filter can be applied to each input channel, and does lightweight filtering.
-
Pointwise convolution (1x1 convolution): New features are constructed as the linear combinations of the input channels.
This factorization greatly makes an inference on reducing the computation cost of a regular convolution operation. The intermediate layer in the inverted residual block increases the input to a higher level. This expansion layer uses depthwise convolutions that are lightweight and helps to filter features and offers non-linearity31.
ShuffleNet
ShuffleNet architecture draws its inspiration from group convolution, skip connections, and depthwise separable convolutions. ShuffleNet suggests a group of convolutions, pointwise convolutions with residual path-based shortcuts. It proposes two important operations to perform large computation reduction with high accuracy: pointwise group convolution and channel-shuffle32.
Representation of how channel shuffling takes place in the shuffleNet architecture(Gconv- Group convolution)32.
Group convolution involves subdividing the channels of the input into groups, and doing convolutions between each of the groups independently. This enables ShuffleNet to consume a broader amount of feature map channels within the same computational cost, which is essential to enable small networks to encode more representation information. One disadvantage of successive group convolutions is that it can be used to block information flow between channel groups. All the input channels would be used in providing a small subset of output channels belonging to a particular group. As a revocation to this, ShuffleNet introduces the channel shuffle operation. The channels between the groups are re-arranged such that there is a possibility of flow of information through all the channel groups33. The Fig. 3 represents how the channel shuffling takes place.
Feature extraction using hybrid neural network
This work introduces a hybrid network using MobileNetV2 and ShuffleNet as shown in Fig. 4. Starting with this point, the X-ray image is transmitted to two different feature extraction backbones at the same time: ShuffleNet and MobileNetV2.
Architecture of the proposed hybrid model.
The ShuffleNet stream (upper path) employs grouped convolutions, channel shuffling, batch normalization and ReLU activation in order to keep the computational complexity low and maintain the representational power. Main factors like depth concatenation layers and addition layers have been implemented to make the feature diffusion and gradient flow better. To guarantee the cross-group information exchange, necessary in the grouped convolutions, the channel shuffling layer is introduced.
MobileNetV2 pathway (lower branch) uses depthwise separable convolutions and linear bottlenecks. MobileNetV2 employs inverted residual, that is, run through pointwise convolutional layer to expand a narrow input to a higher-dimensional feature representation, through depthwise convolutional layer connecting to a compression pointwise projection. This block architecture is effective in edge devices and it facilitates computation. Addition layers and the batch normalization helps stable learning and improved feature refinement. The ShuffleNet and MobileNetV2 branches outputs are concatenated through depth concatenation layer to combine different features of the two branches of the model. The combination of lightweight and high efficiency characteristics of MobileNetV2 and those of small and perfectly mixed feature maps of ShuffleNet enhance the representational capability of this convergence. The fused feature map is then fed into fully connected layers along with linear layer and classification layer, after which the result gives the final predictions.
Ant colony optimizer for hyperparameter tuning
Inspired by the behavior of an ant colony to seek the shortest path of food, ACO algorithm has become one of the most well-liked swarm-based algorithms34. By raising the size of the problem and shape of the search space, the complexity of the search space is raised. Multi-objective functions are the most realistic in terms of optimization problems required to be attained. It is referred to as a multi-objective optimization problems (MOPs)35 and this kind of problem has been encountered in various fields, e.g. bioinformatics, medical image analysis, and so on. These optimizers help to fine-tune the hyperparameters to give the best outcomes. In this work, the diagnostic process for caries conditions is influenced by the optimization of hyperparameters of MobileNetV2 and ShuffleNet. ACO deploys artificial ants to traverse along a solution space with the guidance of pheromone traces and heuristic data. The Algorithm 1 works in the framework of a probabilistic decision-making process in which the probability \(P_{ij}\) of hopping between node \(i\) and node \(j\) is subject to the quantity of pheromone \(\tau _{ij}\) and heuristic desirability \(\eta _{ij}\)36.
In this case, the relative weight of pheromone against heuristic values is controlled by \(\alpha\) and \(\beta\), and \(N_{i}\) represents the neighborhood of node \(i\). Ants build solutions through iterations and jointly consult the pheromone trails to support good routes. Evaporation of pheromone is utilized so that converging too soon is discouraged, and exploration is promoted.
The pheromone update rule is generally expressed as36:
Ant Colony Optimization (ACO)
where \(\rho \in (0,1)\) is the evaporation rate, \(m\) is the number of ants, and \(\Delta \tau _{ij}^k\) is the pheromone deposited by the \(k{\text {th}}\) ant. The algorithm aims to converge to the optimal solution \(x^*\) such that36:
ACO was applied to the hybrid model, to optimize the hyper parameters, and increase the model convergence. Ten ants and 10 iterations are used in the implementation of ACO to balance computational cost and optimization performance. ACO saves on computation cost by probabilistically directing the search using pheromone trails and heuristic data. In contrast to exhaustive grid search which considers all the possibilities, ACO reduces the search space effectively. ACO is more computationally efficient than random search because compared to random search, better solutions paths are reinforced more quickly using fewer evaluations37. In particular, it iteratively optimized combinations of learning rate, momentum, and batch size, using simulated ants, following pheromone to optimize the combination of learning rate, momentum, and batch size. The optimization goal was that of maximizing the validation accuracy and minimizing over-fitting.
Result analysis
This section analyzes the performance of hybrid model with and without optimizer in classifying dental caries in Panoramic X-ray images. It includes comparison between the pre trained models and hybrid model in terms of evaluation metrics. The data set is divided into 7:2:1, and here one part of the whole dataset is used for testing.
Performance analysis of MobileNetV2
Evaluation metrics like accuracy, precision, recall, F1 score, sensitivity, Area Under the Curve (AUC) are calculated. The Fig. 5 , Fig. 6 represents the confusion matrix and ROC curve of the pretrained MobileNetV2 architecture.
The confusion matrix showed that 275 caries and 213 non-caries were classified correctly and 32 caries and 94 non-caries were incorrectly classified. These findings reveal that the model exhibits high sensitivity, but its false positive classification rate is comparatively large when it comes to the non-caries category. This has been supported by the Receiver Operating Characteristic (ROC) curve which has an AUC of 0.886 indicating the high rate of discriminative capability. This steep takeoff at the beginning of the corner of the curve indicates that the model has a robust sensitivity at low false positive rates.
Confusion matrix for MobileNetV2 model.
ROC curve for MobileNetV2 model.
Performance analysis of ShuffleNet
The model based on ShuffleNet showed good performance in classification, which is shown by the analysis of the confusion matrix in Fig. 7 and the ROC curve in Fig. 8. It absolutely classified 232 caries and 277 non-caries, was false positive in 30 cases (non-caries falsely designated as caries), and false negative in 75 cases (caries falsely designated as non-caries). This indicates that the model has a strong probability of identifying non-caries samples but has an average tendency of identifying all caries samples.
Confusion matrix for ShuffleNet model.
ROC curve for ShuffleNet model.
The Table 3 shown represents comparison between the evaluation metrics of the two pretrained models.
Performance analysis of proposed hybrid model
In order to improve the ability of the features retrieved by MobileNetV2, and ShuffleNet, a fusion network was used. The classification results achieved with the use of the improved feature set are depicted with the help of the confusion matrix shown in Fig. 9. The model demonstrates better balancing between the expected and observed labels of caries and healthy individuals, along with a significant decrease in the number of misclassifications in contrast to the initial model. The model is able to discriminate between the two classes, as also revealed in the ROC curve provided in Fig. 10. The hybrid model had an AUC of 0.909 which has high separability and high consistency of classification.
Confusion matrix for proposed hybrid model.
ROC curve for proposed hybrid model.
The hybrid model demonstrated moderate result in the classification of dental X-ray images with a total accuracy of 83.55 percent as shown in Table 4. Based on the confusion matrix the model was able to correctly predict 260 caries and 253 non-caries samples but failure to predict 54 non-caries and 47 caries cases. The F1-score was observed to be 83.36%, this is achieved through recording precision of 84.33% and recall of 82.41%, which indicates an optimum trade-off between false and negatives as well as false positives. Besides, the sensitivity of the model was very high at 98.24%, which means that the skill at identifying the true positive cases (caries) is also high. The ROC curve, as an AUC of 0.90938 proves the high discriminative capabilities of the model, showing robustness of the results in line with different threshold conditions and establishes the hybrid model as a light but powerful architecture in detecting dental disease. 5-fold cross-validation was also used to assess the viability of the proposed model, a summary of which is available in Table 5. The model reached the accuracy above 86% in all the folds, which proves its stability.
Performance analysis of proposed hybrid model with ACO
In applying the ACO to the hybrid model, there was excellent performance compared to the other models because of its characteristic of searching for the ideal hyperparameters to further assist in the effective and accurate classification. Hybrid model that has been trained on a binary-classified prediction of dental caries on enhanced X-ray images was tuned through hyperparameter adjustment with the help of the ACO technique. The configuration which performed the best consisted of learning rate momentum, mini-batch size as shown in Table 6. Figures 11 and 12 represent the confusion matrix and ROC curve for the proposed hybrid model with ACO .
Confusion matrix for proposed hybrid model with ACO.
The overall performance of the suggested system satisfies the good results obtained on the last evaluation on test collection, as it can be demonstrated in Table 7. The model exhibits significant improvements with an accuracy of 92.67%, a precision of 93.67% with a recall of 91.53% and AUC of 96.79%.
Cervical burnout is a radiographic artifact that would present as a radiolucent band at the cervical area, frequently resembling dental caries and resulting in false positives in the case of automated classification38. Explainable AI approaches, such as Grad-CAM (Gradient-weighted Class Activation Mapping), provide visual heatmaps that show which elements of an image a model depends on the most to produce a forecast. Grad-Cam computes the gradient of the desired result (e.g., the presence of caries) flowing into the final convolutional layer of a neural network and applies it to weight feature map importance39. The process generates localization maps that indicate where the model gazes during decision making in order to confirm that the focus is placed in an anatomically relevant region as opposed to such artifacts as cervical burnout.
ROC curve for proposed hybrid model with ACO.
The Fig.13 demonstrates Grad-CAM heatmaps overlaid on dental radiographs, which gives a visual representation of where the hybrid network concentrates to identify dental caries. The heat maps apply a color gradient with red and orange coloration indicating areas with most model activity meaning that they are more relevant and have more likely carious lesions present. Colors like blue and green indicate areas with least influence on the classification decision of the model. This type of visualization makes the interpretations more understandable, as the spatial areas contributing to the greatest output of the network can be seen, and clinicians can ensure that the AI model focuses on the anatomically and diagnostically relevant areas, thereby fostering trust and transparency in AI-assisted dental diagnosis.
Grad-Cam heatmap images highlighting caries effected regions in panoramic radiographic images.
Comparative study
Table 8 provides an overview of a comparative analysis of our proposed model with respect to the previous works that focused on classification and detection of dental caries with the use of various kinds of radiographs. It can be observed that our proposed model by the integration of a hybrid model and the ACO, has competitive performance in caries detection as compared to other existing strategies.
Conclusion
This paper presents an elaborate method for categorizing dental caries using panoramic radiographic images. First clustering-based method was used to resolve the issue of class imbalance in the dataset, and then edge detection was used with Sobel-Feldman to increase the significance of specific features through preprocessing. Although MobileNetV2 and ShuffleNet as individual deep learning models were used, they did not perform well when used alone. To make the best use of the characteristics of both architectures, a hybrid model was built, uniting the advantages of the architectures. This was done by further optimization using the ACO algorithm within the hybrid framework that produced quite high performance values. The ACO-strengthened hybrid model showed better classification accuracy, due to its strong global search strategy as well as optimal parameter settings, hence confirming its promise in automated dental diagnosis systems.The accuracy scores of the individual models were moderate, where MobileNetV2 and ShuffleNet scored 76% and 82% respectively. Conversely, the proposed hybrid model has shown a better accuracy rate of 83%, and by combining the model with ACO, the level of accuracy was further improved to 92% indicating the efficiency of the hybrid model. The present work is restricted to classifying and detecting caries in X-ray images without assessing severity levels or specific tooth positions. Future works should explore hybrid approaches that analyze deeper dentin and pulp regions for improved diagnostic precision. Expanding to multi-class classification, segmentation, and validation on external datasets using different image modalities can strengthen the model’s reliability and clinical relevance.
Data availability
The data is taken from a publicly available dataset, which can be accessed through https://www.kaggle.com/ datasets/lokisilvres/dental-disease-panoramic-detection-dataset
References
Lawal, F. B. et al. Oral health impact among children: A systematic review update in 2024. J. Evid.-Based Dent. Pract. 25, 102082. https://doi.org/10.1016/j.jebdp.2024.102082 (2025).
Tonetti, M. S., Greenwell, H. & Kornman, K. S. Staging and grading of periodontitis: Framework and proposal of a new classification and case definition. J. Clin. Periodontol. 45, S149–S161. https://doi.org/10.1111/jcpe.12945 (2018).
Forouzeshfar, P. et al. Dental caries diagnosis using neural networks and deep learning: A systematic review. Multimed. Tools Appl. 83, 30423–30466. https://doi.org/10.1007/s11042-023-16599-w (2023).
Pedersen, S. et al. Pano-gan: A deep generative model for panoramic dental radiographs. J. Imaging 11, 41. https://doi.org/10.3390/jimaging11020041 (2025).
Thapliyal, N., Aeri, M., Kukreja, V. & Sharma, R. Navigating landscapes through AI: A comparative study of efficientnet and mobilenetv2 in image classification. In 2024 International Conference on Emerging Technologies in Computer Science for Interdisciplinary Applications (ICETCS). 1–4. https://doi.org/10.1109/ICETCS61022.2024.10543922 (IEEE, 2024).
Marco, Dorigo Mauro, Birattari Thomas, Stutzle. Ant colony optimization. IEEE Computational Intelligence Magazine.1(4), 28–39. https://doi.org/10.1109/MCI.2006.329691 (2006).
Kumar, C. & Singh, B. A comparative study of machine learning regression approach on dental caries detection. Proc. Comput. Sci. 215, 519–528. https://doi.org/10.1016/j.procs.2022.12.054 (2022).
Radha, R. C., Raghavendra, B. S., Subhash, B. V., Rajan, J. & Narasimhadhan, A. V. Machine learning techniques for periodontitis and dental caries detection: A narrative review. Int. J. Med. Inform. 178, 105170. https://doi.org/10.1016/j.ijmedinf.2023.105170 (2023).
Chakraborty, C., Bhattacharya, M., Pal, S. & Lee, S.-S. From machine learning to deep learning: Advances of the recent data-driven paradigm shift in medicine and healthcare. Curr. Res. Biotechnol. 7, 100164. https://doi.org/10.1016/j.crbiot.2023.100164 (2024).
Ahmed, W. M. et al. Artificial intelligence in the detection and classification of dental caries. J. Prosth. Dent. 133, 1326–1332. https://doi.org/10.1016/j.prosdent.2023.07.013 (2025).
Mei, L. et al. Clinical knowledge-guided hybrid classification network for automatic periodontal disease diagnosis in x-ray image. Med. Image Anal. 99, 103376. https://doi.org/10.1016/j.media.2024.103376 (2025).
Chawla, R. et al. A hybrid optimization approach with deep learning technique for the classification of dental caries. Int. J. Ad. Comput. Sci. Appl. https://doi.org/10.14569/IJACSA.2022.0131241 (2022).
Megalan Leo, L. & Kalapalatha Reddy, T. Learning compact and discriminative hybrid neural network for dental caries classification. Microprocess. Microsyst. 82, 103836. https://doi.org/10.1016/j.micpro.2021.103836 (2021).
Mohammad-Rahimi, H., Rokhshad, R., Bencharit, S., Krois, J. & Schwendicke, F. Deep learning: A primer for dentists and dental researchers. J. Dent. 130, 104430. https://doi.org/10.1016/j.jdent.2023.104430 (2023).
Bayrakdar, I. S. et al. Deep-learning approach for caries detection and segmentation on dental bitewing radiographs. Oral Radiol. 38, 468–479. https://doi.org/10.1007/s11282-021-00577-9 (2022).
Xiangdong, T. Image recognition algorithm based on hybrid deep learning. Int. J. Syst. Assur. Eng. Manag. https://doi.org/10.1007/s13198-023-02134-5 (2023).
Tareq, A. et al. Visual diagnostics of dental caries through deep learning of non-standardised photographs using a hybrid yolo ensemble and transfer learning model. Int. J. Environ. Res. Public Health 20, 5351. https://doi.org/10.3390/ijerph20075351 (2023).
Zanini, L. G. K., Rubira-Bullen, I. R. F. & Nunes, F. D. L. D. S. A systematic review on caries detection, classification, and segmentation from x-ray images: Methods, datasets, evaluation, and open opportunities. J. Digit. Imaging 37, 1824–1845. https://doi.org/10.1007/s10278-024-01054-5 (2024).
Rahman, R. B. et al. A comprehensive dental dataset of six classes for deep learning based object detection study. Data Brief 57, 110970. https://doi.org/10.1016/j.dib.2024.110970 (2024).
Hameed, T. & AmalaShanthi, S. A novel teeth segmentation on three-dimensional dental model using adaptive enhanced googlenet classifier. Multimed. Tools Appl. 83, 68547–68568. https://doi.org/10.1007/s11042-024-19154-3 (2024).
Vimalarani, G. & Ramachandraiah, U. Automatic diagnosis and detection of dental caries in bitewing radiographs using pervasive deep gradient based lenet classifier model. Microprocess. Microsyst. 94, 104654. https://doi.org/10.1016/j.micpro.2022.104654 (2022).
Ramana Kumari, A., Nagaraja Rao, S. & Ramana Reddy, P. Design of hybrid dental caries segmentation and caries detection with meta-heuristic-based resnext-rnn. Biomed. Signal Process. Control 78, 103961. https://doi.org/10.1016/j.bspc.2022.103961 (2022).
Shao, Y. et al. Balanced data sampling for language model training with clustering. arXiv https://doi.org/10.48550/arXiv.2402.14526 (2024).
Yong, Y. The research of imbalanced data set of sample sampling method based on k-means cluster and genetic algorithm. Energy Proc. 17, 164–170. https://doi.org/10.1016/j.egypro.2012.02.078 (2012).
Maharana, K., Mondal, S. & Nemade, B. A review: Data pre-processing and data augmentation techniques. Glob. Transit. Proc. 3, 91–99. https://doi.org/10.1016/j.gltp.2022.04.020 (2022).
Tawakuli, A., Havers, B., Gulisano, V., Kaiser, D. & Engel, T. Survey: Time-series data preprocessing: A survey and an empirical analysis. J. Eng. Res. 13, 674–711. https://doi.org/10.1016/j.jer.2024.02.018 (2025).
Mathur, N., Mathur, S. & Mathur, D. A novel approach to improve Sobel edge detector. Proc. Comput. Sci. 93, 431–438. https://doi.org/10.1016/j.procs.2016.07.230 (2016).
Ranjan, R. & Avasthi, V. Edge detection using guided Sobel image filtering. Wirel. Pers. Commun. 132, 651–677. https://doi.org/10.1007/s11277-023-10628-5 (2023).
Howard, A. G. et al. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv https://doi.org/10.48550/ARXIV.1704.04861 (2017).
Alsakar, Y. M. et al. Multi-label dental disorder diagnosis based on mobilenetv2 and swin transformer using bagging ensemble classifier. Sci. Rep. 14, 25193. https://doi.org/10.1038/s41598-024-73297-9 (2024).
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. & Chen, L.-C. Mobilenetv2: Inverted residuals and linear bottlenecks. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4510–4520. https://doi.org/10.1109/CVPR.2018.00474 (IEEE, 2018).
Zhang, X., Zhou, X., Lin, M. & Sun, J. Shufflenet: An extremely efficient convolutional neural network for mobile devices. arXiv https://doi.org/10.48550/ARXIV.1707.01083 (2017).
Ma, N., Zhang, X., Zheng, H.-T. & Sun, J. Shufflenet v2: Practical guidelines for efficient cnn architecture design. arXiv https://doi.org/10.48550/ARXIV.1807.11164 (2018).
Awadallah, M. A. et al. Multi-objective ant colony optimization: Review. Arch. Comput. Methods Eng. 32, 995–1037. https://doi.org/10.1007/s11831-024-10178-4 (2025).
Handl, J., Kell, D. B. & Knowles, J. Multiobjective optimization in bioinformatics and computational biology. IEEE/ACM Trans. Comput. Biol. Bioinform. 4, 279–292. https://doi.org/10.1109/TCBB.2007.070203 (2007).
Duygu, Yilmaz Eroglu Umut, Akcan. An Adapted Ant Colony Optimization for Feature Selection. Applied Artificial Intelligence. 38(1). https://doi.org/10.1080/08839514.2024.2335098 (2024).
Lohvithee, M., Sun, W., Chretien, S. & Soleimani, M. Ant colony-based hyperparameter optimisation in total variation reconstruction in x-ray computed tomography. Sensors 21, 591. https://doi.org/10.3390/s21020591 (2021).
Berry, H. M. Cervical burnout and Mach band: Two shadows of doubt in radiologic interpretation of carious lesions. J. Am. Dent. Assoc. 106, 622–625. https://doi.org/10.14219/jada.archive.1983.0109 (1983).
Kim, D., Kim, J. & Choi, S. G. Cnn-based remote dental diagnosis model for caries detection with grad-cam. Sci. Rep. 15, 26555. https://doi.org/10.1038/s41598-025-11447-3 (2025).
Askar, H. et al. Detecting white spot lesions on dental photography using deep learning: A pilot study. J. Dent. 107, 103615. https://doi.org/10.1016/j.jdent.2021.103615 (2021).
Lee, S. et al. Deep learning for early dental caries detection in bitewing radiographs. Sci. Rep. https://doi.org/10.1038/s41598-021-96368-7 (2021).
Kühnisch, J., Meyer, O., Hesenius, M., Hickel, R. & Gruhn, V. Caries detection on intraoral images using artificial intelligence. J. Dent. Res. 101, 158–165. https://doi.org/10.1177/00220345211032524 (2022).
Zhang, X. et al. Development and evaluation of deep learning for screening dental caries from oral photographs. Oral Dis. 28, 173–181. https://doi.org/10.1111/odi.13735 (2022).
Liao, D. & Hargreaves, C. A. Classification of dental teeth X-ray images using a deep learning cnn model. Preprints https://doi.org/10.20944/preprints202305.0513.v1 (2023).
Guo, Y. et al. Rapid detection of non-normal teeth on dental X-ray images using improved mask r-cnn with attention mechanism. Int. J. Comput. Assist. Radiol. Surg. 19, 779–790. https://doi.org/10.1007/s11548-023-03047-1 (2024).
Liu, Y., Xia, K., Cen, Y., Ying, S. & Zhao, Z. Artificial intelligence for caries detection: A novel diagnostic tool using deep learning algorithms. Oral Radiol. 40, 375–384. https://doi.org/10.1007/s11282-024-00741-x (2024).
Chaves, E. T. et al. Detection of caries around restorations on bitewings using deep learning. J. Dent. 143, 104886. https://doi.org/10.1016/j.jdent.2024.104886 (2024).
Jones, B. et al. Dental caries detection in children using intraoral scans and deep learning. J. Dent. 160, 105906. https://doi.org/10.1016/j.jdent.2025.105906 (2025).
Funding
Open access funding provided by Manipal Academy of Higher Education, Manipal
Author information
Authors and Affiliations
Contributions
Kaveri. K: Conceptualization, Formal analysis, Investigation, Methodology, Resources, Writing-original draft. Venkata Ratna Prabha. K: Conceptualization, Formal analysis, Investigation, Methodology, Writing-original draft, Writing-review & editing, Supervision. Pradeep Reddy. G: Funding acquisition, Investigation, Methodology, Writing-original draft, Writing-review & editing, Supervision. Lakshmi Ganesh. P: Conceptualization, Formal analysis, Investigation, Methodology, Resources, Writing-original draft. Radha. K: Investigation, Methodology, Writing-original draft, Writing-review & editing. Ramesh. P: Investigation, Methodology, Writing-original draft, Writing-review & editing. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Kaveri, K., K., V.R.P., Reddy, G.P. et al. ACO-optimized MobileNetV2-ShuffleNet hybrid model for automated dental caries classification. Sci Rep 15, 40615 (2025). https://doi.org/10.1038/s41598-025-24375-z
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-24375-z
















