Automated detection and classification of cervical and anal squamous cancer precursors using deep learning and multidevice colposcopy

Mascarenhas, Miguel; Martins, Miguel; Barroso, Luís; Spindler, Lucas; Fathallah, Nadia; Manzione, Thiago; Alencoão, Inês; Carinhas, Maria João; Ribeiro, Tiago; Mendes, Francisco; Cardoso, Pedro; Almeida, Maria João; Mota, Joana; Fernandes, Joana; Ferreira, João; Mascarenhas, Teresa; Nadal, Sidney; Zulmira, Rosa; Macedo, Guilherme; de Parades, Vincent

doi:10.1038/s41598-025-14514-x

Download PDF

Article
Open access
Published: 26 September 2025

Automated detection and classification of cervical and anal squamous cancer precursors using deep learning and multidevice colposcopy

Miguel Mascarenhas^1,2,3,10^na1,
Miguel Martins^1,2^na1,
Luís Barroso⁴,
Lucas Spindler⁵,
Nadia Fathallah⁵,
Thiago Manzione⁶,
Inês Alencoão⁷,
Maria João Carinhas⁷,
Tiago Ribeiro^1,2,3,
Francisco Mendes^1,2,
Pedro Cardoso^1,2,3,
Maria João Almeida^1,2,
Joana Mota^1,2,
Joana Fernandes⁸,
João Ferreira⁸,
Teresa Mascarenhas^3,9,
Sidney Nadal⁶,
Rosa Zulmira⁷,
Guilherme Macedo^1,2,3 &
…
Vincent de Parades⁵

Scientific Reports volume 15, Article number: 33068 (2025) Cite this article

714 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Human papillomavirus (HPV) infection presents neoplastic risks in both cervix and anus. High-resolution colposcopy/anoscopy is crucial for assessing these regions but has suboptimal accuracy. This study aims to develop a Convolutional Neural Network (CNN) to identify and differentiate low-grade (LSIL) and high grade (HSIL) squamous intraepithelial lesions, in the cervix and anus. A retrospective multicenter study was conducted to develop a CNN using 320 colposcopy and anoscopy examinations, from 3 device types. Dataset included 88,073 frames, categorized as LSIL or HSIL based on pathological analysis. The data was split into training/validation (90%, n = 79,265, including a threefold cross-validation) and test sets (10%, n = 8808). Diagnostic metrics including sensitivity, specificity, accuracy, positive and negative predictive values (PPV and NPV, respectively) and an area under the receiving operating and the precision-recall curves (AUC-ROC and AUC-PR) were calculated. During training/validation phase, the model achieved an average sensitivity for HSIL of 98.1% (IC95% 97.6–98.5%), specificity of 97.4% (IC95% 96.0–98.8%), PPV of 97.2% (IC95% 95.8–98.7%), NPV of 98.2% (IC95% 97.7–98.6%), and accuracy of 97.7% (IC95% 97.2–98.6%). The mean AUC-ROC and AUC-PR were both 0.98 ± 0.01. In the testing phase, performance metrics for HSIL were: sensitivity 99.0%, specificity 97.8%, PPV 97.6%, NPV 99.0%, and accuracy 98.3%. HPV infection impacts both cervical and anal region. This study developed a pioneering CNN to differentiate HSIL and LSIL in HPV-related dysplastic lesions, during cervical and anal examinations. This model achieved promising results, suggesting its potential to improve detection accuracy and cost-effectiveness in clinical practice.

Reproducible and clinically translatable deep neural networks for cervical screening

Article Open access 08 December 2023

Enhancing cervical cancer detection and robust classification through a fusion of deep learning models

Article Open access 11 May 2024

Evaluation of a real-time optoelectronic method for the detection of cervical intraepithelial neoplasia and cervical cancer in patients with different transformation zone types

Article Open access 08 November 2024

Introduction

Human papillomavirus (HPV) infection is a highly prevalent sexually transmitted disease, affecting over 80% of sexually active individuals at some point in their lives¹. It is frequently associated with transient, mostly asymptomatic infection. However, in certain cases, chronic persistent infection can develop, increasing the risk of neoplastic transformation across diverse anatomical locations². One possible mechanism contributing to carcinogenesis is integration of viral DNA into the host cell genome, which triggers uncontrolled proliferation and impairs DNA repair mechanisms². A key manifestation of active HPV infection is the development of low-grade intraepithelial lesion (LSIL), which resolves spontaneously in the majority of cases. However, while LSIL often regresses without further consequences, in some cases it can progress to high-grade intraepithelial lesion (HSIL). HSIL is associated with a significantly higher risk of progression to invasive squamous carcinoma and is therefore considered a precancerous lesion³.

Understanding this carcinogenesis process is essential, since promptly treating HSIL represents a pivotal opportunity to reduce the burden of HPV-associated squamous cancers³. Cervical cancer represents the oncogenic disease model and it is, in fact, the most important HPV-related neoplasia². Currently, there are already a number of established initiatives aimed at both primary prevention (through immunization) and secondary prevention (through treatment of precursor dysplastic lesions)⁴. Additionally, it is important to recognize that the nature of HPV infection is ubiquitous and its impact extends beyond the cervical region. This means that its carcinogenic effect can manifest in various pelvic areas, including the vagina or vulva in women, the penis in men, as well as, the perianal area and the anus^5,6.

Considering the need to inspect finer anatomical details, there has been a growing interest in utilization of high-resolution colposcopes to assess not only the women genital tract, but also the anal zone. Performing colposcopy for both cervical and anal regions enable high resolution detailed assessment of these areas and precise targeting of biopsies and treatment procedures through direct visualization^7,8. The prevailing recommendation is to opt for colposcopic assessment following a cytological abnormal exam and/or high-risk HPV type of the cervical/anal area^8,9. Although this procedure provides the highest diagnostic and therapeutic yield, this procedure is accompanied by a significant learning curve. The limited expertise can lead to a shortage of physicians who are technically proficient at raising suspicion and providing accurate diagnoses, especially in early stages¹⁰. In the particular case of high resolution anoscopy (using colposcopes or anoscopes), the insufficient number of trained proctologists, may result in gynecologists performing both cervical and anal assessment, given their greater familiarity with HPV-related dysplastic lesions.

In contexts with suboptimal diagnostic accuracy and high interobserver variability, artificial intelligence (AI) models could enhance procedures cost-effectiveness¹¹. The abundance of colposcopy images further supports AI tools for image analysis, particularly convolutional neural networks (CNN), inspired by human visual cortex for pattern analysis. Currently, researchers are leveraging this technology in the perineal region to improve the accuracy of diagnosing HPV-induced lesions using colposcopy/anoscopy^12,13,14. The published models so far focus only in detecting and differentiating lesions in one specific region, either the cervix or in the anal canal^{13,14,15,16,17,18,19,20,21}. However, achieving high performance metrics in one area may not necessarily translate to similar effectiveness in the other, and AI-enhanced ubiquitous diagnosis tools (with training dataset of both regions) are currently lacking.

The aim of this study is to develop and validate a CNN for automatic differentiation of cervical and anal squamous cancers precursors during high-resolution colposcopy/ anoscopy.

Methods

Study design and categorization of the lesions

We included high-resolution colposcopies performed at Centro Materno Infantil do Norte [CMIN] (Porto, Portugal) [n = 70] using a Zeiss FC 150 colposcope and high-resolution anoscopies performed at Groupe Hospitalier Paris Saint-Joseph [GHPSJ] (Paris, France) [n = 177], Instituto de Infecciologia Emílio Ribas [IFER] (São Paulo, Brazil) [n = 54] and Wake Forest University [WKU] (North Carolina, USA) [n = 13] using a videoproctoscope THD® Proctostation HRA Module (THD SpA, Correggio, Italy), Kolplast colposcope (Kolplast CIA, São Paulo, Brazil), Zeiss FC 150 colposcope (Carl Zeiss Meditec AG, Jena, Germany), respectively. The included procedures were conducted and recorded between 2020 and 2023. The collected videos were then segmented in still frames using VLC Media Player.

The dataset consisted of a total of 88,073 frames of HPV-induced dysplastic lesions, with 45,726 labelled as LSIL and 42,347 labelled as HSIL. This binary classification was determined based on the corresponding histopathology reports from biopsied or treated lesions during colposcopy or anoscopy procedures. Cytological samples were not used to establish the ground truth.

The number of biopsies/treated lesions varied for each procedure. In cases involving multiple biopsies, the biopsy sites were documented, and the histological findings were matched with the corresponding video frames. Any cases with uncertainty about the correlation between the biopsy site and the image were excluded from the analysis to ensure rigor and prevent misclassification.

We divided total data in two parts: training/validation and testing set, with 79,265 (90%) and 8808 (10%) frames, respectively. We used the testing set to assess the global performance of the model. We resume dataset methodology in Fig. 1.

Due to the retrospective nature of the data collection, this study follows a non-interventional approach. Additionally, no modifications to therapeutic practices were made as a result of the study. Approval from the ethics committee was obtained prior to study’s beginning, with permissions granted by the ethics committee of Group Hospitalier Paris Saint-Joseph, Instituto de Infecciologia Emílio Ribas, and Hospital Universitário Santo António (IRB 00012157, SPTC 81/2023, IRB 2023.157(131-DEFI/123-CE), respectively). The study was carried out in accordance with the principles of Helsinki Declaration.

Colposcopy and anoscopy protocol

In each center, colposcopy and anoscopy procedures were performed by expert medical doctors, according to the current best practices. Each be procedure can be divided in four stages maximum: first examination without applying stain, followed by examination with 3% acetic acid and optionally lugol’s iodine later, ending with therapeutic manipulation (e.g. laser ablation, plasma coagulation or surgical ablation). The dataset included frames from these four categories, with each procedure potentially encompassing any combination of them.

Development of DL model and performance analysis

A Resnet10 model, which was pre-trained on ImageNet-1 K (a comprehensive collection of data used to recognize objects within images), was used to build this CNN²². The early layers of the model were kept, in order to use the features it had already learned, but the final fully connected layers were removed. Instead, new fully connected layers were added to adapt the model for LSIL vs HSIL classification. The architecture consists of two main blocks, each includes a fully connected layer with a subsequent dropout layer, to mitigate overfitting risk. Following these blocks, a dense layer was incorporated, sized according to the number of categories (2). We fine-tuned hyperparameters such as the learning rate (0.0001), batch size (32), and the number of epochs (5) through trial and error to achieve the best performance. Libraries such as FFMPEG, Pandas, and Pillow were used for data preparation. We implemented the model in PyTorch 2.2.2, running it on a powerful system equipped with a dual 2.1 GHz Intel Xeon Gold 6130 processor (Intel, Santa Clara, CA, USA) and a dual NVIDIA Quadro RTX A6000 graphics card (NVIDIA Corporate, Santa Clara, CA, USA). A probability of being LSIL or HSIL was calculated for each frame. The CNN’s final classification for each frame relied on the category with the highest probability. The classification of the model was compared to the current gold standard, corresponding histopathological classification (Fig. 2).

Statistics and reproducibility

The model was assessed during training/validation phase (rationale: assess robustness) and during test phase (rationale: assess overall performance). During training/validation phase, 90% of the data underwent division into three equivalent dimension folds, using a StratifiedKFold division. A total of three distinct iterations were executed in total. In each iteration, the model was trained using two folds, and validate using the other one. Additionally, in each iteration, the folds employed for training and validation were different. During test phase, the remaining 10% were used to independently to assess performance of the CNN. Computational performance was also evaluated by measuring the algorithm processing time for all frames in the test set.

We performed statistical analysis using Sci-Kit Learn v0.22.2 (https://scikit-learn.org/0.22/)²³. We also generated heatmaps to assess which characteristics most significantly contributed to CNN prediction. Examples of a cervical and an anal frame are shown in Fig. 3.

Results

We included a total of 88,073 of high-resolution colposcopy and anoscopy still frames, from 3 different devices.

From the total dataset, 79,265 frames were used to train the model (GHPSJ = 31,086, IFER = 22,738, CMIN = 20,393, WFU = 5048), whereas the remaining 8808 frames were used to independently test the model (GHPSJ = 3393, IFER = 2587, CMIN = 2300, WFU = 528 frames).

The total dataset incorporated 45,726 of LSIL (41,153 in training/validation, 4573 in testing set) and 42,347 HSIL (38,112 in training/validation, 4235 in testing set) labeled frames.

From the procedure number perspective, the training/validation set included frames from 165 exams, while the testing set had frames from 155.

1. Training/Validation set

Table 1 displays the number of frames, patients, devices, regions and lesion (LSIL and HSIL) numbers for each fold, during cross-validation (training/validation phase).

Table 1 Number of frames, patients and types of CE device per group, which was divided in training/validation (90% of patients, including a threefold cross validation) vs. test group (10% of remaining).

Full size table

Regarding performance metrics to HSIL differentiation (Table 2), the average sensitivity was 98.1% (IC95% 97.6–98.5%) and the average specificity was 97.4% (IC95% 96.0–98.8%). The average PPV were 97.2% (IC95% 95.8–98.7%) and the average NPV was 98.2% (IC95% 97.7–98.6%). The average overall accuracy was 97.7% (IC95% 97.2–98.6%). The mean AUC-ROC and AUC-PR were both 0.98 ± 0.01. Table 3 reveals the performance metrics detailed for each fold. Figure 4 reveals the discriminatory capacity of the model during threefold cross validation, as shown by the AUC-ROC and AUC-PR curves.

Table 2 Confusion matrices for each cross-validation run (training-validation phase) and in test phase.

Full size table

Table 3 Data was divided in training/validation and test groups. During training/validation phase, three iterations were conducted, each with unique frame distribution.

Full size table

2. Testing set

Regarding testing phase, performance metrics to HSIL differentiation were as follow: sensitivity of 99.0%, specificity of 97.8%, with a PPV and NPV of 97.6% and 99.0%, respectively. The overall accuracy was 98.3%.

Discussion

This study introduces the first worldwide ubiquitous deep learning model that can detect and differentiate HPV-related dysplastic lesions in two distinct areas: the cervix and the anal canal. The model predictions are highly accurate and hold great potential for practical use in clinical live scenarios. This cross-zone interoperable model represents a novel advancement in computer-aided detection (CADe) and diagnosis (CADx) systems by enabling effective analysis across anatomically distinct regions. This approach offers an original solution to improve the accuracy and efficiency of endoscopic and magnified evaluation of these regions, providing a more versatile diagnostic tool for practitioners.

One of the primary strengths of this AI model is its development using histologically confirmed frames lesions from two different anatomical zones. This ensures the model is trained in both regions, unlike existing evidence focused only on detecting cervical or anal lesions. Another key strength of the model is its multicentricity and interoperability, having been trained on data provided by four centers and three distinct devices used for endoscopic evaluation of cervical and anal regions. This approach generated a more heterogeneous dataset, incorporating different populations from Europe and America, probably reflecting a more externally validated and more closed to real-life scenario, closing the gap between development and clinical practice.

Adhering to FAIR criteria is essential in current development of AI software to enhance clinical practice²⁴. CNN compatibility with multiple devices is mandatory, in order to facilitate validation and extend clinical practice and research across multiple settings clinical. Therefore, interoperability of this model is a significant advantage, elevating it to a higher level of technological readiness. Moreover, our group has been developing AI models for HPV-related dysplastic lesions, initially focusing on the anal canal, then the cervix, and now a ubiquitous single model capable of calculating predictions for both regions. This approach adheres to the principle of reusability and may facilitate the development of more robust and efficient AI model. Principles of findability and accessibility were also respected through the reproducible and consistent collection of data.

Comparing published models so far is challenging, as comparing performance metrics only may not provide an accurate assessment (Table 4). The methodologies used in each study can vary significantly, making direct comparisons difficult. From the cervical problematic point of view, several CNN have been published for differentiating HSIL. Miyagi et al. reported 80% sensitivity and 88% specificity (fivefold cross validation), using a dataset of LSIL and HSIL (two categories) non-stained frames dataset¹⁷. This study involved a low number of patients (330) and used only one frame per colposcopy. Yuan et al. achieved 85% sensitivity and 85% specificity (train-test validation 80-10-10%; these metrics relate to distinguishing HSIL from other categories), using a dataset of normal, LSIL and HSIL (three categories) stained frames¹⁹. This study included a large number of patients (11,198) and used frames stained with acetic acid and another with lugol. Xue et al. reported 66% sensitivity and 90% specificity (train-test validation 70-10-20%; these metrics relate to distinguishing HSIL from other categories), using a dataset that included normal, LSIL, HSIL and cancer (four categories) non-stained frames¹⁸. This study involved a larger number of patients (19,435) but relied on frame annotation. Chen et al. achieved 88% sensitivity and 94% specificity (train-test-validation 60-20-20%), using both stained and non-stained LSIL and HSIL frames, using multiple frames per exam of a total of 6002 patients¹⁵. Fang et al. reported 82% sensitivity for detecting HSIL in a dataset constituted with non-stained frames from normal, LSIL, HSIL, cervical cancer, from 1189 patients¹⁶. Lastly, Mascarenhas et al. reported 99.7% sensitivity and 98.6% specificity (train-test 90-10%), using a dataset of LSIL and HSIL (two categories) from non-stained, stained and post-manipulated frames¹⁴. The dataset comprised a higher number of frames containing dysplastic lesions (22,693), from 70 patients. Regarding the perspective on this issue concerning the anal canal, to our knowledge, only our group has published evidence on the development of AI models for this anatomical region. Our studies have shown significant progress: from a pilot study reporting 91.4% sensitivity and 89.7% specificity (train-test 90-10%), using a dataset of LSIL and HSIL (two categories) of 5026 frames²⁰; To a subsequent study achieving 96.5% sensitivity and 94.3% specificity (fivefold cross validation), using a dataset of total 27,770 frames, maintaining high performance metrics across categories²¹; and finally, our latest study utilizing frames from high-resolution colposcopes and anoscopes¹³. For detection of HSIL, the model reported 93.6% sensitivity and 95.7% specificity (train-test 80–20%); from a total of 57,882 frames across 151 exams.

Table 4 Summarized published deep learning models developed for differentiation of HPV-related dysplastic lesions in the cervical and anal regions.

Full size table

From the data science methodology and analysis perspective, there are some key points of the study that should be mentioned. We strictly included only lesion frames that were later confirmed through histological analysis (ground truth). This included frames from the entire endoscopic examination, encompassing non-stained, stained and post-manipulated ones, which can be particular useful for physicians, as its diagnostic performance may not be compromised despite the presence of stains, blood or burnt tissue. Moreover, we included a diverse dataset with images from various angles, implemented a proper train-test split, and avoided data annotation. This model represents a pioneering approach as it was trained simultaneously of still frames showing HPV dysplastic lesions (LSIL or HSIL) from two different anatomical regions: the cervix and the anus. It demonstrated high performance metrics in test set, achieving 99.0% sensitivity and 97.8% specificity. We also performed a cross-validation during training/validation phase. The average metrics were similarly high (mean sensitivity 98.1% and mean specificity 97.4%), indicating robustness across different frame distribution.

The retrospective nature of this study, in addition to the lack of procedural split in the training/validation and testing sets, as well as the potential demographic bias associated with the absence of patient-level data (due to GPDR restrictions). These limitations should be acknowledged and may contribute to risk of overfitting. Consequently, the findings of this study cannot be broadly generalizable or directly applied to clinical setting and more prospective and multicentric studies are still needed to determine if the use of these AI model can significantly improve the diagnostic and treatment of HPV-related dysplastic lesions. Since our dataset preparation did not involve manual data annotation, and due to the inherent black box nature of these models, we excluded more complex cases with simultaneous (observed in the same frame) presence of LSIL and HSIL lesions, which can be also a limitation that can compromised the external validity of study’s results. For similar reasons, we excluded frames where both lesions and instruments (e.g. forceps used for traction to better expose a lesion) were present simultaneously. We acknowledge that ensuring the model can function without interference from such instruments is important for its real-world applicability. Additionally, form a gynecological/ proctological perspective, the ideal CADe/CADx system would be the one that was capable of detecting/differentiating LSIL, HSIL, and non-dysplastic lesions. Due to limitations in our current dataset, implementing a trinary model at this stage was not feasible.

In conclusion, the development of efficient AI models, interoperable, developed with minimal selection bias and dataset diversity is essential for implementing this technology in real clinical scenarios. Using a single ubiquitous model in both anatomical regions can be more versatile and efficient for clinical practice than employing two separate models individually. Future research will prioritize the validation of CADe/CADx models within prospective, multicentric and real-time clinical context, including tandem comparative evaluation between AI-enhanced and clinical performance metrics through conventional high resolution anoscopy and/or colposcopy diagnostic performance. Building on this need, this multicentric study represents a necessary intermediate step and introduces innovative model for detection and differentiation of HPV-related dysplastic lesions in two main anatomical areas of assessment of colposcopes and anoscopes. This development may increase clinical outcomes and cost-effectiveness of these procedures, potentially making them accessible to a larger portion of the population.

Due to the retrospective nature of the study by the ethics committee of Groupe Hospitalier Paris Saint-Joseph, Institituto de Infecciologia Emílio Ribas and Hospital Universitário Santo António (IRB 00,012,157, SPTC 81/2023, IRB 2023.157(131-DEFI/123-CE), respectively) waived the need of obtaining informed consent.

Data availability

Raw data were generated at the Faculty of Medicine of the University of Porto, PT. Derived data supporting the findings of this study are available from the corresponding author upon request.

References

Scott-Wittenborn, N. & Fakhry, C. Epidemiology of HPV related malignancies. Semin. Radiat. Oncol. 31(4), 286–296 (2021).
Article PubMed PubMed Central Google Scholar
Burd, E. M. Human papillomavirus and cervical cancer. Clin. Microbiol. Rev. 16(1), 1–17 (2003).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Palefsky, J. M. et al. Treatment of anal high-grade squamous intraepithelial lesions to prevent anal cancer. N. Engl. J. Med. 386(24), 2273–2282 (2022).
Article CAS PubMed PubMed Central Google Scholar
Davies-Oliveira, J. C. et al. Eliminating cervical cancer: Progress and challenges for high-income countries. Clin. Oncol. (R Coll. Radiol.) 33(9), 550–559 (2021).
Article CAS PubMed Google Scholar
Ayala, M. & Fatehi, M. Vulvar intraepithelial neoplasia. In: StatPearls, StatPearls Publishing (2024).
Maugin, F. et al. Early detection of anal high-grade squamous intraepithelial lesion: Do we have an impact on progression to invasive anal carcinoma?. J. Low Genit. Tract. Dis. 24(1), 82–86 (2020).
Article PubMed Google Scholar
Khan, M. J. et al. ASCCP colposcopy standards: Role of colposcopy, benefits, potential harms, and terminology for colposcopic practice. J. Low Genit. Tract. Dis. 21(4), 223–229 (2017).
Article PubMed Google Scholar
Stier, E. A. et al. International Anal Neoplasia Society’s consensus guidelines for anal cancer screening. Int. J. Cancer 154(10), 1694–1702 (2024).
Article CAS PubMed Google Scholar
Espinosa, K. ASCCP management guidelines for abnormal cervical cancer screening. Am. Fam. Physician 109(3), 275–276 (2024).
PubMed Google Scholar
Bai, A. et al. Assessing colposcopic accuracy for high-grade squamous intraepithelial lesion detection: A retrospective, cohort study. BMC Womens Health 22(1), 9 (2022).
Article PubMed PubMed Central Google Scholar
Ayturan, K. et al. SPHERE: Benchmarking YOLO vs. CNN on a novel dataset for high-accuracy solar panel defect detection in renewable energy systems. Appl. Sci. 15(9), 4880 (2025).
Article CAS Google Scholar
Brandão, M. et al. Revolutionizing women’s health: A comprehensive review of artificial intelligence advancements in gynecology. J. Clin. Med. 13(4), 1061 (2024).
Article MathSciNet PubMed PubMed Central Google Scholar
Saraiva, M. M. et al. Deep learning and high-resolution anoscopy: Development of an interoperable algorithm for the detection and differentiation of anal squamous cell carcinoma precursors-a multicentric study. Cancers (Basel) 16(10), 1909 (2024).
Article CAS PubMed Google Scholar
Mascarenhas, M. et al. Artificial intelligence and colposcopy: Automatic identification of cervical squamous cell carcinoma precursors. J. Clin. Med. 13(10), 3003 (2024).
Article CAS PubMed PubMed Central Google Scholar
Chen, X. et al. Application of EfficientNet-B0 and GRU-based deep learning on classifying the colposcopy diagnosis of precancerous cervical lesions. Cancer Med. 12(7), 8690–8699 (2023).
Article CAS PubMed PubMed Central Google Scholar
Fang, S. et al. An improved image classification method for cervical precancerous lesions based on ShuffleNet. Comput. Intell. Neurosci. 2022, 9675628 (2022).
Article PubMed PubMed Central Google Scholar
Miyagi, Y., Takehara, K. & Miyake, T. Application of deep learning to the classification of uterine cervical squamous epithelial lesion from colposcopy images. Mol. Clin. Oncol. 11(6), 583–589 (2019).
PubMed PubMed Central Google Scholar
Xue, P. et al. Development and validation of an artificial intelligence system for grading colposcopic impressions and guiding biopsies. BMC Med. 18(1), 406 (2020).
Article PubMed PubMed Central Google Scholar
Yuan, C. et al. The application of deep learning based diagnostic system to cervical squamous intraepithelial lesions recognition in colposcopy images. Sci. Rep. 10(1), 11639 (2020).
Article CAS PubMed PubMed Central Google Scholar
Saraiva, M. M. et al. Artificial intelligence and high-resolution anoscopy: Automatic identification of anal squamous cell carcinoma precursors using a convolutional neural network. Tech. Coloproctol. 26(11), 893–900 (2022).
Article CAS PubMed Google Scholar
Saraiva, M.M., et al., Deep Learning in High-Resolution Anoscopy: Assessing the Impact of Staining and Therapeutic Manipulation on Automated Detection of Anal Cancer Precursors. Clin Transl Gastroenterol, 2024.
He, K. et al. Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016).
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet Google Scholar

Download references

Author information

Miguel Mascarenhas and Miguel Martins contributed equally.

Authors and Affiliations

Department of Gastroenterology, São João University Hospital, Porto, Portugal
Miguel Mascarenhas, Miguel Martins, Tiago Ribeiro, Francisco Mendes, Pedro Cardoso, Maria João Almeida, Joana Mota & Guilherme Macedo
WGO Gastroenterology and Hepatology Training Center, Porto, Portugal
Miguel Mascarenhas, Miguel Martins, Tiago Ribeiro, Francisco Mendes, Pedro Cardoso, Maria João Almeida, Joana Mota & Guilherme Macedo
Faculty of Medicine of the University of Porto, Porto, Portugal
Miguel Mascarenhas, Tiago Ribeiro, Pedro Cardoso, Teresa Mascarenhas & Guilherme Macedo
Wake Forest University Health Sciences, Winston-Salem, NC, USA
Luís Barroso
Department of Proctology, GH Paris Saint-Joseph, Paris, France
Lucas Spindler, Nadia Fathallah & Vincent de Parades
Department of Surgery, Instituto de Infectologia Emílio Ribas, São Paulo, Brazil
Thiago Manzione & Sidney Nadal
Department of Gynecology, Centro Materno-Infantil do Norte Dr. Albino Aroso (CMIN), Santo António University Hospital, Porto, Portugal
Inês Alencoão, Maria João Carinhas & Rosa Zulmira
Department of Mechanical Engineering, Faculty of Engineering of the University of Porto, Porto, Portugal
Joana Fernandes & João Ferreira
Department of Gynecology, São João University Hospital, Porto, Portugal
Teresa Mascarenhas
Gastroenterology, Centro Hospitalar Universitário de São João, Rua Oliveira Martins 104, 4200-427, Porto, Portugal
Miguel Mascarenhas

Authors

Miguel Mascarenhas
View author publications
Search author on:PubMed Google Scholar
Miguel Martins
View author publications
Search author on:PubMed Google Scholar
Luís Barroso
View author publications
Search author on:PubMed Google Scholar
Lucas Spindler
View author publications
Search author on:PubMed Google Scholar
Nadia Fathallah
View author publications
Search author on:PubMed Google Scholar
Thiago Manzione
View author publications
Search author on:PubMed Google Scholar
Inês Alencoão
View author publications
Search author on:PubMed Google Scholar
Maria João Carinhas
View author publications
Search author on:PubMed Google Scholar
Tiago Ribeiro
View author publications
Search author on:PubMed Google Scholar
Francisco Mendes
View author publications
Search author on:PubMed Google Scholar
Pedro Cardoso
View author publications
Search author on:PubMed Google Scholar
Maria João Almeida
View author publications
Search author on:PubMed Google Scholar
Joana Mota
View author publications
Search author on:PubMed Google Scholar
Joana Fernandes
View author publications
Search author on:PubMed Google Scholar
João Ferreira
View author publications
Search author on:PubMed Google Scholar
Teresa Mascarenhas
View author publications
Search author on:PubMed Google Scholar
Sidney Nadal
View author publications
Search author on:PubMed Google Scholar
Rosa Zulmira
View author publications
Search author on:PubMed Google Scholar
Guilherme Macedo
View author publications
Search author on:PubMed Google Scholar
Vincent de Parades
View author publications
Search author on:PubMed Google Scholar

Contributions

M.M.S and M.M: equal contribution in study design, image extraction, drafting of the manuscript, and critical revision of the manuscript. L.B, L.S, N.F, T.M, I.A, M.J.C: study design, data acquisition, critical revision of the manuscript T.R, F.M, P.C, M.J.A, J.M: bibliographic review, image extraction, critical revision of the manuscript. J.F and J.F.R: construction and development of the DL model, statistical analysis, critical revision of the manuscript. T.M, S.N, R.Z, G.M, V.P: study design, critical revision of the manuscript. All authors approved the final version of the manuscript.

Corresponding author

Correspondence to Miguel Mascarenhas.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethical approval

Given the retrospective nature of collection data, this study adopts a non-interventional paradigm. Moreover, the study did not result in any changes to therapeutic conduct. Ethical committee permission was acquired prior to initiation of the study by the ethics committee of Groupe Hospitalier Paris Saint-Joseph, Institituto de Infecciologia Emílio Ribas and Hospital Universitário Santo António (IRB 00012157, SPTC 81/2023, IRB 2023.157(131-DEFI/123-CE), respectively), and conducted in accordance with principles indicated in the Helsinki declaration.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Mascarenhas, M., Martins, M., Barroso, L. et al. Automated detection and classification of cervical and anal squamous cancer precursors using deep learning and multidevice colposcopy. Sci Rep 15, 33068 (2025). https://doi.org/10.1038/s41598-025-14514-x

Download citation

Received: 24 July 2024
Accepted: 31 July 2025
Published: 26 September 2025
DOI: https://doi.org/10.1038/s41598-025-14514-x