Introduction

Human papillomavirus (HPV) is the most frequent viral sexually transmitted infection worldwide. Almost 80% of sexually active people who have not been vaccinated acquire the virus at some point in their lives. However, most HPV infections do not cause disease, and about 90% resolve spontaneously within 12–24 months post-infection, the remaining can persist and progress to cervical precancerous lesions or cancer1,2,3.

Cervical cancer is the fourth most common cancer among women in the world as 662,301 new cases and 348,874 deaths have been reported in 2022. It is well-known that HPV types 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 66, and 68 are responsible for the development of almost all invasive cervical cancer (ICC) cases; among which two types 16 and 18 accounts for 70% of ICC and seven types 31, 33, 45, 52, and 58 detects in 20% of cases4,5,6. The eight most common HPV types in Iranian women with cervical cancer are as follows: HPV 16 (53%), 18 (15%), 31 (4%), 33 (3%), 45 (3%), 58 (3%), 35 (2%), and 52 (2%)7.

The classification of papillomaviruses is based on the DNA sequence of the L1 gene8. HPVs are clustered among five genera including alpha, beta, gamma, mu, and nu according to less than 60% identity. Within a given genus, the species and type are designated for those viruses that share 60–70% and 71–89% identity, respectively9,10. In each type, 1–10% and 0.5–1% identity through the genome are called lineage and sublineage, respectively. HPV 33 is clustered to alpha genus, species of 9 (alpha-9), and has three lineages of A, B, and C; among which A lineage consists of three distinctive sublineages A1, A2, and A3. HPV 35 is divided into alpha genus, species of 9 (alpha-9), and has two sublineages of A1 and A211.

Knowing the geographical spread of lineages and sublineages of each HPV type, is beneficial for the epidemiological and evolutionary studies of this virus. Several studies in Iran identified the (sub)lineages of HPV 16, 18, 31, 39, 45, 51, 52, 56, 58, and 59. However, no studies have analyzed the variants of HPV 33 and 35 in Iranian women. To our knowledge, the sequence variability of the whole E6 gene of these two types was investigated to identify their circulating (sub)lineages in Tehran, Iran.

Material and methods

Study population

To investigate the sequence variations of HPV 33 and 35 E6 gene, a cross-sectional study was designed from 2021 to 2023 in Tehran. In total, 42 HPV 33 and 92 HPV 35-infected samples were analyzed to know their lineage/sublineages in this study. These 134 samples were obtained from the previous screening of one thousand liquid-based cervical samples (BD SurePathTM, Ireland) and four-hundred and sixty-five formalin-fixed paraffin-embedded (FFPE) specimens. The liquid-based cervical samples were collected from several laboratories in Tehran that were infected with high-risk HPV types. FFPE samples were obtained from the archives of three hospitals in Tehran. Our research was conducted ethically by the World Medical Association Declaration of Helsinki. This study was approved by the local ethical committee of the Tehran University of Medical Sciences (IR.TUMS.SPH.REC.1402.336).

The sequence variations of HPV 33 and 35 E6 gene

Isolation of DNA from liquid-based cervical samples was done using High Pure Viral Nucleic Acid Kit (Roche Diagnostics GmbH, Roche Applied Science) according to the manufacturer’s instructions. DNA extraction from FFPE samples was carried out by phenol‐chloroform assay12.

The entire E6 gene of HPV 33 (nt 109–558) and HPV 35 (nt 110–559) was investigated using nested PCR with primer pairs shown in Table 1. The PCR reaction was done in a 50 μl reaction mixture, including a 2X PCR master mix (AMPLIQON, Ampliqon A/s, Denmark), 10 pmol of each primer, and 100–200 ng of DNA template. The PCR thermal cycling programs were included: an initial denaturation for 5 min at 95 °C, 35 cycles of 95 °C for 30 s, 56 °C for 50 s, and 72 °C for 50 s (first round) and 35 cycles of 95 °C for 25 s, 55 °C for 40 s, and 72 °C for 40 s (second round). A reaction mixture lacking template DNA was used as a negative control in every set of PCR runs. All PCR amplicons were sequenced bidirectional by the Sanger sequencing method with BigDye® Terminator v3.1 Cycle Sequencing Kit and a 3130 Genetic Analyzer Automated Sequencer as specified by Applied Biosystems manuals (Foster City, CA). The sequences were deposited in GenBank and are available with accession numbers PQ417275-PQ417408.

Table 1 The sequence of primer pairs that used in this study.

To characterize lineages and sublineages, all obtained sequences were aligned to reference sequences of HPV 33 and HPV 35. For HPV 33 the reference sequences were M12732 (A1), HQ537698 (A2), EU918766 (A3), HQ537705 (B), and KF436865 (C). The following reference sequences were HQ537720 (A1) and HQ537727 (A2) for HPV 3511. For both types, more sequences were obtained from GeneBank and used to draw a phylogenetic tree. The phylogenetic tree was made by Mega software version 11 using the maximum likelihood method and Kimura 2-parameter model. The reliability of the tree was calculated by measuring the bootstrap with 1000 replicates.

Statistical analysis

Statistical calculation was done using the Fisher exact test (Epi Info 7, Statistical Analysis System Software). A P-value < 0.05 was considered statistically significant.

Results

In total 134 uterine cervical samples (42 and 92 positive samples of HPV 33 and 35, respectively) were investigated in this study.

The full E6 sequence (nt 109–558) of 42 HPV 33-positive samples was aligned as opposed to the HPV 33 reference sequence (GeneBank accession number M12732). As shown in Fig. 1, all samples were classified to A lineage. Sublineage analysis showed that most of the studied samples belonged to A2 sublineages (85.8%) and the remaining were classified to A1 sublineage.

Fig. 1
figure 1

Phylogenetic analysis of the HPV 33 E6 gene was carried out by MEGA11 software using the Maximum Likelihood method based on the Kimura 2-parameter model. The numbers above the branches represent the bootstrap values. The reference sequences are indicated by a black circle.

Nucleotide changes were found in 36 samples (85.8%), while no mutations were detected in 14.2% (Table 2). Four mutations at positions of A132C, A213C, A364C, and A480T were detected in 66.7%, 85.7%. 85.7%, and 19.1% of our samples, respectively. Mutations A213C and A364C led to amino acid changes corresponding to positions K35N and N86H of amino acid, respectively. Amino acid changes occurred in 36 samples (85.8%). In the studied groups, amino acid changes were detected in 90.9 and 80% of normal and premalignant/malignant groups, respectively. In this study, three nucleotide substitution patterns were found as follows: no changes (14.2%); A213C/A364C/A480T (19.1%); and A132C/A213C/A364C (66.7%). Two amino acid substitution patterns including no changes and K35N/N86H were detected in 14.2 and 85.8% of samples, respectively. No statistically significant differences were found between sublineages of HPV 33 and histology/cytology status (P = 0.57; OR (95% CI) 2.5 (0.4–15.4)) (Table 3).

Table 2 HPV 33 sublineages identified based on the E6 (109–558) gene in normal, premalignant, and invasive cervical cancer samples of Iranian women.
Table 3 The frequency of HPV 33 lineages stratified by histology/cytology status in cervical samples of Iranian women.

As shown in Table 4, ninety-two complete E6 sequences (nt 110–559) of HPV 35-infected samples were compared to the HPV 35 reference sequence (GeneBank accession number HQ537720). Two sublineages, A1 and A2, with a prevalence of 84.8% and 15.2%, respectively, were found in studied samples (Fig. 2). In the entire E6 gene, seven nucleotide changes were observed at positions of C127G/T, A130C, A131C, C136T, G249T, A326G, and C341T. Three of these mutations, including A130C, G249T, and C341T, resulted in amino acid changes at positions of E7D, C47F, and R78W (Table 3). The frequency of amino acid changes was 9.8% for R78W, 4.4% for E7D, and 3.3% for C47F in our samples. In other words, no amino acid substitutions occurred in most of the studied samples (83.6%).

Table 4 HPV 35 sublineages identified based on the E6 (110–559) gene in normal, premalignant, and invasive cervical cancer samples of Iranian.
Fig. 2
figure 2

Phylogenetic analysis of the HPV 35 E6 gene was carried out by MEGA11 software using the Maximum Likelihood method based on the Kimura 2-parameter model. The numbers above the branches represent the bootstrap values. The reference sequences are indicated by a black circle.

As shown in Table 4, ten nucleotide substitution patterns including no changes; C127G; A130C/A131C; A131C; G249T; C341T; C127T/C136T; C127T/C136T/C341T; C127T/A130C/C136T/C341T; and C127T/C136T/A326G/C341T were observed. Five patterns of amino acid changes occurred as follows: no changes (83.6%); E7D (3.3%); C47F (3.3%); R78W (8.7%); and R78W/E7D (1.1%).

Although the prevalence of sublineage A1 was higher in premalignant/ICC (91.7%) than normal (83.7%) samples, no statistically significant differences were observed in this regard (P = 0.84; OR (95% CI): 0.47 (0.05–3.9)) (Table 5).

Table 5 The frequency of HPV 35 lineages stratified by histology/cytology status in cervical samples of Iranian women.

Discussion

This study revealed that all HPV 33-infected samples belonged to the A lineage, the A2 sublineage was dominant (85.8%), and the remaining were classified to the A1 sublineage (14.2%). The A2 sublineage is reported to be more prevalent in Europe and Asia than in other parts of the world. This sublineage is found among 47.2 and 51.8% of European and Asian women, respectively13. In consistence with our findings, the A2 sublineage (60%) was more prevalent than the A1 sublineage (40%) in Hungarian women14. The data of ICC samples from India and South America showed that although A2 sublineage was more common (85.7%) than A1 (14.3%) in India, A1 sublineage was more frequent (81.8%) than A2 (18.2%) in South America15. The results of one study from the USA showed that A1 sublineage is dominant (69.7%), followed by B (19.7%) and A2 (10.6%) (sub)lineages16. In Japan, the frequency of (sub)lineages was as follows: A1 (70%), A2 (15%), A3 (5%), and B (10%)17. In China, two sublineages A1 and A3 were detected in 92.1% and 7.9% of cervical samples, respectively18. In Africa, B lineage is dominant (63%) followed by A1 (32.6%) and C (4.4%) (sub)lineages13.

Most of our samples detected two amino acid changes at positions K35N and N86H. Both alterations were not found at important positions linked to known E6 biological functions. However, a previous study revealed that amino acid changes at positions of I73L, V83L, K93N, and A138V of E6 had a higher ability to degrade P53 and MAGI-3 than the prototype13.

In the present study, no statistical differences were found between distinct sublineages of HPV 33 and histology/cytology status. Accordance to our data no associations were found in this regard on samples from Canada and Brazil which sublineage A1 was found to be more common in both normal (70.3%) and high-grade intraepithelial lesions (66.7%) than (sub)lineages A2 or B19. A study from China showed that the A1 sublineage was more frequent than the A3 sublineage among both normal and CIN I-III groups20. However, several studies reported that sublineage A1 has increased oncogenic potential than sublineage A213,21. Indeed, it is indicated that HPV 33 LCR variants of sublineage A2 have reduced transcriptional activities compared to sublineage A114. Our finding that not observed statistically significant differences can be due to the low sample size in both groups as inferred from Table 2, sublineage A1 was more common in the CIN I-III/ICC group than the normal group.

It is indicated that HPV 33 (sub)lineages A2, A3, B, and C than A1 displayed a reduced sensitivity to neutralization (2–fourfold) by nonavalent HPV vaccine (Gardasil®9) antibodies. This finding suggests that the natural variation in L1 alters capsid antigenicity and consequently affects neutralization by a range of antibody specificities22,23.

Two sublineages A1 and A2 of HPV 35 were detected with a frequency of 84.8% and 15.2% in the studied samples, respectively. Compatible with our results, studies have shown that sublineage A1 is dominant in America as the A1 and A2 sublineages were detected in 85.8 and 14.2% of samples, respectively21. In Brazil, sublineage A1 is also reported in 83.3% of cervical samples and the remaining belonged to sublineage A2 (16.7%)24. The whole-genome analysis results in Zimbabwe revealed that sublineages A1 and A2 were found in 75% and 25% of samples, respectively25. However, sublineage A2 (69.1%) is more common than A1 (30.9%) in Costa Rica26. A comprehensive study from different parts of the world showed that sublineage A1 was more common than A2 in non-African world districts for premalignant and malignant cervical samples. On the contrary, sublineage A2 was more frequent than A1 in African women. The B lineage was also detected in Bhutan and India27.

While the frequency of sublineage A1 was higher in premalignant/ICC than in normal groups, no statistically significant differences were observed. Previous studies have shown discrepancies in this regard. It is reported that sublineage A1 than A2 had long persistency and increased risk of CIN III in women from Costa Rica28. However, a study from America has shown that the chance of progression toward CIN II-III for sublineages A1 and A2 was not different21.

It is suggested that the genetic differences in host, particularly HLA molecules, can play a significant role in the population-based distribution of HPVs variants29. Previous studied determining lineage/sublineages of several high-risk HPVs in Iran were shown that distinctive lineage/sublineages are common as follow: lineage D of HPV 16, sublineage A4 of HPV 18, lineage A of HPV 31, sublineage A1 of HPV 39, lineage B of HPV 45, lineage B of HPV 5630,31,32,33,34.

The main limitation of this study was low sample size in our studied groups. Also, all samples were collected from Tehran and no other provinces of Iran were included. It is highly recommended that in future studies samples obtain from different parts of Iran.

Conclusion

Our results indicated that sublineages A2 of HPV 33 and A1 of HPV 35 were dominant in Tehran, Iran. However, more studies with larger sample sizes from different histopathological stages of cervix in various geographical regions of Iran are necessary to evaluate the pathogenicity risk of HPV 33 and 35 (sub)lineages in Iranian women with cervical cancer in the future. Also, the investigation of HLA molecules in Iranian women with cervical cancer highly recommended to be considered in the context of HPV 33 and 35 variants in future studies.