Introduction

Dengue virus is classified within the Flaviviridae family1. The genome of the dengue virus comprises 10.7 kb single-stranded, positive sense RNA, which encodes a single polyprotein, that further cleaves into three structural; Capsid (C), Membrane (M), and Envelop (E), and seven non-structural proteins; NS1, NS2A, NS2B, NS3, NS4A, NS4B, and2. Structural proteins are responsible for the formation of viral particles and the non-structural proteins participate in the replication of viral RNA, assembly of viruses, and modulation of host cell responses3,4.

Based on the differences in antigenic determinants, dengue virus is classified into four serotypes, DENV-1, DENV-2, DENV-3, and DENV-4. These serotypes are further divided into genotypes based on envelop gene sequences, DENV1 (1I-V), six in DENV2 (2I-VI), and four in DENV3 (3I, 3II, 3III, 3 V) and DENV4 (4I-IV)5,6,7. Although the genotyping is primarily based on envelop gene, the C-prM region has also been used in several studies for the classification of genotypes8,9,10. A fifth serotype was found in the Sarawak state of Malaysia in October 2013. DENV serotypes 1–4 follow the human cycle, while DENV-5 follows the sylvatic cycle11. The symptoms caused by these serotypes might be similar but phylogenetically they are distinct12. The genotypes within each serotype are distinctly distributed across different geographical locations. Although all serotypes are competent to cause severe infection, some serotypes are more likely to lead to severe cases than others. Several studies have found that the Asian genotype of DENV-2 can cause more severe infection in comparison to the American genotype as the latter efficiently replicates in human cells as well as can efficiently cause infection in new mosquitoes13,14.

In Pakistan, dengue fever is endemic, and it has become a major public health issue. Pakistan faces sporadic dengue outbreaks each year during the monsoon season. The prevalence of dengue infection has greatly increased in Pakistan over the years, burdening the already compromised economic situation of the country15. Pakistan reported its first dengue case in the year 1994 and then the annual outbreaks started in Karachi in November16. Pakistan features several factors that provide optimal conditions for the growth of Aedes mosquito which include rapid urbanization, lack of public health resources, climate, rainfall, insufficient water storage and waste management facilities17,18,19.

In Pakistan, DENV-2 is the most prevalent serotype, followed by DENV-1 and DENV-320. Several factors are responsible for expanding the reach of dengue vectors, such as globalization, international trade, travel, urbanization, changes in climate, and the lack of efficient vector control programs21. Increasing cases of dengue infections are strongly associated with climate change22. According to23, high temperatures increase the risk of dengue infections.

Due to the presence of various alterations in the dengue virus genome, which have been identified as contributing to its enhanced pathogenicity and transmission, preventing and treating dengue infection is becoming challenging. Therefore, the main objective of this study is to evaluate the genetic variations in the genome of all four dengue virus serotypes. The study also aims at finding the most frequent dengue virus serotypes and genotypes present in the population of Karachi.

Methodology

Ethical approval

The samples used in the study were taken after the approval of the ethical committees from the respective laboratories. This study was approved by the Institutional Ethical Committee (IEC) of the International Center for Chemical and Biological Sciences (ICCBS), University of Karachi (Study # ICCBS/IEC/102-HB/2024/Protocol/1.0). Written informed consent was taken from the patients.

Sample and data collection

One hundred and fifty-two (152) dengue NS1 positive serum samples were collected from different hospitals in Karachi between April to August 2022, along with clinical and demographic data with the consent of the infected patients.

RNA extraction and cDNA synthesis

RNA extraction from serum samples was performed using the QIAamp Viral RNA Mini Kit (Qiagen, Germany), according to the manufacturer’s protocol. RNA quantification was carried out using a nanodrop spectrophotometer, followed by first-strand cDNA synthesis with random hexamer primer, using RevertAid First Strand cDNA Synthesis Kit (ThermoFisher Scientific, USA). The concentration of cDNA was quantified using a Qubit dsDNA High Sensitivity kit. The synthesized cDNA was then stored at -80°C for subsequent experimentation24,25.

Polymerase chain reaction (PCR), agarose gel electrophoresis and PCR product purification

The synthesized cDNA was amplified using two consensus primers, D1-forward primer (5’-TCAATATGCTGAAACGCGCGAGAAACCG-3’) and D2-reverse primer (5’-TTGCACCAACAGTCAATGTCTTCAGGTTC-3’), designed by26.

The PCR reaction was run using Absolute Master Mix (MOLEQULE-ON, New Zealand), which contains buffer, dNTPs mix, Taq polymerase, and loading dyes. Initial denaturation was carried out at 94°C for 3 min, followed by 35 cycles of denaturation at 94°C for 30 s, annealing at 55°C for 1 min, extension at 72°C for 2 min, and final extension at 72°C for 5 min. The amplified products were run on 2% agarose gel.

The amplified PCR products were purified using Agencourt Ampure XP beads (Beckman Coulter, USA). Each product was combined with 1.8 × volume of magnetic beads, followed by incubation at room temperature for 3–4 min. The tubes were then placed in the magnetic rack for 2 min. Once the beads were attached to the wall of the tube, the supernatant was discarded. The beads were washed with 90 μL of 80% ethanol in the first round. The ethanol was discarded, and the beads were washed with 100 μL of 80% ethanol. The tubes were dried, and the DNA was eluted using an elution buffer27.

Sanger Sequencing

The purified PCR products were quantified using a Qubit fluorometer (ThermoFisher Scientific, USA). The Sanger sequencing of amplified DNA was carried out using Big Dye Terminator v3.1 Cycle Sequencing Kit (ThermoFisher Scientific, UK) on a conventional thermal cycler. The products were sequenced using D1 (forward primer). A total of 5 μL reaction was prepared, consisting of 0.3 μL BDT, 1 μL sequencing buffer, 1 μL forward primer, cDNA (up to 20ng), and water up to 5 μL. The PCR conditions were set as: Initial denaturation at 95°C for 1 min, followed by 35 cycles of denaturation at 95°C for 12 s, annealing temperature at 50°C for 10 s, and extension at 60°C for 4 min and a final extension at 60°C for 4 min. The sequencing PCR products were purified by using the ethanol precipitation method. The ethanol precipitation method was carried out by mixing the sequencing PCR product with a 5 μL volume of 125 mM ethylenediaminetetraacetic acid (EDTA) and 60 μL absolute ethanol. The tubes were centrifuged at 2200 RCF for 15 min. The supernatant was discarded and 100 μL of freshly prepared 80% ethanol was added and the tubes were centrifuged for 5 min at 2200 RCF. The supernatant was discarded, and the tubes were air dried overnight. The dried pellet was mixed with 10 μL formamide and was mixed well, followed by heat shock at 95°C for 5 min, and finally loaded onto SeqStudio Genetic Analyzer (ThermoFisher Scientific, USA)28.

Sequencing data analysis

The sequenced data was filtered using BioEdit. Low-quality and ambiguous sequences were filtered out to ensure accuracy. The resultant cleaned sequences were then analyzed through BLAST (Basic Local Alignment Search Tool) for serotyping by comparing each sequence with the reference sequences available in the NCBI database. For genotyping, the sequences were subjected to Flavivirus Typing Tool.

Phylogenetic analysis

Reference sequences from different countries of all the genotypes belonging to each serotype were obtained from the NCBI Basic Local Alignment Search Tool (BLAST) to align with the sequenced data. A total of twenty DENV-1 sequences, twenty-nine DENV-2 sequences, and twenty-five DENV-3 sequences were retrieved for the phylogenetic tree construction. The selection of sequences was based on the following criteria: Sequences from dengue endemic and epidemic countries were selected to provide comprehensive understanding of global genotype distribution, for cases with identical mutational patterns from the same year or different years, only one representative sequence per country was selected, and prevalent genotype of each serotype in a given country was prioritized. Phylogenetic trees were constructed using the Maximum Likelihood (ML) method for each serotype of dengue virus based on genotypes, using the General Time Reversible (GTR) model, incorporating both Gamma distribution and a portion of invariant sites (G + I), and the reliability of the corresponding branches was evaluated with 1000 bootstraps replicates. Mega11 software was used for the construction of a phylogenetic tree29, and the tree was visualized by the Interactive Tree of Life (iTOL) website. Mutations in the protein sequences were also evaluated by Mega11 software. The sequences from the current study were compared with the reference genomes. These reference genomes were selected from Pakistan, isolated in different years and the available reference genome at the database ViPR (Virus Pathogen Resource).The sequenced data was then submitted to NCBI GenBank.

Statistical analysis

Data entry, analysis, and processing were performed using IBM SPSS Statistics 21 and Excel. Descriptive analysis including the mean and median age of the studied patients was calculated using Excel.

Results

Gender-wise distribution of samples

This study involved the collection of 152 dengue-positive serum samples, out of which 57.9% (88/152) patients were male, and 42.1% (64/152) patients were female (Fig. 1). The age of the infected patients ranged between 7 months to 75 years (mean age = 23.91 ± 13.05, median age = 23). The age group with the highest rate of dengue prevalence was found to be 21 to 30 years. The mean age found in the female participants was 24.40 ± 13.21 and the median age in females was 24, while in male participants, the mean age was 23.56 ± 12.99 and the median age was 22.

Fig. 1
figure 1

Image of the PCR products subjected to 2% agarose gel electrophoresis, run at 70 V for 70 min, stained with ethidium bromide, and visualized under UV transilluminator. Each resultant PCR product is 511 bp long. Lanes are numbered from left to right; the First lane contains Marker; 100 bp plus ladder used. Lanes 2–19 show PCR products that are 511 bp in size and Lane 20 shows negative control.

PCR amplification using universal primers

The cDNA was amplified using universal primers, D1 (forward) and D2 (reverse). Out of the 152 samples, 100 samples were found to be PCR positive for the C-prM region of the dengue virus as shown in figure-1. The product size was found to be 511 bp.

Distribution of DENV serotypes by sanger sequencing

Samples with high DNA concentration were subjected to Sanger sequencing, revealing a pre-dominance of DENV-1 (78.33%), followed by DENV-2 (15%), and DENV-3.

(6.66%)The visual representation is provided in Fig. 2.

Fig. 2
figure 2

Serotype distribution based on sequencing results among the study population. The graphical chart represents the number of samples identified for different dengue serotypes.

Phylogenetic analysis

DENV-1 analysis

DENV-1 sequences depicted in Fig. 3 show clear branching patterns, with each genotype forming distinct clades. The phylogenetic analysis of the DENV-1 isolates, collected during the 2022 dengue outbreak, revealed that all the isolates clustered within Genotype III. None of the isolates clustered with the reference genomes of Genotypes I (red), IV (orange), and V (purple). The sequences of Genotypes I, IV and V formed distinct clades, indicating clear divergence from the isolates of the present study and highlights genetic diversity of DENV-1 across different geographic locations. The strain OP898557 isolated from Rawalpindi in the year 2022 showed relatedness with the strains (OQ947238, PP800878, OQ947240, OQ947258) from the present study collected from Karachi in the year 2022 (Supp-Table-S1, S2). The 2022 isolates from the current study formed a distinct clade, suggesting either sustained transmission within the region or recent common ancestry and indicating low genetic diversity among circulating strains.

Fig. 3
figure 3

Phylogenetic tree of DENV-1 genotypes, based on partial C-prM region, constructed using the Maximum Likelihood (ML) method with 1000 bootstrap replicates. All the sequences of DENV-1 (47 isolates) from the current study belonged to Genotype III as indicated by green color.

DENV-2 analysis

The comprehensive phylogenetic analysis conducted for DENV-2 isolates, collected during the 2022 dengue outbreak, revealed that all the local strains (green colored) clustered within the Cosmopolitan genotype (blue colored). This monophyletic clade, alongside the other Pakistani strains, suggests continued circulation and evolution of this genotype within the region. The isolates from the current study clustered tightly, showing minimal divergence, indicating strong genetic relatedness and potential local transmission. Furthermore, the earlier isolates from Pakistan within the same cluster support the endemic persistence of the cosmopolitan genotype in the region over the past decade. Alternatively, the reference sequences belonging to Asian I (red), Asian II (orange), American (black), and Asian/American (purple) formed distinct clades, indicating a lack of external introductions (Fig. 4).

Fig. 4
figure 4

Phylogenetic tree of DENV-2 sequences based on C-prM region, constructed using the Maximum Likelihood method, using 1000 bootstrap replicates. The tree shows clear separation of DENV-2 genotypes and highlights persistence of the Cosmopolitan genotype in Pakistan.

DENV-3 analysis

For the construction of DENV-3 phylogenetic tree based on C-prM region, sequences from different geographical regions were included, represented by different color codes as shown in Fig. 5. The analysis revealed that all the 2022 DENV-3 isolates (green colored) clustered with the genotype III reference sequences, but formed their own subgroup, indicating some genetic differences that could be due to recent mutations or local adaptation. The sequences from this study formed a single clade and showed close relatedness with the strains from Myanmar (MW788888, MW788911) (Supp-Table-S1, S2). In contrast, the other genotypes formed distinct clades, representing diverse strains, including those from China, India, Thailand, Myanmar, and South America, suggesting the global spread of DENV-3.

Fig. 5
figure 5

The tree was generated using the Maximum Likelihood (ML) method with 1000 bootstrap replicates, based on the partial C-prM region. The genotypes from the current study are represented in green color.

Mutational analysis

The sequenced samples from the present study showed no insertions or deletions. The amino acid changes observed in the C-prM region of all serotypes are shown in (Supp-Table-3 (A, B, and C)). and the mutations identified and the number of sequenced samples positive for the mutations are depicted in Table-1. In DENV-1, a total of 29 substitutions were observed while 8 substitutions were found in DENV-2 and 7 in DENV-3.

In the genome of DENV-1, multiple mutations were observed, but two unique mutations were found in all 47 sequenced samples: substitution of aspartic acid with glutamic acid at position 176 (D176E) and substitution of valine with isoleucine at position 178 (V178I). The genome of DENV-2 remained largely conserved and displaying only a few mutations, as shown in Table 1. The genome of DENV-3 had five unique mutations found exclusively in the sequenced samples from the current study: the substitution of alanine with valine at position 30, lysine with glutamine at position 31, phenylalanine with valine at position 72, and isoleucine with tyrosine at position 145.

Table 1 Mutations identified in the number of sequenced samples of each serotype.

Discussion

Dengue virus has caused numerous outbreaks in Pakistan over the years, infecting thousands of people and contributing significantly to morbidity and mortality. Globally, the incidence of dengue infection has escalated rapidly, placing nearly half of the world’s population at risk30.

The aim of the study was to identify the predominant serotype and genotype of dengue virus in Karachi, circulating in the population during the 2022 outbreak.

Among the studied participants, a higher proportion of dengue infection was observed in male patients (58%) compared to female patients (42%). This disparity may be attributed to outdoor exposure, leading to a greater risk of mosquito bites by the Aedes vector. However, the finding of this study is in contrast to a cross-sectional study conducted in Karachi in the 2021 by31, which reported a significantly higher prevalence of dengue infection in female patients (n = 172, 91%) than in male patients (n = 17, 9%).

Among various age groups, the age group 21 to 30 exhibited the highest incidence of dengue infection. This trend aligns with previous studies32,33,34,35,36, suggesting that this age group may be more susceptible due to environmental and behavioral factors. However, other factors like region, sample size, or study design may also influence this pattern. Further population-based studies are required to confirm this trend on a broader scale.

Serotyping revealed that DENV-1 was the most prevalent serotype in 2022 (78.33%), followed by DENV-2 (15%) and DENV-3 (6.66%). These findings differ from the prior findings in Karachi, where DENV-2 was reported as the dominant serotype15,37,38,39. The recent dominance of DENV-1 may reflect a shift in herd immunity dynamics, where prior widespread immunity to DENV-2 has curtailed its transmission, making the population more susceptible to infection by other dengue serotypes.

The phylogenetic analysis revealed genotype III of DENV-1, cosmopolitan genotype of DENV-2, and genotype III of DENV-3 as the major circulating genotypes in the 2022 outbreak. These observations align with the previous studies, including38, suggesting sustained circulation patterns in the region.

Among the evaluated dengue serotypes, DENV-3 displayed the highest genetic diversity, indicating continuous viral genetic change. This study identified multiple mutations, particularly in DENV-3 and DENV-1, while DENV-2 exhibited relatively fewer changes in the C-prM region. Such mutations may lead to the emergence of new genotypes, possibly modifying transmission efficiency, disease severity, as well as vaccine effectiveness40,41.

The shift from DENV-2 to DENV-1 as the dominant serotype highlights the dynamic nature of the dengue virus and underscores the necessity for continuous genomic surveillance. The identification of these mutations may indicate enhanced pathogenicity and transmission, highlighting the need for robust public health strategies.

Karachi, a densely populated sub-tropical coastal city, presents a conductive environment for dengue transmission due to rapid urbanization combined with several other factors such as hot and humid climate, inadequate sanitation system, and persistent water stagnation, all of which provide ideal breeding grounds for the vector Aedes mosquito. These conditions are escalating public health burden of dengue in Pakistan42.

In conclusion, understanding the genetic variations of circulating DENV is crucial for tracking viral evolution, enhancing preparedness for future outbreaks. The reinforcement of molecular surveillance and vector control programs are key measures to control dengue-related mortalities and morbidities.

Conclusion

This study provides important findings on the distribution of dengue virus serotypes and genotypes during the 2022 outbreak in Karachi, Pakistan. Young adults, particularly those aged between 21 to 30, were the most affected, likely due to increased outdoor exposure and subsequent contact with the Aedes vector. Notably, a higher infection rate was observed in male patients, reflecting gender-based differences in exposure patterns.

Our findings revealed DENV-1 as the dominant serotype (78.33%), highlighting a major shift in serotype prevalence from the historically dominant DENV-2. DENV-2 (15%) and DENV-3 (6.66%) were also detected, indicating the co-circulation of multiple serotypes. Furthermore, the phylogenetic analysis confirmed the presence of Genotype III (DENV-1), Cosmopolitan genotype (DENV-2), and Genotype III (DENV-3), suggesting persistent circulation of established genotypes in Pakistan.

The Sanger sequencing of the C-prM region revealed high conservation across most isolates, however, several mutations were observed, especially in the genome of DENV-3 genome, which could have potential implications on dengue virus pathogenicity and may contribute to emergence of any other genotype in future outbreaks.

These findings underscore the need for continuous genomic surveillance to monitor serotype shift, detect emerging genotypes, and identify mutations with potential clinical relevance. Further studies utilizing whole genome sequencing (WGS) are recommended to gain deeper insights into dengue virus evolution and inform targeted control strategies.