Introduction

Malaria remains a major global health concern, with an estimated 249 million cases and over 669,000 deaths reported in 2022, 94 percent of which occurred in Africa1. The invasion of Anopheles stephensi, a malaria vector found in South Asia and the Middle East, into Africa over the last decade threatens to substantially increase the malaria disease burden in the region. The detection of An. stephensi in Djibouti in 2012 was associated with an exponential increase in malaria cases2,3,4. Similarly, an increase in malaria cases has been observed in Ethiopia since the detection of the invasive malaria vector in 20161,5. A recent An. stephensi-mediated malaria outbreak was reported in Dire Dawa, an urban hub in eastern Ethiopia, during the dry season generating evidence linking urban malaria increases directly to invasive An. stephensi6. Anopheles stephensi is capable of transmitting both Plasmodium falciparum and P. vivax and thrives in urban, peri-urban, and rural environments due to its ability to use artificial water storage containers at the larval stage, characteristics that are contrary to the native African malaria vectors7,8.

Additionally, An. stephensi in the Horn of Africa (HoA) has been reported to be resistant to all classes of insecticides used against adult malaria vectors in the region9,10. This rapid range expansion of An. stephensi in Africa and its associated malaria cases and insecticide resistance led the WHO to launch an initiative in 2022 and updated vector alert in 2023 calling for heightened surveillance and actions to stop the spread of this invasive malaria vector11.

In Kenya, An. stephensi was first confirmed in the north in Marsabit and Turkana counties in 202212. This initial detection was concerning and prompted an urgent need for country-wide surveillance given the vector’s ability to thrive in human-altered landscapes and its potential to increase malaria transmission in areas previously considered low-risk13 as observed in Djibouti3,4 and Ethiopia6. Thus, the invasive An. stephensi could pose a significant public health threat, particularly in urban and peri-urban areas of Kenya, where the probability of occurrence is higher13.

Understanding the distribution, introduction history, and potential sources of An. stephensi in Kenya is critical for assessing its threat to malaria control and adopting effective measures to halt its spread. This study employed an interdisciplinary approach, combining entomological surveys, molecular genetics, and spatial modeling to comprehensively understand the introduction and spread of An. stephensi in Kenya. Entomological surveys determine the geographical distribution, larval habitats, and adult resting sites of An. stephensi, informing vector control strategies as recommended by WHO for urban and peri-urban settings. Genetic analysis establishes diversity and population structure, determining demographic history, invasion routes, and connectedness to other invasive populations in the Horn of Africa. Species distribution modeling provides insights into suitable habitats and predictive range expansion for targeted surveillance. This integrated approach will enhance our understanding of An. stephensi distribution and spread in Kenya, crucial for targeted vector control interventions and predicting future spread. Ultimately, these efforts will contribute to more cost-effective malaria control in the region, aligning with the WHO initiative to stop the spread of this invasive malaria vector in Africa.

Methods

Surveillance site selection

Following the detection of An. stephensi in Kenya in December 202212, the Kenya National Malaria Control Program (NMCP) and partners conducted additional sampling targeted at areas bordering locations where An. stephensi was detected, urban or peri-urban areas neighboring confirmed An. stephensi presence, sites with high habitat suitability based on environmental conditions and population density, points of entry from neighboring countries in the North of Kenya, towns along major transportation routes with considerable movement of people, goods, and animals, and counties reporting unexplained increases in malaria cases, especially outside usual seasonal patterns and areas reporting increased cases of both P. falciparum and P. vivax malaria. Within each of the selected counties, sampling teams prioritized areas with high animal ownership, and the presence of water reservoirs. In rural areas: oases, major water reservoirs such as small dams, streams, and rivers along known cattle grazing routes and sites of urban development and construction.

Sampling for Anopheles stephensi

Sampling for An. stephensi targeted both larval and adult stages between January 2023 and June 2024 across Kenya, with a focus on both indoor and outdoor collections under the coordination of the NMCP. In each of the selected counties, three sub-counties and two towns/villages per sub-county were selected, except in Mandera, where mosquito surveys were carried out in one sub-county, because of insecurity. The team conducted adult and larval surveys for 8 days per county, simultaneously. In areas where An. stephensi was already established, either monthly or quarterly sampling was conducted if resources allowed, though most data reported here was from one time sampling efforts.

Larval Sampling focused on potential larval habitats including man-made water containers, freshwater pools and pans, stream margins, discarded tires and plastic containers, irrigation ditches, water storage containers (metal and plastic tanks, concrete cisterns, barrels, clay pots), construction sites and areas near animal shelters. Collected larvae were preserved in 70% ethanol for species identification and teams were instructed not to carry any live material outside of the areas of collection to prevent accidental introductions to new areas.

Quarterly cross-sectional surveys led by the NMCP involved sampling of adult mosquitoes using indoor CDC light traps, outdoor UV light traps, and mechanical aspiration indoor and outdoor using Prokopacks.

Evaluation of adult trapping methods in Marsabit County

A host of adult collection methods were evaluated in Marsabit county where the vector was thought to have been established. Methods that were evaluated indoor and outdoor included: UV light traps, human landing catches (HLC), CDC light traps (CDCLT), and mechanical aspiration using Prokopacks. Host decoy traps (HDT), double bed net traps (DBT), BG—Sentinel and BG Pro traps (Biogents AG, Regensburg, Germany) with BG lure (synthetic human odor attractant) were only evaluated outdoors. The methods were evaluated for the density of An. stephensi trapped.

Sample processing and identification

Collected samples were morphologically identified using established keys, focusing on specific features such as palp speckling, wing vein patterns, and thoracic characteristics14. All Anopheles larvae were included in the molecular analysis.

Molecular analyses

Molecular identification was conducted through an initial Colorimetric Loop-Mediated Isothermal Amplification Assay (CLASS) for preliminary detection as previously described15. Briefly, Single mosquito legs from whole mosquito samples and DNA extracted from larvae samples were analyzed for the presence of target DNA indicated by a visible color change in the reaction mixture, typically from pink to yellow. This rapid detection method was followed by a confirmatory species-specific PCR to validate the results. To confirm the species and conduct genetic analyses, a portion of the mitochondrial cytochrome oxidase subunit I (COI) locus from morphologically identified An. stephensi, DNA was PCR amplified and sequenced using previously published protocol5. Briefly, to amplify the partial COI locus, we used LCO1490F (5'-GGTCAACAAATCATAAAGATATTGG-3ʹ) and HCO2198R (5ʹ-TAAACTTCAGGGTGACCAAAAAATCA-3ʹ) primers16 with the following thermal cycling conditions: 94 °C for 5 min, followed by 35 cycles of 94 °C for 30 s, 56 °C for 45 s, 72 °C for 1 min, and a final extension of 72 °C for 10 min.

Amplicons were visualized on 2% agarose gel to confirm the correct locus was amplified, then cleaned using ExoSap (Cytiva, Marlborough, MA) and sequenced using Sanger technology with BigDye chemistry (EdgeBio, San Jose, CA) and run on an ABI 3730 Genetic Analyzer (Thermo Fisher, Santa Clara, CA). Sequences were cleaned using CodonCode version 11.0.2 (CodonCode Corporation, Centerville, MA, USA) and submitted as queries to the National Center for Biotechnology Information’s (NCBI) Basic Local Alignment Search Tool (BLAST) (Altschul et al., 1990) against the nucleotide database in GenBank under default parameters for highly similar hits (98–100%) to confirm the species.

Genetic diversity and population structure

To determine the genetic diversity, structure and evolutionary relationships of the Kenyan An. stephensi COI sequences generated in this study, population genetic and phylogenetic analyses were performed.

Multiple sequence alignment was performed using MAFFT version 7 [1] and uneven ends were trimmed using BioEdit version 5.0.9. Population genetic statistics were generated using COI sequences for each collection site in DnaSP version 617. The statistics generated included the number of polymorphic (segregating) sites (s), number of haplotypes (h), haplotype diversity (Hd), and nucleotide diversity (π). For population structure characterization and further comparative analysis, we performed a phylogenetic analysis with the Kenyan An. stephensi COI sequences and An. stephensi COI sequences from both the native and invasive range retrieved from NCBI Genbank.

The global An. stephensi COI sequences included eight from India (Genbank accession number: KT89988818, KX467337, MH538704, MK726121, MN329060, LR736015, LR736014, and LR736013), four from Pakistan (Genbank accession number: KF406694, KF406693, KF406701, and KF406680)18,19, one from United Arab Emirates (Genbank accession number: MK170098)20, seven from Sri Lanka (Genbank accession number: MF975729, MF975728, MF124611, MF124610, MF124609, MF124608, and MG970565)21, two from Yemen (Genbank accession number: OM865140, and PP387838)22,23, one from Djibouti (Genbank accession number: KF933378)2, three from Sudan (Genbank accession number: MW197100, MW197099, and MW197101)24, two from southern Ethiopia (Genbank accession number: OQ865406, and OQ86540725, nine from eastern Ethiopia (Genbank accession number: OK663480, OM801691, OM801703, OM801693, OM801697, MH651000, OK663481, OK663479, and OK6634825,26,27, two from Somaliland (Genbank accession number: ON421572, and ON421574)28, and two from Kenya (Genbank accession number: OR607950, and OR607949)12. Anopheles maculatus was designated as an outgroup to be consistent with previous An. stephensi phylogenetic analysis29,30. Phylogenetic relationships were inferred using Mr Bayes version 3.2.731 which is based on Bayesian inferencing and relies on calculating the posterior probability distribution of phylogenies. The general time reversible (GTR) nucleotide substitution model32 with GAMMA rates of heterogeneity was used to determine how nucleotides evolve. The trees were visualized in FigTree version 1.4.433 and the percentage posterior probabilities included. We also mapped COI haplotype proportions across the study sites. For consistency and comparison, haplotype numbering was kept the same as in Carter et al.25 and color coded as in Samake et al.34.

Predicting the probability of occurrence across Kenya

Extensive entomological monitoring can be expensive, particularly when the species in question is rare, and difficult to collect with existing monitoring tools. A country-specific species distribution modeling approach was used to identify hotspots for the next phase of extensive entomological monitoring to inform control efforts for An. stephensi spread in Kenya. An ensemble species distribution model using the package ‘biomod2’ version 4.2.5.2 in R35 version 4.1.2 was employed, borrowing from the first prediction of An. stephensi invasion13. Climatic variable selection was based on a recent habitat suitability modeling approach used to identify entomological surveillance points that led to the first detection of An. stephensi in Ghana36.

The environmental and climatic data were downloaded at a resolution of 1 km square (km2) for explanatory variable pre-processing in R version 4.1.2. There were 19 bioclimatic raster layers, and an elevation raster layer from WorldClim platform37,38, and normalized difference vegetation index (NDVI) from 2020, 2021, 2022, 202338, alongside population density based on the 2019 population census in Kenya39. In total, there were 25 variables. Pre-processing involved cropping to the boundary extent of Kenya, raster resampling to match their resolutions, and encoding to American Standard Code for Information Interchange (ASCII) format.

The 19 bioclimatic raster layers and elevation were assessed for correlation using the ENMTools package in R to reduce multicollinearity. A correlation coefficient cutoff of ≥ 0.7 was used to identify high associations13,36. Where two rasters had a high correlation, only one was chosen for inclusion in the model. To minimize bias related to arbitrary variable selection and ensure the selection of the most unique variables, a process that used a count function to generate the correlation frequency for each variable across other variables was devised. This identified bioclimatic variables with a correlation coefficient ≥ 0.7 across more than 50% of the 19 bioclimatic variables and elevation (frequency ≥ 10/20 of variables). These were removed, followed by an additional stepwise elimination of other highly correlated variables.

Both An. stephensi presence and absence data were included as response variables. A binary coding was used to ensure readability by the models, with presences coded as “1” and absences as “0”. These were based on entomological monitoring conducted between December 2022 and June 2024. To reduce the overlap between presence and absence, absence records were not included in sites where a presence was recorded in subsequent sampling rounds. Actual absence records were selected in all counties where An. stephensi was not detected throughout the entomological monitoring efforts, to June 2024. In total, there were 4,128 unique geographic coordinates with absence, and 32 unique geographic coordinates with presence across the counties.

The ensemble modeling included four models, Random Forest, Generalized Additive Model (GAM), Gradient Boosted Machines (GBM), and Extreme Gradient Boosting (XGBoost), with the “Bigboss” calibration option for each model. The “Bigboss” options are a set of model calibrations optimized for species distribution modeling40. An additional step using K-fold cross-validation, with 3 partitions and 10 draws of cross-validation data (a total of 30 cross-validation runs for each model) reduce overfitting by each model. Additionally, each model was specified for an 80:20 split for test and training across the 30 cross validation runs. Model evaluation was based on the area under the receiver operating curve (ROC) and True Skill Statistic (TSS). The performance of each model was evaluated for inclusion in the ensemble using TSS.

Results

Larval sampling

One hundred and fourteen confirmed An. stephensi larvae were collected (Additional file 1) between December 2022 and August 2024. Sampling was conducted in 18 of the 47 counties with An. stephensi being detected in seven counties across northern Kenya. In the northwest, the vector was found in Turkana County. In the north-central region, An. stephensi was present in Marsabit County, with further spread southward into Samburu and Isiolo counties. In the northeast, the vector was detected in Mandera County and had spread southward to Wajir County. Additionally, An. stephensi was found in Elgeyo Marakwet County, located south of Turkana. Anopheles stephensi tended to occur along the major north–south road networks which was biased by the logistical challenge of accessing the interior of many of these remote locations within the two-year period (Fig. 1). The larval habitats consisted of dumped car tires, water storage tanks and other plastic containers, spill over from community water points, and water pits dug at mining sites (Fig. 2). Fifty nine percent of these sites were in urban and peri-urban areas with the remaining 41% occurring in rural sites. An. stephensi larvae were found to co-occur with An. arabiensis and culicine mosquitoes in the same larval habitats in 20% and 55% of all the habitats sampled, respectively. In the remaining 25% of habitats, An. stephensi was the only species present (Additional file 1). The relative abundance of An. stephensi compared to other species varied across sites, with An. stephensi comprising approximately 30% of larvae in mixed habitats.

Fig. 1
figure 1

Map of Kenya showing the road network (blue dashed lines), areas sampled and sites where An. stephensi has been detected as of August 2024. The red dots represent areas where Anopheles stephensi has been confirmed while those with black dots are areas where An. stephensi has not been detected. The figure was generated using QGIS Version 3.34.11 “Prizren” 49.

Fig. 2
figure 2

The range of Anopheles larval collection sites: (A) used car tire, (B) discarded plastic jerrican, (C) runoff from a community tank, (D) shallow pit dug in a gold mining site, (E) cut out water tank, (F) plastic water storage tanks.

Adult sampling

Over the 2 years of collection 52 samples were identified morphologically as An. stephensi of which 33 adult mosquitoes were confirmed to be An. stephensi by PCR or sequencing. Cross sectional surveys yielded 6 adults collected indoors using aspiration in Samburu County, while in Mandera County, 11 adults were collected indoor using CDC light traps, 3 by UV light traps outdoors and 3 by aspiration indoors.

The evaluation of adult collection methods in Marsabit County yielded 8 adults collected using UV light traps outdoors, 1 using CDC-light traps indoors and 1 using BG-Sentinel traps outdoors, other collection methods did not yield An. stephensi (Fig. 3).

Fig. 3
figure 3

Pictures showing the range of adult collection methods evaluated in Marsabit county: (A) Prockopack aspiration in hidden areas outside the house, (B) indoor UV light trap, (C) outdoor UV light trap, (D) Prockopack aspiration Inside a half-filed water storage tank, (E) set up of a BG Sentinel trap outdoors and (F) a volunteer conducting human landing catches outdoors.

Longitudinal sampling

Longitudinal sampling in Marsabit County revealed temporal variations in mosquito abundance. During the five collection time points spanning two years, we detected An. stephensi in December 2022 (initial detection), February 2023, May 2023, and February 2024. Notably, despite extensive sampling effort during October 2023, which coincided with the peak rainy season when mosquito populations typically increase, no An. stephensi specimens were collected. This unexpected absence during favorable climatic conditions suggests possible seasonal dynamics that differ from patterns observed in other invaded regions and in native malaria species. (Fig. 4).

Fig. 4
figure 4

Multiple sampling efforts in Marsabit shows fluctuations in An. stephensi densities in Marsabit county.

Molecular species confirmation and genetic diversity

One hundred and twenty Anopheles larval and adult specimens morphologically characterized as An. stephensi and molecularly analyzed in this study were confirmed as An. stephensi using cytochrome c oxidase subunit 141 (Additional file 2). The genetic diversity statistics based on COI sequences revealed two segregating sites leading to three haplotypes (Table 1). The haplotype diversity ranged from 0 for Mandera to 0.667 for Isiolo. Nucleotide diversity ranged from 0 for Mandera to 0.00421 for Isiolo (Table 1). Overall, the highest genetic diversity was observed in Isiolo with only three samples (h = 2; Hd = 0.667; π = 0.00421), and the lowest genetic diversity was observed in Mandera (h = 1; Hd = 0; π = 0) (Table 1).

Table 1 Population genetic diversity based on mtDNA COI loci of Anopheles stephensi from Kenya.

Phylogenetic analysis

Phylogenetic analysis indicated that all the samples of An. stephensi clustered within a single clade (Fig. 5). However, a more detailed examination of the Kenyan samples revealed the presence of three distinct clades, previously described in the literature [2]. These clades correspond to Haplotype 1, Haplotype 2, and Haplotype 3. Notably, Haplotype 2 was found to be the most prevalent among the samples, followed by Haplotype 3, while Haplotype 1 was identified in only two samples from Kenya. Geographical distribution of these haplotypes showed that Marsabit had Hap 2 and Hap 3, Wajir and Samburu had mostly Hap 2 with a single occurrence of Hap 3 each, Mandera only had Hap 2 while Isiolo had one occurrence of Hap 2 and two of Hap 3. Turkana was the only region with presence of Hap 1 in addition to Hap 2.

Fig. 5
figure 5

Phylogenetic analysis of the invasive An. stephensi COI sequenced data. Haplotype numbering was based on haplotypes list from Carter et al. 2021. Posterior probabilities > 70 are shown.

Population structure

Based on the phylogenetic analysis, all An. stephensi clustered on one clade with significant percentage posterior probabilities (100%) (Fig. 5). The Kenyan An. stephensi further clustered with three previously reported An. stephensi COI haplotypes26 (Fig. 5). The three clusters correspond to haplotypes 1,2, and 3 with haplotype 2 (Hap 2) as the most prevalent COI haplotype found in Kenya.

When mapped across the studied sites, An. stephensi from Marsabit, Samburu, Isiolo, and Wajir had similar genetic compositions with haplotypes 2, 3 (Hap 2/Hap 3) (Fig. 6A). Mandera had haplotype 2 (Hap 2) only and Turkana had haplotypes 1, 2 (Hap 1/ Hap 2) (Fig. 6A). Further comparative analysis revealed that two of the identified An. stephensi COI haplotypes (Hap 2 and Hap 3) in Kenya are two of the most common COI haplotypes of the invasive An. stephensi in the Horn of Africa (Fig. 6B). Specifically, Hap 2 and/or Hap 3 are found in Djibouti City (Djibouti), three sites in Somalia (Lawyacado, Berbera, Hargesia), and ten sites in Ethiopia (Semera, Bati, Gewane, Erer Gota, Dire Dawa, Jigjiga, Degehabur, Kebridehar, Godey, Hawassa) (Fig. 6). However, Hap 1 in Turkana is found only in Djibouti City (Djibouti) and three sites in northeast Ethiopia (Semera, Bati, Jigjiga) (Fig. 6).

Fig. 6
figure 6

(A) Kenyan An. stephensi COI haplotype map. (B) Kenya An. stephensi COI haplotype compared to Horn of Africa An. stephensi COI haplotypes.

Probability of occurrence

Seven environmental covariates were included in the model for predicting the probability of An. stephensi occurrence in Kenya. This was after a stepwise elimination process on 25 environmental variables, to minimize multicollinearity in the final covariates. Temperature seasonality (Bio4), temperature annual range (Bio7), mean temperature of the driest quarter (Bio9), precipitation of the wettest month (Bio13), precipitation seasonality (Bio15), mean NDVI from the year 2022 (NDVI2022res), and population density from the 2019 Kenya census (PopDen2020res).

Model evaluation was on area under the receiver operating curve (ROC) and True Skill Statistic (TSS). Values between 0.7 and 0.8 are considered acceptable while values above 0.8 are considered excellent predictive power. All individual models had good predictions, between a minimum TSS and ROC of 0.9 and maximum TSS of 0.98 and ROC of 1 (additional file 3). The final ensemble output had a TSS of 0.998 and an ROC of 1. Considering cross-validation was applied to reduce overfitting (30 cross-validation runs for each model), the model offers excellent prediction. Precipitation seasonality (Bio15), precipitation of the wettest month (Bio13), and Temperature seasonality (Bio4) were predicted as the top 3 with the most influence on the probability of occurrence of An. stephensi (Fig. 7a). The mean response curves show higher precipitation seasonality was associated with a higher probability of An. stephensi occurrence. Higher temperature seasonality and higher precipitation in the wettest month was associated with a decrease in the probability of An. stephensi occurrence (Fig. 7b).

Fig. 7
figure 7

Predictor importance by (a) Percent contribution of individual predictors to the ensemble output by mean (EMmean) and by committee averaging (EMca). Outputs were similar, EMmean was selected for the final model (b) Ensemble model TSS from each predictor.

The mean spatial output from the model ensemble has values adjusted to span from 1 to 0, with 1 representing high probability of occurrence and 0 representing a low probability of occurrence (Fig. 8). The model predicts a greater extent of An. stephensi than the current positive sites in Marsabit, Samburu, Turkana, Mandera, Wajir, and Isiolo counties with overlaps in neighboring counties including Tana River, Garissa, Kitui, Machakos, Makueni, Kajiado, Taita Taveta, Tharaka Nithi, Embu, Meru, Baringo, and West Pokot as highlighted in Fig. 8. These regions are currently classified as having low to seasonal malaria transmission, with the lowest prevalence in children aged 6–14 months (additional file 4)42.

Fig. 8
figure 8

Probability of An. stephensi occurrence from the mean model ensemble. The number on the map matches the county number in the table for identification. The figure was generated using R version 4.1.235.

Discussion

This study provides the current (as of June 2024) geographical distribution of An. stephensi in Kenya. Entomological surveys identified An. stephensi in seven counties across Kenya. In addition to the initial detections in Marsabit and Turkana counties, the invasive vector was subsequently found in Mandera, Wajir, Isiolo, Samburu, and Elgeyo Marakwet counties, indicating significant geographic spread throughout northern Kenya12. The spatial distribution of An. stephensi in Kenya shows a pattern of spread along major transportation routes, particularly in northern counties. This distribution pattern aligns with modelling studies, where human-mediated dispersal along trade routes has been associated with the rapid spread of An. stephensi13,43. The concentration of An. stephensi in urban and peri-urban areas, as well as its detection in diverse larval habitats, underscores its adaptability to human-altered environments—a characteristic that has contributed to its successful establishment and spread in the Horn of Africa44,45 where it has been associated with significant increases in malaria transmission, particularly in Djibouti4,46.

The co-occurrence of An. stephensi larvae with native Anopheles species and Culicine mosquitoes in shared habitats could have implications for integrated vector management (IVM). The relative abundance of An. stephensi in these mixed habitats, comprising approximately 30% of larvae, suggests that this invasive species utilizes overlapping larval habitats with native vectors. It is not clear whether similar observations have been made in other countries invaded by An. stephensi such as Ethiopia and Djibouti but this observation may necessitate that larval control programs, where and when implemented, are tailored to target both An. stephensi as well as co-breeding vector populations.

Despite expectations of higher adult An. stephensi numbers based on larval abundance and experiences in other countries, only 33 adults were collected over two years despite intensive efforts. This lower-than-expected adult capture rate may be due to the species’ behavior in this new environment or limitations in current sampling methods, highlighting the need for refined sampling techniques or potential out-competition with sympatric native species. This difficulty in adult collection has been noted in other areas with An. stephensi and may be due to the species’ behavioral plasticity or our limited understanding of its resting and feeding preferences in these new environments9. The success of UV light traps and BG-Sentinel traps in capturing adult An. stephensi, albeit in low numbers, provides valuable information for future surveillance efforts. Future studies are required to optimize methods or combination of methods and lures for trapping of An. stephensi. The temporal variation in An. stephensi detection in Marsabit county, with no samples collected during the October 2023 rainy season, differs from observations in some parts of Ethiopia, where An. stephensi populations peak during rainy seasons45. This discrepancy underscores the need for longitudinal surveillance to fully understand the species’ population dynamics in Kenya.

Our analyses of the genetic diversity and population structure of An. stephensi in Kenya reveals three distinct genetic compositions with various levels of genetic diversity, suggesting multiple introductions into the country. Based on the phylogenetic analysis, we observed that the Kenyan An. stephensi share mitochondrial DNA (mtDNA) COI haplotypes with mosquitoes from other countries in the Horn of Africa, indicating genetic connectedness between An. stephensi populations in Kenya and these countries (Fig. 6). Specifically, the population structure of the Kenyan An. stephensi is constituted of haplotype 1 (Hap 1), haplotype 2 (Hap 2), and haplotype 3 (Hap 3) with Hap 2 as the predominant COI haplotype in Kenya which was also reported as the most prevalent haplotype in the Horn of Africa region, suggesting a common origin or similar selective pressures across these invaded areas24,26,28 (Fig. 6). This COI population structure revealed three distinct An. stephensi genetic compositions in Kenya, which include Hap 1/Hap 2 in Turkana, Hap 2/Hap 3 in Marsabit, Samburu, Isiolo, and Wajir, and Hap 2 in Mandera, potentially reflecting three distinct An. stephensi introductions into the country (Fig. 6A). Also, as the least genetically diverse, the Mandera An. stephensi population is most likely a recent introduction compared to the other An. stephensi populations in Kenya (Table 1).

Further comparative population structure analysis showed that the Kenyan An. stephensi population structure, particularly, Mandera, Wajir, Isiolo, Samburu and Marsabit is similar to the population structure of southern Ethiopian An. stephensi populations (Hawassa, Godey) where both Hap 2 and Hap 3 are found (Fig. 6B). As these two regions share borders and are connected by major roads (e.g., Hawassa–Marsabit), the finding suggests two scenarios: (1) southern Ethiopia is a potential source population of the Kenyan An. stephensi in Mandera and Marsabit, or (2) southern Ethiopia and Kenya share a common founding population. While it is unclear which populations were established first, further comparative genetic analysis based on simultaneous sampling of both regions and multi-locus markers can help elucidate the temporal relationship between the two An. stephensi populations. However, Hap 1 found in Turkana, the most northwestern region of Kenya, has only been reported in Djibouti and north and central eastern Ethiopia in the Horn of Africa (Fig. 6B)26,34. Thus, continued sampling in southern Ethiopia and other regions bordering Turkana can help elucidate the potential source population of An. stephensi in Turkana.

Additionally, although the population structure revealed potential routes of introduction, the mechanism by which the species is being introduced is still unknown. Several potential mechanisms of An. stephensi invasion and dispersal in Africa have been hypothesized. Maritime and land trade routes were proposed to potentially play a role in the incursion and movements of this invasive species in the Horn of Africa34,43. The influence of wind as a mechanism of An. stephensi invasion into Africa was also suggested47. Thus, additional information on key trade routes and goods as well as wind patterns through these sites where An. stephensi has become established may be needed to reveal the mechanism of introduction and spread in Kenya.

Our species distribution model predicts a greater extent of An. stephensi occurrence in Kenya than currently observed, particularly in northern counties with historically low malaria transmission. This aligns with previous predictive models13 and raises concerns about potential increases in malaria cases in these regions, similar to observations in Djibouti4 and Ethiopia6 following An. stephensi detection. The model’s identification of low precipitation and minimal seasonal temperature variations as key factors influencing An. stephensi distribution provides valuable insights for targeted surveillance and control efforts. Interestingly, our model suggests a lower importance of population density compared to previous studies13,36, which aligns with the species’ detection in some of Kenya’s least densely populated counties39. This finding highlights the need for context-specific modeling approaches that account for local environmental and demographic factors.

Our findings have implications for An. stephensi vector control and surveillance in Kenya. The genetic diversity, population structure and potential routes of introduction could inform strategies to manage An. stephensi where it has been detected and prevent its further spread within Kenya. With the presence of different levels of genetic diversity, targeted vector control strategies could be implemented to effectively control An. stephensi in the invaded regions of Kenya. As the less diverse An. stephensi population in Kenya, larval source management could be implemented against An. stephensi in Mandera as highlighted in the WHO vector alert11. Larval source management (LSM) was found to be an effective approach against An. stephensi in its endemic range48 and in Ethiopia49 and even though LSM is not presently implemented in Kenya, the genetic connectivity between Kenyan An. stephensi and populations in neighboring countries emphasizes the need for coordinated, cross-border surveillance and control efforts. Such collaboration is crucial to prevent re-invasion and to manage this vector effectively on a regional scale49.

Conclusion

This continued surveillance by the National Malaria Control Program (NMCP) and partners in Kenya provides critical insights into the dynamics of An. stephensi movement and spread within Kenya, revealing distinct genetic compositions across the invaded areas that suggest multiple introductions. The genetic and spatial data presented here can inform targeted surveillance and control efforts, which are urgently needed to mitigate the impact of this invasive vector on malaria transmission in Kenya and the broader East African region. Future research should focus on elucidating the bionomics of An. stephensi in these new environments, refining sampling techniques for adult mosquitoes, and developing integrated vector management strategies that can effectively control this adaptable and resilient invasive species.