Introduction

As an inland river in Central Asia, the Ili River originates in the Tian Shan Mountains of northwestern China and flows through Kazakhstan before eventually draining into Lake Balkhash1. The Ili River is the largest river in the Xinjiang Uygur Autonomous Region and one of the three international rivers between China and the Republic of Kazakhstan. It has a total length of 1236 km and a drainage area of 151,000 square kilometers, 442 km and 56,000 square kilometers of which are located within China2. The Ili River basin topography is complicated. The River’s main stream and its three tributaries-Teksi River, Kunes River and Kashi River, flow through valleys. These tributaries have low but stable annual runoff, and the river has a steep gradient2. The unique terrain makes the Ili River Basin rich in water resources, fertile land, extensive grasslands, and dense forests, making it one of the most biodiverse regions in the Xinjiang Uygur Autonomous Region2,3. The ecosystem of the Ili River Basin contains abundant water resources and a healthy ecological environment, supporting diverse fish species3.

The Kashi River, a tributary of the Ili River, originates from the southern slopes of the Tianshan Mountains in Xinjiang and flows through a mountainous and ecologically diverse region. Although smaller than the main stem of the Ili River, the Kashi River plays an important ecological role in the local freshwater network, supporting aquatic biodiversity and serving as a vital water source for agriculture and livestock in surrounding communities3. In recent years, aquaculture and freshwater fisheries in southern Xinjiang have expanded steadily, and the fishery output and output value have increased year by year. Characteristic aquaculture has become a key driver of local economic development and has helped boost the income of farmers and herdsmen1. However, rivers in the region, including the Kashi River, face multiple environmental pressures, such as water extraction, habitat degradation, and climate variability, which threaten aquatic biodiversity and ecosystem stability2. Resource monitoring and sustainable management practices are essential to protect the health of rivers and the livelihoods they support4.

However, due to the Kashi River’s remote location and the complex topography of the Ili River basin, there is still a lack of relevant monitoring to grasp the overall status of its biological resources5. Fish represent a crucial ecological group within river ecosystems. Traditional methods for fish biodiversity monitoring—such as netting, electrofishing, or visual surveys—are often labor-intensive, spatially limited, and potentially harmful to aquatic habitats6. These limitations are particularly challenging in remote and rugged systems like the Kashi River, where seasonal access constraints hinder long-term ecological monitoring. In contrast, eDNA sampling can be conducted at regular intervals, offering a cost-effective means to monitor changes in species composition, abundance, and distribution throughout the year7. Despite the ecological importance of rivers in southern Xinjiang—such as the Kashi River—there have been few studies on fish biodiversity in these systems, likely due to their remoteness and limited accessibility2.

Environmental DNA (eDNA) has emerged as a powerful, non-invasive tool for assessing species diversity in freshwater ecosystems. By extracting trace DNA from water samples and applying eDNA metabarcoding, researchers can detect a wide range of fish species with high sensitivity and minimal habitat disturbance8,9,10. Metabarcoding of environmental DNA allows for the identification of species present in environmental samples, offering an effective tool for assessing fish biodiversity across different ecosystems. This approach can also aid in tracking species distribution, community structure, and ecological changes over time11,12,13,14,15,16,17. Previous studies have shown that eDNA has a high evaluation efficiency in freshwater fish community monitoring18,19,20,21,22,23. This method is particularly well-suited for challenging environments like the Kashi River Basin, where traditional surveys are limited by terrain and seasonal accessibility.

This study represents the first application of eDNA metabarcoding to assess fish diversity in the Kashi River. The main contents were as follows: (1) to survey the fish community structure in different sections of the river using eDNA, (2) to characterize spatial and seasonal patterns in species composition, and (3) to provide baseline data to support long-term monitoring and conservation strategies in this understudied region.

Methods

Sample collection

Environmental DNA (eDNA) samples were collected from Kashi River (81.81°E-84.14°E, 43.64°N-43.85’N), a tributary of the Ili River in the Xinjiang Uygur Autonomous Region. A total of 21 sampling sites were established across the Kashi River basin to comprehensively assess fish community composition (Fig. 1). Sampling sites were selected to cover a range of ecological conditions, including upstream, midstream, and downstream locations, ensuring a comprehensive representation of fish diversity across the river. Sampling was conducted in two-season, April and September, to capture seasonal variations. A total of 37 samples (16 in April and 21 in September) were collected and analyzed in this seasonal snapshot monitoring. Subsequent analysis divided the survey stations into different groups based on the longitudinal gradient and flow direction of the Kashi River (Fig. 1).

Fig. 1
figure 1

eDNA sampling station along Kashi River in Xinjing Uygur Autonomous Region. In the top panel is a satellite image of the Kashi River basin, with the eDNA sampling sites represented by yellow points. At the bottom is a map of the Ili River showing the overall distribution area and range of sampling points in Xinjing Uygur Autonomous Region. The map was generated using ArcGIS 10.8 (Esri, 2020) available at https://www.esri.com/en-us/arcgis.

In April, samples were collected from the upstream to the downstream sections of the Kashi River, while in September, sampling began from the downstream section and moved upstream. Although the sample codes differ in directional order between the two seasons, each code corresponds to a consistent geographic location along the river. Due to ice cover and restricted field accessibility in early spring, only 16 samples were collected in April. To improve spatial coverage and enhance the dataset, five additional sampling points were added in September, which were highlighted in red in Fig. 1. The sampling process involved traveling along the river by vehicle, collecting water samples sequentially from upstream to downstream locations. To ensure the temporal consistency and reliability of the eDNA samples, the entire sampling cycle was completed within approximately one week. This timeline minimized potential degradation of eDNA and ensured the validity of the samples for subsequent analysis24.

For eDNA water collection, 2L freshwater samples were collected from surface water using bleach sterilized sample bucket and stored in brown jars sterilized to reduce DNA degradation caused by sun exposure23,25. During the sampling process, each new station sampling process will disinfect the sampling equipment to prevent cross-contamination. Then the water samples were immediately taken to a laboratory to be filtered for DNA collection, and filtration within two hours using a TM-Giltration device equipped with a Sterivex TM-GP filter unit (pore size, 0.22 μm; EMD Millipore Corp, Billerica, MA, USA)22. Additionally, a negative filtration control with ultrapure water was filtered. Filter membranes samples were immediately frozen and stored at −20 °C prior and post-transportation until DNA extraction26.

eDNA extraction, PCR amplification and sequencing

The eDNA extractions from the filter units were performed using the TIANamp Soil DNA Kits (Tiangen Biotech Co., Ltd, Beijing, China) following the manufacturer’s protocol, and stored at −20 °C prior to further analysis. Then the quantity of extracted DNA was measured using a NanoDrop NC2000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA), while the quality was assessed through agarose gel electrophoresis22. To ensure no cross-contamination during the eDNA extraction process and introduced from the laboratory environment, negative controls of filtration controls and extraction blanks were extracted and processed alongside all samples. After confirming the quality of DNA extraction by 1.2% agarose gel electrophoresis, DNA was amplified using PCR assays to largely fishes23.

PCR amplification of the conserved region of 12S rRNA region was performed using the forward primer Teleo2 F (5'-AAACTCGTGCCAGCCACC-3′) and the reverse primer Teleo2 R (5'-GGGTATCTAATCCCAGTTTG-3′)11. Three PCR replicates were performed for each sample, and the reaction solution volume for PCR amplification totaled 25 µL and comprised the following components: The Polymerase Chain Reaction (PCR) amplification components consisted of 5 μL 5 × Reaction Buffer, 5 μL 5 × GC Buffer, 2 μL dNTP (2.5 mM), 1 μL Forward Primer (10 μM), 1 μL Reverse Primer (10 μM), 2 μL DNA Template, 8.75 μL ddH2O, and 0.25 μL Q5 DNA Polymerase. The thermal cycling profile was as follows: initial denaturation at 95 °C for 10 min, followed by 45 cycles of denaturation at 95 °C for 30 s, annealing at 63 °C for 45 s, extension at 72 °C for 60 s, and a final extension at 72 °C for 10 min. PCR amplicons were purified with Vazyme VAHTSTM DNA Clean Beads (Vazyme, Nanjing, China) and quantified using the Quant-iT PicoGreen dsDNA Assay Kit (Invitrogen, Carlsbad, CA, USA)27. After the individual quantification step, amplicons were pooled in equal amounts, and pair-end 2 × 300 bp sequencing was performed using the Illlumina MiSeq platform with MiSeq Reagent Kit v3 at Shanghai Personal Biotechnology Co., Ltd (Shanghai, China)22.

Sequencing library preparation was constructed by Illumina’s TruSeq Nano DNA LT Library Prep Kit. Libraries were sequenced on 500 cycles (for paired-end sequencing of the 12S amplicons) MiSeq V2 Standard Flow Cell on an Illumina MiSeq platform (Illumina, San Diego, CA, USA)23,28. The libraries were dissolved to 2 nM in a gradient before being proportionally blended to the required amount of data. Before on-machine sequencing, the library was inspected by Agilent High Sensitivity DNA Kit on Agilent Bioanalyzer28. The NovaSeq sequencer was used for double-ended sequencing with the NovaSeq 6000 SP Reagent Kit23. Then, the original high-throughput sequencing data was screened based on sequence quality, and problematic samples were further sequenced or re-sequenced as needed22,23,28.

Bioinformatics analysis and taxonomic assignment

The original MiSeq sequencing data were performed using the FastQC program and stored in FASTQ format23. The library and samples were divided according to index and barcode information from the original sequence of quality preliminary screening, and barcode sequence was removed. The sequences were further subjected to quality control using the Vsearch (v.2.13.4) pipeline29, the detail as follow: raw sequence data were demultiplexed using the demux plugin following by primers cutting with cutadapt plugin30. Sequences were then merged, quality filtered and dereplicated using functions of fastq_mergepairs, fastq_filter and derep_fullength in Vsearch plugen. All the unique sequences were then clustered at 98% (via cluster_size) followed by chimera removing (via uchime_denovo)23. Non-singleton amplicon sequence variants (OTUs) were aligned with mafft31.

Taxonomic information for each representative OTU sequence was assigned by querying against the NCBI GenBank nucleotide database using BLASTn, with comparisons made to both the NCBI Organelle Genome Resources (nt) database and the MitoFish database23,32. Taxonomic assignments were made based on the best BLAST hit with a sequence identity ≥ 97% and alignment coverage ≥ 80% for species-level identification. After that, the specific composition of each sample at different species taxonomic level was displayed to understand the overall situation. To enhance classification accuracy, all initially matched species were further screened using FishBase (https://www.fishbase.org/) and Fish Chronicles33, and potential misassigned or contaminant taxa were manually removed. Sequence data analyses were mainly performed using QIIME2 and R software (v4.2.3)34.

Statistical analyses

Through statistical analysis of the feature table after the removal of singleton, R script was used to calculate the clustering results of each sample and each taxon, and the species composition of the seasonal snapshot monitoring was presented in the form of bar graphs. Furthermore, the overlapping in fish OTUs between each station site in the two-season were illustrated by the “UpSetR” package to compare the uniqueness of the communities35. To identify key species contributing to seasonal differentiation, we employed a Random Forest classification algorithm using QIIME2. The input data consisted of a species-level abundance matrix generated from eDNA OTU assignments. The model was trained to classify samples by season (April vs. September), and species importance was assessed using permutation importance. The 20 species with the highest importance scores were selected for visualization. A corresponding heatmap was generated to show the relative abundance of these species across all sampling sites.

Alpha diversity indices, such as Chao1 richness estimator, Shannon diversity index, Simpson index, Faith’s PD, Pielou’s evenness and Good’s coverage were calculated using the OTU table in QIIME2, and visualized as box plots23. OTU-level ranked abundance curves were generated to compare the richness and evenness of OTUs among samples. To investigate the diversity of community composition across different sections of the Kashi River, sampling sites were divided into four groups corresponding to the upper, upper-middle, lower-middle, and lower reaches of the river, based on their longitudinal positions. Specifically, Group A is S1-S4 stations in the upper reaches of Kashi River; Group B is S5-S9 stations in the upper and middle reaches; Group C is S10-S13 stations in the middle and lower reaches; Group D is the downstream S14-S21 station (Fig. 1).

Beta diversity analysis was performed to assess differences in fish community composition across different regions and between the two seasons. Non-metric multidimensional scaling (NMDS) based on Bray–Curtis dissimilarity was used to visualize patterns in community structure36. To further examine clustering patterns, hierarchical cluster analysis was conducted on the same dissimilarity matrix. NMDS, which preserves the rank order of dissimilarities rather than absolute distances, is well-suited for complex ecological data and provides a stable ordination37. All analyses were performed using the “vegan” package in R.

Results

Sequence statistics and fish composition

In this study, a total of 4,594,782 high-quality sequences (2,457,586 in April and 2,137,196 in September) were obtained after quality filtering and removal of chimeric reads, with an average of 124,183 sequences per sample. After bioinformatics filtering analysis, 166 fish species were detected in the eDNA survey conducted in the main stream of Kashi River and distributed across 111 genera, 44 families and 21 orders in two-season. Statistical analysis of eDNA data collected from 16 sampling stations in April identified 77 fish species. In September, eDNA analysis from 21 stations revealed a significantly greater taxonomic diversity, identifying 137 fish species. The results of seasonal snapshot monitoring showed noticeable differences in fish communities, with only 48 fish species consistently detected across both seasons. The survey situation of fish in each season was listed in Supplementary Table S1. We analyzed the monitoring results of two-season separately.

77 fish species belonging to 55 genera, 23 families, and 11 orders were detected in April, among the order levels, Cypriniformes, Gobiiformes, Siluriformes had the largest amount of eDNA monitoring; Among the family levels, Cyprinidae, Nemacheilidae, Xenocyprididae, Gobiidae, Acheilognathidae had the largest amount. Among the genera levels, Triplophysa, Rhinogobius, Acheilognathus had the largest amount (Table S1).

Figure 2 illustrated the species-level taxonomic composition and relative abundance of fish detected in different samples. Species are ordered by abundance, highlighting community structure variations among stations (Fig. 2). Variations in species composition among sites reflect spatial heterogeneity, and some samples show limited species annotation due to low eDNA concentrations or detection limitations. The fish species with high relative abundance detected in the April survey were Triplophysa aliensis, Triplophysa strauchii, Pseudorasbora parva, Micropercops swinhonis, Perca fluviatilis, Oncorhynchus mykiss, Gymnodiptychus dybowskii, Rhinogobius nagoyae, Carassius gibelio, Rasbora atridorsalis. (Fig. 2).

Fig. 2
figure 2

Species-level composition of fish communities based on the April eDNA survey. The horizontal axis represents the sample names. The vertical axis represents the relative abundance (%) at the species level. The right side illustrates the species-level taxonomic composition detected by eDNA monitoring in April, represented in the bar chart with different colors.

Then 137 species belonging to 101 genera, 43 families, and 21 orders were detected in September. Among the order levels, Cypriniformes, Gobiiformes, Perciformes had the largest amount of eDNA monitoring. Among the family levels, Leuciscidae, Xenocyprididae, Cyprinidae, Gobionidae, Nemacheilidae had the largest amount. Among the genera levels, Acheilognathus, Gobio, Phoxinus had the largest amount (Table S1). The fish species with high relative abundance detected in the September survey were Phoxinus ujmonensis, Carassius gibelio, Triplophysa aliensis, Gobio acutipinnatus, Triplophysa strauchii, Hypophthalmichthys nobilis, Gymnodiptychus dybowskii, Oncorhynchus mykiss, Siniperca scherzeri, Triplophysa labiate (Fig. 3).

Fig. 3
figure 3

Species-level composition of fish communities based on the April eDNA survey. The horizontal axis represents the September samples. The vertical axis represents the relative abundance (%) at the species level. The right side illustrates the species-level taxonomic composition detected by eDNA monitoring in September, represented in the bar chart with different colors.

Then, the freshwater fish statistics of the two-season were shown in Table 1. In general, the fish diversity of the Kashi River basin is very rich, with results showing a total of 153 freshwater fish species monitored over the two-season, and the fish composition of the two-season is significantly different. 73 freshwater fish were detected in April and 124 in September, among which 44 species of freshwater fish were detected in the two-season (Table 1).

Table 1 Lists of freshwater fish species based on eDNA monitoring.

“ + ” indicates species detected in the corresponding season; “” indicates species detected in both seasons; blank cells indicate absence. Species are listed in alphabetical order based on their scientific names.

Fish distribution in Kashi River

Figure 4 presents UpSet plots showing the overlap and distribution of fish species across sampling stations in April (a) and September (b), based on OTUs detected through eDNA analysis. As shown in the Fig. 4a, only 7.79% (6/77) of the fish species overlapped in all combinations of the April surveys as indicated by the red bars, with no significant difference fish numbers observed among sampling locations. Additionally, 14.29% (11/77) of fish species were detected at more than 50% of the stations. Station SE12 recorded the highest number of fish species, while station SE01 had the lowest. Notably, 22.08% (17/77) of the fish species identified in April were unique to that survey. The majority of unique OTUs were detected at stations SE10 and SE12, followed by SE16, SE14, SE09, SE08, and SE13 (Fig. 4a).

Fig. 4
figure 4

UpSet plots showing the overlap of fish species across stations in (a) April and (b) September. Red bars represent species shared across all stations; yellow bars indicate species unique to a single station; green bars show species detected at more than half of the stations (> 50%). The horizontal bars display the total number of species observed at each station.

In September, the OTU profiles sampling locations were highly variable, with only 1.46% (2/137) of fish species common to all sampling combinations (Fig. 4b). A larger proportion of species (6.57%, 9/137) were shared across more than 50% of stations, while 23.36% (32/137) were unique to only one station. Most unique OTUs in September were found at station UE21, followed by stations UE19, UE09, and UE02. Station UE02 recorded the highest fish species richness, while station UE06 detected the fewest species (Fig. 4b). These results highlighted differences in community composition and spatial distribution patterns.

The 20 most abundant fish species detected across the two-season were presented in Fig. 5. Species are ranked from top to bottom in descending order of significance, which from 6.00 to −0.63. A few species showed negative permutation importance values, indicating that their presence may introduce noise or have little relevance to the classification task (Fig. 5). These top-ranking species can be considered as key indicators of group differences between the two-season, highlighting their pivotal role in shaping the seasonal fish community composition of the Kashi River in the Ili River. The 20 most abundant species identified across the two surveys included Pseudorasbora parva, Micropercops swinhonis, Diptychus maculatus, Siniperca scherzeri, Triplophysa leptosoma, Elopichthys bambusa, Gymnodiptychus dybowskii, Rhinogobius nagoyaePerca fluviatilis, Triplophysa strauchii, Plecoglossus altivelis, Triplophysa aliensis, Triplophysa brevicauda, Triplophysa dorsalis, Carassius gibelio, Rasbora atridorsalis, Cyprinus acutidorsalis, Coreoperca whiteheadi, Perca schrenkii, Oncorhynchus mykiss (Fig. 5).

Fig. 5
figure 5

Random forest analysis of fish in the seasonal snapshot monitoring. The bar chart shows species importance scores for the classifier model, with the horizontal axis representing scores and the vertical axis listing species. The heatmap displays species abundance across samples.

Fish diversity and seasonal variation

Ultimately, samples from two-season were grouped into four groups (A to B) for alpha diversity analysis, based on their longitudinal positions along the river from upstream to downstream (Fig. 1). Although the specific sample IDs within each group differed between seasons due to variations in sampling order, the groups were consistently presented in upstream-to-downstream order to ensure comparability. Figure 6 presented boxplots of alpha diversity indices (Observed species, Chao1, Shannon, and Simpson) across different sample groups. In each panel, the horizontal coordinate is the group label, and the vertical coordinate is the value of the corresponding alpha diversity index. The numbers under the diversity index label are the P-values of the Kruskal–Wallis test. Overall, no significant differences were observed among the groups in terms of species richness and diversity, indicating comparable levels of α diversity across regions.

Fig. 6
figure 6

Alpha diversity analysis detected in the Kashi River from eDNA surveys. (a) April fish diversity analysis results, (b) September fish diversity analysis results.

In April, Group B exhibited notably higher Observed species and Chao1 indices compared to the other groups, suggesting a potentially higher species richness in this section. Some of the differences among groups were statistically significant (Kruskal–Wallis test, p < 0.05), indicating that spatial variation along the river may have a significant influence on fish diversity patterns (Fig. 6a). Additionally, the results showed that no significant difference in Shannon diversity was detected in September between the upstream and downstream regions (P > 0.05), suggesting that overall community evenness remained stable across stations (Fig. 6b).

The difference of fish composition among samples was demonstrated by NMDS dimensionality reduction method, and visualized by scatterplot, different colors and symbols are used to represent the groups to show the differences between samples more clearly (Fig. 7). The stress value of the NMDS analysis was below 0.2, confirming the reliability of the ordination results. The closer the distance between the samples, the smaller the difference between the fish communities in the two samples. In April, the fish composition of station SE02 and station SE14 were similar, station SE05, SE06 and SE07 were also similar. Fish composition at station SE16 was different from that at other stations (Fig. 7a).

Fig. 7
figure 7

NMDS Ordination Based on Bray–Curtis Dissimilarity of eDNA Composition (a) April survey sample. (b) September survey sample. (c) Comparison of samples in two-season. Each point in the figure represents a sample, with different colored points indicating different samples (groups). The proximity (or distance) between two points reflects the degree of similarity (or dissimilarity) in microbial community composition between the corresponding samples.

In September surveys, there was little difference in fish composition from station UE01 to station UE06, while they were different from other stations (Fig. 7b). Overall, the NMDS analysis of fish composition retrieved in different samples revealed marked spatial variation in community composition. In addition, we compared fish community composition between two-season, April and September, the results revealed clear clustering of samples by season, indicating significant differences in fish composition between the two periods (Fig. 7c).

Discussion

eDNA effectively detects fish diversity in the Xinjiang Uygur Autonomous Region

Previous studies have used eDNA to investigate fish diversity in other important freshwater river areas in China22,23,38,39. The above studies indicated that eDNA technology provided a valuable tool for conservation and management efforts in this ecologically significant yet understudied region. However, comprehensive investigations remain limited, leaving critical knowledge gaps regarding the biodiversity and conservation needs of the aquatic ecosystem40. The study is the first, to our knowledge, to report continuous multi-season monitoring of the Kashi River of the Ili River in Xinjiang, China, based on eDNA technology. Studies focusing on the ichthyofauna of the Kashi River have aimed to understand the composition, distribution, and seasonal variation of fish species in this region.

Despite the limited genetic information available for many endemic fish species in Xinjiang, we still monitored a total of 166 fish species across two-season, with 77 species detected in April and 137 species in September. A previous study conducted from 2012 to 2014 investigated fish resources in the upper, middle, and lower reaches of the Ili River in Xinjiang, China, and collected 27 fish species belonging to 6 orders, 9 families, and 23 genera2. In the present study, the dominant species detected through eDNA monitoring showed a high degree of consistency with those captured using traditional trawling methods, such as Diptychus maculatus, Gymnodiptychus dybowskii and Schizothorax eurystomus, indicating the reliability of eDNA for assessing fish community composition in this river system3.

The predominance of Cyprinidae, along with the high representation of Gobionidae and Xenocyprididae, reflected the typical fish community structure expected in temperate freshwater ecosystems such as the Ili River2. These findings were in line with the ecological characteristics of the region, where common cyprinid species are known to thrive in flowing, nutrient-rich waters2. The ability of eDNA to capture these dominant taxa further supports its reliability as a tool for assessing fish biodiversity in large river systems. Moreover, the taxonomic resolution achieved in this study underscores the potential of eDNA for long-term ecological monitoring and conservation planning in similar river ecosystems.

Remote habitats support a wide range of fish species, including numerous endemics and commercially important taxa. Research on fish diversity in the Xinjiang Uygur Autonomous Region has gained attention in recent decades due to concerns about the development of cascade hydropower plants, overfishing, habitat degradation, invasive species, and the impacts of climate change2,41. Unlike traditional survey methods, eDNA enables the detection of species without the need for direct observation or capture, which minimizes environmental impact and reduces sampling bias42,43. This method is particularly advantageous in detecting rare, elusive, or invasive species that might otherwise go unnoticed using conventional approaches14.

Moreover, eDNA analysis offers a cost-effective and time-efficient alternative to labor-intensive fieldwork, as it allows for the simultaneous detection of multiple species through metabarcoding techniques11,44. The non-invasive nature of eDNA sampling also makes it suitable for monitoring protected areas or species under conservation concern. However, challenges such as DNA degradation, false positives due to contamination, and varying detectability depending on environmental conditions highlighted the need for methodological standardization and validation across different habitats45,46,47. Therefore, it is necessary to strengthen the combination of traditional and molecular measurement methods19. Overall, the integration of eDNA tools into the Xinjiang Uygur Autonomous Region fish biodiversity assessments provides valuable insights into ecosystem health and helps inform conservation and management strategies in a rapidly changing environment.

eDNA could be applied to monitor seasonal variations in fish communities

eDNA has demonstrated significant potential for monitoring the seasonal variation of fish communities, providing a non-invasive and efficient approach to capture temporal biodiversity dynamics48. Our research revealed significant differences in fish composition between two-season. The findings align with previous studies that emphasize the importance of multi-season monitoring for capturing biodiversity patterns in freshwater ecosystems14,39. Furthermore, eDNA can provide insights into migratory behaviors and the presence of transient or cryptic species that are challenging to capture using conventional methods49.

Due to the lack of environmental data at some stations, this study was unable to analyze fish movement in conjunction with environmental factors. Nonetheless, survey results indicated similarities in fish composition among neighboring stations, such as SE05, SE06, and SE07 in April. Similarly, in September, the fish composition at stations UE1 to UE05 were comparable (Fig. 7). These observations suggested minimal differences in fish composition within similar environmental contexts. Seasonal environmental factors, such as water temperature, flow rate, and biological activity, can influence eDNA detection rates, making it a powerful tool for understanding the interplay between fish community dynamics and environmental variables50. Building on these findings, future research should further integrate eDNA technology with environmental monitoring to explore the dynamic changes in fish communities under varying ecological conditions.

Moreover, habitat complexity significantly influences fish distribution and abundance. Areas characterized by diverse structural elements, such as submerged vegetation and rocky substrates, tend to support higher fish diversity by offering essential shelter and foraging opportunities51. We observed clear differences in fish community composition between upstream and downstream sites (SE01 and SE16 in April; UE01 and UE21 in September), as indicated by NMDS ordination patterns. Understanding these relationships is vital for effective fishery management and conservation efforts, particularly in light of changing environmental conditions due to climate change52. Incorporating environmental data into fish monitoring programs can enhance our understanding of fish populations and help inform strategies for their sustainable management53.

Despite the promising potential of eDNA technology for assessing fish diversity in the Ili River, several challenges such as DNA degradation rates, sampling frequency, and methodological standardization must be addressed to enhance the accuracy and reliability of eDNA data in capturing seasonal trends. The high flow rates in rivers can result in eDNA transport over long distances, complicating the interpretation of local biodiversity14. eDNA monitoring results in this study identified some marine fish species, such as Yellowfin Seabream Acanthopagrus latus, Large Yellow Croaker Larimichthys crocea, Japanese Grenadier Anchovy Coilia nasus, and Spotted Scat Scatophagus argus. In addition, we also detected a considerable number of non-native freshwater species, such as Rasbora atridorsalis, Channa marulius, Monopterus albus, and Plecoglossus altivelis.

One common explanation is taxonomic misassignment, which may occur when closely related freshwater and marine taxa share highly conserved gene regions, leading to incorrect matches during sequence alignment54,55. Anthropogenic contamination is another possible cause. Marine DNA can be introduced into freshwater systems through wastewater discharge, aquaculture runoff, or seafood processing facilities56. Additionally, the transport of eDNA across environments via water flow or atmospheric deposition has also been proposed as a potential mechanism, though this remains relatively rare and often difficult to confirm14. These findings emphasize the need for cautious interpretation of eDNA results and the importance of verifying unexpected taxa through ecological plausibility and database curation. Our study complemented the regional reference database and provided important information for future studies.

Spatial distribution of fish communities in Kashi River

Fish surveys in remote areas, have certain limitations and potential challenges, so the use of eDNA technology provides new methods and ideas for monitoring biodiversity in these areas20. We compared the results of other studies to show the composition of indigenous fish. It is noteworthy that two of the sampling sites in this study—Kashi River at Nileke (S14) and Yamaduh River section (S21)—correspond to locations (C and E, respectively) surveyed in the previous trawling-based study2. In the previous survey, only 9 species were recorded at each of these sites. In contrast, our eDNA-based monitoring detected 31 species in April and 19 species in September at the Nileke site, and 22 species in April and 37 species in September at the Yamaduh River section. These results suggest that eDNA metabarcoding can capture a substantially higher level of fish diversity compared to traditional trawling methods, especially in complex or under-sampled habitats2.

Challenges remain in using eDNA, particularly in accounting for water flow dynamics, which can influence eDNA transport and lead to false positives or negatives23. However, advancements in metabarcoding techniques and predictive modeling have enhanced the reliability of eDNA for studying spatial patterns in fish communities, offering valuable insights for managing and conserving remote ecosystems8,14,57. This method has proven effective in identifying both endemic species and invasive threats, aiding in the conservation of unique ecosystems9,15,17,58. Spatial resolution can be improved by sampling across multiple sites, allowing researchers to map species distributions and pinpoint biodiversity hotspots59.

We identified a strong spatial distribution pattern of fish eDNA within the different river basins (Figs. 4, 7). Multiple possible reasons may cause differences in the distribution of fish, such as eDNA transport14,23 and the effects of human activities60. In the lower reaches of the Kashi River, sampling sites with high levels of human activity, such as the Tuohai Power Station, exhibited a reduced diversity of fish species. This may be due to human activities, including agricultural expansion, urbanization, overfishing and water pollution, which severely disrupted fish communities and led to a significant decline in fish diversity54,58,61. This study showed eDNA technology offers significant potential for advancing freshwater fish monitoring in the Kashi River in the Xinjiang Uygur Autonomous Region. Despite current challenges, ongoing methodological and technological improvements, coupled with targeted research efforts, will enhance its applicability and reliability.

Conclusion

Our study successfully revealed the fish diversity and community composition of the Kashi River in the Xinjiang Uygur Autonomous Region through two-season of systematic spatial sampling based on eDNA technology. A total of 166 fish species (77 in April and 137 in September) were detected in the two-season, of which 153 were freshwater fish (73 in April and 124 in September). In summary, our study was the first to continuously monitor fish diversity in the remote watershed of the Kashi River using eDNA technology. Our case study demonstrated that eDNA technology can be an effective tool for rapidly monitoring the composition and distribution of fish diversity in the Kashi River, a tributary of the Ili River in Xinjiang, China. Moreover, our findings highlight the need for continued monitoring of fish stocks in the region, which will help assess its biodiversity conservation and restoration measures.