Introduction

Rivers in India are lifelines for domestic, agricultural, and industrial needs, but they are increasingly threatened by pollution arising from rapid urbanisation, agricultural intensification and industrial expansion. Among the most serious contaminants are metals such as arsenic (As), lead (Pb), and cadmium (Cd), which are toxic even at trace levels and pose serious risks to human health1,2,3,4. These metals originate from both natural processes, such as weathering and sediment mobilisation, and anthropogenic activities, including mining, fertilizer use, untreated sewage and industrial discharges5,6,7,8,9,10. Once introduced into river systems, metals are persistent, bio-accumulative, and capable of causing long-term health effects ranging from neurological and developmental damage to kidney dysfunction and cancer1,3,4,11,12,13,14,15,16,17. In India, seasonal variability linked to the monsoon further complicates pollution dynamics, with high flows diluting contaminants and low flows amplifying their concentrations. Consequently, rivers often alternate between periods of apparent safety and episodes of severe health risk, exposing millions of people who rely directly on untreated water for drinking and domestic use.

Despite the recognised importance of rivers as a resource and the well-established toxicity of heavy metals, risk-focused assessments at river confluences remain limited. Confluences are hydrologically dynamic zones where contaminants from different catchments interact, mix, and sometimes intensify, potentially creating hotspots of contamination18,19,20. Yet, much of the existing research in India has focused either on individual rivers or on general water quality indices, often overlooking the combined risks posed by multiple metals at strategic mixing points8,10,21,22,23,24,25,26,27,28,29,30,31,32,33,34. This gap in knowledge means that communities depending on confluence waters for drinking or irrigation may be exposed to risks that remain poorly characterised and underestimated. The lack of detailed risk-based studies not only hinders scientific understanding of pollutant behaviour at confluences but also limits the development of effective water management and public health interventions specifically designed for such critical locations.

The present study addresses this gap by conducting a year-long, systematic water quality assessment at the Betwa–Yamuna confluence in Hamirpur District, India28,29,30. The specific objectives of this study are to:

  • Evaluate the compliance of key physicochemical parameters and dissolved metals against the World Health Organization (WHO) drinking water standards35;

  • Quantify non-carcinogenic and carcinogenic health risks for both adults and children using established United States Environmental Protection Agency (USEPA)/WHO frameworks36,37;

  • Model uncertainty and variability in exposure and risk estimates using Monte Carlo simulations, providing probabilistic insights beyond conventional deterministic approaches38,39,40.

Together, these three objectives establish a comprehensive baseline for water safety at one of northern India’s important river confluences. The results highlight seasonal and site-specific vulnerabilities and provide evidence-based guidance for targeted monitoring, treatment interventions, and long-term risk management strategies.

Materials and methods

Study area and data

Parameters used in this study reflect both general water quality indicators and contaminants of recognised health concern. Physicochemical variables such as pH, electrical conductivity (EC), total dissolved solids (TDS) and temperature provide baseline information on the ionic strength, solute concentration and stability of the aquatic environment, which are critical for understanding seasonal changes in dilution and evaporation. Trace metals, including arsenic (As), cadmium (Cd) and lead (Pb), were prioritised due to their toxicity and frequent exceedance of drinking water standards in South Asian River systems1,2,16,41,42,43,44. Additional elements such as Cu, Fe, Mn, Mo, Ni and Zn were included to capture broader geochemical variability and to assess potential contributions from both natural sources, e.g., weathering, sediment mobilisation, and anthropogenic inputs, e.g. industrial discharges and agricultural runoff.

Sample collection and analysis

Surface water samples were collected monthly from June 2023 to May 2024 at a depth of approximately 20 cm from three monitoring locations: Betwa River (25°56′43″ N, 80°09′14″ E), Yamuna River before confluence (25°57′29″ N, 80°09′23″ E) and Yamuna River after confluence (25°55′34″ N, 80°14′23″ E) (Fig. 1). These locations were selected to monitor seasonal water quality variations and mixing behaviour before and after the confluence point in the Hamirpur district, Uttar Pradesh, India.

Fig. 1
figure 1

Study area maps: (a) India with an inset of the Ganga plains; (b) Digital Elevation Model (DEM) of the Ganga plain highlighting the study area DEM image was downloaded from https://earthexplorer.usgs.gov/. and modified using CorelDraw 2023; (c) Sampling locations along the Betwa and Yamuna rivers, both upstream and downstream of their confluence, in Hamirpur District, Uttar Pradesh, India (Map created using ArcGis 10.8.2).

Water samples were collected using pre-cleaned 50 mL narrow mouth high-density polyethylene bottles, each sealed with an inner cap and secured using waterproof tape to prevent evaporation and atmospheric exchange. After collection, all samples were stored at room temperature in dark conditions until laboratory analysis.

Prior to sampling, all bottles were acid-washed with 10% nitric acid, rinsed thoroughly with deionised water, and air-dried to prevent metal contamination. Field blanks and duplicate samples were collected periodically to verify sampling precision and contamination control. Physicochemical parameters (pH, EC, TDS and temperature) were measured following standard APHA procedures using calibrated portable meters45. Trace metal analysis (As, Cd, Pb, Cu, Fe, Mn, Mo, Ni and Zn) was performed at the Geochemistry lab of Birbal Sahni Institute of Palaeosciences (BSIP) after acid digestion using standard laboratory protocols. Instrument calibration was verified using multi-element standard solutions, and analytical accuracy was assessed using procedural blanks and replicate measurements, with relative standard deviations maintained within ± 5%.

Sampling at monthly intervals across all hydrological seasons allowed for the detection of temporal trends, including low-flow enrichment during pre-monsoon periods and dilution during the monsoon. The inclusion of three strategically positioned sites, Betwa, Yamuna upstream and the confluence, enabled comparative assessments of tributary contributions and downstream mixing effects. This spatial and temporal coverage ensured a robust dataset for subsequent health risk modelling, statistical interpretation and Monte Carlo simulations, providing a comprehensive view of water quality dynamics at this critical river junction.

WHO and USEPA standards

Although national regulatory limits exist under the Bureau of Indian Standards (BIS), this study adopts WHO and USEPA guideline values to ensure international comparability and robust toxicological interpretation. Notably, the second revision of the BIS drinking water requirements has been upgraded to closely align with internationally recognised benchmarks, including WHO guidelines, USEPA standards, EU Directives, and the Indian Manual on Water Supply and Treatment. This regulatory harmonisation supports the application of WHO and USEPA frameworks for risk assessment in the Indian context.

The water quality parameters measured in this study were systematically evaluated against benchmark guideline values established by the World Health Organization (WHO)35 and, where appropriate, by the United States Environmental Protection Agency (USEPA)36,37. These international standards are widely recognised as the most authoritative references for defining safe limits of contaminants in drinking water and serve as the foundation for assessing potential human health risks. The WHO guidelines provide maximum permissible concentrations for critical trace metals such as – As (0.01 mg/L), Cd (0.003 mg/L) and Pb (0.01 mg/L) (Table 1), all of which are classified as priority pollutants due to their chronic toxicity and well documented impacts on human health, including cancer, kidney dysfunction, and neurological disorders1,11,13,14,15,16,17,41,42,43,44,46. Similarly, physicochemical parameters such as pH, EC and TDS are also compared against WHO thresholds to assess the overall suitability of the water for consumption, as deviations from the recommended ranges can affect both palatability and long-term health outcomes. It should be noted that the Bureau of Indian Standards (BIS: IS 10,500:2012, revised 2015) does not prescribe a direct guideline value for EC in drinking water. Instead, BIS regulates TDS with an acceptable limit of 500 mg/L and a permissible limit of 2000 mg/L in the absence of an alternative source47. Because EC is strongly and empirically related to TDS, it is widely used as a supporting proxy indicator of salinity and mineralization rather than a regulated toxicological parameter. Therefore, in this study, EC is interpreted in conjunction with TDS and WHO guideline ranges to assess overall mineral load, aesthetic quality, and seasonal salinity variation rather than as a parameter with a direct health-based regulatory limit. The USEPA standards were incorporated where supplementary toxicological data or slope factors for carcinogenic risk assessment were required, ensuring that both non-carcinogenic (HQ, HI) and carcinogenic (CR) health risk indices were calculated within a robust and internationally accepted framework. This approach ensures that the evaluation of the Betwa-Yamuna water quality not only reflects local environmental conditions but also meets the standards necessary for international comparability, thereby highlighting the extent to which water in this system poses risks to public health under current usage conditions.

Table 1 WHO/USEPA standards for each parameter35,37.

Health risk assessment (standard USEPA/WHO framework)

Health risk assessment in this study was conducted following the standard methodology prescribed by the USEPA and WHO, distinguishing between non-carcinogenic and carcinogenic risks. Four main indices were used to quantify risk: the Average Daily Dose (ADD), Hazard Quotient (HQ), Hazard Index (HI) and Carcinogenic Risk (CR). The ADD represents the estimated daily intake of a contaminant through water ingestion and was calculated using Eq. (1):

$${\text{ADD}} = \left( {C \times IR \times EF \times ED} \right)/\left( {BW \times AT} \right)$$
(1)

where C is the contaminant concentration (mg/L), IR is ingestion rate (L/day), EF is exposure frequency (days/year), ED is exposure duration (years), BW is body weight (kg) and AT is averaging time (days) (Table 2). Non-carcinogenic risks were determined using Eq. (2)

$${\text{HQ}} = {\text{ADD}}/RfD$$
(2)

with HQ < 1 indicating negligible risk and HQ > 1 suggesting possible adverse health effects. Table 3 presents the reference doses (RfDs) for each contaminant. Cumulative risks from multiple contaminants were captured using the Hazard Index (HI = ΣHQ). Carcinogenic risks were estimated using Eq. (3)

$${\text{CR}} = {\text{ADD}} \times SF$$
(3)

where SF is the slope factor (mg/kg/day)⁻1—Table 4 lists the SFs used for carcinogenic metals. CR values within the range of 1 × 10⁻⁶ to 1 × 10⁻4 were considered acceptable, while values above 1 × 10⁻4 were interpreted as indicative of high concern. This dual framework allows for a comprehensive evaluation of both chronic non-cancer outcomes and long-term cancer risks associated with exposure to contaminated drinking water.

Table 2 Exposure parameters applied in this study (USEPA/WHO defaults)35,36,37.
Table 3 Reference doses (RfDs) for selected metals (USEPA/WHO)35,37.
Table 4 Slope factors (SFs) for carcinogenic metals (USEPA)36,37.

Monte Carlo simulation setup

Monte Carlo simulations were carried out to incorporate uncertainty and variability in both contaminant concentrations and human exposure parameters, providing a probabilistic complement to the deterministic risk assessment38,39,40,48,49. Contaminant concentrations were represented using lognormal distributions, consistent with their typically skewed environmental behaviour, while exposure parameters such as ingestion rate, exposure frequency and body weight were modelled as normal or lognormal distributions depending on their variability reported in literature.

All stochastic simulations were implemented using Python (version 3.10), with numerical computation handled using the NumPy and SciPy libraries, and data processing performed using Pandas. Random sampling from prescribed distributions was conducted using pseudo-random number generators with fixed seeds to ensure reproducibility. Each simulation was iterated 10,000 times to generate stable probability distributions for risk indices. The iteration count (N = 10,000) was selected based on convergence testing, which showed that percentile estimates (P50 and P95) stabilised beyond 8,000 iterations.

The outputs included central tendency measures such as medians, upper-bound estimates such as the 95th percentiles (P95), and exceedance probabilities for key thresholds, specifically HQ > 1 and HI > 1 for non-carcinogenic risk, and CR > 1 × 10⁻4 for carcinogenic risk as recommended by WHO and USEPA guidelines. For each contaminant and population group (adults and children), ADD, HQ, HI, and CR were recalculated within each iteration using randomly sampled exposure parameters and concentration values, producing full probabilistic risk distributions rather than single-point estimates. Simulation outputs were visualised using probability density functions, cumulative distribution functions, and exceedance probability plots, allowing direct interpretation of uncertainty, tail-risk behaviour, and threshold exceedance likelihood.

Results

Water quality

Table 5 presents the WHO guideline values alongside the observed ranges of physicochemical and metal concentrations in the Betwa-Yamuna confluence dataset. Across the 12-month monitoring period, several core parameters remained within acceptable levels, such as pH, which largely respected the WHO recommended range of 6.5–8.5. However, other indicators sometimes exceeded safe limits; EC and TDS, for example, sometimes rose above thresholds during the pre-monsoon months, reflecting the concentration of solutes under conditions of high evaporation and reduced discharge. These exceedances signal that even though the rivers receive substantial dilution during the monsoon, water quality stress returns in drier months, when anthropogenic and geogenic inputs accumulate within the reduced flow volume.

Table 5 WHO guideline values compared with observed concentration ranges for physicochemical parameters and selected metals.

The most concerning findings relate to As, Pb and Cd, given their well-documented toxicity and strict WHO drinking water guidelines (0.01 mg/L for As and Pb, 0.003 mg/L for Cd). Arsenic concentrations in this dataset ranged from 0.0006 to 0.0112 mg/L, with several samples approaching or slightly exceeding the 0.01 mg/L limit. Lead values were mostly below 0.005 mg/L, but occasional peaks reached 0.0108 – 0.0124 mg/L, going over the WHO threshold. In contrast, cadmium remained comparatively low, with concentrations as low as 0.0003 mg/L, consistently below the 0.003 mg/L guideline. Spatially, exceedances of As and Pb were more prevalent at the Yamuna and confluence sites than at the Betwa, suggesting cumulative contamination from upstream activities. These results point to a combination of natural mobilisation from sediments and anthropogenic inputs, including agricultural runoff, untreated effluents and industrial discharges. Importantly, the recurrence of values near or above guideline levels highlights chronic pollution risks that, if untreated, could compromise drinking water safety.

Descriptive statistics for all parameters (Table 6) highlight pronounced variability in the dataset, reflecting both natural hydrological fluctuations and localised pollution events. Mean EC values exceeded 900 µS/cm at the Yamuna and confluence sites, approaching the guideline threshold, highlighting persistent solute enrichment during periods of reduced flow. TDS followed a similar pattern, with high interquartile ranges suggesting variability across months rather than uniform exceedance. Metals such as As and Pb displayed strongly skewed distributions, where most samples clustered near lower values, but occasional sharp peaks indicated episodic contamination. These spikes may result from stormwater flushing of contaminated sediments, untreated sewage releases or seasonal agricultural runoff entering the rivers. The temporal dynamics are further illustrated in Fig. 2, which presents time series trends for selected parameters. EC and TDS increased steadily in the pre-monsoon months, consistent with evaporative concentration and reduced dilution capacity. In contrast, metals such as As, Pb, and Cd peaked intermittently during dry seasons, but dropped significantly with the onset of the monsoon when high rainfall and discharge diluted contaminant loads. These patterns confirm the dual influence of hydroclimatic cycles and anthropogenic pressures on water quality: natural seasonal variability controls baseline concentration shifts, while human inputs drive episodic exceedances. The concurrent rise in EC, TDS, and dissolved metals during the pre-monsoon season indicates strong evaporative concentration effects under high temperature and low-flow conditions, which intensify solute accumulation in the river water. Together, the statistical summaries and time series plots provide a clear picture of fluctuating water quality that intermittently breaches safe thresholds, with implications for drinking water safety.

Table 6 Descriptive statistics for physicochemical and metal parameters across all sites; N: number of observations, SD: standard deviation, IQR: inter-quartile range, Min.: minimum observation, Max.: maximum observation.
Fig. 2
figure 2

Monthly variations in EC, TDS, As, Pb and Cd concentrations at Betwa, Yamuna and confluence sites (June 2023 – May 2024).

WHO standards and exceedance frequencies

Compliance with WHO standards, summarised in Table 7, highlights the persistent and widespread contamination of the Betwa-Yamuna confluence system. Exceedance analysis showed that iron surpassed guideline values in 100% of the samples, arsenic and lead in approximately 6%. These levels of non-compliance are alarming, given the well documented health risks of chronic exposure to these metals13,50, including carcinogenic and neurological impacts5,11. Even parameters such as EC and TDS, which are not toxic but reflect salinity and mineral loading51, exceeded thresholds in a significant fraction of samples, indicating broader water quality stress that can affect domestic and agricultural use. Overall, exceedances were most pronounced during the pre-monsoon season, decreased sharply during the monsoon due to dilution, and re-emerged at moderate levels during the post-monsoon and winter periods.

Table 7 Frequency (%) of samples exceeding WHO guideline values for physicochemical and metal parameters.

The temporal pattern of exceedances shows that risks are not evenly distributed throughout the year but are strongly linked to hydrological cycles. The pre-monsoon and winter months, when flows are reduced and dilution capacity is limited, showed the highest frequency of non-compliance. In these periods, pollutant concentrations became more influential due to both evaporative enrichment and sustained anthropogenic inputs. By contrast, the monsoon months provided partial relief, as high discharge diluted contaminants and temporarily lowered exceedance rates. However, the persistence of high concentrations outside the wet season highlights a chronic exposure risk for downstream populations, emphasising the need for continuous monitoring and interventions targeting both point and diffuse pollution sources.

The patterns confirm that the study area faces chronic water quality challenges, with implications for drinking water safety and ecosystem health. Importantly, the fact that exceedances were recorded for As and Pb aligns with previous findings in north Indian rivers, where untreated effluents, agricultural inputs and geogenic release from sediments contribute to contamination8,21,25,26,52,53. The elevated As concentrations observed during low-flow and pre-monsoon conditions likely reflect enhanced desorption of As from iron oxide phases under increasingly reducing conditions, combined with limited dilution during periods of reduced discharge.

Non-Carcinogenic health risk assessment

Hazard quotients

Hazard quotients for individual metals are summarised in Table 8, with corresponding scatterplots shown in Fig. 3. The analysis indicates that arsenic presents the most consistent risk, with median HQ values of 0.42 for adults and 0.98 for children, and maxima exceeding 1.0 in both groups. Lead produced lower HQ values overall, with means well below unity, though children’s upper range approached 0.24. Cadmium showed very low HQs across all metrics, remaining two orders of magnitude below the threshold of 1.0. Other metals such as Fe and Mn displayed sporadic high values (child maximum HQs of ~ 2.0 for Fe and ~ 0.43 for Mn), but these were occasional rather than persistent exceedances. These findings suggest that non-carcinogenic risks in this system are dominated by arsenic exposure, with secondary but less consistent contributions from Fe and Mn, while Pb and Cd remain below critical thresholds. This pattern is broadly consistent with South Asian River studies that identify arsenic as the most significant chronic waterborne contaminant16,27,41,43,44.

Table 8 Calculated hazard quotients (HQ) for major contaminants compared against the WHO reference doses for adults and children.
Fig. 3
figure 3

Scatterplots of HQ values vs WHO references doses (RfD), showing exceedances for metals in both adults and children.

Children’s HQs were systematically higher than adults’ across nearly all metals, with mean HQs roughly double for As and Fe. For instance, the 95th percentile HQ for As in children reached 2.28 compared to 0.98 in adults, indicating a substantially higher risk burden for younger age groups. These differences reflect children’s higher water intake relative to body mass and their increased physiological sensitivity to toxic exposures. The results emphasise the necessity of age-stratified risk assessments, as children may experience significant health effects even when adult HQs remain below critical thresholds. This disproportionate vulnerability highlights the urgency of interventions targeted at communities with high child exposure, particularly where river water is consumed untreated.

The results collectively demonstrate that ingestion of untreated river water poses non-carcinogenic health risks, most critically from arsenic. While Pb and Cd remain below guideline thresholds in this dataset, arsenic exceeded the HQ = 1.0 benchmark in a significant proportion of child cases, with maximum values approaching 2.5. High HQs for Fe and Mn, though less frequent, point to additional stressors that may compound risk under low-flow or polluted conditions. These findings align with global evidence linking arsenic exposure to skin lesions, cardiovascular problems and increased risk of chronic diseases14,15,17,44,46, while also reinforcing the heightened susceptibility of children to such contaminants. The persistence of arsenic exceedances highlights the chronic nature of exposure, pointing to a combination of natural geogenic sources and anthropogenic inputs in the Betwa-Yamuna system. Without targeted monitoring and treatment, the risks identified here suggest long-term health issues for communities reliant on river water for drinking and domestic use. Seasonally, the highest HQ values occurred during the pre-monsoon low-flow period, while monsoon months consistently showed the lowest risk levels, confirming strong hydrological control on exposure intensity.

Hazard index

Table 9 shows cumulative HI values by site and season, and Fig. 4 summarises their distributions for adults and children. The data show that cumulative, non-carcinogenic risk is site- and season-dependent. For adults, HI medians were well below 1.0 at the Betwa River across all seasons (median 0.19–0.39), but approached or exceeded unity at the Yamuna River and the confluence during pre-monsoon (Yamuna median 1.05; confluence median 1.45). For children, HI medians were consistently > 1.0 at Yamuna (1.28–2.44) and the confluence (1.64–3.38) in every season, while Betwa remained < 1.0 (0.44–0.92). These patterns indicate that cumulative exposures are most severe where upstream inputs converge and during seasons of reduced flow, when concentration effects are strongest.

Table 9 Seasonal hazard index values for adults and children across Betwa, Yamuna and confluence sites.
Fig. 4
figure 4

Distribution of hazard indices for adults vs children across the study period.

The dominance of the Yamuna and confluence sites in driving HI exceedances reflects the additive contributions of multiple metals rather than a single contaminant. Although arsenic is the most consistent driver of high HQs, higher values for Fe and Mn also contribute to seasonal spikes in HI, particularly under pre-monsoon and winter low-flow conditions. Figure 4 shows that children’s HI values are typically about twofold higher than adults’ at the same site/season, e.g. confluence pre-monsoon: adult mean 1.43 vs child mean 3.34, highlighting disproportionate vulnerability. The persistence of HI > 1 for children at the Yamuna and confluence throughout the year suggests a chronic exposure risk that cannot be attributed to episodic pulses alone.

The high and persistent HI values show that people who use untreated river water face serious health risks from multiple metals at the same time. For children, HI values of 2 or more in several seasons mean the risks are even greater because contaminants add up. Unlike single-metal HQs, the HI highlights the full risk of exposure and shows that overall risk is much higher. The cumulative risk expression followed the same seasonal structure, with maximum HI during pre-monsoon, minimum values during monsoon dilution, and partial risk recovery during post-monsoon conditions. These results point to the need for targeted action at the Yamuna and confluence sites, especially in the pre-monsoon and winter months, and for special protection of children through focused monitoring and treatment strategies.

Carcinogenic risk assessment

Carcinogenic risks were estimated for As, Cd and Pb, which are recognised for their carcinogenic potential through oral pathways2,40,54 (Table 10). The results show that As is the dominant contributor, with adult CR values reaching a median of 8.1 × 10⁻5 and peaking at 2.1 × 10⁻4, thereby exceeding the USEPA benchmark of 1 × 10⁻4 in the upper distribution5,35,36,37,55. For children, the median CR was slightly lower at 3.8 × 10⁻5, but the 95th percentile (8.8 × 10⁻5) and maximum (9.6 × 10⁻5) approach the threshold, highlighting their heightened vulnerability. By contrast, Cd and Pb remained well below the threshold, with maximum values on the order of 10⁻⁶, suggesting that these metals are unlikely to pose significant carcinogenic risks in this system. The systematic decline in metal concentrations and associated HQ, HI, and CR values during the monsoon reflects dilution by high rainfall and increased river discharge, which temporarily reduces contaminant residence time and exposure potential. Arsenic-driven carcinogenic risk peaked during the pre-monsoon and early winter months, whereas monsoon conditions consistently suppressed CR values through hydrological dilution. However, the metal concentration presence signals the need for continued vigilance, especially during low flow months when pollutant concentrations are high. These findings indicate that long-term arsenic exposure presents a chronic cancer risk in the Betwa-Yamuna system, consistent with widespread concerns across South Asia, where as contamination has been linked to increased cancer concerns.

Table 10 Carcinogenic risk (CR) values for As, Cd and Pb for adults and children, compared against USEPA/WHO safety thresholds.

Monte Carlo simulation

Probability distributions

Monte Carlo simulations (10,000 runs) were performed to quantify uncertainty in HQ, HI and CR using empirical (bootstrap) sampling of measured concentrations and variability in exposure factors. Probability density functions for HI are shown in Fig. 5, and percentile summaries for HQ (per metal), HI and CR (As) are reported in Table 11. The children’s HI distribution is right-skewed with a longer tail than adults, confirming greater upper-end risk even when central tendencies are comparable. Consistent with deterministic findings, arsenic dominates carcinogenic risk: the simulated P95 for CR(As) exceeds 1 × 10⁻4 in adults and approaches the benchmark in children, indicating that high exposure scenarios are possible and policy relevant.

Fig. 5
figure 5

Probability density functions for hazard index based on 10 000 Monte Carlo Simulations for adults and children; dashed line marks HI = 1.

Table 11 Monte Carlo percentiles (P50, P75, P95) for HQ (metals), cumulative HI and CR (As) for adults and children.

Cumulative exceedance probability

Cumulative exceedance curves (Fig. 6) and exceedance summaries (Table 12) provide threshold-focused insights. Simulations indicate HI > 1 in roughly 62–69% of children’s runs and ~ 20–23% of adults’ runs, highlighting both higher central risk and more frequent threshold exceedance in children. For carcinogenic risk, CR(As) shows the highest exceedance probabilities, with children again more vulnerable. Probabilistic outcomes preserved the same seasonal hierarchy observed in deterministic results, with pre-monsoon showing the highest likelihood of exceedance, monsoon the lowest, and post-monsoon transitional behaviour.

Fig. 6
figure 6

Complementary CDFs (exceedance probability) for HI (left) and CR (As) (right) from 10 000 Monte Carlo simulations; dashed line mark HI = 1 and CR = 1 × 10–4.

Table 12 Exceedance probabilities (%) for HI and CR (As) relative to WHO/USEPA benchmarks (adults vs children).

Comparative risk between adults and children

Comparative bar plots (Fig. 7) highlight clear and systematic differences between adults and children in both non-carcinogenic and carcinogenic risks. For non-carcinogenic exposure, children consistently exhibit higher hazard quotients than adults, with median HQ values for As and Pb typically 1.5–2 times higher in children. This difference propagates into the cumulative Hazard Index (HI), where children’s median HI values exceed adult values by a similar factor across seasons. This amplified risk is primarily driven by children’s higher intake-to-body-weight ratio and shorter non-carcinogenic averaging time (AT = ED × 365), which increases effective dose relative to body mass. These results quantitatively confirm that children are the most vulnerable group for cumulative, multi-metal exposure in the Betwa–Yamuna system.

Fig. 7
figure 7

Grouped bars of median HQ (As, Pb, Cd), HI and CR (As) for adults and children (Monte Carlo medians; N = 10 000).

In contrast, arsenic-related carcinogenic risk (CR) is systematically higher in adults than in children, with adult median and upper-percentile CR values exceeding those of children by approximately 2–3 times under the adopted exposure assumptions. This occurs because the cancer averaging time is fixed at 70 years for both groups, while adults experience longer exposure duration and higher ingestion rates, which together outweigh body-mass differences. This dual pattern indicates that children bear the dominant burden of cumulative non-carcinogenic effects (HI), whereas adults carry the larger lifetime cancer risk from arsenic (CR). From a management perspective, child-focused protection should prioritise reducing multi-metal exposure through household and community-scale treatment, while adult risk mitigation should focus specifically on arsenic source control, adsorption-based removal, and routine seasonal monitoring, particularly during low-flow periods.

Discussion

Trace element contamination in the Betwa–Yamuna river system reflects the combined influence of geogenic weathering processes and diverse anthropogenic inputs such as agriculture, thermal power generation, urban sewage, and industrial effluents56,57,58,59. The frequent exceedance of WHO standards for As at the confluence and downstream Yamuna sites highlights cumulative upstream loading and enhanced mobilisation under mixing conditions. The amplified contaminant levels and health risks at the confluence are consistent with cumulative upstream loading and hydraulic mixing of two chemically contrasting river systems, which enhances both metal mobilization and exposure potential. Arsenic concentrations repeatedly exceeded the 0.01 mg/L guideline, consistent with documented arsenic-related cancer and cardiovascular risks across South Asia16,41,42. Lead contamination, previously reported upstream near Delhi60, likely originates from industrial discharges, vehicular emissions, and agricultural inputs. Although Cd and Pb exceedances were less frequent, even low-level exposure is concerning due to their cumulative nephrotoxic and skeletal impacts3,4,12. Elevated metal levels during pre-monsoon and winter further indicate the concentrating effect of low-flow conditions.

Non-carcinogenic risk metrics (HQ and HI) demonstrate a clear age-dependent vulnerability pattern. Children consistently exhibited HI values exceeding 1.0 across all seasons, while adult risks approached or exceeded unity mainly during low-flow periods. These findings align with observations from contaminated river basins in Egypt and China, where higher intake-to-body-weight ratios drive elevated child risk1,39,61. HI values above 2.0 in several months confirm the importance of multi-metal cumulative exposure assessment rather than reliance on single-contaminant indicators. Carcinogenic risk analysis further identifies As as the dominant driver, with more than 40% of samples exceeding the USEPA benchmark of 1 × 10⁻435,37. Monte Carlo simulations strengthen these findings by revealing a markedly higher probability of unsafe HI exceedance for children (~ 67%) compared to adults (~ 23%), while As also shows the highest likelihood of carcinogenic exceedance. Together, these results support the urgent need for seasonal monitoring, stricter regulation of point and non-point pollution sources, and deployment of community-level treatment technologies to protect public health and advance Sustainable Development Goal 6 (Clean Water and Sanitation)62,63.

Conclusion and recommendations

This study demonstrates that the Betwa–Yamuna confluence is affected by significant water quality degradation, with As persistently exceeding WHO drinking-water guidelines and driving both non-carcinogenic and carcinogenic health risks. Seasonal controls exerted a strong influence on contaminant behaviour, with the highest metal concentrations occurring during pre-monsoon and winter low-flow periods, when dilution capacity is reduced. The HI consistently exceeded the safety threshold (HI > 1) for children in all seasons, while adults showed frequent exceedances during low-flow months, confirming the presence of chronic multi-metal exposure risks. Carcinogenic risk assessment further identified As as the dominant contributor, with a substantial proportion of exposures exceeding the USEPA acceptable cancer risk level. Monte Carlo simulations strengthened these findings by demonstrating wide uncertainty ranges and long-tailed probability distributions, particularly for children, confirming their heightened vulnerability under realistic exposure scenarios. These results collectively show that untreated river water at this confluence presents a persistent and seasonally amplified public-health concern.

Despite the robustness of the dataset and analytical framework, this study has certain limitations. The assessment was based on surface-water concentrations only, and thus does not account for groundwater–surface water interactions, which may influence exposure in adjacent communities. In addition, site-specific exposure parameters were not available, requiring reliance on WHO/USEPA default values, which may introduce uncertainty in absolute risk magnitudes. Furthermore, metals were evaluated as total concentrations, without accounting for speciation or bioavailability, which can influence toxicity. Recent studies increasingly emphasize the importance of coupling surface water, groundwater, and biomonitoring data and integrating real-time sensors and isotope-based tracers to resolve contaminant transport pathways more accurately. Future investigations should therefore include multi-pathway exposure routes, site-specific demographic data, and longer monitoring periods to better quantify chronic exposure dynamics.

From a management perspective, the findings clearly indicate the need for priority control of As, Pb, and Cd at the Betwa–Yamuna confluence. Seasonally resolved monitoring programs should be institutionalized to capture low-flow risk amplification. Community-scale water treatment systems, e.g. adsorption-based filtration, coagulation–flocculation, membrane processes, and decentralized treatment units, are urgently required in settlements dependent on untreated water. Pollution source control, targeting industrial effluents, municipal wastewater discharges, and agricultural runoff, must be enforced simultaneously. Public awareness campaigns and child-focused protection strategies are essential, given the consistently elevated health risks observed in children. These recommendations align directly with Sustainable Development Goal 6 (Clean Water and Sanitation) and support recent international frameworks advocating risk-based water governance rather than concentration-only compliance monitoring. This study therefore provides a critical health-risk baseline for northern Indian river confluences and offers a scientifically rigorous platform for evidence-driven water safety policy and long-term mitigation planning.