Introduction

The Indus River was central1,2 to the development of the ancient Indus Valley Civilization (IVC), providing a stable water source for agriculture1,3,4,5,6, trade, and communication3,7. The civilization flourished around the Indus River and its tributaries around ~5000 years before present (BP; relative to 1950 CE) and evolved over time8. During the Mature Harappan stage (4500-3900 years BP), the IVC featured well-planned cities, advanced water management systems, and a sophisticated writing system1,8. After ~3900 years BP, however, the Harappan civilization began to decline and eventually collapsed3,8,9,10,11. The causes of this decline remain debated3,12. Proposed factors include climate change, seawater retreat, droughts, floods, and shifting river dynamics, interacting with social and political changes3,6,10,13,14,15,16,17,18. Understanding ancient hydroclimatic events and their societal impacts provides critical insight into the vulnerability of complex societies to environmental stress10,19,20,21.

Hydroclimatic variability across the Harappan domains was influenced by both the Indian summer monsoon (ISM) and Indian winter monsoon (IWM). These systems are thought to have been coupled during the early-to-mid Holocene, but diverged during the mid-to-late Holocene, likely due to low-latitude insolation changes22. Instrumental and paleoclimate evidence demonstrates that diverse regional forcings shaped hydroclimatic variability during different periods12,20,23,24. Whereas instrumental records help to investigate the hydroclimatic variations of the last 100–150 years, reliable, long-term records are rare beyond 185023,25. Accordingly, paleoclimate proxies may be used to investigate past perturbations to the hydrological cycle.

Notable efforts have been made to capture the hydroclimatic variability and past drought events associated with the ISM using lacustrine20,26,27,28,29,30, speleothem9,31,32,33, paleo-botanical4,34, marine11,35, and other geological archives21,36,37. For instance, Dutt et al.3,38 used 4500-year-old lacustrine sediments from Tso Moriri Lake, northwest Himalaya, and found regional variations in the ISM during the initiation of the Meghalayan period9,14,31. They3 also suggested a prolonged dry phase between 4350- and 3450-years BP, which likely contributed to the water scarcity in the IVC region. Kathayat et al.31,39 used high-resolution speleothem proxies from the Sahiya and Mawmluh cave to assess the putative 4200-year global aridification event, which bounds the prolonged drought over the IVC region between 4200- to 3900-years BP. Giosan et al.11 used fluvial evidence to determine past environmental conditions and their impacts on habitats in the Indo-Gangetic Plain, identifying centuries-long weak monsoon events during the Late Harappan period. Similarly, Staubwasser et al.13 used oxygen isotope data from the Indus Delta to invoke drought initiation, which bounds the putative aridification event within 4400- and 3400-years BP. Dixit20 attempted to reconstruct the variability of the ISM using lacustrine records of Rajasthan. Lakes on the margin of the Thar Desert and the eastern region of Rajasthan showed distinct patterns of evolution over time, highlighting the influence of regional climate trends20,26,29,40,41,42.

Paleoclimate proxies provide indirect and geographically restricted estimates of past climate change and variability. Accordingly, output from climate model simulations in tandem with proxies can help contextualize spatial variability and help explain mechanisms of hydrological shifts25,43,44. Recent paleoclimate modeling suggests substantial spatiotemporal variability of ISM intensity from the mid-Holocene to the present25,45,46,47. Hydroclimatic changes during the mid–late Holocene, particularly the abrupt ~4200 BP event, have drawn extensive attention and debate3,9,13,31,37,40,48,49,50,51. Yet, reconstructions remain hampered by limitations, including site-specific biases, low resolution, chronological uncertainties, and non-unique interpretations23,52,53. Catchment-scale processes further complicate relationships between proxy signals and hydrological variables such as precipitation and discharge54, making quantitative assessments of climate–society linkages difficult.

To address these challenges, we combine climate model simulations with paleoclimate proxy records to evaluate the paleohydrological variability of the IVC region during the late Holocene (5000–3000 years BP). We use meteorological forcings from three transient climate simulations (TraCE-21ka; 22000 years BP to present, MPI; 7950 years BP to 100 years BP, and TR6AV; 6000 years BP to present; See Table S1) as input parameters to the Variable Infiltration Capacity (VIC) hydrological model. This approach enables simulations of river discharge variability near major Harappan settlements. By integrating past climate simulations with hydrological modeling, we aim to contextualize existing proxy reconstructions and assess how hydroclimatic trends helped shape historical trajectories of human settlements in the IVC55.

Results

Identification of past drought events

We focused on a designated IVC region (Fig. 1) to examine past hydroclimatic variations using transient climate simulations (Table S1) and paleoclimate records (Table S2). The Harappan period (5000–3000 years BP) was divided into three phases10,31: (1) Pre-Harappan (5000–4500 years BP; PreH), (2) Mature Harappan (4500–3900 years BP; MatH), and (3) Late Harappan (3900–3000 years BP; LatH) based on archaeological evidence of settlement evolution and socio-environmental changes2,11,39,56. These divisions enable evaluation of hydroclimatic trends alongside cultural transitions, with particular focus on settlement patterns and potential climate-driven changes at the close of each phase.

Fig. 1: Rainfall variability over Indus Valley Civilization area from 6000 years BP to present (1850 CE).
figure 1

a Inset tile showing location of the study area (shaded in blue) with the major river basins of Indian subcontinent. b Study Area indicating discharge stations considered in this study near Major Harappan sites (Yellow circles) with major river basins of Indian subcontinent. Red triangles indicate location of lake proxies, while yellow start denotes the location of Sahiya Cave. Shaded background shows elevation data (in m) from USGS GTOPO30. c Simulated ensemble rainfall with associated uncertainty, calculated as ± 1 standard deviation derived from three transient climate simulations (TraCE-21ka, MPI, and TR6AV) spanning from 6000 years BP to present (1850 CE). d Similar to (c) but for Standardized precipitation Index (SPI) computed with respect to the Pre Harappan period. e Simulated ensemble rainfall over the grid encompassing Sahiya cave, alongside its validation (R2 = 0.46) with speleothem data. Three bands from 5000 to 3000 years BP show the distinct periods of the Indus Valley Civilization: Pre-Harappan (PreH), Mature Harappan (MatH), and Late Harappan (LatH).

The ensemble mean of all three transient climate simulations (TraCE-21ka; hereafter TraCE, MPI, and TR6AV) indicates a persistent drying trend in the 100-year moving rainfall averages over the past 6000 years (Fig. 1c), with a reduction of ~120 mm and an inter-model (1σ) uncertainty of 20 to 40 mm. All three models show this drying trend (Fig. S1). TraCE and TR6AV simulate gradual declines in rainfall, with TraCE producing consistently higher rainfall than TR6AV. Conversely, MPI simulates a sharper decline and greater variability across the three models (Fig. S1). The abrupt reduction in MPI is linked to anomalous warming in the West African and Indian monsoon zones57, which triggered a positive feedback reducing rainfall through diminished evaporative cooling and cloud cover57. The 12-month Standardized Precipitation Index (SPI) also shows a long-term decline, with a marked reduction after the PreH interval (Fig. 1d).

To test model fidelity, we compared ensemble-mean rainfall with speleothem δ¹⁸O data from Sahiya cave, where more negative values are interpreted as higher rainfall or more intense convection39. Ensemble-mean rainfall anomalies from 4500 to 2500 BP indicate a moderate correlation with the δ¹⁸O record (r = 0.46; Fig. 1e), supporting the modeled drying trend. Comparisons with other cave archives (e.g., Bitto, r = 0.50; Wahshikhar, r = 0.39) also show broad consistency (Fig. S2). While discrepancies exist across sites, these results suggest the transient simulations capture key large-scale processes. We also note the existence of discrepancies between simulations and existing records as well (Fig. S2), and stress that our intent to utilize transient simulations is to explore plausible large-scale processes and feedbacks44,57,58. Grid-to-grid comparisons with proxy records may not be appropriate due to the limited resolution of simulations and inadequate representation of local topography, land-atmosphere interactions, and isotope meteorology59. Nevertheless, broad agreement with speleothem evidence (Figs. 1e and S2) provides confidence in using the simulations to explore drought extent and severity.

The transient climate simulations also reveal a ~0.5 °C rise in IVC’s annual mean temperature from the PreH to the LatH, with a significant warming trend of 0.024 °C per century (p < 0.01; Fig. S3a). Furthermore, we found warming trends with more than 95% confidence level within both latter periods as well—with trends of 0.02 °C and 0.016 °C per century for the MatH and LatH periods, respectively. Spatially, significant (99% confidence level) warming over the central IVC region and particularly over the foothill of the Himalayas was observed during the transition from the PreH to LatH period (Figs. S3b and S3c). In addition, century-scale rainfall averages (a 100-year moving average of rainfall) decreased significantly (~51 mm reduction in this metric from the PreH to LatH; Fig. 2b), which is notable considering the long period (100 years) for which the average rainfall was estimated. Furthermore, the simulations indicate declines of ~13–15% in annual, summer, and winter rainfall with a significant decreasing trend of ~0.7 % per century at 99% confidence level (Fig. S3d, S3g, and S3j, respectively). Annual rainfall showed a noteworthy reduction of over 20% in the central region of the IVC and a decline ranging from 15 to 20% in peripheral areas during the LatH period (Fig. S3f). In contrast, a more modest reduction of approximately 10% was observed across the entire IVC during the MatH period (Fig. S3e). All transient simulations indicated moderate annual rainfall amounts during the PreH compared to deteriorating conditions across the PreH to LatH transition (Fig. S1 and S4). We further note that the TraCE simulation showed a major reduction in annual rainfall over western IVC areas compared to the central region, whereas MPI showed a major reduction over the central IVC area in contrast (Fig. S4). Spatial patterns similar to those observed in annual rainfall changes were also evident in summer rainfall anomalies, with a strong reduction of over 20% during the LatH (Fig. S3i). We found minimal changes in winter rainfall across the IVC during the MatH period (Fig. S3k), which agrees with paleoclimate records showing steady winter rainfall during the MatH period56. Conversely, a reduction of 10–15% in winter rainfall was found during the Late Holocene (coinciding with the LatH period) (Fig. S3l). The warming trend propagated from east to west, whereas the rainfall deficit propagated from west to east from the PreH to the LatH period, in agreement with earlier hypotheses20.

Fig. 2: Identification of major drought events during Harappan timeframe and corresponding changes in paleoclimate variables.
figure 2

a Annual air temperature over IVC derived from the TraCE-21ka simulations; b Simulated ensemble rainfall from three transient climate simulations (TraCE-21ka, MPI, and TR6AV) with associated uncertainty, calculated as ± 1 standard deviation; c Similar to b but for Standardized precipitation Index (SPI) computed with respect to the Pre Harappan period; d Detrended SPI and identification of major flood and drought events using the threshold of 0.5 and −0.5, respectively; e Variation in detrended DJF mean SST anomalies over the Niño 3.4 and AMO region. All the variables in (a–e) are 100 year moving mean estimates. Light orange stripes show major drought events. f Spatial variability in SPI derived from the precipitation datasets of all transient climate simulations and presented for the identified major drought events. Black dots indicate Harappan sites during the period of drought, and blue polygon shows extent of Harappan Civilization. g Event-wise spatial variability in DJF mean SST anomalies with highlighted Niño 3.4 and AMO regions.

We identify four severe droughts (>85 years each) between MatH and LatH, based on the detrended 100-year moving mean SPI (Fig. 2d and Table S3). These major droughts occurred at 4445–4358 (D1), 4122–4021 (D2), 3826–3663 (D3), and 3531–3418 (D4) years BP (Fig. 2d). Three of these droughts affected more than 85% of the IVC region (range 65–91%). Peak intensities reached SPI values of –1.5, –2.1, –1.65, and –1.72, respectively. Using a drought severity score60, which incorporates the intensity, duration, and areal extent, D3, peaking at 3757 years BP, ranks as the most severe— lasting ~164 years, reducing annual rainfall by ~13%, and affecting more than 91% of the region (Figs. 2f and 3g). Unlike the other events, D3 featured consistent declines in summer and winter rainfall, disproportionately impacting the central IVC region, whereas Saurashtra remained relatively unaffected.

The first drought event, D1 (Fig. 2f), occurred at the beginning of the MatH period (~4.4–4.3 ka), with 65% of the IVC region under drought with ~5% reduction in annual rainfall compared to PreH (Fig. 3a). This event lasted about 88 years and was two to three times less severe compared to the later droughts. D2 coincided with the global 4200-year event, with substantially lower severity than D3. The central Indus Basin experienced more substantial impacts than the Lower and Upper Indus Basins during D2, whereas the Ganweriwala and Kalibangan regions were less severely affected (Figs. 2f and 3f). The D4 event (Fig. 2f) began ~3531 years BP, lasting ~114 years with moderate intensity and ~13% reduction in annual rainfall. MPI and TraCE captured all the major drought events, whereas TR6AV showed a lagged response (Fig. S1), highlighting the role of internal variability47. MPI consistently simulated the most severe droughts, while the models differ in regional rainfall declines (Fig. S5). All models showed drier conditions during the D3 and D4 events, with model-based differences in spatial patterns. Whereas TraCE indicates more of a decline in the central region during these events, MPI and TR6AV show a stronger decline in the north-east part of the IVC region (Fig. S5).

Fig. 3: Paleoclimate variables across the Indus Valley Civilization Region during major drought events.
figure 3

a–d Annual Temperature anomalies (°C) and ensemble-mean rainfall anomalies of e–h annual rainfall i–l summer rainfall (JJASON) and m–p winter (DJFMAM) rainfall derived from three transient climate simulations (TraCE-21ka, MPI, and TR6AV). Anomalies (%) are relative to the long-term mean of 5000–4500 years BP (PreH). Black dots show the location of Harappan sites during the given period, whereas the red line bounds the IVC region.

We observed a gradual reduction in annual and summer rainfall from event D1 to D4 (Fig. 3), whereas winter rainfall decreases till the D3 event only and then revives during D4 (Fig. 3). Further, we also observed the increase in annual mean temperature from D1 to D4 (Fig. 3a–d), which might have increased the atmospheric water demand and thus evapotranspiration, which makes these droughts more severe60. Similarly identified droughts are also evident in speleothem records from Sahiya cave, whereas they are either absent or show a large, lagged response in the Mawmluh cave record. This may result from the hydroclimate signal at Mawmluh Cave having lowered sensitivity to rainfall in the IVC region24,52. The drying over the IVC area from 4000 to 3000 years BP was more dominant61, and mainly expressed in the central IVC region for the last three droughts (Fig. S5), which may additionally explain the shift in settlements toward the Ganga plains and Saurashtra region during this period12,39,56.

Finally, we examined simulated variability in sea surface temperatures (SST) in the Pacific and North Atlantic—regions with well-established teleconnections to ISM variability62,63. Typically, El Niño-like conditions in the tropical Pacific, with a warmer central and eastern equatorial Pacific, and relatively cooler SSTs in the northern Atlantic Ocean (Fig. S6), have been shown to induce drought conditions during the ISM season in observations62 and last millennium simulations64. Detrended SST anomalies (Fig. 2) in the Niño3.4 box—a region used to monitor ENSO activity—were higher during D1 than the previous ~400 years, signifying strong El Niño conditions across this drought. The drought at 4.1–4.0 ka BP (D2) occurred under high SSTs in the tropical Pacific and cooler SSTs in the northern Atlantic Ocean (Fig. 2). Accordingly, we suggest that D1 and D2 were associated with warmer central and eastern equatorial Pacific SSTs and cooler North Atlantic SSTs—an inference consistent with reconstructions from reduced space and Bayesian methods65,66. In contrast, drought events D3 and D4 occurred with the warming SST in the eastern Pacific relative to background conditions and neutral SST anomalies in the Northern Atlantic.

Paleohydrological reconstruction

We found that the higher settlement density in the western region of the IVC and upper Ganga plains during the PreH period (Fig. S4) coincided with enhanced water availability indicated by the simulated riverflow (Fig. S7). Most sites were situated away from the river complex during the PreH period, suggesting higher rainfall conditions and access to freshwater across the IVC region10,37, with reduced dependence on river resources (Fig. S4). Wetter conditions over northwest India during PreH and subsequent drying during LatH were also observed in the reconstructions of Gill et al.67. In this work, they67 reconstructed changes in rainfall over India with respect to the present day and found a 40–60% increase in rainfall relative to the present, which sustained lakes, including Lake Sambhar68. Subsequently, we simulated river discharge using the VIC hydrological model at sites proximal to major Harappan cities (referred to as stations hereafter), with nine stations situated within the Indus basin, three stations within the Ganga basin, and five stations within the Sabarmati basin (Fig. 1). We estimated the discharge anomalies with respect to the PreH period (Fig. S7). During the MatH period, compared to the PreH period, urban settlements shifted towards the Indus River from various parts of the IVC region. Relatively low discharge anomalies were found in stations STI01, STI03, STI05, and STI09 compared to the other stations in the upper Indus, Ganga basin, and Saurashtra region during D1 (Fig. 4a1, 4a2). Moreover, towards the end of MatH period, the central IVC region was affected by the drought event D2 (Fig. 2f), exacerbating fluvial drying at STI01, STI03, STI05, and STI09. However, stations in Saurashtra (STS01-STS05) indicated strengthening riverflow conditions, whereas those in the upper Indus (STI06-STI08) and Ganga basins (STG01-STG03) remained relatively stable, thus attracting urban settlements at the end of the MatH period (Fig. 4b1, 4b2). Subsequently, riverflow deteriorated during D3 and D4, with some of the sites experiencing discharge anomalies of more than 12%. According to our hydrological simulations, lower and upper Indus regions showed a maximum reduction (more than 9%) in riverflow during the D4 (Fig. 4d1). Sites such as Kot Diji (STI02) and Ganweriwala (STI04) experienced a ~12% reduction in riverflow, suggesting water scarcity in the Indus River during the LatH period. Additionally, we observed a downstream decrease in discharge anomalies, signifying the dominance of westerly-driven winter rainfall and snowmelt in the upper Indus as potential drivers. Conversely, sites in Saurashtra exhibited a smaller reduction in discharge compared to other stations, indicating favorable conditions for settlements in that region during MatH to LatH transition (Fig. S7).

Fig. 4: Paleodischarge anomalies derived from VIC model during major drought events.
figure 4

Spatial variation in discharge anomalies (with respect to the long-term mean of the Pre-Harappan period) across the IVC region for (a1) D1, (b1) D2, (c1) D3, and (d1) D4 drought events. The panel on the right (a2, b2, c2, and d2) displays station-wise discharge anomalies with associated uncertainty during the respective drought event. Red, orange, and yellow bars show stations in Indus, Ganga, and Sabarmati, respectively.

We compared our analysis with reconstructed lake levels from IVC-proximal lacustrine records. Reconstructed lake levels of Nal Sarovar Lake show declines from the PreH to MatH, specifically during D1 (Fig. S8a). In contrast, other lakes exhibit increasing or minor fluctuations in water levels during D1 (Fig. S8), which may be explained by regional differences in precipitation patterns20, evaporation/precipitation (E/P) ratios, and variable runoff resulting from a smaller catchment area30,69. Nevertheless, we find a notable disparity between paleolake levels and drought events during the MatH, specifically during the 4200-year BP D2 event. All other lake records, except Pariyaj (Fig. S8), experienced an initial drying phase corresponding to D3 and D4 events (Fig. S8). The Sahiya cave record showed drying rainfall trends across D1 to D3 (Fig. S8), but diverges from the simulations during D4 (Fig. S8).

We identified four distinct spatial clusters based on the simulated riverflow patterns during major drought events (Figs. 4 and S7). These include the Upper Indus (Harappa-STI05), Middle Indus (Kot Diji-STI02, Ganweriwala-STI04, Kalibangan-STI06 and Banawali-STI07), Lower Indus (Mohenjo-Daro-STI01 and Chanhudaro-STI03), and Saurashtra (Dholavira-STS01, Surkotada-STS02 and Lothal-STS03) regions. Clustering based on discharge anomalies coincides with the region of homogeneous agricultural patterns identified by Petrie and Bates4, demonstrating the potential of hydrological simulations to provide perspectives on early farming practices using simulated paleodischarge data. Petrie and Bates4 showed the prevalence of multi-cropping for summer crops and mono-cropping for winter cereals over the upper Indus region, which may be linked to the large and notable shifting phase of discharge in this region during the PreH. Archaeobotanical records from the foothill of the Himalayas showed positive rainfall anomalies during both summer and winter, allowing for the continuous availability of soil moisture in this region during the mature phase70. Such precipitation anomalies may have helped spur innovation in crop strategies and selections. For example, Rabi and Kharif crops were grown, and wheat and barley were shifted to millet during the MatH4,5,34.

Discussion

The onset, evolution, and decline of the Harappan civilization, which developed along alluvial plains supported by natural drainage and floodwater irrigation, has been a major focus of research24. Based on our climate simulations and paleoclimate evidence, we suggest that drought initiation in the IVC region began around 4440 years BP, coinciding with shifts in settlement patterns and cultural reorganization. Transient climate simulations and hydrological modeling reveal a sequence of severe droughts exceeding 85 years in duration during this interval, with progressively intensifying conditions (Fig. 2f). Giesche et al.33 proposed recurring summer and winter droughts between 4200–3970 BP from Dharamjali Cave speleothem records, including multi-decadal events around 4190, 4110, and 4020 BP separated by ~20–30 year recovery phases. These align closely with our second drought event. Similarly, our findings are consistent with the “double drought” hypothesis of Scroxton et al.51, which suggests that aridification began around 4260 years BP and persisted for 300 years. The extended summer monsoon drought that we identified during 3531–3418 BP coincides with widespread deurbanization and cultural abandonment of major Harappan centers12,39,56.

The study of Giosan et al.56 shows coherence between our drought events and their reconstruction of weakened summer monsoon strength and enhanced winter monsoon rainfall from ~4500 to 3000 BP, followed by a winter monsoon decline after 3300 BP. This reconstruction56 of winter monsoon variability, based on marine sediment core and ancient DNA analysis, does not directly resolve individual drought events of the summer monsoon at high temporal resolution. Nevertheless, we find coherence between the simulated drought events and trends uncovered in their work (Fig. S9). In particular, the first two droughts we identify were partially relieved by stronger winter monsoon rainfall, especially across the Ghaggar–Hakra interfluve. However, once winter monsoon precipitation also declined after ~3300 BP, Harappan settlements fragmented into smaller units (Fig. S9).

To investigate drivers of the multidecadal droughts, we analyzed SST variability in all three transient simulations. Prior work has implicated Northern Hemisphere (NH) insolation, North Atlantic SSTs44,56, ENSO variability in the Pacific71, and Indian Ocean conditions56 as potential contributors. The physical mechanisms underlying these teleconnections involve large-scale atmospheric circulation shifts44,56,71. Mechanistically, warm tropical Pacific SSTs during El Niño events enhance upper-tropospheric divergence and shift the ascending branch of the Walker Circulation eastward72,73 (Fig. S6), inducing subsidence over the western Pacific and Indian Ocean. This suppresses convection and rainfall over South Asia by weakening the monsoon Hadley cell72. The warmer SSTs over the tropical Pacific during El Niño events enhance upper-tropospheric divergence and shift the ascending branch of the Walker Circulation eastward72. Simultaneously, El Niño–driven warming of the Indian Ocean74,75 (Fig. 2) reduces the land–sea thermal gradient and weakens the cross-equatorial pressure gradient, limiting moisture transport into the subcontinent76,77,78. The weakened thermal gradient diminishes the low-level wind inflow from the ocean to the continent, reducing the moisture transport and ultimately leading to a decline in monsoon rainfall78. Conversely, cooler North Atlantic SSTs (negative AMO phase; Fig. S6) reduce the inter-hemispheric temperature gradient44, which is associated with weakened subtropical high-pressure systems and results in a southward displacement of the Intertropical Convergence Zone (ITCZ). This further reduces moisture influx over South Asia79. Moreover, negative AMO-like phases may have weakened the Atlantic Meridional Overturning Circulation state, which could have shifted teleconnections and weakened the Indian monsoon80. Additionally, TraCE-derived Sea level pressure and SST patterns suggest positive North Atlantic Oscillation (NAO) like conditions during 4200 years BP, which led to cooling over the NH73. Overall, Synchronous El Niño-driven warming in the Pacific and Indian Ocean, along with the negative AMO-driven cooling in the North Atlantic, weakens the Walker circulation, reduces the land–sea thermal gradient, and displaces the ITCZ southward, respectively, collectively suppressing ISM moisture transport and rainfall over the IVC region.

Single-forcing experiments of TraCE simulations44 performed using meltwater forcing, ice-sheet dynamics, and greenhouse gases neither show the signature of a global 4200 years BP event nor the reduction in rainfall during the LatH, which points towards internal variability or additional forcings as a major contributor to the drying81. Giosan et al.56 reported that changes in tropical albedo and regional dust emissions during aridification events may have caused land-use and land-cover changes, which might have caused a southward shift of the ITCZ. Furthermore, transient climate simulations have their own limitations due to initial conditions, parameterization, forcing assumptions, and local-scale feedback processes58. While they are useful for analyzing broad-scale and long-term trends45,47, their limitations in resolution and forcing representation pose challenges for assessing decadal and sub-centennial variability and warrant extensive validation with geological proxies. Thus, high-resolution paleoclimate proxies are required to further examine the role of regional forcings and internal climate variability on ITCZ shifts in the Indian Ocean, and its subsequent impact on the Indian monsoons. Additionally, increasing model resolution can help to better capture regional processes. For instance, the improved horizontal resolution in TR6AV over TR5AS allows more realistic monsoon penetration along the Himalayan foothills47.

Our results suggest that climate shifts coincided with the eastward and southward migration of Harappan populations into the Ganga Plains and Saurashtra (Fig. 1b). Stronger discharge anomalies near Kot Diji, Ganweriwala, and Dholavira likely played a major role, while declining Indus River discharge may have encouraged relocation. This supports a “push–pull” migration framework as outlined by Giosan et al.56. Enhanced orographic precipitation with dense river networks in the foothills of the Himalayas likely facilitated a more persistent moisture regime throughout the year, providing a more stable agricultural system in the Ganga Plains11. Adaptation strategies, including a transition from wheat and barley to drought-tolerant millets4,5,34, likely sustained agricultural production in Saurashtra. Maritime trade with Mesopotamia82 likely provided another buffer, allowing coastal and Indus societies to supplement local shortfalls through imported staples. The decline of the IVC was therefore not the product of a single abrupt collapse, but rather a protracted transformation shaped by both climatic and non-climatic processes2,5,6,14,16,70,82. Whereas prolonged aridity contributed, additional factors such as the selection of drought-tolerant plants4,5,34, long-distance agricultural trade82, and shifts in regional artifacts2 suggest that the decline was not a sudden collapse but a complex process of fragmentation and cultural transformation, indicating a societal adaptation rather than a complete disappearance83. Discrepancies between model simulations and proxy reconstructions highlight uncertainties in both model–data comparison and transient climate modeling84,85. Nevertheless, our findings of simulated and reconstructed river drought over the IVC reign enhance our understanding of the history of the Harappan civilization and its intricate relationship with climate change.

Methods

The study region encompasses parts of India and Pakistan and is primarily characterized by summer monsoonal rainfall (~400 mm/year), which accounts for 75–80% of the annual rainfall20,36. Importantly, the upper Indus area also receives winter rainfall from western disturbances38. We selected seven terrestrial proxy records covering Harappan period (Table S2) from the surrounding area of IVC to investigate regional climate variability. Amongst these, five are records of lacustrine deposits (Fig. 1b) from Gujarat28,30 and Rajasthan26,29,86, and two records derived from speleothems of Sahiya39 and Mawmluh caves9. The selected lakes are strongly influenced by the ISM; in contrast, the speleothem records offer high-resolution, continuous records of past climate variability, providing valuable insights into regional climate, but are affected by notable rainfall variations outside ISM variability. Rainfall over Sahiya cave39, which is in Uttarakhand, is influenced by the southwestern summer monsoon as well as the winter monsoon, driven by westerlies. Thus, we hypothesize that Sahiya Cave likely provides insights into winter precipitation over the region. Conversely, Mawmluh cave9 in Meghalaya experiences rainfall variability associated with the northeast monsoon winds (during October-December) as well as the ISM. Further, we have also used other three speleothem data from Bitto cave87, Wah Shikar cave63, and Jhumar-Dandak cave63 to validate rainfall trend obtained from the transient climate simulations.

We considered three transient climate simulations (Table S1) obtained using General Circulation Models (GCMs). Two simulations (MPI and TR6AV) were conducted using mid-Holocene boundary conditions as the initial conditions for the transient experiment45,57,58, while the third one (TraCE) was conducted using the Last Glacial Maximum (LGM) boundary conditions81. We selected TraCE, MPI, and TR6AV over other transient simulations such as EC-Earth388,89 and iTraCE90, as they offer continuous transient output of precipitation and sea surface temperatures for the entire Harappan period (5000-3000 years BP), and their data are publicly available. Additionally, we obtained the instrumental long-term (1813–2006) monthly rainfall datasets91 from the Indian Institute of Tropical Meteorology (IITM) (https://www.tropmet.res.in/, Accessed on January 2022). We used these datasets to evaluate the performance of the three transient climate simulations over different seasons (Fig. S10a) and annual rainfall time series (Fig. S10b). TraCE transient climate simulations performed relatively well by showing less percentage bias (~−19%) for annual average rainfall compared to other simulations. Moreover, all the models underestimated annual average rainfall, with underestimation of rainfall during March-May (MAM) and June-September (JJAS), and overestimation in January-February and October-December (OND). We used rainfall from all the transient simulations and then validated the ensemble rainfall with speleothem data of five different caves (Table S2). Geological validation shows ability of transient climate simulations in capturing rainfall trend and major rainfall patterns observed in the past 5000 years (Fig. S2). While, minimum and maximum temperature were taken from the TraCE-21ka as these variables were not available publicly in other climate simulation. However, we noted that the major droughts in the monsoonal climate are largely driven by rainfall variability instead of temperature92. He81 generated the TraCE-21ka simulation using the CCSM3 climate model, which is available from 22000 years BP to present. We obtained TraCE-21ka data from the National Center for Atmospheric Research (NCAR) data portal (https://ncar.ucar.edu/, Accessed on January 2022). CCSM3 is a coupled model of the land, atmosphere, ocean, and sea-ice components without flux adjustment81. The land module of CCSM3 contains a dynamic vegetation layer that represents mixed land units for each grid cell81. Additionally, the CCSM3 model includes a layer to create dynamic sea-ice topography and integrates the extent and thickness of ice sheets every 500 years using paleoclimate data to constrain the model evolution81. MPI simulations were generated using MPI-ESM1.2 with dynamic vegetation component and no variation in ice sheets topography57. Similarly, TR6AV simulations were generated using IPSLCM5A-MR with key attributes like predictive snow cover, soil hydrology with 11 layers, and dynamic vegetation component in its land surface component45. Furthermore, we used these transient climate simulations as the input forcing for running Variable Infiltration Capacity (VIC) model.

The Variable Infiltration Capacity (VIC) model

The VIC model is a semi-distributed model that solves water and energy balance at each grid93. We used 0.25° grid resolution with a daily temporal resolution to conduct hydrological simulations using the VIC model. As the observed discharge data is unavailable for the paleoclimate period for the model calibration, we used soil, vegetation, and other calibrated model parameters from Shah to Mishra94 and Kushwaha et al.95. They94,95 calibrated soil parameters (depths of second and third soil layers, binf, Ws, Ds, and Dsmax) of the VIC model using observed streamflow data from 18 stations in major river basins of India for the current climate. We specifically focused on the Indus, Ganga, Mahi, and Narmada River basins, as they fall within the IVC region and have good availability of observed streamflow data. The VIC model was calibrated against the observed monthly streamflow at Baramula station for Indus, Farakka station for Ganga, Chakaliya station for Mahi, and Sandia station for Narmada. The VIC model was calibrated and validated at a monthly scale in each basin with different time periods due to the limited availability of the discharge data. Moreover, details about the station location, calibration, and evaluation period can be found in Supplementary Table S4. All basins showed a monthly NSE of more than 0.6 during the calibration and more than 0.7 during the evaluation period (Fig. S11). Subsequently, we provided daily 0.25° gridded rainfall with maximum and minimum temperatures bootstrapped from monthly transient climate simulations as input to the VIC model to simulate paleohydrology. While applying modern calibrated parameters to simulate past hydrological conditions, we considered static vegetation cover, soil properties, and land use with no change in the basin morphology, which might introduce uncertainties in the analysis, as paleoclimate conditions likely differed from the present. Moreover, we considered the first 200 years for the model spin-up to reduce the influence of the model’s initial conditions and to stabilize state variables. We then used the VIC model-generated runoff to reconstruct past hydrology. We employed a standalone routing model developed by Lohmann et al.96 for routing runoff from the VIC model by assuming the horizontal routing processes within the rivers as linear and time invariant.

Experimental design

We obtained monthly rainfall datasets of the last 6000 years from the all-transient climate simulations, while temperature dataset from TraCE-21 ka simulation (Table S1). Next, we used bilinear interpolation to convert the spatial resolution of different transient climate simulations to 0.25° spatial resolution. Subsequently, we generated annual average rainfall time series for India, focusing on the period that coincided with the IITM observed rainfall datasets91. We examined the spatial variability of rainfall (%) and temperature (°C) anomalies with respect to the Pre-Harappan period over three time spans (Pre-Harappan-PreH, Mature Harappan-MatH, and Late Harappan-LatH). We estimated percentage anomalies for annual and seasonal rainfall over the IVC area. Furthermore, we converted monthly climatological data of 0.25° grid resolution into daily time steps using bootstrapping algorithm97. Initially, we collected daily climate data from the MPI-ESM and subsequently resampled it for each month. To ensure the fidelity of the generated data, we performed random sampling from MPI-ESM data based on the number of days in each month. We then applied a bias correction method to maintain the monthly statistics as per the transient climate simulation. Specifically, we adjusted the daily rainfall data to preserve the sum and the daily temperature data to preserve the mean, matching the respective monthly values for rainfall and temperature. By employing bootstrapping algorithm, we successfully generated synthetic daily data that closely resemble the patterns and variability observed in the monthly data of climate simulations.

The 4200-year event is believed to have marked the onset of the collapse of the IVC40. To substantiate this claim, we identified major drought events using a standardized precipitation index (SPI)98. We selected SPI over other drought indices, such as the Standardized Precipitation Evapotranspiration Index (SPEI), due to the constrained availability of climate data. We generated the 12-month detrended SPI time series from the 100-year moving rainfall data to avoid interdecadal variability, a continuously drying trend, and other high-frequency variations after following44. McKee et al.98 used an SPI threshold of −1 to identify drought events. In contrast, we used a threshold of −0.5 SPI to identify drought periods. The reduction in drought threshold accounts for the reduced overall SPI variabilities and intensities resulting from the application of a 100-year moving mean filter to the rainfall datasets (Fig. 2d). Furthermore, recent studies99,100,101 over the ISM region also used a threshold of −0.5 for the creation of a drought atlas and operational drought monitoring. We estimated drought characteristics (Table S3) such as duration (the period during which the SPI value remained below -0.5), mean intensity (the mean SPI value within the drought period), percentage area under drought (the area with SPI value below −0.5), and severity (a combined measure of duration, intensity, and percentage area under drought), for each major drought event. Drought characteristics allowed us to identify the periods during which potential meteorological droughts occurred due to prolonged rainfall deficiency.

We conducted hydrological simulations using the VIC model, aiming to examine the impacts of identified droughts on river discharge. VIC model enabled us to trace the progression of drought conditions from their meteorological origins to their impact on the hydrological system. For that, we calculated discharge anomalies at 17 major Harappan stations (Fig. 1) for the major drought events, including nine stations from the Indus basin (three from Upper Indus, three from Middle Indus, and three from Lower Indus), three stations from the Ganga basin, and five stations from the Sabarmati basin.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.