Introduction

Flooding has impacted human society for millennia1. Notable examples include the Judeo-Christian story of Noah’s Flood2 and the Chinese legend of Emperor Yu’s Great Flood3, both of which underscore the profound societal impacts of such events. Floods still cause massive losses annually, e.g., the catastrophic 2022 Pakistan floods4, even though advancements in real-time satellite monitoring and hydrological models enable precise flood tracking, forecasting, and hindcasting5,6. Many assessments suggest that floods will become more frequent and severe with global warming. However, empirical evidence to substantiate this conclusion remains surprisingly limited7,8.

Floods are hydrological extreme events that often result from complex interactions between climate, hydrology, and human activities within a drainage basin9. Climate change has induced pronounced non-stationarity in hydro-climatic systems10, altering precipitation and runoff patterns globally, as evidenced by instrumental records spanning the past century8,11. For instance, Europe experienced an unprecedented flood-rich period (1990–2016) that stands out even within a 500-year context12. Paleo-flood reconstructions extending back two millennia further demonstrate that both flood-prone and quiescent periods have occurred across Europe and North America in either warmer or cooler climate intervals13,14,15. Besides climate, human land use activities such as artificial channelization and land clearing were also qualitatively linked to increased flood frequency, e.g., in the Yangtze River since 1420 CE16. Land use change roughly accounted for 75% of the increased 100-year flood occurrences in the Mississippi River during the 20th century17. However, current studies lack a robust methodology to effectively disentangle the influences of anthropogenic and climate-driven factors on major floods, thereby sustaining ongoing scientific debates5,7. For example, several studies suggest that deforestation and land cover change primarily affect small to medium floods, while their influence diminishes for extreme events in large basins5,18,19,20,21.

Furthermore, the precipitation-flood paradox—where observed and projected increases in precipitation extremes under climate change contrast with the mixed, and often non-increasing, trends in observed floods5,7,11—highlights fundamental uncertainties in flood prediction arising from critical data and attribution gaps. Specifically, instrumental records (typically <100 years in most basins) are often too short to capture long-term flood variability, particularly for streamflow and precipitation extremes17. While historical documents could extend flood chronologies by millennia7, they remain underutilized due to concerns over data completeness, precision, and potential biases22. Similarly, paleo-flood reconstructions, though valuable, are inherently limited by the rarity of extreme events preserved in sedimentary archives, making it difficult to establish robust connections with climate proxies17. In flood attribution, historical records rarely document specific climate-flood linkages. Contemporary analyses of individual floods often focus on precipitation extremes but overlook the impacts of landscape change5,23. Even when incorporated into numerical models, landscapes are typically parameterized empirically, with human activities treated as exogenous inputs, thereby obscuring their co-evolving nature and altering the catchment trajectory. Most critically, the pathways through which human activities modulate floods remain poorly understood and are rarely accounted for in flood risk projections. Particularly, land use changes may lead to potentially contradictory impacts on flood risk5,16,17,23. Unfortunately, instrumental observations (<50 years) are often inadequate for evaluating and confirming hypothetical mechanisms behind infrequent events that occur over decades in slowly evolving landscapes19,21. Collectively, these critical gaps significantly constrain our ability to reliably predict flood extremes, particularly for the most severe events.

To address these shortcomings, this study examined the Yangtze River Basin, where exceptional historical documentation provides flood history spanning over a millennium (950 CE to present) Box 1. This heavily populated basin represents both a historically flood-prone area with immense flood volumes24 and a projected climate change hotspot for future flood risk5,7. Moving beyond conventional event-centric analyses, we delineate emergent flood patterns through systematic integration of annually resolved historical records (950–2000 CE), encompassing flood chronologies, climatic variables (precipitation and temperature), and anthropogenic factors (population dynamics, forest cover changes, and polder development) (see Methods). Our novel cascading attribution methodology advances flood causality analysis by decoupling dynamic and thermodynamic precipitation drivers, and then evaluates anthropogenic modifications to fundamental climate-flood relationships across this vast basin. These millennial-scale insights should deepen our understanding of flood variability and enhance our capacity for forecasting, mitigation, and integrated watershed management in a changing world.

Results

Millennial flood regimes in the Yangtze River Basin

The Yangtze River Basin, home to over 400 million people, has undergone distinct flood regime shifts during the past millennium. Our analysis of 1220 historically documented flood events across 11 sub-basins is classified into three severity categories: mild (Group 1), severe (Group 2), and extreme (Group 3) floods. These events delineate three flood regimes corresponding to major climatic epochs: the Medieval Warm Period (MWP; 950–1400 CE), the Little Ice Age (LIA; 1400–1850 CE), and the Current Warm Period (CWP; 1850 CE–present), yet the transitions between flood-poor and flood-rich periods are less pronounced (Fig. 1) than the nine regularly recurring flood-rich periods identified in European records12. The basin shows clear nonstationary behavior in flood regime (Table 1), with an exponential increase in both annual flood frequency and number of affected sub-basins (~3.5‰ annual growth rate), alongside a 56% intensification in mean flood magnitude during the past two centuries, despite potential underreporting of early mild and severe events (Fig. S1). Spatially, the affected sub-basins were initially confined to downstream areas, but over time, the impacts expanded to upstream sub-basins as well, with two distinct flood hotspots emerging in terms of flood density (events per region per period) (Fig. 1). One is the persistently active middle reaches (Han River and Poyang Lake basins). Another is the upper reaches (Mintuo River and Jialing River basins), where floods intensified in both warm climatic periods. These patterns indicate a general trend toward more frequent and severe floods, underscoring the inadequacy of assuming flood stationarity for engineering design and watershed management planning. The observed intensification warrants a comprehensive investigation of its climatic and anthropogenic origins in this river system, which is globally significant due to its large size, high population dependence, and socio-economic importance.

Fig. 1: Flood occurrences across the 11 sub-basins of the Yangtze River Basin, from the upper to the lower reaches, span the Medieval Warm Period (MWP), the Little Ice Age (LIA), and the Current Warm Period (CWP) from 1000 to 2000 CE.
Fig. 1: Flood occurrences across the 11 sub-basins of the Yangtze River Basin, from the upper to the lower reaches, span the Medieval Warm Period (MWP), the Little Ice Age (LIA), and the Current Warm Period (CWP) from 1000 to 2000 CE.
Full size image

The upper plot illustrates the variation in annual global temperature alongside the total annual flood occurrences. The lower panel presents flood regimes, with each point representing a flood event. Quantile density contours at 5% intervals are shaded from pink (5%) to red (95%), illustrating the gradient from poor to rich regimes. The right plot displays a histogram categorizing floods into mild, severe, and extreme for each sub-basin over the entire period.

Table 1 Stationarity test of time series (xt) using the Augmented Dickey-Fuller (ADF) test, including decomposed approximate coefficients for four flood indices, global mean temperature, China’s precipitation regime, and population, forest cover, and polder area in the Yangtze River Basin from 950 to 2000 CE

Climatic origin of flood regimes

Floods originate primarily from basin precipitation. In the Yangtze River Basin, located within the Asian monsoon-dominated region, most floods occur during the summer rainy season25. The river basin experiences dry or wet climatic conditions, classified into six distinct precipitation modes for May through August, based on comprehensive documentation and climate data in China spanning from 950 to 2000 CE26,27,28 (Fig. S2). Climate change can impact regional precipitation, and thus floods, through both thermodynamic (primarily through temperature-driven increases in moisture capacity) and dynamic mechanisms (via alterations in atmospheric circulation patterns)29,30,31.

As a result of dynamic shifts in large-scale atmospheric circulation32, particularly changes in the East Asian monsoon and mid-latitude westerlies, wet precipitation regimes became most common during the LIA (Fig. 2a). Under colder global conditions, the subtropical westerly jet stream weakened and shifted equatorward, enhancing meridional flow and moisture transport into East Asia32,33. This southward displacement fostered persistent rain belts and increased flood frequency through a “flood-rich” mechanism, as blocking patterns slowed the progression of weather systems. Additionally, the western North Pacific subtropical high may have intensified and extended westward, further reinforcing monsoonal moisture convergence over the region34. Comparable mechanisms have been associated with contemporaneous flood-rich periods in Europe, where polar jet stream shifts and amplified meridional flow sustained quasi-stationary weather systems and extreme precipitation12,35. Despite some hemispheric coherence, these interactions underscore strong regional controls on precipitation regimes, which vary substantially across climate zones and limit global uniformity of flood signals. Notably, LIA precipitation exhibited a reduced frequency of multi-decadal oscillations (Fig. 3a), indicating suppressed atmospheric variability under colder climates36. Such suppression diminishes the intensity and frequency of oscillatory phenomena, such as monsoon fluctuations and ENSO-related anomalies, thereby promoting more persistent and stable atmospheric states37,38. These stabilized conditions likely contributed to more uniform precipitation regimes, shaping the observed multi-decadal flood variability and leading to a higher frequency of mild floods relative to severe events during this period (Fig. 2b).

Fig. 2: Climatic impacts on regional precipitation regimes through dynamic shifts and thermodynamic scaling.
Fig. 2: Climatic impacts on regional precipitation regimes through dynamic shifts and thermodynamic scaling.
Full size image

a Histogram of precipitation regimes in China, ranging from extreme dry (PI) to extreme wet (PVI), during the Medieval Warm Period (MWP), the Little Ice Age (LIA), and the Current Warm Period (CWP). Wet regimes were more prevalent during the LIA, while dry regimes dominated in the CWP, indicating a dynamic shift in precipitation patterns. b Box-whisker plots of precipitation intensity across individual regimes in the Yangtze River Basin. Boxes represent the interquartile range, diamond markers indicate means, whiskers show variability, and individual data points are plotted. The MWP is excluded due to missing data prior to 1470. The difference between the regression lines for the LIA (blue) and the Post-LIA/CWP (red) indicates a thermodynamically driven increase in precipitation intensity, as confirmed by a two-tailed T test (p = 0.0236).

Fig. 3: Climate impacts on Yangtze floods via dynamic and thermodynamic mechanisms in 950-2000 CE.
Fig. 3: Climate impacts on Yangtze floods via dynamic and thermodynamic mechanisms in 950-2000 CE.
Full size image

a The continuous wavelet power spectrum of precipitation shows lower frequency precipitation patterns dominating during the Little Ice Age (LIA). b The categorical ratio of annual floods, smoothed with a 30-year moving window, reflects flood frequency patterns similar to precipitation during the LIA. c The categorical ratio of annual floods correlates with rising global mean temperatures, showing a significant decrease in mild floods but an increase in severe and extreme floods, highlighting the thermodynamic differentiation in flood magnitude.

Unlike dynamic circulation shifts, thermodynamic changes in precipitation exert distinct influences on flood regimes through temperature-mediated moisture availability. While cold climates favor wet precipitation regimes in the region, warming tends to increase precipitation, as predicted by Clausius-Clapeyron scaling39,40. Historical records (1470–2000 CE) support this thermodynamic effect through differential trends: wet regimes exhibit stronger precipitation increases (7.6% per century for regime V) than dry regimes (3.4% for regime I) (Fig. 3b), comparable to reported precipitation–temperature scaling rates of approximately 7% per degree41. We refer to this warming-driven enhancement as a ‘flood-strong’ mechanism, meaning that rising temperatures amplify precipitation within existing regimes, thereby increasing flood severity without altering the underlying circulation state (Fig. 3c). In reality, dynamic and thermodynamic processes often interact, jointly shaping flood risk through both regime shifts and intensity amplification. Consequently, warming has reduced the frequency of mild floods while increasing severe and extreme events, suggesting a shift towards fewer mild but more intense floods. A similar thermodynamic amplification may also explain the recent surge in large floods across Europe12. These consistent hemispheric patterns underscore thermodynamic scaling as a key contributor to contemporary flood intensification.

Overall, the dynamic and thermodynamic mechanisms of precipitation exert different impacts on flood frequency and magnitude. Together, they explain the known inconsistency between cold/warm climates and flood-rich/-poor periods12, which can help better predict flood regimes in the context of climate change. However, these statistically stationary precipitation regimes account for <20% of observed millennial flood variability and fail to explain the pronounced exponential uptrends. Most strikingly, precipitation-normalized flood intensity increased by 23.5% during the Current Warm Period, significantly exceeding the levels of the Little Ice Age (Fig. 4), suggesting that factors beyond climate change have influenced floods in the region.

Fig. 4: Fluctuations in precipitation intensity (green) and flood intensity (blue) in the Yangtze River Basin from 1470 to 2000 CE.
Fig. 4: Fluctuations in precipitation intensity (green) and flood intensity (blue) in the Yangtze River Basin from 1470 to 2000 CE.
Full size image

The normalized flood intensity (orange), adjusted for precipitation, shows a pronounced upward trend (a 23.5% increase during the CWP compared to the LIA), suggesting that factors beyond precipitation change have significantly contributed to flood generation.

Increasing dominance of anthropogenic influences on flood regimes

Land use changes alter the natural hydrological properties of catchments, but their long-term impacts on floods are often speculative and not explicitly considered in current flood predictions7,8,42. Our millennial-scale analysis, combining signal processing and cointegration regression, shows that human land use activities have surpassed climatic influences in exacerbating floods (Table 2, Figs. S3 and S4). Specifically, dynamically and thermodynamically driven precipitation variations contributed mainly to flood fluctuations. In contrast, growing anthropogenic influences were responsible for the observed uptrend in both flood frequency and intensity over the past 1000 years.

Table 2 Cointegration relationships between the time series of decomposed approximate coefficients for four flood indices and five forcing variables from 950 to 2000 CE

Traditional unit hydrograph theory assumes flood intensity scales proportionally with precipitation in a basin43,44. However, our analysis of independent precipitation and flood datasets (1470–2000 CE) reveals that this fundamental assumption fails to hold (R2 = 0.03) (Fig. 5a). By incorporating anthropogenic indicators (population density, forest cover, polder area) into the precipitation-flood relationship, we demonstrate that human activities fundamentally reconfigure flood generation mechanisms. Notably, normalization by these indicators restores statistical significance, with combined anthropogenic adjustments yielding an R2 of 0.82 (p < 0.0001) (Fig. 5e). Forest cover emerges as the dominant mediator, accounting for 71% of variability when considered independently. The basin’s dramatic deforestation from 60% to 10% coverage over the millennium provides an exceptional trajectory to track these effects. Obviously, forests consistently reduce both flood frequency and intensity under equivalent precipitation - particularly for heavy floods (i.e., intensity ≥ 2) (Fig. 5f). The clear inverse relationship between forest cover and flood severity proves forests buffer floods across all event magnitudes even during extreme rainfall, thereby resolving the long-standing ‘forest-flood debate’ through irrefutable, observation-based evidence19,21. Beyond this theoretical resolution, the restored precipitation-flood relationship under anthropogenic normalization (R2 = 0.82) demands the urgent incorporation of land-use legacies into global flood risk assessments, particularly for those nations where deforestation continues unabated and parallels the Yangtze’s historical trajectory.

Fig. 5: Relationships between precipitation and flood intensities in the Yangtze River Basin.
Fig. 5: Relationships between precipitation and flood intensities in the Yangtze River Basin.
Full size image

a Statistical relationship between overall precipitation and flood intensity. be Conditional relationships between precipitation and flood intensity, normalized by population (b), forest cover (c), polder area (d), and their combined effects (e). Normalization accounts for 82.19% of the total impact, with forest cover having the highest influence (71.06%), followed by population (63.59%), and polder area (11.09%). The regression intercepts suggest that forestation has a positive impact, while population growth and polder construction have negative impacts on flood intensity. f Relationship between flood intensity and forest cover, normalized by precipitation, shows that increased forest cover significantly reduces heavy floods under the same precipitation levels. Bubble sizes represent flood intensity.

Apart from the deforestation trajectory, humans have a long history of cultivating agricultural fields, developing urban areas, and building hydraulic infrastructure to improve living conditions in flood-prone and low-lying areas45. Major hydraulic solutions implemented in the Yangtze River Basin include polder systems, which entail constructing levees and drainage systems to prevent flooding within the area, as well as building banks and dams to regulate floodways. However, these hydraulic engineering measures may yield unintended consequences, such as raising riverbed levels, a phenomenon known as the levee paradox17. Our cointegration regression analysis demonstrates that afforestation tends to reduce overall floods, while population growth and polder expansion tend to increase floods (Figs. 6b, 7d). The impact of population growth is largely associated with urbanization, which increases built-up and paved areas, reducing soil infiltration capacity, increasing rapid runoff, and thereby elevating flood risk. Over the past millennium, the basin’s population has increased tenfold, the polder area has expanded fivefold, and formerly separate polders have coalesced into large, interconnected hydraulic communities in an evolving way46 (Fig. S3a). The drainage outflows from these systems elevate downstream water levels, pushing flood impacts further downstream19. This phenomenon became particularly evident during the early 1600 s, when the peak of polder construction coincided with an increase in flood occurrences—occurring more often than expected in 9 out of 10 years47. The problem has worsened with the shift from traditional human-powered water-wheels to modern high-capacity electric pumping systems, particularly in recent decades48 (Fig. S3b). The current network of 2629 stations, with a combined drainage capacity of 19,400 m3/s (~25% of peak flood flows)49, has substantially increased flood vulnerabilities in downstream regions containing major urban agglomerations such as Nanjing and Shanghai. This technological shift explains the disproportionate severity of recent catastrophic floods (1954, 1998, 2020) relative to their precipitation inputs (Fig. 6). Most strikingly, while the 1954 and 1998 floods occurred under comparable rainfall conditions, the 2020 event achieved record flood peaks with significantly less precipitation–clear evidence that artificial drainage systems have become the dominant flood driver in the contemporary period.

Fig. 6: Climatic and anthropogenic attribution of three major floods in the Yangtze River Basin.
Fig. 6: Climatic and anthropogenic attribution of three major floods in the Yangtze River Basin.
Full size image

Phase spaces for precipitation and discharge accumulated during the three worst flood events (1954, 1998, 2020) at three representative stations: Yichang (a), Hankou (b), and Datong (c). Accumulation minimizes the variability in flood impact due to antecedent soil moisture conditions. Unlike the 1954 and 1998 events, the 2020 flood occurred with relatively low levels of accumulated precipitation and discharge, underscoring the significant intensification of downstream flooding due to artificial drainage from 2629 pumping stations in the mid-reaches. Despite the Three Gorges Dam, the world’s largest in terms of power, reducing the flood peak at Yichang (upstream) from 75,000 m3/s to 51,000 m3/s in 2020 (a), peaks still occurred at Hankou (midstream) and Datong (downstream), with discharges second only to the 1954 event. The size of the bubbles represents the daily river discharge (m3/s), and the solid bubbles indicate the maximum discharge (m3/s) for each flood event.

In summary, human activities have inadvertently but progressively altered landscapes while deliberately developing water infrastructure, jointly producing emergent flood dynamics (Fig. 7). This anthropogenic emergence requires a conceptual shift from natural to human-dominated flood risk prediction. Given the feedback nature between human interventions and flood systems, exemplified by the forest-flood debate, polder effects, and levee paradox, anthropogenic factors must be incorporated as endogenous variables rather than external drivers. The conceptual shift should foster the development of structurally novel models that enhance flood predictability. This is especially the case for “dragon-king” occurrences, which represent extreme outliers arising from distinct generating processes rather than ordinary hydro-climatological variability and therefore deviate from standard hydrological expectations50.

Fig. 7: Impacts of climate change on regional precipitation patterns and their interactions with human land use in shaping Yangtze River floods.
Fig. 7: Impacts of climate change on regional precipitation patterns and their interactions with human land use in shaping Yangtze River floods.
Full size image

Global climate change influences regional precipitation through dynamic and thermodynamic mechanisms. Concurrently, human activities, including afforestation/deforestation, agricultural practices, irrigation, polder expansion and drainage, urbanization, and dam construction and operation, alter flood patterns in the basin.

Future floods in the anthropocene

Climate projections from the IPCC AR6 indicate that global near-surface and air temperatures will continue to rise until at least mid-century, relative to 1850–1900 levels8,51. According to projections from the Coupled Model Intercomparison Project Phase 6 (CMIP6), precipitation in the Yangtze River Basin is expected to increase in both annual totals and daily maximums under all emissions scenarios throughout this century51. The basin is expected to enter a wetter phase of its dominant precipitation cycle within the next 30–40 years. Coupled with thermodynamic intensification of precipitation, Yangtze floods are very likely to become more frequent and intense in the coming years. Superimposed on existing anthropogenic flood amplifiers, these hydrological changes pose substantial risks to this economically critical region, which sustains one-third of China’s population and generates over 40% of the national GDP.

Discussion

Human-caused landscape changes, as well as climate change, must be explicitly considered in any effective flood-risk mitigation and resilience building. Reliable flood predictions depend on accurate attribution of dynamic (atmospheric circulation) and thermodynamic (temperature-mediated) precipitation changes that are currently lacking in structuring flood attribution. Equally critical is quantifying anthropogenic influences as endogenous variables in midterm flood risk prediction, which are currently treated as static in most models, despite their significant role in shaping flood regimes. Historical land-use baselines (e.g., the Yangtze’s decline from 60% to 10% forest cover) can inform nature-based solutions, such as reforestation, while modern engineering impacts (e.g., a 19,400 m3/s pumping capacity) necessitate whole-basin management frameworks. Such frameworks should conceptualize the Yangtze as a coupled human-water system, where upstream interventions, such as deforestation, and downstream infrastructure, like polders, interact across scales. These findings can help improve flood prediction models by incorporating land-use legacies, develop next-generation early-warning systems that consider both climate and human factors, and implement adaptive policies that recognize the basin’s evolving flood vulnerability. The findings of our study should have implications for global flood-prone regions under similar climates to the Yangtze.

Methods

Methodology

Unlike instrumental observations, historical records of heavy precipitation and flood events are often qualitative, with more quantitative descriptions appearing in later periods. Since the 1980s, historical events have been extensively compiled by Chinese national institutions using common criteria to classify events by magnitude26,52. We accordingly assigned numerical values to the classes, ranging from 1 to 3 for documented flood magnitude (mild, severe, and extreme) and 1–5 for station-based summer precipitation (dry to wet). We also categorized precipitation into six regimes (I–VI) from dry to wet for the millennium in China. For floods, we calculated the number of annual events, the number of flooded sub-basins, and the flood frequency for the entire Yangtze River Basin. We used an exponential fit to the annual flood frequency and affected sub-basin trends to estimate their annual growth rates. In the absence of instrumental data for mainstream discharge and water levels, we defined the overall flood intensity of the entire basin as the mean magnitude of annual events. Relative to the previous eight centuries, flood intensification over the past two centuries was quantified using the mean flood magnitude. We also defined flood density as the total number of occurrences in a region during a specific time period to portray the spatiotemporal regime. To categorize flood severity, we calculated the ratio of annual floods within each severity class (mild, severe, and extreme floods) to the total number of floods for that year in the basin. Despite the well-established local chronicles system for event documentation dating back to the Sung Dynasty (960–1279 CE)52,53, we scrutinized historical floods using an information-entropy-based test to ensure data robustness for statistical analysis (see Fig. S1 for details). This test evaluates the randomness and informational completeness of the event sequence by calculating entropy across time segments. Low entropy values suggest potential data gaps or reporting biases, whereas higher entropy indicates a more uniform and diverse distribution of events, reflecting greater completeness and reliability. For precipitation, we defined the overall precipitation intensity of the entire basin as the annual mean of the numerated summer precipitation for 32 stations available within the basin for 1470–2000 CE (Fig. S5). The overall intensity also serves as a basis for assessing the agreement between basin precipitation and continental dry-wet regimes (Fig. 2b).

To disentangle the impacts of climate change on floods using documented data, we decomposed the complex cascade from temperature to precipitation, to specific landscapes, and to individual flood events in three dimensions (cascading attribution). First, we analyzed precipitation variability from both dynamic and thermodynamic perspectives. Dynamically, we used the national annual dryness/wetness time series (950–2000 CE) to characterize regional precipitation regimes shaped by the Asian Monsoon and other large-scale atmospheric circulation systems (Fig. S226,27,28). Thermodynamically, we divided the dryness/wetness series into the Medieval Warm Period (950–1400 CE), the Little Ice Age (1400–1850 CE), and the Current Warm Period (1850–2000 CE). We then used station-based annual summer precipitation data for the Yangtze River basin (1470 to 2000 CE) to compare precipitation regimes and their intensities between colder (LIA) and warmer (CWP) climates (Fig. 2). Second, we introduced the concept of flood-strong into the flood-generating framework, complementing the widely discussed flood-rich concept in recent studies12. Flood-rich refers to periods of increased flood frequency, whereas flood-strong denotes episodes of heightened flood intensity. These dimensions represent different aspects of flood regimes and do not necessarily coincide. Accordingly, we depicted both flood-rich (frequency) and flood-strong (intensity) episodes, providing new insights into the structured variability of floods in relation to precipitation regimes and thermodynamic influences. Third, since floods originate from precipitation and vary with underlying landscapes that are subject to human modification, we explicitly considered anthropogenic influences in the analysis of millennium floods in addition to climate change. Given the diversity of landscapes, land use, and land cover changes may yield different impacts, still largely unknown. In combination with compiled demographic data, we isolated and evaluated the anthropogenic influences in terms of basin-wide and downstream effects. The outcomes provide a foundation for building global resilience to alleviate flood risks across the entire basin.

We used unit root tests to distinguish stationarity and non-stationarity in time series data54. Specifically, the augmented Dickey–Fuller (ADF) test was applied to examine stationarity (and trends) at a significant level of p = 0.05 and determine the order of integration for the time series55. For non-stationary time series with periodicity, we used the Morlet wavelet approach, which is optimal among many wavelet approaches in determining periodicity56. Furthermore, we decomposed the annual flood sum, flooded sub-basin, and overall flood intensity into low- and high-pass filter components—one determined trend and four high-frequency components—for the period 950–2000 CE using the Daubechies-4 algorithms of wavelet analysis57 (Fig. S4). The approximate coefficients for the trend were then cointegrated with climate and anthropogenic indicators, including global temperature, precipitation regime, population, forest cover, and polder area. The Johansen test was applied to confirm the multicointegration between the time series58. Additionally, we normalized overall precipitation and flood intensities with population, forest cover, polder area, or their combination. The normalization process assigns the precipitation-flood relationship to the same conditions, minimizing differences due to individual or combined influences, thereby allowing for the evaluation of individual influences on the distorted precipitation-flood relationship. Regarding the top three worst flood events (1954, 1998, and 2020), we proposed analyzing the phase space of precipitation and discharge in annual accumulation to minimize the effects of antecedent soil moisture ahead of flood events at three representative stations (Yichang, Hankou, and Datong). Furthermore, the Three Gorges Dam (TGD) outflow was replaced with the TGD inflow to restore the discharge at Yichang for the year 2020, without the TGD.

Data

Hydrological data exists in two forms. First, daily river discharge and water level data are available from the Hydrological Bureau of the Changjiang Water Resources Commission, dating back to 1877 for Yichang, 1865 for Hankou, and 1922 for Datong (with some years missing due to wars). The daily discharge data for the TGD inflow and outflow were obtained from the China Three Gorges Corporation for the period from 1 June 2002 to 31 December 2020. Second, flood events, numerated at three levels (mild, severe, and extreme) for 11 sub-basins, are charted in the Yangtze Chorography for the period from 185 BCE to 1999 CE52. To maintain consistency with the precipitation series, we excluded flood events before 950 CE from the analysis.

The precipitation data used for the basin consist of three types: station-based daily, station-based annual summer, and national annual dryness/wetness. First, daily data is available for 179 national meteorological stations from the China Meteorological Data Sharing Service System for the period from 1 January 1951 to 31 December 2020 (Box Fig. 1). Second, station-based annual summer (May–September) precipitation is available from the system for 32 stations within the Yangtze River basin from 1470 to 2000 CE. Precipitation is numerated in five grades from dryness to wetness for each station, according to well-established numeration rules based on extensive compilations of documents routinely recorded in the local chronicles system59,60,61. Third, the national annual dryness/wetness time series, classified into six modes, was constructed with orthogonal analysis based on 118 stations for the period 1470–197726 (Fig. S2), extended to the period 950–199127 and then updated to 200528.

We extracted global mean surface temperature for the period 950–2000 CE from a 2000-year-long time series of temperature anomalies, using a state-of-the-art dataset reconstructed with a global collection of paleoclimate records62. The anomaly is referenced to 13.97°C, the mean for 1961–1990 CE. Annual population data were linearly interpolated from population charts for the basin in each dynasty from 2 CE (Han Dynasty) to 1936 (Republic of China), as documented in historical records63. Annual forest cover was similarly interpolated from forest estimates based on extensive historical documents from the 2nd to the 20th century64,65. Polders, mainly constructed in the mid- and lower reaches of the Yangtze River over the last millennium, were documented, and the annual polder area was linearly interpolated from historical records for the period 950–2000 CE66,67.

Annual reservoir storage and irrigated area data for 1950–2020 CE were collected from published refs. 68,69,70 and the yearly bulletins for the Yangtze River and the Southwest Rivers71 (http://cjw.gov.cn/zwzc/bmgb/szygb/). Annual population and GDP data for each province are available from the China Statistical Yearbook for the period 1950–2020 CE (http://www.stats.gov.cn/tjsj/ndsj/), with the totals for the entire basin calculated subsequently. Electronic drainage power data was estimated based on quantitative descriptions for individual provinces and charts for waterlogging in the basin72,73.