Socioeconomic predictors of vulnerability to flood-induced displacement

Mester, Benedikt; Frieler, Katja; Korup, Oliver; Desai, Bina; Schewe, Jacob

doi:10.1038/s41467-025-64015-8

Download PDF

Article
Open access
Published: 16 September 2025

Socioeconomic predictors of vulnerability to flood-induced displacement

Nature Communications volume 16, Article number: 8296 (2025) Cite this article

5555 Accesses
20 Altmetric
Metrics details

Subjects

Abstract

Floods displace an average of 12 million people every year, and are responsible for 54% of all disaster-induced displacements. Displacement risk scales with the vulnerability of exposed populations, but this vulnerability is poorly understood at a global scale. Here we show that measures of human development and rural areas explain more of the variance of displacement vulnerability than income levels measured by gross domestic product. We combine global flood and displacement data to estimate vulnerability, as the ratio of displacement to exposure, for over 300 historical flood events. We find that this vulnerability varies by several orders of magnitude both between and within countries. A random forest regression shows that infant mortality rate and population density are among the most important predictors of displacement vulnerability at national level and within countries, respectively, highlighting the vulnerability of low-income and marginalized populations and of rural communities. Our results indicate that, rather than relying on overall economic development alone, targeted investments are needed to improve living conditions and coping capacities for the most vulnerable groups, particularly outside of large cities, and to prepare for increasing flood hazards due to climate change.

Limited progress in global reduction of vulnerability to flood impacts over the past two decades

Article Open access 08 May 2024

Integrating social vulnerability into high-resolution global flood risk mapping

Article Open access 11 April 2024

The wider the gap between rich and poor the higher the flood mortality

Article Open access 17 April 2023

Introduction

More than 195 million people worldwide have been displaced by floods since 2008. This is more people than by any other type of disaster, and more than by conflicts and violence¹ (https://www.internal-displacement.org/database/). Disaster displacement refers to situations in which people are “forced or obliged to flee or to leave their homes or places of habitual residence, in particular as a result of or in order to avoid the effects of […] natural or human-made disasters”². Displacement avoids fatalities, but disrupts livelihoods, undermines well-being, and incurs substantial costs on communities and countries³. Displacement risk is a product of the physical properties of flooding (hazard), the exposure of people, their assets and livelihoods to flooding, and vulnerability, i.e., the susceptibility and lack of resilience to being displaced^4,5. Flood hazard has been changing over the past five decades^6,7,8,9 and, in particular with respect to rare, large events, is expected to increase in many regions under continued climate change^10,11,12. More than 20% of the world’s population are currently exposed to high flood risk¹³, and population growth and urbanization are set to raise this exposure further, particularly in lower-income countries^14,15. At the same time, progress in reducing vulnerability has not been sufficient to reduce overall disaster risk^16,17. Against this backdrop, it is important to understand and quantify flood displacement vulnerability. Knowing what determines this vulnerability is important for understanding past trends in displacement risk; anticipating future changes; and identifying entry points to improving the resilience of affected communities to reduce displacement risk.

However, it is unclear how displacement vulnerability varies between flood events, and which factors, beyond differences in hazard and exposure, might explain this variation. Only few studies have explored flood-induced displacement at the global scale^18,19,20,21. Displacement is mostly low in countries with gross national income above $13k (2020 international $) per capita, while both low and high rates of displacement (per country population) are observed across lower-income countries²¹. It remains unresolved how much of the variation in displacement can be attributed to national income levels. Similarly, little is known about the role of non-economic or local factors, such as urban development and infrastructure access, demographics, or social disparities, which are important drivers of social vulnerability to flooding in many case studies^22,23,24 and large-scale assessments²⁵ but have rarely been considered in the context of displacement.

Here we combine reported displacement data with remote-sensing data of flood extents and gridded population estimates, to estimate vulnerability, as the ratio between displacement and flood exposure, for over 300 large fluvial and coastal flood events that occurred around the world between 2008 and 2018. We examine which predictors, measured at sub-national resolution, explain most of the observed variation in displacement vulnerability between individual events, using a mixed-effects random forest²⁶ to account for unobserved country-specific factors (i.e., average vulnerability might be lower in one country than in another). To gain insight into these potential country-specific factors, we apply random forest regression to predict the median vulnerability per country using predictors measured at the national level. While vulnerability is ultimately relevant at the local level, it is impossible to directly measure all its possible determinants across many countries. Factors such as the presence of disaster early warning systems, physical protection measures, the availability of emergency and recovery assistance, or public awareness to flood hazards are hardly documented at global resolution. Such elements may, however, be reflected by national-level characteristics such as public assets or forms of governance. Hence, country-level indicators might explain some of the variation in vulnerability across countries as opposed to indicators only available at the local level.

Our combination of the best available global flood observation data with the most complete and detailed global displacement estimates is unique compared to previous global studies of flood vulnerability^21,27,28. Our main methodological choices are motivated as follows. First, we use remote-sensing, rather than modeled, flood hazard data to warrant consistency and avoid model uncertainty²⁹, providing more accurate exposure estimates for each flood. Second, we use geocoded displacement information on level-1 or level-2 subnational administrative units (e.g., provinces or districts) for a finer resolved analysis than at national level²¹. Thus, we can identify the local context of displacement events, and address variations in displacement vulnerability not only across, but also within, countries. Third, as opposed to many previous studies that have focused on single predictors of flood impacts, such as national income or population size, we choose a multivariate analysis. Drawing from a larger set of plausible predictors, and using random forest regression, our analysis can also account for non-linear effects of, and interactions between, these predictors. Finally, using the vulnerability ratio as the target variable controls for the expected close association between exposure and displacements prior to the regressions. This narrows the distribution of the target variable (Supplementary Fig. S1) and makes sure we estimate predictor effects on vulnerability rather than on exposure. The third and fourth aspects especially differ from a recent study that estimated displacements from a smaller number of local-level independent variables in a linear regression framework¹⁸.

Results

Global displacement vulnerability

Vulnerability to flood displacement varies between countries by several orders of magnitude (Fig. 1, and Supplementary Fig. S2). It is high (>0.1, meaning one or more displacements for every ten people exposed) in many South American, Sub-Sahara African, and Asian countries. Countries with high vulnerability include Ecuador, Ethiopia, Zimbabwe, Nigeria, Afghanistan, Nepal, and China (Supplementary Fig. S2); some countries have only one or two data points (Supplementary Fig. S3). We estimate median vulnerability of >1 across multiple events) in several countries including Afghanistan, Ecuador, and Ethiopia, while vulnerability to individual events was >1 in two thirds of countries with at least three reported flood events (Supplementary Fig. S2). Formally, vulnerability expresses a fraction of loss and thus cannot exceed 1. However, our estimate of vulnerability may exceed 1 if our estimate of displacements exceeds our estimate of exposed population. This can be due to preemptive evacuations³⁰, which cannot be separated from post-disaster displacement in the data; or due to social dynamics that displace people even when they are not personally exposed. This may happen if people follow their kin, or if their places of work or other source of income or important infrastructure, such as schools or childcare, suffer damage^31,32. Vulnerability estimates > 1 may also reflect an over-reported displacement, or an underestimated exposure. An underestimated exposure could in turn arise either from incomplete space-borne flood extent observations³³ (for instance, small but important features such as flooded streets in urban areas may not be captured) or low-quality population data³⁴. There is no significant trend in global median vulnerability over the period 2008-2018 (Supplementary Fig. S4). Given that event-specific vulnerabilities vary by orders of magnitude even within countries, we use log₁₀-transformed values³⁵; thus, our analysis concerns the magnitude of vulnerability. This approach also acknowledges the low, if not unknown, accuracy of displacement statistics³⁶.

**Fig. 1: Country-level median vulnerability to flood-induced displacement.**

Predictive Models

To understand the possible determinants of flood-displacement vulnerability, we first select potential predictors based on a review of the literature on flood-related social vulnerability, in addition to physical characteristics of the floods and inundated areas (Methods). A set of up to three such candidate predictors feeds into a random forest regression, excluding combinations of closely related and mutually correlated predictors. We test many different models (predictor combinations) in a leave-one-out cross-validation setup. Our five preferred models (highest R²) in the event-level analysis have R²-values of 0.27-0.31, and all include population density and elevation as predictors (Table 1). Ranking models by the Akaike information criterion (AIC) or Bayesian information criterion (BIC), which penalize models with more predictors, yields nearly identical results as ranking them by R² (Supplementary Table S1). In the country-level analysis, our five preferred random forest models have R² values of 0.28 − 0.34, and all include the level of urbanization (share of urban population in total population) and education index as a predictor. Again, ranking models by AIC or BIC yields very similar results (Supplementary Table S2). The modest R² values, while not surprising given the complexity of the issue, mean our models only partially explain vulnerability. The predictor importance ranking discussed below must be viewed in this context of low explained variance; nevertheless, they provide meaningful insights into the relative roles of different socioeconomic factors.

Table 1 Explanatory power (R²) of the five preferred predictor combinations

Full size table

Important predictors at the event-level

To assess the importance of an individual predictor across different models, we test all models containing the predictor of interest before and after randomly permuting its measurements, and compare the resulting R² values. The change in R² due to randomization is a measure of the predictor’s contribution to the model skill, also termed feature importance³⁷. We rank predictors by the median decrease in R² after randomization (Fig. 2). Alternatively, predictors can also be ranked by the median R² across all models containing the relevant predictor (Supplementary Fig. S5). While this measure does not treat individual predictors entirely independently, it results in a similar ranking of the most important predictors as that by decrease in R². Results are also very similar when we rank predictors by the median increase in AIC (Supplementary Fig. S6) or BIC (Supplementary Fig. S7) after randomization, compared to ranking by decrease in R².

**Fig. 2: Feature importance in the event-level analysis (top; n = 303) and the country-level analysis (bottom; n = 72), measured by the median decrease in R² after randomizing.**

The event-specific analysis indicates population density and elevation as the most important predictors (Fig. 2, top). This result is consistent with the widespread presence of population density and elevation in the models with the highest R² (Table 1). The ranking also shows that these two predictors are more important than GDP per capita. This finding is crucial, because GDP per capita or some related measure of income levels is often used as the single indicator of socio-economic vulnerability, and assumed to be a reasonable proxy of measures of social status, economic deprivation, etc^21,38. Our results show that the variance in flood-displacement vulnerability is better explained by factors other than aggregate income levels (as measured by GDP per capita) alone (Fig. 2, top). These results are robust when using the ratio of deaths to exposure as an alternative target variable (Supplementary Fig. S8), suggesting they may represent general aspects of flood vulnerability.

We show the marginal effect of a given predictor on flood-induced displacement vulnerability in partial dependence plots. We find that population density has a negative marginal effect (Fig. 3). All else being equal, places with low population density are associated with high flood-displacement vulnerability. Our findings thus indicate that sparsely populated, rural areas tend to be highly vulnerable on average. This observation is consistent with theories and individual empirical studies on rural and urban flood vulnerability^22,24,39,40 that point out high vulnerability in rural areas. Our results support this notion systematically in relation to displacement, for a global context with a large number of observations. We recall that the extent of urban floods may be underestimated e.g., when short-lived or small features such as flash floods or flooded streets are missed by satellite imagery³³; however, such a bias would imply that we overestimate vulnerability in urban areas, and thus our finding of higher vulnerability to displacement from fluvial and coastal flooding in rural areas compared to urban areas remains robust.

**Fig. 3: Partial dependence plots (PDPs) for the most important event-level predictors.**

Vulnerability to floods and other disasters can be larger in rural areas than in cities for physical but also social and economic reasons²². In physical terms, small rural communities may have a much larger share of their population or assets exposed to a given hazard than large cities. For fluvial and coastal floods, in an urban context, much of the population living in the area for which exposure and vulnerability are assessed (e.g., some administrative unit or a grid cell) may be less exposed to hazardous or damaging water levels e.g., because of variations in elevation across the city, and multi-story residential buildings or other infrastructure may provide refuge and prevent displacement. In contrast, a small village may get completely flooded quickly, offering little for its inhabitants to take refuge, and making it much more likely that most or all of its population may be displaced. These physical aspects concern fine-scale variations in exposure, which our data cannot distinguish, and which are thus subsumed in our vulnerability metric. In terms of social and economic reasons, rural areas tend to be relatively poorer, with lower structural resilience of buildings, and to be neglected or treated subordinately by centralized government, resulting in higher vulnerability against floods and other disasters. For example, levees that protect larger settlements may lead to even higher flood levels for neighboring or downstream, smaller settlements. Rural areas may also lack economies of scale, as cities can afford much larger emergency response capacities such as professional fire brigades^22,39. Resilience against floods and other disasters differs markedly between rural and urban counties in the USA³⁹; such differences are likely to be more pronounced in less wealthy countries.

The marginal effect of the second most important predictor, elevation, is largely positive, such that vulnerability increases with elevation. Floods in mountainous regions tend to have different properties than floods in low-lying areas, for example mainly higher velocity of flow, or potentially damaging debris carried by the water^41,42. At the same time, mountain regions often have a different socioeconomic structure than lowlands: infrastructure and economic development are heavily modulated by topography; and a common demographic pattern is that young people move to cities while older people remain in mountain villages and towns^43,44. The increased vulnerability at higher elevations may thus partly reflect differences in age, educational, and economic characteristics influencing vulnerability and adaptive capacity. The partial dependence plot also indicates increased vulnerability at very low elevations, although this observation is based only on few samples and may be less reliable (Fig. 3). These areas below ~10 m above sea level are mainly coastal areas which globally are often densely populated and susceptible to coastal flooding; with their flat terrain, they may be associated with longer flood duration on average than more rugged areas.

The effect of (sub-national) GDP per capita, which is ranked third most important predictor by decrease in R², is negative in the range of about $ 6k to 10k (2017 PPP), while there seems to be little effect at either lower or higher income levels. While the cited range corresponds to the highest data density, many data points are available between about $ 1.3k and 20k, supporting a non-linear marginal effect. This means that high-income places tend to be less vulnerable than low-income places, but there is a lot of variation in vulnerability in both the low-income and the high-income range unexplained by income levels as measured by GDP per capita. Critical infrastructure has a negative marginal effect on displacement vulnerability, consistent with studies showing high flood vulnerabilities in undersupplied, informal settlements²⁴. The remaining predictors show mostly small or indeterminate marginal effects (Fig. 3 and Supplementary Fig. S9), which is consistent with their low feature importance ranking. This includes a measure of flood protection standards (FLOPROS), which in our context does not represent the effectiveness of flood prevention (we only study floods which were not prevented) but the possibility that higher flood protection standards may also be associated with stronger flood emergency response capacities. However, according to our analysis, this measure is of low importance in explaining displacement vulnerability; which may also be related to the high uncertainty of protection standard estimates in many parts of the world⁴⁵.

Important predictors at the country-level

At the country-level, urbanization level and infant mortality rate are the most important predictors, ranked by decrease in R²; followed by the share of elderly population (65 years and older) and GDP per capita (Fig. 2, bottom). When instead using the increase in AIC or BIC, the relative ranking of these predictors changes slightly, with education level and share of elderly population ranked more important than infant mortality. In any case, urbanization level, infant mortality rate, and share of elderly population are ranked more important than GDP per capita. While these non-economic, human development-related factors are linearly correlated with GDP per capita (r = 0.81 for urban population, and r = 0.91 for education; Supplementary Fig. S10), the finding that they are more weighty predictors suggests they contain important information related to the causes of vulnerability. For instance, most causes of infant death are preventable with low-cost measures⁴⁶, thus infant mortality rate is a measure of human development that is sensitive to deficient healthcare (and by extension, generally inadequate living conditions) even in a small fraction of a country’s population; whereas in country-level GDP per capita, income differences within the country get averaged out. The result that GDP per capita is not the most important predictor at sub-national level either suggests that, for similar reasons, aggregate income levels–at least as measured with available data products–may inappropriately capture vulnerability even when averaged over smaller areas.

The age structure variables (population aged 14 years and below, and 65 years and above, respectively) were also included in the event-level analysis, but were of relatively low importance there, while they are more important at the country-level. The same holds for urbanization, expressed by urban area at the event-level, and by the share of urban population at the country-level. These variables may thus be more indicative of the overall level of development and vulnerability in a given country or region, while other factors play a more important role in explaining the local variation between different events within a country. In particular, while the share of urban area at subnational level is only moderately correlated with population density (Supplementary Fig. S10), national-level urbanization is indicative of the overall fraction of population living in rural settings (corresponding to low population density at a subnational level), and thus being potentially more vulnerable, along the lines described in the previous section.

Partial dependence plots for the most important predictors in the country-level models show that urbanization and the share of population aged 65 years and older both have a negative marginal effect on vulnerability. In other words, vulnerability tends to be high in less urbanized countries and countries with a small proportion of elderly population. The first aspect may be related to the observation that in countries with low urbanization levels, relatively many people (compared to highly urbanized countries) live in rural areas which tend to be more vulnerable⁴⁰ - linking back to the importance of population density in the subnational models. A high share of elderly population is related to a high life expectancy, which in turn is an indicator of human development and, more specifically, well developed health care systems and other social services^47,48,49. In contrast, a high share of young population is often related to poverty and low levels of access to social services^24,50,51. Regarding infant mortality, predicted vulnerability is low only for very low infant mortality rates, but consistently high for infant mortality rates from around 1%−8% (Fig. 4). This suggests the capacity to prevent infant deaths is a strong indicator of broader societal development. The nonlinear shape of the PDP also confirms the importance of using flexible methods, such as random forest, that do not impose a linear relationship.

**Fig. 4: Partial dependence plots for the most important country-level predictors.**

The level of education (as measured by mean current and expected future years of schooling) has a clear negative marginal effect on vulnerability, while the population growth rate has a positive effect (Fig. 4). GDP per capita, which on the country-level is the fourth most important out of ten predictors, shows only a slight negative marginal effect on vulnerability at low and intermediate GDP values, while at higher values, vulnerability is predicted to be more significantly lower (Supplementary Fig. S9). This is in agreement with the observation that vulnerability is generally low in most high-income countries, whereas both low and high vulnerabilities are observed in lower-income countries²¹. The relative weakness of this predictor compared to urbanization and infant mortality rate shows that such additional, non-economic factors might be important in explaining the variance in vulnerability across most low- and middle-income countries.

Discussion

Vulnerability to flood-induced displacement is poorly understood in comparison to mortality or economic damages induced by flooding. In light of 10 million flood-induced displacements worldwide in 2023 alone¹, a better understanding of vulnerability is important to leverage risk reduction strategies and adaptation planning^4,52. We estimated event-specific vulnerability values for 303 recent flood events in 72 countries, and found that they vary by orders of magnitude both within and across countries. Particularly high vulnerabilities were estimated in some African countries, such as Ethiopia, Nigeria, and Zimbabwe; but also in China, Nepal, Afghanistan, and Ecuador. High vulnerability is thus widespread among, but not limited to, the lowest-income countries.

We investigated the importance of a range of social, economic, political, and physical factors in explaining variations in the magnitude of vulnerability across events and across countries. Within countries, using mixed-effects random forest models (R² ≤ 0.31), population density emerges as the most important factor, followed by elevation. At the same time, GDP per capita, a commonly used predictor often associated with disaster risk and vulnerability, is relatively unimportant in our models. We have used a recent global reconstruction of subnational GDP per capita⁵³ which utilizes data on urbanization levels, travel time to the closest city, and national-level income inequality to estimate subnational GDP per capita in countries with no reported data, which concerns many African and some Asian countries.

While sparsely populated, rural areas have been shown to be especially vulnerable to flood-related impacts in empirical and qualitative studies based on data from the USA³⁹ and across the world^22,24,40, this finding is new in the context of displacement. This result underlines the importance of clearly separating the effect of population density on vulnerability from its role in determining exposure: in densely populated areas, more people will be exposed to a given flood event. Studies trying to predict displacement using population density as an explanatory variable may conflate the two effects and produce biased results. In our study, we predict (the log of) displacement vulnerability as the ratio between displacement and exposure, thus accounting for the effects of population density on exposure. The remaining effect of population density on vulnerability, highlighted by our models, indicates residents of rural areas on average suffer higher displacement risk than their urban counterparts, for a given hazard. Our models estimate a two orders of magnitude difference in vulnerability between sparsely and densely populated areas, all else equal, which would imply a large potential for risk reduction from better protecting rural populations.

Across countries, the most important factors explaining differences in the average order of magnitude of vulnerability, according to random forest models (R² ≤ 0.34), are urbanization and infant mortality rate, followed by the share of elderly population and GDP per capita. When model quality is measured by AIC or BIC instead of R², education is similarly ranked very highly. These observations are in line with general theories stating that a higher level of human development leads to a decrease in vulnerability^5,54. Age structure, health, infrastructure, and education indicators are often found indicative of vulnerability to climate extremes for communities and individuals⁵⁵. Infant mortality rate may serve as a proxy for public service delivery in the context of climate extremes, such as emergency and recovery assistance, and has been found to explain parts of the variation in disaster-related deaths also within countries⁵⁶. High infant mortality rates are also related to the marginalization of affected communities⁵⁷, which is associated with higher flood risk⁵⁸, caused by, for example, a lack of early warning systems, physical protection measures (which are not well measured by FLOPROS in many Global South countries), or official emergency and recovery support. More directly, infant mortality can contribute to displacement during and after floods by acting as both a health crisis and a socio-emotional tipping point for affected households, compelling them to flee high-risk zones. In regions where floodwaters damage healthcare infrastructure or impede access to essential services, the elevated risk of infant death due to preventable causes—such as waterborne diseases or lack of neonatal care—can push families to migrate in search of safer environments with better medical access.

Again, we find that GDP per capita is relatively unimportant in the country-level models. GDP per capita, though widely used to characterize socioeconomic status, may thus be a poor measure of vulnerability to flood-induced displacement. One reason may be that communities are very heterogeneous, and the lowest-income and most vulnerable parts of the population may not show up in average income levels as measured by GDP per capita, whereas factors like infant mortality are sensitive to the presence of marginalized, impoverished, or undersupplied subpopulations. The marginal effect of the infant mortality indicator on vulnerability is highly nonlinear, highlighting the usefulness of models like random forest that do not impose a-priori linear relationships. For the real world, our results suggest reducing poverty and improving living conditions for the most vulnerable parts of the population might, among countless other benefits, effectively reduce displacement risk.

While random forests are well-suited for small samples⁵⁹, the limited amount of data available for our analysis introduces considerable variance in our results, which prevents us from drawing more detailed conclusions. Similar to other studies, a lot of residual variation in vulnerability remains that we cannot explain with our models, with R² values rarely exceeding 0.3. This variance may be partly related to data quality issues regarding the measurement of displacement. While IDMC to our knowledge is the highest-quality global source of displacement data, no uncertainty estimates are available for their figures, and IDMC figures and those from another global data provider (the Dartmouth Flood Observatory, DFO) can differ by several orders of magnitude (Supplementary Fig. S11). That said, some of our main findings–the prime importance of population density in the event-level analysis; the role of infant mortality rate, urban population, and share of elderly population among the most important predictors in the country-level analysis; and the relatively lower importance of GDP per capita–are robust even when DFO displacement data are used instead of IDMC data (Supplementary Fig. S12 and S13; note R² values are lower with DFO data than with IDMC data, Supplementary Fig. S14–S16).

On the other hand, the unexplained variance reflects to some degree the limited understanding of the many processes that shape vulnerability at the community and household level. Consequently, our models provide useful insights into some of the factors explaining variations in displacement vulnerability; they are less skilled for reliable prediction of displacement from any given flood event. Nevertheless, our results go beyond those of recent studies employing random forest¹⁹ or negative binomial¹⁸ regression to predict displacement (rather than displacement vulnerability). Ronco et al.¹⁹ attributed displacement mainly to a combination of intense precipitation and poor household conditions, albeit without a direct measure of flooding. Our study instead controls for the magnitude of flooding induced by precipitation and other drivers, and the importance of the infant mortality rate in our models is consistent with the importance of related measures of poor household conditions. However, we reveal the increased vulnerability of rural communities measured through the population density indicator. Vestby et al.¹⁸ found national income, level of democracy, and conflict as important factors explaining some of the variation in displacement magnitude. In comparison, we find that other factors are more important than national income, and these factors do not include the level of democracy which ranks very low in our study. Moreover, Vestby et al.¹⁸ found extreme displacement positively associated with nighttime luminosity, which they interpreted at least partly as population exposure. Our study explicitly controls for population exposure to flooding in the definition of the target variable, and therefore our findings can more readily be interpreted as relating to the importance of a given predictor in terms of vulnerability.

A negative relation between the economic status and vulnerability in terms of fatalities and economic losses, was found in previous works for a set of hazards, including flooding⁶⁰. Mortality rates are relatively high among countries with less than US$ 10k per capita, while a vulnerability threshold for economic loss rates varies between US$ 10k and US$ 15k^27,28. Globally, gross national income per capita is negatively associated with flood-induced displacements per 1000 people, with a breakpoint at US$ 13k per capita²¹. We find a nonlinear negative relationship between GDP per capita and displacement vulnerability at the event-level, with the strongest decline in vulnerability between ~$ 6k and 10k (2017 int. $ PPP), as well as at the country-level, with a drop at approximately US$ 28k (2022 US$ PPP). Thus, on the one hand, our study confirms a negative and non-linear association between aggregate income levels and vulnerability also with respect to displacement. On the other hand, it shows that other, not directly economic, factors are more important in explaining observed variations in the magnitude of displacement vulnerability.

Major challenges in understanding displacement vulnerability are the limited quality and granularity of the displacement data (Buhaug, 2023). Conceptual uncertainties, practical difficulties involved in collecting displacement data on the ground, and potential misreporting by media sources, mean that reported numbers are unlikely to be accurate. While the uncertainty is hard to quantify, the order or magnitude of displacements, which we have addressed in our analysis, may be a more reliable indication than the absolute number of displacements. Moreover, the available data fail to distinguish post-disaster displacement from preemptive evacuations. While the latter can save lives, both forms of displacement are associated with costs and burden³⁰. Our results do not appear to be primarily driven by the presence of preemptive evacuations in the data: sparsely populated areas, identified as particularly vulnerable in our study, are less likely to receive early warnings and to be targeted for planned evacuation than urban centers; and countries with high infant mortality rates are less likely to have the financial and institutional resources required for organizing effective evacuation campaigns. Disaggregated estimates by the Internal Displacement Monitoring Center (IDMC)⁶¹ for 2023 show that out of 3180 flood-induced displacement events in that year, only 46 were associated with preemptive evacuations, while 2171 were associated with post-disaster displacement (963 events had no related information). This record indicates that preemptive evacuations may constitute a minor fraction of all displacement events. Nevertheless, the missing distinction between post-disaster displacement and preemptive evacuations is a major caveat of our study, and may put important limits on the overall explanatory power of our models.

Many global studies used intensity-damage functions to represent vulnerability, for example, depth-damage functions are often used for flood risk assessments⁶². Previous studies have shown that flood depth explains a substantial part, but not all, of the variations in flood damages; simple models with only flood depth do not perform well⁶³. While flood-depth information is unavailable for the observed flood events studied here, we have included indicators such as elevation, slope, and flood duration in order to characterize some of the physical aspects of the hazard. Our finding that vulnerability tends to be higher in high-elevation and very low-elevation areas may be related to physical characteristics of mountain floods, but also socioeconomic and demographic differences between mountain regions and lowlands. Besides this, our study for the first time sheds light on the role of several social, economic, and political factors, for the variability in flood-induced displacement outcomes between as well as within countries. It thus adds to the literature on the determinants of vulnerability to climate events²⁵. To advance flood-displacement risk assessment, future studies may consider additional aspects of physical vulnerability alongside those of social vulnerability⁶⁴.

Our findings suggest that relying on economic growth on its own to reduce vulnerability to flood-induced displacement is delusive. Rather, targeted investments are needed in particular to improve coping capacities in rural communities, promote infrastructure access and safe housing for the urban poor, and combat social marginalization and extreme poverty so that forced displacement, as well as involuntarily immobility, can be reduced to a minimum and safe mobility can be a successful response and risk reduction strategy.

Methods

Vulnerability to flood-induced displacement

We estimate flood-displacement vulnerability as the ratio of displacement to exposure for individual events. First, we geocode the IDMC’s Global Internal Displacement Database (GIDD) to assign sub-national administrative units (provinces and districts) to each reported flood-displacement event, where possible. We extract textual location information provided in the GIDD and match it to the names of administrative units in GADM⁶⁵. We are able to identify the location of 1702 out of 3083 recorded displacement events between 2008 and 2018 at a subnational level; the remaining events are assigned the respective country as location. Note that this varying accuracy across the sample may affect the accuracy of the matching between displacement and flood events: Displacement events with available subnational location may be matched more accurately to flood events than displacement events for which only the country is given. Consequently, matching errors may be more likely in earlier records (2008–2012) that have no subnational location information in IDMC. However, we find no systematic difference between vulnerability factors calculated for the full record and for the period of 2013–2018 (Supplementary Fig. S2).

Next, we use the location and timestamp of each displacement event to find associated flood events from the Global Flood Database³³ (GFD), accounting for the possibility of multiple displacement events associated with a single flood event and vice versa⁶⁶. This yields a dataset (FLODIS⁶⁶) of flood extent and the associated number of displacements for 303 events that occurred in 72 countries between 2008 and 2018. This is fewer than the 461 flood events contained in the GFD for that period because for some flood events no corresponding displacement event could be identified. Finally, we multiply flood extent with gridded population data from the Global Human Settlement Layer⁶⁷ to estimate exposure, and calculate vulnerability (Supplementary Fig. S17). We also compute for each of the 72 countries the national median vulnerability across all events.

Many previous studies of vulnerability related to damages, mortality, and displacement, particularly at a global scale, have used model-based flood extent estimates^21,27,28,68, which allows studying a greater number of events. However, the accuracy of global flood models remains limited due to uncertainties concerning flood frequency analyses⁶⁹, the representation of flood defenses⁷⁰, or the river channel geometries⁷¹, and it is unclear whether such models represent individual flood events, and specifically those that have triggered displacement, truthfully enough to be useful for estimating displacement vulnerability. We use remote sensing-based flood extent estimates which represent actual flood events²⁹ and are likely to yield more realistic estimates of exposure and vulnerability than model-based products. We note that the GFD, and therefore the scope of our study, is limited to fluvial and coastal floods of sufficient size and duration that can be captured by satellite instruments. In particular, these may miss flash floods, as well as small-scale features such as flooded streets in urban areas, or flooding below dense tree cover³³.

Predictors of vulnerability

In the next step, we identify possible drivers of the vulnerability to flood-induced displacement, drawing on literature related to flood vulnerability and disaster vulnerability more generally^{22,23,24,40,51,72,73}. We selected indicators for which empirical evidence or theoretical arguments suggest a plausible association with vulnerability, and for which data were available for the relevant time period at national or subnational level (Tables 2 and 3). For instance, studying socio-economic covariates of flood-induced fatalities, Reimann et al.⁷³ found education levels the most significant variable, with the share of elderly population, income inequality, healthcare infrastructure, and rural settlements also influential. On the national level, Toya and Skidmore⁷⁴ found economic losses from natural disasters are lower in countries with higher income and higher educational attainment.

Table 2 Event-level predictors

Full size table

Table 3 Country-level predictors

Full size table

We log₁₀-transform indicators whose values are very unevenly distributed, such that both our target variable – log₁₀(vulnerability) - as well as our predictor variables are either approximately normally or uniformly distributed. Due to the log-transformation, six observations containing zeroes (1 in population density, 3 in urban area, 2 in elevation) were dropped in the event-level analysis, so that the sample there includes only 297 events instead of 303. In the country-level analysis, all 311 events enter the calculation of the national median vulnerability values. We also introduce a dummy variable as a benchmark to measure the significance of a given predictor’s importance. The dummy variable is constructed by assigning a random value between 0 and 1 to this predictor for each entry. Descriptive statistics for all indicators can be found in Supplementary Table S3.

Regression models

Random forests are a tree-based type of supervised machine learning method for classification and regression tasks³⁷. Random forest models require no prior assumptions about the functional form of the relationship between predictor and target variable, nor about the existence or absence of interactions between different predictors; and they can handle relatively small datasets⁵⁹. We use the random forest regressor of the Python module “Scikit-learn”⁷⁵ version 0.24.1 with the default settings of 100 trees and without sample bootstrapping; no hyperparameter tuning was performed. In the event-level analysis, we account for unobserved country-specific characteristics (Supplementary Fig. S2) by applying a mixed-effects random forest²⁶ (MERF), with the ISO3 country code as random effect covariate, using the “MERF” Python package version 1.0 (https://github.com/manifoldai/merf).

We construct random forest models using up to three predictors. We test all possible predictor combinations, except for a number of predictors in the country-level analysis which are mutually related and highly correlated (Supplementary Fig. S10) and which we therefore do not combine with one another (Table 3). That is, we do not combine any two of the indicators infant mortality rate, population ≤14 years, population ≥65 years, and population growth, as these are all related to the age structure of the population and have Pearson correlation coefficients of 0.67 or higher between them; and we do not combine population growth and urban population growth, as the urban population growth rate may closely follow the total population growth rate and the Pearson correlation coefficient between the two variables is 0.9.

Predictor ranking

As we are interested in which predictors explain vulnerability best, we assess their contribution to explanatory skill across all models (predictor combinations), separately for the event-level and country-level analysis. We first split the data into training sets and test sets that are used to fit the model and predict the vulnerability, respectively. The number of observations is limited, so we choose a leave-one-out cross-validation (LOOCV)⁷⁶, splitting the data n times into a single-item test set and a training set of size n−1, where n is the total number of observations. To test the capability of explaining variations in vulnerability using the predictor variables, we collect the n vulnerability predictions and compute a single R² (the coefficient of determination) from the n prediction-observation pairs.

Predictor importance in multivariate models may be obscured by that of other predictors present in a given model. Hence, ranking the importance of variables simply based on R² values may be misleading. Therefore, we assess predictor importance by measuring the decrease in a model’s R² due to randomly permuting the values of the predictor of interest; based on the hypothesis that if a predictor is of zero importance, randomly permuting its values does not alter the accuracy of the prediction³⁷. For each predictor, we calculate the decrease in R² due to randomization, for all models that include that predictor and have an absolute R² ≥ 0.01 (to exclude models which have a negative R² to start with). We then rank predictors by the median decrease in R² across all relevant models.

Alternatively, we also rank predictors by the median increase, due to predictor randomization, in the value of the Akaike (AIC) or Bayesian Information Criterion (BIC). AIC and BIC account for different numbers of predictors between models, and lower values correspond to higher-quality models. While the LOOCV also penalizes overfitting (and shows asymptotic equivalence to these prediction error estimators^77,78), it does so differently than AIC and BIC, therefore these measures provide complementary ways of model selection or ranking.

Partial dependence plots

We use partial dependence plots (PDPs)⁷⁹ to visualize and analyze the marginal effect of each predictor on the predicted vulnerability. The PDPs are limited to a maximum of two variables, but a collection of PDPs can serve to show the partial dependence of each predictor⁸⁰. We train a random forest model (a MERF model in the event-level analysis) on all data, including the predictors of the model that, in the predictor importance analysis described above, exhibits the highest decrease in R² after randomizing the predictor of interest. Next, we duplicate the training set 100 times and assign for all values of the predictor of interest one of 100 increments between its minimum and maximum value (across space and time), as similarly done by Vogel et al.⁸¹ Then, for every increment, the trained models predict 100 times an estimated vulnerability value, resulting in a distribution of predictions at each increment (the conditional response distribution; represented in the figures by its median as well as 5th/95th percentile and 33th/66th percentile). The change of this distribution across all increments indicates the marginal effect on vulnerability of the predictor in question, all else being equal. This last assumption is not necessarily realistic, as the joint distribution of predictor values is likely not fully random or stretched out evenly across the range sampled. Also, since the model used to derive the PDPs does not employ any cross-validation, it is prone to overfitting to small-scale features; and the model depends on data density, such that spare intervals with no data for the relevant predictor end up with nearly-horizontal lines. Thus, we do not assign any meaning to these artificial features, but only interpret broad trends over large ranges with continuous data. Given this caveat, the PDPs can nevertheless indicate the direction and approximate functional form of the predictor’s effect.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Flood data is available from the Global Flood Database at (https://global-flood-database.cloudtostreet.ai). Displacement data is available from IDMC’s Global Internal Displacement Database at (https://www.internal-displacement.org/database/), and from the Dartmouth Flood Observatory at (https://floodobservatory.colorado.edu/Archives/index.html).

Code availability

The Python code used to produce the numbers, tables and figures in this paper is available at (https://doi.org/10.5281/zenodo.16620061).

References

IDMC. Global Report on Internal Displacement. https://www.internal-displacement.org/global-report/grid2024/ (2024).
UN OCHA. Guiding Principles on Internal Displacement. https://www.internal-displacement.org/publications/ocha-guiding-principles-on-internal-displacement (2004).
Desai, B. et al. Addressing the human cost in a changing climate. Science 372, 1284–1287 (2021).
Article ADS PubMed CAS Google Scholar
Cardona, O.-D. et al. Determinants of Risk: Exposure and Vulnerability. in Managing the Risks of Extreme Events and Disasters to Advance Climate Change Adaptation (eds. Field, C. B., Barros, V., Stocker, T. F. & Dahe, Q.) 65–108 (Cambridge University Press, 2012).
Oppenheimer, M. et al. Emergent risks and key vulnerabilities. in Climate Change 2014 Impacts, Adaptation and Vulnerability: Part A: Global and Sectoral Aspects 1039–1100 (Cambridge University Press, 2015).
Berghuijs, W. R., Aalbers, E. E., Larsen, J. R., Trancoso, R. & Woods, R. A. Recent changes in extreme floods across multiple continents. Environ. Res. Lett. 12, 114035 (2017).
Article ADS Google Scholar
Blöschl, G. et al. Changing climate both increases and decreases European river floods. Nature 573, 108–111 (2019).
Fowler, H. J. et al. Anthropogenic intensification of short-duration rainfall extremes. Nat. Rev. Earth \ Environ. 2, 107–122 (2021).
Article ADS Google Scholar
Gudmundsson, L. et al. Globally observed trends in mean and extreme river flow attributed to climate change. Science 371, 1159–1162 (2021).
Article ADS PubMed CAS Google Scholar
Hirabayashi, Y. et al. Global flood risk under climate change. Nat. Clim. Change 3, 816–821 (2013).
Article ADS Google Scholar
Lange, S. et al. Projecting exposure to extreme climate impact events across six event categories and three spatial scales. Earth’s Futur. 8, e2020EF001616 (2020).
Milly, P. C. D., Wetherald, R. T., Dunne, K. A. & Delworth, T. L. Increasing risk of great floods in a changing climate. Nature 415, 514–517 (2002).
Article ADS PubMed CAS Google Scholar
Rentschler, J., Salhab, M. & Jafino, B. A. Flood exposure and poverty in 188 countries. Nat. Commun. 13, 3527 (2022).
Article ADS PubMed PubMed Central CAS Google Scholar
Jongman, B., Ward, P. J. & Aerts, J. C. J. H. Global exposure to river and coastal flooding: long term trends and changes. Glob. Environ. Change 22, 823–835 (2012).
Article Google Scholar
Winsemius, H. C. et al. Global drivers of future river flood risk. Nat. Clim. Change 6, 381–385 (2016).
Article ADS Google Scholar
Sauer, I. J., Mester, B., Frieler, K., Zimmermann, S. & Schewe, J. Limited progress in global reduction of vulnerability to fl ood impacts over the past two decades. 5, 239 (2024).
Feldmeyer, D., Birkmann, J. & Welle, T. Development of human vulnerability 2012–2017. J. Extrem. Events 04, 1850005 (2017).
Article Google Scholar
Vestby, J., Schutte, S., Tollefsen, A. F. & Buhaug, H. Societal determinants of flood-induced displacement. Proc. Natl Acad. Sci. 121, e2206188120 (2024).
Article PubMed PubMed Central CAS Google Scholar
Ronco, M. et al. Exploring interactions between socioeconomic context and natural hazards on human population displacement. Nat. Commun. 14, 8004 (2023).
Article ADS PubMed PubMed Central CAS Google Scholar
Kam, P. M. et al. Global warming and population change both heighten future risk of human displacement due to river floods. Environ. Res. Lett. 16, 044026 (2021).
Article ADS Google Scholar
Kakinuma, K. et al. Flood-induced population displacements in the world. Environ. Res. Lett. 15, 124029 (2020).
Article ADS CAS Google Scholar
Cross, J. A. Megacities and small towns: different perspectives on hazard vulnerability. Environ. Hazards 3, 63–80 (2001).
Google Scholar
Fatemi, F., Ardalan, A., Aguirre, B., Mansouri, N. & Mohammadfam, I. Social vulnerability indicators in disasters: Findings from a systematic review. Int. J. Disaster Risk Reduct. 22, 219–227 (2017).
Article Google Scholar
Rufat, S., Tate, E., Burton, C. G. & Maroof, A. S. Social vulnerability to floods: Review of case studies and implications for measurement. Int. J. Disaster Risk Reduct. 14, 470–486 (2015).
Article Google Scholar
Brooks, N., Neil Adger, W. & Mick Kelly, P. The determinants of vulnerability and adaptive capacity at the national level and the implications for adaptation. Glob. Environ. Change 15, 151–163 (2005).
Article Google Scholar
Hajjem, A., Bellavance, F. & Larocque, D. Mixed-effects random forest for clustered data. J. Stat. Comput. Simul. 84, 1313–1328 (2014).
Article MathSciNet Google Scholar
Jongman, B. et al. Declining vulnerability to river floods and the global benefits of adaptation. Proc. Natl Acad. Sci. 112, E2271–E2280 (2015).
Article PubMed PubMed Central CAS Google Scholar
Tanoue, M., Hirabayashi, Y. & Ikeuchi, H. Global-scale river flood vulnerability in the last 50 years. Sci. Rep. 6, 36021 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Alfieri, L. et al. A global network for operational flood risk reduction. Environ. Sci. \ Policy 84, 149–158 (2018).
Article Google Scholar
McAdam, J. Evacuations: a form of disaster displacement? forced migration. Review 56, 57 (2022).
Google Scholar
Paul, N., Galasso, C. & Baker, J. Household displacement and return in disasters: a review. Nat. Hazards Rev. 25, 03123006 (2024).
Article Google Scholar
Rossi, L. et al. A new methodology for probabilistic flood displacement risk assessment: the case of Fiji and Vanuatu. Front. Clim. 6, 1345258 (2024).
Article Google Scholar
Tellman, B. et al. Satellite imaging reveals increased proportion of population exposed to floods. Nature 596, 80–86 (2021).
Article ADS PubMed CAS Google Scholar
Leyk, S. et al. The spatial allocation of population: a review of large-scale gridded population data products and their fitness for use. Earth Syst. Sci. Data 11, 1385–1409 (2019).
Article ADS Google Scholar
Carozza, D. A. & Boudreault, M. A global flood risk modeling framework built with climate models and machine learning. J. Adv. Modeling Earth Syst. 13, 1–21 (2021).
Google Scholar
Buhaug, H. What is in a number? Some reflections on disaster displacement modelling. Int. Migr. 61, 353–357 (2023).
Article Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Fox, S., Agyemang, F., Hawker, L. & Neal, J. Integrating social vulnerability into high-resolution global flood risk mapping. Nat. Commun. 15, 3155 (2024).
Article ADS PubMed PubMed Central CAS Google Scholar
Cutter, S. L., Ash, K. D. & Emrich, C. T. Urban–rural differences in disaster resilience. Ann. Am. Assoc. Geographers 106, 1236–1252 (2016).
Google Scholar
Jamshed, A., Birkmann, J., Feldmeyer, D. & Rana, I. A. A conceptual framework to understand the dynamics of rural–urban linkages for rural flood vulnerability. Sustainability 12, 2894 (2020).
Article ADS Google Scholar
Fuchs, S. et al. Short communication: a model to predict flood loss in mountain areas. Environ. Model. Softw. 117, 176–180 (2019).
Article Google Scholar
Jakob, M. et al. Debris-flood hazard assessments in steep streams. Water Resour. Res. 58, e2021WR030907 (2022).
Sung, C.-H. & Liaw, S.-C. Using spatial pattern analysis to explore the relationship between vulnerability and resilience to natural hazards. Int. J. Environ. Res. Public Health 18, 5634 (2021).
Article PubMed PubMed Central Google Scholar
Frigerio, I. et al. A GIS-based approach to identify the spatial variability of social vulnerability to seismic hazard in Italy. Appl. Geogr. 74, 12–22 (2016).
Article Google Scholar
Scussolini, P. et al. FLOPROS: an evolving global database of flood protection standards. Nat. Hazards Earth Syst. Sci. 16, 1049–1061 (2016).
Article ADS Google Scholar
Andrews, K. M., Brouillette, D. B. & Brouillette, R. T. Mortality, Infant. in Encyclopedia of Infant and Early Childhood Development 343–359 (Elsevier, 2008).
Ahmed, S. A., Cruz, M., Quillin, B. & Schellekens, P. Demographic Change and Development: A Global Typology. World Bank Policy Research Working Paper No. 7893, (2016).
Birdsall, N., Kelley, A. C. & Sinding, S. Population Matters: Demographic Change, Economic Growth, and Poverty in the Developing World. (OUP Oxford, 2001).
Desai, M. Human development. Eur. Economic Rev. 35, 350–357 (1991).
Article Google Scholar
Das Gupta, M., Bongaarts, J. & Cleland, J. Population, poverty, and sustainable development: A review of the evidence. World Bank Policy Research Working Paper (2011).
Cutter, S. L., Boruff, B. J. & Shirley, W. L. Social vulnerability to environmental hazards *. Soc. Sci. Q. 84, 242–261 (2003).
Article Google Scholar
Jurgilevich, A., Räsänen, A., Groundstroem, F. & Juhola, S. A systematic review of dynamics in climate risk and vulnerability assessments. Environ. Res. Lett. 12, 13002 (2017).
Article ADS Google Scholar
Kummu, M., Kosonen, M. & Masoumzadeh Sayyar, S. Downscaled gridded global dataset for gross domestic product (GDP) per capita PPP over 1990–2022. Sci. Data 12, 178 (2025).
Article PubMed PubMed Central Google Scholar
The Nansen Initiative. Agenda for the Protection of Cross-Border Displaced Persons in the Context of Disaster and Climate Change. 1,{Platform} on Disaster Displacement. (2015).
Cissé, G. et al. 2022: Health, Wellbeing, and the Changing Structure of Communities. In: Climate Change 2022: Impacts, Adaptation and Vulnerability. Contribution of Working Group II to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. Climate Change 2022–Impacts, Adaptation and Vulnerability (2022).
Rubin, O. Social vulnerability to climate-induced natural disasters: {Cross}-provincial evidence from {Vietnam}. Asia Pac. Viewp. 55, 67–80 (2014).
Article Google Scholar
Bishop-Royse, J., Lange-Maia, B., Murray, L., Shah, R. C. & DeMaio, F. Structural racism, socio-economic marginalization, and infant mortality. Public Health 190, 55–61 (2021).
Article PubMed CAS Google Scholar
Sanders, B. F. et al. Large and inequitable flood risks in {Los} {Angeles}, {California}. Nat. Sustain 6, 47–57 (2023).
Article Google Scholar
Scornet, E., Biau, G. & Vert, J.-P. Consistency of random forests. Ann. Stat. 43, 1716–1741 (2015).
Article MathSciNet Google Scholar
Formetta, G. & Feyen, L. Empirical evidence of declining global vulnerability to climate-related hazards. Glob. Environ. Change 57, 101920 (2019).
Article PubMed PubMed Central Google Scholar
Global Internal Displacement Database. IDMC - Internal Displacement Monitoring Centre https://www.internal-displacement.org/database/displacement-data.
Ward, P. J. et al. Review article: Natural hazard risk assessments at the global scale. Nat. Hazards Earth Syst. Sci. 20, 1069–1096 (2020).
Article ADS Google Scholar
Wagenaar, D., de Jong, J. & Bouwer, L. M. Multi-variable flood damage modelling with limited data using supervised learning approaches. Nat. Hazards Earth Syst. Sci. 17, 1683–1696 (2017).
Article ADS Google Scholar
Flanagan, B., Gregory, E., Hallisey, E., Heitgerd, J. & Lewis, B. A social vulnerability index for disaster management. J. Homeland Security and Emergency Management 8, 0000102202154773551792 (2011).
GADM. Database of Global Administrative Areas. https://gadm.org/data.html (2018). Accessed 2025-09-12.
Mester, B., Frieler, K. & Schewe, J. Human displacements, fatalities, and economic damages linked to remotely observed floods. Sci. Data 10, 482 (2023).
Article PubMed PubMed Central Google Scholar
Schiavina, M., Freire, S. & MacManus, K. GHS population grid multitemporal (1975-1990-2000-2015), R2019A. European Commission, Joint Research Centre (JRC) 10.2905/0C6B9751-A71F-4062-830B-43C9F432370F (2019).
Sauer, I. J. et al. Climate signals in river flood damages emerge under sound regional disaggregation. Nat. Commun. 12, 2128 (2021).
Article ADS PubMed PubMed Central CAS Google Scholar
Zhou, X., Ma, W., Echizenya, W. & Yamazaki, D. The uncertainty of flood frequency analyses in hydrodynamic model simulations. Nat. Hazards Earth Syst. Sci. 21, 1071–1085 (2021).
Article ADS Google Scholar
Bates, P. D. et al. Combined Modeling of US fluvial, pluvial, and coastal flood hazard under current and future climates. Water Resour. Res. 57, e2020WR028673 (2021).
Article ADS Google Scholar
Ward, P. J. et al. Usefulness and limitations of global flood risk models. Nat. Clim. Change 5, 712–715 (2015).
Article ADS Google Scholar
Tate, E., Rahman, M. A., Emrich, C. T. & Sampson, C. C. Flood exposure and social vulnerability in the United States. Nat. Hazards 106, 435–457 (2021).
Article Google Scholar
Reimann, L., Koks, E., de Moel, H., Ton, M. J. & Aerts, J. C. J. H. An empirical social vulnerability map for flood risk assessment at global scale (“GlobE-SoVI”). Earth’s Future 12, e2023EF003895 (2024).
Toya, H. & Skidmore, M. Economic development and the impacts of natural disasters. Econ. Lett. 94, 20–25 (2007).
Article Google Scholar
Pedregosa, F. et al. Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet Google Scholar
Davis, B. M. Uses and abuses of cross-validation in geostatistics. Math. Geol. 19, 241–248 (1987).
Article Google Scholar
Fang, Y. Asymptotic equivalence between cross-validations and akaike information criteria in mixed-effects models. J. Data Sci. 9, 15–21 (2022).
MathSciNet Google Scholar
Stone, M. An asymptotic equivalence of choice of model by cross-validation and Akaike’s criterion. J. R. Stat. Soc. Ser. B (Methodol.) 39, 44–47 (1977).
Article MathSciNet Google Scholar
Friedman, J. H. Greedy function approximation: {A} gradient boosting machine. Ann. Stat. 29, 1189–1232 (2001).
Article MathSciNet Google Scholar
Hastie, T., Tibshirani, R., Friedman, J. H. & Friedman, J. H. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. 2 (Springer, 2009).
Vogel, E. et al. The effects of climate extremes on global agricultural yields. Environ. Res. Lett. 14, 054010 (2019).
Article ADS Google Scholar
Center For International Earth Science Information Network-CIESIN-Columbia University. Gridded Population of the World, Version 4 (GPWv4): Basic Demographic Characteristics, Revision 11. Palisades, NY: NASA Socioeconomic Data and Applications Center (SEDAC) https://doi.org/10.7927/H46M34XX (2018).
Klein Goldewijk, C. G. M. A historical land use data set for the Holocene; HYDE 3.2 (replaced). DANS Data Station Archaeol. https://doi.org/10.17026/DANS-ZNK-CFY3 (2016).
Article Google Scholar
Nirandjan, S., Koks, E. E., Ward, P. J. & Aerts, J. C. J. H. A spatially-explicit harmonized global dataset of critical infrastructure. Sci. Data 9, 150 (2022).
Article PubMed PubMed Central Google Scholar
Amatulli, G. et al. A suite of global, cross-scale topographic variables for environmental and biodiversity modeling. Sci. Data 5, 180040 (2018).
Article PubMed PubMed Central Google Scholar
WDI. World Development Indicators 2022 (World Bank). https://databank.worldbank.org/source/world-developmentindicators/preview/on (2022). Accessed 2022-02-22.
Smits, J. & Permanyer, I. The subnational human development database. Sci. Data 6, 190038 (2019).
Article PubMed PubMed Central Google Scholar
Coppedge, M. V-Dem Dataset 2021. Varieties Democracy (V.-Dem) Proj. https://doi.org/10.23696/VDEMDS21 (2021).
Article Google Scholar
Ahrens, J. & Rudolph, P. M. The importance of governance in risk reduction and disaster management. J. Contingencies Crisis Manag. 14, 207–220 (2006).
Article Google Scholar
Peduzzi, P., Dao, H., Herold, C. & Mouton, F. Assessing global exposure and vulnerability towards natural hazards: the Disaster Risk Index. Nat. Hazards Earth Syst. Sci. 9, 1149–1159 (2009).
Article ADS Google Scholar

Download references

Acknowledgements

This work received funding from the EU Horizon 2020 program, project number 869395 (HABITABLE), and the Federal Ministry of Education and Research (BMBF), funding code 01LS2001A.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Benedikt Mester
Present address: Swiss Re, Arabellastraße 30, Munich, Germany

Authors and Affiliations

Potsdam Institute for Climate Impact Research (PIK), Member of the Leibniz Association, Potsdam, Germany
Benedikt Mester, Katja Frieler & Jacob Schewe
Institute of Environmental Science and Geography, University of Potsdam, Potsdam, Germany
Katja Frieler & Oliver Korup
Institute of Geosciences, University of Potsdam, Potsdam, Germany
Oliver Korup
CGIAR Climate Security, Alliance Biodiversity International and CIAT, Rome, Italy
Bina Desai

Authors

Benedikt Mester
View author publications
Search author on:PubMed Google Scholar
Katja Frieler
View author publications
Search author on:PubMed Google Scholar
Oliver Korup
View author publications
Search author on:PubMed Google Scholar
Bina Desai
View author publications
Search author on:PubMed Google Scholar
Jacob Schewe
View author publications
Search author on:PubMed Google Scholar

Contributions

J.S. and K.F. conceived the study. J.S., B.M., and O.K. designed the study. B.M. processed and analyzed data and performed simulations. J.S. and B.M. wrote the paper. All authors including B.D. contributed to the interpretation of the results.

Corresponding author

Correspondence to Jacob Schewe.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Halvard Buhaug and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Transparent Peer Review file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mester, B., Frieler, K., Korup, O. et al. Socioeconomic predictors of vulnerability to flood-induced displacement. Nat Commun 16, 8296 (2025). https://doi.org/10.1038/s41467-025-64015-8

Download citation

Received: 08 October 2024
Accepted: 04 September 2025
Published: 16 September 2025
DOI: https://doi.org/10.1038/s41467-025-64015-8

Subjects

Abstract

Similar content being viewed by others

Limited progress in global reduction of vulnerability to flood impacts over the past two decades

Integrating social vulnerability into high-resolution global flood risk mapping

The wider the gap between rich and poor the higher the flood mortality

Introduction

Results

Global displacement vulnerability

Predictive Models

Important predictors at the event-level

Important predictors at the country-level

Discussion

Methods

Vulnerability to flood-induced displacement

Predictors of vulnerability

Regression models

Predictor ranking

Partial dependence plots

Reporting summary

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary Information

Reporting Summary

Transparent Peer Review file

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links