National-scale landslide susceptibility and risk mapping of India using a hybrid data-driven approach

Khan, Imran; Bahuguna, Harish; Kainthola, Ashutosh; Ahmed, Rayees; Asgher, Md. Sarfaraz

doi:10.1038/s41598-025-33446-0

Download PDF

Article
Open access
Published: 31 December 2025

National-scale landslide susceptibility and risk mapping of India using a hybrid data-driven approach

Scientific Reports volume 16, Article number: 3519 (2026) Cite this article

1548 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Landslides pose a persistent and deadly hazard across India’s diverse terrain, demanding robust, high-resolution national-scale assessments to guide effective risk mitigation and policy decisions. This study addresses a critical research gap in landslide susceptibility and risk assessment across India at a high spatial resolution of 90 m × 90 m on a national scale. Leveraging a robust landslide inventory comprising 109,504 documented events and incorporating causative factors selected through expert judgment, three models Analytic Hierarchy Process (AHP), Frequency Ratio (FR), and Yule’s Coefficient (Yc) were employed to reduce model bias and identify the most reliable susceptibility and risk mapping approach through comprehensive field validation and map interpretation. The results indicate, very high landslide susceptibility zones cover 4.0% (AHP), 4.2% (Yc), and 4.5% (FR) of India’s land area, while combined high and very high susceptibility classes account for 11.0% (AHP), 10.4% (Yc), and 10.5% (FR). Model evaluation demonstrates robust predictive performance, with accuracies of 89.1% (AHP), 88.9% (Yc), and 90.5% (FR) according to the ROC curve, and 86.7%, 85.6%, and 87.2% respectively via the precision-recall curve. Minor variations in susceptibility extent and accuracy among models underscore the resilience of the input data, however the Yc model provides a a more ground representative output based on factor weighting. Utilizing the Yc model and exposure data, approximately 8,606 km² (0.26% of India) are classified as having a high to extremely high landslide risk at a qualitative scale. Overall, the susceptibility analysis indicates that approximately 0.34 million km² is categorized as having high and very high susceptibility under Yc model. The analysis identifies Nagaland (55%), Mizoram (53.1%), Arunachal Pradesh (52.1%), Sikkim (45.0%), Uttarakhand (40.0%), Manipur (30.6%), Himachal Pradesh (28.9%), Meghalaya (22.0%), Jammu & Kashmir (20.2%), and Tripura (8.0%) as the ten most landslide-susceptible States and Union Territories , listed in descending order of susceptibility. These findings provide critical insights for disaster risk reduction strategies and policymaking, enhancing landslide prediction, mitigation, and resilience planning across susceptible regions at mapped scale.

A combined analysis of frequency ratio and analytical hierarchy process for landslide susceptibility assessment in Tenta, South Wollo, Ethiopia

Article Open access 23 May 2025

Evaluation of landslide susceptibility of mountain highway based on RF and SVM models

Article Open access 10 July 2025

Enhancing landslide disaster prediction by evaluating non landslide area sampling in machine learning models for Spiti Valley India

Article Open access 10 April 2025

Introduction

Mountain regions globally cover more than 27% of the Earth’s land surface and are home to over 1 billion people, representing approximately 14% of the global population¹. These regions are often characterized by steep and unstable slopes, which create significant challenges for their inhabitants. Between 2005 and 2014, countries with mountainous terrains accounted for over 70% of disaster-related fatalities^2,3. However, these statistics likely underrepresent the broader impact of recurrent and localized mass movements, which are considerable barriers to sustainable livelihoods and regional development^4,5,6,7,8. Landslides, which are among the deadliest natural hazards, account for approximately 17% of all natural disasters^9,10. This trend is expected to rise in the future, exacerbated by increasing urbanization, deforestation, and the impacts of climate change^11,12. Extreme precipitation, often amplified by global warming, is a key driver for landslide occurrences, with the potential to trigger nonlinear increases in landslide activity¹³. The annual global economic losses due to landslides are estimated to reach approximately 18 billion Euros, with significant social and environmental repercussions¹⁴. From 1980 to 2017, data from the Emergency Events Database (EM-DAT) documented 631 landslide disaster events, resulting in 44,541 fatalities¹⁵. Further, the global catastrophic landslide database, covering 2004–2010, reveals an average of 374 catastrophic landslides annually, leading to an average of 4,617 fatalities each year⁹.

Numerous studies have also highlighted the temporal and spatial distribution of landslides, underlining their status as a major global hazard with substantial human and economic consequences every year^{16,17,18,19,20}. Countries in regions such as Colombia, Peru, Brazil, Nicaragua, El Salvador, Italy, Nepal, India, China, and New Zealand are particularly prone to landslides²¹. A recent study by²² projects a significant increase in landslide-related population risk across different timeframes (1971–2000, 2031–2060, and 2066–2095), with countries like China, India, Turkey, the Philippines, and Nepal expected to face heightened risks. India, in particular, ranks second globally, with projected increases in population risk from 360 to 760 (111%) between 2031 and 2060, and a sustained high level of 690 (92%) from 2066 to 2095. This highlights the growing landslide risks in Asia, especially in the coming decades. Asia is the continent with the highest landslide risk, particularly along the Himalayan arc^10,23. High Mountain Asia has long been recognized as a hotspot for landslide risks²⁴, and studies suggest that landslide hazards in this region are expected to increase further in the future¹⁷. According to EM-DAT, Asia accounted for 58% of all global landslides and 41% of all disaster-related incidents from 1982 to 2022²⁵. Southern Asia, home to countries like India, Nepal, and Pakistan, is particularly susceptible, representing 34% of Asia’s landslide occurrences²⁵. The Himalayan Mountain range alone contributes to 89% of landslides in Southern Asia¹².

The risk of casualties from landslides is expected to rise markedly in the forthcoming decades, especially in Asia, which comprises 60% of the ten countries with the highest risk²². India is projected to incur an average of 760 fatalities annually from 2031 to 2060, ranking after China (1,670) and before Afghanistan, the Philippines, and Indonesia²². Relative to the 1971–2000 baseline, these nations are anticipated to experience annual increases over 200 casualties²². This escalation is propelled by increased extreme precipitation and anticipated population development in mountainous, landslide-prone regions²².

India, being one of the most landslide-prone countries, faces considerable challenges in mitigating these risks. With an estimated population of 1.46 billion in 2025, India represents approximately 17.78% of the global population²⁶. As per Geological Survey of India (GSI), around 0.42 million square kilometers, or 12.6% of India’s land area is prone to landslides²⁷. This includes the NWH, NEH and Ghats regions of India. Landslides in India are primarily triggered by precipitation, with the Himalayan region also susceptible to earthquake-triggered landslides due to its location in the highest seismic zones (Zones IV and V) of India²⁸. The region has experienced numerous devastating earthquakes, such as the Shillong earthquake (8.1 magnitude) in 1897, which triggered significant landslides²⁹.

Landslide susceptibility (LS), hazard, and risk zoning are three interconnected approaches that play a critical role in land-use planning, offering valuable insights into the potential impacts of landslides on communities and infrastructure^30,31,32,33. Susceptibility mapping, in particular, serves as the initial step in assessing the likelihood of landslides in a given area by analyzing the influence of various geo-environmental factors, excluding the temporal element of landslide frequency^34,35,36. High-resolution landslide susceptibility zonation (LSZ) studies at the national or global scale have been limited due to the unavailability of comprehensive landslide inventories and the high computational demands required for analysis^37,38. Presently, the majority of extensive LSZ maps depend on heuristic methodologies and are often generated at low spatial resolutions. Recent studies have investigated the application of machine learning (ML) and deep learning (DL) techniques; however, these endeavors frequently utilize restricted input datasets. Table 1 encapsulates prior studies on LSZ, spanning global to national levels. It underscores the diversity of approaches, databases, and spatial resolutions employed in global, continental, and national research. However, with advancements of technology and availability of open-source data, it is now possible to develop more detailed susceptibility maps.

A variety of strategies are employed in landslide susceptibility zoning, and the selection of method substantially influences predicted efficacy. These methods are typically divided into two broad categories: qualitative (knowledge-driven) approaches, which rely on the expert judgment of researchers^32,36,39,40 and quantitative (data-driven) approaches, which apply mathematical or statistical models to correlate landslides with their controlling factors^41,42. Recent studies indicate that methods such as AHP and statistical models like Yc and FR are among the most commonly used for both large and small scale LSZ^{32,43,44,45,46,47}. This study utilizes a combination of these models to mitigate the potential bias from any single method, with the results being validated through ROC analysis.

This research enhances National Landslide Susceptibility Zonation (NLSZ) at high resolution by overcoming significant limitations in prior evaluations. Although⁴⁸ produced a landslide susceptibility map for the entire country of India, their research was limited by a small number of causative factors, without considering important predictors like lithology, geomorphology, land use/cover, and earthquake density, all of which are important factors that influence the likelihood of landslides. This research incorporates these essential predictors with additional geo-environmental predictors, markedly improving the predicted accuracy of susceptibility mapping. The identification of key predictors relies on comprehensive expert insight, an exhaustive literature review, and meticulous multicollinearity assessment, ensuring a resilient model. This work utilizes a spatial resolution of 90 m, an enhancement above the 100 m resolution⁴⁸, hence offering enhanced spatial information. To further enhance susceptibility evaluations, the analysis is confined to landslide prone States and Union Territories (SUTs), omitting areas with gentler slopes to reduce categorization errors. In addition, this study includes a heuristic to data data-driven approach to eliminate dependency on a single model and enhance the results through comparative assessment of the models. The susceptibility models are validated by receiver operating characteristic (ROC) curve analysis, utilizing extensive landslide datasets (including all landslides, training landslides, and testing landslides) to robotic reliability. Upon validation, the best susceptibility zonation map is utilized to evaluate landslide risk, providing essential insights for disaster mitigation.

The establishment of a novel, high-resolution methodological framework for the purpose of landslide susceptibility zonation and risk assessment on a national scale is one of the most important contributions that this study has made. The approach improves the precision, reliability, and usefulness of susceptibility models by systematically integrating major causal elements at a spatial resolution of ninety meters across the entire area. The results of this research have significant value not only for the scientific community but also for those who make decisions regarding climate and environmental policy, international funding agencies like the World Bank, governmental bodies, and industries that are involved in infrastructure planning, land-use management, and disaster risk reduction.

Table 1 Overview of previous studies on LSM from global to national scale.

Full size table

Study area

The chosen eighteen SUTs exemplify India’s most landslide-prone areas, featuring varied physiographic zones and hazard characteristics. The NWH and NEH include steep gradients, elevated seismic activity, and substantial monsoonal precipitation, resulting in recurrent landslides. The Peninsular region include states like Kerala, Goa, Maharashtra, Tamil Nadu, and Karnataka, which feature the Western Ghats, where orographic rainfall and mountainous topography increase the risk of landslides. SUTs with gentler slopes and low susceptibilty are being eliminated to preserve the geo-environmental relevance necessary for precise LSZ. The chosen study area encompasses the complete geographical and hazard diversity of India’s landslide-prone regions, rendering it essential for formulating effective national strategy. This extensive coverage facilitates the formulation of region-specific mitigation plans within a unified national framework.

The study area (Fig. 1), covering approximately 1,365,826 square kilometers, includes regions in India highly susceptible to landslides, such as the NWH and NEH, as well as the Western Ghats and Konkan hills (https://ndma.gov.in). Landslides are most prevalent in the NWH, followed by the NEH and the Western Ghats, as per the records of GSI²⁷. The river systems in these regions primarily flow from north to south, driven by steep gradients and the presence of glaciers in the northern and northeastern parts of India. Mawsynram in Meghalaya holds the record for the highest annual rainfall in India and is recognized as the wettest place on Earth, with an average annual rainfall of 11,872 millimeters⁵⁹. In 1985, Mawsynram set a world record by receiving an astounding 26,000 millimeters of rainfall, according to the Guinness Book of World Records. The Himalayan belt, being a young and tectonically active geological region, is predominantly composed of meta-sedimentary rocks that are highly prone to denudation and erosion¹². The combination of steep slopes and rapidly flowing rivers in this region leads to significant toe erosion, resulting in unstable slopes. Besides geological factors, anthropogenic activities and natural triggers, such as the intensity and duration of rainfall, play a critical role in the spatial and temporal occurrence of landslides. The majority of landslides in these areas occur on the windward sides of the southwest slopes of the Western Ghats and Himalayas, where rainfall is heaviest. Human activities, particularly the construction and widening of roads, further increase the risk of landslides in these susceptible regions.

The research area is mostly situated in seismically active regions²⁸, notably Seismic Zones IV and V, which indicate a significant potential for earthquake-induced hazards, including landslides. Elevation varies from 12 to 8,546 m above mean sea level, indicating significant topographic diversity. The region has significant geological, physiographic, and geomorphic diversity, featuring lithological strata from the Holocene to the Paleoproterozoic era²⁷.

Materials and methods

The GSI serves as the nodal agency for landslide-related activities across the country and has made significant contributions to the field through extensive work in landslide inventory and susceptibility mapping in various regions of India²⁷. To achieve comprehensive national coverage, GSI initiated the National Landslide Susceptibility Mapping (NLSM), conducted on a 1:50,000 scale, between 2013 and 2020. Recognizing the reliability and authenticity of GSI’s database, the inventory, along with lithology, structural lineament, and other relevant datasets, has been sourced from the GSI website (www.gsi.gov.in) for use in this research work. The specific details of additional datasets utilized in this research, which were obtained from their respective sources, are outlined in Table 2.

Table 2 Overview of datasets used in the study, including data type, scale, and source.

Full size table

The primary objective of this research is to develop a LSZ for landslide-prone SUTs in India through a comparative analysis of various methods^34,46. The selection of causative factors in statistical and data-driven methods is typically determined by the specific characteristics of the study area and the availability of data^36,60,61,62. In light of the lack of universal criteria for selecting landslide conditioning factors, this study considers ten factors based on the terrain condition and extensive literature review and data availability^{60,63,64,65,66,67,68,69,70}. These factors encompass topographical, hydro-geological, geomorphological, geological, anthropogenic influences, and triggering factors. The inclusion of a diverse range of causative factors enables a comprehensive analysis of the interrelationships between each factor and landslide distribution, ultimately aiding in the identification of the most suitable LSZ. In this study, widely used data driven models, such as the FR and Yc, and AHP have been used to calculate the weights of various conditioning factors, which are then applied to generate the NLSZ. The comparative analysis is instrumental in determining the most appropriate model for the Indian terrain. The detailed methodology adopted in the study is presented in Fig. 2.

Multicollinearity

In LSZ, evaluating multicollinearity among factors is very important to make sure that the model is reliable and correct⁷¹. Multicollinearity happens when predictor variables are strongly linked, which makes coefficient values unstable and makes it harder to understand the model⁷². The Variance Inflation Factor (VIF) and Tolerance (TOL) are two common ways to find multicollinearity⁷³. A high TOL value, approaching 1, indicates minimal multicollinearity, while a low TOL value, nearing 0, signifies significant multicollinearity, which may pose challenges⁷⁴. A VIF exceeding 10 is typically regarded as a benchmark for significant multicollinearity. These metrics can be computed using RStudio software (RStudio-2025.09.2–418.exe https://posit.co/download/rstudio-desktop/) and are theoretically represented as:

$$\:TOL\:=\frac{1}{VIF}=1-{Ri}^{2}$$

$$\:VIF\:=\frac{1}{{TOL}_{i}}=\frac{1}{1-{Ri}^{2}}$$

In this case, Ri² denotes the coefficient of determination derived from regressing the i-th predictor against all other predictors. TOL represents the fraction of variance in a predictor that remains unexplained by other independent variables. The VIF quantifies the degree to which the variance of a regression coefficient is augmented as a result of collinearity.

Analytic hierarchy process (AHP)

The AHP is a popular multi criteria decision-making (MCDM) method for systematically figuring out how likely a landslide is by giving factors different weights based on expert opinion⁴⁰. AHP works well for combining many geo-environmental factors, and it has been used successfully in landslide risk assessments³². There are several steps to the method used to implement AHP in this work. The first step is to choose the right landslide influencing factors based on how important they are to the landslide happening. Then, a Pairwise Comparison Matrix (PCM) is made to see how important each factor is compared to the others. Using 9-point scale, a 1 means that the two things are equally important, a 3 means that they are moderately important, a 5 means that they are strongly important, a 7 means that they are very strongly important, and a 9 means that they are extremely important⁷⁵. To make the comparisons more accurate, numbers in the middle (2, 4, 6, and 8) are used. According^65,76, experts give scores based on what they know about the features of landslide-prone terrain. This makes sure that the ranking shows how each factor really affects the likelihood of a landslide. The eigenvector method is used to get adjusted weights for each factor after the PCM has been built⁶⁵. The subsequent approach is employed to compute the adjusted weight (Wi) for each predictor:

$$\:{W}_{i\:}=\:\frac{{\sum}_{j=1}^{n}{A}_{ij}}{n}$$

Where Aij represents the value of the ith row and jth column in the PCM, and n is the total number of factors. This step ensures that all conditioning factors are assigned appropriate weights based on their relative significance in landslide occurrence.

The final weights derived from the AHP analysis are used to compute the Landslide Susceptibility Index (LSI) through the Weighted Linear Combination (WLC) method^43,77. The LSI for AHP is calculated using the following equation:

$$\begin{aligned}\:{LSI}_{AHP}=&\:{(Aspect\:x\:W}_{AHP})+\:{(Convexity\:x\:W}_{AHP})+\:{(Drainage\:x\:W}_{AHP})+\:{(Earthquake\:x\:W}_{AHP})\\&+\:{(Fault\:x\:W}_{AHP})+\:{(Geomorphon\:x\:W}_{AHP})+\:{(LULC\:x\:W}_{AHP})+\:{(Rainfall\:x\:W}_{AHP})\\&+\:{(Slope\:x\:W}_{AHP})+\:{(Lithology\:x\:W}_{AHP})\end{aligned}$$

W_AHP represents the weight assigned to each conditioning factor of the landslide.

Frequency ratio (FR)

The FR method is a prevalent bivariate statistical technique employed to evaluate landslide susceptibility by examining the correlation between historical landslides and their causal elements⁷⁸. This method quantifies the likelihood of landslide development concerning each contributing component category, offering an objective assessment of susceptibility^79,80. The initial step entails the preparation of a detailed landslide inventory dataset, assembled from field surveys, remote sensing, and historical documentation. Numerous conditioning elements affecting slope stability are chosen based on previous research and their significance in the study area⁸¹. All thematic layers are processed utilizing GIS methodologies, guaranteeing consistent spatial resolution for analysis. The FR value for each factor class is thereafter calculated utilizing the algorithm⁶⁸:

$$\:FR\:=\frac{\%\:of\:landslide\:pixels\:in\:Predictor\:class}{\%\:of\:area\:covered\:by\:Predictor\:class}$$

The numerator denotes the quantity of landslide pixels within a designated factor class, while the denominator indicates the fraction of that class in the research region. The FR value indicates the relative probability of landslides occurring within a specific category of a conditioning factor. An FR greater than 1 signifies a robust association between the component class and landslides, whereas an FR less than 1 indicates diminished susceptibility⁶⁹.

The LSI_FR is determined by aggregating the FR values of all chosen parameters^81,82.

$$\begin{aligned}\:{LSI}_{FR}=&\sum\:\left(FRi\right){\text{ = F}}{{\text{R}}_{{\text{Aspect}}}}{\text{\; + F}}{{\text{R}}_{{\text{Convexity}}}}{\text{ + F}}{{\text{R}}_{{\text{Drainage}}}}{\text{\; + F}}{{\text{R}}_{{\text{Earthquake}}}}{\text{ + F}}{{\text{R}}_{{\text{Fault}}}}\\&{\text{\; + F}}{{\text{R}}_{{\text{Geomorphon}}}}{\text{ + F}}{{\text{R}}_{{\text{LULC}}}}{\text{\; + F}}{{\text{R}}_{{\text{Rainfall}}}}{\text{ + F}}{{\text{R}}_{{\text{Slope}}}}{\text{\; + F}}{{\text{R}}_{{\text{Lithology}}}}\end{aligned}$$

FR stands for the frequency ratio of each factor class. The final LSI map is put into different susceptibility zones.

Yules coefficient (Yc)

The Y_C method, together with the Landslide Occurrence Frequency Score (LOFS), offers a statistical framework for assessing the correlation between landslide events and conditioning factors^83,84,85. This approach improves LSZ by integrating categorical factor correlations and the frequency of landslide events within designated factor classes⁸⁶. The initial phase entails the compilation of a landslide inventory and thematic layers that illustrate essential conditioning factors⁸⁷. The landslide inventory is utilized to delineate landslide-affected and unaffected regions for statistical analysis. A contingency excel is created for each class of conditioning factors to quantify the relationship between landslide occurrences and these parameters. Yule’s Coefficient (Yc) is computed as described⁸³.

$$\:Yc\:=\frac{\sqrt{\left(\frac{Fab}{F{a}^{{\prime\:}}b}\right)}\:-\sqrt{\left(\frac{Fab{\prime\:}}{F{a}^{{\prime\:}}b{\prime\:}}\right)}}{\sqrt{\left(\frac{Fab}{F{a}^{{\prime\:}}b}\right)}+\sqrt{\left(\frac{Fab{\prime\:}}{F{a}^{{\prime\:}}b{\prime\:}}\right)}}$$

F_ab denotes a positive match, where both landslide and factor class are present. F_a’b and F_ab’ represent areas of mismatch, with X_a’b indicating the absence of a specific factor class despite the presence of landslides, while Fab’ signifies the presence of a factor class in the absence of landslides. F_a’b’ indicates a negative match, where both landslide and factor class are absent. The Pearson correlation coefficient and Y_C values range from − 1 to + 1, with positive values signifying a greater spatial association and negative values indicating the opposite. The Landslide LOFS is calculated utilizing the Y_C values from all predictors maps .

$$\:LOFS\:=\left\{\begin{array}{c}0,\:\:\:\:\:\:for\:\:\:Yc\le\:0\\\:\frac{Yc}{{Yc}_{max}},\:for\:\:\:Yc>0\end{array}\right.$$

LOFS represents the degree of influence of each factor class on susceptibility to failure, ranging from zero to one, whereas YC_max reflects the maximum Yc among all classes within a spatial predictor.

Various spatial elements may be associated with landslides in unique manners. However, because to the association of landslides with multiple interacting variables, an investigation of inter-predictor weights is essential for predictive susceptibility modeling. A comprehensive grasp of the predictors of landslides may enhance the analysis; yet, expert knowledge is subjective and may assign arbitrary weights to various variables. Utilizing Eq. 3, a prediction weight (Wi) was calculated for all geo-factors, predicated on the extent of geographical association with the landslides.

$$\:{W}_{i}=\:\raisebox{1ex}{$({Yc}_{max}-\:{Yc}_{min})$}\!\left/\:\!\raisebox{-1ex}{${\left.({Yc}_{max}-\:{Yc}_{min})\right|}_{min}$}\right.$$

To determine the predictor weights for each geo-factor, the absolute difference between the maximum and minimum Yc values was divided by the minimum Yc value.

The LSI for Yc was computed by integrating the LOFS values for each geo-factor class (Eq. 2) and the Wi values for each geo-factor map (Eq. 3) through the weighted multi-class index overlay method in the GIS platform.

$$\begin{aligned}\:{LSI}_{Yc}=&\frac{{\sum\:}_{i}^{n}({LOFS}_{ij}\:x\:{W}_{i\:})}{{\sum\:}_{i}^{n}{W}_{i}}{\text{ = [(LOF}}{{\text{S}}_{{\text{Aspect}}}}{\text{ x }}{{\text{W}}_{{\text{Aspect}}}}{\text{) + }}\left( {{\text{LOF}}{{\text{S}}_{{\text{Convexity}}}}\:{\text{x }}{{\text{W}}_{{\text{Convexity}}}}} \right)\\&{\text{ }} + {\text{ }}\left( {{\text{LOF}}{{\text{S}}_{{\text{Drainage}}}}\:{\text{x }}{{\text{W}}_{{\text{Drainage}}}}} \right){\text{ }} + {\text{ }}\left( {{\text{LOF}}{{\text{S}}_{{\text{Earthquake}}}}{\text{x }}{{\text{W}}_{{\text{Earthquake}}}}} \right)\\&{\text{ }} + {\text{ }}\left( {{\text{LOF}}{{\text{S}}_{{\text{Fault}}}}{\text{x }}{{\text{W}}_{{\text{Fault}}}}} \right){\text{ }} + {\text{ }}\left( {{\text{LOF}}{{\text{S}}_{{\text{Geomorphon}}}}{\text{x }}{{\text{W}}_{{\text{Geomorphon}}}}} \right)\\&{\text{ }} + {\text{ }}\left( {{\text{LOF}}{{\text{S}}_{{\text{LULC}}}}{\text{x }}{{\text{W}}_{{\text{LULC}}}}} \right){\text{ }} + {\text{ }}\left( {{\text{LOF}}{{\text{S}}_{{\text{Rainfall}}}}\:{\text{x }}{{\text{W}}_{{\text{Rainfall}}}}} \right)\\&{\text{ }} + {\text{ }}\left( {{\text{LOF}}{{\text{S}}_{{\text{Slope}}}}\:{\text{x }}{{\text{W}}_{{\text{Slope}}}}} \right){\text{ }} + {\text{ }}\left( {{\text{LOF}}{{\text{S}}_{{\text{Lithology}}}}\:{\text{x }}{{\text{W}}_{{\text{Lithology}}}}} \right)]{\text{ }}/{\text{Cumulative weight}}\end{aligned}$$

where i is the quantity of predictor maps, and j denotes the classes of predictors within the maps. The generated map (LSI) illustrates a probabilistic model of landslide susceptibility, offering insights into the probability of slope failures in the examined region⁶⁰.

Landslide inventory

A landslide inventory systematically documents landslide occurrences together with their locations and characteristics, serving as the basis for susceptibility mapping, risk evaluation, and early warning systems^88,89. The premise is that the future landslides will transpire under situations analogous to previous occurrences, therefore necessitating a comprehensive inventory for dependable models⁹⁰. Landslides can be delineated by field surveys, aerial images, and remote sensing, contingent upon the scope of the study and the availability of data⁹¹. This study utilized landslide data from the GSI²⁷, which maintains a comprehensive national inventory that integrates polygon and point datasets representing regional landslide attributes. Polygon and point data from the Himalayas and Ghats regions were standardized by converting polygons into points and subsequently cleaned to eliminate duplicates, yielding an inventory of 109,504 landslides at a resolution of 90 m. Figure 1 illustrates the spatial distribution of landslides in different landslide susceptible regions of SUTs of India. Rainfall-induced landslides predominate in the Himalayas and Ghats, but earthquake-triggered incidents are concentrated in tectonically active regions of active tectonic belt of Himalayas.

Landslide predictor maps

The selection of suitable predictor variables is essential for producing dependable LSZ^34,92. This work compiles an extensive array of predictors for both Himalayan and non-Himalayan terrains, ensuring spatial and geomorphological diversity^53,93,94,95. The predictors are classified into geological (lithology, fault density), topographic (slope, aspect, convexity), geomorphological (geomorphons), hydrological (drainage density), climatic (annual rainfall), anthropogenic (land use/land cover), and seismological (earthquake magnitude) factors, each of which influences slope instability^{64,66,69,96,97,98,99,100,101}. The raw predictor datasets, which are available in diverse resolutions and formats (vector and raster), are standardized by converting all layers to raster format and resampling them to align with the 90 m resolution of the Digital Elevation Model (DEM). To alleviate collinearity among predictors, an issue that might misrepresent statistical relationships, this research utilize TOL and VIF¹⁰². Variables in the top decile with VIF values beyond five are progressively eliminated until all remaining variables demonstrate VIF values beneath this threshold, hence providing statistical robustness¹⁰³. After eliminating collinear variables, the predictor datasets are synchronized with the training data for LSZ modeling. Altitude was omitted from the final model due to its disproportionately high impact on LSZ outputs, perhaps resulting from the considerable altitudinal variance within the study area. The array of predictors employed in the analysis and their corresponding significance are encapsulated below:

Aspect

The orientation of slopes affects microclimatic variables, including sun radiation, evapotranspiration, and weathering. In the Indian subcontinent, south-facing slopes frequently undergo accelerated weathering due to extended solar exposure, whilst other orientations may maintain elevated soil moisture levels due to reduced evaporation, hence heightening the risk of saturation-induced failures²¹. Aspect classification is illustrated in Fig. 3a.

Convexity

Terrain convexity (Fig. 3b) significantly influences material transit and accumulation. Concave zones generally serve as depositional sites, accumulating loose material, while convex zones are more susceptible to erosion and operate as detachment locations, hence heightening the probability of landslide initiation¹².

Draiange

Proximity to drainage networks is a pivotal feature affecting landslide susceptibility, as regions adjacent to streams and rivers frequently endure elevated erosion, soil saturation, and slope undercutting. Proximity to drainage channels generally signifies areas with heightened surface runoff and diminished infiltration capacity, especially in places experiencing heavy monsoonal precipitation, thereby increasing the likelihood of slope failure^48,59. This study utilized GIS-based Euclidean distance analysis to determine the distance to drainage, which was integrated as a crucial conditioning factor in the susceptibility model (Fig. 3c).

Earthquake

Earthquakes induce landslides by ground shaking, liquefaction, and the reduction of shear strength in unconsolidated materials^46,47. A seismic density map was created using nearly a century of magnitude-based seismic recordings, resampled to a 90-meter resolution for spatial modelling (Fig. 3d).

Fault

The proximity to active faults and thrust zones significantly influences landslide susceptibility, as these regions are frequently structurally weakened by tectonic fracturing, seismic activity, and cumulative ground deformation^66,67. This study assessed the distance to faults using GIS-based Euclidean distance analysis, which was included as a vital conditioning factor in the susceptibility modeling framework (Fig. 3e).

Geomorphon

Geomorphon⁴⁶ based classification identifies landform features including ridges, valleys, and hollows, offering a structured approach to describe terrain morphology (Fig. 3f). This classification improves the recognition of geomorphic zones more susceptible to mass movements^68,69. Geomorphons were produced using SAGA GIS and incorporated into the modeling process (Jasiewicz & Stepinski, 2013).

LULC

Anthropogenic induced land use alterations, including deforestation, agriculture, and urbanization, disrupt natural slope equilibrium and heighten susceptibility, while vegetated regions bolster slope stability through root reinforcement and hydrological management^53,63. The worldwide LULC dataset from ESRI was employed and resampled to a resolution of 90 m (Fig. 3g).

Rainfall

Prolonged and intense precipitation is a principal catalyst for both shallow and deep-seated landslides, as it increases pore-water pressure, diminishes effective stress, and decreases the shear strength of slope materials⁴⁷. This work used gridded precipitation data from the Indian Meteorological Department (IMD, 2023) inside a GIS context and resampled it to a spatial resolution of 90 m for consistency with other model inputs (Fig. 3h).

Slope

The slope angle is a critical topographic element affecting landslide initiation, as steeper inclines amplify gravity forces and diminish stability. The probability of failure is especially elevated when steep slopes coincide with weak lithology and areas of significant precipitation³⁰. Standard slope categorization methodologies were utilized during the modeling process (Fig. 3i).

Lithology

Lithologies differ in their extent of weathering, permeability, and mechanical strength. Highly fractured, jointed, or weathered lithologies have a markedly greater susceptibility to failure compared to large, competent rock blocks^33,41. Lithological units were categorized by geological age to improve model interpretability and performance, hence managing dataset complexity (Fig. 4).

Results

The VIF values reached a maximum of 2.55, while TOL values remained at or above 0.998, indicating no multicollinearity among the predictor variables (see Figs. 5a, b). These values are significantly below the standard thresholds of concern (e.g., VIF > 10, TOL ≈ 0), thereby confirming the reliability of the model inputs. The topographic predictors, guided by expert knowledge and existing literature, demonstrate both statistical robustness and representativeness, thereby reinforcing the credibility of the LSZ maps.

Factors vs landslide distribution

The examination of landslide frequency among several predictors indicates a distinct disparity in susceptibility (Fig. 6). Slopes oriented between 157.5° and 202.5° exhibited the highest landslide frequency at 20.1%, followed by those between 112.5° and 157.5° at 19.2%, and slopes between 202.5° and 247.5° at 17% (Fig. 6a). Conversely, the intervals between 327.5°–360° and 292.5°–327.5° had the lowest frequencies at 2.8% and 6.3%, respectively. No landslides transpired on level terrain (0°), underscoring the impact of slope gradient and orientation. Regarding convexity, regions with low values (0–21.7) had no landslide occurrences, signifying enhanced stability (Fig. 6b). The incidence of landslides escalated significantly with increased convexity, reaching a maximum of 56% within the 47.9–80.4 range and 38.2% within the 42.9–47.9 range. The data indicate that highly convex slopes are more susceptible to failure, possibly due to diminished support, elevated shear stress, and water infiltration.

Landslide susceptibility varies significantly with distance from drainage networks (Fig. 6c). The highest frequency (34.66%) occurs at 2000–5000 m, suggesting mid-range distances are more prone to slope failure, likely due to factors such as soil saturation, erosion, and steep topography. In contrast, areas close to drainage (0–100 m: 3.8%) or far from it (10,000–50,000 m: 3.31%) show minimal landslide activity. Frequency increases steadily from 3.8% at 100 m to 16.7% at 2000 m, before declining beyond 5000 m (17.73% at 5000–10,000 m; 3.31% at 10,000–50,000 m), indicating reduced susceptibility at greater distances from water channels. Similarly, landslide distribution relative to tectonic faults shows a peak frequency (25.39%) at 2000–5000 m (Fig. 6d), possibly due to the interplay of structural weakening and adverse geomorphic conditions. A marked increase is also observed at 1000–2000 m (14.21%), with a gradual decline beyond 5000 m (17.92% at 5000–10,000 m; 17.97% at 10,000–50,000 m). Regions within 0–100 m of mapped faults have the lowest landslide frequency (2.66%), suggesting a national impact. In tectonically active places like the Himalayas, fault proximity directly increases landslide frequency, suggesting regional variances in fault-related susceptibility.

Earthquakes of intermediate magnitude (4.57–4.92 Mw) correlate with the highest frequency of landslides (34.7%), succeeded by those in the 4.25–4.57 Mw range (29.3%), demonstrating that low to moderate seismic events substantially influence slope collapses (Fig. 6e). These magnitudes are likely to induce landslides on pre-weakened or degraded inclines. Conversely, lower-magnitude earthquakes (1.8–4.25 Mw) constitute merely 11.4% of occurrences, presumably owing to inadequate seismic energy. High-magnitude earthquakes (5.39–8.5 Mw) account for just 7.1%, likely due to their infrequency, occurrence in geological formations, or propensity to induce fewer yet more devastating failures.

Slope-related landforms within geomorphon classes serve as a principal determinant of landslide occurrence, illustrating the intrinsic instability of steep terrains influenced by gravitational forces, severe weathering, and mass-wasting processes (Fig. 6f). Hollows (19.7%) and spurs (16.7%) demonstrate the highest frequency of landslides, suggesting that both concave and convex shapes contribute to instability because to their tendency for water collection, uneven weathering, and structural degradation. Valleys exhibit significant activity (14.6%), perhaps influenced by toe-cutting and erosional processes related to surface drainage networks. Conversely, landforms generally linked to stability, such as flat regions (0%), shoulders (0.1%), and footslopes (0.1%), have less landslide activity due to their mild gradients, which diminish gravitational stress and restrict erosional pressures. Peaks (1.2%) and pits (3.6%) exhibit minimal activity, possibly associated with localized erosion or discrete collapses. Rridges exhibit a comparatively low landslide incidence (9.1%), probably because to lithological resilience, vegetative cover, and minimal human interference.

The results of the LULC analysis indicate that the majority of landslides occur in forested areas with tree heights of 15 feet or more, accounting for 61.9% of all landslides (Fig. 6g). The elevated frequency of landslides in forest regions can be attributed to the lack of classification based on forest density. Classifying forest regions into sparse, moderate, and dense density levels may reveal significant patterns or correlations. Vegetation is generally associated with slope stability, likely due to the prevalence of steep topography in densely forested mountainous regions. Under some situations, intense precipitation, prolonged soil saturation, and pre-existing geological weaknesses can overwhelm the stabilizing effect of root systems, resulting in both shallow and deep-seated landslides. Rangelands account for 28.7% of landslide incidents, underscoring their susceptibility due to little vegetation and exposed soils that facilitate erosion and hydrological instability. Conversely, built-up areas (4.1%) and bare terrain (3.2%) have reduced frequency, possibly attributable to constructed slope protections or inherently stable circumstances. Croplands demonstrate negligible landslide activity (0.7%), perhaps due to their level topography and cultivated environments.

In regions with minimal annual precipitation (52–500 mm), landslides are rare, comprising approximately 3.6% of incidents (Fig. 6h), suggesting restricted hydrological triggering potential in dry to semi-arid environments. The frequency of landslides markedly escalates with rainfall, reaching its zenith in the 1000–1500 mm range. A minor decrease is shown in the 1500–2000 mm category (24.3%), however vulnerability persists, indicating the destabilizing impact of prolonged rainfall on slope materials. Above 2000 mm, the frequency of landslides progressively diminishes—18.2% (2000–2500 mm), 9% (2500–3000 mm), and 10% (> 3000 mm)—potentially attributable to geological resilience, saturation limits, or vegetation cover. These trends underscore the significance of moderate to high precipitation in the incidence of landslides, particularly when coupled with additional factors such as slope and lithology.

The distribution of landslides in relation to slope gradient indicates that the majority of occurrences transpire on slopes ranging from 30° to 45° (37.9%) and from 20° to 30° (31.3%), where gravitational forces surpass slope resistance under specific triggering conditions (Fig. 6i). Moderately inclined terrains (10°–20°) constitute 18.8% of landslides, suggesting that such slopes may destabilize under particular geological or hydrological conditions. The most inclined slopes (45°–87°) have the lowest frequency (4.9%), presumably attributable to restricted soil accumulation, exposed bedrock, and inherent stabilizing elements such surface roughness and vegetation.

Significance of factors

In AHP, predictor weights were obtained using Saaty’s Pairwise Comparison Scale (1–9), yielding a consistency ratio of 0.096 (< 0.1), hence affirming the trustworthiness of expert assessments (Tables 3 and 4). The AHP results indicated that slope (22.8%), land use/land cover (19.8%), and lithology (14.8%) are the most significant factors, highlighting the impact of terrain steepness and human activities on landslide incidence (Fig. 7). The FR model, utilizing frequency ratios, identified convexity (24.6%), land use/land cover (12.5%), and slope (11.4%) as predominant variables, underscoring the significant impact of terrain curvature. Conversely, the Yc model allocated about uniform weights (~ 10.7%) to slope, lithology, geomorphons, aspect, and fault proximity, indicating a more equitable contribution among the components. The effects of earthquakes was negligible in all models, with the lowest weights attributed in AHP (2.7%) and Yc (3.6%), signifying a restricted effect on susceptibility throughout the research area.

Table 3 AHP pairwise comparison matrix for factor weights.

Full size table

Table 4 Consistency index (CI), random index (RI), and consistency ratio (CR) Calculation.

Full size table

National landslide susceptibility zonation

The spatial distribution of landslide susceptibility (LS) exhibits little yet significant variance among the AHP, FR, and Yc models (Figs. 8, 9 and 10). The FR model identifies the most extensive region of very high landslide susceptibility (VHLS) at 10.9%, succeeded by Yc at 10.1% and AHP at 9.7% (Fig. 11), demonstrating no major variation. In the high susceptibility (HLS) category, AHP accounts for 17%, whilst Yc represents 14.9% and FR constitutes 13.7%. All models provide approximately comparable coverage for moderate susceptibility (MLS), with 19.4% (AHP), 19.6% (Yc), and 19.5% (FR), indicating continuous concordance on transitional susceptibility zones. The low susceptibility (LLS) zone is more prevalent in Yc (24.7%) and FR (24%) than in AHP (19.4%), however AHP accounts for the highest proportion of very low susceptibility (VLLS) at 34.6%, succeeded by FR (31.9%) and Yc (30.6%), reflecting a more conservative classification methodology employed by AHP.

At the national level (total area ≈ 3,287,263 km²), VHLS zones encompass 4.5% (FR), 4.2% (Yc), and 4.0% (AHP) of India’s landmass, consistent with previous findings (4.7%) with ML techniques⁴⁸. The combined susceptible area (HLS + VHLS) comprises 11.0% (AHP), 10.4% (Yc), and 10.2% (FR), aligning with earlier estimates (12.6% by GSI; 13.17% by Sharma et al.⁴⁸), hence confirming models accuracy. AHP identifies Sikkim as having the highest VHLS proportion at 54.9%, followed by Uttarakhand at 44.1% and Himachal Pradesh at 40.5%. Arunachal Pradesh encompasses the most extensive VHLS area, approximately 32,942 km² (Table 5). The Yc model ranks Nagaland (55.0%), Mizoram (53.1%), and Arunachal Pradesh (52.1%) as the highest, with Arunachal Pradesh encompassing around 44,392 km² (Table 5). Likewise, FR designates Nagaland (60.8%), Mizoram (57.9%), and Uttarakhand (51.2%) as having the largest proportions, whereas Arunachal Pradesh encompasses around 43,860 km² classified as VHLS (Table 5).

Notwithstanding model-based variances, the uniformity in pinpointing highly sensitive regions particularly the Himalayas and Western Ghats, highlights the geodynamic fragility of these areas. The results underscore the significance of multi-model methodologies for effective LSZ mapping and its relevance in focused land-use planning, risk reduction, and disaster preparedness initiatives.

Table 5 This table presents a comparison of different landslide susceptibility models, highlighting the approach, VHLS percentage focus, VHLS area, strengths, and limitations of each model.

Full size table

Identification of the top 10 susceptible SUTs of India

Sikkim demonstrated the highest VHLS in AHP model projecting 54.9%, the FR model 52.1%, and the Yc model 45% (Fig. 12). Arunachal Pradesh (AP) and Mizoram demonstrated significant VHLS, with AP exhibiting values from 38.7% in the AHP model to 52.1% in Yc and 51.5% in FR, whilst Mizoram ranged from 31.5% in AHP to 53.1% in Yc and 57.9% in FR. The VHLS of Nagaland exhibited significant variation, with AHP at 25%, and Yc and FR at 55% and 60.8%, respectively. Manipur demonstrated reduced susceptibility, with VHLS scores of 14.7% (AHP), 30.6% (Yc), and 25.2% (FR). Uttarakhand (UK) and Himachal Pradesh (HP), both susceptible to frequent landslides, demonstrated moderate to high VHLS, with UK registering 44.1% in AHP, 40% in Yc, and 51.2% in FR, whilst HP reported 40.5% in AHP, 28.9% in Yc, and 37.6% in FR. Jammu & Kashmir (J&K) and Meghalaya demonstrated reduced VHLS, with J&K recording 27.4% (AHP), 20.2% (Yc), and 25.1% (FR), whereas Meghalaya reported 10.9% (AHP), 22% (Yc), and 18.7% (FR). Tripura exhibited VHLS ≤ 8% across all models.

The elevated VHLS values in the Yc and FR models presumably indicate their data-driven responsiveness to variables such as deforestation and rainfall intensity. The results underscore the considerable influence of steep gradients, intense precipitation, and unstable geological formations, but the minor differences across models indicate variances in sensitivity and methodological approach.

Validation

The Receiver Operating Characteristic (ROC) curve is an effective tool in LSZ, as it offers evaluation of the model’s predictive capability for landslides⁸¹. This curve facilitates the identification of the ideal equilibrium between precise landslide prediction and minimizing false alerts. The ROC curve enables researchers to evaluate the reliability of susceptibility models, optimize parameter values, and enhance overall predictive accuracy¹⁰⁴. The susceptibility maps were validated using independent testing datasets comprising 30% of both landslide and non-landslide occurrences. The validation results yielded area under curve (AUC) values of 0.889 for the Yc model, 0.891 for the AHP model, and 0.905 for the FR model (Fig. 13), indicating strong predictive capability. Additionally, cross-validation using the precision-recall method produced AUC values of 0.856 for Yc, 0.867 for AHP, and 0.872 for FR (Fig. 14). Collectively, these evaluations confirm the high performance and reliability of all three models.

Validation of maps through landslide distribution

The maps are also validated by examining landslide spatial distribution across susceptibility classes. The VHLS zones includes majority of landslides, with 60.6%, 58.0%, and 66.8% for AHP, Yc, and FR models, respectively (Fig. 15). This remarkable relation between VHLS zones and landslides verifies the models ability to identify susceptible areas. The HLS zones consist of landslide concentration of around 28.2% in AHP, 30.7% in Yc, and 25.4% in FR models, confirming the models’ prediction dependability. The MLS covers less landslides, from 6.7% (FR) to 9.4% (Yc), indicating a transitional zone between stable and unstable terrain. LLS and VLLS zones cover low landslide percentage, not exceeding 1.9%, indicating that these places are more stable. The VHLS and HLS include large landslide concentrations, demonstrating that the models are good in predicting locations susceptible to landslides.

Ground validation

Field observations indicate that the majority of recent landslides, including historically devastating events, are concentrated within zones classified as HLS and VHLS (Fig. 16). This strong spatial correlation between observed landslide occurrences and the predicted high-risk areas provides robust empirical validation for the assessment model. Such consistency not only confirms the accuracy of the susceptibility mapping but also enhances confidence in its use for disaster risk management, land-use planning, and early warning systems.

Best fit model

The LSZ map produced by the Yc model was deemed the most appropriate for further analysis, based on various evaluation criteria, including the significance of predictor weights (Fig. 7), ROC values (Figs. 13 and 14), and spatial alignment with documented landslide occurrences (Fig. 15). As a result, the Yc-derived LSZ map was chosen for further landslide risk mapping. Figure 17 displays the LSZ map generated by the Yc model, superimposed with the relevant STUs boundaries to improve geographical analysis and visualization.

National landslide risk map (NLRM) at qualitative scale

The earlier study¹⁰⁵ undertaken in Gopeshwar Township, Chamoli District, Uttarakhand, India, concentrated on a risk assessment based on building types within a delineated area of 8.39 km². However, our study employs a generalized methodology that offers a comprehensive qualitative risk evaluation at the national level. The qualitative risk map was created by combining built-up area and road network layers with HLS VHLS obtained from the Yc model (Fig. 18). The analysis indicates that around 8,606.13 km² of study area is situated within high to very high-risk zones, representing about 0.26% of the nation’s total land area. Approximately 2,398.66 km² is designated as very high risk, whilst 6,207.47 km² is categorized as high risk under study area.

Discussion

This research presents a detailed framework for the creation of a NLSZ and a NLRM, based on expert consultation, thorough literature review, and stringent multicollinearity analysis. All three models (FR, Yc and AHP) exhibited consistent outcomes, with VHLS zones encompassing 10.9% (FR), 10.1% (Yc), and 9.7% (AHP) of the study area, reflecting negligible variation in pinpointing the most susceptible zones. The AHP model identified the HLS zones around 17%, in contrast to 14.9% for Yc and 13.7% for FR, while moderate susceptibility zones had similar proportions across models, indicating a consistent classification of transitional terrain. Zones of low and very low susceptibility exhibited minimal fluctuation, with FR encompassing the largest area (55.9%), closely followed by Yc (55.4%) and AHP (53.9%). The obtained very high susceptibility coverage in India, varying from 4.0% (AHP) to 4.5% (FR), roughly aligns with previous results⁴⁸ (e.g., 4.7% derived using machine learning models). The amalgamation of high susceptibility and VHLS zones (10.2%–11.0%) aligns with the estimates from GSI and Sharma et al. 2024⁴⁸ (12.6% and 13.17%, respectively), hence affirming the trustworthiness and robustness of the models for national-scale landslide susceptibility assessment.

The state-wise research indicates that the AHP model identifies the highest percentage of VHLS in Sikkim (54.9%), followed by Uttarakhand (44.1%) and Himachal Pradesh (40.5%). Arunachal Pradesh, meanwhile, possesses the most extensive VHLS area, measuring 32,942 km². The Yc and FR models identify Nagaland, Mizoram, and Arunachal Pradesh as the most vulnerable, underscoring regional variability influenced by geological, climatic, and anthropogenic causes¹⁰⁶. Minor inconsistencies among models indicate variations in methodological sensitivity, with data-driven models (Yc, FR) exhibiting more responsiveness to recent environmental alterations, including deforestation and infrastructural development¹⁰⁷. Model validation produced AUC values between 0.874 and 0.905, signifying robust predictive efficacy. A significant proportion of mapped landslides (60.6%–66.8%) resides within high and VHLS, corroborating the spatial correlation between anticipated risk and recorded occurrences¹⁰⁸. The VLLS has the most extensive geographic area across all models, indicating that a significant chunk of India faces minimal landslide danger.

Among the models, Yc has exceptional performance in terms of classification accuracy and spatial sensitivity. The highest VHLS is identified in Nagaland (55.0%), Mizoram (53.1%), and Andhra Pradesh (52.1%), followed by Sikkim (45.0%), Uttarakhand (40.0%), Manipur (30.6%), Himachal Pradesh (28.9%), Jammu & Kashmir (20.2%), Meghalaya (22.0%), Tripura (8.0%), West Bengal (1.9%), Assam (2.9%), Goa (3.7%), Kerala (2.9%), Ladakh (0.7%), Karnataka (0.3%), Tamil Nadu (0.3%), and Maharashtra (0.2%). These findings highlight the model’s sensitivity to essential variables including topography, rainfall, and land-use intensity. The choice of model profoundly affects risk perception and policy results. Incorporating multi-model methodologies with GIS and real-time monitoring can enhance landslide mapping and facilitate more efficient mitigation techniques¹⁰⁹.

Comparison with previous studies at global and National scale

This work enhances prior national and worldwide landslide susceptibility evaluations by using high-resolution thematic datasets, several causative elements, and an extensive inventory of 109,504 landslide occurrences, leading to improved susceptibility zonation accuracy. Globally, findings correspond with previous studies¹¹⁰ indicating significant vulnerability in the Himalayas and Western Ghats (Fig. 19a), although enhance accuracy through improved data and broader factor incorporation. The results partially corroborate previous global maps¹⁹, particularly concerning elevated susceptibility in the Himalayas (Fig. 19b), while rectifying overgeneralizations of moderate susceptibility in non-prone areas. Likewise, the global map by¹¹¹ delineates susceptible areas in the Himalayan and Western Ghats regions (Fig. 19c). Nevertheless, this study avoids such erros by using focused datasets, which results in an overestimation of susceptibility in stable areas. Unlike the previous study⁴⁸, which created a NLSM, this work concentrates on the most vulnerable SUTs (Fig. 19d). This study incorporates more relevant parameters with greater clarity than earlier studies⁴⁸, many of which were overlooked. The latest findings also uncover new high-to-very high susceptibility zones, particularly in Ladakh and select J&K regions, that were not previously identified.

Significance, limitations, uncertainty, and future research directions

This work enhances landslide susceptibility and risk mapping in India, highlighting the necessity for periodic updates to the GSI landslide inventory to augment precision. Future studies must include high-resolution geoenvironmental data and revised inventories that account for dynamic vulnerability shaped by environmental and anthropogenic influences.

Current maps provide general overviews, use LULC-based built-up and road data for risk modeling, which constrains accuracy. Variations stem from model selection, inventory classification, and data granularity. Incorporating more detailed statistics, supplementary vulnerable components (such as agriculture and infrastructure), and landslide inventories directly into risk models would improve regional precision and risk evaluation.

Conclusion

This study offers a comprehensive national-scale assessment of landslide susceptibility and risk in India, utilizing a hybrid framework that combines heuristic (AHP) and data-driven (FR, Yc) models. The incorporation of ten meticulously chosen predictor variables from geological, topographic, hydrological, climatic, anthropogenic, and seismological domains, in conjunction with a comprehensive landslide inventory, improves model thoroughness and dependability. The comparative assessment reveals that the Yc model exhibits enhanced prediction performance (AUC reaching 0.905) and greater geographic concordance with actual landslide distributions. Lithology and land use/land cover were identified as the primary factors influencing susceptibility, succeeded by slope, geomorphons, and proximity to faults. Although rainfall and drainage proximity significantly contribute, seismic variables exhibited minimal impact inside the Yc framework.

The FR model determined the majority area of VHLS at 10.9%, succeeded by Yc at 10.1% and AHP at 9.7%. In HLS, AHP approximated 17%, just above Yc (14.9%) and FR (13.7%). MLS exhibited a comparable distribution across models, signifying topographic uniformity. Areas of LLS and VLLS were most prevalent in FR (55.9%), followed by Yc (55.4%) and AHP (53.9%).

At the national level, VHLS zones encompass approximately 4.0%, 4.2%, and 4.5% of India’s land area according to the AHP, Yc, and FR models, respectively, consistent with prior research. The zones of combined HLS and VHLS account for approximately 11.0% in AHP, 10.4% in Yc, and 10.5% in FR models.Validation demonstrated robust prediction accuracy, with AUC values ranging from 0.874 to 0.905. Risk mapping with the Yc model reveals that around 0.26% of India (~ 8,606 km²) is situated within high to very high landslide risk zones.

These findings offer essential geospatial insights for disaster risk mitigation, highlighting the necessity for focused slope monitoring, public awareness, and the incorporation of landslide risk into land-use planning at mapped scale.

Data availability

All data used in this study are either publicly available from open-source repositories or were generated during the current research. Processed datasets and supporting materials can be made available by the corresponding author (s) upon reasonable request.

Abbreviations

AHP:: Analytic Hierarchy Process
AP:: Arunachal Pradesh
AUC:: Area Under Curve
CR:: Consistency Ratio
DEM:: Digital Elevation Model
DRR:: Disaster Risk Reduction
DL:: Deep Learning
EM-DAT:: Emergency Events Database
FR:: Frequency Ratio
GIS:: Geographic Information System
GSI:: Geological Survey of India
HLS:: High Landslide Susceptibility
HMR:: Himalyan Mountain Range
HP:: Himachal Pradesh
J&K:: Jammu and Kashmir
LOFS:: Landslide Occurrence Frequency Score
LLS:: Low Landslide Susceptibility
LS:: Landslide susceptibility
LSI:: Landslide Susceptibility Index
LULC:: Land Use Land Cover
MCDM:: Multi Criteria Decision Making
ML:: Machine Learning
NEH:: Northestern Himalayas
NLRM:: National Landslide Risk Map
NLSM:: National Landslide Susceptibility Mapping
NLSZ:: National Landslide Susceptibility Zonation
NWH:: Northwestern Himalayas
PCM:: Pairwise Comparison Matrix
ROC:: Receiver Operating Characteristic
STUs:: States and Union Territories
TOL:: Tolerance
UK:: Uttarakhand
VIF:: Variance Inflation Factor
VLLS:: Very Low Landslide Susceptibility
WLC:: Weighted Linear Combination
Yc:: Yule’s Coefficient

References

Ehrlich, D., Melchiorri, M. & Capitani, C. Population trends and urbanisation in mountain ranges of the world. Land 10, 255. https://doi.org/10.3390/land10030255 (2021).
Article Google Scholar
Klein, E., Kappes, M. S. & Glade, T. Advances and applications in modeling, assessment, and mitigation of landslide risk. Front. Earth Sci. 8, 148. https://doi.org/10.3389/feart.2020.00148 (2019).
Article Google Scholar
United Nations Office for Disaster Risk Reduction (UNISDR). Sendai Framework for Disaster Risk Reduction 2015–2030. (United Nations, 2015). https://www.unisdr.org/files/43291_sendaiframeworkfordrren.pdf
Froude, M. J. & Petley, D. N. Global fatal landslide occurrence from 2004 to 2016. Nat. Hazards Earth Syst. Sci. 18, 2161–2181. https://doi.org/10.5194/nhess-18-2161-2018 (2018a).
Article ADS Google Scholar
Arouri, K., Bouamoud, M. & Rachid, M. Landslide hazard and risk assessment: case study of the Nabeul region, Tunisia. Environ. Earth Sci. 74, 1027–1040. https://doi.org/10.1007/s12665-015-4462-2 (2015).
Article Google Scholar
Gerrard, A. & Gardner, T. The Landslide Handbook: A Guide for Practical Application (CRC, 2002).
Tobin, G. A., Montz, B. E. & Schaefer, D. A. Hazardous Materials and Risks: A Framework for Evaluating the Role of Human Activities in Landslide-Prone Areas (Springer, 2011). https://doi.org/10.1007/978-94-007-0305-1
von Wymann, S., Nüsser, M. & Schneider, A. Vulnerability and resilience in landslide-prone areas: A case study from the Swiss alps. J. Environ. Manage. 212, 33–45. https://doi.org/10.1016/j.jenvman.2017.01.016 (2017).
Article Google Scholar
Petley, D. N. Global patterns of loss of life from landslides. Geology 40, 927–930. https://doi.org/10.1130/G33217.1 (2012a).
Article ADS Google Scholar
Petley, D. N. The size and frequency of landslides in the Himalayas. Landslides 9, 215–221. https://doi.org/10.1007/s10346-011-0270-3 (2012b).
Article Google Scholar
Ahmed, R., Sam, L. & Bhardwaj, A. It is time to build GLOF-resilient communities globally before another GLOF disaster strikes. Npj Nat. Hazards. 2 (1), 41. https://doi.org/10.1038/s44304-025-00097-0 (2025).
Article Google Scholar
Shrestha, A., Sharma, S. & Shrestha, S. Landslides in the himalayas: A comprehensive review of hazards, impacts, and adaptive strategies. Environ. Sci. Policy. 55, 1–14. https://doi.org/10.1016/j.envsci.2025.01.001 (2025).
Article CAS Google Scholar
Nadim, F., Jaedicke, C., Smebye, H. & Kalsnes, B. Assessment of global landslide hazard hotspots. In Landslides: Global Risk Preparedness (eds Sassa, K. et al.) 59–71 (Springer, 2013). https://doi.org/10.1007/978-3-642-22087-6_4.
Chapter Google Scholar
Abedi Gheshlaghi, H. & Feizizadeh, B. GIS-based ensemble modelling of fuzzy system and bivariate statistics as a tool to improve the accuracy of landslide susceptibility mapping. Nat. Hazards. 107, 1981–2014 (2021).
Article Google Scholar
Centre for Research on the Epidemiology of Disasters (CRED). EM-DAT: The International Disaster Database. (2019). https://www.emdat.be
Kirschbaum, D., Stanley, T. & Zhou, Y. Spatial and Temporal analysis of a global landslide catalog. Geomorphology 249, 4–15. https://doi.org/10.1016/j.geomorph.2015.03.016 (2015).
Article ADS Google Scholar
Kirschbaum, D. B. & Jibson, R. W. Landslide hazard is projected to increase across high mountain Asia. Sci. Adv. 10, eaav6750. https://doi.org/10.1126/sciadv.aav6750 (2024).
Article Google Scholar
Haque, U. et al. Fatal landslides in Europe. Landslides 13, 1545–1554. https://doi.org/10.1007/s10346-016-0689-3 (2016).
Article Google Scholar
Stanley, T. & Kirschbaum, D. B. A heuristic approach to global landslide susceptibility mapping. Nat. Hazards. 87, 145–164. https://doi.org/10.1007/s11069-017-2757-y (2017).
Article Google Scholar
Gómez, D., García, E. F. & Aristizábal, E. Spatial and Temporal landslide distributions using global and open landslide databases. Nat. Hazards. 117, 25–55. https://doi.org/10.1007/s11069-023-05848-8 (2023a).
Article Google Scholar
Gómez, M., Rios, D. & Román, S. Landslide hazard mapping in the andes: evaluating landslide-prone areas with machine learning. Geomorphology 398, 108321. https://doi.org/10.1016/j.geomorph.2023.108321 (2023b).
Article Google Scholar
Wang, X., Wang, Y., Lin, Q., Yang, X. & Kirschbaum, D. B. Assessing global landslide casualty risk under moderate climate change based on multiple GCM projections. Nat. Hazards. 111, 1209–1233. https://doi.org/10.1007/s11069-023-05848-8 (2023).
Article Google Scholar
Froude, M. J. & Petley, D. N. Global landslide fatalities and the role of landslide hazard assessment. Science 360, 1223–1226. https://doi.org/10.1126/science.aat0316 (2018b).
Article Google Scholar
Nadim, F., Kjekstad, O., Peduzzi, P., Herold, C. & Jaedicke, C. Global landslide and avalanche hotspots. Landslides 3, 159–173. https://doi.org/10.1007/s10346-006-0036-1 (2006).
Article Google Scholar
Centre for Research on the Epidemiology of Disasters (CRED). EM-DAT: The International Disaster Database. (2023). https://www.emdat.be
Worldometer Real-time world Statistics. Worldometer. Dadax Ltd., (2025). https://www.worldometers.info
Geological Survey of India. Official website of the Geological Survey of India. GSI. https://www.gsi.gov.in
Bureau of Indian Standards (BIS). IS 1893–2002: Criteria for earthquake-resistant Design of structures – Part 1: General Provisions and Buildings 5th edn (Bureau of Indian Standards, 2002).
Pettenati, M., D’Odorico, P. & de Blasio, F. Landslide susceptibility assessment and modeling: a review of approaches and recent trends. Environ. Earth Sci. 76, 1–17. https://doi.org/10.1007/s12665-017-6631-4 (2017).
Article Google Scholar
Goetz, J. N., Guthrie, R. H. & Brenning, A. Integrating physical and empirical landslide susceptibility models using generalized additive models. Geomorphology 129, 376–386 (2025).
Article ADS Google Scholar
Dutta, K., Wanjari, N. & Misra, A. K. Landslide susceptibility assessment in Sikkim himalaya with RS & GIS, augmented by improved statistical methods. Arab. J. Geosci. 17, Article138 (2024).
Article Google Scholar
Kshetrimayum, A., H, R. & Goyal, A. Exploring different approaches for landslide susceptibility zonation mapping in Manipur: a comparative study of AHP, FR, machine learning, and deep learning models. Journal of Spatial Science https://doi.org/10.1080/14498596.2024.2368156 (2024).
Siddique, T. et al. Application of Slope Mass Rating and Kinematic Analysis along road cut slopes in the Himalayan terrain. In: Verma, A.K. (eds) Proceedings of Geotechnical Challenges in Mining, Tunneling and Underground Infrastructures. ICGMTU 2021. Lecture Notes in Civil Engineering, 228. Springer, Singapore (2022). https://doi.org/10.1007/978-981-16-9770-8_47
Bui, D. T., Pradhan, B., Lofman, O., Revhaug, I. & Dick, O. B. Landslide susceptibility mapping at Hoa Binh Province (Vietnam) using an adaptive neuro-fuzzy inference system and GIS. Comput. Geosci. 81, 46–56 (2025).
Google Scholar
Singh, K., Bhardwaj, V., Sharma, A. & Thakur, S. A comprehensive review on landslide susceptibility zonation techniques. Quaest Geogr. 43, 79–91 (2024a).
Google Scholar
Ayalew, L., Yamagishi, H. & Ugawa, N. Landslide susceptibility mapping using GIS-based weighted linear combination: the case in Tsugawa area of Agano River, Niigata Prefecture, Japan. Landslides 1, 73–81 (2004).
Article Google Scholar
Balteanu, D., Sima, M. & Micu, M. Landslide hazard assessment in Romania. Geogr. Tech. 5, 1–10 (2010).
Google Scholar
Okalp, K. & Akgün, H. Landslide susceptibility mapping using GIS and multicriteria decision analysis: a case study from Turkey. Bull. Eng. Geol. Environ. 75, 1–15 (2016).
Google Scholar
Axing, Y., Hong, H. & Fei, Y. GIS-based landslide susceptibility mapping using support vector machine: a case study in long County, China. Environ. Earth Sci. 60, 1237–1246 (2010).
Google Scholar
Saaty, T. L. Multicriteria Decision Making: the Analytic Hierarchy Process (RWS, 1990).
Pareek, T. et al. Analyzing the posterior predictive capability and usability of landslide susceptibility maps: a case of Kerala, India. Landslides 22, 655–670. https://doi.org/10.1007/s10346-024-02389-4 (2025).
Article Google Scholar
Chung, C. F. & Fabbri, A. G. Validation of Spatial prediction models for landslide hazard mapping. Nat. Hazards. 30, 451–472 (2003).
Article Google Scholar
Fenta, A. A. & Asfaw, D. H. Landslide susceptibility mapping using combined geospatial, FR, and AHP models: a case study from ethiopia’s highlands. Discov Sustain. 5, Article474 (2024).
Article Google Scholar
Arunkumar, K. S., Ankoop, K. G. & Thomas, J. Landslide susceptibility mapping using the analytical hierarchy process and GIS for Idukki District, Kerala, India. Acta Geogr. Debr Landsc. Environ. Ser. 17, 11–32 (2023).
Google Scholar
Subedi, S., Bhandari, K. P., Sherchan, B. & Neupane, N. Landslide susceptibility mapping using analytical hierarchy process in Gandaki Province, Nepal. J. Eng. Sci. 2, 1–8 (2023).
Article Google Scholar
Khan, H., Sharma, P. & Kumar, A. Advanced bivariate geostatistical modeling for high-resolution landslide susceptibility zonation for effective risk management in the northwestern Himalaya, India. J Earth Syst. Sci 133, 5 (2024).
Article ADS Google Scholar
Kumar, A. & Ghosh, S. Ensemble of fuzzy-analytical hierarchy process in landslide susceptibility modeling from a humid tropical region of Western Ghats, Southern India. Environ. Sci. Pollut Res. 31, 41370–41387 (2024).
Google Scholar
Sharma, N., Saharia, M. & Ramana, G. V. High resolution landslide susceptibility mapping using ensemble machine learning and Geospatial big data. Catena 235, 107653. https://doi.org/10.1016/j.catena.2023.107653 (2024).
Article Google Scholar
Hong, Y., Adler, R. F. & Huffman, G. Use of satellite remote sensing data in the mapping of global landslide susceptibility. Nat. Hazards. 43, 245–256. https://doi.org/10.1007/s11069-006-9104-z (2007).
Article Google Scholar
Liu, C. et al. Susceptibility evaluation and mapping of china’s landslides based on multi-source data. Nat. Hazards. 69, 1477–1495. https://doi.org/10.1007/s11069-013-0759-y (2013).
Article Google Scholar
Wang, D. et al. Assessment of landslide susceptibility and risk factors in China. Nat. Hazards. https://doi.org/10.1007/s11069-021-04812-8 (2021).
Article Google Scholar
Zhang, G. et al. Production and analysis of a landslide susceptibility map covering entire China. Remote Sens. 17, 1615. https://doi.org/10.3390/rs17091615 (2025).
Article ADS Google Scholar
Günther, A., Van Den Eeckhaut, M., Malet, J. P., Reichenbach, P. & Hervás, J. Climate-physiographically differentiated Pan-European landslide susceptibility assessment using Spatial multi-criteria evaluation and transnational landslide information. Geomorphology 224, 69–85. https://doi.org/10.1016/j.geomorph.2014.07.011 (2014).
Article ADS Google Scholar
Gaprindashvili, G. & Van Westen, C. J. Generation of a National landslide hazard and risk map for the country of Georgia. Nat. Hazards. 80, 69–101. https://doi.org/10.1007/s11069-015-1958-5 (2016).
Article Google Scholar
Ngo, P. T. T. et al. Evaluation of deep learning algorithms for National scale landslide susceptibility mapping of Iran. Geosci. Front. 12, 505–519. https://doi.org/10.1016/j.gsf.2020.06.013 (2021).
Article ADS Google Scholar
Broeckx, J., Vanmaercke, M., Duchateau, R. & Poesen, J. A data-based landslide susceptibility map of Africa. Earth-Sci. Rev. 185, 102–121. https://doi.org/10.1016/j.earscirev.2018.05.002 (2018).
Article ADS Google Scholar
Lee, S. M. & Lee, S. J. Landslide susceptibility assessment of South Korea using stacking ensemble machine learning. Geoenviron Disasters. 11, 7. https://doi.org/10.1186/s40677-024-00271-y (2024).
Article Google Scholar
Lee, S., Roh, M., Jo, H. W., Kim, J. & Lee, W. K. Machine learning-based rainfall-induced landslide susceptibility model and short-term early warning assessment in South Korea. Landslides 22, 2809–2827. https://doi.org/10.1007/s10346-025-02513-y (2025).
Article Google Scholar
Kumar, S., Kant, S. & Ahmed, R. Assessment of an extreme heavy rainfall over Meghalaya, India on 16th & 17th June 2022: A case study using meteorological and remote sensing observations. Trop. Cyclone Res. Rev. 14, 60–70. https://doi.org/10.1016/j.tcrr.2025.02.007 (2025).
Article Google Scholar
Ghosh, S., Carranza, E. J. M., van Westen, C. J., Jetten, V. G. & Bhattacharya, D. N. Selecting and weighting Spatial predictors for empirical modeling of landslide susceptibility in the Darjeeling Himalayas (India). Geomorphology 131 (1–2), 35–56. https://doi.org/10.1016/j.geomorph.2011.04.019 (2011).
Article ADS Google Scholar
Yalcin, A. GIS-based landslide susceptibility mapping using analytical hierarchy process and bivariate statistics in Ardesen (Turkey): comparisons of results and confirmations. Catena 72, 1–12. https://doi.org/10.1016/j.catena.2007.01.003 (2008).
Article Google Scholar
Magliulo, P., Di Lisio, A., Russo, F. & Zelano, A. Geomorphology and landslide susceptibility assessment using GIS and bivariate statistics: a case study in Southern Italy. Nat. Hazards. 47, 411–435. https://doi.org/10.1007/s11069-008-9230-x (2008).
Article Google Scholar
Ajin, R. S., Segoni, S. & Fanti, R. Optimization of SVR and catboost models using metaheuristic algorithms to assess landslide susceptibility. Sci. Rep. 14, 24851. https://doi.org/10.1038/s41598-024-72663-x (2024).
Article CAS PubMed PubMed Central ADS Google Scholar
Dhakal, D. et al. Enhancing landslide disaster prediction by evaluating Non landslide area sampling in machine learning models for Spiti Valley India. Sci. Rep. 15, 12242. https://doi.org/10.1038/s41598-025-95087-7 (2025).
Article CAS PubMed PubMed Central ADS Google Scholar
Liu, L. et al. Landslide data sample augmentation and landslide susceptibility analysis in Nyingchi City based on the MCMC model. Sci. Rep. 15, 25624. https://doi.org/10.1038/s41598-025-10651-5 (2025).
Article CAS PubMed PubMed Central ADS Google Scholar
Yadav, J., Dash, R. K. & Kanungo, D. P. Spatial prediction of landslides in Pithoragarh district, Kumaon Himalaya, India. J. Earth Syst. Sci. 134, 176. https://doi.org/10.1007/s12040-025-02625-y (2025).
Article ADS Google Scholar
Chauhan, V., Gupta, L. & Dixit, J. Landslide susceptibility assessment for Uttarakhand, a Himalayan state of India, using multi-criteria decision making, bivariate, and machine learning models. Geoenvironmental Disasters. 12, 2. https://doi.org/10.1186/s40677-024-00307-3 (2025).
Article Google Scholar
Sharma, P. et al. Assessing landslide susceptibility in the upper Ravi river catchment, Himachal Pradesh, india: a comprehensive analysis using the logistic regression model. Geoenviron Disasters. 12, 26. https://doi.org/10.1186/s40677-025-00327-7 (2025).
Article Google Scholar
Khan, I., Bahuguna, H. & Kainthola, A. Regional landslide susceptibility zonation utilizing bivariate statistical techniques in the Northwestern Himalayas, Jammu and Kashmir, India. J. Earth Syst. Sci. 133, 157. https://doi.org/10.1007/s12040-024-02367-3 (2024b).
Article ADS Google Scholar
Jasiewicz, J. & Stepinski, T. F. Geomorphons—a pattern recognition approach to classification and mapping of landforms. Geomorphology 182, 147–156. https://doi.org/10.1016/j.geomorph.2012.11.005 (2013).
Article ADS Google Scholar
Salmerón, R., García, C. B. & García, J. Variance inflation factor and condition number in multiple linear regression. J. Stat. Comput. Simul. 88, 2365–2384. https://doi.org/10.1080/00949655.2018.1463376 (2018).
Article MathSciNet Google Scholar
Salmerón, R., García, C. & García, J. Overcoming the inconsistencies of the variance inflation factor: a redefined VIF and a test to detect statistical troubling multicollinearity. ArXiv Preprint arXiv:2005 02245. https://doi.org/10.48550/arXiv.2005.02245 (2020).
Article Google Scholar
Ekiz, O. U. An improved robust variance inflation factor: reducing the negative effects of good leverage points. Kuwait J. Sci. 50, 1–12 (2023). https://journalskuwait.org/kjs/index.php/KJS/article/view/15533
MathSciNet Google Scholar
Jacob, J. & Varadharajan, R. Robust variance inflation factor: a promising approach for collinearity diagnostics in the presence of outliers. Sankhya B. 86, 845–871. https://doi.org/10.1007/s13571-024-00342-y (2024).
Article MathSciNet Google Scholar
Saaty, T. L. The Analytical Hierarchy Process: Planning, Priority Setting, Resources Allocation (McGraw-Hill, 1980).
Cheng, Q. & Wei, X. Application of the analytic hierarchy process (AHP) method in landslide susceptibility mapping: a case study of the three Gorges reservoir Area, China. Environ. Earth Sci. 79, 305. https://doi.org/10.1007/s12665-020-08996-4 (2020).
Article ADS Google Scholar
Malczewski, J. GIS and Multicriteria Decision Analysis (Wiley, 1999).
Lee, S. & Pradhan, B. Landslide hazard mapping at Selangor, Malaysia using frequency ratio and logistic regression models. Landslides 4, 33–41 (2007).
Article Google Scholar
Mamdouh, M. E. & Tarek, M. R. Application of the frequency ratio model for landslide susceptibility mapping in the nile delta region, Egypt. Environ. Earth Sci. 82, 278. https://doi.org/10.1007/s12665-023-10758-3 (2023).
Article Google Scholar
Yilmaz, I. Landslide susceptibility mapping using frequency ratio, logistic regression, artificial neural networks, and their comparison: A case study from Kat landslides (Tokat—Turkey). Comput. Geosci. 35, 1125–1138 (2009).
Article ADS Google Scholar
Keshri, D., Sarkar, K. & Chattoraj, S. L. Landslide susceptibility mapping in parts of aglar watershed, lesser himalaya based on frequency ratio method in GIS environment. J. Earth Syst. Sci. 133, 1. https://doi.org/10.1007/s12040-023-02204-z (2024).
Article ADS Google Scholar
Lee, C., Choi, S. & Lee, H. Application of the frequency ratio model for landslide susceptibility mapping in the Southern Korean Peninsula. Geomorphology 66, 51–65. https://doi.org/10.1016/j.geomorph.2004.08.003 (2004).
Article Google Scholar
Yule, G. U. On the method of correlation for contingency tables. Philos. Trans. R Soc. Lond. A. 212, 1–32. https://doi.org/10.1098/rsta.1912.0012 (1912).
Article Google Scholar
Gupta, V., Ram, P., Tandon, R. S. & Vishwakarma, N. Efficacy of landslide susceptibility maps prepared using different bivariate methods: case study from mussoorie Township, Garhwal himalaya. J. Geol. Soc. India. 99, 370–376. https://doi.org/10.1007/s12594-023-2319-8 (2023).
Article Google Scholar
Laltanpuia, Z. D., Martha, T. R., Rao, K. S. & Khanna, K. Bivariate statistical models for landslide susceptibility mapping at local scale in the Aizawl municipal area, Mizoram, India. Himal. Geol. 45, 1–15. https://doi.org/10.1007/s13550-024-00123-4 (2024).
Article Google Scholar
Zhang, Z., Xu, C. & Li, X. Landslide susceptibility mapping using yule’s coefficient method in a hilly region of China. Geomorphology 250, 89–101. https://doi.org/10.1016/j.geomorph.2015.09.003 (2015).
Article Google Scholar
Barman, J. & Das, J. Comprehensive landslide prediction mapping using bivariate statistical models of Mizoram state of Northeast India. J. Spat. Sci. 69, 157–179. https://doi.org/10.1080/14498596.2024.2001234 (2024).
Article Google Scholar
Milledge, D. G., Robinson, R. & Petley, D. N. Review of landslide inventories for Nepal between 2010 and 2021 reveals data gaps in global landslide hotspot. Nat. Hazards. 106, 2 (2024).
Google Scholar
Zhang, B., Tang, J., Huan, Y., Song, L., Shah, S.Y.A. & Wang, L. Multi-scale convolutional neural networks (CNNs) for landslide inventory mapping from remote sensing imagery and landslide susceptibility mapping (LSM). Geomatics, Natural Hazards and Risk https://doi.org/10.1080/19475705.2024.2383309 (2024).
Du, J., Li, Z. & Li, X. A hybrid model to overcome landslide inventory incompleteness issue for landslide susceptibility prediction. Environ. Geol. 83, 3 (2024).
Google Scholar
Mirus, B. B. et al. Landslides across the USA: occurrence, susceptibility, and data limitations. Landslides 17, 2271–2285. https://doi.org/10.1007/s10346-020-01424-4 (2020).
Article Google Scholar
Abbas, N. et al. Stress-deformation and stability challenges in Himalayan tunnels: impact of geological discontinuities. Discov Mater. 4, 72. https://doi.org/10.1007/s43939-024-00144-z (2024).
Article ADS Google Scholar
Halder, K. et al. Improving landslide susceptibility prediction through ensemble recursive feature elimination and meta-learning framework. Sci. Rep. 15, 5170. https://doi.org/10.1038/s41598-025-87587-3 (2025).
Article CAS PubMed PubMed Central ADS Google Scholar
Singh, A. et al. Ensembled transfer learning approach for error reduction in landslide susceptibility mapping of the data scarce region. Sci. Rep. 14, 29060. https://doi.org/10.1038/s41598-024-76541-4 (2024b).
Article CAS PubMed PubMed Central ADS Google Scholar
Khan, I., Kainthola, A., Bahuguna, H., Ahmed, R. & Abioui, M. Unravelling the impact of landslide inventory on landslide susceptibility in the Indian himalaya. Phys. Chem. Earth A/B/C. 139, 103930. https://doi.org/10.1016/j.pce.2025.103930 (2025).
Article Google Scholar
Zighmi, K. et al. AHP multi criteria analysis for landslide susceptibility mapping in the Tellian atlas chain. Sci. Rep. 15, 25747. https://doi.org/10.1038/s41598-025-10819-z (2025).
Article CAS PubMed PubMed Central ADS Google Scholar
Gupta, K. et al. Landslide susceptibility along National Highway-7 in the Himalayas using random forest-based machine learning tool. J. Earth Syst. Sci. 134, 74. https://doi.org/10.1007/s12040-025-02533-1 (2025).
Article ADS Google Scholar
Sazid, M. Analysis of rockfall hazards along NH-15: a case study of Al-Hada road. Geo-Engineering 10, 1. https://doi.org/10.1186/s40703-019-0097-3 (2019).
Article Google Scholar
Sarkar, R., Das, B. & Goswami, A. Landslide susceptibility mapping using AHP and GIS techniques: A case study from Darjeeling himalaya. J. Earth Syst. Sci. 128, 20. https://doi.org/10.1007/s12040-019-1086-7 (2019).
Article ADS Google Scholar
Sonker, I., Tripathi, J. N. & Swarnim Landslide susceptibility mapping using morphological and hydrological parameters in Sikkim himalaya: frequency ratio model and Geospatial technologies. Nat. Hazards. 120, 6797–6832. https://doi.org/10.1007/s11069-024-06491-7 (2024).
Article Google Scholar
Zhao, B., Zhang, L. & Zhang, J. Comprehensive landslide susceptibility map of central Asia. Nat. Hazards Earth Syst. Sci. 23, 2229–2245. https://doi.org/10.5194/nhess-23-2229-2023 (2023).
Article Google Scholar
Pyakurel, A., Dahal, B. K. & K.C., D. & Enhancing co-seismic landslide susceptibility, Building exposure, and risk analysis through machine learning. Sci. Rep. 14, 5902. https://doi.org/10.1038/s41598-024-54898-w (2024).
Article CAS PubMed PubMed Central ADS Google Scholar
Kainthura, P. & Sharma, N. Hybrid machine learning approach for landslide prediction, Uttarakhand, India. Sci. Rep. 12, 20101. https://doi.org/10.1038/s41598-022-22814-9 (2022).
Article CAS PubMed PubMed Central ADS Google Scholar
Zhang, L., Li, X. & Zhang, Z. Comparative study on landslide susceptibility mapping based on machine learning models: A case study of the Longhai City, China. Sci. Rep. 13 https://doi.org/10.1038/s41598-023-33186-z (2023).
Singh, A., Pal, S. & Kanungo, D. P. An integrated approach for landslide susceptibility–vulnerability–risk assessment of Building infrastructures in hilly regions of India. Environ. Dev. Sustain. 23, 5058–5095. https://doi.org/10.1007/s10668-020-00804-z (2021).
Article Google Scholar
Kumar, P., Singh, B. & Gupta, R. GIS-based comparative analysis of landslide susceptibility models in the Indian Himalayas. Geosci. J. 26, 453–470. https://doi.org/10.1007/s12303-022-00458-3 (2022).
Article Google Scholar
Dahal, R. K. & Hasegawa, S. Rainfall-induced landslides and their hazard assessment in Nepal. Landslides 18, 587–603. https://doi.org/10.1007/s10346-021-01534-2 (2021).
Article Google Scholar
Chauhan, S., Sharma, M. & Jaitly, A. Machine learning-based landslide susceptibility mapping: a case study from the Western Himalayas. Nat. Hazards. 102, 1127–1145. https://doi.org/10.1007/s11069-020-03920-1 (2020).
Article Google Scholar
Pradhan, B. & Lee, S. Landslide susceptibility assessment and factor effect analysis: backpropagation artificial neural networks and their comparison with frequency ratio and logistic regression modelling. Environ. Model. Softw. 25, 747–759. https://doi.org/10.1016/j.envsoft.2009.10.016 (2010).
Article Google Scholar
Lin, L., Lin, Q. & Wang, Y. Landslide susceptibility mapping on a global scale using the method of logistic regression. Nat. Hazards Earth Syst. Sci. 17, 1411–1424. https://doi.org/10.5194/nhess-17-1411-2017 (2017).
Article ADS Google Scholar
Titti, G. et al. Landslide susceptibility in the belt and road countries: continental step of a multi-scale approach. Environ. Earth Sci. 80, 630. https://doi.org/10.1007/s12665-021-09910-1 (2021).
Article ADS Google Scholar

Download references

Acknowledgements

We express my heartfelt gratitude to the GSI for providing publicly accessible landslide inventory and geological data, which was pivotal to our research. We express profound gratitude to the Head of the Department of Geology and the Research Advisory Committee of Banaras Hindu University, Varanasi, for their important direction, astute input, and unwavering support during this work.

Author information

Authors and Affiliations

Geoengineering and Computing Laboratory, Department of Geology, Banaras Hindu University, Varanasi, 221005, India
Imran Khan & Ashutosh Kainthola
Geological Survey of India, CHQ Kolkata, Kolkata, 700016, India
Harish Bahuguna
Divecha Center for Climate Change, Indian Institute of Science Bangalore, Bangalore, 560012, India
Rayees Ahmed
Department of Geography, University of Jammu, Jammu, 180006, India
Md. Sarfaraz Asgher

Authors

Imran Khan
View author publications
Search author on:PubMed Google Scholar
Harish Bahuguna
View author publications
Search author on:PubMed Google Scholar
Ashutosh Kainthola
View author publications
Search author on:PubMed Google Scholar
Rayees Ahmed
View author publications
Search author on:PubMed Google Scholar
Md. Sarfaraz Asgher
View author publications
Search author on:PubMed Google Scholar

Contributions

IK conceptualized and designed the study, processed and modeled the data, and prepared the initial draft of the manuscript. HB provided supervision and contributed to the interpretation of the findings. AK supervised the research and contributed to manuscript revision. RA critically reviewed and revised the manuscript with a focus on the Indian context. MSA supported editing and refinement of the manuscript’s overall presentation. All authors read and approved the final version of the manuscript.

Corresponding authors

Correspondence to Ashutosh Kainthola or Rayees Ahmed.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Khan, I., Bahuguna, H., Kainthola, A. et al. National-scale landslide susceptibility and risk mapping of India using a hybrid data-driven approach. Sci Rep 16, 3519 (2026). https://doi.org/10.1038/s41598-025-33446-0

Download citation

Received: 19 July 2025
Accepted: 18 December 2025
Published: 31 December 2025
Version of record: 27 January 2026
DOI: https://doi.org/10.1038/s41598-025-33446-0

Subjects

Abstract

Similar content being viewed by others

A combined analysis of frequency ratio and analytical hierarchy process for landslide susceptibility assessment in Tenta, South Wollo, Ethiopia

Evaluation of landslide susceptibility of mountain highway based on RF and SVM models

Enhancing landslide disaster prediction by evaluating non landslide area sampling in machine learning models for Spiti Valley India

Introduction

Study area

Materials and methods

Multicollinearity

Analytic hierarchy process (AHP)

Frequency ratio (FR)

Yules coefficient (Yc)

Landslide inventory

Landslide predictor maps

Aspect

Convexity

Draiange

Earthquake

Fault

Geomorphon

LULC

Rainfall

Slope

Lithology

Results

Factors vs landslide distribution

Significance of factors

National landslide susceptibility zonation

Identification of the top 10 susceptible SUTs of India

Validation

Validation of maps through landslide distribution

Ground validation

Best fit model

National landslide risk map (NLRM) at qualitative scale

Discussion

Comparison with previous studies at global and National scale

Significance, limitations, uncertainty, and future research directions

Conclusion

Data availability

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links