Introduction

Landform is a dominant factor governing the occurrence and development of surface processes on the Earth, reflecting the objective geometrical characteristics of the surface and the ultimate expression of the long-term integrated action of the Earth’s internal and external forces on the surface1,2. Landform classification divides landforms based on certain classification principles and indicators and uses specific classification methods, which define geographic boundaries for the study of geomorphology, hydrology, soil science, and other related fields3. As a basic theoretical issue in geomorphology, landform classification reflects the morphology, genesis, and development of landforms, and serves as the foundation for regional landform analysis4. Hence, landform classification holds great significance in optimizing resource allocation, coordinating regional economic development and other issues of national economic construction.

The Tibetan Plateau region is a hotspot for landform classification research in China, receiving extensive attention due to its complex landform types and special geographic location5,6. The complex and varied topography of the region is a product of strong plate movements and geotectonic activities, mapping rich dynamic information on the uplift and evolution of the plateau. Moreover, the Tibetan Plateau, known as the “Water Tower of Asia”, is an essential birthplace of water resources in China, and the spatial distribution of its landform has an important impact on infrastructure construction and protection, water resource security, and ecosystem changes, etc7,8,9. Accordingly, landform classification research for this region is on the rise.

Current landform classification methods can be broadly categorized into pixel-based approaches and object-based approaches10. Pixel-based methods classify landforms at the level of individual grid cells using spectral features and topographic variables. Common techniques include threshold-based rules, supervised and unsupervised classification, clustering analysis, and machine learning algorithms11,12,13,14. However, these approaches often neglect spatial context and overall morphological patterns, resulting in fragmented or noisy outputs15. Object-based methods group pixels into meaningful landform units through image segmentation or terrain partitioning, and then perform classification at the object level16. Techniques include segmentation–merging strategies17, rule-based or knowledge-driven classification18, as well as recent applications of deep learning and knowledge-graph approaches19,20. Compared with pixel-based methods, object-based approaches take spatial context and morphological structures into account, producing results that are more consistent with the semantic understanding of landforms21. Nevertheless, they are often more complex and require careful parameterization and prior knowledge. To summarize, pixel-based methods emphasize local features and are well-suited for rapid, large-area classification, while object-based methods highlight spatial organization and semantic meaning, thereby providing more realistic representations of geomorphological structures.

Concentrating on the Tibetan Plateau, landform classification can be summarized into two main categories: region-oriented comprehensive landform classification and thematic map-oriented sectoral landform classification studies. In region-oriented landform classification studies, Lin et al.22, Xiao et al.23, and Xu et al.24 have classified landform morphology across the Tibetan Plateau. For thematic landform classification studies, scholars have focused on deciphering and analyzing single landform types or landform objects such as aeolian landforms25, periglacial landforms26, and glaciers27. These studies have explained the spatial distribution of landforms on the Tibetan Plateau to some extent. However, no effective consensus has been reached on the classification system and classification methods for landforms on the Tibetan Plateau. Especially for the complex landform unit of the Tibetan Plateau, the results of the existing landform classification make it difficult to reveal the macroscopic features of the landform effectively, and the data accuracy can hardly meet the needs of detailed landform research. Besides, the existing landform classification methods produce a lot of fragmented units when applied to high-precision DEMs, making it challenging to ensure the integrity of the landform objects. Hence, there is an urgent need to innovate landform classification methods further to realize high-precision classification in the complex landform region of the Tibetan Plateau.

On this basis, this paper takes the Qinghai-Tibetan Plateau as the study area, and the research objectives include (1) constructing a new method of landform classification based on the DEM and completing the division of landform types under the premise of considering the global topographic features and the integrity of the landform objects; (2) analyze and quantify the variability in the spatial distribution of landform types in the study area. This study aims to deepen the knowledge of macro-geomorphic patterns and spatial differences in the complex landform area of the Tibetan Plateau and to enrich the theories and methods of geomorphic classification-related studies and digital terrain analysis.

Study area and data sources

Study area

The Tibetan Plateau is the highest globally known as the “third pole”28. In this study, the Tibetan Plateau boundary data compiled by Zhang et al. were utilized to define the extent of the study area, which is the latest and most accurate Tibetan Plateau boundary data available29. According to this extent, the total area of the Tibetan Plateau is 3.0 × 106 km2, geographically located between 25°59′30″N and 40°1′0″N, 67°40′37″E and 104°40′57″E (Fig. 1), and covers nine countries, including China, India, and Pakistan30. The global topography is characterized by high elevations in the northwest and low elevations in the southeast. Mountain ranges with different orientations intersect, dividing the Tibetan Plateau into numerous basins, broad valleys, and lakes, thereby forming a complex and varied plateau landscape31.

Fig. 1
figure 1

Source: generated by the author of this work using ArcGIS Pro 3.5 (https://pro.arcgis.com).

Location map of the study area.

Data sources

  1. 1.

    DEM data.

The digital elevation model (DEM) is one of the key data sets for landform classification. In this study, the FABDEM (Forest and buildings removed Copernicus DEM) with a horizontal resolution of 30 m is employed, which is a secondary deep processing of the Copernicus DEM (Copernicus GLO-30 DEM) and utilizes the machine learning technique of a random forest regression model to remove the height bias of buildings and trees from the Copernicus DEM. The processed FABDEM dataset reduces the average absolute vertical error from 1.61 to 1.12 m in the built-up area and from 5.15 to 2.88 m in the forested area, providing the most recent and best-quality open-source DEM data available32,33,34.

  1. 2.

    Landform map materials.

The landform maps produced by previous researchers are essential references and validation materials for this paper. Currently, there are several sets of landform atlases of the Tibetan Plateau based on different scales and classification systems, among which the Geomorphological Atlas of the People’s Republic of China (1:1,000,000) by Zhou et al. has the highest recognition and popularity35. The dataset is completed through remote sensing interpretation and GIS library building methods based on TM/ETM + image data from Landsat remote sensing satellites, a DEM, 1:250,000 base map, geological maps, historical geomorphological mapping information, and other data. The dataset has been widely used and generally recognized as authoritative since its release. Therefore, the classification criteria and validation of the results in this paper are based on this dataset.

Methods

Segmentation of mountains and plains based on slope-cost distance

In landform classification, plains and mountains provide the basic combination of land surface slope patterns, representing the two most fundamental macroforms of land surface morphology, and the effective division of these two is the basis of landform classification. Plain represents a fundamental geomorphic concept with particular significance in terrestrial landscapes. Despite its widespread usage, the term lacks a rigorous scientific definition and is understood differently across studies. Following the definition in the Dictionary of Geomorphology: an extensive land surface with gentle relief, lower in elevation than plateaus and less undulating than hills36. This study employs a broad notion of plain, including various genetic types (e.g., fluvial, aeolian) as well as gently sloping landforms such as alluvial fans and terraces. After excluding plain from the entire region, the residual areas are delineated as mountains. Cost distance is the calculation of the minimum cumulative cost distance from each image pixel on the element source (element raster) to the cost surface. It is typically used to create the minimum cost path between the source and the target37,38. The cost distance method has also been employed in the extraction of certain landform features, such as dunes39, catchments40, karst landforms41, and loess landforms42. The principle of plains and mountains segmentation based on cost distance is that in areas with small slope values, the cumulative value of slope cost distance increases slowly. When the image pixel moves to the foothill line area, the slope value increases significantly, and the cumulative slope distance rises abruptly. The flow of the cost distance algorithm is shown in Fig. 2.

Fig. 2
figure 2

Schematic flow of the cost-distance algorithm.

For the slope cost distance constructed in this paper, the value of the image pixel on the slope cost distance raster reflects the value of the slope depletion when moving from a specific image pixel (source image pixel). This study utilizes the streamflow network extracted from DEM-based hydrological analysis as the input source and the slope layer as the cost raster within the ArcGIS Pro platform. The constructed slope cost distance surface reflects the cost of slope depletion required to reach a specific raster pixel from the source image element. Larger cost values indicate greater cumulative slope values. In practical implementation, the extraction of the streamflow network was conducted in ArcGIS Pro through the sequence of “Fill–Flow Direction–Flow Accumulation” operations to generate a flow accumulation layer. Subsequently, a suitable threshold was applied to the flow accumulation layer using the ‘Raster Calculator’ to delineate the river network. The choice of threshold influences the density of the extracted river network and introduces a certain level of uncertainty. In this study, the threshold of flow accumulation was determined to be 8,000 after iterative adjustments combined with visual inspection. Additionally, it was necessary to determine an appropriate slope cost depletion threshold where the image elements must start from the source image elements and accumulate along the rasters with smaller slopes until that threshold is reached. Since the terrain is flatter where the slope value is lower, these areas formed are the extent of the plains.

Calculation method of mountain relief index using the mountain unit object

Mountain units are the direct object of classification when classifying mountains and are physical units of geography that actually exist. Referring to the watershed segmentation concept, if the DEM is inverted to obtain an inverse DEM, the hydrological characteristics of the inverse terrain are opposite to those of the original terrain43. The watershed boundary line extracted from the inverse terrain, which corresponds to the sink line of the original terrain, divides the inverse terrain into a closed area without water inflow or outflow. This area corresponds to the hilltop control area of the original terrain, thus realizing the natural dissection of the terrain. The dissected independent surface object is the mountain unit, and the entire process is shown in Fig. 3.

Fig. 3
figure 3

Schematic diagram of mountain unit segmentation.

Mountain relief is defined as the difference between the elevation of each raster pixel within a given mountain unit and the minimum elevation value for that mountain unit. Since the segmentation of the mountain unit under the constraint of the river network ensures that the starting point of a single mountain unit is located on an intermountain valley, the lowest value of the elevation within the mountain unit can be regarded as the starting point of the mountain uplift, i.e., the datum for the development of the mountain. Under this calculation method, the value of the uplift at any point on the mountain reflects the difference in elevation relative to the base of the mountain44, reflecting the actual vertical change in the height of the mountain, which is more consistent with human perceptions than absolute elevation and topographic relief (Fig. 4).

Fig. 4
figure 4

Schematic diagram for calculating the mountain uplift index.

The formula for calculating the mountain relief is as follows:

$$MU{I_P}={P_{ele}} - {Q_{ele}}$$
(1)

Where MUIp is the value of mountain relief at point p, Pele is the value of absolute elevation at point p, and Qele is the value of absolute elevation at the calculated point within the mountain unit, i.e., the minimum elevation value within that mountain unit.

Criteria for landform classification

The classification criteria and nomination of landform types in this paper are consistent with the “Geomorphological Atlas of the People’s Republic of China (1:1000000)” by Zhou et al.45. According to altitude, the landforms are first divided into a six-level system: plains, hills, low mountains, middle mountains, high mountains, and the highest mountains. Secondly, this system is further subdivided according to mountain relief, thus constituting 22 secondary landform types on the Tibetan Plateau. The specific landform types and classification criteria are provided in Table 1.

Table 1 Classification criteria for basic landform types on the Tibetan Plateau.

Results

Spatial distribution pattern of plains and mountains in the study area

Plains and mountains are the main control structures of the landform system. Based on the slope cost distance method proposed in this study, effective classification of plains and mountains is realized by setting a slope cost distance threshold, and the results are shown in Fig. 5.

Fig. 5
figure 5

Source: generated by the author of this work using ArcGIS Pro 3.5 (https://pro.arcgis.com). Note: All areas not in green are mountains.

Classification results and localized map of the plains.

The extracted plains of the proposed method were compared with those delineated in the ‘Geomorphological Atlas of the People’s Republic of China (1:1,000,000)’ and further verified using remote sensing imagery. Since the Atlas only provides landform classification results for China, the comparison was limited to the Chinese part of the Tibetan Plateau (Fig. 6). The results indicate that the plains extracted in this study cover a broader extent and provide a more detailed representation. The proposed method is also effective in extracting river valley plains in mountainous areas, and even narrow river valley plains in steep canyons can be accurately extracted. The area statistics show that the area of plains is 739,438 km2, accounting for nearly 25% of the total study area; the area of mountains is 2,264,800 km2, accounting for 75%; and the ratio of mountains to plains in the Tibetan Plateau is about 3:1, with mountains occupying an absolutely dominant position.

Fig. 6
figure 6

Source: generated by the author of this work using ArcGIS Pro 3.5 (https://pro.arcgis.com).

Classification results and localized map of the plains.

Segmentation results of mountain units in the study area

The river network in the study area was extracted through hydrological analysis, and then the mountain unit division was constrained based on spatial topological relationships. This operation was executed in the ArcGIS platform. Finally, the results and local effects of mountain unit delineation in the Tibetan Plateau under the constraint of the river network are shown in Fig. 7.

Fig. 7
figure 7

Segmentation results of mountain units and localized maps.

The Tibetan Plateau region was finally divided into 178,286 mountain units, and the four indices of elevation, area, slope, and maximum height difference within the mountain units were measured. The results are shown in Fig. 8.

Fig. 8
figure 8

Statistical information on mountain units in the Tibetan Plateau. Note: The red dotted line represents the mean value of this group of data.

The sizes of the segmented mountain units ranged from 0.5 to 225.2 km2, with a mean value of 12.7 km2. As for the average elevation, due to its location on the highest plateau in the world, the average elevation value of the mountain unit is relatively high, with an overall average elevation of 4,346 m. The existence of a certain number of low hills leads to an average elevation value of no more than 2,000 m in some of the mountain units. The mountain unit has significantly higher average slope values, with an overall average of 15.5°. Certain differences exist in the range of elevation within the mountain unit, with elevation differences ranging from 0 to 5,135 m and an average of 728 m, which indirectly reflects the steep morphology of the mountains within the Tibetan Plateau.

Landform classification results based on mountain relief and elevation

Based on the mountain relief calculation results, the mountains in the Tibetan Plateau are categorized into five types: hills, low-relief mountains, middle-relief mountains, high-relief mountains, and highest-relief mountains. Their spatial distribution is shown in Fig. 9, and the statistical information is presented in Table 2.

Fig. 9
figure 9

Source: generated by the author of this work using ArcGIS Pro 3.5 (https://pro.arcgis.com).

Classification results of primary landform types in the study area.

The mountains of the Tibetan Plateau are dominated by the low-relief mountains (28%) and middle-relief mountains (27%), whereas the highest-relief mountains are small in size (1%) and show little difference between the areas of the other types of mountains. Significant variability was observed in the average relief, which reached 2108 m in the highest-relief mountain, far exceeding the high-relief (1190 m) and middle-relief mountains (646 m), and relatively small differences in relief between the hills (303 m) and low-relief mountains (110 m). In terms of spatial distribution, hills and low-relief mountains are fragmented and sporadically embedded, with a more concentrated distribution on the northern Tibetan plateau in the center. Middle-relief and high-relief mountains are distributed along the periphery of the study area in a circular pattern. The concentrated areas include the Karakoram Mountains in the northwest and the vicinity of the Hengduan Mountains in the southeast. In contrast, the highest relief mountains are concentrated in the southeast of the study area, and their spatial distribution is relatively sporadic due to the tiny area.

Table 2 Statistical information on the characteristics of primary landform types.

Mountain relief describes the steepness of a mountain; another topographic characteristic of mountains is height or elevation. According to the classification criteria for landform types, elevation is classified into levels and then spatially superimposed with the results of the mountain relief classification, thus obtaining the results of the second-level classification under the two indicators of elevation and mountain relief (Fig. 10).

Fig. 10
figure 10

Source: generated by the author of this work using ArcGIS Pro 3.5 (https://pro.arcgis.com).

Classification results of secondary landform types in the study area.

According to the statistical results of the secondary mountain classification on the Tibetan Plateau (Fig. 10), the study area is dominated by the three types of low-relief mountains (18%), middle-relief mountains (15%), and high-relief mountains (14%), which account for 47% of the total area of the mountain regions. Other mountain types show greater variability in spatial distribution.

Discussion

Uncertainty analysis of landform classification results

Uncertainty is one of the inherent properties of geography46,47. Because the main data source for landform classification is a DEM, its resolution directly affects the accuracy of the classification results48. The advantage of a high-resolution DEM lies in the ability to provide more topographic information, accurately reflecting small topographic changes, suitable for recognizing small-scale geomorphic features, and especially better performance in complex terrain. Yet, a high-resolution DEM has a large data load, longer computation and processing times, and high hardware requirements49. The advantage of a low-resolution DEM includes the ability to intuitively reflect the global topographic change characteristics and spatial differentiation laws in a large region, as well as the small data load, small processing workload, and ease of realization. However, the surface information expressed by a low-resolution DEM is coarse, making it easy to confuse similar landforms, resulting in insufficient analytical accuracy.

Regarding the Tibetan Plateau region, scholars tend to use 90-m resolution DEMs for landform classification, while some scholars use 1 km resolution DEMs. For example, Drăguţ and his colleagues produced a 1 km resolution global landform classification dataset based on an object-oriented approach50,51; Iwahashi et al. produced a global landform dataset using the MERIT DEM (Multi-error-removed improved-terrain DEM) resampling to 280-m resolution52. Some scholars also use 30-m resolution DEMs to classify and extract local areas and special landforms, such as landslides53,54, dunes25,55, and glaciers56,57. Considering data applicability, 1-km resolution DEMs are too coarse to obtain high-precision landform classification results. Thus, this study compares the results of landform classification at two resolutions: 30 m and 90 m. It is observed that in the experiment of local landform classification, the 30-m resolution can better distinguish between plain and mountain areas. The extracted plain has a wider range, including the intermountain valley plain, with some plains located near mountain summits (Fig. 11a). According to the definition of plain, relatively flat terrain near mountain summits should also be classified as a plain. In contrast, the plains extracted using the 90-m resolution DEM have a more limited extent (Fig. 11b), with part of the plain misclassified as hills. The recognition effect of different resolution DEMs on individual mountains also varies significantly. The 30-m resolution DEM shows finer recognition of mountain contours (Fig. 11c) and more complex shapes of mountain patches, and the low-elevation hilly areas in the northeast corner can also be recognized. However, in the 90-m resolution DEM, the mountain patch boundaries are simplified, and the patches are more extensive. The low-elevation hilly area in the northeast corner is not identified. The classification refinement is coarse (Fig. 11d). Upon comprehensive comparison, the landform classification results based on the 30-m resolution DEM are more refined, providing more accurate landform boundaries and revealing richer spatial pattern laws and knowledge of the landforms on the Tibetan Plateau.

Fig. 11
figure 11

Comparison of landform classification results based on 30-m and 90-m resolution DEMs. Note: (a) and (c) are the classification results based on the 30-m DEM, (b) and (d) are the classification results based on the 90-m DEM.

Limitations and future directions

This paper proposes an innovative automated method and technical framework for the classification of basic landform types, featuring strong generalizability and applicability at the global scale. The proposed approach can provide methodological support for related geomorphological studies. Furthermore, this study produced a 30-m resolution dataset of basic landform classifications for the Tibetan Plateau, effectively enhancing and supplementing the current data resources in this region. This dataset serves as a valuable reference for geological surveys, land resource investigations, landform evolution studies, and related research.

Although this method significantly improves upon the fragmented classification results and low automation levels inherent in traditional landform classification approaches, certain shortcomings still remain. Firstly, this study does not separately classify tablelands as a fundamental landform type but instead merges them into the hill category. This decision is based on the fact that, although tablelands have been mentioned as one of the basic landform types in many studies, the classification criteria remain ambiguous, and no authoritative and unified definition has been established. In the ‘Geomorphological Atlas of the People’s Republic of China (1:1,000,000)’45, tablelands are defined as having a relief generally greater than 30 m, while hills are defined as having a relief of less than 200 m. As can be seen, both tablelands and hills fall within the relief interval of 30 to 200 m; however, no explicit threshold distinguishing the two has been provided, making it challenging to separate tablelands from hills. Additionally, this study proposed a slope cost-distance approach as an alternative to the traditional slope-threshold segmentation method. The proposed approach effectively avoids the fragmentation and internal void effects frequently encountered in slope-threshold segmentation. Nevertheless, similar to the slope-threshold method, the slope cost-distance method still requires the specification of a threshold value to separate plains from mountains. In practice, the threshold is determined by iteratively adjusting the slope cost-distance value, while referencing hillshade maps and remote sensing imagery, to ensure that the segmentation boundary closely aligns with mountain margins. Although this visually assisted approach improves the accuracy of classification results, the objectivity and scientific basis of threshold setting still require further discussion. Future research will further explore classification methods specifically for tablelands and develop genetic landform classification approaches and workflows by integrating multi-source data such as remote sensing imagery, geological maps, and hydrological information.

Conclusion

This paper takes the Tibetan Plateau as the study area, and based on multi-source data such as DEMs, remote sensing images, geomorphological maps, and other data, comprehensively utilizes the ideology of object segmentation and classification oriented to geomorphological entities, digital terrain analysis, and other methods, and carries out systematic research on the indices and classification methods of landform classification. The main conclusions of this paper are as follows:

  1. (1)

    Methods of mountain relief index and slope-cost distance are proposed in this paper to realize the classification of landform types on the Tibetan Plateau. The new landform classification data are generated based on the FABDEM with a 30-m resolution, which significantly improves the quality of landform classification. Meanwhile, compared with the traditional image-element-oriented landform classification, the concept of landform entity-oriented classification improves the fragmentation phenomenon of the classification results and ensures the integrity of the landform classification objects; the calculation of topographic relief based on the mountain relief index avoids the “myopic” effect of the traditional neighborhood analysis, and more realistically reflects the global topographic changes.

  2. (2)

    Statistical analysis of the landforms of the Tibetan Plateau shows that the proportion of plains within the mountainous Tibetan Plateau has reached 25%, with a ratio of mountains to plains of nearly 3:1. Among the mountain types, low-relief mountains (21.1%) and middle-relief mountains (20.8%) dominate, with the smallest proportion of the highest-relief mountains (0.5%). In secondary landform types, mountains are dominated by high-elevation mountains, including low-relief high-altitude mountains (11.7%), middle-relief high-altitude mountains (11.1%), high-relief high-altitude mountains (10.2%), and high-altitude hills (9.0%), and the average elevation of mountains reaches 4,346 m. Among plain types, high-altitude plains are also dominant (15.5%).

  3. (3)

    Regarding the spatial distribution characteristics, the plain has a high degree of fragmentation and a low degree of spatial aggregation, and the overall distribution is characterized by an “from inside to outside” dispersion. The spatial distribution of the mountains, the Himalayas, the Kunlun Mountains, the Qilian Mountains, and the Hengduan Mountains, comprises the main “three horizontal and one vertical” elements of the overall topographic skeleton. This distribution pattern indicates that the landforms in the study area show significant interclass variability in spatial distribution and landscape characteristics, confirming the complex and diverse overall landscape pattern of the Tibetan Plateau.