Abstract
Continuous time-series of land cover is critical for attributing runoff, sediment and carbon changes on the Chinese Loess Plateau (CLP). However, current land cover products with annal temporal resolution lack spatial identification accuracy, particularly in capturing authentic changes of cropland, forest and grassland. To address these issues, a 30 m annual land cover dataset was proposed by the Yellow River Conservancy Commission (YRCC_LPLC) for the CLP from 1990 to 2022. Different levels of land cover were classified using different combinations of spectral, monthly and annual temporal and topographic features and Random Forest classifier. Compared to other land cover products (45.64%–73.38%), the accuracy of YRCC_LPLC has a better performance with an overall accuracy of 85.16%. The YRCC_LPLC is capable of capturing not only the explicit spatial variation but also the change direction and change time of land cover, especially for the most critical conversion of cropland into forest and grassland induced by implementation of Grain to Green Program on the CLP.
Similar content being viewed by others
Background & Summary
Land cover plays a crucial role in the earth system, serving as a vital link between the biosphere, atmosphere, and hydrosphere1. It is also fundamental for simulating land surface processes and serves as a key variable in ecological and hydrological models2,3,4. Currently, human activities have increasingly caused dramatic land cover changes, impacting on water cycles, air quality, energy balance and biogeochemical cycles, biodiversity and the provision of ecosystem services5,6,7. Consequently, obtaining precise and continuous time series of land cover data over a long-term period is critical for understanding climate and environmental dynamics8 and studying the interplay between human activities and global changes9. With the rapid economic development in China over recent decades, the country has encountered numerous environmental challenges, including desertification, sandstorms, soil erosion, and land degradation10,11. The Chinese Loess Plateau (CLP) is situated in northwestern China, spanning approximately 64,000 square kilometers. The mean annual precipitation across the CLP are between 300 mm and 600 mm, with the mean annual temperature ranging from 4 °C to 14 °C. This region is a typical arid and semi-arid environment where water scarcity has become a critical constraint on the regional socio-economic development and ecological progress. Historically, both climate change and excessive land cultivation have caused severe ecological degradation and water and soil erosion in the CLP12,13,14, leading to serious sedimentation and severe socioeconomic issues. To mitigate devastating environmental problems, the Chinese government has initiated several ecological restoration programs, including the ‘Three Norths’ Shelterbelt Program (TNSP)15 and the Grain to Green Program (GTGP)16. As a result, significant land cover and land use changes have been observed in the CLP since the GTGP began in 199917, particularly in the loess hills and gully areas where there has been a notable decrease in cropland and an increase in forest and grassland18,19. Recently, it has been reported that land cover changes in the CLP contribute significantly to a marked reduction of runoff and sediment within the Yellow River Basin20,21, thus quantitatively assessing the impact of vegetation type changes on soil and water loss in the Loess Plateau has become a prominent research focus22,23. Consequently, land cover data has become essential for attributing the runoff and sediment reduction in tributaries and evaluating the ecological benefits of vegetation restoration efforts in the CLP.
There are several commonly used land cover products in the CLP, including the European Space Agency Climate Change Initiative (ESA-CCI) land cover product (300 m)24, the MCD12Q1 product (500 m)25, and the University of Maryland (UMD) land cover map (1 km)26. However, the spatial resolution of these products is too coarse to capture the highly heterogeneous land cover distribution caused by the fragmented terrain across the Loess Plateau. The free availability and accessibility of high-resolution Earth Observation (EO) data (e.g., Landsat)27 has facilitated fine-scale land cover monitoring on a large scale, leading to the development of many products with a resolution of 30 meters or higher. Some examples include the Finer Resolution Observation and Monitoring of Global Land Cover product (FROM-GLC) in 2010, 2015, and 201728, GlobeLand3029 in 2000, 2010, 2020, annual China land cover dataset (CLCD)30 from 1990 to 2022, Global land-cover product with fine classification (GLC_FCS30)31 for every five years from 1985 to 2000, and in every year from 2001 to 2022. However, it has been reported that current available land cover products still lack classification and change detection accuracy, particularly in capturing authentic changes of cropland, forest and grassland induced by ecological restoration projects over the CLP18,32, which have severely affected the unveil of response mechanisms of eco-hydrological processes to underlying surface change in the CLP, and have significantly limited the accuracy of flood and sediment simulation and prediction in the Yellow River Basin18. To address these challenges, the Yellow River Conservancy Commission Loess Plateau Land cover (YRCC-LPLC) annual dataset from 1990 to 2022 was generated by YRCC. In this dataset, water bodies and snow/ice are directly from the CLCD product, whereas terrestrial areas were reclassified using the proposed method. Land cover samples were firstly collected using the combination of CLCD, Landsat time-series stack, high resolution satellite imagery and Google Earth images. Then the different specific remote sensing features were input into the Random Forest (RF) classifier to hierarchically classify land cover types. Subsequently, the accuracy of YRCC_LPLC was evaluated using testing samples, and further compared with the widely used global and national land cover products to demonstrate its advantages of detecting spatial and temporal land cover variations. Based on the YRCC_LPLC dataset, the long-term trends in land cover changes and conversions were detected across the CLP over the past three decades.
Methods
Satellite data
Landsat satellites have been providing 30-m global Earth Observation data, making them a widely recognized data source for high-resolution and large-scale land cover and land change mapping. Firstly, all available Landsat 5, 7 and 8 surface reflectance (SR) time-series stacks were collected from the T1_L2 (USGS Level 2, Collection 2, Tier 1) dataset, which contains 4 visible and near-infrared (VNIR) bands and 2 short-wave infrared (SWIR) bands processed to atmospherically corrected and orthorectified surface reflectance, in the Google Earth Engine (GEE) platform. Secondly, cloud and cloud shadow pixels in the SR were automatically detected using the program code of CF-mask algorithm33 in the GEE to remove the pixels contaminated by cloud and cloud shadow. In addition, ALOS Global Digital Surface Model “ALOS World 3D-30m (AW3D30)”, which is able to accurately capture topographic changes over Chinese mountainous area34, were used as the digital elevation model (DEM) to derive other topographic features, as shown in the Table 1.
Land cover classification
Overall scheme
This study aimed to generate the YRCC-LPLC dataset through a comprehensive processing workflow, which included the generation of training and testing samples, the development of input features for different land cover types, the RF classification model, spatial-temporal consistency validation, and accuracy evaluation along with the comparison between different land cover products (Fig. 1). The temporal composite metrics were obtained using the GEE platform providing long-term Earth observation data, and samples for land cover mapping were collected using the combination of current land cover products, high resolution imagery and GEE. Six land cover types including cropland, forest, shrub, grassland, barren, impervious were classified using the RF algorithm. The YRCC-LPLC’s accuracy was validated using independent test samples. Additionally, the YRCC-LPLC was compared with the widely used global land cover products.
Training and testing samples collection
For the supervised large-scale land cover mapping, the accurate and sufficient training samples are critically important35. Referring to the study of Yang and Huang30, the visual interpretation method and automatic sample extraction via existing LC products were both used to collect training samples in this study. Firstly, since annual CLCD achieved a higher overall accuracy than other global land cover products and has been widely used in various studies in China30, it was taken as a valuable source for collecting samples. The pixels without land cover change during 1990–2022 were extracted based on this dataset. Then, the CLP region was divided into 986 grids with 30 km sides, and 30 samples were randomly collected within each grid from pixels without land cover change in last step. In such a way, we obtained a total of 29,580 candidate samples of the CLP. Thirdly, if NDVI time-series curve of a candidate sample was stable (there is no significant change trend in annual 25th, 50th, 75th percentile NDVI time-series), and its land cover type was the same as that visually interpretated using GF-2, BJ-2, Google Earth images from 2020 to 2022, this sample will be used to input classification model. For grid with less than 5 final samples, we visually interpretated some samples based on GF-2, BJ-2, Google Earth images and Landsat NDVI time-series trajectory in order to ensure uniform spatial distribution and diversity of all samples. It should be noted that we used Google Maps photo sphere to distinguish similar land covers (e.g. bare soil and dried grassland) when Google Earth images are not helpful for identifying land cover types. Finally, a total of 17,448 samples were finally determined from potential sample pool, and the training and testing samples at a ratio of 7:3 were illustrated in Fig. 2.
Input features for classification
The input features for classification were derived from spectral features and topographic characteristics. Different land cover types exhibit distinct reflectance characteristics across various spectral bands, allowing the use of band-specific reflectance to distinguish between them. To maximize the utility of Landsat data across all temporal phases, the 50th percentile values of all available epochs were calculated for each spectral bands (Blue, Green, Red, NIR, SWIR1 and SWIR2) year by year. Due to spectral indexes can effectively enhance the difference among different vegetation types36, NDVI (Normalized Difference Vegetation index)37, NDMI (Normalized Difference Moisture Index)38, NDTI (Normalized Difference Tillage Index)39, NBR (Normalized Burnt Ratio)40 and EVI (Enhanced Vegetation index)41 were composited into the 50th percentiles using the annual composite method. Because the annual maximum NDVI (NDVImax) can effectively differentiate between vegetation and non-vegetation, as well as various vegetation types30,42, the SR of different spectral bands corresponding to the maximum NDVI (Red_NDVImax) were also used as the input features.
Given that the spectral characteristics of various land covers fluctuate throughout the year, the standard deviation (SD) of NDVI and NBR was also calculated to reflect the phenological information. It should be noted that based on prior knowledge about the spectral phenology characteristics of cropland and forest-grass vegetation on the Loess Plateau18, we used the monthly maximum NDVI in May and its corresponding SR values as input features to enhance the classification accuracy of cropland and forest-grass vegetation. Currently, numerous studies have shown that DEM and its derived variables, such as slope and aspect, are essential and significant auxiliary variables for land cover mapping30,43. Due to spatial distribution of vegetation types depends on local site condition determined by soil moisture, temperature and solar radiation in the hilly and gully areas, topographic wetness index (TWI)44 and illumination local angle (Cosi)45 also were considered to improve classification accuracy. Detailed feature descriptions were shown in the Table 2.
Classification and post-processing
To minimize classification errors for each land cover type to the greatest extent possible, we used a hierarchical classification strategy to classify terrestrial area of CLP, which could control the classification errors in different levels18, as illustrated in Fig. 1. The RF classifier is widely employed for large-scale land cover mapping36 because of its numerous advantages, including the capability to manage high-dimensional input features, tolerance to sampling errors, and robustness in the presence of missing data46,47. Therefore, the RF classifier was utilized to produce the YRCC-LPLC. Terrestrial area was firstly identified from CLCD, and then forest/grassland, cropland and non-vegetation were classified using the combination of NDVImax in May and its corresponding RS, annual NDVImax and its corresponding RS, and topographic features, Finally forest, shrub, grassland, barren and impervious were further classified based on annual NDVImax and its corresponding SR, annual 50th percentile composites in vegetation indexes and SR, annual SD of vegetation indexes and topographic features.
To ensure the consistency of the classification across spatial and temporal variations, post-classification processing workflow including spatial filter and temporal logical reasoning was individually used to refine the YRCC-LPLC data. Due to the samples used in this study were visually interpreted by high resolution imagery mostly acquired in 2022, the classification for 2022 using these samples must be the most accurate among 30 years. Firstly, A 2-D Gaussian low pass filter with 3 × 3 filtering window48 was applied into land cover map of 2022 to reduce some noise induced by misclassification. In the process of temporal filtering for land cover products, the land cover from 2022, which has higher confidence, should be used as a benchmark to filter the land cover data from earlier years. This ensures that the data from previous years are adjusted based on the more reliable classification from 2022. Therefore, we then considered the classification result of 2022 filtered by Gaussian low pass filter as benchmark, and temporal logical reasoning with a sliding 3-year window was employed from 2019 to 1990. Specifically, for year t, if the land cover labels for years t−2, t−1, and t within 3-year window were inconsistent, the label for year t would be revised into that of year t + 1; if the land cover labels for years t−2, t−1, and t were the same, the label for year t would remain unchanged.
Accuracy assessment
5234 testing samples (Fig. 2(b)) were employed to evaluate YRCC-LPLC, which are independent with the RF training samples. The accuracy of YRCC-LPLC was evaluated using confusion matrix which is a table used to evaluate the performance of a classification algorithm. Confusion matrix shows the number of correct and incorrect predictions made by the model, segmented by each class. It helps to visualize the accuracy of the model and identify where it is making errors. From the confusion matrix, various performance metrics including producer’s accuracy (PA), user’s accuracy (UA), overall accuracy (OA), and Kappa coefficient were derived. Since several land cover types in the study area have undergone significant changes, the F1 score was used to assess the classification accuracy of these specifical types that have experienced substantial changes. The F1 score reflects the balance between UA and PA, with a maximum value of 1 indicating the best accuracy and a minimum value of 0 indicating the worst49,50. The calculation formula is as follows:
For more comprehensive quality evaluation, we intercompared the YRCC-LPLC with four state-of-the-art global or national land cover products, including the MCD12Q125, ESACCI24, Globeland3051, CLCD30 and GLCFCS3031, as demonstrated in the Table 3. To ensure comparability with the YRCC-LPLC, these four products were reclassified according to the YRCC-LPLC classification system to enable a more straightforward comparison. In addition, high-resolution google earth images were overlaid on different land cover products to evaluate their local classification performance.
Data Records
The annual land cover classification data from 1990 to 2022 is available at Zenodo (https://doi.org/10.5281/zenodo.10225564)52. The ZIP file labeled by different year (e.g. 2020_landcover.zip) contains Geotiff file with a spatial reference of the WGS84 coordinate system. The accompanying Excel file, “landcover_classificationsystem.xlsx”, provides a comprehensive description of the land cover classification system utilized in the dataset. Different integers correspond to various types of land cover: 1 cropland; 2 forest; 3 shrub; 4 grassland; 5 water; 6 ice/snow; 7 barren; and 8 impervious. This classification system is similar to that of CLCD30 and can be easily reclassified to align with the FAO (Food and Agriculture Organization) classification system. The Geotiff data can be imported into standard remote sensing processing software and geographical information system software (e.g., ENVI and ArcGIS), and can be easily read by MATLAB, IDL, Python, etc. Figure 3 shows the land cover maps of 1990, 1995, 2000, 2010, 2015, 2022. Figure 4 shows the change time of specific land cover derived from this dataset.
Technical Validation
Accuracy assessment of YRCC_LPLC
The classification accuracy of different land cover product was first assessed by 5234 testing samples for each year, and the multi-year average of accuracy evaluation metrics for each land cover product was calculated respectively, as illustrated in Table 4. It should be noted that due to the difficulty in obtaining accurate information of changed land cover (including change time, land cover before change and land cover after change), we only obtained the samples of unchanged land cover. Therefore, the assessment using these samples represents accuracy in characterizing spatial distribution of land cover, whereas the accuracy of temporal change detection cannot be directly assessed.
Overall, the accuracy of YRCC_LPLC has a better performance with OA of 85.16% compared to the CLCD (73.27%), the GLC_FCS30 (70.38%), the Globeland30 (65.86%) the ESACCI (57.16%) and the MCD12Q1 (45.64%), respectively, as shown in Table 4. Specifically, cropland had the highest average F1 score at 90.39%, followed by the grassland (86.91), barren (85.4%)and impervious (83.41) classes. Both grassland and cropland showed relatively high accuracy, with mean F1 scores exceeding 86%. Although YRCC_LPLC didn’t always have the highest UA or PA compared to other products, it attained the highest F1 scores across nearly all land cover types. For the land covers with a relatively large proportion in area, such as cropland, forest and grassland, CLCD and GLC_FSC30 exhibited higher F1 scores with respect to the Globeland30, ESACCI and MCD12Q1, whereas YRCC_LPLC demonstrated higher F1 score than CLCD and GLC_FCS30 across most land cover types, with particularly notable improvements in cropland and grassland, which underwent the most significant changes in the CLP.
Comparison with other landcover products
Spatial distribution of land covers
Except for the quantitative assessment, four regions and their local enlargements, covering various climate and landscape environment, were selected to directly illustrate the performance of each land-cover product in Fig. 5. The MCD12Q1 completely fails to capture the spatial heterogeneity of land cover, classifying nearly all areas as grassland. In contrast, the ESACCI can reflect the spatial distribution of land cover on a large scale, but it still exhibits noticeable classification errors. From the perspective of land cover diversity, it was obvious that the YRCC_LPLC, CLCD, Globeland30 and GLC_FCS30 had significant advantages over the other two products. Although Globeland30 can depict a more detailed spatial distribution of land cover, it over-identifies cropland and fails to identify the forest and grass vegetation derived from cropland retirement. The overall patterns of land cover types of GLC_FCS30, CLCD, and YRCC_LPLC are similar, however, there are still noticeable differences in the spatial distribution of various land cover types between them.
Given the combined effects of climate conditions and human activities on crop growth, along with its fragmented distribution and rugged terrain, accurately classifying cropland is quite challenging. Consequently, we chose two cropland areas in typical hilly regions for comparative analysis. On Google Earth imagery (2020-8-28), the extent of terraced fields is clearly visible. Ground surveys and prior knowledge confirm that crops such as corn and potatoes are grown on these terraced fields, allowing them to be identified as cropland. As shown in Fig. 6, Globeland30 classifies a significant amount of forest and grass vegetation as cropland, and both CLCD and GLC_FCS30 exhibit notable omission and misclassification issues. In contrast, YRCC_LPLC demonstrates a significantly superior performance in cropland classification compared to the other products, indicating that it can accurately depict the fine-scale spatial variations of different land cover types.
Temporal change of land covers
Based on different land cover products, we analyzed the dynamic changes in the area proportions of various land cover types from 1990 to 2022. As shown clearly in Fig. 7, the changes of cropland, grassland, and barren land in YRCC_LPLC are more pronounced compared to other products. Over 32 years, the proportion of cropland in YRCC_LPLC decreased by 13.58% of the whole CLP, whereas the cropland proportion in CLCD decreased by only 3.69%. The proportions in Globeland30 and ESACCI remained relatively stable, while MCD12Q1 even showed an increasing trend in cropland area. Similarly, the proportion of grassland in YRCC_LPLC increased by 14.73%, while ESACCI and CLCD showed only slight increases of 1.94% and 1.55%, with relatively high values. GLC_FCS30’s grassland area remained largely unchanged, and MCD12Q1 and Globeland30 even exhibited a decreasing trend in grassland area. For barren land, YRCC_LPLC showed a reduction of 5.98% over 30 years, whereas CLCD, GLC_FCS30, MCD12Q1, and ESACCI showed slight decreases of 2.58%, 2.94%, 3.4%, and 1.27%, respectively. The barren land proportion in Globeland30 remained essentially unchanged. YRCC_LPLC also demonstrated some correlation with other products for some specific periods. For instance, after 2010, YRCC_LPLC’s cropland area closely aligned with GLC_FCS30 and CLCD, and in 2000, it was more similar to Globeland30. The grassland area in YRCC_LPLC after 2005 closely matched GLC_FCS30, and the barren land area after 2010 aligned with Globeland30. Shrubland exhibited a similar decreasing trend found in CLCD. These findings indicate that other products only reflected slight change of cropland, grassland and barren land that should be changed due to extensive implementation of GTGP and soil and water conservation efforts49. In contrast, the YRCC_LPLC can accurately captured these obvious land cover changes over the CLP during 35 years.
Additionally, we selected two 30-m products covering 1990 and 2020 (CLCD and GLC_FCS30) for a detailed local comparison with YRCC_LPLC to evaluate their accuracy in detecting cropland loss in a typical area of land conversion from cropland to forest and grassland. Three typical and known GTGP areas (red polygon) were displayed in the Google Earth imagery (2020-8-28), as shown in the Fig. 8. From the distinct texture features of terraces and prior knowledge about cropland cultivation in this area, we can safely conclude that this area used to be cropland during the 1990s. In addition, the imagery also shows scattered tree canopies and large areas of grassland within the terraces, indicating that this area has been abandoned and reforested due to the implementation of GTFP. However, CLCD and GLC_FCS30 identified only a small portion of the farmland in 1990, and showed little change in the cropland by 2020. In contrast, YRCC_LPLC accurately identified the entire farmland area in 1990 and detected a significant loss of cropland by 2020 (Fig. 8). This demonstrates that YRCC_LPLC has a very high accuracy in detecting main land cover changes on the CLP.
Changed and unchanged land covers
Two 30-m products covering 1990 and 2022 (CLCD and GLC_FCS30) for a comparison with YRCC_LPLC to evaluate their accuracy in detecting unchanged and changed land covers. Figure 9 depict the spatial distribution of six unchanged land cover types including cropland, forest, shrubland, grassland, barren land, and impervious surfaces and four types of land cover changes including cropland to forest and grassland (C-Fsg), grassland to forest and shrubland (G-Fs), barren land to forest and grassland (B-Fsg), and other types to impervious surfaces (O-I). Figure 9(b–d) is the local zoomed-in image of changed land cover land unchanged land cover in a typical GFGP area. From the image, we can observe that cropland and grassland have undergone significant changes in the YRCC_LPLC, while the CLCD shows less change in land cover. Additionally, there is little change in land cover for GLC_FCS30. The statistical results for the area proportions of these different types are shown in Fig. 10. For the unchanged types, YRCC_LPLC has the highest proportion of cropland, accounting for 30%, followed by grassland at 27.3%, however, in CLCD and GLC_FCS30, the proportion of grassland is greater than that of cropland. For the changed types, the largest proportion in YRCC_LPLC is C-Fsg, making up 14.2%. As shown in the Fig. 9(c), C-Fsg was mostly distributed in the primary sediment-producing area in Northern Shaanxi region. B-Fsg was mainly located in the Mu Us Sandy Land and Kubuqi Desert, which accounts for 5.4% of the whole CLP. Obviously, it can be seen that the area proportions of these changed land covers are significantly higher than those in CLCD and GLC_FCS30. Overall, it can be inferred that land cover changes derived from YRCC_LPLC are more reliable in the context of implementing the GFGP over 30 years than those derived from CLCD and GLC_FCS30.
Chang year of cropland converting to forest and grassland
30-m annual land cover from 1990 to 2022 of CLCD was selected for a comparison with YRCC_LPLC to evaluate their accuracy in detecting change time of cropland converted into forest and grassland. For YRCC_LPLC, if the land cover in year t-1 is classified as cropland, and in year t it is classified as forest, grassland, or shrubland, then year t is defined as the change year. For CLCD, we utilized the abandoned cropland change detection method proposed by Zhang et al.50, and implemented an IDL program to automatically extract change year for each pixel. Figure 11 shows that the change years derived from YRCC_LPLC are primarily concentrated between 1998 and 2005, which aligns closely with the implementation period of China’s GTGP. Additionally, the areas of detected change largely correspond to the regions where GTGP was carried out. In contrast, the change years derived from CLCD are evenly spread across the period from 1991 to 2019, with most changes in the GTGP regions occurring after 2008, which clearly does not match the actual implementation timeline of GTGP. This finding also suggests that YRCC_LPLC offers superior accuracy in detecting land cover changes, particularly in identifying the critical land cover conversion of cropland into forests and grasslands on the Loess Plateau, compared to other existing products.
Code availability
The source code used the Python and IDL language. The source code contains four sections: changetime_class_filter.pro, detect_changetime.pro, clcd_landcover_change_time2.pro, RFClassify.py. The source code can be downloaded at https://github.com/wzh8588/YRCC_LPLC.
References
Herold, M., Mayaux, P., Woodcock, C. E., Baccini, A. & Schmullius, C. Some challenges in global land cover mapping: An assessment of agreement and accuracy in existing 1 km datasets. Remote Sens. Environ. 112, 2538–2556 (2008).
Wulder, M. A., Coops, N. C., Roy, D. P., White, J. C. & Hermosilla, T. Land cover 2.0. Int. J. Remote Sens. 39, 4254–4284 (2018).
Schewe, J. et al. State-of-the-art global models underestimate impacts from climate extremes. Nat. Commun. 10, 1–14 (2019).
Li, Z., Zhou, P., Shi, X. & Li, Y. Forest effects on runoff under climate change in the Upper Dongjiang River Basin: insights from annual to intra-annual scales. Environ Res Lett. 16(1), 014032 (2021).
Leng, G., Tang, Q. & Rayburg, S. Climate change impacts on meteorological, agricultural and hydrological droughts in China. Glob. Planet. Change. 126, 23–34 (2015).
Xiao, R. et al. Exploring the driving forces of farmland loss under rapid urbanization using binary logistic regression and spatial regression: A case study of Shanghai and Hangzhou Bay. Ecol. Indic. 95, 455–467 (2018).
Li, J., Gao, Y. & Huang, X. The impact of urban agglomeration on ozone precursor conditions: A systematic investigation across global agglomerations utilizing multisource geospatial datasets. Sci. Total Environ. 704, 135458 (2020).
Yang, Q., Huang, X. & Tang, Q. The footprint of urban heat island effect in 302 Chinese cities: Temporal trends and associated factors. Sci. Total Environ. 655, 652–662 (2019).
Andrew, K. S. et al. Environmental science: Agree on biodiversity metrics to track from space. Nature. 523, 403–405 (2015).
Liu, J. G. & Diamond, J. China’s environment in a globalizing world. Nature 435, 1179–1186 (2005).
He, G., Lu, Y., Mol, A. P. & Beckers, T. Changes and challenges: China’s environmental management in transition. Enniron Dev 3, 25–38 (2012).
Zheng, F. L. Effect of vegetation changes on soil erosion on the Loess Plateau. Pedosphere 16, 420–427 (2006).
Liu, B. et al. The assessment of soil loss by water erosion in China. Int Soil Water Conse 8(4), 430–439 (2020).
Feng, X. et al. Revegetation in China’s Loess Plateau is approaching sustainable water resource limits. Nat. Clim. Change. 6, 1019–1022 (2016).
Zhang, Y. et al. Multiple afforestation programs accelerate the greenness in the ‘Three North’ region of China from 1982 to 2013. Ecol. Indic. 61, 404–412 (2016).
Liu, D. et al. The contribution of China’s Grain to Green Program to carbon sequestration. Landsc. Ecol. 29, 1675–1688 (2014).
Zhao, A. Z. et al. Spatiotemporal variation of vegetation coverage before and after implementation of Grain for Green Program in Loess Plateau. China. Ecol. Eng. 104, 13–22 (2017).
Wang, Z. H. et al. Continuous Change Detection of Forest/Grassland and Cropland in the Loess Plateau of China Using All Available Landsat Data. Remote Sens 10, 1775 (2018).
Li, J., Peng, S. & Li, Z. Detecting and attributing vegetation changes on China’s Loess Plateau. Agric. For. Meteorol. 247, 260–270 (2017).
Wang, S., Fu, B. J., Liang, W., Liu, Y. & Wang, Y. F. Driving forces of changes in the water and sediment relationship in the Yellow River. Sci. Total Environ. 576, 453–461 (2017).
Liu, X. Y., Yang, S. T., Jin, S. Y., Luo, Y. & Zhou, X. The method to evaluate the sediment reduction from forest and grass land cover large area in the Loss hilly area. J. Hydraul. Eng. 45, 135–141 (2014).
Liu, X. Y. et al. Influences of shrubs-herbs-arbor vegetation coverage on the runoff based on the remote sensing data in Loess Plateau. Acta Geogr. Sin. 69, 1595–1603 (2014).
Wang, S. et al. Reduced sediment transport in the Yellow River due to anthropogenic changes. Nat. Geosci. 9, 38–41 (2017).
Harper, K. L. et al. A 29-year time series of annual 300 m resolution plant-functional-type maps for climate models,. Earth Syst. Sci. Data. 15, 1465–1499 (2023).
Sulla-Menashe, D., Gray, J. M., Abercrombie, S. P. & Friedl, M. A. Hierarchical mapping of annual global land cover 2001 to present: The MODIS Collection 6 Land Cover product. Remote Sens. Environ. 222, 183–194 (2019).
Hansen, M. C., DeFries, R. S., Townshend, J. R. G. & Sohlberg, R. Global land cover classification at 1 km spatial resolution using a classification tree approach. Int. J. Remote Sens. 21, 1331–1364 (2000).
Woodcock, C. E., Allen, R. G. & Anderson, M. C. Free access to Landsat imagery. Science 320, 1011 (2008).
Gong, P. et al. Stable Classification with Limited Sample: Transferring a 30-m Resolution Sample Set Collected in 2015 to Mapping 10-m Resolution Global Land Cover in 2017. Sci Bull 64, 370–373 (2019).
Chen, J. et al. Global land cover mapping at 30m resolution: A POK-based operational approach. ISPRS J. Photogramm. 103, 7–27 (2015).
Yang, J. & Huang, X. The 30 m annual land cover dataset and its dynamics in China from 1990 to 2019. Earth Syst. Sci. Data. 13, 3907–3925 (2021).
Zhang, X. et al. GLC_FCS30: global land-cover product with fine classification system at 30 m using time-series Landsat imagery. Earth Syst. Sci. Data. 13, 2753–2776 (2021).
Cheng, K. et al. Mapping China’s planted forests using high resolution imagery and massive amounts of crowdsourced samples. ISPRS J. Photogramm. 196, 356–371 (2023).
Zhu, Z. & Woodcock, C. E. Object-based cloud and cloud shadow detection in Landsat imagery. Remote Sens. Environ. 118, 83–94 (2012).
Liu, K. et al. Global open-access DEM performances in Earth’s most rugged region High Mountain Asia: A multi-level assessment. Geomorphology 338, 16–26 (2019).
Foody, G. M. & Mathur, A. Toward intelligent training of supervised image classifications: Directing training data acquisition for SVM classification. Remote Sens. Environ. 93, 107–117 (2004).
Zhang, X. et al. Development of a global 30 m impervious surface map using multisource and multitemporal remote sensing datasets with the Google Earth Engine platform. Earth Syst. Sci. Data. 12, 1625–1648 (2020).
Tucker, C. J. Red and photographic infrared linear combinations for monitoring vegetation. Remote Sens. Environ. 8, 127–150 (1979).
Wilson, E. H. & Sader, S. A. Detection of Forest Harvest Type Using Multiple Dates of Landsat TM Imagery. Remote Sens. Environ. 80, 385–396 (2002).
Gelder, B. K., Kaleita, A. L. & Cruse, R. M. Estimating mean field residue cover on midwestern soils using satellite imagery. Agron. J. 101, 635–643 (2009).
Miller, J. D. & Thode, A. E. Quantifying burn severity in a heterogeneous landscape with a relative version of the delta Normalized Burn Ratio (NBR). Remote Sens. Environ. 109, 66–80 (2007).
Huete, A., Justice, C. & Van Leeuwen, W. MODIS vegetation index (MOD13). Algorithm theoretical basis document 3, 213 (1999).
Li, W. et al. Integrating Google Earth imagery with Landsat data to improve 30-m resolution land cover mapping. Remote Sens. Environ. 237, 111563 (2020).
Gomariz-Castillo, F., Alonso-Sarría, F. & Cánovas-García, F. Improving Classification Accuracy of Multi-Temporal Landsat Images by Assessing the Use of Different Algorithms, Textural and Ancillary Information for a Mediterranean Semiarid Area from 2000 to 2015. Remote Sens 9, 1058 (2017).
Sørensen, R., Zinko, U. & Seibert, J. On the calculation of the topographic wetness index: evaluation of different methods based on field observations. Hydrol. Earth Syst. Sci. 10, 101–112 (2006).
Gu, D. & Gillespie, A. Topographic Normalization of Landsat TM Images of Forest Based on Subpixel Sun-Canopy-Sensor Geometry. Remote Sens. Environ. 64(2), 166–175 (1998).
Bauer, E. & Kohavi, R. Empirical comparison of voting classification algorithms: bagging, boosting, and variants. Mach. Learn. 36, 105–139 (1999).
Wessels, K. J. et al. Rapid land cover map updates using change detection and robust random forest classifiers. Remote Sens 8, 888 (2016).
Tu, Y. et al. A 30 m annual cropland dataset of China from 1986 to 2021. Earth Syst. Sci. Data. 16, 2297–2316 (2024).
Li, C. et al. The first all-season sample set for mapping global land cover with Landsat-8 data. Sci. Bull. 62, 508–515 (2017).
Zhang, M. et al. Reveal the severe spatial and temporal patterns of abandoned cropland in China over the past 30 years. Sci Total Environ 857, 159591 (2023).
Chen, J. & Chen, J. GlobeLand30: Operational global land cover mapping and big-data analysis. Sci. China Earth Sci. 61, 1533–1534 (2018).
Wang, Z. et al. A 30 m annual land cover dataset of Chinese Loess Plateau from 1990 to 2020. Zenodo https://doi.org/10.5281/zenodo.10225564 (2023).
Acknowledgements
This research was funded by the National Key Research and Development Program of China (2023YFC3209303), National Natural Science Foundation of China (U2243210), National Key Research and Development Program of China (2022YFC3205200).
Author information
Authors and Affiliations
Contributions
Z.W. led the study and designed the work. X.S. organized the student team and revised the manuscript. S.D. performed the analysis, and contributed to the writing of the manuscript. M.C. and L.M. developed the classification algorithms, wrote partial code for change detection and assessment of land cover, and drafted the manuscript. L.M. participated in the annotation and review of the development dataset.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wang, Z., Shi, X., Dou, S. et al. The 30 m land cover dataset for capturing land cover changes induced by ecological restoration from 1990 to 2022 on the Chinese Loess Plateau. Sci Data 12, 252 (2025). https://doi.org/10.1038/s41597-025-04575-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-025-04575-y
This article is cited by
-
The impact of the Grain-for-Green Programme on carbon storage in the Upper Yangtze River Basin based on the PLUS-InVEST model
Carbon Balance and Management (2025)