High-resolution (10 m) dataset of multi-crop planting structure on the Loess Plateau during 2018–2022

Zhao, Xining; Wang, Jichao; Ding, Yelu; Gao, Xiaodong; Li, Changjian; Huang, Hongwei; Gao, Xuerui

doi:10.1038/s41597-025-05529-0

Download PDF

Data Descriptor
Open access
Published: 10 July 2025

High-resolution (10 m) dataset of multi-crop planting structure on the Loess Plateau during 2018–2022

Xining Zhao¹,
Jichao Wang^2,3,
Yelu Ding^2,3,
Xiaodong Gao¹,
Changjian Li¹,
Hongwei Huang^2,3 &
…
Xuerui Gao¹

Scientific Data volume 12, Article number: 1190 (2025) Cite this article

2356 Accesses
Metrics details

Subjects

Abstract

Accurate and timely information on planting intensity and crop rotation is essential for guiding agricultural policies and ensuring food security. However, reliable and up-to-date maps for major crops—wheat, maize, rapeseed, soybean, and potatoes—are lacking in the Loess Plateau, a key grain-producing region in western China. To address this gap, this study aims to generate a high-resolution (10 m) crop planting pattern dataset for the Loess Plateau from 2018 to 2022. The research methodology involved four key steps: (1) Enhancing the sample dataset using phenological indices and the Dynamic Time Warping (DTW) algorithm; (2) Identifying crop planting intensity based on phenological growth curves; (3) Developing independent random forest classifiers tailored to agricultural climate zones; and (4) Constructing an optimal feature subset for crop classification. The resulting maps demonstrated high overall accuracies (OA) is greater than 0.81, with satellite-based estimates showing strong agreement with municipal statistical data (R² ≥ 0.60). These results provide crucial insights for the management of agricultural ecosystems in the Loess Plateau and can support more informed decision-making in regional agriculture.

Maps of cropping patterns in China during 2015–2021

Article Open access 05 August 2022

French crop yield, area and production data for ten staple crops from 1900 to 2018 at county resolution

Article Open access 03 February 2022

Incorporating soil information with machine learning for crop recommendation to improve agricultural output

Article Open access 12 March 2025

Background & Summary

Food security is significantly influenced by the escalating population growth, ongoing social and economic development, and climate change^1,2. The Loess Plateau in western China stands out for its abundant arable land resources and rich light and heat resources³, showcasing vast potential for agricultural production. It holds a prominent position as one of China’s key grain production hubs. Being a typical dryland agricultural region, the Loess Plateau primarily relies on precipitation for agricultural water supply, but limited rainfall and high evaporation rates pose challenges to agricultural progress. Optimizing the cropping structure, enhancing water resource efficiency and boosting grain production capacity in dryland regions are essential for safeguarding national food security⁴. Therefore, accurately mapping the planting pattern of the Loess Plateau is vital in promoting agricultural sustainability and ensuring food security⁵.

In recent years, the accessible and freely available remote sensing data with global coverage and satisfactory spatial resolution have significantly bolstered the data support for identifying crop types on national and global scales^6,7,8. The main data types currently used include moderate resolution imaging spectroradiometer (MODIS) with a resolution of 250 m/8 days, Landsat data with a resolution of 30 m/15 days, Sentinel-2 image with a resolution of 10 m/5 days. China’s adoption of a household contract responsibility system has resulted in highly dispersed arable land, enabling farmers to selectively cultivate crops, leading to a marked diversity in crop types. The spatial resolution of MODIS data is too coarse to identify crop types in highly fragmented areas⁹. Furthermore, due to the brief crop growing season and the 16-day revisit interval frequency of Landsat data, there is an incapacity to provide consistent and stable crop phenology characteristics¹⁰. Sentinel-2 provides higher spatial and temporal resolution as well as more spectral bands, making it the optimal free satellite data source for large-scale crop mapping. For example, You et al.⁸ utilized Sentinel-2 data to produce 10-m crop type maps in Northeast China over 2017–2019, while Li et al.¹¹ used Sentinel-2 data to map annual 10-m maize cropland changes in China during 2017–2021.

Remote sensing-based crop mapping methods can be categorized as phenology-based and machine learning (ML)-based approaches. Phenology-based methods focus on capturing the distinct phenology of various crop species through time-series growth curve characteristics¹². This approach necessitates representative crop sample data and adequate satellite observation data to characterize crop phenology. However, satellite data with high temporal spatial resolution is susceptible to weather conditions, intraclass variability, and interclass similarity of spectral and temporal features across large spatial regions and multiple years, impacting crop classification accuracy. Machine learning methods rely on machine learning models to classify crops using remote sensing data. Popular machine learning classifiers such as Support Vector Machine (SVM)¹³, Random Forest (RF)⁸, and Neural Network (NN)¹⁴ are commonly employed. The advantage of machine learning lies in its ability to consider various recognition features like spectral reflectance, vegetation indices, temporal, and textural features comprehensively, enabling quick and autonomous exploration of crop-specific characteristics. However, machine learning methods are also subject to certain limitations in crop mapping in large areas. Firstly, there is a lack of reliable sample data in large areas, and the quantity and quality of sample data determine the identification accuracy of crop mapping. Secondly, the high dimensionality of feature variables employed in classification increases computational complexity and reduces classifier efficiency. Currently, several studies have integrated phenology with machine learning to address these challenges. Belgiu et al.¹⁵ utilized phenological information-based time-series similarity Dynamic Time Warping to create crop sample data for machine learning algorithms. Zhao et al.¹⁶ and Yin et al.¹⁷ selected optimal features based on phenological information to establish an optimal feature subset, maintaining classification accuracy while reducing computational costs. Therefore, it is promising to map large areas crop type using a combination of phenology and machine learning methods.

Accurate and updated data on planting intensity and crop rotation are essential for advancing sustainable agricultural intensification, mitigating negative agricultural impacts, and ensuring food security. However, there is a lack of reliable updated maps with detailed descriptions on cropping intensity and crops rotations in Loess Plateau. Therefore, the objective of this study is to produce a 10 m resolution crop planting pattern dataset from 2018–2022 on the Loess Plateau. Our contributions encompass four main aspects: (1) Enhancement of the sample dataset based on phenological indices and DTW algorithm; (2) Discrimination of crop planting intensity based on crop phenological curves; (3) Development of independent random forest classifier by considering agricultural climate zones; (4) Construction of optimal feature subset for crop classification. The results of this study can provide important information for the management of agricultural ecosystem in the Loess Plateau.

Methods

Study area

The Loess Plateau is situated in the middle and upper reaches of the Yellow River Basin (https://www.geodata.cn/), covering an area of 6.22 × 10⁵km² and spanning latitude 33°69′ to 41°28′N, and longitude 100°86′ to 114°56′E (as shown in Fig. 1). It is characterized by a transitional climate from semi-humid to semi-arid, representing a typical continental monsoon climate with uneven spatial and temporal distribution of precipitation. Rainfall in the southeastern region exceeds 600 mm/day, while the northwestern arid region receives only 150–250 mm/year¹⁸. The region experiences high interannual variability in rainfall, with dry in winter and spring, and wet in autumn. Water scarcity and soil erosion act as significant constraints to the agricultural development and ecological restoration of the area. The Loess Plateau has historically been among China’s vital agricultural zones¹⁹, with cultivated land covering more than 20% of the total area, predominantly reliant on rain-fed agriculture²⁰. The major crops are wheat, maize, rapeseed, potatoes, and soybean.

The Loess Plateau encompasses 7 provinces, 44 cities and 332 counties (https://www.ngcc.cn/). Due to the influence of rainfall, terrain, and temperature, there are spatial differences in planting phenology, so the agricultural region is used for mapping crop planting patterns^21,22. According to the “2019 National Cultivated Land Quality Grading Report” (https://www.gov.cn/), the study area spans six agricultural regions (as shown in Fig. 1). It includes the Jindong-Yuxi Hilly Mountain Agriculture, Forestry and Pastoral Zone (JAFP), Fenwei Valley Agricultural Zone (FVA), Jin-Shaan-Gan Loess Hills Gullies Pastoral and Forest Zone (LPF), Longzhong-Qingdong Hilly Agriculture and Pastoral Zone (LAP), Agriculture and Pastoral Zone along the Great Wall (GAP), and Meng-Ning-Gan Agriculture and Pastoral Zone (MAP). Generally speaking, the crop planting patterns within each agricultural region exhibit similarities. The phenological characteristics of 8 main crops of the Loess Plateau was obtained from fieldwork and referred to Phenology calendar data³, as shown in Fig. 2.

Overview of the crop classification method

Figure 3 illustrates the workflow of this paper. Firstly, we completed the pre-processing of Sentinel-2 data from 2018 to 2022, and extracted cropland data based on FROM_GLC10 product. Secondly, we collected sample points in field surveys and visual interpretation, and the DTW algorithm is used to expand and enhance the data of crop sample points. Thirdly, we derived the cropping intensity based on crop phenological curve method. Fourthly, we used optimal crop features as inputs to train the crop classifier based on the RF algorithm, and then mapped crop pattern map with 10 m resolution from 2018 to 2022 on the Loess Plateau.

Sentinel-2 images and pre-processing

Sentinel-2 data were obtained from the European Space Agency’s Copernicus Open Access Hub (https://scihub.copernicus.eu/). The Sentinel-2 Earth observation mission currently comprises two satellites, Sentinel-2A and Sentinel-2B, which were launched in 2015 and 2017, respectively. Together, the two satellites provide global coverage of the Earth’s surface every five days. Each satellite is equipped with a multispectral instrument capable of acquiring imagery across 13 spectral bands, spanning the visible, near-infrared, and shortwave infrared regions of the electromagnetic spectrum, with spatial resolutions of up to 10 meters.

Due to the unavailability of Level-2A surface reflectance (SR) products on the Google Earth Engine (GEE) platform for the Loess Plateau region prior to 2019⁸, Level-1C top-of-atmosphere (TOA) reflectance data were used for the year 2018. From 2019 to 2022, we used the more reliable Level-2A SR data as they became accessible on the platform.

In this study, five key spectral bands were selected, including Red-edge 1 (RE1), Red-edge 2 (RE2), Red-edge 3 (RE3), Short-Wave Infrared 1 (SWIR1), and Short-Wave Infrared 2 (SWIR2). The red-edge bands are important indicators of plant pigment concentration and vegetation health status, while SWIR bands are sensitive to water content and other biochemical components in crop leaves. Among these, RE2, SWIR1, and SWIR2 demonstrated significant discriminatory power for distinguishing between maize and soybean (You et al., 2021).

In addition, four complementary vegetation indices were derived from different combinations of these spectral bands: the Normalized Difference Vegetation Index (NDVI)^23,24, Enhanced Vegetation Index (EVI)²⁵, Normalized Difference Senescent Vegetation Index (NDSVI)²⁶, and Green Chlorophyll Vegetation Index (GCVI)²⁷. These indices were selected to better capture the phenological characteristics of diverse crops under complex cropping structures. NDVI and EVI have been widely used to extract crop phenological indicators, with numerous studies identifying NDVI as a dominant vegetation index for vegetation monitoring²⁸. EVI is more effective in minimizing background soil noise, making it particularly suitable for arid and semi-arid regions where vegetation cover is sparse and bare soil is prevalent. It also helps mitigate NDVI’s saturation issues in areas with high biomass. NDSVI is useful for monitoring crop growth and identifying senescence stages such as leaf yellowing and wilting, which supports improved analysis of crop phenological cycles²⁹. GCVI, on the other hand, reflects crop photosynthetic capacity and nutrient status, and is particularly effective in the identification of soybean cultivation³⁰. The functions of NDVI, EVI, NDSVI and GCVI are provided in Eqs. (1)–(4) as follows:

$${\rm{NDVI}}=\frac{{p}_{{nir}}-{p}_{{red}}}{{p}_{{nir}}+{p}_{{red}}}$$

(1)

$${\rm{EVI}}=2.5\frac{{p}_{{nir}}-{p}_{{red}}}{{p}_{{nir}}+6{p}_{{red}}-7.5{p}_{{blue}}+1}$$

(2)

$${\rm{NDSVI}}=\frac{{p}_{{swir}1}-{p}_{{red}}}{{p}_{{swir}1}+{p}_{{red}}}$$

(3)

$${\rm{GCVI}}=\frac{{p}_{{nir}}}{{p}_{{green}}}-1$$

(4)

where, ${p}_{{nir}}$, ${p}_{{red}}$, ${p}_{{blue}}$, ${p}_{{swir}1}$, and ${p}_{{green}}$ are the near-infrared, red, blue, shortwave infrared bands, and green in the Sentinel-2A images, respectively.

Cropland mask

Wang et al.³¹ conducted a comparative analysis of six global high-resolution land use/land cover products (WorldCover10, FROM_GLC10, ESRI GLC10, FROM_GLC30, GLC_FCS30, and GlobeLand30), and found that FROM_GLC10 exhibited relatively high accuracy over China, with an overall accuracy of 65.57%, particularly in the classification of croplands. Similarly, Bie et al.³² evaluated three major global land cover products (FROM_GLC10, ESA World Cover, and ESRI Land Cover) over northwestern China and reported that FROM_GLC10 achieved the highest overall accuracy of 77.83%.

Considering that the Loess Plateau is one of China’s primary agricultural production regions, and that cropland areas did not undergo significant large-scale expansion or reduction between 2018 and 2022, this study focused on the classification of specific crop types—including maize, wheat, soybean, rapeseed, and potatoes—rather than distinguishing between cropland and non-cropland. To exclude non-cropland areas, we employed the FROM_GLC10 dataset, which has been shown to perform well in northwestern China.

The FROM_GLC10 product is a global scale land use dataset with 10 m resolution in 2017, which was obtained from the Department of Earth System Science, Tsinghua University, China (https://data-starcloud.pcl.ac.cn/zh).

Sample data collection and augmentation

The sample data used in this study were derived from both field surveys and augmentation based on the Dynamic Time Warping (DTW) algorithm.

(i)
Samples collected from field surveys. From April to October 2021, extensive field surveys were conducted across the main agricultural areas of the Loess Plateau, as shown in Fig. 1. A UniStrong G138BD handheld GNSS receiver with a horizontal accuracy of 2–5 meters was used to record the geographic coordinates of various crop types. After the field investigation was completed, all samples were visually inspected using high-resolution images from Google Earth to ensure quality and accuracy. The obtained sample points include: winter wheat, spring wheat, summer maize, spring maize, soybean, potatoes, winter rapeseed, spring rapeseed, and other cropland.
(ii)
Sample preparation based on phenology method. Based on the field-validated sample points, standard NDVI phenological curves were extracted for eight major crops: winter wheat, spring wheat, summer maize, spring maize, winter rapeseed, spring rapeseed, soybean, and potatoes. On the Google Earth Engine (GEE) platform, pixel-wise NDVI time series were compared with these reference phenological curves using the Dynamic Time Warping (DTW) algorithm to calculate temporal similarity. The resulting DTW distance rasters were then used to generate spatially distributed sample points through stratified random sampling within agricultural zones, thereby augmenting the training dataset with high-quality, geographically representative samples.

Sample point processing and Dynamic time warping

Before conducting sample point amplification using the DTW algorithm, it is necessary to distinguish the collected sample points as wheat in the study area is divided into winter wheat and spring wheat, maize includes summer maize and spring maize, and rapeseed includes winter rape and spring rape, and the planting and harvest times of these crops are different. Using the GEE platform and based on sentinel-2 remote sensing images, the NDVI of the collected sample points of wheat, maize and rapeseed during their growth periods was calculated, and a 10-day NDVI time series was synthesized. The NDVI range values of different types of crops in the study area could be obtained, and the standard NDVI curves of different types of crops could be drawn. As shown in Fig. 4, based on the NDVI curve, the NDVI characteristics of the samples at different times are utilized to distinguish crops such as winter wheat and spring wheat, forming sample points of winter wheat, spring wheat, summer maize, spring maize, winter rape, and spring rape.

Dynamic Time Warping (DTW) algorithm measures the similarity between two non-linear time series curves by calculating the distance value. Assuming that the time series X = {x1, x2, …, xn} is the curve of an unknown pixel, and the time series Y = {y1, y2, …, ym} is the standard curve of a known maize pixel, and the lengths of the two curves are n and m respectively. The DTW algorithm measures the similarity between the two given time series using the Euclidean distance and can flexibly warp and stretch the time series X to align with the time series Y. Use dbase(i, j) to represent the distance matrix obtained by calculating the Euclidean distance between any two points in sequence X and sequence Y. The calculation is as follows:

$${\rm{DTW}}({\rm{X}},{\rm{Y}})={\rm{Min}}\sqrt{\mathop{\sum }\limits_{{\rm{i}}=1}^{{\rm{n}}}{{\rm{D}}}_{{\rm{i}}}}$$

(4)

Where Di is the i-th element in the regular path, and n is the total number of elements in the regular path. The expanded crop sample distribution map based on the DTW algorithm is shown in Fig. 5.

Cropping intensity mapping method

This study employed the framework proposed by Liu et al.³³ to generate a 10 m resolution crop planting intensity dataset for the Loess Plateau, consisting of two main steps: (1) phenological phase identification and (2) cropping cycle detection.

In the first step, the Savitzky–Golay (SG) filter was applied to smooth NDVI time series, from which the annual NDVI minima and maxima were derived. Transition points were then identified when NDVI values exceeded 50% of the seasonal amplitude³⁴, marking mid-greenup (increasing NDVI) and mid-greendown (decreasing NDVI) stages. The period between mid-greenup and mid-greendown was defined as a growing phase, while the reverse marked a non-growing phase.

For cropping cycle detection, the number of potential cropping cycles (${N}_{{pc}}$) was defined as the minimum between the number of mid-greenup (${N}_{{up}}$) and mid-greendown (${N}_{{down}}$) points:

$${N}_{{pc}}=\min \left\{{N}_{{up}}\right.,\left.{N}_{{down}}\right\}$$

To reduce false detections caused by NDVI anomalies, we followed Sakti and Takeuchi³⁵ in setting a minimum crop growth duration of 48 days. Growth periods shorter than this threshold were classified as false cycles (${N}_{{fc}}$), and the actual cropping intensity (${CI}$) was calculated as:

$${CI}={N}_{{pc}}-{N}_{{fc}}$$

In this study, cropping intensity data for each year (2018–2022) were produced based on this method.

Feature selection

According to the growth period of crops, different crops are distinguished by using spectral index, vegetation index and texture features. First, during the crop growth period, we combined remote sensing images into a 10-day time series. We selected five spectral reflection bands: RE1(B5), RE2(B6), RE3 (B7), SWIR1(B11) and SWIR2(B12). Secondly, the maximum, minimum and mean values of the vegetation indices (NDVI, EVI, NDSVI, GCVI) in each time series were calculated as another classification basis. Finally, we used the gray-level co-occurrence matrix function to add the texture features of the image, including the Angular second moment (ASM), Contrast, Correlation, and Entropy indicators (ENT). We take these three indices together as the characteristic values for crop classification (as shown in Table 1).

Table 1 The characteristic value of crop planting structure identification.

Full size table

Random Forest algorithm

Random forest (RF) algorithm is an assembly method that builds multiple randomly uncorrelated decision trees (DT) to achieve an integrated classifier with better predictive performance³⁶. A decision tree is a data structure that each path from the root (the attribute that contributes the most to the final classification result) to the leaf node (the final classification result) represents a rule for the decision³⁷. RF is robust to noise and outlier data, and it solves the overfitting problem that plagues DT. The GEE platform has provided RF algorithm, which has been used to classify the crop types⁸, detect wetlands changes³⁸, and detect forest fires³⁹.

In this study, the RF algorithm was adopted to training crop classifiers, two parameters were set. (1) The parameter “numberOfTrees” (the number of trees determines the number of binary CART trees used to construct the RF model) was set to 100. (2) minLeafPopulation (the minimum sample size required for leaf nodes) is set to 10, and the remaining parameters are default in GEE.

Data Records

This dataset provides crop distribution maps at 10 meters on the Loess Plateau from 2018 to 2022 (Fig. 6), available in the figshare repository in Geotiff format⁴⁰. The dataset is provided in the ESPG: 4326 (WGS_1984) spatial reference system. The range of the dataset is from 33°69 ‘to 41°28’ N latitude and 100°86 ‘to 114°56’ E longitude. The values of these crops types are: (1: winter wheat; 2: spring wheat; 3: winter rape; 4: spring rape; 5: spring maize; 6: summer maize; 7: soybean; 8: potatoes; 9: single others,; 10: winter wheat-summer maize; 11: winter rape-summer maize;12: winter wheat-others; 13: winter rape-others; 14: others-summer maize; 15: other double-cropping patterns). These data can be visualized and analyzed in ArcGIS, QGIS or other similar software.

Technical Validation

First of all, we compared the on-site data with the real data on the ground. Based on the evaluation of the sample points, we took 30% of the sample points as verification samples to generate a confusion matrix and evaluate the classification accuracy of eight crops. The results show that the overall accuracy from 81.08% to 84.54%, and the Kappa coefficient ranged from 78.35% to 82.26%. Secondly, we compared our crop data with the data from the municipal agricultural census. As can be seen from the Fig. 7, the coefficient of determination (R²) of the main crops of the Loess Plateau, maize and wheat, with the data in the statistical yearbook are relatively high, greater than 0.83. The R² of soybeans, potatoes and are rapeseed greater than 0.61. These findings show that our crop products are consistent with the records in the statistical yearbook. However, the farmland mask data we used (Sources of error of FROM_GLC10 product) has an overall accuracy of 72.76%. Although the overall and Kappa coefficients are higher than other products, there are still errors, which will reduce the accuracy of crop map recognition. In the future, we plan to use all crop sample points to create a set of China’s latest detailed farmland maps based on current farmland data.

Usage Notes

The Loess Plateau is one of China’s important grain production areas. The crop planting area information of the Loess Plateau is of great significance to regional food security and agricultural development. In our study, we used field survey samples combined with DTW algorithm expansion, proposed a large-scale multi-category crop recognition model based on random forests, and provided a 10 m resolution crop distribution map of the Loess Plateau region from 2018 to 2022. This high-precision crop map can be used to predict agricultural productivity, evaluate the water suitability of dryland farmland, etc. Timely and high-resolution crop data is of great significance to the study of regional food balance and food security.

Code availability

The JavaScript code used to generate the crop type and crop type maps can be obtained from the figshare repository⁴⁰.

References

Tariq, A., Yan, J., Gagnon, A. S., Riaz Khan, M. & Mumtaz, F. Mapping of cropland, cropping patterns and crop types by combining optical remote sensing images with decision tree classifier and random forest. Geo-Spat Inf Sci. 26(3), 302–320 (2023).
Article Google Scholar
Jiang, S. et al. Sustainability of water resources for agriculture considering grain production, trade and consumption in China from 2004 to 2013. J Clean Prod. 149, 1210–1218 (2017).
Article Google Scholar
Chen, X. et al. Tracking the spatio-temporal change of the main food crop planting structure in the Yellow River Basin over 2001–2020. Comput Electron Agric. 212, 108–102 (2023).
Article Google Scholar
Liu, Q., Niu, J., Du, T. & Kang, S. A. Full-Scale Optimization of a Crop Spatial Planting Structure and its Associated Effects. Engineering-Prc. 28, 139–152 (2023).
Google Scholar
Qiu, B. et al. Maps of cropping patterns in China during 2015–2021. Sci Data. 9(1), 479 (2022).
Article PubMed PubMed Central Google Scholar
Chen, X. et al. Tracking the spatio-temporal change of the main food crop planting structure in the Yellow River Basin over 2001–2020. Comput Electron Agr 212, 108102 (2023).
Article Google Scholar
Wang, S., Di Tommaso, S., Deines, J. M. & Lobell, D. B. Mapping twenty years of corn and soybean across the US Midwest using the Landsat archive. Sci Data. 7(1), 307 (2020).
Article PubMed PubMed Central Google Scholar
You, N. et al. The 10-m crop type maps in Northeast China during 2017–2019. Sci data. 8(1), 41 (2021).
Article PubMed PubMed Central Google Scholar
Liu, Y., Zhang, Z. & Wang, J. Regional differentiation and comprehensive regionalization scheme of modern agriculture in China. Acta Geogr Sin. 73(2), 203–218 (2018).
Google Scholar
Yang, N. et al. Large-scale crop mapping based on machine learning and parallel computation with grids. Remote Sens-Basel. 11(12), 1500 (2019).
Article ADS Google Scholar
Li, Z. et al. Performance of GEDI data combined with Sentinel-2 images for automatic labelling of wall-to-wall corn mapping. Int J Appl Earth Obs. 127, 103643 (2024).
Google Scholar
Zhang, C., Zhang, H. & Tian, S. Phenology-assisted supervised paddy rice mapping with the Landsat imagery on Google Earth Engine: Experiments in Heilongjiang Province of China from 1990 to 2020. Comput Electron Agr. 212, 108105 (2023).
Article Google Scholar
Ni, R. et al. An enhanced pixel-based phenological feature for accurate paddy rice mapping with Sentinel-2 imagery in Google Earth Engine. ISPRS J Photogramm Remote Sens. 178, 282–296 (2021).
Article ADS Google Scholar
Wei, L. et al. Deep Convolutional Neural Network for Rice Density Prescription Map at Ripening Stage Using Unmanned Aerial Vehicle-Based Remotely Sensed Images. Remote Sens. 14(1), 46 (2021).
Article ADS Google Scholar
Belgiu, M., Bijker, W., Csillik, O. & Stein, A. Phenology-based sample generation for supervised crop type classification. Int. J. Appl. Earth Obs. Geoinf. 95, 102264 (2021).
Google Scholar
Zhao, L. et al. In-season crop type identification using optimal feature knowledge graph. ISPRS J Photogramm Remote Sens. 194, 250–266 (2022).
Article ADS Google Scholar
Yin, L., You, N., Zhang, G., Huang, J. & Dong, J. Optimizing Feature Selection of Individual Crop Types for Improved Crop Mapping. Remote Sens. 12(1), 162 (2020).
Article ADS Google Scholar
Gao, X. et al. The spatial and temporal evolution of the actual evapotranspiration based on the remote sensing method in the Loess Plateau. Sci. Total Environ 708, 135111 (2020).
Article CAS PubMed Google Scholar
Fu, B. Soil erosion and its control in the Loess Plateau of China. Soil Use Manage. 5(2), 76–82 (1989).
Article Google Scholar
Jin, N. et al. Effects of water stress on water use efficiency of irrigated and rainfed wheat in the Loess Plateau, China. Sci Total Environ. 642, 1–11 (2018).
Article ADS CAS PubMed Google Scholar
Li, Z. et al. Performance of GEDI data combined with Sentinel-2 images for automatic labelling of wall-to-wall corn mapping. International journal of applied earth observation and geoinformation 127, 103643 (2024).
Article Google Scholar
Liu, J. et al. Spatiotemporal characteristics, patterns, and causes of land-use changes in China since the late 1980s. Journal of geographical sciences 24, 195–210 (2014).
Article ADS Google Scholar
Tucker, C. J. Red and photographic infrared linear combinations for monitoring vegetation. Remote sensing of environment 8, 127–150 (1979).
Article ADS Google Scholar
Watanabe, D. S. Z., Barboza-Pinzon, E. G., Hesp, P. A., Schossler, V. & Salgado, E. T. Evolution of coastal transgressive aeolian sand sheets over 75 years (1948-2023) at Concheiros Barrier - Southern Brazilian coast. Geomorphology 470, 109520 (2025).
Article Google Scholar
Huete, A. R., Liu, H. Q. & Batchily, K. van Leeuwen, W. A comparison of vegetation indices over a global set of TM images for EOS-MODIS. Remote sensing of environment 59, 440–451 (1997).
Article ADS Google Scholar
Zheng, G. et al. The Potential of Multispectral Vegetation Indices Feature Space for Quantitatively Estimating the Photosynthetic, Non-Photosynthetic Vegetation and Bare Soil Fractions in Northern China. Photogrammetric engineering and remote sensing 85, 65–76 (2019).
Article Google Scholar
Zhi, F. et al. Rapid and Automated Mapping of Crop Type in Jilin Province Using Historical Crop Labels and the Google Earth Engine. Remote sensing 14, 4028 (2022).
Article ADS Google Scholar
Radočaj, D., Šiljeg, A., Marinović, R. & Jurišić, M. State of Major Vegetation Indices in Precision Agriculture Studies Indexed in Web of Science: A Review. Agriculture 13, 707 (2023).
Article Google Scholar
Huang, L. et al. Rapid mapping of soybean planting areas under complex crop structures: A modified GWCCI approach. Computers and electronics In agriculture 235, 110326 (2025).
Article Google Scholar
Huang, X., Vrieling, A., Dou, Y., Belgiu, M. & Nelson, A. A robust method for mapping soybean by phenological aligning of Sentinel-2 time series. Isprs journal of photogrammetry and remote sensing 218, 1–18 (2024).
Article Google Scholar
Wang, Y. et al. Evaluation of six global high-resolution global land cover products over China. International Journal of Digital Earth 17(1), 2301673 (2024).
Article ADS Google Scholar
Bie, Q., Luo, J. & Lu, G. Accuracy performance of three 10-m Global Land Cover products around 2020 in an Arid Region of Northwestern China. IEEE Access 11, 133215–133228 (2023).
Article Google Scholar
Liu, C. et al. A new framework to map fine resolution cropping intensity across the globe: Algorithm, validation, and implication. Remote Sens Environ. 251, 112095 (2020).
Article Google Scholar
Bolton, D. K. et al. Continental-scale land surface phenology from harmonized Landsat 8 and Sentinel-2 imagery. Remote Sens Environ. 240, 111685 (2020).
Article Google Scholar
Sakti, A. D., Takeuchi, W. Estimation of global crop calendar and intensity using the MODIS NDVI Time Series from 2001 to 2015. In: International Symposium on Remote Sensing (2018).
Razavi-Termeh, S. V., Sadeghi-Niaraki, A. & Choi, S. A new approach based on biology-inspired metaheuristic algorithms in combination with random forest to enhance the flood susceptibility mapping. J Environ Manage. 345, 118790 (2023).
Article PubMed Google Scholar
Zhu, L., Qiu, D., Ergu, D., Ying, C. & Liu, K. A study on predicting loan default based on the random forest algorithm. Procedia Comput Sci. 162, 503–513 (2018).
Article Google Scholar
Zhao, F., Feng, S., Xie, F., Zhu, S. & Zhang, S. Extraction of long time series wetland information based on Google Earth Engine and random forest algorithm for a plateau lake basin – A case study of Dianchi Lake, Yunnan Province, China. Ecol. Indic. 146, 109813 (2023).
Article Google Scholar
Bar, S., Parida, B. R. & Pandey, A. C. Landsat-8 and Sentinel-2 based Forest fire burn area mapping using machine learning algorithms on GEE cloud platform over Uttarakhand, Western Himalaya. Remote Sens Appl. 18, 100324 (2020).
Google Scholar
Zhao, X. et al. High-resolution (10 m) dataset of multi-crop planting structure on the Loess Plateau during 2018-2022. figshare. https://doi.org/10.6084/m9.figshare.27643137 (2025).
Article Google Scholar

Download references

Acknowledgements

This research was supported by the National Natural Science Foundation of China (42125705, 42377346), and the National Key Research and Development Program of China (2021YFD1900700).

Author information

Authors and Affiliations

Institute of Soil and Water Conservation, Northwest A&F University, Yangling, Shaanxi, 712100, China
Xining Zhao, Xiaodong Gao, Changjian Li & Xuerui Gao
College of Water Resources and Architectural Engineering, Northwest A&F University, Yangling, Shaanxi, 712100, China
Jichao Wang, Yelu Ding & Hongwei Huang
Key Laboratory of Agricultural Soil and Water Engineering in Arid and Semiarid Areas, Ministry of Education, Northwest A&F University, Yangling, Shaanxi, 712100, China
Jichao Wang, Yelu Ding & Hongwei Huang

Authors

Xining Zhao
View author publications
Search author on:PubMed Google Scholar
Jichao Wang
View author publications
Search author on:PubMed Google Scholar
Yelu Ding
View author publications
Search author on:PubMed Google Scholar
Xiaodong Gao
View author publications
Search author on:PubMed Google Scholar
Changjian Li
View author publications
Search author on:PubMed Google Scholar
Hongwei Huang
View author publications
Search author on:PubMed Google Scholar
Xuerui Gao
View author publications
Search author on:PubMed Google Scholar

Contributions

X.Z. and J.W. designed the study and the methodology, Y.D. wrote the code and generated the data, X,G., C.L., H.H. and X.G. checked samples and evaluate the resulting maps. All authors analyzed the data, wrote, and edited the manuscript.

Corresponding author

Correspondence to Xuerui Gao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Zhao, X., Wang, J., Ding, Y. et al. High-resolution (10 m) dataset of multi-crop planting structure on the Loess Plateau during 2018–2022. Sci Data 12, 1190 (2025). https://doi.org/10.1038/s41597-025-05529-0

Download citation

Received: 21 November 2024
Accepted: 03 July 2025
Published: 10 July 2025
DOI: https://doi.org/10.1038/s41597-025-05529-0