Abstract
The black soil region of China is one of the four major black soil regions in the world and is an important grain production base. High-accuracy soil moisture data are therefore crucial for understanding the mechanisms of drought and waterlogging on crop yields. Here, we present a 16 years (2008–2023) soil moisture dataset (DMSN) with both high spatiotemporal resolution and multi-layer spatiotemporal consistency (2 km, daily, 0–10, 10–20, 20–50, 50–100 cm),using Community Land Model 3.5 (CLM3.5). The DMSM showed excellent performance, with validation at seven field monitoring sites in the black soil region, the correlation coefficient (R) was generally greater than 0.7, the RMSE varied between 0.035 and 0.07, the MAE ranged from 0.03 to 0.06, and the bias remained between −0.02 and 0.02. The DMSM effectively captured daily, inter-annual, and seasonal soil moisture variations, providing important insights into soil moisture dynamics in this critical black soil region. These results help in understanding the potential drivers of drought and waterlogging and their impact on crop yields.
Similar content being viewed by others
Background & Summary
Multi-layer soil moisture serves as a vital water source for root water uptake and crop transpiration, playing a crucial role in crop growth and yield formation1,2,3,4,5. High-resolution and high-accuracy multi-layer soil moisture data provide essential support for predicting agricultural droughts and waterlogging, as well as for guiding farmland management6,7,8. Northeast China is a major agricultural region for the cultivation of soybeans, maize, and rice9. Notably, the black soil region, one of the world’s four major black soil areas10, is a key commercial grain production base. However, in recent years, climate change has markedly altered the spatial and temporal distribution of precipitation across the region, with a notable rise in the frequency of extreme weather events. These changes have amplified the temporal variability and spatial heterogeneity of soil moisture, posing significant challenges to crop stability and agricultural productivity. Concurrently, intensified agricultural activities have heightened the sector’s reliance on water resources, thereby increasing the region’s vulnerability to both drought and waterlogging events11. To effectively confront these challenges, there is an urgent need to harness high-resolution soil moisture data to strengthen agricultural risk early warning systems and resource management. Multi-layered soil moisture datasets have broad applicability in real-world contexts, including drought monitoring, water stress detection, irrigation scheduling, and crop planning. Accurately characterizing the spatiotemporal dynamics of soil moisture can support timely adjustments in irrigation and sowing practices by producers, while also providing a robust scientific foundation for regional food security assessments and the formulation of agricultural policy. Understanding soil moisture dynamics in this region is therefore crucial for ensuring stable food production and optimizing agricultural resource management.
In recent years, the development of soil moisture datasets has garnered significant attention. Theoretical methods for soil moisture retrieval using visible, infrared, and microwave bands have become well established12,13,14. Common approaches include the moisture deficit index, soil thermal inertia, and the temperature-vegetation drought index15,16,17,18. Microwave-based retrieval methods typically derive soil moisture by establishing relationships between backscatter coefficients and soil moisture content13,19,20. However, while visible and infrared-based retrieval methods offer high spatial resolution, they are severely limited by cloud cover, which hinders soil moisture estimation under cloudy conditions. Conversely, microwave-based retrieval can penetrate cloud cover but suffers from low spatial resolution, typically in the range of tens of kilometers21,22,23. To overcome these limitations, researchers have increasingly adopted land surface models and data assimilation techniques to generate soil moisture products at both regional and global scales24,25,26. Notable examples include the ERA5-Land reanalysis dataset and the Global Land Data Assimilation System (GLDAS). Additionally, several widely used global soil moisture datasets, such as SMOS, SMAP, and the latest ESA CCI global soil moisture dataset, provide valuable resources for large-scale soil moisture monitoring7,27,28,29. Despite advancements in integrating multi-source remote sensing data for soil moisture estimation, existing datasets still face considerable limitations. Most struggle to simultaneously achieve long-term continuity, high spatial and temporal resolution, and multi-layer soil moisture information27,30,31. Furthermore, due to data constraints, our understanding of multi-layer soil moisture dynamics in certain regions remains insufficient, introducing uncertainties and challenges in soil moisture prediction. Therefore, further research is essential to develop large-scale soil moisture monitoring technologies that can provide more comprehensive, continuous, and high-resolution multi-layer soil moisture data.
The Community Land Model (CLM) allows direct integration of remote sensing observations of land surface conditions to generate accurate and spatiotemporally consistent land surface state fields. By employing ensemble-based methods, CLM effectively reduces uncertainties in land surface simulations, making it possible to incorporate high-resolution satellite-derived soil moisture data to update soil moisture estimates within the model26. In this study, we enhanced the CLM3.5 (Community Land Model version 3.5) framework by replacing its default global datasets with high-resolution regional datasets tailored for Northeast China. Specifically, the original FAO-based soil texture dataset was replaced with data from the Second National Soil Survey (SNSS), and the MODIS-derived land cover dataset was substituted with the China Regional Land Cover Dataset (CLCV). These modifications provide a more accurate representation of regional soil and land cover characteristics.
To further optimize soil moisture simulation, we adopted the scheme proposed by Song et al. which employs atmospheric forcing data from the China Meteorological Administration Land Data Assimilation System (CMA-LDAS) to drive the improved CLM3.5 model. This scheme was applied to simulate multi-layer soil moisture in the Inner Mongolia Autonomous Region of China, yielding varying degrees of improvement in soil moisture simulation. Specifically, the simulated soil moisture under this scheme best captures the spatiotemporal variation characteristics of observed soil moisture in the Inner Mongolia region, leading to a significant enhancement in simulation performance. Building upon this scheme, we developed a high-resolution, multi-layer daily soil moisture dataset spanning 2008–2023, with a spatial resolution of 2 km and covering four depth layers (0–10 cm, 10–20 cm, 20–50 cm, and 50–100 cm). This dataset not only significantly improves the accuracy of soil moisture estimation but also enhances the spatiotemporal resolution, thereby more precisely capturing the dynamic changes in multi-layer soil moisture.
The black soil region in northeast China is one of the major black soil belts in the world (Fig. 1), distinguished by a variety of soil types, including black soil, meadow soil, and chestnut soil. These soils are rich in organic matter and essential nutrients, making the region highly suitable for agricultural production. The primary crops cultivated in this area include rice, maize, soybeans, and sorghum. This region encompasses Heilongjiang province, Jilin province, Liaoning province, and parts of eastern Inner Mongolia, covering a total area of approximately 1.24 million square kilometers. Of this, the area of typical black soil cultivated land spans 18.53 million hectares. The Northeast Black Soil Region serves as China’s largest grain production base, contributing around one-quarter of the nation’s total grain output. Between 2020 and 2022, the implementation of conservation tillage in the black soil region expanded from 4.6 million hectares to 8.3 million hectares. Despite these efforts, soil degradation remains a persistent issue, with varying degrees of severity across the region.
Methods
CLM3.5 introduce
The Community Land Model (CLM) is one of the most advanced and well-developed land surface models globally. It serves as the land surface component of the Community Climate System Model (CCSM) and has been developed through collaboration among various research institutions, building on the foundations of the Common Land Model (CoLM) and the NCAR Land Surface Model (NCAR LSM), among others. CLM incorporates insights from the Biosphere-Atmosphere Transfer Scheme (BATS)32, the IAP94land surface model developed at the Institute of Atmospheric Physics, Chinese Academy of Sciences33, and the NCAR LSM, evolving into a third-generation land surface process model. Significant improvements have been made to the model’s land surface parameters and hydrological processes, and updates have been implemented to integrate MODIS-based surface datasets as well as enhance the canopy interception scheme34. These advancements culminated in the development of CLM3.5 available at http://www.cgd.ucar.edu/tss/clm/distribution/clm3.5/index.html)35. Numerous offline simulations have demonstrated that CLM3.5 significantly improves the distribution of global evapotranspiration compared to observed runoff, soil moisture, and total water storage36. The model simulates wetter soil moisture conditions, lower vegetation water stress, enhanced transpiration, and increased photosynthesis. Additionally, improvements were observed in the interannual variability of total land water storage, as well as the phase and amplitude of runoff interannual variations36. In this study, the CLM3.5 was employed to simulate multi-layer soil water content, using updated land use and soil texture data. The simulations cover four distinct soil depth layers: 0–10 cm, 10–20 cm, 20–50 cm, and 50–100 cm, as Fig. 2.
The driving data for CLM3.5 encompass fundamental datasets such as soil texture, DEM37, land cover (https://doi.org/10.5067/MODIS/MCD12Q1.061), and leaf area index (LAI) (https://doi.org/10.5067/MODIS/MOD15A2H.061) derived from MODIS product data, alongside meteorological variables including precipitation and temperature. Daily precipitation and temperature data are derived from the ERA5-Land reanalysis dataset provided by the European Centre for Medium-Range Weather Forecasts (ECMWF; available at https://cds.climate.copernicus.eu/). These ERA5-derived data have been corrected using observational datasets from the National Qinghai-Tibet Plateau Science Data Center, which cover China with a 1 km spatial resolution and monthly temporal resolution spanning 1961–2014 (accessible at https://data.tpdc.ac.cn/home). In the data processing workflow, factors such as longitude, latitude, and altitude are explicitly incorporated. Compared to analogous products, this dataset exhibits higher resolution and lower uncertainty38.
Delta downscaling method
The Delta downscaling method is a commonly used statistical downscaling technique that has been widely applied in the downscaling process of General Circulation Models (GCMs)39. In this study, the Delta method was applied to temperature and precipitation data to enhance the spatial accuracy of climate driving data. Specifically, the original low-resolution data were first interpolated onto the target high-resolution grid. Based on monthly climatological means from a historical reference period, systematic biases between model observations and history were then quantified at each grid point, and spatial correction factors were computed. These factors were then applied to bias-correct the downscaled climate data. This downscaling approach was used to preprocess the driving data for the CLM3.5 land surface model, providing high-resolution input data that supports accurate simulation and validation of soil moisture. While the Delta downscaling method can generate input data with higher resolution, two key considerations led us to finalize a 2 km resolution for the soil moisture dataset: the CLM3.5 model requires extensive inputs, including meteorological variables and underlying surface parameters, yet some of these datasets are unavailable at resolutions finer than 2 km, making it challenging to uniformly upscale all input data to a resolution higher than this threshold, and the primary objective of this dataset is to support future analyses of drought and flood impacts on agriculture, for which a 2 km resolution is sufficient to meet the requirements; taking these factors into account, we opted to produce the soil moisture dataset at a 2 km resolution.
Where Pobs represents precipitation data generated from observations, Phis represents historical precipitation, Delta(P) represents precipitation difference coefficient, Prcp represents uncorrected precipitation, Prcp is the bias-corrected precipitation, Tobs represents air temperature data generated from observations, This represents historical air temperature, Delta(T) represents temperature difference, Trcp represents uncorrected temperature, and Trcp is bias-corrected temperature.
Error analysis method
The DMSM (Data Management and Simulation Model) aims to quantify system and random errors by comparing simulated values with field observation data. To comprehensively assess the performance of DMSM, four metrics were used, including the correlation coefficient (R), bias, mean absolute error (MAE), and root mean square error (RMSE)7.
Data Records
This dataset includes soil moisture data across four vertical layers within the 0–100 cm depth range. Each layer is archived as a compressed file in .7Z format, which requires decompression software for extraction, and the compressed file for each layer is named “DMSM 1.7z”. The data files are freely available at figshare40. The dataset is publicly accessible via the https://doi.org/10.6084/m9.figshare.2852308740. All files are stored in GeoTiff format, projected under the World Geodetic System 1984 (WGS84) coordinate system, with a spatial resolution of 0.019. The file naming convention follows the example “NELDAS_V1_Land_2km._YYYYMMDD_10.tif”, where “YYYYMMDD” denotes the date of data acquisition. The suffixes in the filenames correspond to specific depth layers: “_10” for the first layer (0–10 cm), “_20” for the second layer (0–20 cm), “_50” for the third layer (20–50 cm), and “_100” for the fourth layer (50–100 cm). Files are organized into and stored within respective subfolders based on their corresponding layers.
Technical Validation
We conducted accuracy validation of the newly developed DMSM dataset at different spatial and temporal scales, which consisted of four steps. First, accuracy analysis was performed using the R, RMSE, MAE, and Bias metrics, comparing the DMSM dataset with data from 7 field observation stations. Second, a systematic comparison was made between ERA5 from the European Centre For Medium-Range Weather Forecasts, ECWMF (https://cds.climate.cop-ernicus.eu), and GL-DAS from the Land Data Assimilation System (https://ldas.gsfc.nasa.gov/gldas/GLDASpublicati-ons.php) soil moisture data and DMSM, as well as precipitation data, to verify the applicability and accuracy of CLM3.5 under temporal variation trends. Third, at the regional scale, we analyzed the correlation between DMSM data and both ERA5 and GLDAS on a gridded scale to assess the spatial accuracy of soil moisture representation between DMS-M and existing soil moisture data. Fourth, we analyzed the spatial variation trends of D-MSM in the northeastern black soil region to understand its spatiotemporal variability at this regional scale.
Validation using field observation station data
In this study, soil moisture data from 8 in-situ measurement sites within the Northeast Black Soil Region were obtained for the period 2008–2023 (Fig. 1). The data were collected at 10-day intervals from May to September throughout this period. A comprehensive verification analysis of the DMSM was performed (Fig. 3). Overall, the coefficient of determination (R²) between DMSM and the in-situ data was 0.65, indicating that DMSM generally exhibited good consistency with the measured soil moisture data. Additionally, we collected data from seven field observation sites, encompassing soil layers of 0–10 cm, 10–20 cm, 20–50 cm, and 50–100 cm. For each of these seven sites, precision verification and comparative analysis of soil moisture across different depths were conducted.
To assess the accuracy of the CLM3.5 generated soil moisture data at depths of 0–10 cm, 10–20 cm, 20–50 cm, and 50–100 cm, the observed soil moisture data from 7 field observation sites were compared with the corresponding depth simulated soil moisture data from CLM3.5. Figures 4 and 5 present the R², Bias, MAE, and RMSE. Among the seven observation sites, three sites (Barigasitai, Dashizhai, Julihen) provided data for all four layers (0–10 cm, 10–20 cm, 20–50 cm, and 50–100 cm), while four sites (Alideer, Eti, Guiliuhe, Suolun) only provided data for three layers (0–10 cm, 10–20 cm, and 20–50 cm). All observation stations are located in relatively homogeneous areas within their respective 2 km×2 km grids. Specifically, the surrounding environment of each station is characterized by flat terrain with no significant topographic relief, dominant land cover types cropland consistent with the main surface type of the grid (Fig. 1). Accuracy validation was performed separately for the four-layer and three-layer data from different site locations. The results showed that, in general, DMSM exhibited the lowest R value of 0.7, RMSE values between 0.035 and 0.07, MAE values between 0.03 and 0.06, and BIAS values between −0.02 and 0.02(Fig. 6). Furthermore, we analyzed the correlation between GLDAS-SM and ERA5-SM with the observed soil moisture data, and found that the R values were all lower than the correlation coefficient R of DMSM and with the observed soil moisture data. Based on the four metrics, the soil moisture simulations for 0–10 cm, 10–20 cm, 20–50 cm, and 50–100 cm in the northeastern black soil region generated by CLM3.5 are highly reliable in terms of accuracy.
Temporal variation of the DMSM
To further analyze whether DMSM can effectively reflect the temporal variation of soil moisture, the temporal performance of DMSM was evaluated. We selected the growing season (May to September) from 2021 to 2022 and compared DMSM with ERA5 and GLDAS for the seven observation sites to analyze their temporal consistency. Since the 0–100 cm soil moisture data from ERA-SM and GLDAS-SM are divided into three layers—ERA-SM: Layer 1 (0–7 cm), Layer 2 (7–28 cm), and Layer 3 (28–100 cm); GLDAS-SM: Layer 1 (0–10 cm), Layer 2 (10–40 cm), and Layer 3 (40–100 cm)—whereas DMSM consists of four layers—Layer 1 (0–10 cm), Layer 2 (10–20 cm), Layer 3 (20–50 cm), and Layer 4 (50–100 cm), a comparative analysis was conducted to align soil moisture values across different depths. To facilitate this comparison, the average soil moisture from the 20–50 cm and 50–100 cm layers of DMSM was calculated to match the three ERA-SM layers, while the average soil moisture from the 10–20 cm and 20–50 cm layers of DMSM was computed to align with the GLDAS-SM depths. Additionally, a comparison was made with the corresponding precipitation data. Figure 7 shows that DMSM, ERA-SM, and GLDAS-SM exhibit good consistency at the daily time scale across all sites. DMSM effectively captures both the daily and seasonal variations of soil moisture and is able to accurately reflect the temporal changes of soil moisture and precipitation events. Furthermore, the relatively higher DMSM values are mainly attributed to major precipitation events.
To further explore the ability of the CLM3.5 simulated soil moisture data (DMSM) to represent spatial soil moisture variation trends, we created spatial distribution maps of the correlation coefficients between DMSM, ERA5 soil moisture data (ERA-SM), and GLDAS soil moisture data (GLDAS-SM) at the pixel scale, as Fig. 8. Since DMSM is divided into 0–10 cm, 10–20 cm, 20–50 cm, and 50–100 cm layers, ERA-SM uses 0–7 cm, 7–28 cm, and 28–100 cm layers, while GLDAS uses 0–10 cm, 10–40 cm, and 40–100 cm layers. We further analyzed DMSM with ERA-SM and GLDAS-SM for different depths and obtained corresponding three-layer soil moisture data for DMSM with ERA-SM and DMSM with GLDAS-SM. We then conducted a correlation analysis for the three layers of soil moisture data to demonstrate the consistency and differences between different data sources at the spatial scale. Previous studies have demonstrated that bilinear interpolation determines the new value of a pixel by calculating the weighted average of the four nearest input pixel centers, with weights assigned based on their respective distances. This method yields higher image quality after scaling, avoids discontinuities, and is thus well-suited for processing continuous datasets without distinct boundaries41. Therefore, in the context of grid-scale correlation analysis, we applied bilinear interpolation to the higher-resolution DMSM data to align its spatial resolution with that of the ERA-SM and GLDAS-SM datasets. It can be seen that in most areas of the northeastern black soil region, the correlation coefficient R is greater than 0, and in most areas, R exceeds 0.7, indicating that DMSM is highly consistent with ERA-SM and GLDAS-SM in representing the spatial variation of soil moisture (Fig. 7).
The trend of soil moisture
The trend of soil moisture in the northeastern black soil region from 2008 to 2023 was analyzed using the least squares method, as shown in Fig. 9. The pixel values in the figure range from −0.01 to 0.01. Pixels with values greater than 0 indicate an increasing trend, whereas those with values less than 0 represent a decreasing trend. A larger absolute value corresponds to a more pronounced changing trend. The trend of DMSM at the 0–10 cm depth indicates that most areas in the northeastern black soil region show an increasing trend in soil moisture (Fig. 8), with the most noticeable increase occurring in the central and eastern parts of the study area, suggesting a trend towards wetter conditions. In comparison, the increasing trend is slightly weaker for the other three soil moisture layers (10–20 cm, 20–50 cm, and 50–100 cm). The areas with a decreasing trend in soil moisture are mainly located in the southwestern part of the eastern black soil region, which is mainly the sandy soil area.
Data availability
The driving data for CLM3.5 encompass fundamental datasets such as soil texture available from the National Qinghai-Tibet Plateau Scientific Data Center (https://data.tpdc.ac.cn/zh-hans/data/611f7d50-b419-4d14-b4dd-4a944b141175), DEM, land cover (https://doi.org/10.5067/MODIS/MCD12Q1.061), and leaf area index (LAI) (https://doi.org/10.5067/MODIS/MOD15A2H.061) derived from MODIS product data, available from GEE (https://code.earthengine.google.com/), alongside meteorological variables including precipitation and temperature. Daily precipitation and temperature data are derived from the ERA5-Land reanalysis dataset provided by the European Centre for Medium-Range Weather Forecasts (ECMWF; available at https://cds.climate.copernicus.eu/). The soil moisture dataset published in this study is available from figshare40. This dataset includes soil moisture data across four vertical layers within the 0–100 cm depth range. Each layer is archived as a compressed file in .7Z format, which requires decompression software for extraction, and the compressed file for each layer is named “DMSM 1.7z”. The dataset is publicly accessible via the https://doi.org/10.6084/m9.figshare.2852308740.
Code availability
No custom code was generated for this work.
References
Sahaar, S. A., Niemann, J. D. & Elhaddad, A. Using regional characteristics to improve uncalibrated estimation of rootzone soil moisture from optical/thermal remote-sensing. Remote Sens. Environ. 273, 112982, https://doi.org/10.1016/j.rse.2022.112982 (2022).
Li, X. Z. et al. Estimation of root zone soil moisture at point scale based on soil water measurements from cosmic-ray neutron sensing in a karst catchment. Agric. Water Manag. 289, 108511, https://doi.org/10.1016/j.agwat.2023.108511 (2023).
Zhu, Q., Wang, Y. S., Luo, Y. L. Improvement of multi‐layer soil moisture prediction using support vector machines and ensemble Kalman filter coupled with remote sensing soil moisture datasets over an agriculture dominant basin in China. Hydrol. Process. 35, https://doi.org/10.1002/hyp.14154 (2021).
Babaeian, E. et al. Estimation of root zone soil moisture from ground and remotely sensed soil information with multisensor data fusion and automated machine learning. Remote Sens. Environ. 260, 112434, https://doi.org/10.1016/j.rse.2021.112434 (2021).
Karthikeyan, L. & Mishra, A. K. Multi-layer high-resolution soil moisture estimation using machine learning over the United States. Remote Sens. Environ. 266, 112706, https://doi.org/10.1016/j.rse.2021.112706 (2021).
Meng, X. J. et al. A fine-resolution soil moisture dataset for China in 2002–2018. Earth Syst. Sci. Data. 13, 3239–3261, https://doi.org/10.5194/essd-13-3239-2021 (2021).
Abowarda, A. S. et al. Generating surface soil moisture at 30 m spatial resolution using both data fusion and machine learning toward better water resources management at the field scale. Remote Sens. Environ. 255, 112301, https://doi.org/10.1016/j.rse.2021.112301 (2021).
Huang, S. Z. et al. A Novel Fusion Method for Generating Surface Soil Moisture Data with High Accuracy, High Spatial Resolution, and High Spatio‐Temporal Continuity. Water Resour. Res. 58, https://doi.org/10.1029/2021WR030827 (2022).
Qi, W., Feng, L., Liu, J., Yang, H. Snow as an Important Natural Reservoir for Runoff and Soil Moisture in Northeast China. Journal of Geophysical Research: Atmospheres. 125, https://doi.org/10.1029/2020JD033086 (2020).
Xu, Z. Y. et al. Evaluating the Capability of Satellite Hyperspectral Imager, the ZY1-02D, for Topsoil Nitrogen Content Estimation and Mapping of Farmlands in Black Soil Area, China. Remote Sens. 14, 1008, https://doi.org/10.3390/rs14041008 (2022).
Liang, S. et al. Effects of Winter Snow Cover on Spring Soil Moisture Based on Remote Sensing Data Product over Farmland in Northeast China. Remote Sens. 12, 2716, https://doi.org/10.3390/rs12172716 (2020).
Arias, M. et al. Evaluation of soil moisture estimation techniques based on Sentinel-1 observations over wheat fields. Agric. Water Manag. 287, 108422, https://doi.org/10.1016/j.agwat.2023.108422 (2023).
Ebrahimi-Khusfi, M. et al. Comparison of soil moisture retrieval algorithms based on the synergy between SMAP and SMOS-IC. Int. J. Appl. Earth Obs. Geoinf. 67, 148–160, https://doi.org/10.1016/j.jag.2017.12.005 (2018).
Wang, Y. W. et al. Surface Soil Moisture Retrieval Using Optical/Thermal Infrared Remote Sensing Data. IEEE Trans. Geosci. Remote Sensing. 56, 5433–5442, https://doi.org/10.1109/TGRS.2018.2817370 (2018).
Bhuiyan, C., Singh, R. P. & Kogan, F. N. Monitoring drought dynamics in the Aravalli region (India) using different indices based on ground and remote sensing data. Int. J. Appl. Earth Obs. Geoinf. 8, 289–302, https://doi.org/10.1016/j.jag.2006.03.002 (2006).
Sandholt, I., Rasmussen, K. & Andersen, J. A simple interpretation of the surface temperature/vegetation index space for assessment of surface moisture status. Remote Sens. Environ. 79, 213–224, https://doi.org/10.1016/S0034-4257(01)00274-7 (2002).
Wei, W. et al. Temperature Vegetation Precipitation Dryness Index (TVPDI)-based dryness-wetness monitoring in China. Remote Sens. Environ. 248, 111957, https://doi.org/10.1016/j.rse.2020.111957 (2020).
Zhou, L. et al. Comparison of remotely sensed and meteorological data-derived drought indices in mid-eastern China. Int. J. Remote Sens. 33, 1755–1779, https://doi.org/10.1080/01431161.2011.600349 (2012).
Li, X. J. et al. The first global soil moisture and vegetation optical depth product retrieved from fused SMOS and SMAP L-band observations. Remote Sens. Environ. 282, 113272, https://doi.org/10.1016/j.rse.2022.113272 (2022).
Rao, P. Z. et al. Daily soil moisture mapping at 1 km resolution based on SMAP data for desertification areas in northern China. Earth Syst. Sci. Data. 14, 3053–3073, https://doi.org/10.5194/essd-14-3053-2022 (2022).
Fan, K. K. et al. Global soil moisture drought identification and responses to natural and anthropogenic forcings. J. Hydrol. 610, 127993, https://doi.org/10.1016/j.jhydrol.2022.127993 (2022).
Xing, Z. et al. Seasonal-scale intercomparison of SMAP and fused SMOS-SMAP soil moisture products. Front. Remote Sens. 5, 1440891, https://doi.org/10.3389/frsen.2024.1440891 (2024).
Pignotti, G. et al. SMAP soil moisture data assimilation impacts on water quality and crop yield predictions in watershed modeling. J. Hydrol. 617, 129122, https://doi.org/10.1016/j.jhydrol.2023.129122 (2023).
Dennis, E. J. & Berbery, E. H. The Effects of Soil Representation in WRF-CLM on the Atmospheric Moisture Budget. J. Hydrometeorol. 23, 681–696, https://doi.org/10.1175/JHM-D-21-0101.1 (2022).
Yan, D. H. et al. A data set of global river networks and corresponding water resources zones divisions v2. Sci. Data. 9, https://doi.org/10.1038/s41597-019-0243-y (2022).
Naz, B. S., Kollet, S., Franssen, H. H., Montzka, C., Kurtz, W. A 3 km spatially and temporally consistent European daily soil moisture reanalysis from 2000 to 2015. Sci. Data. 7, https://doi.org/10.1038/s41597-020-0450-6 (2020).
Chen, Y. Z., Feng, X. M. & Fu, B. J. An improved global remote-sensing-based surface soil moisture (RSSSM) dataset covering 2003–2018. Earth Syst. Sci. Data. 13, 1–31, https://doi.org/10.5194/essd-13-1-2021 (2021).
Song, P. L. et al. A 1 km daily surface soil moisture dataset of enhanced coverage under all-weather conditions over China in 2003–2019. Earth Syst. Sci. Data. 14, 2613–2637, https://doi.org/10.5194/essd-14-2613-2022 (2022).
Cheng, F. et al. ChinaCropSM1 km: a fine 1 km daily soil moisture dataset for dryland wheat and maize across China during 1993–2018. Earth Syst. Sci. Data. 15, 395–409, https://doi.org/10.5194/essd-15-395-2023 (2023).
Mhawej, M. & Abunnasr, Y. Daily Ten-ST-GEE: an open access and fully automated 10-m LST downscaling system. Comput. Geosci. 168, 105220, https://doi.org/10.1016/j.cageo.2022.105220 (2022).
Zhang, Y. F. et al. Generation of global 1 km daily soil moisture product from 2000 to 2020 using ensemble learning. Earth Syst. Sci. Data. 15, 2055–2079, https://doi.org/10.5194/essd-15-2055-2023 (2023).
Dickinson, R. E. et al. Biosphere-atmosphere transfer scheme (BATS) for the NCAR community climate model. NCAR Tech. Note NCARE/TN-275+STR. https://doi.org/10.5065/D6668B58 (1986).
Dai, Y. J. & Zeng, Q. C. A. land surface model (IAP94) for climate studies part I: Formulation and validation in off-line experiments. Adv. Atmos. Sci. 14(4), 433–460, https://doi.org/10.1007/BF02918716 (1997).
Niu, G. Y. et al. Development of a simple groundwater model for use in climate models and evaluation with Gravity Recovery and Climate Experiment data. J. Geophys. Res. Atmos. 112, https://doi.org/10.1029/2006JD007522 (2007).
Lawrence, P. J., Chase, T. N. Representing a new MODIS consistent land surface in the Community Land Model (CLM 3.0). J. Geophys. Res. Biogeosci. 112, https://doi.org/10.1029/2006JG000168 (2007).
Oleson, K. W. et al. Improvements to the Community Land Model and their impact on the hydrological cycle. J. Geophys. Res. Biogeosci. 113, https://doi.org/10.1029/2007JG000563 (2008).
Miliaresis, G. C. & Paraschou, C. V. E. Vertical accuracy of the SRTM DTED level 1 of Crete. Int J. Appl Earth Obs 7, 49–59, https://doi.org/10.1016/j.jag.2004.12.001 (2005).
Peng, S., Ding, Y., Liu, W. & Li, Z. 1 Km Monthly Temperature and Precipitation Dataset for China From 1901 to 2017. Earth Syst Sci Data 11, 1931–1946, https://doi.org/10.5194/essd-11-1931-2019 (2019).
Peng, S. et al. Spatiotemporal change and trend analysis of potential evapotranspiration over the Loess Plateau of China during 2011–2100. Agric. For. Meteorol. 233, 183–194, https://doi.org/10.1016/j.agrformet.2016.11.129 (2017).
Chen, L. W. et al. Generating high accuracy multi-layer soil moisture at daily scale in the black soil region of China. Figshare. https://doi.org/10.6084/m9.figshare.28523087 (2025).
Iman bin Hussain, M. D. et al. Enhancing Satellite Image Co-Registration Using Mirror Array as Artificial Point Source for MultiSource Image Harmonization. IEEE J-STARS 18, 16983–16996, https://doi.org/10.1109/JSTARS.2025.3582238 (2025).
Acknowledgements
This study was supported by the Strategic Priority Research Program (Class A) of the Chinese Academy of Sciences (No.XDA28020501) and Science and technology research project of Education Department of Jilin Province (938038) and Key Research and Development Project of Jilin Province (20240304135SF) and Joint Research Project on Improving Meteorological Capability of China Meteorological Administration (23NLTSQ008) and Songliao Basin Meteorological Science and Technology Innovation Project (SL202401). Northeast regional science and technology collaborative innovation joint fund project (2024ZD004).
Author information
Authors and Affiliations
Contributions
L.C. and J.R. and H.S. designed the study. L.C. and J.R., and H.S. performed the data assimilation experiments. H.S. contributed to the data assimilation experiments setup. L.C. and J.R. and L.W. helped with data validation. L.C. and Y.W. and G.Z. wrote the manuscript. All authors have read and agreed to the published version of the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Chen, L., Wang, Y., Ren, J. et al. Generating high accuracy multi-layer soil moisture at daily scale in the black soil region of China. Sci Data 12, 1705 (2025). https://doi.org/10.1038/s41597-025-05986-7
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41597-025-05986-7











