Background & Summary

The leaf area index (LAI), defined as one-half of the total leaf area per unit ground surface area1, is a critical parameter used to characterize the structure and function of vegetation2. Since the LAI directly relates to the acquisition and utilization of sunlight by leaves, it is a key parameter in terrestrial ecosystem models and closely related to the carbon cycle as well as to photosynthesis, respiration and transpiration in leaves3.

Many global and regional LAI products with different temporal and spatial resolutions exist that are derived using various retrieval algorithms and can be applied in studies addressing ecophysiology, atmosphere-ecosystem interactions and global change4,5. However, due to the limitations resulting from radiometric calibration, the atmospheric correction of raw data, the scale effect, and retrieval algorithms, errors inevitably exist in satellite products. Thus, to make appropriate use of satellite products, it is essential to investigate and quantify the uncertainties associated with these products6,7.

Field measurements serve as ‘reference’ values and constitute an important part of the validation of remote sensing products8,9. LAI measurement methods are generally categorized into direct and indirect methods10. Indirect methods include optical methods based on Beer’s law and inclined-point quadrat methods, in which the LAI is calculated by measuring other variables, such as the gap fraction, light transmission, and the contact number. However, the influences of the clumping effect, woody components and the leaf angle distribution (LAD) also need to be considered11,12,13. However, correcting for these variables is challenging because difficulties in their accurate measurment14. Several methods have been developed to correct the clumping index, including the finite-length averaging method15, the gap-size distribution method16,17, a combination of the gap-size distribution and finite-length averaging methods18, and the path length distribution method11. These methods, which have been applied for decades, should increase accuracy and be able to be used for new applications. Many comparisons of direct and indirect methods of LAI measurement for crops and forests have also been made19,20. The results of these comparisons have indicated that the indirect methods can underestimate the LAI, which may be due to the clumping of branches and stems, especially in forested areas19,21. In the case of corn, the indirect measurements made by the AccuPAR ceptometer, which measures photosynthetically active radiation (PAR) and inverts these readings to acquire canopy LAI, were shown to give higher mean values of LAI than those collected using destructive methods22,23. Moreover, with the exception of techniques such as downwards-facing digital hemispherical photography (DHP)24,25, it is very challenging to measure the LAI of low vegetation, such as wheat and paddy rice in the early growth stages, using indirect optical methods due to difficulties in collecting the downward radiation through canopies.

In direct methods, plants are collected in a destructive way, and the LAI is determined by measuring the area of the sampled leaves and then dividing by the sampling area. Values of the LAI determined using direct methods are thus considered to be the most accurate4, and they are therefore used to calibrate indirect measurements. Although direct methods are much more time consuming and labour intensive than indirect optical-instrument methods, the use of destructive direct measurement methods is still feasible for small samples of low vegetation such as crops.

In general, an individual pixel of a satellite product covers a certain range on the ground that does not match the area represented by the sampling point on the ground. To overcome the spatial mismatch between field measurements and coarse-resolution LAI products26, multiscale validation based on fine-resolution satellite or airborne remote sensing imagery is employed to bridge the gap between ground measurements and coarse-resolution satellite data. Several previous validation efforts have partially addressed the scale problem in remote sensing: for example, the Bigfoot27 program links field measurements and Landsat-7 ETM data to generate high spatial-resolution maps to overcome spatial mismatch, and in the Cold Land Processes Field Experiment28 (CLPX), a multiscale dataset based on a nested sampling strategy for upscaling was built. A series of protocols and good practices for the validation of global LAI products have been established by the CEOS/WGCV LPV subgroup29. The strategy of validation proposed by the CEOS/WGCV LPV subgroup consists of direct validation and intercomparison approaches2,30,31. The On Line Interactive Validation Exercise (OLIVE)32 platform hosted by the European Space Agency (ESA) Cal/Val portal followed the guidelines of CEOS/WGCV LPV and provided two independent datasets for validation: BELMANIP233 and DIRECT2, which contained 445 and 113 sites, respectively. The Validation of Land European Remote sensing Instruments (VALERI) project focuses on validation activity to obtain consistent approaches and acquire data in a synergistic way. For the purpose of validating coarse-resolution satellite products, this validation project has developed high spatial-resolution (10–30 m) maps of biophysical variables including LAI that were calibrated using ground measurements34. In the FP7 ImagineS project, field measurements have been collected to evaluate the products of the Copernicus Global Land Service (CGLS) derived from satellites since 2013, and the in situ measurements were processed according to the guidelines defined by the CEO/WGCV LPV subgroup31. High spatial-resolution imagery was employed to upscale the local measurements by EOLAB to generate reference maps of LAI based on the protocols established by the VALERI project35. In addition, many validation efforts have also been carried out in China. The Heihe Integrated Observatory Network was established for long-term observations in 2007. Additionally, a series of multiscale observation experiments over heterogeneous land surfaces were conducted in the Heihe River Basin (HRB)36. Among those long-term observations in the HRB, the Heihe Watershed Allied Telemetry Experimental Research (HiWATER) team investigated the LAI on the basis of regular manual observation during 2013–201537, and automatic observation devices for monitoring the LAI were installed and have been operational at three superstations since 2018. In addition, a seasonal field campaign was carried out by Fang et al. to collect LAI measurements of paddy rice, maize, soybean and sorghum using indirect optical methods in Northeast China in 2012–2013 and 2016, which were used to evaluate satellite-based LAI products38,39.

Direct validation of coarse-resolution LAI products derived from remote sensing data works in concert with the comparison of satellite products with upscaled field LAI maps on the basis of spatial–temporal synchronization40. Numerous efforts have been conducted to validate coarse-resolution LAI products using fine-resolution LAI maps calibrated with in situ measurements24,25,41,42,43. Unfortunately, almost all the validation datasets for coarse-resolution LAI products are based on indirect field measurements, and the uncertainties in these data could be transferred to the products to be validated. Various conclusions regarding the validation of LAI products over croplands have thus far been drawn. Early studies found that the Moderate Resolution Imaging Spectroradiometer (MODIS) LAI generally underestimates the LAI of crops at the senescence stage35,44. By evaluating the GLASS, MODIS (V6), and VIIRS products, Fang et al. (2019) found that the LAI was underestimated at a paddy rice site, especially when LAI > 3.0; the results also indicated an overestimation of GEOV2 for rice39. However, Campos-Taberner et al. (2018) recently presented results showing that GEOV1, MODIS (V5) and EPS performed well for rice in southern Europe (root-mean-square error (RMSE) ≤ 0.80)45. Luke et al. (2020) assessed the CGLS 300 m V1, MODIS (V6), and VIIRS (V6) LAI products in North America, and the results indicated that the CGLS 300 m V1 gave the best agreement (root-mean-square deviation (RMSD) = 0.57) in comparison with RMSD values of 0.81 and 0.89 for VIIRS and MODIS (V6) products, respectively43. Xu et al. (2018) assessed the uncertainties/relative uncertainties of VIIRS and MODIS LAI products using ground measurements, with observed values of 0.60/42.2% and 0.55/39.3%, respectively46.

The accurate and comprehensive validation of coarse-resolution LAI products is still very difficult due to the lack of sufficient direct field measurements. The aim of this study was to develop a highly accurate LAI validation dataset with fine-resolution for Chinese croplands to validate coarse-resolution satellite products based on direct field measurements. The fine-resolution reference LAI maps were generated from Landsat imagery using a local semiempirical model; these maps were then used as a bridge to evaluate the coarse-resolution products. Here, reference maps were applied to validate the accuracy of the MODIS LAI product.

Methods

Study area

Field LAI measurements were collected in four areas: Beijing, Henan Province, Heilongjiang Province, and Anhui Province, as illustrated in Fig. 1. Online-only Table 1 shows detailed information about the field measurements and selected Landsat surface reflectance images in the four study areas. A total of 1010 samples corresponding to 43 growth stages were collected during the experiments. The collected samples included wheat, barley, paddy rice and soybean. The specific sampling dates, numbers of samples, and types of crops are listed in Online-only Table 1.

Fig. 1
figure 1

(a) Location and vegetation type of the study areas in (b) Beijing, (c) Youyi Farm in Heilongjiang Province, (d) Longkang Farm in Anhui Province, (e) Zhoukou, Henan Province, and (f) Jiaozuo, Henan Province.

The experiments in Beijing were carried out during the winter wheat growing seasons from 2004 to 2007. Beijing is located in the north of the North China Plain, which is a warm temperate zone with a semihumid and semiarid monsoon climate.

The study sites in Henan Province were located in Jiaozuo and Zhoukou, which have temperate monsoon climates with abundant sunshine and a clear difference between the summer and winter temperatures. The average annual temperature in these areas is between 12.8 °C and 14.8 °C. The annual average precipitation is 644.3 mm, with 45%–60% of the precipitation falling from June to August. The crop grown at these study sites is winter wheat.

The study area in Heilongjiang Province was located at Youyi Farm, which is situated on the Sanjiang Plain. The total cultivated area of this study area is 1104.29 km2, and the main crops are wheat, barley, paddy rice and soybean. The region has a temperate continental monsoon climate with a mean annual temperature of 3.4 °C. The annual average precipitation is approximately 540 mm, and the precipitation is concentrated in the summer. The Sanjiang Plain is one of the most well-known black soil plains worldwide and is characterized by a low soil albedo.

The fourth field experiment was conducted at Longkang Farm (33°06′45.2″N, 116°51′44.8″E), Anhui Province, in 2017. This study area is located in the southern part of the Huaibei Plain. The study area has an elevation of approximately 22.7–25.9 m above sea level and covers a cultivated area of approximately 20 km2. It is located in a transition zone between the subtropics to the south and the warm temperate zone to the north. The site itself lies in the warm temperate semihumid monsoon agricultural zone and receives moderate rainfall and sufficient sunshine. The annual average amount of sunshine is approximately 2000 hours, which is approximately 54% of the possible maximum. The annual average temperature is 14.84 °C, and the average annual precipitation is approximately 789 mm.

LAI measurements

All of the field LAI measurements were collected using a destructive sampling method. The locations of sampling points and vegetation types of study areas were illustrated in Figure S1 in the Supplementary Information. Plant samples were taken from areas of 1 m × 1 m; after being cut, they were quickly taken to the laboratory. All of the fresh leaves were quickly weighed, and 10 typical leaves were scanned to determine the leaf area. These 10 typical leaves and the remaining leaves were then dried in an oven until a constant weight (the dry weight, DW) was reached so that the leaf DW could be obtained. The specific leaf weight (SLW) and LAI were determined as follows:

$${\rm{SLW}}=\frac{{\left({\rm{DW}}\right)}_{0}}{{{\rm{A}}}_{0}}\left(g.c{m}^{-2}\right)$$
(1)
$${\rm{LAI}}=\frac{{\rm{DW}}}{{\rm{SLW}}\times {{\rm{A}}}_{{\rm{s}}}}$$
(2)

where DW is the total dry weight of the leaves; A0 and (DW)0 are the area and dry weight of the typical leaves, respectively, which were used to calculate the SLW; and As is the sampling area (1 m × 1 m). Here, the elementary sampling unit (ESU) method29,31 was not employed to collect LAI measurements due to the large amount of effort required to implement the destructive method. The crops were relatively uniform in comparison to the natural vegetation. According to investigations by Song et al.47, the spatial heterogeneity of winter wheat is relatively small, with a variation coefficient less than 6% for the optimized soil-adjusted vegetation index (OSAVI). Thus, only one uniform plot with a size of 1 m × 1 m was sampled to represent a Landsat TM pixel. In addition, more than 20 samples were collected to build a semiempirical model to retrieve the LAI in each growth stage, with which a fine-resolution LAI map can be generated.

Landsat surface reflectance data and normalization

The Landsat-5 TM and Landsat-8 OLI surface reflectance (SR) products, for which a sufficient number of satellite images acquired at the same time as the field measurements were available, were used as a ‘bridge’ for upscaling the field LAI measurements to match the coarse-resolution LAI products. All of the Landsat TM and OLI SR images were downloaded from the United States Geological Survey (USGS) EarthExplorer website (https://earthexplorer.usgs.gov). All of these data consisted of SR products that had been derived from Level-1 data by atmospheric correction. Landsat TM/ETM SR data are generated with specialized software called the Landsat Ecosystem Disturbance Adaptive Processing System (LEDAPS)48. Landsat-8 OLI SR data are generated from the Land Surface Reflectance Code (LaSRC), which makes use of the coastal aerosol band to perform aerosol inversion tests and uses MODIS auxiliary climate data and a unique radiative transfer model49. The criteria for the selection of the Landsat SR images were that the imagery should be cloud free and acquired within seven days of the field measurements50. As a result, a total of eight Landsat-5 TM imagers that matched the field measurements (path 123/row 32) were collected for the Beijing area. For Henan Province, four clear Landsat-5 TM images of the Zhoukou area (path 123/row 37) and one Landsat-5 TM image of the Jiaozuo area (path 125/row 36) that matched the field measurements were found. Five clear Landsat-5 TM images (path 114/row 28, path 115/row 27) that covered Youyi Farm, Heilongjiang Province, and two clear Landsat-8 images of Longkang Farm, Anhui Province, were selected. The acquisition dates of the satellite imagery are listed in Online-only Table 1. Because of the limitations on the observation time and degree of cloud contamination in the Landsat satellite imagery, data from only 20 of the 43 field experiments listed in Online-only Table 1 were used to generate the fine-resolution reference LAI maps.

The satellite-based NDVI is a crucial variable in the semiempirical model during the upscaling procedure. To reduce the uncertainty related to the data quantification and determine the parameters in semiempirical models more accurately, the Landsat-5 TM SR imagery was normalized using the MODIS (MCD43A4) version 6 Nadir Bidirectional Reflectance Distribution Function (BRDF)-Adjusted Reflectance (NBAR) product51, which provides 500 m reflectance data adjusted using a bidirectional reflectance distribution function to model the reflectance values as if they were taken at nadir view.

Relative radiation normalization is widely used to eliminate the radiation differences among images acquired at different epochs or collected by different space-borne instruments. A clear SR image was generally selected as a reference to normalize the target image using a linear regression model band by band52. Here, it was employed to normalize the Landsat TM SR image using the MODIS SR data as a reference. To obtain the linear regression model for normalization processing, the 30 m TM images were aggregated to a resolution of 500 m and converted to the same sinusoidal projection as the MODIS product used; then, linear regression models were built to link Landsat TM data to MODIS SR data band by band. If the determination coefficient (R2) was greater than 0.75, the TM SR data were normalized using the linear regression model; otherwise, the ratio of the mean values of the TM and MODIS SR data was used to normalize the Landsat TM SR data.

A comparison of the MODIS and Landsat TM SR products (including the reflectance at the red and near-infrared bands and the NDVI) was therefore performed to normalize the Landsat SR products. Figure 2 shows the scatterplot of the normalized Landsat SR product data against the MCD43A4 data on April 1, 2004, in Beijing. The results show that the regression lines deviate from the 1:1 line, indicating that the TM red-band reflectance was higher than that of the MODIS data and that the Landsat NDVI values were smaller than the corresponding MODIS values. The normalization functions for Landsat TM red and near-infrared bands in the Beijing, Henan, and Heilongjiang study areas are illustrated in Tables S1S3 in the Supplementary Information. The corresponding scatterplots are also provided in Figures S2S7 in the Supplementary Information.

Fig. 2
figure 2

Normalization of the Landsat-5 TM SR data using the MCD43A4 product for the Beijing study area on April 1, 2004. (a) Plot of TM red-band reflectance against MODIS red-band reflectance; (b) plot of TM near-infrared reflectance against MODIS near-infrared reflectance; (c) correlation between MODIS and TM NDVI values before calibration; (d) correlation between MODIS and TM NDVI values after calibration.

MODIS LAI product (MCD15A2H)

In this study, we applied the fine-resolution validation dataset to assess the MODIS LAI product with coarse-resolution, one of the most commonly used global LAI products. The MODIS LAI product version 6 (MCD15A2H) was devised by Myneni et al.53 in 2015. This product is widely known as a mainstream global LAI product and has been applied to the modelling of atmospheric carbon assimilation, crop growth, and evapotranspiration. It is produced using a combination of Terra and Aqua data acquired every 8 days at a 500-m spatial resolution. The algorithm used to produce this product is based on three-dimensional radiative transfer theory, which is ultimately optimized using a look-up table (LUT) to solve the radiative transfer equation54. In addition to the main LUT method, a back-up algorithm based on directional vegetation indices can be employed to retrieve the LAI for different biomes55.

Semiempirical NDVI-based model for generating fine-resolution LAI validation maps

A semiempirical model was employed to model the relationship between the NDVI and LAI. This model was based on the Beer-Lambert Law56:

$$NDVI=NDV{I}_{\infty }+(NDV{I}_{bs}-NDV{I}_{\infty })\ast exp(-{K}_{ndvi}\ast LAI)$$
(3)

where NDVIbs is the NDVI value of bare soil, NDVI is the NDVI value corresponding to saturation of the LAI, and Kndvi is the extinction coefficient, which is related to the structure of the scattering community (in particular, the leaf inclination distribution) and the leaf optical properties. The parameters in Eq. (3) were optimized to produce the best accuracy for the Landsat scenes covering the different study areas using the local experimental data at different growth stages and a curve-fitting algorithm to give the lowest fitting error57. For instance, NDVI = 0.93, NDVIbs = 0.15 and Kndvi = 1.58 were derived from the experimental data obtained on April 1, 2004, in Beijing, as illustrated in Fig. 3.

Fig. 3
figure 3

Relationship between NDVI and LAI using measurements of winter wheat in Beijing. NDVI represents the asymptotic value of NDVI when LAI tends towards infinity, NDVIbs represents the NDVI value corresponding to that of the bare soil, and Kndvi is the extinction coefficient in the NDVI-based LAI model.

Once the parameters in Eq. (3) had been determined using the field data, the NDVI-based regression model could be used to generate the fine-resolution LAI maps using the equation

$$LAI={K}_{ndvi}\ast ln\left((NDV{I}_{\infty }-NDV{I}_{bs})/\left(NDV{I}_{\infty }-NDVI\right)\right)$$
(4)

The fine-resolution 30 m LAI maps were first generated using Landsat SR images for different growth stages and areas using the appropriate NDVI-based model. Cloud-free reference LAI maps with a size of 3 km × 3 km centred on the field sampling points were then acquired for use as potential validation maps. Finally, the proportion of cropland in each 3 km × 3 km reference map was calculated using the GLOBELAND30–2010 land cover product58, as shown in Fig. 1. Only the potential LAI validation maps with a proportion of cropland larger than 75% were selected for use as validation maps.

LOOCV validation method

Due to limited field measurements in each growth stage, the leave-one-out cross-validation (LOOCV) approach59 and curve-fitting algorithm were employed to generate the NDVI-based LAI model. The LOOCV method splits a dataset into a training set and a testing set using all but one observation as part of the training set. For example, there were 22 samples in the Beijing field experiment performed on April 1, 2004. The LOOCV approach chose 21 observations as training samples and one observation as a validation sample. This procedure was repeated 22 times. For each repeat, 21 field measurements were used to determine the parameters in Eq. (4) based on the curve-fitting algorithm. This algorithm is in the Python scipy.optimize module, which uses nonlinear least squares to fit a function57. Due to the limitation of sample size, we were required to set the bounds for the parameters, and the algorithm derives the optimal values for the parameters through iteration so that the sum of the squared residuals of the function is minimized. The value range of NDVI is 0.91–0.97, NDVIbs ranged between 0.01 and 0.18, and Kndvi is in the range of 1.3–1.8. Thus, 22 statistical equations were obtained during the procedure. All the field measurements were separately brought into the 22 equations to identify the equation with the lowest RMSE, which was selected as the equation to generate the fine-resolution LAI map.

The equations used to generate the fine-resolution LAI map for each growth stage in the different study areas are shown in Table 1.

Table 1 NDVI-based statistical models used to generate fine-resolution LAI at each growth stage in the different study areas.

Several quality indicators were employed to assess the reference maps and LAI products, including the RMSE, relative root mean square error (RRMSE), coefficient of determination (R2), and relative bias. Relative bias is the relative difference between the corresponding reference LAI and field LAI. It was defined as follows:

$${\rm{Relative}}\;{\rm{bias}}=(mea{n}_{LA{I}_{ref}}-mea{n}_{LA{I}_{field}})/mea{n}_{LA{I}_{field}}$$
(5)

where \(mea{n}_{LA{I}_{ref}}\) represents the mean value of the estimated reference LAI in each growth stage and \(mea{n}_{LA{I}_{field}}\) represents the mean value of the field LAI in each growth stage.

Uncertainty is one of most important indicators used to represent the accuracy of reference maps and is of great significance for product validation. The uncertainty was defined as follows:

$${\rm{Uncertainty}}={{\rm{LAI}}}_{{\rm{mean}}}\times {\rm{RRMSE}}$$
(6)

where LAImean represents the mean value of LAI within the 3 km × 3 km reference map and RRMSE represents the relative root mean square error between the generated and field-measured LAI in each growth stage.

Determination of scaling difference using different upscaling methods

In the absence of scaling errors, Tian et al. (2003) found that the LAI obtained from coarse-resolution satellite data should be equal to the arithmetic average of values obtained from fine-resolution data60. Due to the heterogeneity of the land surface and nonlinearity of the inversion model, scaling errors are inevitable in retrieving LAI at coarse spatial resolution61,62,63. To investigate the scaling errors inherent to the coarse-resolution LAI product, the differences in the U1 and U2 upscaling methods were obtained to partly quantify the errors in product validation. The upscaling method U1 is the so-called ‘invert first and then average’ method, in which the fine-resolution NDVI is calculated first and the fine-resolution LAI is then retrieved based on the semiempirical NDVI-based model. The fine-resolution LAI maps are then aggregated (i.e., upscaled) to generate the coarse-resolution LAI. The upscaling method U2 is the so-called ‘average first and then invert’ method. Using this method, the fine-resolution SR image is aggregated to a coarse-resolution image to derive the coarse-resolution NDVI. The semiempirical NDVI-based model is then used to retrieve the coarse-resolution LAI. The difference in pixel value between the coarse-resolution LAI images obtained using the two different upscaling methods can be regarded as the spatial-scale difference26,61. Details regarding scaling differences are provided in the Supplementary Information.

Data Records

On the basis of the selection rules introduced in the Semiempirical NDVI-based model for generating fine-resolution LAI validation maps section, a total of 80 fine-resolution LAI validation maps with a size of 3 km × 3 km were generated from the Landsat-5 TM and Landsat-8 OLI reflectance data; these maps are provided in the Supplementary Information, Figures S9S13. Detailed statistical metrics for these 80 fine-resolution maps are summarized in Tables 25.

Table 2 Statistical metrics of the fine-resolution LAI maps of the Beijing study area.
Table 3 Statistical metrics of the fine-resolution LAI maps of the study areas in Jiaozuo and Zhoukou, Henan Province.
Table 4 Statistical metrics of the fine-resolution LAI maps of the Youyi Farm study area, Heilongjiang Province.
Table 5 Statistical metrics of the fine-resolution LAI maps of the Longkang Farm study area, Anhui Province.

The scaling difference was taken as the difference between the mean LAI values generated using the two different upscaling methods that were introduced in Figure S8 in the Supplementary Information. The standard deviation reflects the spatial heterogeneity of the 3 km × 3 km fine-resolution LAI maps. The underestimation caused by the scaling difference for the Henan, Beijing, and Anhui study areas (which have relatively light soil substrates) and the overestimation for the Heilongjiang study area (where the soil background is dark) agree with the results of the investigation performed by Liu et al. (2014) and Chen et al. (2002), who found that there was an “underestimation for mixed pixels with bright non-vegetation components and an overestimation for those with dark non-vegetation components ”26,64.

Table 2 lists the statistical metrics of the fine-resolution LAI validation maps for Beijing. A total of 32 reference maps corresponding to eight growth stages were used between 2004 and 2007. The LAI for the 32 reference maps is relatively low, ranging from 0.273 to 2.257, with a mean uncertainty of 0.290. The spatial heterogeneity is relatively large and has a mean standard deviation of 0.720, which gives a relatively large scaling difference with a mean value of 0.046.

Table 3 lists the statistical metrics of the fine-resolution LAI validation maps in the study areas of Henan Province. Twenty reference maps corresponding to five growth stages were used from 2003 to 2004. The LAI for these 20 reference maps varies from 1.615 to 4.310, with a mean uncertainty of 0.364. The spatial heterogeneity is higher than that for the Beijing study area and has a mean standard deviation of 1.361. The scaling difference is still obvious and has a mean value of 0.302.

Table 4 lists the statistical metrics of the fine-resolution LAI validation maps for Youyi Farm, Heilongjiang Province. Here, 20 reference maps corresponding to five growth stages were used from 2005 to 2006. The LAI in these maps is relatively low, ranging from 0.293 to 1.338, with a mean uncertainty of 0.189. At Youyi Farm, the size of the fields was much larger than that in the other study areas; the spatial heterogeneity is thus relatively small and has a mean standard deviation of 0.413. The scaling difference is the smallest among all the study areas and has a mean value of 0.013.

Table 5 lists the statistical metrics of the fine-resolution LAI validation maps for Longkang Farm, Anhui Province. These statistics are for eight reference maps corresponding to two growth stages in 2017. The LAI for these eight reference maps is relatively large, ranging from 2.190 to 4.651, with a mean uncertainty of 0.685. The spatial heterogeneity is similar to that in the Henan study area, with a mean standard deviation of 1.528. The scaling difference has a mean relative value of 0.553.

The field measurements, published for public use, are available at Zenodo, https://doi.org/10.5281/zenodo.5091251. The dataset contains readme files, compressed files of the fine-resolution LAI maps, and files of statistics for the reference maps. The intermediate NDVI files and reference LAI maps derived using the U2 upscaling methods are also provided65.

Technical Validation

Performance of the semiempirical models

The semiempirical NDVI-based models used to generate the fine-resolution reference LAI maps were validated using field measurements and the LOOCV method for the four study areas. This process is illustrated in Figs. 47. The results of a statistical comparison of the field-measured and generated LAI are also displayed in the figures.

Fig. 4
figure 4

Comparison of the fine-resolution reference LAI and the field-measured data for wheat in different stages of growth in the Beijing study area.

Fig. 5
figure 5

Comparison of the fine-resolution reference LAI values and the field-measured data for wheat in different stages of growth in Jiaozuo and Zhoukou, Henan Province.

Fig. 6
figure 6

Comparison of the fine-resolution reference LAI values and the field-measured data for wheat and barley in different stages of growth at Youyi Farm, Heilongjiang Province.

Fig. 7
figure 7

Comparison of the fine-resolution reference LAI values and the field-measured data for wheat in different stages of growth at Longkang Farm, Anhui Province.

In Figs. 47, the field-measured LAI values are compared with the LAI values derived by applying the semiempirical LAI model to Landsat TM/OLI SR data for the four study areas (Beijing, Henan, Heilongjiang, and Anhui). The results shown in Fig. 4 are characterized by slopes that are close to the 1:1 line, with RMSE values ranging from 0.25 to 0.72. As the results are displayed separately for each growth stage, the LAI values measured during the early growth stage have a wide distribution, with the result that the coefficient of determination for the regreening stage is low. Figure 5 displays the relationship between the field-measured LAI and the predicted LAI values for the Henan test area based on the formal semiempirical model: in this case, the RMSE ranges from 0.31 to 0.92, and the RRMSE is less than 23.16%. Figure 6 shows a comparison of the field-measured and predicted LAI values for Youyi Farm, Heilongjiang Province. On May 5th, 2005, and June 6th, 2006, field measurements of both wheat and barley were performed at this site; the samples collected on June 14th, 2007, were of barley only. Since barley and wheat are crops with similar vegetation structures, the two crop types are not separated in this comparison. The RMSE for these data has a range of 0.22 to 0.37, and the RRMSE has a range of 18.25% to 36.78%. The plots displayed in Fig. 7 show the relationship between the field-measured and predicted LAI values for Longkang Farm, Anhui Province. The slopes here are close to the 1:1 line, and the RMSE has a range of 0.67 to 0.95.

Validation of MODIS LAI

The 80 reference LAI maps with a size of 3 km × 3 km derived from the two upscaling methods (Figure S8 in Supplementary Information) and the corresponding field LAI measurements were employed to validate the MODIS LAI V6 product (MCD15A2H) for the four study areas. The validation results are illustrated in Fig. 8 and Table 6.

Fig. 8
figure 8

Validation results for the MCD15A2H LAI products obtained by applying (a) the U1 upscaling method to the LAI data from the 80 fine-resolution LAI maps and (b) the U2 upscaling method to LAI data from the 80 reference maps. (c) Validation results obtained using the corresponding field measurements.

Table 6 Validation metrics for the MODIS LAI product using data from the fine-resolution reference LAI maps and field-measured LAI values in the four study areas. R2: coefficient of determination, RMSE: root mean square error, RRMSE: relative RMSE, RB (relative bias): ratio of the difference in the MODIS and fine-resolution LAI to the fine-resolution LAI.

In Fig. 8(a), the fine-resolution reference LAI maps (30 m) derived from Eq. (4) were compared with the MODIS LAI in the range of 3 km × 3 km, which refers to the U1 upscaling method. To investigate how the scaling difference contributes to the discrepancies between the fine-resolution maps and the coarse-resolution products, the reference LAI maps at 500 m resolution were obtained based on the ‘average first and then invert’ (U2 upscaling) method with a size of 3 km × 3 km (as described in Figure S8). These reference LAI maps at 500 m resolution were compared with the MODIS LAI, as illustrated in Fig. 8(b). In addition, the field LAI measurements were directly compared with the corresponding MODSI LAI, as illustrated in Fig. 8(c).

The results illustrated in Fig. 8(a) indicate that the MODIS LAI values are underestimated in comparison to the fine-resolution reference LAI data in the range of 3 km × 3 km, especially in the case of the Henan study area. Table 6 shows that the accuracy of the MODIS LAI product varies among the study areas: the values are severely underestimated for crops in Beijing, Henan, and Anhui (relative bias = –27.0%, –48.9%, and –10.8%, respectively), whereas the values are overestimated for the crops with a black soil background in Heilongjiang Province (relative bias = 56.9%).

Due to the existence of surface heterogeneity, applying the model developed with 30 m data to 500 m data could result in some discrepancies. Since coarse-resolution LAI should be equal to aggregated fine-resolution LAI in the absence of scaling errors, validation using the reference LAI derived from the U2 method will result in artificially high accuracy60. However, by comparing the validation results from the U1 and U2 methods, the error due to the scale effect inherent to the coarse-resolution product can be at least partly quantified. In Fig. 8(b), the results gave an RMSE of 0.78 against the value of 0.91 that was obtained by applying the U1 (‘invert first and then average’) upscaling method to the reference LAI dataset in Fig. 8(a), which indicates that the scaling difference also contributes to the error in the coarse-resolution MODIS LAI product. When the scaling difference was taken into consideration and compensated for by applying the U2 upscaling method to the reference LAI dataset, the underestimates for the Beijing, Henan, and Anhui areas were reduced, giving relative biases of −24.0%, −43.0%, and 6.0%, respectively, compared with –26.9%, −48.9%, and −10.8% in Fig. 8(a), respectively. In terms of the accuracy of MODIS LAI in Heilongjiang, since the land cover in Heilongjiang is relatively uniform, the mean scaling difference among the four study areas is lowest, and the RMSE and relative bias thus slightly increased from 0.52 to 0.53 and 56.9% to 59.8%, respectively. A direct comparison with the field measurements (Fig. 8(c)) produced much higher uncertainties (RMSE = 1.99, RRMSE = 76.8%, relative bias = −49.3%) than were found by using the upscaled reference LAI dataset.

In this study, a highly accurate fine-resolution LAI dataset for Chinese croplands that could be used as a reference for coarse-resolution LAI products was derived from field measurements and fine-spatial-resolution satellite imagery (Landsat-5 TM and Landsat-8 OLI data). A semiempirical statistical model based on the Beer–Lambert law was used to derive fine-resolution LAI data that could be used for validation of the coarse-resolution LAI product at each growth stage. The parameters of each semiempirical model were estimated using the field LAI at each growth stage based on the curve-fitting algorithm and LOOCV approach. During this procedure, the performance of each semiempirical model was also investigated. Finally, eighty fine-resolution reference LAI maps with a size of 3 km × 3 km were generated for the study areas in four Chinese provinces. This fine-resolution reference LAI dataset was applied to assess the accuracy of MODIS LAI among these four study areas using the U1 upscaling method. The MODIS LAI was also compared to the reference LAI generated using the U2 upscaling method, through which the error due to the scale effect inherent to the coarse-resolution LAI product can be partly quantified. The direct comparison of the LAI data collected in the field and MODSI LAI showed considerable uncertainty. Therefore, this study contributes to the validation of remote sensing LAI products by providing a set of fine-resolution reference LAI datasets based on destructive sampling methods and highlights the importance of using a fine-resolution reference LAI dataset based on direct field measurements. Such a dataset can bridge the gap between field measurements and coarse-resolution pixel data.