Abstract
Numerous validation efforts have been conducted over the last decade to assess the accuracy of global leaf area index (LAI) products. However, such efforts continue to face obstacles due to the lack of sufficient high-quality field measurements. In this study, a fine-resolution LAI dataset consisting of 80 reference maps was generated during 2003–2017. The direct destructive method was used to measure the field LAI, and fine-resolution LAI images were derived from Landsat images using semiempirical inversion models. Eighty reference LAI maps, each with an area of 3 km × 3 km and a percentage of cropland larger than 75%, were selected as the fine-resolution validation dataset. The uncertainty associated with the spatial scale effect was also provided. Ultimately, the fine-resolution reference LAI dataset was used to validate the Moderate Resolution Imaging Spectroradiometer (MODIS) LAI product. The results indicate that the fine-resolution reference LAI dataset builds a bridge to link small sampling plots and coarse-resolution pixels, which is extremely important in validating coarse-resolution LAI products.
Measurement(s) | leaf area index |
Technology Type(s) | destructive sampling method |
Sample Characteristic - Environment | area of cropland |
Sample Characteristic - Location | China |
Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.15124524
Similar content being viewed by others
Background & Summary
The leaf area index (LAI), defined as one-half of the total leaf area per unit ground surface area1, is a critical parameter used to characterize the structure and function of vegetation2. Since the LAI directly relates to the acquisition and utilization of sunlight by leaves, it is a key parameter in terrestrial ecosystem models and closely related to the carbon cycle as well as to photosynthesis, respiration and transpiration in leaves3.
Many global and regional LAI products with different temporal and spatial resolutions exist that are derived using various retrieval algorithms and can be applied in studies addressing ecophysiology, atmosphere-ecosystem interactions and global change4,5. However, due to the limitations resulting from radiometric calibration, the atmospheric correction of raw data, the scale effect, and retrieval algorithms, errors inevitably exist in satellite products. Thus, to make appropriate use of satellite products, it is essential to investigate and quantify the uncertainties associated with these products6,7.
Field measurements serve as ‘reference’ values and constitute an important part of the validation of remote sensing products8,9. LAI measurement methods are generally categorized into direct and indirect methods10. Indirect methods include optical methods based on Beer’s law and inclined-point quadrat methods, in which the LAI is calculated by measuring other variables, such as the gap fraction, light transmission, and the contact number. However, the influences of the clumping effect, woody components and the leaf angle distribution (LAD) also need to be considered11,12,13. However, correcting for these variables is challenging because difficulties in their accurate measurment14. Several methods have been developed to correct the clumping index, including the finite-length averaging method15, the gap-size distribution method16,17, a combination of the gap-size distribution and finite-length averaging methods18, and the path length distribution method11. These methods, which have been applied for decades, should increase accuracy and be able to be used for new applications. Many comparisons of direct and indirect methods of LAI measurement for crops and forests have also been made19,20. The results of these comparisons have indicated that the indirect methods can underestimate the LAI, which may be due to the clumping of branches and stems, especially in forested areas19,21. In the case of corn, the indirect measurements made by the AccuPAR ceptometer, which measures photosynthetically active radiation (PAR) and inverts these readings to acquire canopy LAI, were shown to give higher mean values of LAI than those collected using destructive methods22,23. Moreover, with the exception of techniques such as downwards-facing digital hemispherical photography (DHP)24,25, it is very challenging to measure the LAI of low vegetation, such as wheat and paddy rice in the early growth stages, using indirect optical methods due to difficulties in collecting the downward radiation through canopies.
In direct methods, plants are collected in a destructive way, and the LAI is determined by measuring the area of the sampled leaves and then dividing by the sampling area. Values of the LAI determined using direct methods are thus considered to be the most accurate4, and they are therefore used to calibrate indirect measurements. Although direct methods are much more time consuming and labour intensive than indirect optical-instrument methods, the use of destructive direct measurement methods is still feasible for small samples of low vegetation such as crops.
In general, an individual pixel of a satellite product covers a certain range on the ground that does not match the area represented by the sampling point on the ground. To overcome the spatial mismatch between field measurements and coarse-resolution LAI products26, multiscale validation based on fine-resolution satellite or airborne remote sensing imagery is employed to bridge the gap between ground measurements and coarse-resolution satellite data. Several previous validation efforts have partially addressed the scale problem in remote sensing: for example, the Bigfoot27 program links field measurements and Landsat-7 ETM data to generate high spatial-resolution maps to overcome spatial mismatch, and in the Cold Land Processes Field Experiment28 (CLPX), a multiscale dataset based on a nested sampling strategy for upscaling was built. A series of protocols and good practices for the validation of global LAI products have been established by the CEOS/WGCV LPV subgroup29. The strategy of validation proposed by the CEOS/WGCV LPV subgroup consists of direct validation and intercomparison approaches2,30,31. The On Line Interactive Validation Exercise (OLIVE)32 platform hosted by the European Space Agency (ESA) Cal/Val portal followed the guidelines of CEOS/WGCV LPV and provided two independent datasets for validation: BELMANIP233 and DIRECT2, which contained 445 and 113 sites, respectively. The Validation of Land European Remote sensing Instruments (VALERI) project focuses on validation activity to obtain consistent approaches and acquire data in a synergistic way. For the purpose of validating coarse-resolution satellite products, this validation project has developed high spatial-resolution (10–30 m) maps of biophysical variables including LAI that were calibrated using ground measurements34. In the FP7 ImagineS project, field measurements have been collected to evaluate the products of the Copernicus Global Land Service (CGLS) derived from satellites since 2013, and the in situ measurements were processed according to the guidelines defined by the CEO/WGCV LPV subgroup31. High spatial-resolution imagery was employed to upscale the local measurements by EOLAB to generate reference maps of LAI based on the protocols established by the VALERI project35. In addition, many validation efforts have also been carried out in China. The Heihe Integrated Observatory Network was established for long-term observations in 2007. Additionally, a series of multiscale observation experiments over heterogeneous land surfaces were conducted in the Heihe River Basin (HRB)36. Among those long-term observations in the HRB, the Heihe Watershed Allied Telemetry Experimental Research (HiWATER) team investigated the LAI on the basis of regular manual observation during 2013–201537, and automatic observation devices for monitoring the LAI were installed and have been operational at three superstations since 2018. In addition, a seasonal field campaign was carried out by Fang et al. to collect LAI measurements of paddy rice, maize, soybean and sorghum using indirect optical methods in Northeast China in 2012–2013 and 2016, which were used to evaluate satellite-based LAI products38,39.
Direct validation of coarse-resolution LAI products derived from remote sensing data works in concert with the comparison of satellite products with upscaled field LAI maps on the basis of spatial–temporal synchronization40. Numerous efforts have been conducted to validate coarse-resolution LAI products using fine-resolution LAI maps calibrated with in situ measurements24,25,41,42,43. Unfortunately, almost all the validation datasets for coarse-resolution LAI products are based on indirect field measurements, and the uncertainties in these data could be transferred to the products to be validated. Various conclusions regarding the validation of LAI products over croplands have thus far been drawn. Early studies found that the Moderate Resolution Imaging Spectroradiometer (MODIS) LAI generally underestimates the LAI of crops at the senescence stage35,44. By evaluating the GLASS, MODIS (V6), and VIIRS products, Fang et al. (2019) found that the LAI was underestimated at a paddy rice site, especially when LAI > 3.0; the results also indicated an overestimation of GEOV2 for rice39. However, Campos-Taberner et al. (2018) recently presented results showing that GEOV1, MODIS (V5) and EPS performed well for rice in southern Europe (root-mean-square error (RMSE) ≤ 0.80)45. Luke et al. (2020) assessed the CGLS 300 m V1, MODIS (V6), and VIIRS (V6) LAI products in North America, and the results indicated that the CGLS 300 m V1 gave the best agreement (root-mean-square deviation (RMSD) = 0.57) in comparison with RMSD values of 0.81 and 0.89 for VIIRS and MODIS (V6) products, respectively43. Xu et al. (2018) assessed the uncertainties/relative uncertainties of VIIRS and MODIS LAI products using ground measurements, with observed values of 0.60/42.2% and 0.55/39.3%, respectively46.
The accurate and comprehensive validation of coarse-resolution LAI products is still very difficult due to the lack of sufficient direct field measurements. The aim of this study was to develop a highly accurate LAI validation dataset with fine-resolution for Chinese croplands to validate coarse-resolution satellite products based on direct field measurements. The fine-resolution reference LAI maps were generated from Landsat imagery using a local semiempirical model; these maps were then used as a bridge to evaluate the coarse-resolution products. Here, reference maps were applied to validate the accuracy of the MODIS LAI product.
Methods
Study area
Field LAI measurements were collected in four areas: Beijing, Henan Province, Heilongjiang Province, and Anhui Province, as illustrated in Fig. 1. Online-only Table 1 shows detailed information about the field measurements and selected Landsat surface reflectance images in the four study areas. A total of 1010 samples corresponding to 43 growth stages were collected during the experiments. The collected samples included wheat, barley, paddy rice and soybean. The specific sampling dates, numbers of samples, and types of crops are listed in Online-only Table 1.
The experiments in Beijing were carried out during the winter wheat growing seasons from 2004 to 2007. Beijing is located in the north of the North China Plain, which is a warm temperate zone with a semihumid and semiarid monsoon climate.
The study sites in Henan Province were located in Jiaozuo and Zhoukou, which have temperate monsoon climates with abundant sunshine and a clear difference between the summer and winter temperatures. The average annual temperature in these areas is between 12.8 °C and 14.8 °C. The annual average precipitation is 644.3 mm, with 45%–60% of the precipitation falling from June to August. The crop grown at these study sites is winter wheat.
The study area in Heilongjiang Province was located at Youyi Farm, which is situated on the Sanjiang Plain. The total cultivated area of this study area is 1104.29 km2, and the main crops are wheat, barley, paddy rice and soybean. The region has a temperate continental monsoon climate with a mean annual temperature of 3.4 °C. The annual average precipitation is approximately 540 mm, and the precipitation is concentrated in the summer. The Sanjiang Plain is one of the most well-known black soil plains worldwide and is characterized by a low soil albedo.
The fourth field experiment was conducted at Longkang Farm (33°06′45.2″N, 116°51′44.8″E), Anhui Province, in 2017. This study area is located in the southern part of the Huaibei Plain. The study area has an elevation of approximately 22.7–25.9 m above sea level and covers a cultivated area of approximately 20 km2. It is located in a transition zone between the subtropics to the south and the warm temperate zone to the north. The site itself lies in the warm temperate semihumid monsoon agricultural zone and receives moderate rainfall and sufficient sunshine. The annual average amount of sunshine is approximately 2000 hours, which is approximately 54% of the possible maximum. The annual average temperature is 14.84 °C, and the average annual precipitation is approximately 789 mm.
LAI measurements
All of the field LAI measurements were collected using a destructive sampling method. The locations of sampling points and vegetation types of study areas were illustrated in Figure S1 in the Supplementary Information. Plant samples were taken from areas of 1 m × 1 m; after being cut, they were quickly taken to the laboratory. All of the fresh leaves were quickly weighed, and 10 typical leaves were scanned to determine the leaf area. These 10 typical leaves and the remaining leaves were then dried in an oven until a constant weight (the dry weight, DW) was reached so that the leaf DW could be obtained. The specific leaf weight (SLW) and LAI were determined as follows:
where DW is the total dry weight of the leaves; A0 and (DW)0 are the area and dry weight of the typical leaves, respectively, which were used to calculate the SLW; and As is the sampling area (1 m × 1 m). Here, the elementary sampling unit (ESU) method29,31 was not employed to collect LAI measurements due to the large amount of effort required to implement the destructive method. The crops were relatively uniform in comparison to the natural vegetation. According to investigations by Song et al.47, the spatial heterogeneity of winter wheat is relatively small, with a variation coefficient less than 6% for the optimized soil-adjusted vegetation index (OSAVI). Thus, only one uniform plot with a size of 1 m × 1 m was sampled to represent a Landsat TM pixel. In addition, more than 20 samples were collected to build a semiempirical model to retrieve the LAI in each growth stage, with which a fine-resolution LAI map can be generated.
Landsat surface reflectance data and normalization
The Landsat-5 TM and Landsat-8 OLI surface reflectance (SR) products, for which a sufficient number of satellite images acquired at the same time as the field measurements were available, were used as a ‘bridge’ for upscaling the field LAI measurements to match the coarse-resolution LAI products. All of the Landsat TM and OLI SR images were downloaded from the United States Geological Survey (USGS) EarthExplorer website (https://earthexplorer.usgs.gov). All of these data consisted of SR products that had been derived from Level-1 data by atmospheric correction. Landsat TM/ETM SR data are generated with specialized software called the Landsat Ecosystem Disturbance Adaptive Processing System (LEDAPS)48. Landsat-8 OLI SR data are generated from the Land Surface Reflectance Code (LaSRC), which makes use of the coastal aerosol band to perform aerosol inversion tests and uses MODIS auxiliary climate data and a unique radiative transfer model49. The criteria for the selection of the Landsat SR images were that the imagery should be cloud free and acquired within seven days of the field measurements50. As a result, a total of eight Landsat-5 TM imagers that matched the field measurements (path 123/row 32) were collected for the Beijing area. For Henan Province, four clear Landsat-5 TM images of the Zhoukou area (path 123/row 37) and one Landsat-5 TM image of the Jiaozuo area (path 125/row 36) that matched the field measurements were found. Five clear Landsat-5 TM images (path 114/row 28, path 115/row 27) that covered Youyi Farm, Heilongjiang Province, and two clear Landsat-8 images of Longkang Farm, Anhui Province, were selected. The acquisition dates of the satellite imagery are listed in Online-only Table 1. Because of the limitations on the observation time and degree of cloud contamination in the Landsat satellite imagery, data from only 20 of the 43 field experiments listed in Online-only Table 1 were used to generate the fine-resolution reference LAI maps.
The satellite-based NDVI is a crucial variable in the semiempirical model during the upscaling procedure. To reduce the uncertainty related to the data quantification and determine the parameters in semiempirical models more accurately, the Landsat-5 TM SR imagery was normalized using the MODIS (MCD43A4) version 6 Nadir Bidirectional Reflectance Distribution Function (BRDF)-Adjusted Reflectance (NBAR) product51, which provides 500 m reflectance data adjusted using a bidirectional reflectance distribution function to model the reflectance values as if they were taken at nadir view.
Relative radiation normalization is widely used to eliminate the radiation differences among images acquired at different epochs or collected by different space-borne instruments. A clear SR image was generally selected as a reference to normalize the target image using a linear regression model band by band52. Here, it was employed to normalize the Landsat TM SR image using the MODIS SR data as a reference. To obtain the linear regression model for normalization processing, the 30 m TM images were aggregated to a resolution of 500 m and converted to the same sinusoidal projection as the MODIS product used; then, linear regression models were built to link Landsat TM data to MODIS SR data band by band. If the determination coefficient (R2) was greater than 0.75, the TM SR data were normalized using the linear regression model; otherwise, the ratio of the mean values of the TM and MODIS SR data was used to normalize the Landsat TM SR data.
A comparison of the MODIS and Landsat TM SR products (including the reflectance at the red and near-infrared bands and the NDVI) was therefore performed to normalize the Landsat SR products. Figure 2 shows the scatterplot of the normalized Landsat SR product data against the MCD43A4 data on April 1, 2004, in Beijing. The results show that the regression lines deviate from the 1:1 line, indicating that the TM red-band reflectance was higher than that of the MODIS data and that the Landsat NDVI values were smaller than the corresponding MODIS values. The normalization functions for Landsat TM red and near-infrared bands in the Beijing, Henan, and Heilongjiang study areas are illustrated in Tables S1–S3 in the Supplementary Information. The corresponding scatterplots are also provided in Figures S2–S7 in the Supplementary Information.
Normalization of the Landsat-5 TM SR data using the MCD43A4 product for the Beijing study area on April 1, 2004. (a) Plot of TM red-band reflectance against MODIS red-band reflectance; (b) plot of TM near-infrared reflectance against MODIS near-infrared reflectance; (c) correlation between MODIS and TM NDVI values before calibration; (d) correlation between MODIS and TM NDVI values after calibration.
MODIS LAI product (MCD15A2H)
In this study, we applied the fine-resolution validation dataset to assess the MODIS LAI product with coarse-resolution, one of the most commonly used global LAI products. The MODIS LAI product version 6 (MCD15A2H) was devised by Myneni et al.53 in 2015. This product is widely known as a mainstream global LAI product and has been applied to the modelling of atmospheric carbon assimilation, crop growth, and evapotranspiration. It is produced using a combination of Terra and Aqua data acquired every 8 days at a 500-m spatial resolution. The algorithm used to produce this product is based on three-dimensional radiative transfer theory, which is ultimately optimized using a look-up table (LUT) to solve the radiative transfer equation54. In addition to the main LUT method, a back-up algorithm based on directional vegetation indices can be employed to retrieve the LAI for different biomes55.
Semiempirical NDVI-based model for generating fine-resolution LAI validation maps
A semiempirical model was employed to model the relationship between the NDVI and LAI. This model was based on the Beer-Lambert Law56:
where NDVIbs is the NDVI value of bare soil, NDVI∞ is the NDVI value corresponding to saturation of the LAI, and Kndvi is the extinction coefficient, which is related to the structure of the scattering community (in particular, the leaf inclination distribution) and the leaf optical properties. The parameters in Eq. (3) were optimized to produce the best accuracy for the Landsat scenes covering the different study areas using the local experimental data at different growth stages and a curve-fitting algorithm to give the lowest fitting error57. For instance, NDVI∞ = 0.93, NDVIbs = 0.15 and Kndvi = 1.58 were derived from the experimental data obtained on April 1, 2004, in Beijing, as illustrated in Fig. 3.
Relationship between NDVI and LAI using measurements of winter wheat in Beijing. NDVI∞ represents the asymptotic value of NDVI when LAI tends towards infinity, NDVIbs represents the NDVI value corresponding to that of the bare soil, and Kndvi is the extinction coefficient in the NDVI-based LAI model.
Once the parameters in Eq. (3) had been determined using the field data, the NDVI-based regression model could be used to generate the fine-resolution LAI maps using the equation
The fine-resolution 30 m LAI maps were first generated using Landsat SR images for different growth stages and areas using the appropriate NDVI-based model. Cloud-free reference LAI maps with a size of 3 km × 3 km centred on the field sampling points were then acquired for use as potential validation maps. Finally, the proportion of cropland in each 3 km × 3 km reference map was calculated using the GLOBELAND30–2010 land cover product58, as shown in Fig. 1. Only the potential LAI validation maps with a proportion of cropland larger than 75% were selected for use as validation maps.
LOOCV validation method
Due to limited field measurements in each growth stage, the leave-one-out cross-validation (LOOCV) approach59 and curve-fitting algorithm were employed to generate the NDVI-based LAI model. The LOOCV method splits a dataset into a training set and a testing set using all but one observation as part of the training set. For example, there were 22 samples in the Beijing field experiment performed on April 1, 2004. The LOOCV approach chose 21 observations as training samples and one observation as a validation sample. This procedure was repeated 22 times. For each repeat, 21 field measurements were used to determine the parameters in Eq. (4) based on the curve-fitting algorithm. This algorithm is in the Python scipy.optimize module, which uses nonlinear least squares to fit a function57. Due to the limitation of sample size, we were required to set the bounds for the parameters, and the algorithm derives the optimal values for the parameters through iteration so that the sum of the squared residuals of the function is minimized. The value range of NDVI∞ is 0.91–0.97, NDVIbs ranged between 0.01 and 0.18, and Kndvi is in the range of 1.3–1.8. Thus, 22 statistical equations were obtained during the procedure. All the field measurements were separately brought into the 22 equations to identify the equation with the lowest RMSE, which was selected as the equation to generate the fine-resolution LAI map.
The equations used to generate the fine-resolution LAI map for each growth stage in the different study areas are shown in Table 1.
Several quality indicators were employed to assess the reference maps and LAI products, including the RMSE, relative root mean square error (RRMSE), coefficient of determination (R2), and relative bias. Relative bias is the relative difference between the corresponding reference LAI and field LAI. It was defined as follows:
where \(mea{n}_{LA{I}_{ref}}\) represents the mean value of the estimated reference LAI in each growth stage and \(mea{n}_{LA{I}_{field}}\) represents the mean value of the field LAI in each growth stage.
Uncertainty is one of most important indicators used to represent the accuracy of reference maps and is of great significance for product validation. The uncertainty was defined as follows:
where LAImean represents the mean value of LAI within the 3 km × 3 km reference map and RRMSE represents the relative root mean square error between the generated and field-measured LAI in each growth stage.
Determination of scaling difference using different upscaling methods
In the absence of scaling errors, Tian et al. (2003) found that the LAI obtained from coarse-resolution satellite data should be equal to the arithmetic average of values obtained from fine-resolution data60. Due to the heterogeneity of the land surface and nonlinearity of the inversion model, scaling errors are inevitable in retrieving LAI at coarse spatial resolution61,62,63. To investigate the scaling errors inherent to the coarse-resolution LAI product, the differences in the U1 and U2 upscaling methods were obtained to partly quantify the errors in product validation. The upscaling method U1 is the so-called ‘invert first and then average’ method, in which the fine-resolution NDVI is calculated first and the fine-resolution LAI is then retrieved based on the semiempirical NDVI-based model. The fine-resolution LAI maps are then aggregated (i.e., upscaled) to generate the coarse-resolution LAI. The upscaling method U2 is the so-called ‘average first and then invert’ method. Using this method, the fine-resolution SR image is aggregated to a coarse-resolution image to derive the coarse-resolution NDVI. The semiempirical NDVI-based model is then used to retrieve the coarse-resolution LAI. The difference in pixel value between the coarse-resolution LAI images obtained using the two different upscaling methods can be regarded as the spatial-scale difference26,61. Details regarding scaling differences are provided in the Supplementary Information.
Data Records
On the basis of the selection rules introduced in the Semiempirical NDVI-based model for generating fine-resolution LAI validation maps section, a total of 80 fine-resolution LAI validation maps with a size of 3 km × 3 km were generated from the Landsat-5 TM and Landsat-8 OLI reflectance data; these maps are provided in the Supplementary Information, Figures S9–S13. Detailed statistical metrics for these 80 fine-resolution maps are summarized in Tables 2–5.
The scaling difference was taken as the difference between the mean LAI values generated using the two different upscaling methods that were introduced in Figure S8 in the Supplementary Information. The standard deviation reflects the spatial heterogeneity of the 3 km × 3 km fine-resolution LAI maps. The underestimation caused by the scaling difference for the Henan, Beijing, and Anhui study areas (which have relatively light soil substrates) and the overestimation for the Heilongjiang study area (where the soil background is dark) agree with the results of the investigation performed by Liu et al. (2014) and Chen et al. (2002), who found that there was an “underestimation for mixed pixels with bright non-vegetation components and an overestimation for those with dark non-vegetation components ”26,64.
Table 2 lists the statistical metrics of the fine-resolution LAI validation maps for Beijing. A total of 32 reference maps corresponding to eight growth stages were used between 2004 and 2007. The LAI for the 32 reference maps is relatively low, ranging from 0.273 to 2.257, with a mean uncertainty of 0.290. The spatial heterogeneity is relatively large and has a mean standard deviation of 0.720, which gives a relatively large scaling difference with a mean value of 0.046.
Table 3 lists the statistical metrics of the fine-resolution LAI validation maps in the study areas of Henan Province. Twenty reference maps corresponding to five growth stages were used from 2003 to 2004. The LAI for these 20 reference maps varies from 1.615 to 4.310, with a mean uncertainty of 0.364. The spatial heterogeneity is higher than that for the Beijing study area and has a mean standard deviation of 1.361. The scaling difference is still obvious and has a mean value of 0.302.
Table 4 lists the statistical metrics of the fine-resolution LAI validation maps for Youyi Farm, Heilongjiang Province. Here, 20 reference maps corresponding to five growth stages were used from 2005 to 2006. The LAI in these maps is relatively low, ranging from 0.293 to 1.338, with a mean uncertainty of 0.189. At Youyi Farm, the size of the fields was much larger than that in the other study areas; the spatial heterogeneity is thus relatively small and has a mean standard deviation of 0.413. The scaling difference is the smallest among all the study areas and has a mean value of 0.013.
Table 5 lists the statistical metrics of the fine-resolution LAI validation maps for Longkang Farm, Anhui Province. These statistics are for eight reference maps corresponding to two growth stages in 2017. The LAI for these eight reference maps is relatively large, ranging from 2.190 to 4.651, with a mean uncertainty of 0.685. The spatial heterogeneity is similar to that in the Henan study area, with a mean standard deviation of 1.528. The scaling difference has a mean relative value of 0.553.
The field measurements, published for public use, are available at Zenodo, https://doi.org/10.5281/zenodo.5091251. The dataset contains readme files, compressed files of the fine-resolution LAI maps, and files of statistics for the reference maps. The intermediate NDVI files and reference LAI maps derived using the U2 upscaling methods are also provided65.
Technical Validation
Performance of the semiempirical models
The semiempirical NDVI-based models used to generate the fine-resolution reference LAI maps were validated using field measurements and the LOOCV method for the four study areas. This process is illustrated in Figs. 4–7. The results of a statistical comparison of the field-measured and generated LAI are also displayed in the figures.
In Figs. 4–7, the field-measured LAI values are compared with the LAI values derived by applying the semiempirical LAI model to Landsat TM/OLI SR data for the four study areas (Beijing, Henan, Heilongjiang, and Anhui). The results shown in Fig. 4 are characterized by slopes that are close to the 1:1 line, with RMSE values ranging from 0.25 to 0.72. As the results are displayed separately for each growth stage, the LAI values measured during the early growth stage have a wide distribution, with the result that the coefficient of determination for the regreening stage is low. Figure 5 displays the relationship between the field-measured LAI and the predicted LAI values for the Henan test area based on the formal semiempirical model: in this case, the RMSE ranges from 0.31 to 0.92, and the RRMSE is less than 23.16%. Figure 6 shows a comparison of the field-measured and predicted LAI values for Youyi Farm, Heilongjiang Province. On May 5th, 2005, and June 6th, 2006, field measurements of both wheat and barley were performed at this site; the samples collected on June 14th, 2007, were of barley only. Since barley and wheat are crops with similar vegetation structures, the two crop types are not separated in this comparison. The RMSE for these data has a range of 0.22 to 0.37, and the RRMSE has a range of 18.25% to 36.78%. The plots displayed in Fig. 7 show the relationship between the field-measured and predicted LAI values for Longkang Farm, Anhui Province. The slopes here are close to the 1:1 line, and the RMSE has a range of 0.67 to 0.95.
Validation of MODIS LAI
The 80 reference LAI maps with a size of 3 km × 3 km derived from the two upscaling methods (Figure S8 in Supplementary Information) and the corresponding field LAI measurements were employed to validate the MODIS LAI V6 product (MCD15A2H) for the four study areas. The validation results are illustrated in Fig. 8 and Table 6.
In Fig. 8(a), the fine-resolution reference LAI maps (30 m) derived from Eq. (4) were compared with the MODIS LAI in the range of 3 km × 3 km, which refers to the U1 upscaling method. To investigate how the scaling difference contributes to the discrepancies between the fine-resolution maps and the coarse-resolution products, the reference LAI maps at 500 m resolution were obtained based on the ‘average first and then invert’ (U2 upscaling) method with a size of 3 km × 3 km (as described in Figure S8). These reference LAI maps at 500 m resolution were compared with the MODIS LAI, as illustrated in Fig. 8(b). In addition, the field LAI measurements were directly compared with the corresponding MODSI LAI, as illustrated in Fig. 8(c).
The results illustrated in Fig. 8(a) indicate that the MODIS LAI values are underestimated in comparison to the fine-resolution reference LAI data in the range of 3 km × 3 km, especially in the case of the Henan study area. Table 6 shows that the accuracy of the MODIS LAI product varies among the study areas: the values are severely underestimated for crops in Beijing, Henan, and Anhui (relative bias = –27.0%, –48.9%, and –10.8%, respectively), whereas the values are overestimated for the crops with a black soil background in Heilongjiang Province (relative bias = 56.9%).
Due to the existence of surface heterogeneity, applying the model developed with 30 m data to 500 m data could result in some discrepancies. Since coarse-resolution LAI should be equal to aggregated fine-resolution LAI in the absence of scaling errors, validation using the reference LAI derived from the U2 method will result in artificially high accuracy60. However, by comparing the validation results from the U1 and U2 methods, the error due to the scale effect inherent to the coarse-resolution product can be at least partly quantified. In Fig. 8(b), the results gave an RMSE of 0.78 against the value of 0.91 that was obtained by applying the U1 (‘invert first and then average’) upscaling method to the reference LAI dataset in Fig. 8(a), which indicates that the scaling difference also contributes to the error in the coarse-resolution MODIS LAI product. When the scaling difference was taken into consideration and compensated for by applying the U2 upscaling method to the reference LAI dataset, the underestimates for the Beijing, Henan, and Anhui areas were reduced, giving relative biases of −24.0%, −43.0%, and 6.0%, respectively, compared with –26.9%, −48.9%, and −10.8% in Fig. 8(a), respectively. In terms of the accuracy of MODIS LAI in Heilongjiang, since the land cover in Heilongjiang is relatively uniform, the mean scaling difference among the four study areas is lowest, and the RMSE and relative bias thus slightly increased from 0.52 to 0.53 and 56.9% to 59.8%, respectively. A direct comparison with the field measurements (Fig. 8(c)) produced much higher uncertainties (RMSE = 1.99, RRMSE = 76.8%, relative bias = −49.3%) than were found by using the upscaled reference LAI dataset.
In this study, a highly accurate fine-resolution LAI dataset for Chinese croplands that could be used as a reference for coarse-resolution LAI products was derived from field measurements and fine-spatial-resolution satellite imagery (Landsat-5 TM and Landsat-8 OLI data). A semiempirical statistical model based on the Beer–Lambert law was used to derive fine-resolution LAI data that could be used for validation of the coarse-resolution LAI product at each growth stage. The parameters of each semiempirical model were estimated using the field LAI at each growth stage based on the curve-fitting algorithm and LOOCV approach. During this procedure, the performance of each semiempirical model was also investigated. Finally, eighty fine-resolution reference LAI maps with a size of 3 km × 3 km were generated for the study areas in four Chinese provinces. This fine-resolution reference LAI dataset was applied to assess the accuracy of MODIS LAI among these four study areas using the U1 upscaling method. The MODIS LAI was also compared to the reference LAI generated using the U2 upscaling method, through which the error due to the scale effect inherent to the coarse-resolution LAI product can be partly quantified. The direct comparison of the LAI data collected in the field and MODSI LAI showed considerable uncertainty. Therefore, this study contributes to the validation of remote sensing LAI products by providing a set of fine-resolution reference LAI datasets based on destructive sampling methods and highlights the importance of using a fine-resolution reference LAI dataset based on direct field measurements. Such a dataset can bridge the gap between field measurements and coarse-resolution pixel data.
Code availability
In the data repository65, the readme files explain the location of the files and folders. All raw measurements records can be found in one Excel sheet. All the field data and satellite images were processed and analysed in IDL and Python. The source codes are available at the Github. https://github.com/BowenSong123/Code.
References
Chen, J. & Black, T. A. Defining leaf area index for non-flat leaves. Plant Cell Environ 15, 421–429 (1992).
Garrigues, S. et al. Validation and intercomparison of global Leaf Area Index products derived from remote sensing data. J. Geophys. Res. Biogeosci 113 (2008).
Bonan, G. B. Land-Atmosphere interactions for climate system Models: coupling biophysical, biogeochemical, and ecosystem dynamical processes. Remote Sens. Environ 51, 57–73 (1995).
Chen, J., Rich, P., Gower, S., Norman, J. & Plummer, S. Leaf Area Index of Boreal Forests: Theory, Techniques, and Measurements. J. Geophys. Res. Atmos 102, 429–429,443 (1997).
Liang, S. & Wang, J. Advanced Remote Sensing: Terrestrial Information Extraction and Applications 2nd edn (Elsevier Academic Press, 2020).
Lafont, S. et al. Modelling LAI, surface water and carbon fluxes at high-resolution over France: comparison of ISBA-A-gs and ORCHIDEE. Biogeosciences 9, 439–456 (2012).
Fang, H., Baret, F., Plummer, S. & Schaepman-Strub, G. An overview of global leaf area index (LAI): Methods, products, validation, and applications. Rev. Geophys 57, 739–799 (2019).
Yan, K. et al. Evaluation of MODIS LAI/FPAR Product Collection 6. Part 2: Validation and Intercomparison. Remote Sens 8, 460 (2016).
Yin, G. et al. Derivation of temporally continuous LAI reference maps through combining the LAINet observation system with CACAO. Agric. For. Meteorol 233, 209–221 (2017).
Norman, J. M. & Campbell, G. S. in Plant Physiological Ecology: Field methods and instrumentation (eds. Pearcy, R., Mooney, H. & Rundel, P.) Ch. 14 (Springer NL Press, 1989).
Hu, R., Yan, G., Mu, X. & Luo, J. Indirect measurement of leaf area index on the basis of path length distribution. Remote Sens. Environ 155, 239–247 (2014).
Ryu, Y. et al. How to quantify tree leaf area index in an open savanna ecosystem: A multi-instrument and multi-model approach. Agric. For. Meteorol 150, 63–76 (2010).
Zou, J., Yan, G., Zhu, L. & Zhang, W. Woody-to-total area ratio determination with a multispectral canopy imager. Tree Physiol 29, 1069–1080 (2009).
Yan, G. et al. Review of indirect optical measurements of leaf area index: Recent advances, challenges, and perspectives. Agric. For. Meteorol 265, 390–411 (2019).
Lang, A. Estimation of leaf area index from transmission of direct sunlight in discontinuous canopies. Agric. For. Meteorol 37, 229–243 (1986).
Chen, J. & Cihlar, J. Quantifying the effect of canopy architecture on optical measurements of leaf area index using two gap size analysis methods. IEEE Trans. Geosci. Remote Sens 33, 777–787 (1995).
Leblanc, S. G. Correction to the plant canopy gap-size analysis theory used by the Tracing Radiation and Architecture of Canopies instrument. Appl. Opt 41, 7667 (2002).
Leblanc, S.G., Chen, J.M. and Kwong, M. Tracing Radiation and Architecture of Canopies - TRAC manual, version 2.1.3. http://faculty.geog.utoronto.ca/Chen/Chen’s%20homepage/PDFfiles/tracmanu.pdf (2002).
Bréda, N. J. J. Ground‐based measurements of leaf area index: a review of methods, instruments and current controversies. J. Exp. Bot 54, 2403–2417 (2003).
Levy, P. E. & Jarvis, P. G. Direct and indirect measurements of LAI in millet and fallow vegetation in HAPEX-Sahel. Agric. For. Meteorol 97, 199–212 (1999).
Mason, E. G., Diepstraten, M., Pinjuv, G. L. & Lasserre, J. P. Comparison of direct and indirect leaf area index measurements of Pinus radiata D. Don. Agric. For. Meteorol 166–167, 113–119 (2012).
Stroppiana, D., Boschetti, M., Confalonieri, R., Bocchi, S. & Brivio, P. A. Evaluation of LAI-2000 for leaf area index monitoring in paddy rice. Field Crops Res 99, 167–170 (2006).
Facchi, A., Baroni, G., Boschetti, M. & Gandolfi, C. Comparing optical and direct methods for leaf area index determination in a maize crop. J. Agric. Eng 41, 33 (2010).
Demarez, V., Duthoit, S., Baret, F., Weiss, M. & Dedieu, G. Estimation of leaf area and clumping indexes of crops with hemispherical photographs. Agric. For. Meteorol 148, 644–655 (2008).
Garrigues, S. et al. Intercomparison and sensitivity analysis of Leaf Area Index retrievals from LAI-2000, AccuPAR, and digital hemispherical photography over croplands. Agric. For. Meteorol 148, 1193–1209 (2008).
Liu, L. Simulation and correction of spatial scaling effects for leaf area index (in Chinese). J. Remote Sens 18, 1158–1168 (2014).
Cohen, W. B. & Justice, C. O. Validating MODIS terrestrial ecology products: Linking in situ and satellite measurements. Remote Sens. Environ 70, 1–3 (1999).
Cline, D. et al. Overview of the NASA cold land processes field experiment (CLPX-2002). Proc.SPIE 4894, 361–372 (2003).
Fernandes, R. et al. Global Leaf Area Index Product Validation Good Practices Version 2.0 (eds. Schaepman-Strub, G., Román, M. & Nickeson, J.) Land Product Validation Subgroup (WGCV/CEOS) https://lpvs.gsfc.nasa.gov/PDF/CEOS_LAI_PROTOCOL_Aug2014_v2.0.1.pdf (2014).
Weiss, M., Baret, F., Garrigues, S. & Lacaze, R. LAI and fAPAR CYCLOPES global products derived from VEGETATION. Part 2: validation and comparison with MODIS collection 4 products. Remote Sens. Environ 110, 317–331 (2007).
Morisette, J. T. et al. Validation of global moderate-resolution LAI products: a framework proposed within the CEOS land product validation subgroup. IEEE Trans. Geosci. Remote Sens 44, 1804–1817 (2006).
Weiss, M. et al. On Line Validation Exercise (OLIVE): A Web Based Service for the Validation of Medium Resolution Land Products. Application to FAPAR Products. Remote Sens 6 (2014).
Baret, F. et al. Evaluation of the representativeness of networks of sites for the global validation and intercomparison of land biophysical products: proposition of the CEOS-BELMANIP. IEEE Trans. Geosci. Remote Sens 44, 1794–1803 (2006).
Rossello, P., Baret, F. VALidation of Land European Remote Sensing Instruments. VALERI Project http://w3.avignon.inra.fr/valeri/Meeting_Reports/Davos_2007/Rossello_Davos_VALERI.pdf (2007).
Camacho, F., Cernicharo, J., Lacaze, R., Baret, F. & Weiss, M. GEOV1: LAI, FAPAR essential climate variables and FCOVER global time series capitalizing over existing products. Part 2: Validation and intercomparison with reference products. Remote Sens. Environ 137, 310–329 (2013).
Liu, S. et al. The Heihe Integrated Observatory Network: A Basin-Scale Land Surface Processes Observatory in China. Vadose Zone J 17, 180072 (2018).
Li, X. et al. Heihe Watershed Allied Telemetry Experimental Research (HiWATER): Scientific Objectives and Experimental Design. Bull. Am. Meteorol. Soc 94, 1145–1160 (2013).
Fang, H., Li, W., Wei, S. & Jiang, C. Seasonal variation of leaf area index (LAI) over paddy rice fields in NE China: Intercomparison of destructive sampling, LAI-2200, digital hemispherical photography (DHP), and AccuPAR methods. Agric. For. Meteorol 198–199, 126–141 (2014).
Fang, H. et al. Validation of global moderate resolution leaf area index (LAI) products over croplands in northeastern China. Remote Sens. Environ 233, 111377 (2019).
Wu, X., Xiao, Q., Wen, J., You, D. & Hueni, A. Advances in quantitative remote sensing product validation: Overview and current status. Earth Sci. Rev 196, 102875 (2019).
Majasalmi, T., Rautiainen, M., Stenberg, P. & Rita, H. Optimizing the sampling scheme for LAI-2000 measurements in a boreal forest. Agric. For. Meteorol 154–155, 38–43 (2012).
Campos-Taberner, M. et al. Multitemporal and multiresolution leaf area index retrieval for operational local rice crop monitoring. Remote Sens. Environ 187, 102–118 (2016).
Brown, L. A. et al. Evaluation of global leaf area index and fraction of absorbed photosynthetically active radiation products over North America using Copernicus Ground Based Observations for Validation data. Remote Sens. Environ 247, 111935 (2020).
Claverie, M. et al. Validation of coarse spatial resolution LAI and FAPAR time series over cropland in southwest France. Remote Sens. Environ 139, 216–230 (2013).
Campos-Taberner, M. et al. A Critical Comparison of Remote Sensing Leaf Area Index Estimates over Rice-Cultivated Areas: From Sentinel-2 and Landsat-7/8 to MODIS, GEOV1 and EUMETSAT Polar System. Remote Sens 10, 763 (2018).
Xu, B. et al. Analysis of global LAI/FPAR products from VIIRS and MODIS sensors for spatio-temporal consistency and uncertainty from 2012–2016. Forests 9, 73 (2018).
Song, X. et al. The delineation of agricultural management zones with high resolution remotely sensed data. Precis. Agric 10, 471–487 (2009).
Masek, J. G. et al. A Landsat surface reflectance dataset for North America, 1990-2000. IEEE Geosci. Remote Sens. Lett 3, 68–72 (2006).
Vermote, E., Justice, C., Claverie, M. & Franch, B. Preliminary analysis of the performance of the Landsat 8/OLI land surface reflectance product. Remote Sens. Environ 185, 46–56 (2016).
Wang, J., Zhao, C. & Huang, W. The basis and application of agricultural quantitative remote sensing (Science Press, 2008).
Schaaf, C. B. et al. First operational BRDF, albedo nadir reflectance products from MODIS. Remote Sens. Environ 83, 135–148 (2002).
Hu, Y., Liu, L., Liu, L. & Jiao, Q. Comparison of absolute and relative radiometric normalization use Landsat time series images. Proc.SPIE 8006, 80016 (2011).
Myneni, R., Knyazikhin, Y., Park, T. MCD15A2H MODIS/Terra+Aqua Leaf Area Index/FPAR 8-Day L4 Global 500m SIN Grid Version 006. NASA EOSDIS Land Processes DAAC https://doi.org/10.5067/MODIS/MCD15A2H.006 (2015)
Knyazikhin, Y. et al. Estimation of vegetation canopy leaf area index and fraction of absorbed photosynthetically active radiation from atmosphere-corrected MISR data. J. Geophys. Res. Atmos 103, 32239–32256 (1998).
Myneni, R. B., Ramakrishna, R., Nemani, R. & Running, S. W. Estimation of global leaf area index and absorbed par using radiative transfer models. IEEE Trans. Geosci. Remote Sens 35, 1380–1393 (1997).
Baret, F. & Guyot, G. Potentials and limits of vegetation indices for LAI and APAR assessment. Remote Sens. Environ 35, 161–173 (1991).
I Griva, Stephen G. N & Sofer, A. in Linear and Nonliner Optimization Ch.13 (SIAM Press, 2008).
Jun, C., Ban, Y. & Li, S. Open access to Earth land-cover map. Nature 514, 434–434 (2014).
Shao, J. Linear Model Selection by Cross-validation. J. Am. Stat. Assoc 88, 486–494 (1993).
Tian, Y. et al. Radiative transfer based scaling of LAI retrievals from reflectance data of different resolutions. Remote Sens. Environ 84, 143–159 (2003).
Garrigues, S., Allard, D., Baret, F. & Weiss, M. Influence of landscape spatial heterogeneity on the non-linear estimation of leaf area index from moderate spatial resolution remote sensing data. Remote Sens. Environ 105, 286–298 (2006).
Zhu, X., Feng, X., Zhao, Y. & Song, X. Scale effect and error analysis of crop LAI inversion. J. Remote Sens 14, 579–592 (2010).
Sun, C., Liu, L., Guan, L., Jiao, Q. & Peng, D. Validation and error analysis of the MODIS LAI product in Xilinhot grassland. J. Remote Sens 18, 518–536 (2014).
Chen, J. et al. Derivation and validation of Canada-wide coarse-resolution leaf area index maps using high-resolution satellite imagery and ground measurements. Remote Sens. Environ 80, 165–184 (2002).
Liu, L. & Song, B. ValLAI_Crop: Validation dataset for coarse-resolution satellite LAI product over Chinese Cropland. Zenodo https://doi.org/10.5281/zenodo.5091251 (2021).
Acknowledgements
This research was funded by the National Key Research and Development Program of China, grant number 2017YFA0603001, and the National Natural Science Foundation of China, grant number 41825002. The authors also thank Editorial Board Member Prof. Jingjing Liang and reviewers for advices about the manuscript.
Author information
Authors and Affiliations
Contributions
B.S. and L.L. conceived and designed the research. B.S. conducted the data analysis and prepared the original manuscript. L.L. reviewed and edited the manuscript. S.D., X.Z., X.C. and H.Z. also contributed to the data analysis. All authors have read and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Online-only Table
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
About this article
Cite this article
Song, B., Liu, L., Du, S. et al. ValLAI_Crop, a validation dataset for coarse-resolution satellite LAI products over Chinese cropland. Sci Data 8, 243 (2021). https://doi.org/10.1038/s41597-021-01024-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-021-01024-4