Table 2 LAGOS-US LANDSAT workflow decisions and fit statistics for the Full-data model for each water quality predictive model.
From: LAGOS-US LANDSAT: Remotely sensed water quality estimates for U.S. lakes over 4 ha from 1984 to 2020
Variable | Workflow components tested and selected | Final model fit statistics | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Image and pixel processing by lake | Modeling steps | Matchup training data | |||||||||||
Pixel retrievals | Band ratio calculation | Scene quality: Negative reflectances | Scene quality: Scene clouds > 50% | In situ data transformed | # of bands and ratios | Training model type | Matchup window (±days) | Final matchups (N) | Variance explained (%) | RMSE | MAPE (%) | MAPE (%) Improvement of log10 model | |
CHL | Whole-lake | Pixel-specific | Removed | Removed | Log10 | All bands & ratios | Random forest | 1 | 43,755 | 44.1 | 0.43 | 23.3 | (+9.8) |
Secchi | Whole-lake | Pixel-specific | Removed | Removed | none | All bands & ratios | Random forest | 7 | 586,368 | 63.7 | 1.23 | 22.8 | — |
True color | Whole-lake | Pixel-specific | Removed | Removed | Log10 | All bands & ratios | Random forest | 7 | 33,194 | 48.6 | 0.26 | 7.5 | (+12.7) |
DOC | Whole-lake | Pixel-specific | Retained | Retained | Log10 | All bands & ratios | Random forest | 7 | 7,466 | 42.0 | 0.24 | 10.2 | (+6.1) |
TSS | Whole-lake | Pixel-specific | Removed | Removed | Log10 | All bands & ratios | Random forest | 1 | 10,845 | 20.7 | 0.62 | 27.0 | (+16.3) |
Turbidity | Whole-lake | Pixel-specific | Removed | Removed | Log10 | All bands & ratios | Random forest | 1 | 58,999 | 52.1 | 0.37 | 8.8 | (+20.9) |