Extended Data Fig. 9: Nearest-neighbor-distance-matching leave-one-out cross-validation (NNDM LOO CV) examples for rainfed wheat.

The top panel shows the distribution of site-specific water-limited yield potential (Yw) of rainfed wheat from the Global Yield Gap Atlas (GYGA) and the prediction grid (lands harvested with rainfed wheat as reported by SPAM17). The lower left and middle panels show the GYGA Yw sites used for model testing and training and excluded sites due to their proximity to the testing site for two iterations of the NNDM LOO CV10. The neighbors to be excluded are defined so that the cumulative frequency of distances between testing sites and their nearest training site in the NNDM LOO CV procedure matches the cumulative frequency of distances between the prediction grid cells and their nearest GYGA Yw, as shown in the lower right panel.