Introduction

Environmental variability due to changing climate poses one of the greatest threats to agricultural productivity1. Increasingly, researchers aim to predict the effects of changing climate on agriculture, projecting constraints on crop production and anticipated decreases in yield2,3. For regions and crop species identified as vulnerable under future climates, strategies to increase agricultural resilience may include adapting management practices and substituting varieties or crop species4.

A catastrophic incident is defined by the National Response Framework as, “any natural or manmade incident, including terrorism, that results in extraordinary levels of mass casualties, damage, or disruption severely affecting the population, infrastructure, environment, economy, national morale, and/or government functions”. Aerosol-producing global catastrophic events, such as nuclear war, or an asteroid strike, are expected to produce significant climate change5 through deflecting solar radiation, preventing sunlight from reaching the Earth’s surface and causing global cooling. Since the spread of nuclear weapons during the twentieth century, there has been significant focus on assessing the consequences of a nuclear conflict on both society and the environment6. Published climate models have been used to consider the impacts of nuclear wars on the growth of major grain crops7,8,9 and summarize the degree to which the rapid environmental change induced by a black carbon aerosol (soot) producing catastrophe would impact global crop production. The soot produced during a nuclear war or an asteroid strike is created by massive fires and in the case of nuclear war, the black carbon carbon emission is from city burn. To date, the impact of such a soot-producing catastrophe on agricultural systems has not accounted for intraspecific diversity present in crop species, including landraces, and how this diversity may aid in increasing agricultural resilience. Cereal crops account for the most calories consumed by humans10 and maintaining their production post-global catastrophe is of the utmost importance.

Crop landraces (local traditional varieties) contain most of the genetic diversity within many crops, much of which is not represented in modern breeding varieties11 and are still widely cultivated in the developing world. The continual cultivation and selection of crops by farmers gives rise to these local varieties that often carry locally adapted alleles and phenotypes12. Historically, landraces have contributed to plant breeding through the identification of traits and alleles for adaptation to stressful environments (e.g. water stress, salinity, and high temperatures)13. Many thousands of landrace varieties are now stored in germplasm banks and represent untapped adaptive diversity that may increase agricultural resilience under changing environments14.

The genetic basis of adaptation to local environments can be characterized through geographic associations between genotype and environment, known as genotype-environment associations15. Genotype-environment associations have been used to study the adaptive potential of species16, estimate optimal range shifts17, and identify genes that may be advantageous for organisms under future climates18. Genotype-environment associations may also give insights into which specific environmental pressures drive local adaptation19,20,21. For landraces, a large portion of genomic variation can be explained by environments of origin22,23,24, making them good systems for considering the environmental gradients driving local adaptation25 and the geographic distribution of locally adapted alleles26,27.

An emerging approach for predicting adaptation to novel environments is first fitting genotype-environment models that describe how current allele frequencies change across environments under an assumption of local adaptation. Next, the fitted model is applied to a novel environment to determine the change in genomic composition required for adaptation to that environment, known as genomic offset (reviewed in ref. 28). The genotype-environment models can further be extended to identify optimal genotypes or varieties for specific environments24,29 and guide movement of genotypes to minimize maladaptation to the novel climates. Such modeling methods capture long-term signals of adaptation and may provide insights into genotypes that are the most vulnerable or are sources of resilience to climatic variability30.

We studied the climate impacts of a soot-producing catastrophe on broadly distributed globally important cereal crops for which landrace cultivation is important for smallholder farmers: Sorghum bicolor (L.) Moench (sorghum), Zea mays L. (maize), Oryza sativa L. subsp. indica and japonica (rice), and Hordeum vulgare L. (barley). These crops represent four of the top five cereal species in global production31. For each crop species included in this study, we independently implemented crop growth models to identify environmental stressors and genomic models to estimate the degree of disruption to current landrace adaptation under several post-catastrophic scenarios differing in the amount of soot injected into the climate model. We validated our genomic models through comparing predicted local adaptation and published maize landrace performance data collected in common gardens across diverse climates in Mexico. We further extended our genomic models to identify landrace varieties best matched to specific post-catastrophic conditions, supporting the management strategy of substituting vulnerable landrace genotypes for more resilient ones.

Our study aims to evaluate the environmental forces that have historically shaped genomic variation in landraces and to assess how landrace adaptation may be disrupted by novel catastrophic events. There is little research investigating the impacts of changing climate on diverse genotypes of multiple species. Thus, the literature may be oversimplifying climate change effects on agricultural and ecological systems. Utilizing a multi-species genomics approach allows us to confront this challenge, acknowledging the distinct impacts on various species that are vital for food production. We hypothesized that the magnitude of maladaptation would be largely determined by the magnitude of environmental change, that substitutions of genotypes from different locations could partially ameliorate effects of climate change, and that countries with greater climatic diversity would have better adapted genotype substitutions within their borders compared to less climatically diverse countries. Finally, the approach developed in this study may be extended to and prove valuable for understanding impacts of greenhouse gas induced climate change. Here, we show that soot-induced disruptions to climate would drive global landrace maladaptation and suggest current adaptive diversity is insufficient for agricultural resilience under the most severe post-catastrophic climate disruptions.

Results

Climate scenarios

We studied disruptions to current landrace adaptation for six nuclear war scenarios that simulate the impact of varying amounts of stratospheric soot on global climate (Fig. 1b) using previously published climate simulation data6,32. The published weather files describe the climate impacts for five India-Pakistan nuclear war scenarios (soot injections of 5 Tg, 16 Tg, 27.3 Tg, 37 Tg, and 46.8 Tg), one United States-Russia scenario with a soot injection of 150 Tg, and one control simulation ensemble member to describe normal fluctuations in climate.

Fig. 1: Landrace genotype point of origin and study workflow.
figure 1

a Global distribution of genotyped landrace accessions (All accessions, n = 6384; Barley, n = 215; Maize, n = 3404; Rice subsp. indica, n = 677; Rice subsp. japonica n = 309; Sorghum, n = 1779). b Modeling and statistical pipeline used in this study.

Genotyped landrace accessions

To assess maladaptation in cereal crop landraces following a soot-producing catastrophe, we identified species for which landrace relatives are currently grown in the developing world that also had publicly available, high quality sequencing data of geographically diverse accessions. From these criteria, we selected four crop species: barley (n = 215), maize (n = 3404), rice (n = 677 of the subsp. indica; n = 309 of the subsp. japonica), and sorghum (n = 1,779). The distribution of accessions covered most of the agricultural areas in the developing world (Fig. 1a) across diverse climate regimes.

Using crop growth models to estimate integrated local climatic stressors under control and post-war conditions

Traditional implementations of genotype-environment associations typically use off-the-shelf climate parameters without connection to organismal biology and without consideration of phenology. However, actual climate-driven stress likely emerges from a combination of conditions (e.g. precipitation and temperature) and depends on organismal phenology and development. To address these issues, we used the Cycles agroecosystem model2,33 to simulate growth and stress parameters for our full set of genotyped, georeferenced landrace accessions (n = 6384) under control and six nuclear war conditions that differed in the quantity of stratospheric soot simulated (5 Tg, 16.1 Tg, 27.3 Tg, 36.6 Tg, and 46.8 Tg, 150 Tg). Cycles simulations selected a planting date in a designated planting window when the weather and soil conditions are suitable for the specific crop, and simulated crop growth until the time of harvest or termination, using parameters specific to each of our four species. Cycles simulations that accounted for species-specific growth parameters were run independently for each crop species and climate scenario (control and six soot scenarios).

We used outputs from Cycles simulations to infer emergent environmental, growth, and stress values experienced during key phenological stages of crop, constrained to the growing period for each simulated accession under the different climate scenarios (Supplementary Table 1). Thus, the selected model outputs characterized differences in environment and potential stress experienced by a given landrace accession under control and post-war climates, while accounting for crop-specific growth parameters. For each climate scenario and accession, we extracted 13 Cycles-derived variables representative of average temperature, coldest temperature, water stress, and solar radiation experienced by simulated landrace accessions across the vegetative, reproductive, and total growth and days to reach maturity (hereafter, Cycles-derived environmental variables, Fig. 2, Supplementary Table 1). While in reality landrace accessions likely exhibit variation in response to environmental variability, modeling this genetic variation was not our goal at this stage. Rather, our goal was to use Cycles to estimate integrative environmental stressors through space and time for later use in modeling genotype-environment associations.

Fig. 2: Cycles estimated local climatic stressors.
figure 2

Cycles-derived environmental variables for (a) barley (b) maize (c) rice subsp. indica (d) rice subsp. japonica and (e) sorghum. Control lines are plotted as the average value across all accessions that were projected to reach maturity. The shading around averaged control represents the standard deviation of yearly averages, indicating yearly fluctuations in environmental variables. For all soot scenarios, lines are plotted as yearly averaged values across accessions that were projected to reach maturity. The vertical dotted line indicates the time of soot injection into the climate models.

As described by other groups, in all war scenarios regardless of detonation location, produced soot spreads globally and causes disruptions to solar radiation reaching the earth’s surface, resulting in global cooling6,32. Stratospheric soot from each post-war scenario dissipated over the course of a decade and the climate anomalies caused by atmospheric soot decreased proportionally, with respect to severity of the scenario. Across all scenarios, surface shortwave radiation reached its all-time low 2 years post-war, corresponding to the point at which Cycles modeled crops were simulated with the lowest average solar radiation (Fig. 2). Consequently, global surface temperature immediately and rapidly declined after the catastrophe and on average reached its lowest point in the 3rd year post-war, with more extreme cooling in the Northern Hemisphere6,34. Our crop models summarized this cooling trend. Daily average temperature for landraces modeled by Cycles reached its lowest point two to 3 years post catastrophe. Barley, the crop with a primary distribution in the Northern Hemisphere, experienced the coolest post-war temperatures (Figs. 1a, 2). In the coolest year of the 150 Tg Russia-US scenario, daily average temperature of the growing season across all simulated accessions decreased by 11.3 °C for maize and sorghum, 13.1 °C for rice subsp. japonica, 14.3 °C for rice subsp. indica, and 12.1 °C for barley as compared to the averaged control daily temperature across years, indicating the severity of this scenario.

For maize, rice, and sorghum, whose landraces modeled in this study were mostly tropical, declines in temperature across the simulated growing season led to an increase in the number of days required for a plant to reach maturity. The strength of this relationship increased with the more severe soot scenarios (Supplementary Fig. 1). Failure to accumulate enough thermal time during the growing season was recorded as the crop not reaching maturity. As the Cycles set up did not account for genetic and adaptive variation among landraces, an individual not projected to reach maturity can be interpreted as environmental conditions that are relatively inhibitory for growth. The simulated number of days to maturity generally corresponded to the severity of the climate anomaly of the post-war soot scenario. The most extreme environmental effects of the 150 Tg scenarios at least doubled the number of days to reach maturity for all tropical crops (Fig. 2). In the second post-war year of this scenario, 90% of barley, 62% of rice subsp. indica, 51% of rice subsp. japonica, 54% of maize, and 33% of sorghum accessions were projected to not reach maturity.

Identification of environmentally adaptive genetic loci

For each crop species, we acquired published genotype data of landrace accessions used in Cycles simulations above for use in modeling and predicting disruptions to current genotype-environment relationships. The final set included 6384 accessions with genotype data represented by various sequencing and genotyping methods: 215 barley accessions with exome sequencing (1,688,807 single nucleotide polymorphisms, (SNPs))35, 3404 maize accessions with genotyping-by-sequencing (GBS) (946,072 SNPs)36, 986 rice accessions with whole genome resequencing (WGS) (677 subsp. indica, 309 subsp. japonica; 9.78 million SNPs)22, and 1,779 sorghum accessions with GBS (459,304 SNPs)37. Though differences in genotyping methods and the distribution of genotyped accessions may influence our ability to model adaptation, we sought to identify datasets that most represented the diversity of genotypes and environments that landraces of our focal species originate from and are likely adapted to.

We built gradient forest (GF) models that were used to represent current genotype-environment relationships and for predicting maladaptation in crop landraces following post-war soot induced change in climate. For use in GF models, we first identified a subset of genomic loci that we hypothesized were more likely to underlie local adaptation. Specifically, we identified genetic loci that were associated with landrace climate of origin and flowering time quantitative trait loci (QTL) for use in GF models. Following methods described in ref. 38 and for each crop species, we used partial redundancy analysis (pRDA) to identify the top 1000 genetic loci associated with variation in 13 Cycles-derived environmental variables under the control scenario while also accounting for population structure (methods; Supplementary Fig. 2). To ensure potentially critical phenology QTL were accounted for in our models, we further identified single nucleotide polymorphisms (SNPs) of loci found within and in cis-regulatory regions (+/- 5 kilobase (kb) pairs) of known flowering time network genes (Supplementary Data 2). We identified loci known to be involved in flowering time for each crop species by literature review, obtained gene coordinates for each flowering time gene, extracted all SNPs that overlapped within and in cis-regulatory regions of the genomic region, and filtered each species’ set of flowering time loci to account for patterns of linkage disequilibrium. The number of flowering time SNPs included in our GF models for each of our focal species included 636 for barley, 608 for maize, 314 for rice subsp. indica, 323 for rice subsp. japonica, and 116 for sorghum (differences in number are a product of marker density). In total, the final genetic dataset used to build each species’ GF model included the top 1,000 SNPs associated with variation in the control Cycles-derived environmental variables identified by pRDA and the SNPs found in and near flowering time network genes.

Control scenario GF models describe existing genome-environment associations

We built GF models representative of current genotype-environment associations using the loci described above and Cycles-derived environmental variables of the control simulation, averaged across all years of Cycles-simulated growth, for each crop species. GF is a nonparametric multivariate approach that fits an ensemble of regression trees using Random Forest39 and models changes in allele frequency along environmental gradients40. GF’s functions provide a means to rescale environmental predictors from their normal units (e.g., °C, mm) into a unit of cumulative importance for describing variation in a genetic dataset. For all GF models, the emergent environmental parameter of simulated days to maturity was in the top five most important predictors for describing variation in the genetic dataset of loci we hypothesized to contribute to environmental adaptation (Supplementary Fig. 3). Across all crop species, no single environmental variable was substantially more related to turnover in allele frequencies of tested loci, indicating that GF models captured genome-wide relationships to multiple environmental gradient signals rather than a high impact at a single locus30. The differing importance of environmental variables specific to a growth stage of plants (variable constrained to the vegetative or reproductive stage of growth) indicated that stress experienced by plants changes across the different phenological stages of growth and genetic variation can be associated with life-stage specific stress.

GF models capture adaptation in landraces

To test if GF models (constructed using Cycles-derived environmental outputs from the control scenario and the set of loci we hypothesized to be important in local adaptation) captured current environmental adaptation in landraces, we compared published performance data of 11,762 maize landraces grown across 13 common gardens in Mexico27,36 to predicted GF offset of genotype-environment relationships of landraces grown in common gardens. The common gardens maize landraces were grown and phenotyped in spanned the geographic and environmental range of maize cultivation (Supplementary Fig. 4a). We predicted how ‘adapted’ maize landraces accessions were to the common gardens they were grown in through calculation of GF offset. We calculated GF offset for each maize landrace accession grown in each common garden as the Euclidean distance between the accession’s control GF modeled genotype-environment association (representing current genotype-environment relationships) and the expected genomic composition at the common garden (representing the optimal genotype-environment relationship for a common garden). As offsets are calculated from current genotype-environment relationships in the GF model, they are weighted by the contribution of different loci that are involved in current landrace adaptation and indicate what amount of genetic change would be required for adaptation to a common garden. Accessions with a low GF offset are expected to be better adapted to the conditions at the common garden, as they require less genetic change to be adapted to the environment of a common garden. We found that, indeed, accessions performed best (height and yield measures) when grown in sites where they had low GF offset (Fig. 3a; Supplementary Fig. 4b). Furthermore, anthesis silking interval (ASI, synchronicity of male and female flower maturity) was reduced when accessions were grown at sites in which they had lower GF offset. ASI is a reliable predictor of stress in maize41, indicating maize landraces were less stressed when grown in common gardens to which they were predicted to be relatively more adapted to.

Fig. 3: GF models captured a signal of landrace adaptation and predicted maladaptation under post-catastrophic climates.
figure 3

GF models capture current genotype-environment associations in maize landraces and were used to predict maladaptation (GF offset) under post-catastrophic climate scenarios. a Phenotypic residuals (remaining variation after accounting for experimental design) plotted against the logged GF offset of maize landrace accessions grown in common gardens. GF offset is calculated for each phenotyped accession as the Euclidean distance of the expected genotype-environment relationship at a common garden common vs the genotype-environment relationship from the accessions’ point of origin. For both phenotypes (anthesis silking interval, n = 7833; bare cob weight, n = 3175), the reported statistic is a two-sided test from a linear regression model between the logged GF offset to common gardens and the residuals for each phenotype. No adjustments were made for multiple comparisons. Points with a more negative logged GF offset indicate maize landrace accessions that are expected to be adapted to the common garden they were grown in. Yearly logged GF offset for (b) maize (c) barley (d) rice subsp. indica (e) rice subsp. japonica and (f) sorghum. The control line represents the mean logged GF offset across all years, with the shaded region representing the standard deviation of yearly means to indicate fluctuations in maladaptation (GF offset) due to normal variability in climate. For each soot scenario, lines are the averaged logged GF offset across all accessions of a species and for each year. The vertical dotted line indicates the time of soot injection into the climate models. Inlaid scatter plots are the averaged logged GF offset across all accessions of a species two years after the incident.

The degree of GF offset post-catastrophe follows the magnitude of climate disruption

After confirming the fitted control GF models captured adaptation in landraces, we then used GF models to predict the expected locally-adapted genomic composition for landraces across space and time under the six post-war scenarios. To predict the magnitude of maladaptation, we calculated GF offset as the Euclidean distance between a given landrace source location’s expected genomic composition between control (representing current genotype-environment relationships) and the six soot scenarios (representing the optimal genotype-environment relationship for a soot scenario) separately. High GF offset values corresponded to a greater degree of maladaptation and represented a greater shift in allelic composition required for adaptation to persist in the climate produced by the soot scenario. For all crops and scenarios, GF offset values followed the trend in post-war climate disruptions, with a sharp increase and gradual recovery after 10–15 years (Fig. 3b–f; Supplementary Fig. 5). GF offset for all crops reached its highest point two to 3 years post-catastrophe, indicating that crops were expected to have the highest degree of maladaptation when global solar radiation and temperatures reached their all-time low. Maximum GF offset of each target scenario linearly corresponded to the amount of soot simulated for the 5 Tg to 46.8 Tg soot scenarios. In the most extreme 150 Tg scenario, the trend was more pronounced and deviated from the linear pattern (Fig. 3b–f). Across all crop species, we detected a strong latitudinal pattern associated with GF offset values, equatorial regions which experienced less adverse climate impacts were predicted to be less maladapted to post-war conditions (Fig. 4).

Fig. 4: Global distribution of GF offset under post-catastrophic climates.
figure 4

Global distribution of logged GF offset (predicted maladaptation) under the 150 Tg scenario 2, 9, and 16 years after the incident for (a) barley (b) maize (c) rice subsp. indica (d) rice subsp. japonica and (e) sorghum landraces (open colored circles indicating offset). Higher GF offset values correspond to a larger degree of predicted maladaptation under the post-war scenario.

Identification of landrace substitutions for post-catastrophe adaptatio

We next leveraged our GF models to identify landrace genotypes best matched to specific post-catastrophic conditions, indicating potential varietal substitutions for locations with landraces that were the most maladapted to post-catastrophic climates. Under post-catastrophic conditions, many locations will not have climate suitable for the cultivation of crops and we constrained our analyses to only look for substitutions for locations that were projected to have a crop reach maturity in the worst year (year 2 post-strike) of the 150 Tg scenario. After filtering for locations that were not expected to be suitable for agriculture, 10% of barley, 38% of rice subsp. indica, 49% of rice subsp. japonica, 46% of maize, and 67% of sorghum landraces source locations were retained to search for a suitable substitution. For the remaining locations, we identified the most vulnerable locations as those with the highest GF offset (predicted maladaptation). We then searched for the most optimal substitution globally as well as the best within country substitution, identifying the landrace accession with the lowest GF offset to the post-catastrophic climate in the vulnerable location (Fig. 5). Though the identification of landraces with lower levels of maladaptation to post-catastrophic conditions may be valuable for finding the genotypes most resilient to post-catastrophic climates, it is important to note that our calculation of maladaptation is a relative metric and to approach these findings with caution. The post-catastrophic climate of the 150 Tg scenario may be sufficiently extreme to dramatically reduce the absolute production of accessions that is identified as a suitable substitution and predicted to have a low GF offset to the novel climate conditions.

Fig. 5: Substitution trajectories to minimize maladaptation under post-catastrophic climates.
figure 5

Substitution trajectories for the most vulnerable landrace populations in year 2 of the 150 Tg scenario. For each crop, substitution trajectories are provided for the most optimal substitution across all available germplasm for (a) barley (c) maize (e) rice subsp. indica (g) rice subsp. japonica (i) sorghum and the best within-country substitution for (b) barley (d) maize (f) rice subsp. indica (h) rice subsp. japonica (j) sorghum. For each substitution, arrows connect the source location of a landrace accession that is the most optimal to the vulnerable location (arrowhead) and are colored by the remaining GF offset (maladaptation) of the substitution. Substitutions are colored by how well-matched the moved landrace is to the vulnerable location, where colors corresponding to lower GF offset of substitution indicate a substitution that has a low degree of maladaptation to the novel environment. Inlaid histograms represent the frequency of substitutions of different distances and are colored by the remaining GF offset (maladaptation) of the substitution.

Across all crops, the most optimal substitution was often far away (~1000–10,000 km) and across country borders. For many locations, the best substitution still had a high degree of GF offset, indicating that there was not a genotype that was expected to be adapted to the post-catastrophic climate at the vulnerable location included in our dataset (Fig. 5a, c, e, g, i). This could be due to the severity of the novel climate at the vulnerable location, the absence of a landrace accession that was expected to be adapted to the novel environment, or some combination of both. For all crops, the best substitution trajectories typically moved landrace accessions from poles and high elevations towards the equator and low elevations, indicating that landrace germplasm adapted to currently cooler climates may be sources of resilience for locations that may be more likely to support agriculture post-catastrophe. For all crop species, there were instances where one genotype was the most optimal substitution for multiple vulnerable locations, suggesting genotypes that may be particularly valuable for post-catastrophic agriculture.

In the case of a catastrophe, substitutions across long distances may not be possible due to socioeconomic disruptions, e.g. in transport and trade. We further searched for the optimal within-country substitution. For all crops, within country substitutions with a low GF offset were rare; within country substitutions always had a higher GF offset, corresponding to higher expected maladaptation, than the optimal global substitution (Fig. 5b, d, f, h, j). Though maintaining a high degree of maladaptation (GF offset), most within-country substitutions included trajectories moving individuals towards the equator and lower elevations.

The within-country current diversity of environments to which landraces are adapted may be important for finding a suitable substitution. To test this hypothesis, we compared the remaining GF offset for global substitutions versus within-country substitutions within each country. We focused on sorghum because it was the crop with the most countries having viable cultivation in year 2 of the 150 Tg scenario, giving power to compare countries. As expected, all 32 countries with at least 5 sorghum accessions had greater GF offset for within-country substitutions as compared to global substitutions, indicating that global substitutions were always superior to substitutions constrained within a country. Next, we tested for the proportional inferiority of within-country substitutions compared to global substitutions, defined as the remaining GF offset of the optimal global substitutions divided by the remaining GF offset of within country substitutions. The proportional inferiority of within-country compared to global substitutions was not significantly related to the number of landraces from each country (r = 0.17, p = 0.3622). We next tested if the control climate mean and variance influenced the inferiority of within-country substitutions in a multiple regression, while accounting for the number of landraces in each country. We found that the countries with colder mean temperatures (multiple regression, mean cold t = −4.90, p < 0.0005) and greater variability of cold temperatures (variance in cold t = 2.59 p = 0.015) had significantly higher quality of within-country substitutions relative to global. Again, the number of sorghum landraces per country was not a significant predictor of the quality of the within-country substitution (number of landraces t = 1.39 p = 0.177). The overall model explained 58% of the variance in substitution quality (R2 = 0.58) and highlighted the potential value of diversity for regions and nations housing landraces adapted to diverse climates.

Discussion

The resilience of agricultural systems to changing climate determines global food security. In this study, we used information on landrace genetic variation and environment of origin for agronomically important cereal crops to predict disruptions to their adaptation/cultivation and to explore if the diversity of landraces may be beneficial sources of resilience in the case of a soot-producing climate catastrophe. Consistent with other groups who have investigated the consequences of a soot-producing catastrophe on global agriculture7,9 and fisheries34, we find the climate impacts would be devastating to global subsistence agriculture, many locations would become unsuitable for agriculture, and for the most extreme soot scenario, the locations that remain suitable may not have sufficient local landrace diversity within a species to enable a successful substitution of a resilient variety.

Our crop model results correspond to previous estimates of the climate impacts of soot-producing catastrophes6,7,9,32,34,42 while also providing an assessment of the diversity of environments to which crop landraces of globally important cereal crops are adapted. Increases in the number of days simulated to reach maturity corresponded to the climate anomalies of reduced daily temperature and solar radiation. In the years and locations with the greatest climate impacts, landraces in higher latitudes rarely achieved full maturity. Colder temperatures slow down phenological development, and can diminish photosynthetic activity and damage tissue.

We built GF models to summarize current landrace genotype-environment relationships and validated that GF models captured real adaptive differences through use of phenotypic data collected for a broad diversity panel of maize landraces grown in common gardens across Mexico. We show that predicted maladaptation, in the form of GF offset, is associated with height, yield, and stress-related traits, demonstrating a new test of these tools23. Landrace accessions had classic phenotypic patterns of local adaptation when grown in common gardens they had low GF offset (maladaptation) to, suggesting GF models captured broad adaptation of landraces’ local environments43. However, landrace performance was not perfectly predicted by our genotype-environment model. This inability to completely predict adaptation may be attributable to limitations of genotype-environment association approaches or to the maintenance of diversity within environments. In general, reciprocal transplants and common gardens often find mixed evidence for local adaptation44. Genotypes from the same environment may differ in performance in a given common garden environment if processes like migration or environmental fluctuations maintain diversity within populations or if important selective forces are not present in experimental conditions. Our validation methodology confirms that GF offset can be a powerful tool to capture current genotype-environment relationships though our inability to perfectly predict adaptation likely highlights a potential importance of maintaining genetic diversity within a site, which may complicate our ability to model these relationships.

When environments change and populations are not able to track the environmental change through plasticity or rapid shifts in genetic composition, populations may become maladapted and have reduced fitness in a novel climate45. In our case, landraces were predicted to be the most maladapted, or have the highest GF offset, in the locations where climate was the most disrupted from long-term averages, corresponding to the most extreme soot scenarios and the years post-war where atmospheric soot was the most abundant. The strong relationship we observed between GF offset and soot-induced change in climate is perhaps unsurprising. GF models are trained using current genotype-environment associations and any shift in the environment will likely require a change in genomic composition to track adaptation to a novel climate. The ability to interpret the magnitude of offsets derived from GF-derived functions in an ecologically meaningful way has recently become a point of discussion. Genetic-based quantifications of adaptation38 and offset46 can be biased for unsampled areas or if the projected environment exceeds what is used to train the model. Though we have a broad sampling of landrace accessions for each focal crop species that are adapted to a diversity of environments and used in GF genotype-environment models, the extremeness and novelty of post-catastrophic climates used in this study likely make predicting maladaptation difficult47. At the same time, though the true magnitude of maladaptation may be difficult to quantify, our GF models allow us to incorporate measures of climate-associated genomic variation for the identification of the most vulnerable locations that will likely require a varietal substitution. Additionally, our GF models provide insights to the aspects of the environment that may be most related to a crop’s current adaptation, which is likely related to the evolutionary history and cultivation practices of the crop. For example, GF identified average temperature and solar radiation experienced in the reproductive growth phase as most related to rice subsp. indica genome-wide allelic turnover, suggesting these variables may be important in driving local adaptation within this species. Rice landraces of the indica variety are traditionally cultivated in warm, tropical to subtropical locations and may have limited cold tolerance48. While cold and solar radiation are the variables most altered by nuclear winter, perhaps suggesting vulnerability of this species, the GF model also suggests that indica genotypes vary in their adaptation to temperature and light, suggesting there is some mitigation possible with genotype substitutions.

Crop diversity has been suggested as a potential solution to mitigate climate impacts on agriculture24,49. For all crops included in this study, we found that landraces accessions with a distribution farther from the equator were most maladapted to post-catastrophic climates and were most often selected as the best varieties for substitutions. Most substitutions that were well matched to vulnerable locations required long migration distances and for many locations, a landrace adapted to the novel environment at the vulnerable location does not exist within our dataset. Substitutions that maintained a high level of GF offset indicated landrace varieties that may remain maladapted to the novel climate, and no other varieties were better adapted to the vulnerable, tested location. At the same time, for locations where the cultivation of crops remains possible, the identification of multiple suitable genotypes may be important for the maintenance of crop diversity within a site. For smallholders, the development of elite farmer-preferred varieties and the introgression of alleles adapted to novel climates is a priority50, and genotype substitutions identified here could be potential donors of such alleles. For vulnerable locations that were not predicted to have a well-adapted substitution, switching cultivation to faster-maturing crop varieties, or other non-cereal crop species that tolerate lower temperatures (e.g. potato)51, may be a strategy for increased resilience. However, the adoption of a new crop species requires a significant investment by farmers and substantial modifications of farmer and consumer behavior52. It is worth noting that there may be some diversity in response to post-catastrophic conditions in modern elite crop varieties cultivated in wealthier nations, which are not accounted for in this study. Other studies have considered changes to global crop productivity under nuclear conflict, including Jagermeyer et al. (2020) who showed that even a relatively small nuclear strike (e.g., 5 Tg of soot) would drastically impact crop production.

Though our study highlights maladaptation in cereal crop landraces following a soot-producing catastrophe, methodology used in this study can also be leveraged to understand disruptions to adaptation and possible genotype substitutions (also known as assisted gene flow) given any change in climate, including greenhouse gas induced climate change53. Our results highlight the global maladaptation of landraces under post-catastrophic climates and indicate that for the landrace populations most vulnerable to a climate catastrophe, the within-species genetic diversity in a country may not be sufficient for resilience and substitutions across country borders of further distances may be required.

Methods

We used landraces to characterize global disruptions to adaptation and identify resilient accessions in the case of a climate catastrophe that produces soot. Selected landrace crop species fulfilled two criteria - 1. Landrace relatives of the species account for a large portion of accessions currently grown and 2. High quality sequencing data of geographically diverse accessions were publicly available. From these criteria, we selected four cereal crop species-Hordeum vulgare L. (barley), Oryza sativa L. (rice) subsp. indica and japonica, Zea mays L. (maize), and Sorghum bicolor (L.) Moench (sorghum). For all analyses, the rice subsp. indica and japonica were run separately. Altogether, the species cover most of the agricultural areas of the globe and are cultivated in and adapted to diverse climate regimes.

Weather data

Previously published weather data described in Toon et al. (2019) and Coupe et al. (2019) simulate the climate impacts of India-Pakistan and US-Russia wars using the Community Earth System Model (CESM, version 1.3) with the Whole Atmosphere Community Climate Model Version 4 (WACCM4, version 4) as the atmospheric component, or CESM-WACCM454. To more accurately represent the evolution of smoke injection, the Community Aerosol and Radiation Model for Atmospheres (CARMA55,56) is coupled with WACCM to simulate the injection, lofting, advection, and removal of soot aerosols42,57.

The climate impacts of nuclear war were simulated by injecting varying quantities of black carbon aerosol (soot) into the stratosphere in a layer between 100 and 300 hPa over a 1 week period starting on 15 May above the U.S. and Russia, or the South Asian subcontinent6,32,42. In total, six nuclear war scenarios were simulated, and we refer to the year soot was injected as year “0”. For the five India-Pakistan nuclear war scenarios (soot injections of 5 Tg, 16.1 Tg, 27.3 Tg, 36.6 Tg, and 46.8 Tg, representing a range of arsenal sizes) simulations were each run for 19 years. One United States-Russia scenario with a 150  Tg soot injection was also considered, and the simulation was run for 21 years. This scenario assumes both countries use most of their nuclear arsenals58 and is still possible given modern nuclear arsenals. Additionally, we used one ensemble member from the three in the control simulation to represent the natural weather variability. The control simulation was forced by year 2000 radiative forcing and ran for 20 years6,32.

Cycles

The Cycles agroecosystem model was used to infer growth and stress variables of landrace accessions’ point of origin using conditions accessions are expected to be adapted to (control scenario) and post-catastrophe (six post-nuclear war soot scenarios). Cycles is a process-based multi-year and multi-species agroecosystem model2,33 that requires a number of input files to simulate crop growth. All simulations were carried out using Cycles v0.13.0 (https://github.com/PSUmodeling/Cycles). The crop description file defines the physiological and management parameters that control the growth and harvest of crops used in the simulation. For each of our crop species, we used Cycles default crop parameters from the default crop description file. The management (operation) file defines the daily management operations to be used in a simulated crop rotation. We activated conditional planting where Cycles “plants” a simulated crop once certain soil moisture and temperature levels are satisfied within a window of planting dates. For many of the scenarios where planting conditions are not met (i.e. daily temperature remains too low) Cycles forced planting on the last day of the planting window. We turned on the automatic nitrogen fertilization option and set planting density to 67% for all crops in the simulation to be grown without nitrogen limitations so that stress observed in model outputs was due entirely to climatic factors. Weather files were built using the CESM-WACCM4 outputs as described in refs. 6,32 for one control and six post-nuclear soot scenarios, formatted for use in Cycles. The weather files were generated by aggregating the three-hourly CESM output to daily time steps at all CESM grids, which have a 1.9° latitude × 2.5° longitude resolution (https://doi.org/10.6084/m9.figshare.28732919). Weather files were matched to landrace point of origin for each simulated accession, where the climatic parameters used to simulate growth match the location accessions were sourced from. Weather files included variables describing variation in daily precipitation, temperature, solar radiation, humidity, and wind. Soil physical parameters were obtained from the ISRIC SoilGrids global database59 via the HydroTerre data system60,61,62 for all simulation locations. Soil files were also matched to landrace point of origin for each simulated accession and describe the average soil characteristics and land use for crop cultivation types. For accessions designated as paddy rice by ref. 22 we used the irrigated or post-flooding land use type. Rainfed land use type was used for all other simulated crop accessions.

For all simulated accessions of each crop species, seven Cycles simulations, including the control scenario and six soot scenarios were implemented separately. Cycles models simulated 20 years of crop growth for the control scenario, 15 years of crop growth after impact for the India-Pakistan scenarios (5 Tg, 16.1 Tg, 27.3 Tg, 36.6 Tg, and 46.8 Tg, and 150 Tg), and 17 years of crop growth for the US-Russia scenario (150 Tg). From the outputs of each Cycles simulation and for each year growth was simulated, we extracted variables summarizing the environmental stress and simulated growth plants experienced for each of our focal crop species (Cycles-derived environmental variables). Variables included information on the number of days to reach maturity, water stress, cold stress, and light stress experienced across simulated plant growth and when in the vegetative and reproductive phase (Supplementary Table 1). For accessions not projected to reach maturity, certain environmental summary variables were not extractable, and we imputed the 95% stress of the variable for each accession with missing environmental values, specific to crop species and the year growth was simulated for.

Genotyped datasets

As differences in genotyping resolution across species might influence the detection of genomic signals of adaptation, we selected datasets with high density genomic markers and a distribution of sequenced landraces accessions that most represented the environments that landraces of our focal species originate from and are likely adapted to. Advances in technology have made low-coverage whole-genome sequencing (WGS) relatively inexpensive, providing datasets that are particularly well-suited for research exploring polygenic signals. All genotype files were processed in PLINK v1.90b6.13, an established software for analyzing and filtering genotypic data63. For each landrace species, raw genotype files were filtered for minor allele frequency (MAF) removing all SNPs with <5% MAF and for linkage disequilibrium (LD) to reduce the number of SNP candidates we tested for environmental association. As the initial genotype files differed in size, the LD filter step included different conditions to thin files. We used --indep-pairwise 30 10 .1 for both rice subsp. (indica and japonica) and sorghum, and --indep-pairwise 100 10 .05 for the maize and barley data files. This filtering step resulted in 74,430 SNPs for barley, 43,818 SNPs for rice subsp. japonica, 61,430 SNPs for rice subsp. indica, 67,522 SNPs for maize, and 20,387 SNPs for sorghum to test for association to the species-specific Cycles-derived environmental variables of the control simulation.

Genome scan for environmentally related SNPs

Genotype-environment associations test for genetic variation that is statistically correlated to environmental predictors. We followed partial redundancy analysis (pRDA) methods developed by ref. 38 to identify loci putatively involved in environmental selection for our focal crop species. For each crop species, pRDA models were built using population allele frequencies (population defined as accessions from the same geocoordinates) from the filtered genetic dataset as response variables and the 13 Cycles-derived environmental variables from the control simulation, averaged across the 20 years of modeled growth as explanatory variables. Neutral genetic structure was accounted for by including the first three axes of a population PCA as conditional covariables. Using the rdapat function described in ref. 64, we identified the top environmentally related (outlier) loci based on the extremeness of their loading along a Mahalanobis distance distribution calculated between each marker and the center of the first two pRDA axes. P-values for each marker were derived as this distance, corrected for the inflation factor using a chi-squared distribution with two degrees of freedom. We then selected the top 1000 markers with the lowest P-values as candidate outliers to represent loci that may be important for environmental adaptation. The analysis was carried out using R/vegan v2.6-465.

To assess whether the top loci selected by pRDA are unique to the method, we further implemented Bayesian-information and Linkage-disequilibrium Iteratively Nested Keyway (BLINK, https://github.com/YaoZhou89/BLINK) and compared the significant loci as identified by BLINK and pRDA for sorghum. BLINK is a package commonly used for genome-wide association studies (GWAS) and improves upon traditional GWAS methods by addressing limitations such as computational inefficiency and reduced statistical power66. We ran BLINK separately for the same 13 Cycles-derived environmental variables used in the sorghum pRDA model and extracted the set of significant loci (p-value < 0.05) for each BLINK model that were built separately for each climate variable. For all BLINK models, the first three axes of a population PCA were used as covariates to account for population structure. We then compared the set of BLINK-identified significant loci across all 13 models (8728 unique SNPs) to the 1000 most significant loci as identified by pRDA and found that 556 SNPs were present in both datasets. Thus, the overlap between genotype-environment association methods for identifying loci that are related to variation in environmental gradients confirm that the results are not highly sensitive to the approach.

Identification of flowering time SNPs

We further accounted for genetic variation that may capture important plant phenological processes by including SNPs of known flowering time network loci for each focal crop species. We conducted a literature search to identify genes known to be involved in the flowering time network for each crop (Supplementary Data 1). Gene coordinates of each flowering time gene were gathered from the gff3 files that corresponded to each reference genome used to call SNPs (maize reference B73v2, https://figshare.com/articles/dataset/GTF_and_GFF_for_maize/895628; rice reference R498 IGDBv3, http://mbkbase.org/R498/; sorghum reference S. bicolorv3.1, https://phytozome-next.jgi.doe.gov/info/Sbicolor_v3_1_1)67,68,69. For maize, rice subsp. indica and japonica, and sorghum we also included SNPs found +/- 5 kilobase (kb) of each flowering time gene to account for variation in cis-regulatory elements. Barley sequence information was reported as contigs and we extracted SNPs located in contigs previously identified to overlap with homologs of well-characterized genes in Arabidopsis thaliana35 (barley reference whole-genome shotgun assembly of barley cv. Morex NCBI accession: CAJW01, https://www.ebi.ac.uk/ena/browser/view/CAJW010476474.1)70. Gene coordinates for the location of each flowering time gene region or flowering time related contig extracted using—extract in PLINK63. To account for patterns of linkage disequilibrium, we further filtered each species’ set of flowering time loci (gene and sites up and downstream of the gene) and only retained SNPs with an r2 value <0.2 within the flowering-time genic window and flanking region.

Gradient forest models and calculation of offset

Gradient forest (GF) is a machine learning algorithm extended from random forest which searches for genotypic patterns as associated with environmental descriptors. Using R/gradientForest::gradientForest40, we built GF models to associate current adaptive allelic diversity (the combined set of pRDA-identified environmentally related loci and flowering time network loci) with Cycles-derived environmental variables from the control simulation, averaged across the 20 years of modeled growth (hereafter, control GF model). Models were built separately for each of our focal crop species to describe control species-specific genotype-environment relationships. The control GF model parameters were tuned to increase the number of trees built to ntree = 500.

Genomic offset (also known as genomic vulnerability) is one metric used to characterize maladaptation with a genomic context (reviewed in ref. 28). The distance between current and expected genotype-environment associations under some change in environment is representative of the genomic offset, or the genetic shift required in a population to adapt to the future climate. Comparing the control genotype-environment association captured by GF models (control GF model), and the projected genotype-environment association for different scenarios (common garden, target scenarios, vulnerable locations) we made several measurements of GF offset to summarize predicted maladaptation. For all GF offset calculations, we followed methods described in ref. 24.

To validate that our control GF models captured current genotype-environment associations, we first used the maize control GF model to predict the genomic composition expected at common gardens maize landraces had been phenotyped in for a previous study36. Here, the GF offset was defined as the Euclidean distance of current genotype-environment relationships at the common garden site from the genotype-environment relationship of each landrace’s point of origin. This measure summarized how genetically well matched a landrace was to the common garden it was grown in (measure of predicted maladaptation to a common garden) and was compared to the phenotypic breeding values for each landrace grown in a common garden.

Validation of gradient forest-predicted adaptatio

We used phenotypic data of maize landraces grown in 23 trials across 13 common garden locations over 2 years to confirm that our control GF models captured real differences in current landrace genotype-environment relationships. We restricted our analysis to include the phenotypic data of landraces accessions that were simulated in Cycles models.

Briefly, phenotyped accessions are a part of the broader seed evaluation of the maize landrace collection36. Accessions were planted in multiple environments under a replicated F1 crossing. Importantly, two features of the crossing design ensure that phenotype data is not overly biased by elevational adaptation. Crossed plants were preferentially grown in locations that were of similar adaptation (highland tropical, sub-tropical or lowland tropical) to their home environment and each plant was crossed to a tester that was adapted to the environment that the F1 seeds were grown in. These design features allowed for comparison of a larger sample of accessions, but also led to an unbalanced experimental design. As a further consequence of the experimental design, apparent adaptive differences among landraces may be reduced and make phenotypic estimates of adaptation more conservative27. We extracted phenotypic information capturing differences in plant height (PH), the total weight of ears (kernels and cob) measured in the field (field weight; FW), bare cob weight (BCW), moisture adjusted grain weight per hectare (GWH), days to anthesis (DA), days to silking (DS), and anthesis-silking interval (ASI) for plants grown in trials (https://data.cimmyt.org/dataset.xhtml?persistentId=hdl:11529/10548233). The number of phenotypic measurements of maize landraces ranged from n = 4851 (BCW) to n = 11,762 (ASI) across the common garden field sites. Following methods from Gates et al. (2019) and Romero Navarro et al. (2017), we estimated breeding values controlling for tester, checks, and field position in a complete nested model. We further accounted for the random effect of the tester and year.

Calculation of offset under post-catastrophic condition

Once we confirmed that our maize control GF model captured phenotypic differences representative of adaptation to common gardens with conditions most like the source locations landraces are sourced from, we extended our control GF models to predict maladaptation in crop landraces under the six post-war target climate scenarios. Here, GF offset was calculated as the Euclidean distance between the current predicted genotype-environment association (control GF model) and for each soot scenation, the future projected genotype-environment association across all 13 Cycles-derived environmental variables.

We confirmed that GF offset values were correlated to relative changes in environment that were most related to genotype-environment associations, as summarized by the control GF model, by comparing the difference in GF offset (target subtracted from control) versus the change in Cycles-derived environmental variables (Supplementary Fig. 6). Environmental variables were scaled and adjusted by their relative contribution to GF models.

Identification of landrace substitutions for post-catastrophe adaptation

To understand if existing landrace diversity may be a source of resilience following a climate catastrophe, we used our GF models to identify the best suited substitutions for locations with the landraces that are the most maladapted24,29 under the most extreme target scenario (150 Tg) and for the year where climate is the most disrupted (year 2 after soot injection). We first excluded all locations not predicted to reach maturity, so as not to identify substitutions to locations where agriculture would not likely be possible. For the remaining locations, we defined the most vulnerable locations as those with the highest GF offset to search for both the most optimal and the best within-country substitution. Clusters of vulnerable pixels were identified using R/DBscan::dbscan71, which groups pixels based off proximity. Clustering was based on the geographic distance between vulnerable pixels measured with R/geosphere::distm72. Only clusters separated by <1000 km were retained for further analysis.

For each vulnerable cluster, a GF offset (i.e., Euclidean distance) was calculated between the projected genotype-environment association of the vulnerable location under the 150 Tg target scenario and the control GF modeled genotype-environment association (control GF model) across all landrace accessions included in the model. The lowest GF offset was defined as the minimum Euclidean distance and identified the current landrace accession predicted to be best adapted to the future climate conditions of the vulnerable area. For landrace accessions that were not perfectly adapted to the locations they were substituted to (i.e GF offset is >0), this measure represented the genomic gap that still needs to be filled for the migrated varieties to be fully adapted to the conditions of their new location (assuming current genotype-environment relationships are representative of perfect adaptation). High GF offset indicated substitutions where accessions are not predicted to be well adapted to the locations they were substituted to, and no other landraces included in the model were better adapted to the projected climate of the vulnerable area.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.