Main

Natural climate solutions (NCS) are deliberate human actions that protect, restore and improve management of land and oceans to mitigate climate change1. They are considered an essential tool for mitigating climate change in this century2,3. However, a possible consequence of widespread adoption of these actions on crop-lands is yield losses, leading to land-use change to meet food demand, undermining climate change mitigation and other sustainable development goals. Crop-land NCS should preferably sustain or increase crop yields to avoid additional land conversion4.

In experimental studies, crop-land NCS practices have been reported to produce variable outcomes for yield and GHG mitigation, especially in the initial years after adoption5,6,7,8,9. At larger scales, long-term trade-offs between yield and mitigation goals are largely unknown. Prior empirical analyses provided some protections for the global food supply by avoiding reforestation in crop-lands, but, nonetheless, assessments are lacking about how crop-land NCS directly impact crop yields1,10. Process-based modelling can help fill this gap.

Additionally, assessments of the climate change mitigation potential of these practices often focus only on soil organic carbon (SOC) responses1,10,11,12,13,14,15,16. Meta-analyses and regional modelling suggest that the net GHG mitigation benefit of adopting NCS for SOC storage, which removes CO2 from the atmosphere, may be partially negated by growth in soil N2O emissions6,17,18, a GHG that has a global warming potential of 273 relative to CO2 over a 100-year time horizon. Despite these research gaps, there is growing momentum across the public and private spheres to rapidly adopt NCS practices to address climate change with the assumption that yield and other ‘regenerative’ co-benefits will follow.

Cover cropping is viewed as a widely favourable NCS in crop-lands. Cover crops increase SOC through higher net primary production in annual cropping system rotations7,19,20. Cover crops also uptake excess nitrogen fertilizer, oftentimes reducing N2O (and nitrate) losses7,21. But the choice of cover crop functional type—especially the distinction between leguminous or non-leguminous species—can impact soil C sequestration and N dynamics in different management and soil conditions19,22. Observed impacts of cover crops on crop yields vary depending on initial SOC, cover crop functional type, cropping system and tillage management7,8,9. Combining cover crops and no tillage together, which reduces soil disturbance, may lead to better environmental and production outcomes5,23. However, the mitigation potential for cover crops and no tillage is either reported separately or assumed to be additive by combining global meta-analysis effect sizes for individual practices11.

Here we compared global and regional outcomes of immediate 100% adoption of these NCS practices compared to GHG and yield responses under continued, conventional crop-land management, that is, fallow and tillage, using a process-based model. This resulted in four crop-land NCS scenarios, adoption of grass or legume cover crops with conventional tillage or no tillage, all of which included full residue retention of cover and cash crop biomass. We also evaluated the efficacy of these individual and combined NCS practices in the context of a changing climate and over multi-decadal time frames to 2100, these extended time frames being critical to assessing the longer-term impacts of practices that must be maintained indefinitely to ensure that carbon remains stored in SOC24,25,26.

The objectives of our study were to: (1) identify scenarios that led to favourable GHG mitigation and yield outcomes over the near and medium terms (defined as mid-century and end of century, respectively), (2) determine environmental and management drivers of favourable outcomes and (3) quantify the potential costs (trade-offs) that arise from adopting crop-land NCS maximizing for GHG mitigation or yield outcomes or adoption that balances both climate change mitigation and crop production goals.

Favourable climate and production outcomes are not ubiquitous

Our model analysis shows that it is not possible to simultaneously maximize global GHG mitigation and yield benefits from crop-land NCS (Fig. 1). The highest modelled GHG mitigation potential is under the adoption of grass cover crops with no tillage (universal adoption), with a near-term (2016–2050) cumulative mitigation of 32.6 Pg CO2 equivalent (CO2-eq), 95% confidence interval (CI) (29.5, 35.7) and a medium-term (2016–2100) cumulative mitigation of 41.3 Pg CO2-eq, 95% CI (35.4, 47.2) (annual values in Fig. 1). However, this GHG mitigation comes at the cost of reducing crop yields by a cumulative 4.8 Pg of dry mass, 95% CI (4.0, 5.7) and 8.2 Pg, 95% CI (6.2, 10.1), respectively. Whereas legume cover crops with no tillage results in higher yields, 6.2 Pg, 95% CI (4.8, 7.7), and 19.3 Pg, 95% CI (15.4, 23.2), in near and medium terms, respectively, the mean GHG mitigation potential is 7% lower than grass cover crops with no tillage in the near term (significant at a 95% CI based on one-sided difference bootstrap significance test). Notably, the GHG mitigation potential of legume cover crops with no tillage is 69% lower than grass cover crop with no tillage over the medium term because of elevated soil N2O emissions and declining soil carbon storage (Fig. 1 and Supplementary Figs. 13).

Fig. 1: Global and regional annual mean GHG mitigation potential by annual yield difference for crop-land NCS relative to continued management.
figure 1

ad, Data are presented as mean values with uncertainty (± s.e.m.) for near term (a,b) and medium term (c,d); horizontal and vertical bars are annual GHG mitigation and yield uncertainty, respectively (n = 24). CC, cover crop; Ntill, no tillage.

The nature of this GHG mitigation and crop yield trade-off varies spatially (Fig. 2 and Extended Data Figs. 13). Around 90% of the modelled GHG mitigation potential from grass cover crops with no tillage is in developed countries (DEV; 13.1 Pg CO2-eq, 95% CI (12.2, 14.0), in the near term and 20.0 Pg CO2-eq, 95% CI (18.3, 21.7), in the medium term) followed by Latin America and Caribbean (LAC) countries and Asia and Developing Pacific (ADP) countries. However, these regions also experience the highest yield reductions, 2.4 Pg, 95% CI (2.2, 2.7), 0.9 Pg, 95% CI (0.8, 1.0) and 0.6 Pg, 95% CI (0.3, 1.0), lower in the near term, respectively, and similar differences in the medium term. The GHG mitigation potential of grass cover crops with no tillage is lowest in Africa and Middle East (AME) countries and East Europe and West–Central Asia (EEWCA) countries, but crop yield impacts are more heterogeneous, and generally, there are smaller yield reductions in these regions. The magnitude of mitigation potential in each region largely arises due to the amount of crop-land area, but per-hectare responses usually follow a similar trend across all regions (Supplementary Figs. 14). We note that the coefficient of variation of the modelled hectare-level responses tends to be lower in DEV countries (typically across all scenarios) but is much higher in other regions (Supplementary Figs. 5 and 6).

Fig. 2: Near-term, 2016–2050, mean annual GHG emissions and yield differences for crop-land NCS relative to continued management practices.
figure 2

a,b, Grass CC (GHG (a); yield (b)). c,d, Grass CC + Ntill (GHG (c); yield (d)). e,f, Legume CC (GHG (e); yield (f)). g,h, Legume CC + Ntill (GHG (g); yield (h)). Base map data in ah from The World Bank under a Creative Commons license CC BY 4.0.

Grass cover crops with tillage have a lower global GHG mitigation potential than with no tillage but also a reduced (albeit still negative) impact on crop productivity (Figs. 1 and 2). Cumulative GHG mitigation potential is highest in DEV and LAC countries in the near term, 4.0 Pg CO2-eq, 95% CI (3.5, 4.4), and 3.6 Pg CO2-eq, 95% CI (2.9, 4.3), respectively, but medium-term GHG mitigation potential in LAC countries is 56% lower compared with the near term because of decreasing soil C sequestration rates. The negative effect of grass cover crops with tillage on crop yields in DEV countries is reduced by 74% in the near term and 80% in the medium term compared to grass cover crops with no tillage (Supplementary Fig. 4). The heterogeneity in yield responses to grass cover crops in AME, ADP and EEWCA countries results in smaller yield reductions at the regional scale in the near term and a small crop yield benefit in AME and EEWCA countries in the medium term (Fig. 1 and Extended Data Figs. 13).

Adopting no tillage with legume cover crops attenuates the elevated soil N2O emissions associated with biological N fixation offsetting these emissions through higher soil carbon storage. Modelled near-term global GHG mitigation potential is 30.3 Pg CO2-eq, 95% CI (27.9, 32.7), whereas in the medium term, it is 12.9 Pg CO2-eq, 95% CI (8.0, 17.8) (Fig. 1). Unlike grass cover crops, legume cover crops have a positive effect on crop yields; global modelled crop yields in the near term are 6.2 Pg, 95% CI (4.8, 7.7), higher than continued crop-land management practices, and this effect increases over time where medium-term crop yields are 19.3 Pg, 95% CI (15.4, 23.2), higher. Regionally, DEV countries are almost 40% of the global GHG mitigation potential in the near term, and moreover, DEV countries account for nearly all of the global GHG mitigation potential in the medium term. While lower in the near term, crop yields are about 1.0 Pg, 95% CI (0.3, 1.8), higher in DEV countries in the medium term. However, while crop yield benefits become more apparent over time with this scenario, especially in LAC and AME countries, legume cover crops with no tillage do not always lead to medium-term GHG mitigation (that is, AME and ADP countries).

Whereas legume cover crops with tillage lead to higher yields than with no tillage, especially over the near term, GHG mitigation potential is considerably lower and even a source of GHG emissions. Specifically, global crop yields are 38% higher and 22% higher in the near and medium terms, respectively, compared to legume cover crops with no tillage. While highly variable, legume cover crops with tillage are a small global net source of emissions in the near term, 0.4 Pg CO2-eq, 95% CI (−1.7, 2.4). Over the medium term there is a stronger global signal with GHG emissions that are 36.9 Pg CO2-eq, 95% CI (31.3, 42.6), higher than continued crop-land management practices, largely arising from higher emissions in ADP countries.

Drivers of favourable climate and crop yield outcomes

The variation in outcomes suggests that climate, soil and management features drive GHG mitigation and crop responses to crop-land NCS. To identify and rank the importance of these features, we used SHapley Additive exPlanations (SHAP values) to explain feature importance for predicting joint-outcome classifications in random forest models for each crop-land NCS scenario (Supplementary Table 1). Below, we focus on the conditions that lead to favourable outcomes for end-of-century climate change mitigation and crop yield benefits (additional outcomes in Supplementary Fig. 7).

The most important features for predicting favourable outcomes across all the crop-land NCS scenarios are related to soil properties, cropping system and nitrogen (Fig. 3). Management-related features, such as cash crop type, water management, nitrogen application and initial residue retention fraction, frequently contribute to the likelihood of favourable outcomes. Soil bulk density, which is highly correlated with soil texture (that is, clay content), and initial soil nitrate content are also key features probably because of the influence of soil physical properties on soil C stabilization and on crop growth in the model. Climate-related variables, such as mean diurnal range, are not ranked among the top features for any of the scenarios.

Fig. 3: Top five features for predicting medium-term, 2016–2100, favourable outcomes ranked by mean absolute SHAP value estimated from random forest models.
figure 3

a, Grass CC. b, Grass CC + Ntill. c, Legume CC. d, Legume CC + Ntill.

There is an increased likelihood of favourable outcomes in maize and wheat compared to soybean cropping systems, which is probably a factor of additional N from N fixation provided by legume cover crops (Fig. 4). Irrigated cropping systems are also more likely to have favourable outcomes by alleviating water limitation. The response across crop-land NCS scenarios to nitrogen applications (fertilizer and manure) and initial residue retention amounts are more varied; the highest rates of N application and residue retention are more likely to lead to benefits for grass cover crops with tillage and legume cover crops with no tillage but not for legume cover crops with tillage.

Fig. 4: Partial dependence plots of top five features for predicting medium-term, 2016–2100, favourable outcomes.
figure 4

Positive SHAP value indicates higher likelihood of favourable outcome. Lines and points are mean SHAP values for continuous and categorical features, respectively. Figure 3 provides key to feature colours. ae, Grass CC (soil bulk density (a); initial soil nitrate (b); initial residue fraction (c); water management (d); cash crop (e)). fj, Grass CC + Ntill (soil bulk density (f); cash crop (g); initial soil nitrate (h); water management (i); initial residue fraction (j)). ko, Legume CC (cash crop (k); water management (l); nitrate fertilizer fraction (m); nitrogen inputs (n); initial soil nitrate (o)). pt, Legume CC + Ntill (cash crop (p); initial soil nitrate (q); soil bulk density (r); nitrogen inputs (s); initial residue fraction (t)).

Balancing climate and crop benefits from crop-land NCS

Rapid scaling of crop-land NCS practices can be supported with spatially explicit adoption recommendations based on mutually beneficial mitigation and crop production outcomes. The results above suggest that these recommendations will vary in space and time depending upon desired goals; more sustainable outcomes will avoid compromising yield for GHG mitigation (or vice versa).

We find that pursuing a goal of maximum GHG mitigation, but ignoring crop yield impacts, results in global near-term GHG mitigation of 35.6 Pg CO2-eq, 95% CI (32.8, 38.5) and medium-term GHG mitigation of 44.0 Pg CO2-eq, 95% CI (38.4, 49.6) (Extended Data Fig. 4). Most of this potential is attributed to the adoption of grass cover crops with no tillage across all regions. However, while the mean crop yield impact is positive in the near term, 2.3 Pg, 95% CI (1.4, 3.3), global crop yields are 3.6 Pg, 95% CI (1.6, 5.6), lower in the medium term. Regionally, the highest near- and medium-term GHG mitigation potential is concentrated in DEV countries, but crop yields are 1.6 Pg, 95% CI (1.3, 1.8), lower in the near term and 3.5 Pg, 95% CI (2.9, 4.0), lower in the medium term (Fig. 5). EEWCA is the only region with clear GHG and yield benefit outcomes through the medium term.

Fig. 5: Regional practice recommendations for maximizing GHG mitigation potential while balancing yield outcomes.
figure 5

Annual mean estimates with 95% CI are reported for the near term, 2016–2050, and medium term, 2016–2100 (n = 24). Positive values indicate GHG mitigation or higher yields whereas negative values indicate GHG emissions or lower yields relative to continued management practices. Base map data from The World Bank under a Creative Commons license CC BY 4.0.

Likewise, pursuing a goal of maximum crop yield but ignoring GHG impacts leads to 9.6 Pg, 95% CI (8.3, 10.9), and 25.5 Pg, 95% CI (21.9, 29.1), higher crop yields in the near and medium terms, respectively (Extended Data Fig. 4). Under this goal, the recommended scenario for most crop-land is to adopt legume cover crops with tillage. Most of this potential is found in LAC countries (10.4 Pg, 95% CI (9.4, 11.4), in medium term) followed by AME countries (5.6 Pg, 95% CI (4.8, 6.4), in medium term). However, pursuing a maximum crop yield goal has modest (and variable) GHG mitigation potential of 1.4 Pg CO2-eq, 95% CI (−0.2, 2.9), in the near term but increases GHG emissions by 20.1 Pg CO2-eq, 95% CI (15.7, 24.5), over the medium term, with almost 40% of these emissions attributed to AME countries.

Balancing both climate and production goals substantially reduces the benefits of crop-land NCS (Fig. 5). Restricting adoption recommendations to Pareto-optimal practices that maximize climate benefits without sacrificing any reduction in crop yields, leads to near-term global GHG mitigation of 4.4 Pg CO2-eq, 95% CI (4.2, 4.6) (Extended Data Fig. 4). Conversely, adopting only practices that maximize crop yields without sacrificing an increase in GHG emissions leads to a near-term global GHG mitigation of 4.1 Pg CO2-eq, 95% CI (3.7, 4.4). Near-term global crop benefits range from 3.8 Pg, 95% CI (3.7, 3.9), under a goal of maximum GHG mitigation (no crop yield reductions) to 5.6 Pg, 95% CI (5.3, 6.0) under a goal of maximum crop yields (no GHG emissions increases; Extended Data Fig. 4). This is a reduction of about 87% of the maximum GHG mitigation potential from crop-land NCS and of up to 60% of the maximum yield benefit in the near term; this is because in many instances no crop-land NCS practices lead to favourable outcomes, and we assume a continuation of conventional management practices. Under the balanced goals in the medium term, global GHG mitigation potential from crop-land NCS are 85–98% lower than the maximum GHG mitigation estimate, and global yield benefits are 79–90% lower than the maximum yield benefit estimate. LAC and DEV countries generally have the highest mitigation and crop production benefits (Fig. 5 and Extended Data Fig. 5).

Discussion

Global food and animal feed demand has been predicted to grow more than 50% over the next three decades27. This growth must occur on current crop-lands to avoid deforestation and other land-use changes that release additional CO2 to the atmosphere. We find that modelled crop-land NCS provide crop co-benefits in some, but far from all, circumstances. Adopting crop-land NCS can boost crop yields, albeit at a far lower climate benefit when balancing mitigation with the need for higher crop production for food and feed.

Previous assessments largely excluded soil N2O, future climate change and other dimensions of food production such as yield1,10,11,12,13,16,28. By including these components, our estimates of GHG mitigation potential from cover crops and no tillage are considerably lower than some other previous studies, which include estimates of crop-land GHG mitigation as high as 7 Pg CO2-eq yr−1 (refs. 16,29), whereas more recent estimates focused on individual or combined practices place the potential at 1.0–2.5 Pg CO2-eq yr−1 (refs. 11,12,13). Our near-term cumulative estimate of 35.6 Pg CO2-eq (1.0 Pg CO2-eq yr−1, 95% CI (0.94, 1.1)) based on a goal of maximizing GHG mitigation (ignoring crop yield impacts) is within the lower end of this range; however, our estimate based on balanced GHG mitigation and crop yield benefits is substantially lower. Regardless, direct comparisons should be interpreted cautiously because of differences in estimation methods, assumptions around practice adoption rates, cover crop types and applicable crop-land area in previous assessments.

There is appreciable uncertainty in our modelled GHG mitigation and crop yield estimates. Similar to other recent DayCent studies, grid-level GHG uncertainty exceeded 100% in many areas30,31. This uncertainty was still large at regional and global scales (reported as a 95% CI), but we were able to detect a clearer signal at these aggregate scales from NCS practices. Model uncertainty is large because of variability in experimental site data that is fully captured by DayCent30,32. We also found some evidence for a synergistic SOC effect (that is, more than additive) of combining residue retention, no tillage and cover crops for which there is limited empirical evidence and needs further investigation23. Further, the model has not been validated across all the geographies included in this study due to limited research, particularly in developing countries. Long-term benchmark datasets, of which there are few, of SOC, N2O and crop yield response to individual and combined practices are a critical need for model improvement and uncertainty reduction. In future work, multi-model ensembles—still uncommon in this field—could produce more precise estimates although this approach may not produce results more robust than single models, especially in the absence of large-scale empirical model calibration efforts33.

Crop-land NCS as an emissions reduction and carbon dioxide removal strategy presents additional risks. Newly sequestered soil C from crop-land NCS is vulnerable to re-release back into the atmosphere if management subsequently reverts to less sustainable practices. Recent work indicates that CO2 storage of less than 1,000 years is inadequate for negating fossil fuel emissions34, and the fate of SOC storage over these extended timescales is impossible to predict with confidence. Crop-land NCS with less long-term durability can still support land-based emissions reductions targets and build back the soil carbon debt35,36,37,38 in the shorter term. However, it is important to recognize that this is a removal that would need to be continually managed rather than an inherently durable form of sequestration39.

Yield benefits from cover cropping with no tillage were more consistent than GHG mitigation outcomes over the next several decades. This points to a clearer opportunity to address growing food and animal feed demand with these practices over a carbon-first agenda40. We found only modest GHG mitigation benefits—cumulative near-term mitigation (35 years) amounted to less than 10% of all annual GHG emissions in 202341—after accounting for soil N2O and crop production impacts. Hence, better soil management will have larger benefits for food production and only constitute a fraction of food system decarbonization; reaching climate goals in the food system will entail closer scrutiny of GHG-intensive foods.

Methods

DayCent model

We used the DayCent v. 380 ecosystem model to simulate future crop yields, SOC to a depth of 0.3 m and soil nitrogen dynamics to the bottom of the soil profile in maize, soybean and spring wheat crop-lands. The Daily Century model, DayCent, simulates the dynamics and flows of carbon, nitrogen, phosphorus and sulfur between soils, vegetation and the atmosphere at a daily time step. For this study we only included the elemental flows of carbon and nitrogen. The model requires daily weather, soil physical properties, selection of crop type and cultivar and crop-land management information as input data. DayCent estimates soil carbon and nitrogen balances, trace gas emissions, plant productivity and soil water balances in terrestrial ecosystems42.

Here we simulate soil C dynamics to a depth of 0.3 m and water and nitrogen flows to the bottom of the soil profile. The soil C submodel conceptualizes SOC turnover in three separate pools: active, slow and passive43. Microbial processes and products are implicitly represented in the model. In the active pool SOC turnover time is months to years, and it contains more easily metabolized plant materials and soil microbes. The slow pool contains more chemically complex plant material and microbial products with a turnover time of decades. Finally, the passive pool represents physically and chemically stabilized SOC with centuries-long turnover times. Climate variables and soil properties influence the decomposition rates of all three pools, and physical disturbance, particularly by tillage, impacts decomposition of the active and passive pools. Soil texture, that is, clay and sand content, affects C transfer between pools.

Recent improvements to the DayCent model include its plant production44 and soil organic matter modules43 and regionalized crop parameters45. The plant production module better simulates plant canopy production using a green leaf area index based on growing degree days. It also incorporates the evapotranspiration calculation recommended by the Food and Agriculture Organization of the United Nations (FAO)42,44,46. A Bayesian calibration method applying the sampling importance resampling was used to parameterize the soil organic matter module using a global long-term dataset reducing model uncertainty for predicting soil organic carbon stocks43. This Bayesian framework was recently applied to improve maize crop production parameters resulting in regional crop types that better agreed with crop production data relative to a single maize type45. This same approach was applied to soybean and spring wheat crops for this study. Additional details about the recent calibration and validation of DayCent are discussed in Supplementary Methods.

Geospatial input datasets

DayCent simulations were completed at a 0.5° × 0.5° spatial resolution (approximately 55.5 km at the Equator). The required grid-specific input variables included daily maximum and minimum temperatures; precipitation; soil properties; historic and current land use; management practices including cultivation, planting and harvest; nitrogen inputs (manure and fertilizer); irrigation and tillage43. Climate input data for simulations came from daily downscaled Climate Model Intercomparison Project (CMIP6) climate data for Shared Socioeconomic Pathway (SSP) 3–7.0. We chose not to model SSP5–8.5 because it is no longer considered a plausible future climate scenario given current GHG mitigation commitments by governments leaving SSP3–7.0 as an upper-end estimate47. However, we note the assumptions about future aerosols in SSP3–7.0 used for CMIP6 limit its utility as an upper-end future climate scenario48.

We included 24 global climate models (GCMs) as SSP3–7.0 climate variants in our simulations (Supplementary Table 2); data were obtained from Thrasher et al.49. More information about this dataset and how it was used as input into DayCent is available in Supplementary Methods. Annual atmospheric CO2 concentrations over the simulation period, 2016–2100, for SSP3–7.0 were from Meinshausen et al.50. Our SSP3–7.0 climate variants included ‘hot’ models classified by Hausfather et al.51 in the Sixth Assessment report by the Intergovernmental Panel on Climate Change (IPCC AR6).

Data on crop-land extent for the three crop types (maize, soybean and spring wheat) and water management (rain-fed or irrigated) were from ISMIP52 and soil data from the Harmonized World Soil Database53. Crop-land management information for planting and harvest dates are based on Sacks et al.54 and nitrogen and manure inputs from Jägermeyr et al.55. Because we used a growing degree day version of DayCent, exact crop harvest dates varied slightly from Sacks et al.54. The crop calendars were applied to both rain-fed and irrigated cropping systems because of a lack of disaggregated calendars for irrigated cropping systems. To derive carbon content of manure inputs, we used information about livestock species56 and manure C:N ratios57,58. The fraction of mineral nitrogen fertilizer from ammonium, nitrate and urea was estimated at the national-level from International Fertilizer Association (IFA)59. Because we lacked information about irrigation scheduling, model simulations were run allowing DayCent to apply water up to field capacity through irrigation events whenever soil water fell below 92.5% over the growing season. The proportion of crop residue returned to the soil at time of harvest was determined by country-level estimates for cereal crops from Wirsenius60 and Wirsenius et al.61. We used the crop type ‘cultivars’ developed in Yang et al.45 for maize and applied the same methodology for soybean and wheat. Crop-specific tillage management was from Porwollik et al.62.

Model simulations

The model simulation design included three stages: spin-up, historical baseline and future management with or without crop-land NCS. First, there was a model spin-up period to allow SOC stocks to reach equilibrium with historical climate and soil characteristics. The spin-up period lasted approximately 10,000 years under native vegetation63. For the historical baseline, crops were simulated as continuous rain-fed or irrigated monocultures because of expected GHG emissions differences with water management starting in 1700 or the year of initial cultivation in regions with expanding agricultural production64,65,66. The exact year of agricultural land conversion varied by grid cell and the land conversion year corresponded to the first year with a majority or more of the final crop-land area in 2015. Tillage, fertilization, crop residue return and model crop ‘cultivars’ varied over this period according to historical management from the geospatial datasets described above. Second, we simulated the historical baseline crop-land management period (1700–2015) to approximate initial C and N pools at the start of the future simulation (Supplementary Methods). Finally, we explored future crop-land management through either continued management or the global adoption of crop-land NCS (2016–2100). For crop-land NCS we assumed immediate adoption over the full crop-land area to assess the technical potential consistent with other studies. We did not include expansion or contraction of crop-land area in this analysis.

We simulated five management scenarios in the future period: (1) continued crop-land management under current, conventional practices, (2) grass cover crop with conventional tillage, (3) legume cover crop with conventional tillage, (4) grass cover crop with no tillage and (5) legume cover crop with no tillage. Continued management (scenario 1) includes tillage and residue retention as described above; cover crops and no tillage were not included in this scenario. Crop residue return rates were maintained at the country-level estimates60,61 for the continued management scenario. For the crop-land NCS scenarios (scenarios 2–4), all crop residue was returned to the soil at harvest for a 100% residue retention rate.

Both the legume and grass cover crop were simulated during the fallow period between the main crop harvest and planting dates. Because of the short fallow period between winter wheat crop harvest and planting the following fall, we excluded cover cropping in this cropping system and only included cover crops in spring wheat. Rye was selected to represent the grass cover crop whereas clover was simulated as the legume cover crop. The parameter values for these crops are based on the calibration described in McClelland et al.67. We selected these specific cultivars as functional type representatives because they are informed by the highest number of field observations. Cover crops were planted one week after the main crop harvest and terminated with an herbicide event two weeks before main crop planting. We simulated cover crops on more than 405 million hectares of crop-land.

Nitrogen inputs were unchanged in the five management scenarios. Reducing N inputs even when a legume cover crop is added to a crop rotation is still rarely practised68,69. In a national survey of US growers that adopted cover crops, most reported no N fertilizer savings70. Realizing the potential for reducing N fertilization with cover crops may also take several years to allow for a build-up of soil available N, although this may be more achievable in fields that practise both legume cover cropping with no tillage71,72. It will be important to establish recommended N fertilizer rates to support the reduction of N inputs that minimize N2O emissions without compromising crop yields over time; however, global datasets of recommended N fertilizer rates are lacking. Irrigation scheduling was unchanged but because it is a dynamic estimate in the model based on soil water content, total irrigation water applied varied between scenarios.

Uncertainty estimation

The variation in crop yield responses were estimated from the DayCent simulations for each of the 24 GCM climate variants for SSP3–7.0. To estimate DayCent model GHG emissions uncertainty, we applied a Monte Carlo approach as described in Ogle et al.32 and more recently in Ogle et al.30. Uncertainty associated with errors in model structure and parameters for soil organic carbon stocks and direct N2O emissions were approximated through an empirical method with a linear mixed-effect model. The linear mixed-effect model estimates ‘true’ soil organic carbon stocks or direct N2O fluxes from long-term experimental data as a function of the predicted DayCent stocks or fluxes and other covariates to address bias in DayCent predictions30,73. Random effects for site and year were not included in the uncertainty analysis.

Indirect N2O flux uncertainty associated with volatilized and leached nitrogen was also analysed with a Monte Carlo approach. Indirect N2O emissions, leached and volatilized N, were estimated according to the 2019 Refinement to the 2006 IPCC Guidelines for National Greenhouse Gas Inventories74,75. We applied the disaggregated Tier 1 emission factors (EF) by climate (wet or dry) from Table 11.3 to calculate indirect N2O emission associated with N volatilization and redeposition (EF4) whereas indirect N2O emissions from inorganic and organic N leaching were estimated using EF575.

The Monte Carlo analysis was conducted with 500 iterations to propagate error from the model structural and Tier 1 sources. For each iteration we randomly selected one of the climate variants from the 24 GCMs. Soil organic carbon stocks and direct N2O emissions for each iteration were estimated by a random selection of parameter values from the empirical estimator for each grid cell, crop type (maize, soybean, spring wheat), irrigation management (rain-fed or irrigated) and management scenario. Indirect N2O emissions for volatilization and leaching were estimated for each iteration by a random selection of EF values from a truncated normal distribution based on the standard deviation for EF4 and EF574,75.

Because of failed runs or missing geospatial input data, some grid cells containing a small amount of crop-land area were missing from our simulations. We imputed these missing values with uncertainty to generate regional and global estimates over all eligible crop-land areas in this study (~405 million hectares). These imputed values were not used in analyses of hectare-level responses or for the explainable machine learning. Missing values were imputed by a Monte Carlo approach with 500 iterations for GHG difference estimates and 24 iterations (representing the climate variants) for yield difference estimates. A random value for each iteration was selected from a normal distribution of GHG or yield estimates by region for each crop type, irrigation management and management scenario. We grouped regions according to the IPCC classification used in Roe et al.13: Asia and developing Pacific (ADP); Africa and Middle East (AME); developed (DEV); East Europe and West–Central Asia (EEWCA) and Latin America and Caribbean (LAC).

Soil carbon, direct N2O, indirect N2O emissions and yields were estimated for each iteration, grid cell, crop type, irrigation management and scenario from 2016 to 2100. GHG estimates are the sum of soil carbon storage, direct N2O and indirect N2O emissions (converted to CO2 equivalents) for each year, and GHG mitigation potential for each scenario is the difference between estimates for the crop-land NCS scenario and the continued, conventional practices scenario. Positive GHG values for a crop-land NCS scenario reflect GHG mitigation (or lower emissions) whereas negative values represent higher GHG emissions relative to continued management. Crop yield differences similarly reflect higher or lower crop yields for a crop-land NCS scenario relative to continued management.

We reported data for two time periods, 2016–2050 and 2016–2100, which we refer to as the near term and medium term, respectively. Because of annual variability in GHG and yield responses, annual estimates are reported by dividing the cumulative responses by the number of years since the start of simulation over the time period of interest: 35 (near term) or 85 years (medium term). Reported GHG mitigation estimates are the mean and standard error of the Monte Carlo analysis and yield estimates the mean and standard error of climate variant simulations. Global and regional estimates for each scenario were calculated according to the following equations:

$$\overline{{x}_{r}}=\frac{{\sum }_{i=1}^{n}({d}_\mathrm{{{r}},{{c}},{{m}},{{g}}}\times {h}_\mathrm{{{r}},{{c}},{{m}},{{g}}})}{n}$$
(1)
$${\sigma }_\mathrm{{{r}}}=\sqrt{\frac{{\sum }_{i=1}^{n}{({x}_{\mathrm{{i}},{{r}}}-\overline{{x}_{r}})}^{2}}{(n-1)}}$$
(2)
$$\mathrm{SE}=\frac{\sigma }{\sqrt{N}}$$
(3)

where d is the response variable, h is the number of hectares, r is the region, c is the crop type, m is the irrigation management and g is the grid cell. For equation (3), N is the independent sample size, which we treated as 24 for the number of unique climate variants. These estimates were then used to calculate the 95% confidence interval.

Hectare-level GHG and yield potential estimates are based on crop-area-weighted means and weighted standard deviations and standard error. Weights were calculated as the proportion of hectares for each crop and irrigation management type in a grid cell, giving more weight to responses with a larger share of crop-land area.

Finally, we estimated maximum global and regional GHG mitigation and yield potential based on four possible goals: (1) maximum GHG mitigation potential (ignoring yield impacts), (2) maximum yield potential (ignoring GHG impacts), (3) maximum GHG mitigation potential restricted to scenarios that do not reduce crop yields and (4) maximum yield potential restricted to scenarios that do not increase GHG emissions. Goals 3 and 4 are considered sustainable, and they identify the Pareto-optimal solution (or scenario) whereby GHG mitigation (or crop yield) does not worsen the outcome of the other, that is, crop yield losses or GHG emissions. For each grid cell, crop type and irrigation level, a GHG and yield difference estimate was selected for a single scenario according to the following:

  1. 1.

    Maximum GHG mitigation potential (ignoring yield impacts). For each grid cell, crop and irrigation management type, we selected the scenario that most frequently had the maximum GHG mitigation potential across all Monte Carlo iterations. If GHG was negative, indicating higher emissions, continued management was selected resulting in no change in GHG or yield.

  2. 2.

    Maximum yield potential (ignoring GHG impacts). For each grid cell, crop and irrigation management type, we selected the scenario that most frequently had the maximum yield potential across all climate variants. If yield was negative, indicating lower crop yields, continued management was selected resulting in no change in yield or GHG emissions.

  3. 3.

    Maximum GHG mitigation potential restricted to scenarios that do not reduce crop yields. For each grid cell, crop and irrigation management type, we first constrained available scenarios to ones that resulted in no crop yield differences or higher crop yields across the climate variants. If all scenarios led to lower crop yields, continued management was selected. We then selected from the remaining available scenarios the one that most frequently had the maximum GHG mitigation potential across all Monte Carlo iterations. Again, if GHG was negative, indicating higher emissions, continued management was selected resulting in no change in GHG or yield.

  4. 4.

    Maximum yield potential restricted to scenarios that do not increase GHG emissions. For each grid cell, crop and irrigation management type, we first constrained available scenarios to ones that resulted in no GHG differences or GHG mitigation across the Monte Carlo iterations. If all scenarios led to higher GHG, continued management was selected. We then selected from the remaining available scenarios the one that most frequently had the maximum yield potential across all climate variants. Again, if yield was negative, indicating lower crop yields, continued management was selected resulting in no change in yield or GHG emissions.

Goal-based global and regional estimates were then calculated according to equations (1)–(3). A complete workflow of the uncertainty assessment applied in this study is provided in Supplementary Fig. 8.

Explainable machine learning

Explainable machine learning using SHapley Additive exPlanation (SHAP) values76 was performed to assess how climate, site and management variables contributed to modelled differences in GHG emissions and crop yields between the NCS scenarios and the continued crop-land management scenario. SHAP values were estimated using the R package fastshap (version 0.1.0)77 from random forest models using ranger (version 0.14.1)78. The target variable was one of four classes estimated from the mean cumulative end-of-century GHG emissions and yield differences from the Monte Carlo analysis: both favourable (GHG mitigation and higher yields), mitigation-favourable (GHG mitigation and lower yields), yield-favourable (GHG emissions and higher yields) and both unfavourable (GHG emissions and lower yields). Features in the random forest models were both categorical and continuous including bioclimatic variables, which were the mean response across all 24 GCM climate variants for SSP3–7.0, site variables including initial soil C stocks and soil texture and management variables such as N inputs and initial residue retention rates.

We evaluated random forest model performance with the Out-of-Bag prediction error; prediction error is the percentage of misclassified samples in Out-of-Bag sample (Supplementary Table 1). Random forest models were run with default parameters at 500 trees, mtry of 4 and node size of 10, and models included 24 features. We removed extreme outliers in two features, initial soil mineral nitrogen and soil nitrate stocks, using a cut-off of values <1% and >99% percentiles. Highly correlated features were removed from the dataset before creating the random forest models. SHAP values were estimated across classes and for each individual class with nsim set to 50 Monte Carlo repetitions.

Data analysis and processing

We used the R programming language to automate DayCent model simulations, the uncertainty analyses, data processing and data analysis (version 4.1.2)79. All geospatial datasets were aggregated to a 0.5° × 0.5° spatial resolution using the terra package (version 1.7-46)80. The high performance cluster at the Cornell University Center for Advanced Computing was used to run all DayCent model simulations.

Combined GHG emissions from SOC and N2O emissions were first converted to CO2 equivalents (CO2-eq) using the 100-year global warming potential (GWP100) values from the IPCC Sixth Assessment Report before summing2. We used a C content of 42% to convert modelled grain yield C into dry matter81.

We applied an one-sided differenced bootstrap significance test to evaluate the mean difference in near-term GHG mitigation between grass cover crops with no tillage and legume cover crops with no tillage. A sample distribution for the mean difference was created by randomly selecting 24 mean differences (representative of the independent sample size) from the Monte Carlo distribution with replacement 1,001 times. Significance was evaluated with a one-sided 95% confidence interval.

Data were reprojected to Eckert IV for all map figures. Map figures do not include imputed estimates. Shape files in base maps for all figures use World Bank-approved administrative boundaries82.