Main

Terrestrial ecosystems serve as a land carbon sink that offsets roughly a quarter of the anthropogenically CO2 emissions1. Enhancing the land carbon sink through forest conservation and restoration are regarded as the most cost-effective solutions for climate change mitigation2,3,4, as protected old-growth trees and newly planted trees can remove CO2 from the air and fix it into biomass in their lifetime. Many global and regional initiatives, such as reducing emissions from deforestation and degradation (REDD+)5 and the Trillion Tree Campaign6, are leveraging the carbon fixation ability of trees to slow down climate change. However, successful implementations of forest conservation and restoration initiatives for carbon sequestration, and more broadly the estimation of the land carbon sink, require knowledge on how efficient different types of vegetation fix and store carbon.

Ecological studies have historically used the concept of carbon use efficiency (CUE) to describe the efficiency of vegetation to convert fixed carbon in potential storage. Trees take up carbon through leaf photosynthesis (gross primary productivity (GPP)) and lose a portion of the fixed carbon owing to autotrophic respiration (Ra). The net amount of gross productivity, also referred as the net primary productivity (NPP = GPP − Ra), is then used for biomass production and other secondary products (soluble organic compounds and biogenic volatile organic compounds), and its ratio to GPP is defined as CUE7. Recent studies have updated CUE to biomass production efficiency (BPE), which strictly refers to the ratio of biomass production (BP = GPP − Ra—other secondary carbon products) and GPP, although at the annual scale CUE and BPE vary similarly considering that BP is the major fraction of NPP8. Given the same rate of photosynthetic assimilation, a higher CUE or BPE often indicates faster accumulation of biomass. Previous studies have suggested that global CUE converges to a constant around 0.47 with little variation (±0.04; mean ± s.d., n = 12)9, although this notion has been challenged recently10, as regional and global syntheses have reported that CUE or BPE varies considerably due to physical (climate and soil fertility)8,11, biological (biome type and stand age)7,12,13,14 and management15,16 factors. While these studies reveal substantial variation in CUE, they differ in the inferred responses of CUE to temperature11,17,18,19, dryness11,20,21 and biome types7,8,12,15, leading to conflicting interpretations of the global variation in CUE.

This lack of understanding of CUE variation hinders our ability to predict the land carbon sink20,21,22,23 and calls into question whether current conservation and reforestation practices are optimal for mitigating climate change. Part of the difficulty stems from the scarcity of direct CUE observations, as globally we only have ~300 CUE records to represent its variation so far11,15, mostly because of the difficulties related to continuous measurements of NPP or Ra. In this study, we propose to fill the knowledge gap by leveraging distributed and high-frequency carbon flux observations of the land surface from a global network of eddy covariance towers. On the basis of the strong coupling between plant photosynthesis and respiration and recent theoretical advances in estimating Ra (ref. 14), we develop a method to estimate CUE for >2,700 site-years of observations across diverse biomes and climates worldwide (Methods). Specifically, our method assumes that the rate of change in ecosystem respiration relative to photosynthesis during growing seasons contains information on CUE (Supplementary Fig. 1). We used independent estimates of CUE and BPE from inventory or biometric measurements (all referred to as CUEIN)11,15, site measured NPP, BP11,15 and heterotrophic respiration24, and the basal area increment (BAI) of trees estimated from the international tree-ring data bank (ITRDB)25 to validate the CUE inferred from eddy covariance observations (CUEEC). The newly derived CUEEC dataset enables us to examine the variation in CUE and its dependence on biotic and abiotic factors. Using CUEEC, we further map the spatial variability of global CUE, which is a critical element in guiding best practices of forest conservation and restoration for effective and efficient offset of CO2 emissions.

Results

Vegetation CUE derived from eddy covariance observations

Across biomes and years, the CUE derived from eddy covariance observations (CUEEC) was 0.43 ± 0.12 (mean ± s.d.), comparable to the CUE and BPE estimated on the basis of inventory data (CUEIN), which was 0.47 ± 0.12 (Fig. 1a). When comparing CUEEC and CUEIN across nine sites that have both records, we found that the two metrics were similar at seven sites, aligning closely along the 1:1 line (Fig. 1b). CUEEC was available for both forest biomes (n = 1,365) and non-forest biomes (n = 1,372). The CUEEC of forest biomes was 0.41 ± 0.11, significantly lower (P < 0.01; one-way analysis of variance (ANOVA)) than the CUEEC of 0.45 ± 0.12 for non-forest biomes (Fig. 1a). This was consistent with CUEIN, which also showed a significantly (P < 0.01) lower CUEIN for forests (0.46 ± 0.12) than for non-forests (0.53 ± 0.15). Among the non-forest biomes, which include grasslands (GRA), wetlands (WET), shrublands (SH), savannas (SAV) and croplands (CRO), SAV had significantly lower (P < 0.01) CUEEC (CUEEC = 0.32 ± 0.12) than other non-forest biomes (CUEEC = 0.46 ± 0.11).

Fig. 1: CUE derived from eddy covariance observations.
figure 1

a, CUE of different PFTs, which include EBF, ENF, DBF, MF, SH, SAV, GRA, WET and CRO. CUEEC represents the CUE derived from eddy covariance observations; CUEIN represents the CUE derived from inventory data. The letters over each bar indicate whether the groups are significantly different (P < 0.01; one-way ANOVA). The number under each bar means the number of samples. In the boxplot, the centre line represents the median, the circle within the box indicates the mean, the limits of the box indicate the 25th and 75th percentiles and the whiskers represent the 10th and 90th percentiles. b, Comparison of CUEEC and CUEIN at nine sites: Harvard Forest (US-Ha1, DBF), Morgan Monroe State Forest (US-MMS, DBF), University of Michigan Biological Station (US-UMB, DBF), Willow Creek (US-WCr, DBF), Turkey Point TP02 (CA-TP1, ENF), Turkey Point TP89 (CA-TP2, ENF), Turkey Point TP74 (CA-TP3, ENF), Turkey Point TP39 (CA-TP4, ENF) and Hyytiälä (FI-Hyy, ENF); the dark dots indicate the seven sites where CUEEC agrees with CUEIN, the light dots indicate the two sites where CUEEC is different from CUEIN. The r and P values indicate the correlation between CUEEC and CUEIN excluding the two outlier sites, while the value in brackets indicate the correlation between CUEEC and CUEIN including the two outlier sites. The error bars represent 1 s.d. of the CUEEC and CUEIN for each site. c, The paired basal area increment (pBAI) of DBF and ENF (Methods). pBAI is the ratio of the BAI of deciduous trees and evergreen trees at the same locations. The shaded area indicates 1 s.d. of the pBAI values under each stand age. The pBAI is indicative of the relative advantage of DBF to ENF in terms of carbon use for stem growth.

Source data

Among forest biomes, we noticed that the CUEEC of deciduous broadleaf forests (DBF) was 0.46 ± 0.12, significantly higher (P < 0.01) than other forest types, such as evergreen needleleaf forests (ENF; CUEEC = 0.39 ± 0.10) and evergreen broadleaf forests (EBF; CUEEC = 0.32 ± 0.12). In total, CUEEC of DBF was 15% greater than that of evergreen forests. Across all biomes, CUEEC of DBF was among the largest, trailing slightly behind those of CRO (CUEEC = 0.50 ± 0.09). Inventory data also showed a relatively higher CUEIN of DBF (CUEIN = 0.49 ± 0.13) than other forest types (CUEIN = 0.45 ± 0.11), although the difference was not statistically significant, unlike the difference in CUEEC.

We further used an independent metric—paired basal area increment (pBAI) of deciduous trees and evergreen trees at the same site—as a proxy for the relative difference in CUE between DBF and ENF (‘Quantify relative CUE from pBAI’ in the Methods). A pBAI >1 indicates that the CUE of DBF is greater than that of ENF. Despite the wide range of pBAI estimates, especially for young stands, the average of pBAI was consistently >1 across all stand ages, further confirming that the CUE of DBF is greater than that of ENF. Across all stand ages, pBAI averaged to 1.09, suggesting that the CUE of DBF for stem was 9% greater than ENF, which was consistent with the relative difference in CUEEC between DBF and ENF.

The drivers of variation in vegetation CUE

Using the newly derived CUEEC estimates, we applied a linear mixed effect (LME) model to quantify the impacts of various biotic and abiotic factors on CUE (Methods). We found that CUE decreased significantly (P < 0.01) with mean annual air temperature (MAT) and insignificantly (P > 0.01) with mean annual precipitation (MAP) and mean annual photosynthetically active radiation (PAR). CUE also decreased with stand age (P = 0.57), although its impact was not significant (Fig. 2a–e). One important contributor to CUE variability is the inherent difference in baseline CUE associated with each biome, revealing that CRO, DBF and GRA exhibit greater CUE compared to other plant functional types (PFTs) (Fig. 2e). In particular, the baseline CUE of DBF was 0.10 and 0.08 higher than that of EBF and ENF, respectively, accounting for almost all the observed differences in CUE between DBF and other forest types (Fig. 2e).

Fig. 2: The responses of CUE to abiotic and biotic factors.
figure 2

a,e, The biotic factors include stand age (a) and plant functional types (PFTs) (e). bd, The abiotic factors include mean annual air temperature (MAT; °C) (b), mean annual photosynthetically active radiation (PAR; W m−2) (c) and mean annual precipitation (MAP; mm year−1) (d). e,f, The effect of PFTs on CUE (e) and the relative importance of the abiotic and biotic factors in determining CUE using a random forest approach (f). The shaded areas in ad represent the 95% confidence interval of the CUE responses to explanatory factors in the LME, while the error bars in e represent 1 s.d. of the random effects for each PFT in LME. The error bars in f represent 1 s.d. of the variable importance obtained by bootstrapping CUEEC 1,000 times in the random forest.

Source data

Understanding autotrophic respiration through CUE

The newly derived CUEEC allows the calculation of NPP from GPP and further enables us to partition the ecosystem respiration (Reco) into Ra and heterotrophic respiration (Rh) at eddy covariance sites. We found that for all PFTs, areas sampled by eddy covariance sites act as carbon sinks, with GPP exceeding Reco by 10–50% on average (Fig. 3a). Upon partitioning Reco into Ra and Rh components, our analysis revealed that Rh was typically smaller than Ra at eddy covariance sites. On site-year average, Rh was about 39 ± 10% (mean ± 1 s.d.), 31 ± 11%, 45 ± 10%, 41 ± 10%, 45 ± 11%, 32 ± 12%, 44 ± 10%, 45 ± 11% and 50 ± 9% of Reco for ENF, EBF, DBF, mixed forests (MF), SH, SAV, GRA, WET and CRO, respectively. The estimated NPP and Rh were also validated against independent observations and showed reasonable accuracy (Supplementary Fig. 2).

Fig. 3: Using CUE to improve the understanding of respiration components and processes across flux sites.
figure 3

a, Partitioning ecosystem respiration (Reco) into autotrophic respiration (Ra) and heterotrophic respiration (Rh). The panel presents annual GPP, Ra and Rh normalized by Reco. The raw unit for carbon fluxes is g m−2 year−1. The number above each group of bars indicates the number of site-years for that PFTs. The error bars represent 1 s.d. of the variables, calculated from the values of all site-years within each PFT. bd, The relationships between CUE and the key parameters for Ra, including the coefficient of growth respiration (gR; unitless) (b), the fraction of live biomass turned into non-respiratory biomass per unit time (τ; unitless) (c) and the temperature sensitivity of the maintenance respiration (Ea (eV)) (d). The shaded areas indicate 1 s.d. of the fitting by bootstrapping CUE, gR, τ and Ea 1,000 times according to their uncertainties.

Source data

We also examined the relationship between CUE and the key parameters determining Ra (ref. 14) (Methods), including the coefficient of growth respiration (gR; unitless), the fraction of live biomass turned into non-respiratory biomass per unit time (τ; unitless) and the temperature sensitivity of maintenance respiration (Ea (eV)). These parameters were concurrently derived alongside CUE using our method (Methods). We found that CUE was negatively related to gR, while gR was constrained in a narrow range from 0.2 to 0.22. With a 0.01 increase in gR, CUE decreased by 0.15. Meanwhile, τ, a factor determining how fast live biomass turns into non-respiratory materials (sapwood to hardwood), was positively related to CUE. The parameter τ predominantly falls within the range of 0.7 to 0.9 across various PFTs. Swift turnover, indicated by a higher τ, suggests a reduced respiratory cost to maintain live biomass, leading to a corresponding increase in CUE. Finally, we observed that the average site-year Ea was 0.28 ± 0.14 eV. When Ea was <0.50 eV (93% of the cases), there was a negative relationship between Ea and CUE; however, the negative relationship disappeared when Ea was >0.50 eV. This indicates that in most site-years when maintenance respiration is more responsive to temperature (greater Ea), more carbon is probably lost through maintenance respiration, thereby reducing CUE.

The global distribution of vegetation CUE

We generated a global map of CUE by using a random forest model trained by CUEEC (Methods). Our estimates of CUE ranged from 0.3 to 0.6, with a clear latitudinal gradient—increasing from the tropics to high latitudes (Fig. 4a). For >50% of the global vegetated land surface, CUE converged in a relatively small range between 0.4 and 0.5. Meanwhile, the CUE derived from an ensemble of dynamic global vegetation models (DGVMs) exhibited a similar latitudinal gradient, although with a broader distribution, ranging from 0.3 to 0.7 (Fig. 4c and Supplementary Fig. 3). Notably, the CUE estimated from DGVMs showed substantial spread across models (Fig. 4d and Supplementary Fig. 6), with an average uncertainty of 0.15 (1 s.d. of DGVMs CUE) and the largest uncertainty of 0.21 in the high latitudes. Meanwhile, the uncertainty from CUE estimated from the random forest was spatially homogeneous and generally <0.10 (Fig. 4d).

Fig. 4: The global distribution of CUE and its uncertainty.
figure 4

ad, The global mean CUE estimated by a random forest model trained on CUEEC (a) and its associated uncertainty (c); the global mean CUE (b) and its uncertainty (d), calculated from the CUE estimates from an ensemble of DGVMs in the TRENDY v.9 project. The uncertainty of CUE in c is defined as 1 s.d. of 1,000 random forest runs, with a bootstrapped CUE value in each run. The uncertainty of CUE in d is defined as 1 s.d. of the CUE estimated from DGVMs.

Discussion

In this study, we leveraged a global network of eddy covariance sites to estimate CUE and revealed that CUE averaged 0.43 ± 0.12 across 2,737 site-years. We then analysed the biotic and abiotic factors responsible for the global variation in CUE. Among these factors, PFTs emerged as the pivotal contributor to CUE variability, with deciduous broadleaf forests exhibiting notably higher CUE compared to other forest ecosystems. Meanwhile, non-forest biomes have greater CUE than forest biomes. We found consistently negative impacts of temperature, radiation, rainfall and stand age on CUE. On the basis of the CUE we obtained from eddy covariance data, we further partitioned ecosystem respiration into heterotrophic and autotrophic components, and estimated the global pattern of CUE. Our study provides crucial insight into the global variation in vegetation CUE, which can greatly support our estimates of the land carbon sink and its optimization through land management for climate change mitigation.

Differences in CUE between PFTs

Previous studies have examined whether and how CUE varies by PFT. While some found no discernible distinction in CUE among PFTs15, others identified substantial differences10. However, these studies have not reached a consensus regarding the specific effect of PFTs on CUE. For instance, temperate deciduous forests have been reported to have either greater CUE7,10 or lower CUE12 than boreal forests. Importantly, our findings indicate that DBF exhibits a significantly (P < 0.01) higher CUE than other forest ecosystem types.

From a physiological perspective, the higher CUE was jointly determined by lower gR and higher τ (Supplementary Fig. 4). The lower gR means that the carbon cost of DBF building new tissues and components is less than other forests. This difference may manifest in the form of a lower leaf mass per area in DBF compared to other forest ecosystems26,27. Meanwhile, higher τ indicates that there is less live biomass to incur maintenance respiration for DBF than for other biomes.

Our study also reported significantly (P < 0.01) higher CUE for non-forests than for forests, except the low CUE of the C4 grass-dominated savanna ecosystems. The generally higher CUE of non-forests is probably caused by the low carbon stocks and fast turnover rate of their live biomass (Supplementary Fig. 4) and thus less maintenance respiration. CRO has the highest CUE, reflecting the selection of productive varieties for efficient biomass accumulation (crop yield) and better management15,28. Consistent with our results, a study using an emergent constraint approach to estimate CUE also reported greater CUE (0.55) for CRO than forests12. Interestingly, we found that SAV demonstrates significantly (P < 0.01) lower CUE than other herbaceous species (Fig. 1a), potentially reflecting the high energy and carbon costs of C4 photosynthesis pathway compared to the more prevalent C3 pathway in temperate grasslands. Some leaf-level studies indicated that C4 plants show higher dark respiration rates than C3 plants29, indicating a higher respiratory need for C4 and therefore a lower CUE. This high demand of respiration of C4 vegetation is also indicated by the great gR values of savanna ecosystems (Supplementary Fig. 4). Meanwhile, we also hypothesize that the warm and dry conditions associated with the growth of C4 grasses30,31 may suppress photosynthesis and enhance respiration, potentially resulting in a lower CUE for savanna ecosystems compared to other ecosystems.

Climate, soil and stand age impacts on CUE

Other than PFTs, we found that CUE is also impacted by climate (temperature, precipitation and light) and stand age. CUE derived from eddy covariance exhibited a linear decline with stand age, although this trend was not statistically significant (P = 0.57). This finding aligns with outcomes reported in earlier research studies7,11. However, we noted that a linear model might not be the most suitable way to describe the CUE–stand age relationship, since our estimated CUE generally increased with stand age up to ~30–50 years and then gradually decreased after the peak (Fig. 2a). A few studies have noted this nonlinearity in the CUE–stand age relationship11,13,32. Considering that GPP is not as sensitive to stand age as NPP33, we hypothesize that the nonlinear association between CUE and stand age suggests an initial rise in NPP with increasing stand age followed by a subsequent decline as in Odum’s ecological theory. In fact, this pattern has previously been documented for the primary tree species in the continental USA34.

Previous studies using survey data have reported a positive effect of soil N content on CUE8,19, as nutrient limitation may hinder photosynthesis and biomass production or demand stronger metabolic activities (respiration) for nutrient acquisition. In our attribution analysis (Fig. 2), we did not include soil N as an explanatory factor, as the soil N extracted from Soilgrids for eddy covariance sites demonstrated large biases when validated against the observed soil N (Supplementary Fig. 5) and the statistical performance of our attribution model declined after including soil N (Supplementary Table 1). If we indeed include soil N in our attribution analysis (Supplementary Fig. 6), a positive effect of soil N on CUEEC was detected, consistent with the previous understanding. The increase of CUE with soil N also seems nonlinear, with CUE steeply increasing with soil N under low nitrogen conditions and plateauing when soil N is >0.02 gN kg−1 (Supplementary Fig. 6f).

The negative impacts of climate variables on CUE collectively signify that under warmer, wetter and brighter conditions, a greater portion of photosynthate is lost through Ra (Fig. 2). Increasing temperatures are expected to enhance Ra, whereas their effect on GPP can be either positive or negative. If there is a decline in GPP, or if the rise in Ra outpaces that of GPP, it is plausible that rising temperatures could result in a decrease in CUE17. It is worth noting that the response of CUE to temperature we observed differs from those reported in a recent synthesis11. We speculate that this difference may arise from three potential factors as follows.

  1. (1)

    The previous synthesis omitted PFTs as an explanatory variable. Given that PFTs are influenced by climate variations and have an important role in CUE dynamics, the absence of PFTs could potentially alter the CUE–MAT relationship. For example, we attributed the low CUE in the boreal zone to the presence of ENF, which has a lower CUE baseline than temperate DBF (Fig. 2f). However, if ENF is not accounted for in our attribution analysis, the low CUE in the boreal zone may be attributed to low temperature, thus resulting in a positive CUE–MAT relationship.

  2. (2)

    The shift of CUE–MAT relationship with MAT. We analysed the univariate regressions of CUEIN and CUEEC against MAT and found that both CUEIN and CUEEC exhibit a change in their relationship with MAT around 10 °C (Supplementary Fig. 7). Specifically, when MAT is <10 °C, CUEIN increases with MAT, and CUEEC only slightly decreases with MAT. However, when MAT is >10°C, both CUEIN and CUEEC decrease rapidly with MAT. The shift indicates that the positive effect of MAT on CUEIN reported in the previous synthesis may primarily arise from the CUE records obtained under low temperatures. The shift in CUE–MAT relationship around 10 °C, identified by both CUEIN and CUEEC, merits further examination (Supplementary Fig. 7).

  3. (3)

    Thermal acclimation of respiration. Experimental evidence has suggested that thermal acclimation—the declines of base respiration rate or the temperature sensitivity of respiration with the increase of growth temperature—could impact CUE35,36. Although the exact effect of growth temperature on CUE is still subject to debate, it is conceivable that CUE may increase with MAT if GPP increases with growth temperature while the increase of Ra is mitigated by thermal acclimation37. This seems to be consistent with our point (2) above where the increase of CUE with temperature is noted predominantly within the low temperature range. Additionally, we found that Ea (the temperature sensitivity of maintenance respiration) decreases with MAT (Supplementary Fig. 7c), providing further evidence for the existence of temperature acclimation of respiration across global eddy covariance sites.

Water availability directly impacts GPP, as water deficits lead to the downregulation of stomatal openness38 and light use efficiency39. However, the decrease in GPP may not translate into a decrease in NPP or CUE during droughts20. In fact, forests are likely to experience a decrease in biomass residency time (leaf shedding and high tree mortality rate) and then a decrease in Ra, ultimately leading to a CUE increase during droughts20. This mechanism offers a plausible explanation for the negative relationship observed between CUE and precipitation (Fig. 2) and is in line with the notably lower τ values found in the case of wet EBF compared to other forest types (Supplementary Fig. 4).

The influence of light on CUE has received less attention. Nonetheless, the synthesis study we mentioned11 did find a positive correlation between latitude (equivalent to a negative effect of light availability) and CUE. One plausible mechanism is that leaves tend to incur higher construction costs in regions with greater light intensity40, which can result in higher gR and consequently lower CUE. In alignment with this mechanism, our investigation revealed that EBF and SAV in low-latitude regions exhibit significantly higher gR values (P < 0.01) compared to the global average (Supplementary Fig. 4).

Interpretation of CUE derived from eddy covariance

Our method of deriving CUE assumes that there is a strong coupling between Reco and GPP (Supplementary Fig. 1), and within a short period of similar temperature in the growing season, variations in Reco are predominantly influenced by changes in Ra. This hypothesis is based on a series of studies suggesting a temporal mismatch in the variation of Ra and Rh (refs. 41,42,43), owing to the difference in the substrates and their decomposition rates (for example, Q10). Specifically, Ra uses recent photosynthetic assimilates and is tightly coupled to GPP, while Rh draws substrates from various sources, including plant residues or soil organic matter, which typically change more slowly than photosynthesis. Therefore, changes in RaRa) are more likely driven by changes in GPP (ΔGPP) whereas changes in RhRh) are more likely driven by Q10 and temperature41, supporting the assumption underlying our method. However, we acknowledge a key caveat in our assumption: certain Rh fluxes, such as rhizomicrobial respiration or priming effect (exudes from roots that increases substrate availability for Rh), also use recent photosynthetic assimilates44. These rapid changing Rh fluxes could lead to an overestimation of Ra and an underestimation of NPP and CUE (Methods). Nevertheless, the actual magnitudes of rhizomicrobial respiration and priming effect remain largely unknown across ecosystems. On the basis of our validation against in situ biometric observations (Supplementary Fig. 2), we suspect that the underestimation of CUE induced by these factors was minor and not widespread.

Given that we initially determined Ra and then subtracted it from GPP to represent NPP, our NPP estimates encompass not only the carbon available for building biomass but also the carbon used in non-structural and secondary carbon compounds (reserves, root exudates and biogenic volatile organic compound). Those secondary carbon products might use a considerable part of photosynthate45,46. Therefore, our estimate of CUE (NPP/GPP) would be greater than the narrowly defined BPE10. With the strong dependence of maintenance respiration on temperature17, it is also anticipated that CUE will exhibit seasonal fluctuation. While our method allows the derivation of seasonal CUE (Supplementary Fig. 8), we did not explore its variation as we lack independent observations to validate seasonal CUE, and the environmental dependences of CUE at the seasonal scale might be different to those at the interannual scale. We, however, have included the seasonality as a part of the uncertainty in the final annual CUEEC estimates (‘The workflow to estimate CUE from eddy covariance observations’ section in Methods; Supplementary Table 2). Another source of uncertainty in our method, the choice of temperature and time window lengths, was found to have minimal impact on our CUE estimates (Supplementary Fig. 9).

During our validation of CUEEC against CUEIN, we observed good agreement between the two metrics at seven out of the nine sites (Fig. 1b). Nevertheless, at one site (CA-TP2), we observed that CUEEC was considerably greater than CUEIN, and at another site (US-MMS), CUEEC was lower than CUEIN. CA-TP2 is one of the pine plantation sites (CA-TP1, CA-TP2, CA-TP3 and CA-TP4) distributed along a stand age gradient in southern Ontario, Canada. The stand ages for the four sites were 5, 18, 33 and 68 in 200747. Our derived CUEEC for the four sites were 0.61, 0.58, 0.47 and 0.31, respectively, demonstrating a clear negative CUE–stand age relationship. Meanwhile, CUEIN for the four sites were 0.57, 0.31, 0.51, 0.44, with CA-TP2 deviating from the negative CUE–stand age relationship. Given the similar species, climate, topography and soil conditions across the four sites, with stand age being the primary differing factor, we posit that our estimate of CUEEC at CA-TP2 might provide a more accurate reflection of CUE than CUEIN. US-MMS is a diverse deciduous forest (including 29 tree species) in southcentral Indiana, USA48. The CUEIN reported there was 0.70 and the CUEEC was 0.45. We noted that the biometric measurements for CUEIN derivation were mostly conducted in plots within 200 m of the tower49, while the footprint climatology of the US-MMS tower extending to 500 m in daytime, especially the southwest fetch was under-represented by the plots50. Considering the diverse species composition and variable topography of this site, the discrepancy in spatial representativeness between the plots and flux footprint could potentially account for the disparity between CUEIN and CUEEC.

Implications for land carbon sink

Land carbon sink has been increasing proportionally to human emissions of CO2 and playing a critical role in climate change mitigation. The increase in land carbon sink has been largely attributed to elevated CO2 (refs. 51,52), changes in other environmental variables53 and natural regrowth after disturbance54,55,56, meanwhile, the effect of shift in vegetation biomes on carbon uptake has been rarely studied, although large-scale biome shifts have been reported as the expansion of deciduous trees in boreal zones57 and savannas58. Our study indicates that the shift in biome types would potentially impact land carbon cycle by changing CUE—a synthesis of coupled model intercomparison project phase 5 and phase 6 models suggested that CUE accounts for roughly 15% of the variance in global land carbon storage23. Moreover, the disparities in CUE among biomes carry great implications for large-scale tree-planting initiatives and forest management strategies. While the restoration of native trees and natural forest regrowth is advocated as the optimal approach for forest restoration59,60, our findings suggest that we could enhance the carbon use and the land carbon sink more effectively by prioritizing native DBF species given a fixed amount of GPP, particularly in regions where deciduous and evergreen trees coexist. To evaluate the effect of CUE on the carbon sink, we considered a hypothetical scenario in which MF and ENF, along with their CUE, were replaced by those of DBF. Using a gridded global GPP and Reco product as the reference 61, we estimated that this scenario would lead to a 0.18 PgC year−1 increase in global NPP and a 0.11 PgC year−1 increase in the carbon sink. This corresponds to ~10% of annual net emissions from land-use sectors1.

In summary, our study offers a timely and critical insight into the variability of global CUE. It advances our understanding of the terrestrial carbon cycle by using a new method to derive advanced ecological metrics (for example, CUE) from eddy covariance data. It offers valuable guidance for the implementation of reforestation, afforestation and conservation plans aimed at optimizing the potential of the land carbon sink.

Methods

Theoretical basis to estimate CUE from eddy covariance observations

We used eddy covariance observations of CO2 flux between ecosystems and the atmosphere provided in the standard FLUXNET2015 dataset62, as well as the flux data acquired from Ameriflux and ICOS, processed by the FLUXNET2015 pipeline (Supplementary Table 3 and Supplementary Fig. 10a). The pipeline to produce the data includes the quality control of raw flux data and the partitioning of the net CO2 exchange into GPP and Reco using daytime partitioning method (GPP_DT_REF and Reco_DT_REF) or night-time partitioning method (GPP_NT_REF and Reco_NT_REF). In the main analysis, we used the average of GPP_DT_REF and GPP_NT_REF as the GPP time series and the average of Reco_DT_REF and Reco_NT_REF as the Reco time series. This step is meant to avoid the potential uncertainty incurred by the choice of partitioning method.

According to the definition of CUE, CUE = (GPP − Ra)/GPP = 1 − Ra/GPP. Between any two timestamps, we can obtain

$$\Delta {R_{\rm{a}}}=(1-{\rm{CUE}})\times \Delta {\rm{GPP}}$$
(1)

in which we assume CUE to be constant between the two timestamps. Meanwhile, we further expanded autotrophic respiration (Ra) to growth respiration (RG) and maintenance respiration (RM). Parameter RG is the cost of producing new biomass (G) and RM is the cost of maintaining the existing biomass (Wlive). Therefore, RG and RM can be further expanded to the product of their respective respiration coefficients (gR and mR) and biomass amounts (G and Wlive)14, leading to equation (2) below:

$$R_{\rm{a}}={{R}}_{{\rm{G}}}+{R}_{{\rm{M}}}={{g}}_{{\rm{R}}}{G}+{{m}}_{{\rm{R}}}{W}_{{\rm{live}}}$$
(2)

Substituting equation (2) into equation (1), the change in Ra between two timestamps can be described as ΔRa = gRΔG + mRΔWlive. The change in new biomass growth (ΔG) is proportional to the change in NPP between two timestamps (ΔNPP = ΔGPP × CUE). The changes in live biomass (ΔWlive) that needs maintenance are proportional to the accumulated NPP between the two timestamps ΔcNPP = ΔcGPP × CUE × (1 − τ), where cNPP and cGPP are the cumulative NPP and GPP from day one of the year, respectively. Parameter τ represents the fraction of live biomass turned to non-respiratory biomass per unit time. Therefore, we can adjust the equation (2) to

$$\Delta R_{\rm{a}}={{g}}_{{\rm{R}}}\times {\rm{CUE}}\times \Delta {\rm{GPP}}+{{m}}_{{\rm{R}}}\times {\rm{CUE}}\times \Delta {\rm{cGPP}}\times (1-\tau )$$
(3)

Note that in equation (3), we assume that all ΔGPP × CUE between the two timestamps is used for growth, neglecting the carbon allocated for root exudation or symbiosis of mycorrhizal. This assumption would not affect the estimation of CUE, but may lead to an overestimation of gR if carbon allocated for non-growth purposes is substantial between the two timestamps.

Since mR has a strong temperature dependence17 and changes substantially within a year, to facilitate the comparison between site-years we further include the temperature sensitivity of maintenance respiration (Ea in eV, the Arrhenius activation energy for respiration) into equation (3):

$$\Delta R_{\rm{a}}={g}_{\rm{R}}\times {\rm{CUE}}\times \Delta {\rm{GPP}}+{{\rm{e}}}^{(-{{E}}_{{\rm{a}}}/{{k}}_{{\rm{B}}})(1/{T}-1/{{T}}_{0})}\times {{m}}_{{\rm{R}}0}\times {\rm{CUE}}\times \Delta {\rm{cGPP}}\times (1-\tau )$$
(4)

where mR0 refers to the baseline mR of the site-year, T0 (in K) is the temperature corresponding to mR0, T is the mean temperature of the temperature bin and kB = 8.617333262 × 10−5 eV K−1 is the Boltzmann constant.

Considering the differences in the mechanisms (for example, substrates and decomposition rate) of Ra and Rh, and the consequent differences in the time scales of their changes41,42,43, we suggest it is feasible to disentangle the signals of ΔRa and ΔRh in ΔReco by meticulously organizing the data by time frames and temperature (Discussion).

Specifically, we used two criteria to organize daily flux data for the derivation of CUE: (1) we grouped flux observations using windows of 5 consecutive days in the growing season and (2) we grouped flux observations by daily mean temperature (the difference in temperature) should be <1 °C and remove the days with rainfall >2 mm day−1 to avoid sudden pulses of respiration after rewetting the soil (for example, Birch effect)63,64. The criteria are to ensure there are limited changes in soil organic matter (most soil organic matter has turnover time longer than a week) and no changes in Rh under similar temperatures, which allow us to assume ΔRa = ΔReco in these refined data groups.

The workflow to estimate CUE from eddy covariance observations

Following the theoretical basis above, we designed the following workflow to estimate CUE, gR, τ and Ea for each site-year (Supplementary Fig. 8), using MATLAB (v.2021b) as follows.

Step 1. Organize daily flux data in the growing season into groups of 5 consecutive days. Within each 5-day window, we obtained ΔGPP, ΔRa and ΔcGPP between every 2 d. For example, if in a 5-day window the daily GPP are 1.1, 1.2, 1.3, 1.4, 1.5 g m−2 day−1, then the ΔGPP between day 5 and day 1 would be 1.5 − 1.1 = 0.4 and the ΔcGPP between day 5 and day 1 would be (1.1 + 1.2 + 1.3 + 1.4 + 1.5) − 1.1 = 5.4.

For each time window, we got ~20 samples of ΔGPP, ΔRa and ΔcGPP. The growing season was defined as when daily GPP was >20% of the maximum daily GPP of the site-year.

Step 2. Insert the data obtained in step 1 to equations (1) and (3), and estimate CUE, gR, τ and mR for each 5-day time window. In this step, since we have 20 samples and 4 unknown parameters, we used the Markov Chain Monte Carlo (MCMC) method65 to estimate the optimal values of CUE, gR, τ and mR for each 5-day time window. When running MCMC, the prior of variables for CUE, gR, mR and τ were set within the ranges of (0, 1), (0.15, 0.3), (0.0, 0.5) and (0 1), respectively.

Step 3. Obtain mR0 and T0 for the site-year, where mR0 is the bottom tenth percentile of the mR values of all 5-day windows in the site-year (from step 2). T0 is the mean temperature corresponding to mR0. We used the bottom ten percentile to avoid obtaining an extreme mR0 value which is not representative of the ecosystem.

Step 4. Obtain gR and τ for the site-year, where gR and τ are the average gR and τ of all 5-day windows (from step 2). The step aligns with the observations that gR has a small range based on the construction cost of key chemical compounds in plants and should have limited seasonality40,66. There were no previous reports on the seasonality of τ, therefore we assume that it is also a constant for each site-year.

Step 5. Organize daily flux data by temperature and remove the days with precipitation >2 mm. Within each temperature bin (the range of temperature is <1 K), we obtained ΔGPP, ΔRa and ΔcGPP between every 2 days. For each temperature bin, we got ~20–200 samples of ΔGPP, ΔRa and ΔcGPP.

Step 6. Insert the data obtained in step 5, the mR0 and T0 obtained in step 3, and the gR and τ obtained in step 4, to equations (1) and (4), and estimate CUE and Ea for each temperature bin. In this step, since we have 20–200 samples and two unknown parameters, we used MCMC again to estimate the optimal values of CUE and Ea for each bin. When running MCMC, the prior values for CUE and Ea were set within the ranges of (0, 1) and (0.2, 1.5), respectively.

Step 7. Obtain CUE and Ea for the site-year, where CUE and Ea are the average CUE and Ea of all bins from step 6. We removed CUE and Ea estimates from those bins where the estimates were similar to the priors in MCMC, meaning CUE and Ea were not properly constrained. We also acknowledge that while our method provides seasonality of CUE and Ea, we only examined CUE and other parameters at the annual time scale in this study. In total, we obtained 2,737 site-years of CUE record for our analysis. We also got the average CUE for each site for validation (Supplementary Table 2).

Uncertainty

As we only reported annual average CUE of each site-year or site, the main sources of uncertainty for CUE include: (1) the temporal (seasonal and annual) variation in CUE and (2) the posterior distribution of CUE from MCMC. We used a bootstrapping approach to combine the uncertainties incurred by both sources and aggregated them into one uncertainty value for each site-year or site (Supplementary Table 2). Meanwhile, we tested alternative time and temperature windows in MCMC analysis and found they have limited impacts on the CUE derived (Supplementary Fig. 9).

Ground observations for validation

To validate the CUE derived from eddy covariance (CUEEC) and the associated carbon fluxes, we used multiple independent datasets for validation, including (1) site CUE and BPE obtained from inventory or biometric methods11,15, (2) site NPP and BP obtained from inventory or biometric methods11,15, (3) the ITRDB25 and (4) site Rh from open-source continuous soil respiration database (COSORE)24 and global soil respiration database (SRDB)67.

In dataset (1), we obtained 244 records of CUE and BPE from forest sites11 and 57 records of CUE and BPE from non-forest sites15. These studies used inventory or biometric methods to estimate NPP and BP, which were further used to estimate CUE and BPE (all referred to as CUEIN in this study). Meanwhile, the GPP measured at those sites were obtained from various types of methods (biometric, eddy covariance and site-parameterized modelling). In most cases, we used GPP measured by biometric methods, if they are available, for the derivation of CUEEC. We found that there were 12 sites with collocated biometric and eddy covariance observations. The validation based on dataset (1) was presented in Fig. 1a,b.

In dataset (2), we investigated the same data source as dataset (1) but only used their NPP and BP records. These values are used to assess the robustness of NPP estimated based on CUEEC. Specifically, after deriving a global map of CUE from CUEEC using machine learning, we estimated NPP at those site-years in dataset (2) by multiplying extrapolated CUEEC and the site measured GPP. Then we compared NPPEC against the measured NPP and BP. The validation based on dataset (2) was presented in Supplementary Fig. 2.

In dataset (3), we acquired 15,890 of tree-ring records from 4,479 sites globally. Since our target to assess the relative growth rate of deciduous trees (DB) and evergreen trees (EV) under the similar climate, we only selected those sites with both DB and EV in proximation. At the end we used 1,095 tree-ring records (Supplementary Fig. 10b). Please see ‘Quantify relative CUE from pBAI’ section in the Methods on using tree-ring data to derive relative CUE. The validation based on dataset (3) was presented in Fig. 1c.

In dataset (4), we use the Rh observations from the COSORE and the SRDB datasets. The Rh was usually obtained by combing measurements from chamber, isotopic, girdling, trench or root exclusion approaches. For COSORE, we estimated annual Rh by multiplying reported Rh values with the length of year. SRDB provided site-level annual Rh. The validation based on dataset (4) was presented in in Supplementary Fig. 2a.

Quantify relative CUE from pBAI

In parallel to CUE, the carbon accumulation rate of individual trees is assessed by another metric—BAI. BAI is indicative of NPP, as NPP allocated to stem, NPPstem, can be estimated by scaling BAI with stem density (number of trees per unit area), wood density and tree height. Similar to CUE, many studies have examined BAI and its associated changes in biomass production with abiotic (temperature68 and droughts69) and biotic factors (stand age70, tree size71, mortality72 and forest types73), however, they often had a regional focus and thus lack global implications. Following these studies, we have

$${{\rm{NPP}}}_{{\rm{stem}}}={\rm{BAI}}\times {\rm{tree}}\,{\rm{height}}\times {\rm{wood}}\,{\rm{density}}\times {\rm{stem}}\,{\rm{density}}$$
(5)

Meanwhile, we estimated that the CUE for stem is CUEstem = NPPstem/GPP, and assume that the difference in CUEstem is indicative of the difference in total CUE. To compare the relative difference between the CUE of DB and EV trees at the same locations (similar climate and thus hypothetically similar GPP), we have

$$\begin{array}{l}{{\rm{CUE}}}_{{\rm{stem}}\_{\rm{DB}}}/{{\rm{CUE}}}_{{\rm{stem}}\_{\rm{EV}}}\\={{\rm{BAI}}}_{{\rm{DB}}}/{{\rm{BAI}}}_{{\rm{EV}}}\times {{\rm{height}}}_{{\rm{DB}}}/{{\rm{height}}}_{{\rm{EV}}}\times {{\rm{density}}}_{{\rm{DB}}}/{{\rm{density}}}_{{\rm{EV}}}\end{array}$$
(6)

where heightDB and heightEV are the height of the DB and EV trees at the same location, and densityDB and densityEV are the stem density of the DB and EV trees at the same location. Relying on recent global syntheses of tree height74,75 and wood density76, we found that on global average, heightDB is 5% smaller than heightEV, and densityDB is 5% greater than densityEV. Therefore, we assumed heightDB/heightEV × densityDB/densityEV is ~1, which reduced the equation (6) to

$${{\rm{CUE}}}_{{\rm{stem}}\_{\rm{DB}}}/{{\rm{CUE}}}_{{\rm{stem}}\_{\rm{EV}}} \approx{{\rm{BAI}}}_{{\rm{DB}}}/{{\rm{BAI}}}_{{\rm{EV}}}={\rm{pBAI}}$$
(7)

where pBAI means the paired BAI of DB and EN trees at the same locations. To obtain pBAI, we used a clean and high-quality version of tree-ring width data from ITRDB25 and estimated BAI from tree-ring width using the functions in an open R package dlpR. We classified the trees into DB and EV, based on the species information compiled at https://www1.ncdc.noaa.gov/pub/data/paleo/templates/tree-species-code.txt (accessed in 2021, now archived at https://github.com/lxzswr/CUE_fluxnet/blob/main/tree-species-code.txt). Note that the EV trees are mostly evergreen needleleaf trees in ITRDB while evergreen broadleaf trees in the tropics are under-represented. The pBAI provides an independent examination of the difference in CUEstem between DB and EV.

Ancillary climate, soil and model datasets

For site-level analysis, we used the daily meteorological observations from the eddy covariance datasets. The climate variables we used include air temperature, precipitation and photosynthetic active radiation. The stand age of flux sites is acquired from a previous study77 wherever there is a record. For sites lacking stand age data, we sourced the required information from the global forest age dataset (GFAD v.1.1)78. The stand age of non-forest biomes is assumed to be 1 in all attribution analyses. We extract the soil total nitrogen content (soil N content) for each flux site from Soilgrids at 250-m resolution79.

We further used the TS4.04 version of monthly gridded air temperature, precipitation and solar radiation data at 0.5° provided by the Climate Research Unit80, along with the aforementioned GFAD v.1.1 stand age dataset and the soil N content from soilgrids, to estimate global CUE using a random forest model (next section of Methods). All gridded datasets were resampled to 0.5°. The gridded climate data were averaged to annual values, including MAT, MAP and mean annual PAR.

Meanwhile, we studied the CUE estimated by 15 DGVMs participating in the TRENDY v.9 project in our study81,82. For each DGVM, we estimated their CUE as the ratio of mean annual NPP to mean annual GPP from 1980 to 2019 (Supplementary Fig. 3). The simulations of NPP and GPP were conducted under the S3 scenario (considering elevated CO2, climate change and land-use and land-cover changes).

Linear mixed effect model and random forest model

We used the LME model83 to quantify the sensitivities of CUE to abiotic and biotic factors. The LME model suffers less from correlations of samples from similar groups and can quantify the impacts of grouping as the random effect term. This is especially useful in our study to understand the impact of PFTs on CUE, while obtaining the sensitivity of CUE to continuous variables, such as climate and soil properties. We tested several LME models and selected the one that has the lowest Akaike information criterion and Bayesian information criterion, and considers the major factors found in previous studies (Supplementary Table 1). The final LME model to estimate CUE was based on MAT, MAP, PAR, stand age and PFTs, where PFTs were set as a random effect, and the others were fixed effect. The analysis was conducted using the function fitlme in MATLAB.

We then trained a random forest model using the 2,737 records of CUEEC and the in situ variables identified by the LME model as inputs. Subsequently, the random forest model was used to estimate global CUE using gridded MAT, MAP, PAR, stand age, soil N and PFTs. External validation of the random forest model shows good performance in extrapolating CUE (R = 0.72, RMSE = 0.08). We chose the random forest model as it can use both continuous variables (climate) and categorical variables (PFT) as inputs, capture the nonlinear dependence of CUE on some variables and demonstrate stronger statistical performance in predicting CUE compared to LME. The random forest model has been widely used for upscaling in ecology and is often regarded as one of the better-performing models84,85,86.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.