Main

Tropical forests are the most biodiverse terrestrial ecosystems on Earth, and account for a large proportion of global diversity, including up to two-thirds of the approximately 73,000 tree species found on Earth1. They are responsible for key ecological functions, such as carbon exchange, nutrient cycling and the provision of water and energy2, and they contribute to the livelihoods of more than a billion people around the world5. Despite the importance of canopy functional traits (morphological, physiological or phenological attributes that determine function) for forest responses to environmental change, our knowledge of the distribution of functional traits and of functional diversity at large spatial scales is limited, and this knowledge gap is particularly acute for tropical forests6,7,8. Although abiotic factors such as water availability, temperature and soil conditions are expected to drive variation in plant functional traits across spatial scales9,10,11, we do not fully understand how these factors modulate canopy trait distributions and function4. Most global vegetation modelling efforts represent tropical forests as functionally uniform green slabs of canopy, incorporating little geographical variation in canopy functional properties3. This is due partially to the lack of spatially distributed functional trait data from across these regions12. In reality, the combination of climate, geology, evolutionary history and biogeography leads to complex but poorly understood trait variation13. There is, therefore, a fundamental need to describe and map how plant functional traits vary across tropical forests, because this variation has direct implications for ecosystem functioning and resilience to environmental change14,15,16.

Predicting plant trait distributions across large spatial extents has generally focused on a few traits for which more observational data might be available, such as leaf nitrogen, leaf phosphorus and specific leaf area (SLA), and, in fewer cases, other leaf traits, such as leaf dry mass and leaf potassium17,18,19. Some advances in mapping trait distributions have been made by integrating plant functional type information with statistical modelling17,19 and, more recently, satellite remote sensing4,8. However, most predictive models still make use of predefined plant functional types to estimate the distribution of single plant trait values, and still use coarse-resolution satellite data (for example, MODIS at 500 m) to map coarse indicators of community-level trait values—and often, few ground observations are available for tropical forests. This suggests the need to generate tools and methods that facilitate the tracking of functional traits across large spatial extents with high spatial and temporal resolution. Moreover, there is a need to develop methods to compare predictions of plant functional trait values created by different approaches20. Although plant trait databases21,22 might help to model the distribution of functional traits as a function of biotic and abiotic conditions, we are far from having a full representation of the trait values for most tree species across the tropics, or even for single regions, such as Amazonia, with around 15,000 tree species23. Understanding functional trait variability across continents is crucial for predicting ecosystem responses to environmental change, including climate change and land-use alterations9. Previous work24 revealed substantial variation in functional traits across different ecosystems, both within and between plant communities. This variation highlights the relationship between plant trait strategies and environmental conditions, which allows species to occupy distinct ecological niches.

Tree traits across the tropics

Here we present the distribution of plant traits across the entirety of the planet’s tropical forests by expanding on a methodology6 that uses an approach to predict functional traits using the European Space Agency’s Sentinel-2 satellite data. We used data for 13 tree functional traits (hereafter referred to as plant traits), spanning leaf morphological (leaf area, SLA, thickness, fresh and dry mass, also including leaf water content) and chemical (mass-based calcium, carbon, magnesium, nitrogen, potassium and phosphorus concentrations) traits, and also including predictions for wood density24,25. These plant traits were gathered from across tropical forests from the Americas, Africa and Asia, here including northeast Australia in our broad definition of Asian tropical forests (Fig. 1a). We focus on upper-canopy leaf traits, which are the main interface for forest–atmosphere exchange (in that they are part of key processes such as transpiration and photosynthesis26) and which are directly detectable by spectral remote sensing. The plant traits are hence related to fundamental aspects of leaf morphology, chemistry and tree structure (Extended Data Table 1).

Fig. 1: Study area and PCA.
figure 1

a, Study area, showing the distribution of 1,814 vegetation plots across the original biome space for tropical forests (grey background) in the Americas (659.6 ha), Africa (124.6 ha) and Asia (15.4 ha). b,c, PCA (PC1 and PC2, b; PC3, c) depicting the environmental space found across the tropics (yellow and green colours show higher map pixel counts representing area covered) on the basis of mean maximum air temperature (Tmax), soil moisture (SM), solar radiation (SR), slope, MCWD, soil cation-exchange capacity (CEC), soil pH, sand amount and clay amount. The grey, violet and orange points show the location of the sampling plots in environmental space found across the tropics. PC1 accounts for 27% of explained variance, PC2 for 24% and PC3 for 14%, with all three accounting for 65% of the total explained variance. PC1 is loaded mainly by water deficit index (MCWD) (−0.47), SR (0.50) and soil pH (0.59); PC2 by the soil sand (0.57), clay (−0.53) and CEC (−0.44); and PC3 by SM (−0.63) and Tmax (−0.49). Climate data were derived for each pixel from the TerraClimate project34 and soil data were derived from SoilGrids.org.

Overall, we expect that acquisitive traits, which enhance the efficient capture and use of resources (for example, high SLA and leaf nutrient content), will be more prominent in locations with pronounced seasonal variation and nutrient-rich soils. By contrast, conservative traits (for example, thicker, nutrient-poor leaves, high wood density) are likely to dominate in areas with less seasonal variability and poorer soils. In forests dominated by deciduous species, such as drier tropical forests, we expect species with acquisitive traits to become more prevalent, thereby making these traits more common in the ecosystem. African forests, which have experienced a long-term drying trend27, generally exhibit lower species diversity28 and distinct soil conditions29 compared with American and Asian tropical forests. We expect these differences to result in a narrower distribution of plant trait values when compared with the wetter tropical forests of the Americas and Asia. In addition, Asian tropical forests contain the widespread distribution of the Dipterocarpaceae family30, which we anticipate will mainly define the particular set of dominant traits in those areas, such as those associated with large, tough leaves, which are characteristic of this tree family.

Traits were collected from the Global Ecosystems Monitoring (GEM) network31, ForestPlots.net32, BIEN (https://bien.nceas.ucsb.edu/bien/), TRY (www.try-db.org) and a previous study33. We incorporated vegetation census data from the GEM and Monitoreo Nacional Forestal (MONAFOR) networks and contributing networks to ForestPlots.net, with geolocated tree individuals from 1,814 permanent vegetation plots (Fig. 1a), spanning a wide set of environmental conditions across tropical forests (Fig. 1b) and covering a total of 799.5 ha (Extended Data Table 2). We used the plant traits and vegetation censuses to create pixel-level (from the Sentinel-2 satellites) community weighted mean (CWM) trait values using a previously described method6. The total number of CWM pixels used in our analysis was 79,955, which were distributed across 18 countries in the 4 tropical continents (Extended Data Table 2). Our vegetation plots are more abundant in the tropical forests of the Americas and could be thought to represent the environmental conditions in this region rather than those in Africa and Asia. Our principal component analysis (PCA) (Fig. 1b,c) shows that although our sampling sites do not cover all environmental space available across the tropics—especially those climates that are less common in the tropics (dark purple zone in Fig. 1b,c)—we fundamentally cover the most prominent environmental conditions found across tropical forests.

For each pixel for which we calculated trait CWM, we also extracted surface reflectance data from the Sentinel-2 satellite bands covering the years 2019–2022. On the basis of these spectral bands, we also generated the modified chlorophyll absorption reflectance index (MCARI), modified soil adjusted vegetation index 2 (MSAVI2) and normalized difference red edge index (NDRE). Using the grey-level co-occurrence matrix (GLCM) for these indices, we calculated their entropy and correlation as canopy texture variables. We extracted soil texture and chemistry (clay percentage, sand percentage, pH and cation-exchange capacity (CEC)) across the sampling plots from SoilGrids.org and joined these with terrain (slope) and climate (maximum climatic water deficit (MCWD) and maximum temperature (Tmax)) from the TerraClimate dataset34. We used the above-mentioned covariates in random forest models that have produced accurate plant-trait-mapping results4,6 to predict CWM plant functional traits at a pantropical scale. Our analysis hence provides insights into the variation in plant trait composition across climatic and soil gradients across tropical forests. We tested for the prediction accuracy and uncertainty in trait predictions while accounting for potential spatial autocorrelation using a plot-level spatial block leave-one-out cross-validation35 (Supplementary Table 1). We present the spatial predictions from the statistical models described above for canopy-level morphological traits, wood density (Fig. 2 and Supplementary Figs. 17) and chemistry (Fig. 3 and Supplementary Figs. 813). Using our 13 plant trait model predictions (maps), we tested fundamental knowledge gaps on the functional composition across tropical American, African and Asian forests.

Fig. 2: Predicted distribution of CWM morphological and structural plant traits.
figure 2

a, Predicted distribution of a selection of CWM morphological and structural plant traits. Red to orange show areas with low to intermediate trait values; light to dark blue depict areas with intermediate to high trait values. The remaining morphological traits and the spatial predictions of their uncertainty are shown in Supplementary Figs. 1–7. b, Box plots showing the CWM trait distribution values for tropical American (AM), African (AF) and Asian (AS) forests, extracted from the spatial predictions. The horizontal black lines depict the median CWM trait value and vertical lines show the whiskers extending to the largest CWM trait value or not further than 1.5 times the interquartile range. For visualization purposes, we excluded the extreme lowest and highest 1% of values in the maps in a and outliers in b. AreaL, leaf area; ThicknessL, leaf thickness; WD, wood density. For statistical model results, see Supplementary Table 1. For the significance of differences between CWM trait mean values, obtained using a t-test with Bonferroni correction, see Supplementary Table 2.

Fig. 3: Predicted distribution of CWM leaf nutrient plant traits.
figure 3

a, Predicted distribution of a selection of CWM leaf nutrient plant traits. Red to orange show areas with low to intermediate trait values; light to dark blue depict areas with intermediate to high trait values. The remaining chemistry traits and the spatial predictions of their uncertainty are shown in Supplementary Figs. 8–13. b, Box plots showing the CWM trait distribution values for tropical American (AM), African (AF) and Asian (AS) forests, extracted from the spatial predictions. The horizontal black lines depict the median CWM trait value and vertical lines show the whiskers extending to the largest CWM trait value or not further than 1.5 times the interquartile range. For visualization purposes, we excluded the extreme lowest and highest 1% of values in the maps in a and outliers in b. CL, leaf carbon concentration; CaL, leaf calcium concentration; NL, leaf nitrogen concentration; PL, leaf phosphorus concentration. For statistical model results, see Supplementary Table 1. For the significance of differences between CWM trait mean values, obtained using a t-test with Bonferroni correction, see Supplementary Table 2.

Models for leaf chemistry and wood density showed higher accuracy (mean R2 = 0.66 and 0.48, respectively) than did those for leaf morphology traits (mean R2 = 0.25; Supplementary Table 1). Among these, leaf nitrogen (mean R2 = 0.53, root mean squared error (RMSE) = 0.29), phosphorus (0.50, 0.02) and calcium (0.64, 0.22) concentrations had the highest prediction accuracy, followed by leaf carbon (0.40, 1.42) and potassium (0.46, 0.17). Models for SLA (0.32, 19.95), leaf dry mass (0.32, 0.58) and leaf fresh mass (0.31, 2.24) showed moderate accuracy scores. By contrast, leaf magnesium concentration (0.27, 0.06), leaf area (0.22, 66.15), leaf water content (0.18, 3.92), and leaf thickness 0.17, 0.79) had lower accuracy. As expected, lower explanatory values were found when testing the models with the plots from Africa or Asia separately, because fewer data were available (Supplementary Table 1). The individual surface reflectance of the Sentinel-2 bands, the derived vegetation indices and the climate and terrain variables obtained on average the highest importance scores across traits, with texture and soil metrics obtaining on average lower importance values (Extended Data Fig. 1). We report variable importance scores per variable and plant trait in Supplementary Figs. 1f13f.

We make available our trait mapped predictions across the tropics as an online resource in which more detail can be obtained across the tropical region (https://pantropicalanalysis.users.earthengine.app/view/pantropical-traits-aguirre-gutierrez-2025). Using the modelled trait maps (Figs. 2a and 3a and Supplementary Figs. 113), we compared the CWM trait values among continents, which provided insights into the variations in plant traits across continents (Supplementary Table 2, Figs. 2b and 3b and Supplementary Figs. 113). Following our predictions, for most traits, Asian forests show some of the highest average canopy-level trait values; specifically, average leaf area (119.3 cm2), leaf calcium (0.88%), potassium (0.79%) and magnesium (0.28%) concentrations, leaf water content (54.8%), leaf fresh (3.9 g) and dry (1.06 g) mass. These findings are supported by local plot-level data6. However, similar values were found for leaf phosphorus for Asia and Africa (0.11%) and slightly lower for the Americas (0.10%), and also for leaf carbon (around 47%) and leaf nitrogen concentrations (around 2.15%). African forests are predicted to have, on average, the smallest leaves (average of 100 cm2), highest leaf thickness (0.85 mm) and SLA (133.9 cm2 g−1). Wood density is predicted to be, on average, higher in tropical American and African forests (around 0.60 g cm−3), as suggested previously36. These results emphasize Asia’s unique trait spectra and how the African flora is adapted to a wide range of current and past environmental conditions37.

Traits in wet and dry tropical forests

A changing climate affects the distribution and persistence of forests across the tropics. There is an ongoing debate about the capacity of wet and dry tropical forests to adapt or shift their functional composition given global environmental change38. Studies have shown that drier tropical forests could be responding faster to a changing climate by shifting their trait composition39, but also that such drier tropical forests might be becoming more functionally homogeneous, which could negatively affect their capacity to respond to further environmental change37. Hence, understanding the distribution of key tree functional traits across tropical forests is crucial to understanding their potential response to environmental change, including climate.

We determined the extent of tropical broadleaf wet and dry forests using the RESOLVE Ecoregions dataset40. On the basis of this division, wet forests, on average, had higher leaf area and leaf carbon concentration than did dry forests (Supplementary Fig. 1c and Supplementary Table 3). By contrast, dry forests, which are characterized by the presence of stronger and longer dry seasons, had higher average values for leaf chemistry traits such as leaf magnesium, nitrogen, calcium, potassium and phosphorus, and also for SLA (Supplementary Figs. 1d13d and Supplementary Table 3). These distinct strategies possibly ensure optimal nutrient use for drought avoidance, on the basis of the leaf economics spectrum of ‘low’ leaf construction costs for fast energy gains amid challenging environmental conditions41. Notably, both wet and dry tropical forests converge in certain traits, with comparable mean levels of leaf thickness, dry and fresh mass, leaf water content and wood density underscoring their shared strategies. However, these similar average trait values could also be due to the fact that both strategies—drought avoidance and drought tolerance—can be present across both wet and dry forests, potentially ensuring resilience across tropical forest types42. These findings from our comprehensive trait predictions provide crucial insights into the intricate linkages between environmental factors and plant traits across continents, contributing to our understanding of ecological diversity and adaptation strategies in diverse tropical forest ecosystems. Our findings shed light on the diverse plant trait patterns observed across continents, enhancing our understanding of global ecological variation24.

Areas across the wet tropics, which are highly species-diverse, tended to have slightly more uncertain predictions (that is, higher standard error; s.e.) for most traits than did drier tropical forests (Supplementary Figs. 113, middle panel). Our results for leaf morphology and tree structural traits such as fresh mass and wood density showed higher uncertainty in predictions (s.e. = 0.4–1.6 g and 0.02–0.05 g cm−3 respectively) across wetter locations such as central Amazonia, central Africa and Borneo. However, for most other morphological and leaf nutrient traits, their prediction uncertainty was low in most of the tropics (Supplementary Figs. 113, middle panel). Overall, the uncertainty in the predictions of some traits might result from searching for simple relationships between individual traits and the environment, whereas tree individuals represent a combination of traits and trait values that might be interpreted as functional strategies or syndromes. It is the syndrome rather than the individual trait that is selected for in nature. Our findings on the uncertainty of trait predictions give an insight into areas across the tropics that could benefit the most from more extensive field trait campaigns (Supplementary Figs. 113, middle panel and Extended Data Fig. 2).

Functional diversity of tropical forests

The resilience of an ecosystem to environmental change can be partially assessed by the diversity of its functional trait values. According to the insurance hypothesis about biodiversity and ecosystem functioning43, ecosystems with greater taxonomic and functional diversity could potentially be less affected by changes in the environment. Recent studies support this, showing that tropical forests with higher functional diversity and high functional redundancy tend to be less adversely affected by extreme weather events such as El Niño than do less functionally diverse and redundant forests44. Hence, functional diversity indicators such as functional richness and functional divergence can shed light on the capacity of ecosystems to respond to global environmental change. Determining the functional diversity of tropical forest ecosystems will therefore enhance our understanding of their resilience and the possible effects of environmental change on ecosystem functioning and its services to people.

To generate a pantropical understanding of the functional diversity of tropical forests across the Americas, Africa and Asia, and to ascertain how these three regions compare, we first built a PCA that offers insights into the distribution of ecological strategies or syndromes of plant communities45 across tropical forests. This PCA was based on the pixel values from the spatial predictions (maps) of canopy and wood density traits (Figs. 2 and 3 and Supplementary Figs. 113). The first two PCA axes (Fig. 4a,b), explain 44% (PC1) and 20.6% (PC2) of the pantropical trait variance, respectively, and highlight key traits that drive the functional space across tropical forests at a pantropical extent. In our analysis, leaf nutrients such as calcium, nitrogen, phosphorus, potassium and magnesium are the main traits loading PC1 (−0.39, −0.25, −0.39, −0.39 and −0.38, respectively; Supplementary Table 4), with carbon (0.35) and wood density (0.27) in opposite directions. PC2 is loaded mainly by leaf structural and morphological characteristics such as dry mass (0.52), fresh mass (0.43), area (0.47) and SLA (−0.32) (Fig. 4a,b).

Fig. 4: Functional diversity of tropical forests in the Americas, Africa and Asia.
figure 4

a, Functional trait space of trees across tropical forests in the Americas, Africa and Asia (including Australia), with principal component PC1 explaining 44% and PC2 20.6% of the variance in plant traits distributions. Arrows indicate the contribution and direction of each trait for the PCA. b, Distribution of functional trait space for the tropical American (left), African (middle) and Asian (right; including Australia) forests separately. a and b show the probabilistic density distribution defined by the PC1 and PC2 space of the 13 plant functional traits used: area, leaf area; C, leaf carbon concentration; Ca, leaf calcium concentration; K, leaf potassium concentration; Mg, leaf magnesium concentration; N, leaf nitrogen concentration; P, leaf phosphorus concentration; DM, leaf dry mass; FM, leaf fresh mass; SLA, specific leaf area; thickness, leaf thickness; WC, leaf water content; WD, wood density (see Extended Data Table 1 for a description of the trait used). The inner colour gradient represents the density of pixels in the PC trait space. Thick contour lines depict the 0.5 and 0.99 quantiles. FRich shows the functional richness and FDiv the functional divergence for the global trait space across continents (a) and for tropical American (b, left), African (b, middle) and Asian (b, right) forests. c, PC1 (top), PC2 (middle) and PC3 (bottom, explaining 13% of the variance) from a predicted across tropical forests. Co-occurring trait syndromes or strategies are shown, with insets magnified to show greater details of the predicted plant strategies.

Following the PCA results, central-west Amazonia, central Africa and to some extent some areas of Southeast Asia show areas with trait syndromes related to higher wood density and leaf carbon (Fig. 4c, top, PC1), but also higher leaf area and leaf fresh and dry mass (Fig. 4c, middle, PC2). Wood density is closely related to plant mechanical and hydraulic properties, and has been shown to have a negative relationship with mortality given increased physical strength and resistance to drought-induced embolism46. The highest leaf carbon concentration (C) values are predicted to be found in wet regions with relatively infertile soils in the Americas, Africa and Asia, such as northwest Amazonia, Central Africa and much of Borneo, and tend to decline towards drier tropical forests (Fig. 3a and Supplementary Fig. 8). An alternative strategy for dry forest tree species is deciduousness, which leads to low leaf carbon concentration because of lower investment in leaf defence and longevity. In dry forests with fertile soils, we expect deciduousness as a dominant strategy (thus low C), but in less fertile soils we would expect a transition to an evergreen strategy (higher C) to conserve resources. Higher leaf carbon, and generally also higher leaf fresh and dry mass, reflect an increased investment in leaf structural and physical defences47, which favours longer leaf life span and thus higher investment in compounds such as lignin, tannins and soluble phenolics that contain high levels of carbon48.

Syndromes related to higher leaf nutrients (Fig. 4c, top, PC1) and higher SLA (Fig. 4c, middle, PC2) are opposed to the patterns explained above, with higher leaf nutrients and intermediate SLA values found across tropical dry forests and increasing leaf water content predicted across the Andes and high elevations of Southeast Asia (Fig. 4c, bottom, PC3). Leaf nutrients are generally lowest in wet central-west Amazon, Central Africa and wet forests of insular Southeast Asia (Fig. 3), and tend to increase across dry forests in south and southeastern Brazil, West Africa, eastern Madagascar and most of the tropical forests in India and northern Southeast Asia (Figs. 3 and 4, PC3). This suggests that soil physical and chemical properties have an important role in shaping leaf phosphorus distributions49 (Figs. 3a, bottom and 4a,c and Supplementary Fig. 13). We predict a consistently high leaf area across much of insular Southeast Asia (Fig. 4c). This agrees with previous plot-level analyses31 that found a larger leaf area for forests in Malaysian Borneo than for those from other tropical regions. Many of the wet Bornean forest canopies are dominated by a single family (Dipterocarpaceae)30 with a particular set of traits, such as large, tough leaves, and this biogeographical feature might explain some of the leaf morphological differences between Asian and other forests. In the tropical Americas, syndromes related to lower SLA values are found across the Andes, the mountains of southern Brazil and also in the extremely wet and nutrient-poor areas of northwest Amazonia; for example, across the sandy soils of upper Rio Negro. Lower SLA can be found across Central Africa and in Asia across the mountains of New Guinea (Fig. 4c, bottom). Plants with a lower SLA tend to have thicker leaves, which are more resistant to herbivory and decomposition, and lower SLA values indicate a conservative strategy in which resources are invested in long-lasting leaves but often with a lower photosynthetic capacity25.

Building on our PCA analysis, we calculated the trait functional diversity, here by means of the trait functional richness (FRich) and functional divergence (FDiv), across tropical forests (Fig. 4a), and calculated how these FRich and FDiv values differ between the forests of the tropical Americas, Africa and Asia (Fig. 4b). FRich represents the size of the functional trait space and FDiv indicates the distribution of CWM trait abundances within the functional trait space45. The overall FRich across tropical forests is calculated to be 111.7, with a pantropical FDiv of 0.46 (Fig. 4a). The observed FRich values of 109.2 for the Americas, 66.5 for Africa and 63.5 for Asia point to large differences in the diversity of functional trait values in these regions (Fig. 4b). The higher FRich of the Americas suggests that these forests have a broader array of plant strategies and adaptations, potentially influenced by diverse environmental niches and historical factors44, and congruent with the fact that the most taxonomically diverse tropical forests are in the tropical Americas1,23. By contrast, the lower FRich in Africa and Asia suggests that specific environmental filters or historical biogeographical constraints shape the functional traits of plant communities in these regions50. The FDiv values ranged from 0.42 for the Americas to 0.61 for Africa, and 0.57 for Asia, revealing varying degrees of dissimilarity in functional trait space among tropical forests (Fig. 4b). Higher FDiv values imply greater divergence, suggesting stronger niche differentiation or competitive interactions. The comparatively lower FDiv in the Americas might imply a higher degree of functional redundancy across communities. Conversely, the higher FDiv in Africa and Asia suggests a more specialized pattern of resource use, owing potentially to intense interspecific competition or specific ecological constraints in these regions. Regions with higher functional divergence might exhibit higher ecosystem stability because of niche complementarity, whereas regions with lower divergence might face challenges in adapting to changing environmental conditions. The observed patterns have implications for ecosystem functioning, biodiversity conservation and ecosystem services provision.

Understanding the tree trait composition and functional diversity across the tropics is of pivotal importance for global biodiversity and ecosystem modelling and for conservation efforts51. Although dynamic global vegetation models (DGVMs) and species distribution models (SDMs) help to assess the effects of a changing climate, DGVMs often rely on broad plant functional types and SDMs commonly overlook functional trait composition and diversity (but see ref. 52). By incorporating trait-based mechanisms and functional trait diversity, models can better capture the variability in plant responses, potentially making more realistic predictions related to carbon cycling53, vegetation distribution54 and ecosystem composition and resilience44. DGVMs and SDMs could include plant traits and plant functional diversity estimates to advance our understanding of ecosystem functioning and responses to global environmental change.

Our capacity to use artificial intelligence (AI) to map plant functional traits by means of deep-learning models applied to field trait55 data or even photographs56 is quickly developing. These models can process vast amounts of remote-sensing data to identify and classify diverse biodiversity metrics57. Some models—particularly convolutional neural networks—have been integrated with spectral data to map plant traits using field data58 and also, recently, citizen-science approaches56. New satellites with hyperspectral capabilities and high spatial resolution are in development, and the availability of tree censuses and trait data across the tropics is increasing. This will open new avenues for testing the capabilities of large machine-learning models, possibly involving deep learning, for using data across time and space from multiple sources. However, to obtain robust and reliable indicators of plant functional diversity and biodiversity levels across ecosystems, AI models should complement and not replace conventional ecological methods—especially the direct field sampling and botanical identification of individual trees by experts. There is a need for tools that can generate predictions of biodiversity at high temporal resolution, and our approach represents a way forward in this direction. Looking ahead, there is the potential to track plant functional diversity across time (for example, on a yearly basis) using satellite remote-sensing data, such as that from the Sentinel-2 satellites. Such an application would require major efforts in terms of field ecological data collection, availability of new satellite data, modelling algorithms, computing power and storage capabilities. All of this can be achieved by strong and fair collaborations between field researchers, universities and other public and private research organizations.

Our study reveals and maps the geographical variation in the functional composition of the tropical moist and dry forests, where at least two-thirds of Earth’s tree species are found1. Our trait predictions indicate deep physiological constraints of adaptation to long-term climate; the predictions could provide the basis for forecasting how shifting climates will affect the functional composition of tropical forests, and could help to develop a more mechanistic understanding and realistic predictive ecology across spatio-temporal scales. Built from unique, geolocated field records combined with an array of spectral, textural and environmental data, our maps represent data-informed spatial hypotheses that will assist in the identification of priority areas for further field data collection, especially across tropical forests in Africa and Asia, where fewer data are available. The ultimate accuracy of the plant functional trait predictions depends on the sample coverage, the accuracy of the field measurements and the quality of the pantropical covariates that are used to spatially extrapolate our models. Undoubtedly, predictions will improve as new environmental datasets become available and as vegetation census and trait data expand further over space and time. Nevertheless, these maps represent a major improvement on previous site-based speculation for analysing geographical variation in the ecophysiology of the entire tropical forest biome, and they thereby inform our understanding of how tropical forests function in the context of the whole Earth system.

Methods

Vegetation plots and plant traits

We gathered vegetation census data from the GEM and MONAFOR networks and contributing networks to ForestPlots.net, being geolocated tree individuals from 1,814 demarcated and identified vegetation plots (Fig. 1a). The vegetation plots covered a wide set of the environmental conditions found across tropical forests (Fig. 1b) and spanned 799.5 ha (Extended Data Table 2). We aimed to match each individual tree to a trait value. All plant functional traits used are part of the Global Ecosystems Monitoring network (GEM; gem.tropicalforests.ox.ac.uk)31, the MONAFOR network, the ForestPlots (www.forestplots.net)32,59,60, BIEN (https://bien.nceas.ucsb.edu/bien/) and TRY (www.try-db.org)22 databases and from local collaborators and Diaz et al.33, and were collected following a standardized methodology described in Both et al.61, Martin et al.62, Enquist et al.63, Oliveras et al.50 and Gvozdevaite64. For the traits provided by the GEM network and ForestPlots.net, the tree species that contributed the most to plot basal area were sampled with three to five replicate individuals per species. Species representing 80% or more of the basal area were sampled for traits in low-diversity sites and at least 70% in high-diversity sites. For each selected tree, a sun and a shade branch were sampled, and in each branch, three to five leaves were used for trait measurements. Leaf samples were analysed for chemistry (nitrogen, phosphorus, carbon, calcium, potassium and magnesium concentration) and morphological and structural traits (area: area, specific leaf area (SLA); thickness: thickness; fresh mass (FM); and water content (WC); see Extended Data Table 1 for units and definitions). If more than one value per trait per species was available, we used the trait mean at the species level for subsequent analysis. Our approach aimed to cover at least 70% of the canopy area of a pixel within a plot with trait data at species or genus level, often covering more than that (Extended Data Fig. 3). Because when species-level trait data were unavailable we used the mean genus-level data, our analysis could be seen as more representative of the genus-level trait responses. When achieving at least 70% coverage was not possible for a given trait in a given pixel, that pixel was left out of the analysis for the specific trait. All species names were standardized following the Taxonomic Name Resolution Service (TNRS; https://tnrs.biendata.org).

Calculating community-level trait values

We used the pixel-level CWM trait method from Aguirre-Gutiérrez et al.6 in our analysis, in which they calculated the CWM of each trait for each 10 × 10-m pixel of the Sentinel-2 imagery on the basis of the canopy area occupied by the single tree crowns of each species encompassed in a given pixel. The total number of CWM pixels used in our analysis was 79,955, from 1,814 unique permanent forest plots distributed across 18 countries in the 4 tropical continents (Extended Data Table 2). A full detailed description of the methods can be found in Aguirre-Gutiérrez et al.6, and we summarize it here. We calculated the CWM trait values for each 10 × 10-m Sentinel-2 pixel falling into a vegetation plot. We first geolocated the vegetation plot and the distribution of each individual tree in the plot. Some of the plots already had their tree crowns mapped. When this was not the case, we calculated the crown area using regional allometric equations, from which we generated a crown polygon. Then, for each pixel we calculated the trait CWM using the individual tree crown horizontal area as the weighting factor. We used only pixels that had at least a 70% basal area coverage with trait value to generate the trait CWM.

Sentinel-2 spectral data

The European Space Agency Sentinel-2 satellites (sentinel.esa.int/web/sentinel/missions/sentinel-2) have high multispectral (13 spectral channels covering the visible, near-infrared and short-wave infrared), spatial (10 m for visible and near-infrared 835 nm, 20 m for other near-infrared and short-wave infrared) and temporal coverage (revisit period of 5 days), in addition to open data availability. This high spatial, radiometric and temporal resolution provides the backbone to scale functional traits, such as leaf morphology, water content and covalent chemical bonds, without the logistical and field constraints that are common across the tropics6 and other regions65. The extraction of Sentinel-2 Level-2A data on surface reflectance bands, vegetation indices and canopy texture metrics has been fully described previously6, and here we give a summary of the main steps. We extracted Sentinel-2 Level-2A spectral data at the pixel level for each vegetation plot using the raw band values for bands B2 to B12, excluding bands B9 and B10 because those are used for cirrus, water vapour and cloud detection for the images and dates specified in Supplementary Table 5. Next, we calculated the vegetation indices MCARI, MSAVI2 and NDRE.

We also incorporated spatial information by using the spectral indices to derive neighbourhood canopy texture, entropy and correlation with a 9 × 9-pixel GLCM (ref. 66). The GLCM metrics are computed from a matrix that is spatially dependent. The co-occurrence matrix relies on the angular orientation and distance between adjacent pixels, illustrating the frequency of associations between a pixel and its neighbouring pixels. We applied a 9 × 9-pixel kernel window because this window size proved sufficient to capture ample canopy contrast information during the modelling stage without incurring substantial computation time.

We generated spatially explicit predictions across tropical forests in Google Earth Engine (GEE)67 using surface reflectance Sentinel-2 Level-2A images from June to March of 2019–2022, because these months show the lowest cloud cover across most of our study areas. We applied the maskS2clouds and maskEdges to increase the quality of the imagery, especially to detect and mask clouds and cirrus. On the basis of the images selected, we calculated a median spectral reflectance composite value per band and used it for generating the predictive maps. The reader can run the GEE code (Supplementary Table 5) to obtain the number and identity of the imagery used.

Climate, topography and soil data

We used the TerraClimate climate dataset34 to extract climate data for the study area. These data have an original spatial resolution of around 4.6 km at the Equator and a large temporal range (from 1951 to the present). In general, the TerraClimate dataset builds on the Climatic Research Unit climate data, CRU (refs. 54,68), downscales it and swaps the JRA55 reanalysis product69 for CRU when there is insufficient station data to inform CRU. From the TerraClimate dataset, we calculated the 30-year (encompassing 1988–2017) mean annual Tmax and the MCWD for each vegetation plot. The MCWD is a metric for drought intensity and severity defined as the most negative value of the climatological water deficit (CWD) of a given year, and we calculated it following a previous study70 but using the potential evapotranspiration instead of a fixed evapotranspiration value. We derived topography (slope) from the Shuttle Radar Topography Mission (SRTM) digital elevation model V3 product (SRTM Plus) provided by NASA JPL at an original spatial resolution of around 30 m at the Equator71. Soil characteristics such as texture and fertility also determine the distribution of plant species47. Moreover, drier tropical forests tend to be distributed on more nutrient-rich soils than do wetter forests72, which would therefore also select for species adapted to such conditions. Maps of soil data—that is, per cent sand and clay, pH and CEC—were obtained from the SoilGrids project (https://soilgrids.org)73 at a spatial resolution of 250 m per pixel. All climate, topography and soil datasets were scaled to the Sentinel-2 pixel resolution to take advantage of its spectral reflectance pixel size. All spatial analyses were performed in the GEE platform.

Mapping plant traits

We modelled each plant functional trait CWM as a function of the spectral, soil, topography and climatic variables using the random forests (RF) machine-learning algorithm74 in the R platform75 with the Ranger function in a high-performance computing system. RF stands out as a nonparametric algorithm known for its capabilities against overfitting and for its flexibility with respect to variations in the type and number of variable inputs. This robustness is attributed to the bagging process and the inclusion of random feature selection. In addition, RF has been widely and successfully applied for modelling and predicting ecological and remote-sensing data, both within individual ecosystems and across diverse environments6,65,76,77,78. To parametrize the RF models, we performed a comprehensive series of model optimization and regularization techniques to mitigate overfitting6. We determined the number of trees through a cross-validation analysis, exploring a range between 500 and 1,500 trees. Similarly, we varied the number of variables randomly sampled as candidates at each split (also referred to as ‘mtry’ in the RF) in the range of 1–10. The final model incorporated the combination of parameters that yielded the lowest RMSE. We then obtained a map by applying the fitted model to make predictions for the full tropics, where tropical wet and tropical dry broadleaf forests are located (because the data used for model fitting belong to these forest types). We determined the extent of the tropical broadleaf wet and dry forest using the RESOLVE Ecoregions dataset40 (https://ecoregions.appspot.com/) and the tropical countries boundaries dataset (for the GEE app)79. We further used the Land-use Cover map from the European Space Agency80 to delimit the areas classified as forest and a previously described81 30-m forest cover product to further delimit the predictions to areas with a threshold value of a minimum of 25% forest cover in a given pixel. Hence, although an area might be included in the trait maps, this does not mean that it is entirely forested. The accuracy of the predictions was quantified by the explained variance using R2. Variable importance was calculated as the decrease in node impurities, from splitting on the focus variable, derived from the out-of-bag error. We scaled the variable importance values per covariate to a 0–1 scale for comparison purposes.

To assess the uncertainty in model predictions in a spatially explicit manner, we used spatial leave-one-out cross-validation35 for the full dataset. When predicting the RF models, we also obtained their s.e. using the infinitesimal jackknife approach as a measure of prediction uncertainty. From these s.e.-mapped predictions, we also calculated a final map of new field sampling needs by standardizing each trait s.e.-mapped prediction from 0 to 1 and obtaining an average value of the sum of those standardized s.e. maps. From this final field sampling needs map, we calculated the areas belonging to the lowest, middle and highest 33 percentiles and classified these as low, intermediate and high, respectively. This final map could aid in generating field sampling priorities for the traits used in this study.

We tested for differences in the among-continent mean CWM trait values using t-test analysis with Bonferroni correction for significance values. Because we are working with the pixel predictions per continent (here using 100 × 100 m pixels), we have several millions of pixel-level estimates, which makes it possible to obtain significant P values (P < 0.05) just because of the high number of pixels involved. Therefore, we performed the t-test for the full dataset (comparing continents) and also by first randomly sampling 10% and 1% of the data per continent for the comparisons so as to obtain an indication of the possible effect of sample size on the among-continent comparison results.

Functional richness and divergence

We calculated the FRich and FDiv at a pantropical extent and also for the tropical Americas, Africa and Asia. To this end, we took the mapped CWM trait predictions and performed a PCA with them and calculated the trait probability density (TPD) as described before45,82 Using the mapped predictions, and not only the pixels used to build the trait CWM, allowed us to avoid having a larger representation of the tropical forests in the Americas in comparison to those found in Africa and Asia. To perform the PCA, we used the Princomp function in R with the data from the mapped predictions of the 13 traits. We then used the Funspace function in R to create the TPDs, with which we would obtain the functional trait space available at a pantropical extent. We also calculated the TPDs for each continent on the basis of the pantropical TPD so that these could be compared between each other45,82. On the basis of these, we then calculated the FRich and FDiv metrics at a pantropical extent and also for each continent. In our analysis we represent the global TPD (100%) and also highlight the contours containing 50% and 99% of the total probability.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.