A general rule on the organization of biodiversity in Earth’s biogeographical regions

Bernardo-Madrid, R.; González-Suárez, M.; Rosvall, M.; Rueda, M.; Revilla, E.; Carrete, M.; Tella, J. L.; Astigarraga, J.; Calatayud, J.

doi:10.1038/s41559-025-02724-5

Download PDF

Article
Open access
Published: 04 June 2025

A general rule on the organization of biodiversity in Earth’s biogeographical regions

Nature Ecology & Evolution volume 9, pages 1193–1204 (2025)Cite this article

30k Accesses
9 Citations
377 Altmetric
Metrics details

Subjects

Abstract

Life on Earth is a mosaic distributed across biogeographical regions. Their regional species pools have experienced distinct historical and eco-evolutionary pressures, leading to an expected context-dependent organization of biodiversity. Here we identify a general spatial organization within biogeographical regions of terrestrial and marine vertebrates, invertebrates and plants (more than 30,000 species). We detect seven types of areas in these biogeographical regions that reflect unique combinations of four fundamental aspects of biodiversity (species richness, range size, endemicity and biogeographical transitions). These areas form ordered layers from the core to the transition zones of the biogeographical regions, reflecting gradients in the biodiversity aspects, experiencing distinct environmental conditions, and exhibiting taxonomic dissimilarities due to nestedness. These findings suggest this ubiquitous organization is mainly driven by the action of two complementary environmental filters, one acting on species from regional hotspots and the other on species from permeable biogeographical boundaries. The influence of these regional filters extends across spatial scales and shapes global patterns of species richness. Regional biodiversity follows a universal core-to-transition organization governed by general forces operating across the tree of life and space.

GenDivRange: A global dataset of geo-referenced population genetic diversity across species ranges

Article Open access 11 June 2025

Climate change will redefine taxonomic, functional, and phylogenetic diversity of Odonata in space and time

Article Open access 17 November 2022

Environmental filtering, not dispersal history, explains global patterns of phylogenetic turnover in seed plants at deep evolutionary timescales

Article 29 November 2024

Main

Biogeographical regions—or bioregions—reflect regional species pools with different origins, compositions and ecological and evolutionary characteristics^1,2,3,4,5,6. These regional species pools are, in part, isolated by geological and climatic barriers^7,8,9 and have experienced distinct historical, ecological and evolutionary pressures^{9,10,11,12,13}. Therefore, it is reasonable to expect a context-dependent organization of regional biodiversity with idiosyncratic variations across biogeographical regions and taxa. However, the existence of global factors, processes and biodiversity patterns that transcend biogeographical region boundaries^14,15, or tend to be consistent across regions^16,17, open the door for general processes and mechanisms to exert a stronger influence than the idiosyncrasies of geographical areas and life forms; forcing regional biodiversity to organize into limited, convergent and more predictable ways. These unevaluated alternatives revolve around the predominance of context-dependent versus general processes and mechanisms, offering contrasting perspectives on the principal forces driving the organization of biodiversity across multiple scales and, thus, on our understanding of the distribution of life on Earth.

Species biodiversity can be described using complementary aspects. For instance, geographical areas: may teem with species or harbour only a few¹⁸ (species richness); may contain a mixture of biotas from different biogeographical regions, as observed in transitional areas, or maintain mainly species from a single biogeographical region⁸ (biota overlap); may host species occupying relatively small or large areas within the biogeographical region¹⁹ (species range size or occupancy); and may predominantly host endemic or non-endemic species¹⁹ (species endemicity). These four complementary aspects—species richness, biota overlap, species occupancy and species endemicity—play pivotal roles in ecology and biogeography^{9,18,20,21,22}. Hence, distinct combinations of values of these complementary aspects and their spatial distributions in the biogeographical regions can be understood as the biogeographical organization of biodiversity. The study of such organization can offer new insights into how regional biodiversity is assembled and what its drivers may be. Moreover, by comparing the organization of biodiversity across biogeographical regions and taxa, we can determine whether biodiversity organization and its drivers are context-dependent or general.

To address this, we assessed the joint spatial patterns of the four complementary biodiversity aspects in biogeographical regions of seven ecologically contrasting taxa: amphibians, non-marine birds, dragonflies, non-volant mammals, rays, reptiles and trees. Our results reveal a general pattern in the organization of regional biodiversity across global biogeographical regions of terrestrial and marine vertebrates, invertebrates and plants. In this study, we describe this regional spatial pattern across the tree of life and space, provide empirical evidence for its underlying mechanisms, and show how these regionally operating processes influence species richness patterns at the global scale.

Life forms and data studied

We studied the biogeographical regions of seven contrasting life forms using global distribution maps for five taxa: amphibians (6,563 species), non-marine birds (9,752 species), non-volant terrestrial mammals (4,200 species), reptiles (8,219 species) and rays (360 species). We also used distribution maps for dragonflies in Eurasia (648 species) and forest inventory data for trees in North America (307 species). These more than 30,000 species occupy distinct habitats (marine and terrestrial), and have distinct mobilities (sessile, aquatic, aerial and terrestrial), life histories (slow or fast) and physiologies (ectotherm or endotherm). Furthermore, the dataset encompasses different extent, resolution and collection methodologies. Thus, obtaining consistent results would support the generality of our findings to variations in life forms and data characteristics. To delineate biogeographical regions and identify their most characteristic species so as to estimate their biodiversity aspects, we projected species distribution data (distribution maps or inventory data) onto a distinct regular grid for each taxonomic group. For each taxon, we built a bipartite network, linking species to the grid cells where they occur (Extended Data Fig. 1), and applied the widely used community detection algorithm Infomap^23,24. This algorithm simultaneously identifies modules of highly connected grid cells and species in an integrated approach^19,25. Grid cells within a module define a biogeographical region¹⁹ (Extended Data Figs. 2–8). Species assigned to the same module are considered characteristic of that biogeographical region¹⁹, meaning their distribution range is largely confined to that region, contributing to its unique biotic identity. By contrast, species present in a bioregion’s grid cells but not grouped in the same module are deemed non-characteristic¹⁹, either because they are more strongly associated with another region or display an even distribution that precludes a clear regional association (see graphical description in Extended Data Fig. 1).

General organization of biodiversity

In each grid cell representing regular geographical areas on Earth, we measured four complementary biodiversity aspects: the ratio of characteristic and non-characteristic species, as well as the richness, occupancy and endemicity of characteristic species (hereafter, biota overlap, species richness, occupancy and endemicity) (Supplementary Figs. 1–7 and Supplementary Tables 1–8). These four aspects were used in a single k-means clustering analysis to group grid cells with similar combinations of biodiversity values across biogeographical regions and taxa (see workflow in Extended Data Fig. 1). This k-means clustering included all grid cells from the seven taxa, treating each grid cell as a distinct analytical unit, regardless of its grid membership. The resulting clusters represent geographical areas (groups of grid cells) with similar combinations of biodiversity values across biogeographical regions and taxa, which we refer to as ‘biogeographical sectors’. In theory, the potential number of biogeographical sectors could be large. For instance, even if the four studied biodiversity aspects only had three values each—low, medium and high—there would be 81 potential value combinations (3⁴). If biodiversity is organized differently across the taxa, grid cells for each of the seven studied life forms would be clustered separately from each other. Conversely, if biodiversity is organized similarly across the taxa, clusters would combine grid cells from all the taxa. Our k-means clustering showed an optimal number of seven biogeographical sectors, each encompassing grid cells from all taxa (Fig. 1a). The generality and low number of biogeographical sectors across the seven taxa support the hypothesis that biodiversity at regional scales is arranged in a consistent and limited number of ways, and that the mechanisms governing regional biodiversity transcend the particularities of individual life forms.

**Fig. 1: Seven general and spatially structured biogeographical sectors characterized from four biodiversity aspects across taxa.**

To interpret the biogeographical meaning of the identified sectors, we analysed the distribution of biodiversity aspects across them using values from all the grid cells of the seven taxa (Fig. 1a). We also assessed the similarity of sectors in a multidimensional space defined by four biodiversity aspects using principal component analysis (Fig. 1b). Finally, we mapped sectors and, for each taxon and biogeographical region, assessed whether neighbouring relationships between sector pairs occurred more frequently than expected by chance (one-sided binomial proportion tests, expected probability = 1 in 6, P < 0.05) (Fig. 1c and Supplementary Table 9). Our results show that the biogeographical sectors exhibit a spatial neighbouring pattern with an ordered layered scheme (Figs. 1–3). Shifts between neighbouring sectors reflected spatial gradients in two or more of the biodiversity metrics (Fig. 3a,b). On one hand, the ordered layered scheme of the biogeographical sectors largely captured two opposing gradients of variation in regional species richness and the overlap of biotas (Figs. 1 and 2). At one extreme of these gradients lie the regional species hotspots, likely areas with favourable conditions for the diversification and persistence of most regional species²⁶. At the other extreme are the most transitional areas located near the permeable boundaries of biogeographical regions. The sectors between these extremes partially reflect the geographical proximity of areas to those regional hotspots²⁷ and the permeable boundaries of biogeographical regions^9,28. On the other hand, the biogeographical sectors also mirrored two inverse gradients involving species occupancy and endemicity, with hotspots harbouring the most endemic and least widespread species, whereas the permeable boundaries harboured the least endemic and most widespread species (Fig. 1a). Thus, biodiversity in biogeographical regions generally aligns with what we term a ‘core-to-transition’ organization.

**Fig. 2: Seven general and spatially structured biogeographical sectors characterized from four biodiversity aspects across diverse taxa.**

**Fig. 3: Core-to-transition sequence of biogeographical sectors.**

The biogeographical sectors comprising the sequence of ordered layers can vary across biogeographical regions, probably reflecting the challenges in discretizing what may be a continuum¹⁰ (Fig. 3c). For instance, some regions only contain sectors with a high overlap of biotas, probably reflecting the widely recognized transitional nature of some biogeographical areas^9,29,30. Examples include the Mexican, South American, Saharo–Arabian and Oriental transitional zones (Figs. 1d–g and 2). Similarly, the sequence of layers can start from different points, such as the centre or boundaries of biogeographical regions, or vary in their orientation, ordering latitudinally or longitudinally, probably reflecting the idiosyncrasies of geographical areas and biotas (Figs. 1d–g and 2). However, the biogeographical meaning of all the observed sequences of the layers prevails, supporting the generality of the conceptual core-to-transition organization (Fig. 3). Sensitivity analyses using k-means clustering with 2 to 8 clusters (general biogeographical sectors) also show the core-to-transition organization and the gradient in the four biodiversity aspects (Supplementary Figs. 8–13).

Underlying mechanism

Which mechanism could account for this general organization of biodiversity across biogeographical regions in different parts of the world and across life forms? Gradients in species richness³¹, range size³² and transitional zones⁹ have been associated with environmental filters^31,33. Thus, hypothetically, the observed decreasing richness and increasing occupancy covariation patterns might reflect that only a reduced subset of tolerant species can colonize some areas while expanding their distribution ranges^1,34. Similarly, a decreased overlap of biotas from the boundaries can result from a filtering of species from other biogeographical regions^1,35. If biogeographical sectors reflect environmental filtering from regional hotspots and permeable barriers, we would expect the biogeographical sectors in a biogeographical region to be associated with distinct environmental conditions. Additionally, differences in species compositions among these biogeographical sectors would predominantly arise from one sector’s species being a subset of those present in another, representing nestedness patterns rather than species turnover^36,37.

In line with our expectations, multinomial logistic regressions for each biogeographical region and taxon showed that the biogeographical sectors occupy areas with distinct environmental conditions in 97.7% of the cases (median McFadden’s pseudo-R² in multinomial logit models across taxa = 0.32 (Fig. 4a) using temperature and precipitation as explanatory variables in the terrestrial taxa and temperature and salinity at the sea surface in the marine taxon (Supplementary Figs. 14–20)). Complementarily, the partitioning of species dissimilarity into nestedness and turnover components³⁸ across biogeographical sectors within a given biogeographical region revealed that taxonomic dissimilarity is more attributed to nestedness in 77 ± 2% of the biogeographical regions across all taxa (mean ± standard error (s.e.) of the proportion of biogeographical regions with higher nestedness than turnover across all taxa (Fig. 4b)). In some cases, our environmental variables exhibited limited explanatory power in explaining the distinct biogeographical sectors, probably due to their association with other non-studied environmental, historical and geographical factors^1,36,39. For example, changes in temperature and precipitation since the Last Glacial Maximum also correlate with the core-to-transition pattern (sensitivity analysis) (Supplementary Information Appendix A and Supplementary Figs. 21–26). Similarly, in certain cases, species turnover was higher than nestedness, probably due to the intricate overlap of biogeographical processes and biotas across spatio-temporal scales^2,40 as well as to changes in biogeographical patterns resulting from human-mediated extinctions and species introductions^19,41. However, most of the results across the biogeographical regions and taxa aligned with our predictions about the different environmental conditions associated with distinct biogeographical sectors, with nestedness being the key component of their biotic dissimilarity. The results from the multinomial and nestedness analyses remained consistent when varying the number of biogeographical sectors from seven to 2–8 (sensitivity analyses) (Supplementary Figs. 8–14 and Supplementary Tables 10 and 11). The congruency between our predictions and our results supports the important and general role of environmental filters acting on both characteristic and non-characteristic species in shaping biodiversity within biogeographical regions.

**Fig. 4: Biogeographical sectors are consistent with environmental filtering showing associations with environmental variables and community dissimilarity attributed to nestedness.**

Taken together, these findings support the hypothesis that regional hotspots, probably representing centres of diversification or past climatic refugia^42,43, act as species sources from which species with better dispersive capabilities²⁷ and greater environmental tolerances would have colonized other areas in the biogeographical region, expanding their ranges^34,35. This hypothesis²⁷ is supported by the fact that the most core areas of each biogeographical region cover approximately 30% of the region’s surface, but harbour more species than the remaining 70% of the area (holding around 90% of the species, a species richness higher than expected by chance, based on the geographical extent of biogeographical sectors for reptiles, amphibians, dragonflies and trees; P < 0.05 in models permuting a sector’s identity). Similarly, the colonization of species from other biogeographical regions seems to have also been constrained by environmental factors^9,28,34, producing the observed gradient in the biota overlap from permeable borders. Thus, regional biodiversity could be largely conceptualized as species sorting from two predominant sources, entailing dispersion from the most representative or suitable areas in the biogeographical region and the colonization of biotas from other biogeographical regions.

Influence across spatial scales

The ubiquity of the core-to-transition organization of biodiversity worldwide, in conjunction with the interplay of biogeographical processes across spatial scales, raises a new question: can the core-to-transition organization help us understand global variations in local species richness?—a long-debated topic in ecology and biogeography^18,39,44. Variations in species richness at local scales, such as our grid cells, are expected to be regionally determined by the number of species in the biogeographical region—that is, the size of the regional species pool resulting from the balance of speciation, extinction and biogeographical dispersal^45,46—and by the sorting of the regional species pool across the biogeographical regions, largely influenced by environmental filters^47,48 and associated here with the core-to-transition organization. However, the relative importance of these two drivers remains uncertain. If the size of the regional species pool primarily explains the variance in species richness, then global patterns may be predominantly influenced by biogeographical isolation and connectivity behind the formation of biogeographical regions, as well as the context-dependent speciation and extinction events within them. Conversely, if the sorting of characteristic and non-characteristic species better explains the local variation in species richness, this may underscore the predominant role of regional environmental filters across the planet.

To address the relative importance of core-to-transition organization in shaping global patterns of species richness, we modelled the variance in species richness across grid cells using three variables: the size of the regional species pool, the variations due to the sorting of characteristic species and the variations due to the sorting of non-characteristic species. We measured the size of the regional species pool as the total number of characteristic species in each bioregion, while the sorting of characteristic and non-characteristic species was measured as the observed richness of characteristic and non-characteristic species minus their respective mean values in the biogeographical regions—that is, centred values to discount for the effects of the regional species pool size⁴⁶. Note that the size of the species pool plus the centred species richness of the characteristic and non-characteristic species can be viewed as an approximated decomposition of the observed species richness allowing us to explore the independent effect of each component on the local species richness (Methods). Linear regressions and variance partitioning showed that the influence of species sorting can be comparable to that of the regional species pool size in some cases (dragonflies and mammals) or even greater in others (rays) (mean ± s.e. of the non-shared variance explained by the sorting of characteristic species = 0.25 ± 0.06, the sorting of non-characteristic species = 0.06 ± 0.01 and the size of the regional species pool = 0.39 ± 0.06) (Fig. 5). Thus, the processes underlying species sorting in biogeographical regions, probably tied to regional environmental filters and responsible for the core-to-transition organization, may be, on average, as important as those variations in the regional pool size driven by the balance of speciation, extinction and biogeographical dispersal. These results advocate for broadening our attention beyond the traditionally evaluated size of the regional species pool when exploring how regional effects drive global biodiversity⁴⁶ and, in particular, to also consider the processes and mechanisms driving the core-to-transition organization.

**Fig. 5: Species sorting within biogeographical regions largely explains local variations in species richness across the planet.**

Conclusions

Species biodiversity in biogeographical regions tends to be spatially sorted into a core-to-transition organization. This finding aligns with a concept that has been implicit in the minds of biogeographers for centuries^5,9,28,29,49. The generality of the core-to-transition organization across biogeographical regions with distinct origins, ages, conditions and histories, as well as of taxa with distinct eco-evolutionary characteristics and requirements, suggests that the organization of regional biodiversity tends to converge in a predictable way. The cumulative evidence across space and taxa suggests the action of general mechanisms, which seem to be related to species filtering from regional hotspots and from the permeable boundaries of biogeographical regions. These findings and hypotheses align with established theory on how species are assembled locally from hypothetical species sources^45,47. These source areas, which occupy a relatively small area on Earth, probably have an invaluable influence on the biodiversity of the entire biogeographical region, making them potential targets for international conservation⁵⁰. Furthermore, our core-to-transition hypothesis and results show that global variations in species richness can be better understood by unravelling the genesis of regional hotspots and the subsequent filtering of species to the rest of the biogeographical region. In conclusion, this apparent rule on the organization of biodiversity in biogeographical regions, coupled with its relevance for understanding global variations in species richness, supports the processes and mechanisms underlying the core-to-transition organization reflecting some fundamental principles governing life on Earth.

Methods

Delineation of biogeographical regions

We obtained the species distributions of amphibians, dragonflies, mammals, rays and reptiles from the International Union for Conservation of Nature’s Red List (www.redlist.es), of birds from BirdLife (www.birdlife.org) and of trees from the Forest Inventory and Analyses National Program of the United States (www.fia.fs.usda.gov). We projected the species occurrences of the seven taxa onto seven distinct regular grids, with a resolution of 111 × 111 km for terrestrial animals¹⁹, 55.5 × 55.5 km for trees⁵¹ and 444 × 444 km rays for similarity to previous studies⁵². The number of grid cells varied across each taxon, depending on the species distribution and grid resolution (amphibians = 8,907 grid cells, birds = 10,757, mammals = 10,744, reptiles = 9,507, dragonflies = 5,110, trees = 3,019 and rays = 826, with the total = 48,870 grid cells). Although grid cells from different taxa could overlap geographically, each taxon was assigned its own distinct set of grid cells. All 48,870 grid cells were treated as independent analytical units in the subsequent k-means clustering analyses.

To calculate the biodiversity aspects in the grid cells, such as biota overlap, it was essential to delineate the biogeographical regions and assign species to the biogeographical region with which they were most closely associated. We used a well-established biogeographical method that both delineates the regions and assigns species to them in a single integrated process. Specifically, we employed Infomap^24,53, a community detection algorithm based on network theory^{19,24,25,53,54} (www.mapequation.org/infomap/; see details below). In network approaches, the species and grid cells are treated as two types of nodes in a bipartite network, linked based on species occurrence^19,54 (Extended Data Fig. 1). Infomap identifies modules that represent groups of highly connected grid cells and species. Infomap is based on information theory, and these modules represent the best compression of the systems’ information, capturing the key structural patterns within the network^24,53, which, in our case, was the co-occurrence patterns of species across the globe^19,54. The grid cells within a module are considered the geographical areas of a biogeographical region, whereas the species within the same module represent its characteristic species pool¹⁹. Characteristic species have their entire distribution range, or most of it, in their associated biogeographical region (thus including endemic, but also non-endemic, species). Those species present in a grid cell of a biogeographical region but not assigned to the same module were considered as non-characteristic species¹⁹ (Extended Data Fig. 1). Non-characteristic species can be more affined to another biogeographical region—that is, clustered in another module¹⁹. The absolute and relative occurrences of these characteristic and non-characteristic species allow the measurement of biodiversity aspects (see below). The biogeographical regions detected are congruent with those proposed in previous studies and other methodological approaches^{5,6,25,52,54,55,56,57,58,59} (Extended Data Figs. 2–8).

Identifying patterns in large communities is a hard problem. Infomap, as with most clustering and network community algorithms, identifies solutions using a heuristic procedure^19,23,24,60. To consider the heuristic search of Infomap and, thus, all possible biogeographical region delineations (avoiding local minima), we conducted 150,000 analyses for each taxon¹⁹, selecting for subsequent analyses the best delineation based on an information-theoretic criterion of Infomap called codelength^{19,24,53,54,61}.

We selected the community detection algorithm Infomap over alternative algorithms and clustering approaches for several reasons. First, identifying both the bioregions and their characteristic species is essential for calculating biodiversity metrics and evaluating general biodiversity patterns—our primary goal. Infomap integrates the clustering of grid cells and species into a unified methodological framework¹⁹, thereby avoiding additional steps that could introduce methodological complexity or subjectivity. Second, Infomap ensures one-to-one correspondence between regional species pools and biogeographical regions, aligning with the definition of the biogeographical region—Earth’s areas identified as distinct due to the presence of different species pools. Third, Infomap is widely used and accepted in the biogeographic community^19,25,53,61, and its regions have even served as benchmarks for new methods of biogeographical delineation⁶². In particular, Infomap produces biogeographical regions that are comparable to those from well-established biogeographical methods, such as agglomerative hierarchical clustering and modularity-based approaches^25,52,59, as well as being comparable to alternative clustering methods used in other fields of science, such as the stochastic block model⁶³ (Supplementary Information Appendix B). Moreover, Infomap-based biogeographical regions adequately represent the co-occurrence patterns of characteristic species (for example, the spatial congruence between bioregion boundaries and overlapping distribution ranges of characteristic species²⁵). Fourth, Infomap inherently determines the optimal number of clusters—here biogeographical regions—during its search process^23,53, eliminating the need for additional threshold-setting steps that could introduce methodological complexity and subjectivity^9,25. Fifth, Infomap offers additional advantages over other community detection algorithms²⁴. For instance, Infomap may be less affected by the resolution limit²⁴, which refers to the challenges of accurately identifying communities in datasets with high complexity or large volumes of data (see analyses on sensitivity to data extent below). Moreover, its heuristic search yields more stable, and then reliable, solutions⁶⁰. Finally, in methodological studies comparing method performance against ground-truthing, Infomap consistently ranks among the top-performing methods^64,65,66. For a non-specialized description of Infomap and its application in biogeography, see refs. ^54,61 and the supplementary material of ref. ¹⁹. In summary, we chose Infomap because it provides reliable biogeographical regions and also identifies their characteristic species—a critical step in addressing our primary goal of describing biodiversity patterns in biogeographical regions.

Biogeographical sectors

We identified geographical areas in biogeographical regions that exhibited similar biodiversity values, calling them biogeographical sectors. To characterize these biogeographical sectors, we used four biodiversity aspects that captured how biogeographical regions are the result of processes acting on both characteristic and non-characteristic species. These four biodiversity aspects were: the relative richness of characteristic species, which quantified how well a grid cell represented the characteristic species pool of a biogeographical region compared to other grid cells; the overlap of biotas, which measured the proportion of non-characteristic species in a grid cell; the relative occupancy of characteristic species, which measured the extent to which a characteristic species occupied its biogeographical region in comparison to other characteristic species of its biogeographical region; and the endemicity of characteristic species, which indicated the proportion of a characteristic species’ distribution range outside of its associated biogeographical region.

To quantify these four biodiversity aspects, we employed network cartography⁶⁷, a widely used approach in multiple fields of science used for gaining insights into the organization and connectivity of elements in complex systems (in our case, species and grid cells). Specifically, we used two measures: the within-module degree (z) and the connectivity across modules (C) for both grid cells and species. These metrics informed us on the link distribution of nodes inside and outside their respective module⁶⁷ (Extended Data Fig. 1). The z measures represented the relative connectivity of a given node within its associated module, measured as a z score ranging from minus infinity to infinity. For a grid cell, z_cell indicated the relative species richness of the characteristic species in a grid cell compared to other grid cells in the same biogeographical region. In each module, the grid cell with the highest z_cell best represented the characteristic species richness, and vice versa. For the species, z_spp denoted the relative occupancy of a species in its associated biogeographical region compared to the other characteristic species. In each module, the characteristic species with the highest z_spp occupied the largest area of the biogeographical region, and vice versa. Contrastingly, the connectivity across modules (C) measures the proportion of links of a node outside its module, ranging from 0 to 1. For the grid cells, C_cell measures the overlap of biotas as the proportion of non-characteristic species present in a given grid cell. Values of 0 indicate the absence of non-characteristic species with zero overlap of biotas, with lower values indicating the overlap of biotas. For example, a value of 0.4 would indicate that 40% of the species present in the grid cell were non-characteristic. For the species, C_spp measures the endemicity, quantified as the proportion of the characteristic species’ distribution area that fell within its biogeographical region. A value of 1 indicates that the characteristic species is endemic, with lower values indicating that part of its distributional area is also present in other biogeographical regions. To assign a value of occupancy, z_spp, and endemicity, C_spp, to each grid cell, we selected their present characteristic species and calculated the median values of z_zpp and C_spp (Extended Data Fig. 1). To quantify the four biodiversity aspects per grid cell, we defined:

$${\mathrm{Relative}}\,{\mathrm{species}}\,{\mathrm{richness}}=({I}_{c}-{\bar{I}}_{{m}_{c}})/{\sigma }_{{I}_{{m}_{c}}}$$

(1)

where I_c is the number of links of grid cell c to the species in its module, m, and ${\bar{I}}_{{m}_{c}}$ and ${\sigma }_{{I}_{{m}_{c}}}$ are the mean value and the standard deviation, respectively, of I_c over all grid cells in module m.

$${{\mathrm{Biota}}\; {\mathrm{overlap}}}={O}_{c}/{L}_{c},$$

(2)

where O_c is the number of links of grid cell c connecting with species outside its module, and L_c is the total number of links of grid cell c, including links inside (I_c) and outside (O_c) its module.

$${{\mathrm{Endemicity}}}={{\mathrm{Mdn}}}\left(\frac{{I}_{s}}{{L}_{s}}\right)\,,$$

(3)

where I_s is the number of links of the present characteristic species, s, connecting with grid cells inside its module, and L_s is the total number of links of the present characteristic species, s, including links inside and outside its module, and Mdn is the median of the values of the present characteristic species in the grid cell.

$${{\mathrm{Relative}}\; {\mathrm{occupancy}}}={{\mathrm{Mdn}}}\left(\frac{{I}_{s}-\,{\bar{I}}_{{m}_{s}}}{{\sigma }_{{I}_{{m}_{s}}}}\right),$$

(4)

where ${\bar{I}}_{{m}_{s}}$ and ${\sigma }_{{I}_{{m}_{s}}}$ are the mean value and the standard deviation, respectively, of I_s over all characteristic species in module m. We calculated the four biodiversity metrics in all the grid cells of each of the seven grids associated with the seven taxa.

The four biodiversity aspects did not strongly correlate (|Pearson’s coefficient| < 0.7 (ref. ⁶⁸)), and provided complementary information (see correlation values for each taxon and overall in Supplementary Tables 1–8).

To divide the grid cells into biogeographical sectors, we followed a two-step approach. In the first step, we conducted the following procedure seven times, once per taxon. We selected the grid cells of a given taxon and performed a k-means analysis to identify clusters of geographical areas with similar combinations of the four biodiversity aspects. To determine the optimal number of clusters, for each taxon, we used an elbow-like⁶⁹ method commonly employed in biogeography^25,70. We tested cluster numbers ranging from two to 30, and calculated the goodness-of-fit (GoF), defined as the ratio of the between-cluster variance to the total variance. Recognizing the heuristic nature of k-means, we repeated each k-means clustering 100 times and selected the partition with the highest GoF in each partition with cluster numbers ranging from two to 30. Then, in a piecewise regression, we modelled the best 29 GoF values (dependent variable) with the number of clusters (explanatory variable) to identify the inflection point or point of sharpest decrease⁷¹, by searching for the cluster count that resulted in the lowest residual standard error. This point reflects where adding more clusters no longer substantially increases the GoF, and thus represents an optimal balance between model complexity and clustering quality. We identified this point using a brute force iterative search⁷². At this stage, we obtained seven outcomes, one per taxon, indicating the optimal partition of grid cells to taxon-specific biogeographical sectors (Supplementary Table 12). We used these outcomes in the second step.

In the second step, to address whether the biogeographical sectors were taxon-specific or general across the taxa, we performed a general k-means clustering analysis that included all 48,870 grid cells from the seven taxa jointly. We evaluated 13 potential partitions, ranging from two to 14 clusters, with the maximum number constrained by the results from the taxon-specific analyses (first step) and computational feasibility. These clusters, representing geographical areas with similar biodiversity characteristics across all taxa, were termed general biogeographical sectors. To determine the optimal number of clusters, we sought the partition in the general k-means that showed the highest similarity to the taxon-specific results. For each of the 13 potential partitions conducted with all grid cells from the seven taxa (2–14 clusters), we separated the grid cells of each taxon to create seven individual sets. For each partition and set, we quantified the similarity in grid-cell grouping between the general k-means (second step) and the taxon-specific k-means (first step) using adjusted mutual information (AMI), a widely accepted metric for assessing clustering similarity⁷³. For each partition, we obtained seven AMI values, one per taxon, which were averaged to provide an overall similarity measure per partition. Our results showed that the partition with seven clusters produced the highest average AMI value. We considered that the seven-cluster partition provided the best description of biodiversity organization across the seven taxa, and used that as the basis for our subsequent analyses. Nevertheless, all subsequent results remained consistent, regardless of the number of clusters (sensitivity analyses using cluster 2–8 counts) (Supplementary Figs. 8–13 and Supplementary Tables 10 and 11). Although we chose the number of clusters that best represented all taxa, on average, the grid cells were free to group based on their biodiversity aspects. If the combinations of biodiversity aspects in the grid cells differed between taxa, the grid cells of each taxon were expected to form separate clusters. Conversely, if the combinations of biodiversity aspects were similar across taxa, the clusters would contain grid cells from all taxa.

All k-means clustering analyses, both taxon-specific and general, used the four biodiversity aspects of species richness, biota overlap, occupancy and endemicity as features. Before clustering, these metrics were standardized to account for different value ranges. We excluded those modules with missing values in any of the four metrics, including modules consisting solely of species without a clear biogeographical affinity, modules in which all the grid cells exhibited identical species richness and modules where all the species occupied an identical number of grid cells. These modules represented tiny biogeographical regions with non-biogeographical relevance to our goals.

Sensitivity analyses accounting for the distinct number of grid cells per taxon in the k-means clustering provided similar results (Extended Data Figs. 2–8, Supplementary Figs. 27–34 and details in Supplementary Information Appendix C). To maintain consistency in the k-means analyses in both the main and sensitivity analyses, we used the function kmeans.weight from the R package SWKM⁷⁴ in both cases (Supplementary Information Appendix C). Additionally, sensitivity analyses accounting for the distinct geographical extent of the data (global and continental) when delineating the biogeographical regions and calculating the biodiversity metrics also provided similar results (Supplementary Information Appendix D).

In the Supplementary Material, we provide R code for estimating the four biodiversity aspects of biota overlap, species richness, species occupancy and species endemism. This code also enables the clustering of grid cells based on these aspects to identify biogeographical sectors. Furthermore, the online tool Infomap Bioregions, for mapping biogeographical regions^54,61, now provides the values of these four biodiversity aspects (www.mapequation.org/bioregions2/).

Neighbour analyses

To examine the spatial relationship among biogeographical sectors, we assessed whether these sectors exhibited a higher degree of neighbourhood or adjacency to other sectors than expected by chance. With seven sectors, the expected chance probability of neighbouring a different sector was 1 in 6. To calculate the observed probability, we identified neighbouring grid cells belonging to different biogeographical sectors. Then, we evaluated whether the observed neighbouring probability surpassed the expected chance probability through binomial proportional tests. To account for variations in the number of neighbouring grid cells across different biogeographical sectors, we performed tests in both directions, assessing the neighbourhood between hypothetical sectors A and B (A → B and B → A). We deemed there was evidence of significant neighbourhood if either the A → B or B → A tests yielded P < 0.05. These analyses were carried out for each biogeographical region, depicting the frequency of instances where two sectors exhibited greater neighbouring than expected by chance. To ensure robust statistical analysis and avoid issues with expected values below five events in binomial proportional tests, we limited the analyses to combinations of biogeographical regions and sectors with over 30 neighbouring grid cells (five events or grid cells multiplied by the potential six neighbouring sectors). To perform the binomial proportion tests, we used the function prop.test in the R package stats⁷⁵. The results are illustrated in Fig. 1c, where the circle sizes represent the relative proportion of statistically significant neighbouring events across sector pairs in all biogeographical regions and taxa. Supplementary Table 9 provides details on the total number of biogeographical regions, across the seven taxa, where the two compared biogeographical sectors were represented—that is, at least one grid cell from each of the two sectors. Supplementary Table 9 provides details on the proportion of those latter regions where neighbouring between pairs of sectors was higher than expected.

Multinomial models

We examined whether the biogeographical sectors corresponded to geographical areas with distinct environmental conditions by using multinomial logit models with biogeographical sectors as the response variable. We assessed the existence of a correlation between the core-to-transition pattern with two widely used variables for explaining species diversity at the global scale: mean annual temperature and precipitation for the terrestrial biota and mean surface temperature and salinity for the marine biota. The data on mean annual temperature and precipitation were obtained from the Climatic Research Unit time series dataset v.4.06 (ref. ⁷⁶) and downscaled using WorldClim v.2.1 (ref. ⁷⁷), with the mean sea-surface temperature and sea-surface salinity data being obtained from the National Aeronautics and Space Administration’s Ocean Color Web website (http://oceancolor.gsfc.nasa.gov/) and the National Oceanic and Atmospheric Administration’s World Ocean Atlas 2009, and prepared by Sbrocco and Barber⁷⁸. To align the raster data with our grid, we computed the mean values of corresponding pixels within each grid cell.

We performed multinomial models for each taxon and biogeographical region, comparing a model including the two respective explanatory variables against a null model including only an intercept. Evidence was considered statistically significant if the difference in the corrected Akaike information criterion value exceeded 10 (ref. ⁷⁹). We analysed biogeographical regions with two or more sectors. To mitigate issues with sample size in multinomial models, we only evaluated biogeographical sectors with more than 15 grid cells. To ensure the representativeness of the complete biogeographical region, we only evaluated those biogeographical regions where the sum of the biogeographical sectors with more than 15 grid cells represented at least 90% of the entire region. To assess the fit of the multinomial models, we used McFadden’s pseudo-R². We performed multinomial models using the multinom function from the R package nnet⁸⁰, and we calculated McFadden’s R² using the PseudoR2 function from the R package DescTools⁸¹.

We also explored the potential correlation between present biodiversity and past climatic conditions (Supplementary Information Appendix A). We conducted sensitivity analyses to consider the spatial autocorrelation in the model residuals, obtaining similar results (Supplementary Information Appendix E).

Taxonomic dissimilarity: nestedness versus turnover

We examined whether the variation in species composition among biogeographical sectors, measured by Sørensen pairwise dissimilarity, was more attributed to nestedness or to species turnover components³⁸. In our case, nestedness refers to dissimilarity arising because the species composition in one biogeographical sector is a subset of the species found in another sector, whereas turnover captures dissimilarity due to species replacement³⁸. For each bioregion and taxon, considering all present biogeographical sectors together, we computed the proportion of the taxonomic dissimilarity between sectors attributed to nestedness. This proportion was calculated as the ratio of the nestedness component to the total Sørensen dissimilarity. We calculated the distinct components using the beta.multi function from the R package betapart⁸². We calculated the nestedness for characteristic and non-characteristic species separately. Because the non-characteristic species can be affined to distinct biogeographical regions, we conducted independent analyses for the non-characteristic species affiliated with each biogeographical region.

Local variation in species richness at the global scale

At our grid-cell resolution, variations in species richness were expected to depend on both the size of the regional species pool—resulting from the balance of speciation, extinction and biogeographical dispersal^45,46—and the sorting of species from that pool^47,48—in this study, associated with the core-to-transition organization and environmental filters. Previous studies has approximated the size of the regional species pool as the average species richness across locations within a biogeographical region⁴⁶. Thus, the centred richness, calculated as the observed species richness in each grid cell minus the average richness in that biogeographical region, may be considered as representing species sorting within the region independently of the pool size. Therefore, to assess the relative importance of these factors, we modelled species richness across grid cells using three explanatory variables: (1) the total number of characteristic species per biogeographical region, obtained from the network analyses (a proxy for the size of the regional species pool); (2) the centred richness of characteristic species; and (3) the centred richness of non-characteristic species. These three variables provide an approximate decomposition of species richness. While these three variables together should explain a large proportion of the variance in species richness, this decomposition allowed us to explore the independent effect of each variable, and thus which regional effect was more relevant in explaining the global variance in local species richness. The size of the regional species pool would be more important in explaining global patterns in species richness if there were large differences in the size of the regional species pool among the biogeographic regions and the core-to-transition organization was, in general, driven by relatively few species. By contrast, the sorting of characteristic and non-characteristic species would be more important if there were small variations in the size of the regional species pools, and the core-to-transition organization was driven by relatively many species. These alternatives informed on whether global variations in species richness are more associated with the formation of the biogeographical regions and the species diversification and dispersals (size of regional species pool) or with the sorting of species due to environmental filters (core-to-transition organization).

To evaluate the relative importance of each factor, we fitted linear models of the grid-cell richness as a function of the three aforementioned variables. Then, we calculated the variance partitioning of the three explanatory variables, focusing on the individual fraction explained by those—that is, the variance explained by a variable after removing the variance shared with the other explanatory variables. This individual fraction of variance of a given variable was measured as the difference between the R² of the saturated model and the R² of a model without the focal variable. To prevent inference biases stemming from larger biogeographical regions, we used weighted regression, where the cell values were weighted by the inverse size of the biogeographical region. The results from this sensitivity analysis were qualitatively similar using unweighted regression (Supplementary Fig. 35). We performed linear regressions using the lm function from the R package stats⁷⁵, and obtained the adjusted R² by applying the R function summary to the output of the linear regression models.

Species richness in core area higher than expected

The core areas of each biogeographical region tend to comprise 30% of the biogeographical region while harbouring more species than the remaining 70%. This observation suggests a potential conservation value for these core areas⁵⁰. To assess whether the species richness was higher than expected by chance, we conducted a randomization test for each combination of biogeographical region and taxon. We randomized the identity of the biogeographical sectors in the grid cells while maintaining the total number of grid cells per biogeographical sector and region. This process was repeated 100 times. We then evaluated whether the observed species richness in the core areas was higher than that in the randomized core.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The data supporting the findings of this study are publicly available from established repositories. The species distribution maps for amphibians, mammals, reptiles, rays and dragonflies were obtained from the International Union for Conservation of Nature (https://www.iucnredlist.org). The bird species distributions were sourced from BirdLife International (https://www.birdlife.org) and the tree occurrence data from the United States Forest Inventory and Analysis Program (https://www.fia.fs.usda.gov). The climate data used in this study were obtained from the Climatic Research Unit time series dataset v.4.06 and WorldClim v.2.1 databases for the terrestrial taxa and from the National Aeronautical and Space Administration’s Ocean Color Web website and the National Oceanic and Atmospheric Administration’s World Ocean Atlas 2009 for the marine taxa. All raw distribution data are freely accessible for academic use upon request from the respective repositories or via their websites.

Code availability

The R code used to calculate the biodiversity metrics, including the species richness, biota overlap, occupancy and endemicity, and to identify the biogeographical sectors through k-means clustering is provided as part of the Supplementary Information. The scripts are annotated and reproduce the core analytical procedures described in the manuscript. All scripts are intended for academic use and include documentation of input formats and parameter configurations.

References

Calatayud, J. et al. Pleistocene climate change and the formation of regional species pools. Proc. R. Soc. B: Biol. Sci. 286, 20190291 (2019).
Article Google Scholar
Hazzi, N. A., Moreno, J. S., Ortiz-Movliav, C. & Palacio, R. D. Biogeographic regions and events of isolation and diversification of the endemic biota of the tropical Andes. Proc. Natl Acad. Sci. USA 115, 7985–7990 (2018).
Article CAS PubMed PubMed Central Google Scholar
Harris, T., Ottaviani, G., Mulligan, M. & Brummitt, N. Trait hypervolumes based on natural history collections can detect ecological strategies that are distinct to biogeographic regions. J. Ecol. 111, 314–326 (2023).
Article Google Scholar
Sclater, P. L. On the general geographical distribution of the members of the Class Aves. J. Proc. Linn. Soc. Lond. Zool. 2, 130–136 (1858).
Article Google Scholar
Wallace, A. R. The Geographical Distribution of Animals (Cambridge Univ. Press, 1876).
Holt, B. G. et al. An update of Wallace’s zoogeographic regions of the world. Science (1979) 339, 74–78 (2013).
CAS Google Scholar
Ficetola, G. F., Mazel, F. & Thuiller, W. Global determinants of zoogeographical boundaries. Nat. Ecol. Evol. 1, 0089 (2017).
Article Google Scholar
Ficetola, G. F., Mazel, F., Falaschi, M., Marta, S. & Thuiller, W. Determinants of zoogeographical boundaries differ between vertebrate groups. Glob. Ecol. Biogeogr. 30, 1796–1809 (2021).
Article Google Scholar
Morrone, J. J. in Neotropical Biogeography (ed. Morrone, J. J.) Ch. 4 (CRC, 2017).
Ricklefs, R. E. Disintegration of the ecological community. Am. Nat. 172, 741–750 (2008).
Article PubMed Google Scholar
Gouveia, S. F., Hortal, J., Cassemiro, F. A. S., Rangel, T. F. & Diniz-Filho, J. A. F. Nonstationary effects of productivity, seasonality, and historical climate changes on global amphibian diversity. Ecography 36, 104–113 (2013).
Article Google Scholar
Antonelli, A. et al. Amazonia is the primary source of neotropical biodiversity. Proc. Natl Acad. Sci. USA 115, 6034–6039 (2018).
Article CAS PubMed PubMed Central Google Scholar
Leslie, A. B. et al. Hemisphere-scale differences in conifer evolutionary dynamics. Proc. Natl Acad. Sci. USA 109, 16217–16221 (2012).
Article CAS PubMed PubMed Central Google Scholar
Gaston, K. J. Latitudinal gradient in species richness. Curr. Biol. 17, R574 (2007).
Wright, I. J. et al. Global climatic drivers of leaf size. Science (1979) 357, 917–921 (2017).
CAS Google Scholar
Brown, L. E. et al. Functional diversity and community assembly of river invertebrates show globally consistent responses to decreasing glacier cover. Nat. Ecol. Evol. 2, 325–333 (2018).
Article PubMed Google Scholar
MacArthur, R. H. & Wilson, E. O. The Theory of Island Biogeography, Vol. 1 (Princeton Univ. Press, 2001).
Gaston, K. J. Global patterns in biodiversity. Nature 405, 220–227 (2000).
Bernardo-Madrid, R. et al. Human activity is altering the world’s zoogeographical regions. Ecol. Lett. 22, 1297–1305 (2019).
Article PubMed Google Scholar
Willig, M. R., Kaufman, D. M. & Stevens, R. D. Latitudinal gradients of biodiversity: pattern, process, scale, and synthesis. Annu. Rev. Ecol. Evol. Syst. 34, 273–309 (2003).
Orme, C. D. L. et al. Global patterns of geographic range size in birds. PLoS Biol. 4, e208 (2006).
Orme, C. D. L. et al. Global hotspots of species richness are not congruent with endemism or threat. Nature 436, 1016–1019 (2005).
Article CAS PubMed Google Scholar
Rosvall, M. & Bergstrom, C. T. Maps of random walks on complex networks reveal community structure. Proc. Natl Acad. Sci. USA 105, 1118–1123 (2008).
Article CAS PubMed PubMed Central Google Scholar
Smiljanić, J. et al. Community detection with the map equation and infomap: theory and applications. Preprint at https://arXiv.org/abs/2311.04036 (2023).
Vilhena, D. A. & Antonelli, A. A network approach for identifying and delimiting biogeographical regions. Nat. Commun. 6, 6848 (2015).
Article CAS PubMed Google Scholar
Cox, C. B., Moore, P. D. & Ladle, R. J. Biogeography: An Ecological and Evolutionary Approach (Wiley-Blackwell, 2016).
Mora, C., Chittaro, P. M., Sale, P. F., Kritzer, J. P. & Ludsin, S. A. Patterns and processes in reef fish diversity. Nature 421, 933–936 (2003).
Article CAS PubMed Google Scholar
White, A. E., Dey, K. K., Mohan, D., Stephens, M. & Price, T. D. Regional influences on community structure across the tropical-temperate divide. Nat. Commun. 10, 2646 (2019).
Article PubMed PubMed Central Google Scholar
Kreft, H. & Jetz, W. Comment on ‘An update of Wallace’s zoogeographic regions of the world’. Science 341, 343 (2013).
Article CAS PubMed Google Scholar
Holt, B. G. et al. Response to comment on ‘An update of Wallace’s zoogeographic regions of the world’. Science 341, 343 (2013).
Article CAS PubMed Google Scholar
Kirk, M. A., Rahel, F. J. & Laughlin, D. C. Environmental filters of freshwater fish community assembly along elevation and latitudinal gradients. Glob. Ecol. Biogeogr. 31, 470–485 (2022).
Article Google Scholar
Morales-Castilla, I., Rodríguez, M. Á., Kaur, R. & Hawkins, B. A. Range size patterns of New World oscine passerines (Aves): insights from differences among migratory and sedentary clades. J. Biogeogr. 40, 2261–2273 (2013).
Article Google Scholar
Calatayud, J., Neuman, M., Rojas, A., Eriksson, A. & Rosvall, M. Regularities in species’ niches reveal the world’s climate regions. eLife 10, 1–25 (2021).
Article Google Scholar
Pinkert, S. et al. Evolutionary processes, dispersal limitation and climatic history shape current diversity patterns of European dragonflies. Ecography 41, 795–804 (2018).
Article Google Scholar
Sommer, B., Harrison, P. L., Beger, M. & Pandolfi, J. M. Trait-mediated environmental filtering drives assembly at biogeographic transition zones. Ecology 95, 1000–1009 (2014).
Article PubMed Google Scholar
Hortal, J. et al. Ice age climate, evolutionary constraints and diversity patterns of European dung beetles. Ecol. Lett. 14, 741–748 (2011).
Article PubMed Google Scholar
Chiarucci, A. et al. Exploring patterns of beta‐diversity to test the consistency of biogeographical boundaries: a case study across forest plant communities of Italy. Ecol. Evol. 9, 11716–11723 (2019).
Article PubMed PubMed Central Google Scholar
Baselga, A. Partitioning the turnover and nestedness components of beta diversity. Glob. Ecol. Biogeogr. 19, 134–143 (2010).
Article Google Scholar
Dunn, R. R. et al. Climatic drivers of hemispheric asymmetry in global patterns of ant species richness. Ecol. Lett. 12, 324–333 (2009).
Article PubMed Google Scholar
Silva, S. M. et al. A dynamic continental moisture gradient drove Amazonian bird diversification. Sci. Adv. 5, eaat5752 (2019).
Article PubMed PubMed Central Google Scholar
Capinha, C., Essl, F., Seebens, H., Moser, D. & Pereira, H. M. The dispersal of alien species redefines biogeography in the Anthropocene. Science 348, 1248–1251 (2015).
Article CAS PubMed Google Scholar
Harrison, S. & Noss, R. Endemism hotspots are linked to stable climatic refugia. Ann. Bot. 119, 207–214 (2017).
Article PubMed PubMed Central Google Scholar
Igea, J. & Tanentzap, A. J. Multiple macroevolutionary routes to becoming a biodiversity hotspot. Sci. Adv. 5, 8067–8073 (2019).
Article Google Scholar
Mittelbach, G. G. et al. Evolution and the latitudinal diversity gradient: speciation, extinction and biogeography. Ecol. Lett. 10, 315–331 (2007).
Article PubMed Google Scholar
Ricklefs, R. E. Intrinsic dynamics of the regional community. Ecol. Lett. 18, 497–503 (2015).
Article PubMed Google Scholar
Ricklefs, R. E. & He, F. Region effects influence local tree species diversity. Proc. Natl Acad. Sci. USA 113, 674–679 (2016).
Article CAS PubMed PubMed Central Google Scholar
Harrison, S. & Cornell, H. Toward a better understanding of the regional causes of local community richness. Ecol. Lett. 11, 969–979 (2008).
Article PubMed Google Scholar
Zobel, M. The relative of species pools in determining plant species richness: an alternative explanation of species coexistence? Trends Ecol. Evol. 12, 266–269 (1997).
Article CAS PubMed Google Scholar
Müller, S. Ueber den Charakter der Thierwelt auf den Inseln des indischen Archipels, ein Beitrag zur zoologischen Geographie. Arch. Naturgesch. 12, 109–128 (1846).
Google Scholar
Myers, N. et al. Biodiversity hotspots for conservation priorities. Nature 403, 853–858 (2000).
Article CAS PubMed Google Scholar
Olson, R. J., Johnson, K. R., Zheng, D. L. & Scurlock, J. M. O. Global and Regional Ecosystem Modeling: Databases of Model Drivers and Validation Measurements. Technical Memorandum ORNL/TM-2001/196 (Oak Ridge National Laboratory, 2001).
Costello, M. J. et al. Marine biogeographic realms and species endemicity. Nat. Commun. 8, 1–9 (2017).
Article CAS Google Scholar
Rosvall, M., Axelsson, D. & Bergstrom, C. T. The map equation. Eur. Phys. J.: Spec. Top. 178, 13–23 (2009).
Google Scholar
Edler, D., Guedes, T., Zizka, A., Rosvall, M. & Antonelli, A. Infomap bioregions: interactive mapping of biogeographical regions from species distributions. Syst. Biol. 66, 197–204 (2017).
PubMed Google Scholar
Takhtajan, A. Floristic Regions of the World (Univ. of California Press, 1986).
Google Scholar
Kreft, H. & Jetz, W. A framework for delineating biogeographical regions based on species distributions. J. Biogeogr. 37, 2029–2053 (2010).
Article Google Scholar
He, J., Lin, S., Li, J., Yu, J. & Jiang, H. Evolutionary history of zoogeographical regions surrounding the Tibetan Plateau. Commun. Biol. 3, 415 (2020).
Article PubMed PubMed Central Google Scholar
Morrone, J. J. & Ebach, M. C. Toward a terrestrial biogeographical regionalisation of the world: historical notes, characterisation and area. Aust. Syst. Bot. 35, 187–224 (2022).
Bloomfield, N. J., Knerr, N. & Encinas‐Viso, F. A comparison of network and clustering methods to detect biogeographical regions. Ecography 41, 1–10 (2018).
Article Google Scholar
Calatayud, J., Bernardo-Madrid, R., Neuman, M., Rojas, A. & Rosvall, M. Exploring the solution landscape enables more reliable network community detection. Phys. Rev. E 100, 052308 (2019).
Article CAS PubMed Google Scholar
Edler, D. et al. Infomap Bioregions 2 – exploring the interplay between biogeography and evolution. Preprint at https://arXiv.org/abs/2306.17259 (2023).
Maestri, R. & Duarte, L. Evoregions: mapping shifts in phylogenetic turnover across biogeographic regions. Methods Ecol. Evol. 11, 1652–1662 (2020).
Article Google Scholar
Peixoto, T. P. Descriptive vs. Inferential Community Detection in Networks: Pitfalls, Myths and Half-Truths (Cambridge Univ. Press, 2023).
Aldecoa, R. & Marín, I. Exploring the limits of community detection strategies in complex networks. Sci. Rep. 3, 2216 (2013).
Article PubMed PubMed Central Google Scholar
Lancichinetti, A. & Fortunato, S. Community detection algorithms: a comparative analysis. Phys. Rev. E 80, 056117 (2009).
Article Google Scholar
Tandon, A. et al. Community detection in networks using graph embeddings. Phys. Rev. E 103, 022316 (2021).
Article CAS PubMed Google Scholar
Guimera, R. & Amaral, N. Functional cartography of complex metabolic networks. Nature 433, 895–900 (2005).
Article CAS PubMed PubMed Central Google Scholar
Dormann, C. F. et al. Collinearity: a review of methods to deal with it and a simulation study evaluating their performance. Ecography 36, 27–46 (2013).
Article Google Scholar
Salvador, S. & Chan, P. Determining the number of clusters/segments in hierarchical clustering/segmentation algorithms. In Proc. 16th IEEE International Conference on Tools with Artificial Intelligence 576–584 (IEEE, 2004).
Daru, B. H., Karunarathne, P. & Schliep, K. phyloregion: R package for biogeographical regionalization and macroecology. Methods Ecol. Evol. 11, 1483–1491 (2020).
Article Google Scholar
Bernardo-Madrid, R., Vera, P., Gallardo, B. & Vilà, M. Stopping winter flooding of rice fields to control invasive snails has no effect on waterbird abundance at the landscape scale. Front. Ecol. Evol. 10.3389/fevo.2021.688325 (2022).
Crawley, M. J. The R Book (Wiley, 2012).
Vinh, N. X., Epps, J. & Bailey, J. Information theoretic measures for clusterings comparison: is a correction for chance necessary? In Proc. 26th Annual International Conference on Machine Learning (eds Bottou, L. & Littman, M.) 1073–1080 (Association for Computing Machinery, 2009).
Zhang, W., Wangwu, J. & Lin, Z. in Statistical Modeling in Biomedical Research: Contemporary Topics and Voices in the Field (eds Zhao, Y. & Chen, D.-G.) 37–64 (Springer, 2020).
R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2023).
Harris, I., Osborn, T. J., Jones, P. & Lister, D. Version 4 of the CRU TS monthly high-resolution gridded multivariate climate dataset. Sci. Data 7, 109 (2020).
Article PubMed PubMed Central Google Scholar
Fick, S. E. & Hijmans, R. J. WorldClim 2: new 1‐km spatial resolution climate surfaces for global land areas. Int. J. Climatol. 37, 4302–4315 (2017).
Article Google Scholar
Sbrocco, E. J. & Barber, P. H. MARSPEC: ocean climate layers for marine spatial ecology. Ecology 94, 979 (2013).
Article Google Scholar
Anderson, D. & Burnham, K. Model Selection and Multi-Model Inference 2nd edn (Springer-Verlag, 2004).
Venables, W. N. & Ripley, B. D. Modern Applied Statistics with S (Springer, 2002).
Signorell, A. DescTools: tools for descriptive statistics. R package version 0.99.51 https://CRAN.R-project.org/package=DescTools (2023).
Baselga, A. et al. betapart: partitioning beta diversity into turnover and nestedness components. R package version 1.6 https://CRAN.R-project.org/package=betapart (2023).

Download references

Acknowledgements

We thank J. Hortal, A. Benítez-López, V. Hermoso and C. Venditti’s group for their comments on the text. We thank D. Edler for commenting on the text and incorporating the estimation of the four biodiversity aspects in the online tool Infomap Bioregions (www.mapequation.org/bioregions2/). We also thank J. Smiljanic for help with the stochastic block modelling analyses. We thank K. Stewart, P. Romero-Vidal and F. Hiraldo for discussions on this work. We also thank I. Afán and D. Aragonés for their help with grid-cell creation. Finally, we thank E. Gerber for his insights on residual estimation in multinomial models. R.B.-M. was supported by an Olle Engkvist Foundation grant (220-0175) and the Spanish Ministry of Economy and Competitiveness (BES-2013-065753). M. Rosvall was supported by the Swedish Research Council under grant 2023-03705. J.A. was supported by the Basque Government’s Postdoctoral Programme for the Improvement of Doctoral Research Staff (POS_2024_1_0026) and by Basque Government funding for the FisioKlima-AgroSosT research group (IT1682–22). E.R. was funded by the Spanish Ministry of Economy and Competitiveness (CGL2012-35931/BOS). M. Rueda was funded by European Union project 2020/125-US/JUNTA/FEDER EU (Programa Operativo FEDER/Junta de Andalucía 2014–2020). This research was also funded by the Spanish Ministry of Science and Innovation (MICINN) through the European Regional Development Fund (SUMHAL, LIFEWATCH-2019-09-CSIC-13, POPE 2014–2020) and the Ministry of Universities through the University of Seville, as part of the call for the Requalification of the Spanish University System 2021–2023, funded by the European Union’s NextGenerationEU.

Funding

Open access funding provided by Umea University.

Author information

Authors and Affiliations

Integrated Science Lab, Department of Physics, Umeå University, Umeå, Sweden
R. Bernardo-Madrid & M. Rosvall
Departmento de Biología Vegetal y Ecología, Universidad de Sevilla, Seville, Spain
R. Bernardo-Madrid & M. Rueda
Ecology and Evolutionary Biology, School of Biological Sciences, University of Reading, Reading, UK
R. Bernardo-Madrid & M. González-Suárez
Department of Conservation Biology, Doñana Biological Station, Consejo Superior de Investigaciones Cientificas (CSIC), Seville, Spain
R. Bernardo-Madrid, E. Revilla & J. L. Tella
Department of Physical, Chemical and Natural Systems, Universidad Pablo de Olavide, Seville, Spain
M. Carrete
Universidad de Alcalá, Department of Life Sciences, Forest Ecology and Restoration Group (FORECO), Alcalá de Henares, Spain
J. Astigarraga
Department of Physical Geography and Ecosystem Science, Lund University, Lund, Sweden
J. Astigarraga
Departamento de Biología y Geología, Física y Química Inorgánica, Escuela Superior de Ciencias Experimentales y Tecnología (ESCET), Universidad Rey Juan Carlos, Madrid, Spain
J. Calatayud
Global Change Research Institute, Rey Juan Carlos University, Madrid, Spain
J. Calatayud

Authors

R. Bernardo-Madrid
View author publications
Search author on:PubMed Google Scholar
M. González-Suárez
View author publications
Search author on:PubMed Google Scholar
M. Rosvall
View author publications
Search author on:PubMed Google Scholar
M. Rueda
View author publications
Search author on:PubMed Google Scholar
E. Revilla
View author publications
Search author on:PubMed Google Scholar
M. Carrete
View author publications
Search author on:PubMed Google Scholar
J. L. Tella
View author publications
Search author on:PubMed Google Scholar
J. Astigarraga
View author publications
Search author on:PubMed Google Scholar
J. Calatayud
View author publications
Search author on:PubMed Google Scholar

Contributions

R.B.-M. and J.C. conceived the study. R.B.-M., M.G.-S., J.C., M. Rueda and J.A. curated the data. R.B.-M., J.C. and M. Rosvall developed the biodiversity metrics. R.B.-M. and J.C. designed the statistical framework. R.B.-M. performed the data analysis and produced all the figures and visualizations, with contributions from J.C. and M.G.-S. R.B.-M., J.C. and M.G.-S. developed the manuscript outline. R.B.-M. wrote the first draft. All authors provided critical revisions and approved the final version of the manuscript.

Corresponding author

Correspondence to R. Bernardo-Madrid.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Ecology & Evolution thanks Juan José Morrone, Marta Sales-Pardo and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Workflow to obtain biogeographical regions, characteristic and non-characteristic species and biogeographical sectors.

(Step 1) Hypothetical species distribution in grid cells. We called grid cells using chess terminology (columns letters and rows numbers). (Step 2) Representation of species distribution in a network format. Species and grid cells are different types of nodes where links represent the occurrence of a species in a grid cell. (Step 3) Output of the community detection algorithm (also called network clustering algorithms) as a set of modules, represented with distinct colours, containing the nodes assigned. In the case of Infomap, this clustering is based on Map Equation a flow-based and information-theoretic method^23,24. See Supplementary Information of Bernardo-Madrid et al.¹⁹ for a non-specialized description. (Step 4) Visualization of biogeographical regions, characteristic species, and non-characteristic species. (Step 5) Calculation of the four-biodiversity metrics based on network cartography. (Step 6) Clustering of the grid cells with a k-means using the four-biodiversity metrics scaled as features.

Extended Data Fig. 2 Biogeographical regions and sectors of amphibians.

(a) Amphibian’s bioregions. White areas indicate geographical regions where this taxon is absent, or grid cells overlap with ≤ 50% of the Earth’s surface. (b) General biogeographical sectors within amphibian bioregions. Solid black lines represent the boundaries of the biogeographical regions, while colours denote different biogeographical sectors. Warmer colours indicate geographical areas with relatively low overlap of biotas, reflecting distinct biogeographical affinities (i.e. overlap of distinct characteristic species pools). Cooler colours represent areas with higher overlap. Darker and lighter shades within the warm and cold colour ranges, respectively, indicate high and low richness of characteristic species. See Fig. 1 for relationship between the biogeographical sectors and the four-biodiversity metrics. Some biogeographical regions, mostly depicted by blue colours, may represent widely recognized transitional zones. Nonetheless, these biogeographical regions can also correspond to lower biogeographical hierarchical levels, such as subregions or domains⁵⁸. The identification of these highly transitional biogeographical regions opens possibilities for clustering them into higher biogeographical hierarchical scales, such as the Western subregion of the Neartic or the Chacoan subregion of the Neotropical⁵⁸. However, to ensure objectivity in delineating bioregions and avoid subjective criteria for thresholds1^9,56, all bioregions with statistical support were studied independently of their potential biogeographical hierarchical level⁵⁸.

Extended Data Fig. 3 Biogeographical regions and sectors of birds.

(a) Bioregions of birds. (b) Biogeographical sectors within biogeographical regions of birds. See details on caption of Fig. 1 and Extended Data Fig. 2.

Extended Data Fig. 4 Biogeographical regions and sectors of dragonflies.

(a) Bioregions of dragonflies. (b) Biogeographical sectors within biogeographical regions of dragonflies. See details on caption of Fig. 1 and Extended Data Fig. 2.

Extended Data Fig. 5 Biogeographical regions and sectors of mammals.

(a) Bioregions of mammals. (b) Biogeographical sectors within biogeographical regions of mammals. See details on caption of Fig. 1 and Extended Data Fig. 2.

Extended Data Fig. 6 Biogeographical regions and sectors of rays.

(a) Bioregions of rays. (b) Biogeographical sectors within biogeographical regions of rays. See details on caption of Fig. 1 and Extended Data Fig. 2.

Extended Data Fig. 7 Biogeographical regions and sectors of reptiles.

(a) Bioregions of reptiles. (b) Biogeographical sectors within biogeographical regions of reptiles. See details on caption of Fig. 1 and Extended Data Fig. 2.

Extended Data Fig. 8 Biogeographical regions and sectors of trees.

(a) Bioregions of trees. (b) Biogeographical sectors within biogeographical regions of trees. See details on caption of Fig. 1 and Supplementary Fig. 2.

Supplementary information

Supplementary Information (download PDF )

Supplementary Figs. 1–35, Tables 1–12 and Appendices A–E.

Reporting Summary (download PDF )

Peer Review File (download PDF )

Supplementary R Code 1 (download ZIP )

R code and objects to replicate to estimate biogeographical sectors.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bernardo-Madrid, R., González-Suárez, M., Rosvall, M. et al. A general rule on the organization of biodiversity in Earth’s biogeographical regions. Nat Ecol Evol 9, 1193–1204 (2025). https://doi.org/10.1038/s41559-025-02724-5

Download citation

Received: 14 February 2024
Accepted: 17 April 2025
Published: 04 June 2025
Version of record: 04 June 2025
Issue date: July 2025
DOI: https://doi.org/10.1038/s41559-025-02724-5