Introduction

The number of invasive wood-boring beetles is growing due to increasing global transportation and climate change1,2. These beetles often spend their entire life cycle or at least their immature stages within plants, making them difficult to detect during quarantine inspections3. They cause direct damage to the flora of invaded regions, resulting in significant economic impacts on forestry, agriculture, and urban street trees4. Among these species, longhorn beetles (Coleoptera: Cerambycidae) include some of the most notorious invaders, such as Anoplophora glabripennis (Motschulsky, 1854) and A. chinensis (Forster, 1771)5. More recently, additional invasive or potentially invasive longhorn beetles, including Aromia bungii (Faldermann, 1835)6, Psacothea hillaris (Pascoe, 1857)7, Anoplophora horsfieldii (Hope, 1843)8, Chlorophorus annularis (Fabricius, 1787)9, Xylotrechus chinensis (Chevrolat, 1852)10,11, and Olenecamptus bilobus (Fabricius, 1801)12, have been reported in the Northern Hemisphere.

Among them, Aromia bungii (Cerambycidae: Callichromatini Swainson, 1840), also known as the red-necked long-horned beetle, is a notable emerging invasive species that causes significant damage in invaded regions6. Native to Eastern Asia, including China, the Korean Peninsula, Mongolia, northern Vietnam, and the Russian Far East, A. bungii has established populations in Japan, Germany, and Italy and has been intercepted in shipments arriving in the UK, USA, and Australia6. While the extent of economic damage in both native and invaded regions has not been accurately estimated, it is clear that the agricultural (e.g., peach production) and cultural (e.g., cherry blossoming festivals in Korea and Japan) economies are experiencing significant impacts13,14.

Understanding the history of past invasions and predicting future dispersal are crucial when dealing with emergent invasive species such as A. bungii. Past dispersal routes can be inferred through population genetics. For example, population genetics suggests that Japanese invasive populations are likely the result of multiple independent introductions, with China being the most plausible source, despite the conclusions being limited due to insufficient genetic data and sample coverage13,14. On the other hand, future distribution and dispersal paths can be estimated via species distribution modelling (SDM).

SDMs have been widely applied in invasion biology to assess the potential spread and risk posed by invasive species on the basis of records from their native areas15,16. These models remain valuable tools for predicting areas that might be suitable for colonization by invasive species beyond their native ranges, particularly during the early stages of invasion or even before invasion occurs17.

Specifically, for phytophagous beetles, such as A. bungii, host plant availability significantly influences their future distribution18, and host plant abundance and diversity are crucial in enhancing their establishment and dispersal19,20. Therefore, integrating host plants in modelling produces more reliable distribution predictions of potentially suitable areas outside the native range of a target species21,22.

In this study, we conducted an SDM using host plants, land cover, and bioclimatic variables to predict the future distribution and potential invasiveness of Aromia bungii on a global scale.

Methods

Aromia bungii occurrence data

Occurrences of Aromia bungiiin its native and invaded ranges were manually downloaded from the Global Biodiversity Information Facility (GBIF, 2024)23 and iNaturalist (https://www.inaturalist.org/). The occurrence data used in the analyses are updated to December 16th, 2024. Only occurrences with coordinates in degrees with at least two decimal places of resolution were included in the analyses. This allowed us to associate each occurrence with habitat data derived from land cover cartography at a grid resolution of 0.04167° (see below), corresponding to approximately 5 km at the equator. Additionally, we checked for all potential issues (e.g., duplicate records, misidentifications, etc.). The dataset was further enhanced by incorporating species occurrences from S.L.‘s field surveys conducted in South Korea.

We acknowledge that prence-only data derived from repositories are inherently incomplete. Furthermore, for many species, these data may be subject to certain forms of bias. However, these biases can be identified when the species’ ecology is adequately documented. Consequently, the effect of bias can be mitigated to some extent, for example, by using background points allocations based on bias maps when modelling presence-only data.

The native range of A. bungii was retrieved from the Catalogue of Palaearctic Coleoptera, Chrysomeloidea I24, and cross-checked with Horrocks et al.6. The countries where the species is native are the People’s Republic of China, the Democratic People’s Republic of Korea, the Republic of Korea, and Vietnam. Overall, 408 species occurrences were recovered from the native range (raw data used for model calibration on native range), and 39 were retrieved from the invaded areas. Due to the limited number of occurrences available for the species’ native range and the fact that spatial thinning does not necessarily enhance model performance, we opted not to apply spatial thinning to the occurrence points in agreement with Ten Caten & Dallas25.

Environmental data

Bioclimatic and land cover data

Bioclimatic (19 strata) and elevation (i.e., digital elevation model (DEM)) data were downloaded from WorldClim 2.1 (available at: http://www.worldclim.com/version2) and averaged for the years 1970–2000, with a grid resolution of 0.00833°.

For the land cover layers, we used the Global 1-km Consensus Land Cover, updated to 2006 (available at http://www.earthenv.org26). The dataset contains 12 layers (evergreen/deciduous needleleaf trees [EDNT], evergreen broadleaf trees [EBT], deciduous broadleaf trees [DBT], mixed/other trees [MOT], shrubs [SHR], herbaceous vegetation [HERB], cultivated and managed vegetation [CMV], regularly flooded vegetation [RFV], urban/built-up [BU], barren lands [BR], open waters [OW] and Snow/Ice [SI]), each of which provides consensus information on the prevalence of one land-cover class, expressed as a percentage in a pixel having a resolution of 30 arc-seconds (i.e., 0.00833° resolution).

Both the bioclimatic data and land cover layers were rescaled to a 0.04167° resolution via the “aggregate” function included in the terrapackage27in R version 4.3.128.

To avoid collinearity issues, a preselection of bioclimatic covariates (removing those strongly correlated, i.e., those with an absolute value of Pearson’s correlation coefficient higher than 0.729) was performed via the “vifcor” function of the usdmR package30.

Host plant taxa

Aromia bungii host plants were obtained from Horrocks et al.6. If more than one species belonging to the same genus was known as a host plant or if congeneric species could be used as vicariant host plant species in areas potentially exposed to invasion risk, we used the genus as the host taxon. Otherwise, we used a single species as the host plant. The host taxa were Prunus spp., Malus spp., Pyrus spp., Eriobotrya japonica (Thunb.) (Rosaceae), Populus spp. (Salicaceae), Albizia julibrissin Durazz. (Fabaceae), Camellia sinensis (L.) Kuntze (Theaceae), Castanea spp. Blume (Fagaceae), Diospyros kaki Thunb. (Ebenaceae), Ilex spp. Sims (Aquifoliaceae), Juglans spp. L. (Juglandaceae), Morus spp. L. (Moraceae), and Punica granatum L. (Lythraceae).

For host plants at the genus level, data for all occurrences of each single species pertaining to the genus were downloaded from GBIF via the “occ_data” function available from the rgbifR package31. The same procedure was adopted for host plants at the species level.

The data for each host plant taxon were then combined into a single layer of georeferenced points (through the “as.ppp” function from the spatstatpackage32,33) to produce a density map with a kernel interpolation procedure via the “density.ppp” function from spatstat. The kernel interpolation radius of the points was estimated via the “bw.ppl” function from spatstat. We used the logarithm of the kernel density maps (i.e., the quantitative range map of each host plant taxon) to reduce possible bias in the host sampling effort. To avoid overestimating the distribution of each host taxon, the logarithm of their density layer was subsequently multiplied for the land cover type associated with the taxon habit (e.g., deciduous broadleaf trees) and/or the land cover type in which the taxon can be found if of economic interest (e.g., cultivated and managed vegetation); see Table 1 for the associations between the host plant taxon and land cover. This approach allows for the generation of a proxy for the quantitative distribution map of each host plant. Finally, as the last step, we produced the overall host plant layer, extracting the maximum density value retrieved in each grid cell among those of all the host plant layers. Since host plants can replace each other on a local scale, to avoid possible overestimation of the host plant density, we used the maximum density value instead of calculating the sum among all density layers.

Data on host plant taxa were downloaded from the GBIF on June 30, 2024 (range maps for all host taxa are available in the “.tiff” format in the supplementary materials).

Table 1 Schematic associations between the host plant taxon and land cover (DBT: deciduous broadleaf trees; MOT: mixed/other trees; SHR: shrubs; CMV: cultivated and managed vegetation; EBT: evergreen broadleaf trees; BU: urban/built-up).

Species distribution modelling

The modelling procedure was based on the ensemble modelling approach34,35 since the combined output derived from an ensemble model should tendentially have higher explanatory and predictive performances than a model obtained from a single algorithm36,37,38,39,40).

To better evaluate the contribution of environmental variables in explaining the spatial distribution of the species, bioclimatic and habitat variables were modelled into two separate ensemble models, and their respective output maps were combined in their spatial product, following the framework given in Ruzzier et al.22. This choice stems from the evidence that combining scenopoetic (bioclimatic; sensu Hortal et al.41) variables with those of a local character (habitat) would result in an underestimation of variables’ importance (see Ruzzier et al.22 for further details).

SDMs based on presence-only data rely directly on pseudoabsence (PA) or background data42. Therefore, PA selection was performed separately for the bioclimatic and habitat modelling procedures, given their different requirements in terms of PA allocation for the HSM with respect to the bioclimatic suitability model (BSM)22.

To assess the effects of habitat covariates on HSM, PAs should be identified strictly within known native species ranges22,43,44,45. Starting from this consideration, the HSM was developed over a narrow geographic area around occurrence data (narrow native calibration range; Fig. S1 in Sup. Mat. 1; for more details, refer to the PA selection for the HSM in the Methods section). Japan and Taiwan were excluded from the HSM calibration area (i.e., countries excluded from PA allocation) because they are outside the putative native range of the species; in these countries, A. bungiiis naturally absent, presumably because of the existence of a major biogeographic barrier (ocean) that the species is unable to overcome, whereas present occurrences are the result of human introduction13. For BSM, we assessed the effects of bioclimatic variables over a wider geographic area centred on A. bungii occurrences (wide calibration range, between 0° and 70° N and 70° E and 160°E; Fig. S2 in Sup. Mat. 1); in this case, we considered all areas geographically contiguous with A. bungii occurrences (i.e., mainland Central and East Asia), removing islands belonging to the Indomalayan bioregion due to biogeographic discontinuity rather than the actual lack of suitable bioclimatic conditions.

The HSM and BSM were developed via a use-vs-availability approach35, and both models return a map of suitability in terms of species presence probability. From the spatial product of the two maps of species presence probabilities, we obtained an overall suitability map, which returns the probability of finding the species in a given location on the basis of both habitat and bioclimatic covariates (see Ruzzier et al.22 for details).

The modelling procedure was performed via the biomod2 package version 4.2–6-246. In order to have just one occurrence per grid cell, we set the option “filter.raster = T” in the formatting data step (“BIOMOD_FormatingData” function).

The generalized additive model (GAM), generalized boosting model (GBM), random forest (RF), and maximum entropy (MAXNET) were used as modelling algorithms. The number of PAs was set to 4 times the number of occurrences, and the sets of PAs used were 20 for the HSM and 5 for the BSM22,47; the models were tested via 5-fold cross-validation runs22.

While developing the HSM, the “disk” strategy was adopted to randomly select PAs within a buffer ranging from 2.5 to 500 km to species occurrence. The lower limit of 2.5 km was established to prevent PAs from being placed too close to occurrence points, thereby minimizing the likelihood of locating PAs within suitable habitats. The upper limit of 500 km was set to avoid placing PAs in regions where differences are predominantly influenced by variations in bioclimatic conditions22.

For the BSM, a random selection of PAs was adopted in the entire selected area (other than the ocean). This approach of PAs allocation allows for a more consistent evaluation of the effect of bioclimatic covariates through the ratio created between presences and PAs along the latitudinal gradient22.

Models were calibrated using the settings included in the “bigboss” strategy, in accordance with the recommendations provided by the biomod2 developers46. As an alternative, we conducted a tuning process for the algorithms (GAM, GB, and RF; not for MAXNET, as the “bm_tuning” function cannot be applied in the biomod2 package). However, this approach proved to be less effective compared to the “bigboss” strategy. This conclusion was based on our empirical evidence. Specifically, in the tuning process, all sets of PAs are used to optimize the algorithms. Conversely, when using the “bigboss” strategy, the sets of PAs that are ineffective at distinguishing differences in environmental composition between occurrence points and background points are discarded.

Given the lack of external data to evaluate the models’ accuracy, a 5 cross-validations data-splitting procedure was performed using 80% of the data as the training set with the remaining 20% as the validation set. Each of the models obtained from both the HSM and BSM was validated via three validation metrics: the area under the receiver operating characteristic (ROC) curve (AUC48), the true skill statistic (TSS49), and Cohen’s kappa50, with thresholds of 0.9, 0.7, and 0.65, respectively49,51,52.

Habitat (HSM) and bioclimatic (BSM) ensemble models were calculated via the “EMwmean” function. To account for the performance of each individual model used in building the ensemble model, we weighted its contribution by setting a “proportional” option decay46. A three-run permutation procedure was used to assess the importance of each variable included in the ensemble model. The same ensemble model framework was adopted for both the HSM and the BSM.

The HSM and BSM were projected to the following four extents: (a) native countries only; (b) global, (c) Eastern and Southeast Asia (ESA), and (d) Western Palaearctic (WP; excluding Fennoscandia). The ESA projection provides a detailed map of the native range and the closely invaded area. The WP projection provides a map of where adventive populations were found.

The Boyce index53,54 was used to evaluate the modelling performance and was applied to the HSM, BSM, and overall suitability map for the narrow native calibration range (modEvA package55).

As the final step, the multivariate environmental similarity surface index (MESS15), implemented in the dismo package56, was used to detect possible extrapolation effects of the HSM and BSM at the global scale.

Results

Assessment of the performance of species distribution models and determination of influential variables

Overall, we used 202 occurrences to calibrate HSM and BSM in the A. bungii native range. Among the 12 land use layers, we excluded a priori from the HSM procedures of the SI and OW. None of the remaining land use covariates were pairwise correlated. After the VIF analysis, six of the 19 bioclimatic variables were retained for the BSM procedure (Bio2, Bio3, Bio8, Bio15, Bio18, and Bio19).

Overall, we developed 1200 models for the HSM (4 modelling algorithms × 20 sets of PAs × 5 cross-validation runs × 3 validation metrics) and 300 models for the BSM (4 modelling algorithms × 5 sets of PAs × 5 cross-validation runs × 3 validation metrics).

For the HSM, according to the ROC curve, 250 out of 400 (62.5%) models were outstanding (ROC > 0.9); following the TSS, 66 out of 400 (16.5%) models exhibited excellent performance (TSS > 0.7), and according to the KAPPA, 140 out of 400 (35.0%) models exhibited substantial performance (KAPPA > 0.6). Nevertheless, according to the three metrics of validation, virtually all the models were adequate since they exceeded the critical thresholds that define a successful model (ROC > 0.7, TSS > 0.5, KAPPA > 0.4). For the BSM, according to the ROC curve, all the models were outstanding (ROC > 0.9). Following the TSS, 97 out of 100 achieved excellent performance (TSS > 0.7), and according to the KAPPA, 97 out of 100 achieved substantial performance (KAPPA > 0.6). Additionally, for the BSM, all the models were found to be adequate, exceeding the critical thresholds for the three validation metrics (see Tables S1 and S2 in Sup. Mat. 2 for the validation metrics obtained for the HSM and BSM).

The best-performing models (456 out of 1200 for the HSM and 294 out of 300 for the BSM) were retained to realize the HSM and BSM ensemble models projected for the four extents. The ensemble model process allowed us to obtain the importance of covariates (Tables S3 and S4 in Sup. Mat. 2) and the corresponding response curves for the HSM and BSM; to obtain each single variable effect, we set all other variables to their median value (Figs. S3 and S4 in Sup. Mat. 1). Since all three HSM maps obtained via the ROC, TSS, and KAPPA metrics were highly correlated (pairwise Pearson’s correlation > 0.997), we decided to use the HSM built after ROC model selection. The same applied to the BSM maps, for which the pairwise Pearson’s correlation was greater than 0.996. Habitat suitability was affected mostly by host plants (variable importance HP.max: 0.504), followed by evergreen broadleaf trees (EBT: 0.149), cultivated and managed vegetation (CMV: 0.144), built-up areas (BU: 0.124), and herbaceous vegetation (0.104). Host plants and built-up areas had positive effects on the probability of species presence, whereas we found negative effects for evergreen broadleaf trees, cultivated and managed vegetation and herbaceous vegetation. Bioclimatic suitability was substantially and positively affected by the mean temperature of the wettest quarter (BIO8: 0.431) and the precipitation of the warmest quarter (variable importance BIO18: 0.277) and; in contrast, isothermality (BIO3: 0.360) was found to negatively influence the probability of species presence.

The Boyce indices used for the validation of both the HSM map (habitat suitability map and its 95% confidence intervals; Fig. S5 in Sup. Mat. 1) and the BSM map (bioclimatic suitability map and its 95% confidence intervals; Fig. S6 in Sup. Mat. 1) projected on their calibration ranges returned 0.983 and 0.997, respectively (see Figs. S7 and S8 in Sup. Mat. 1 for the Boyce graphs).

The overall suitability map (Fig. S9 in Sup. Mat. 1), which was calculated as the spatial product of the habitat suitability and bioclimatic suitability maps for the narrow native calibration range, yielded a Boyce index of 0.988 (see Fig. S10 in Sup. Mat. 1 for the Boyce graph). The good modelling performance can also be observed in the overall suitability map, where the presence points of A. bungii clustered within areas with a high probability of occurrence of the species (Fig. S9 in Sup. Mat. 1).

MESS analyses of habitat and bioclimatic variables revealed that projections were not substantially subject to extrapolation (Figs. S11 a and b in Sup. Mat. 1).

SDM projections at the global, ESA, and WP levels

The projections of HSM and BSM at the global, ESA, and WP scales allowed us to obtain habitat, bioclimatic, and overall species suitability maps at the three mentioned geographic extents (see Figs. 1 and 2, and 3, respectively).

In addition to the native range of A. bungii, according to the global habitat suitability map, the most favourable areas for the species were in Japan, different parts of northern and southern Australia, the Cape Region in South Africa, from the lower to high latitudes of the Western Palaearctic, central parts of European Russia and the Caucasus, eastern North America from the Gulf of Mexico to the Great Lakes Region, Pacific Southwest North America, including Texas and Baja California, central Mexico, Southeast Brazil, and multiple areas in central Argentina (Fig. 1a). Notably, although many areas appear potentially suitable due to the availability of habitats, ecological knowledge regarding A. bungii has led to the exclusion of areas located at relatively high latitudes that may be suitable for the species due to bioclimatic constraints. In contrast, the global bioclimatic suitability map highlights suitable areas only in Japan. However, moderate suitability was observed from the area south of the Great Lakes region towards the east. Slightly suitable areas appeared in central and eastern Europe and southern territories of European Russia (Fig. 1b). The overall suitability map, in addition to the native area, revealed three areas with different levels of suitability at the global scale (Fig. 1c): (a) Japan was the area with the highest suitability, except for the island of Hokkaido; (b) the southern area of the Great Lakes Region was moderately suitable, whereas other areas in eastern North America were only slightly suitable; and (c) central and eastern Europe up to the southern territories of European Russia appeared suitable, although to a very limited extent.

Fig. 1
figure 1

World projections of A. bungii suitability maps; (a) habitat suitability map; (b) bioclimatic suitability map; (c) overall suitability map (maps were created using the libraries sf57,58, rnaturalearth59 and ggplot260 in R).

The habitat suitability map of the ESA extent revealed a large availability of suitable areas (Fig. 2a), although in the native range, the habitat availability was rather limited to some areas along the coasts of the East China Sea and the Republic of Korea. Outside the native range of A. bungii, suitable areas were identified (a) along the land strip from Tyumen Oblast to Prymorsky Kray (Russia), which was found to be moderately to highly suitable; (b) in the Indian Peninsula, which was extensively but only moderately suitable; (c) in Japan, which was highly suitable overall; and (d) on the western coasts of Taiwan, which was highly suitable. The bioclimatic suitability map (Fig. 2b) highlights a wide and continuous suitable area but is limited to eastern China, the Democratic People’s Republic of Korea, and the Republic of Korea. A separate highly favourable bioclimatic area corresponds to Japan (Hokkaido excluded). Considering the overall suitability map for SE (Fig. 2c), the most suitable areas were limited to the Republic of Korea and coastal areas of China, where the species is native. In addition, highly suitable areas were also found in the invaded range of Japan, particularly on Honshu, Shikoku, and Kyushu Islands.

Fig. 2
figure 2

Eastern and Southeast Asia (ESA) projections of A. bungii suitability maps; (a) habitat suitability map; (b) bioclimatic suitability map; (c) overall suitability map (maps were created using the libraries sf57,58, rnaturalearth59 and ggplot260 in R).

The habitat suitability map of the Western Palaearctic (excluding Fennoscandia), the only area outside Asia where A. bungii presented adventive populations, appeared highly suitable to a large extent, except for Ireland and the western parts of Great Britain and central and Eastern Turkey. All desertic as well as alpine habitats were not suitable (Fig. 3a). However, the area was poorly suitable from a bioclimatic perspective (Fig. 3b). The overall suitability map of the western Palaearctic (Fig. 3c) displayed generally scarce suitability for the entire geographic extent, with the probability of species presence rarely exceeding 0.10.

Fig. 3
figure 3

Western Palaearctic (WP) projections of A. bungii suitability maps; (a) habitat suitability map; (b) bioclimatic suitability map; (c) overall suitability map (maps were created using the libraries sf57,58, rnaturalearth59 and ggplot260 in R).

Discussion

It is important to highlight that the species suitability must be interpreted according to the overall suitability map, which is defined by the combination of the HSM and BSM maps.

Based on the bioclimatic suitability map derived for the native range, the overall distribution of A. bungii appears to be largely constrained by bioclimatic conditions. However, within the bioclimatically suitable areas, habitat emerged as a limiting factor in the species distribution pattern, as shown by the habitat suitability map. When considering the overall suitability map, the combined influence of bioclimatic and habitat covariates on the species’ distribution pattern becomes even more apparent.

As expected, as an obligate endophytophagous and xylophagous species, A. bungii responds primarily to the presence of its host plants when bioclimatic needs are satisfied. Indeed, the importance of the host plant covariate is slightly more than four times greater than that of the only other covariate with positive effect, the built-up area (BU). A possible bias in species data collection may also contribute to the positive relationship between Aromia bungii and BU; occurrences of A. bungii might be more easily recorded in areas frequently visited by humans, who are therefore more likely to provide records associated with urban contexts. Notably, the species’ native area has been densely inhabited by humans since historical times; therefore, progressive anthropisation may also have affected its distribution. Indeed, it is possible that, at least in some areas, A. bungii might behave as a synanthropic species since the distribution of its host plants has been widely modified because of economic interest in gardening or agriculture. However, the importance of BU is substantially lower than that of host plants. The effect of evergreen-broadleaf trees (EBT) is negative, as most of the host species of A. bungii are deciduous broadleaf trees; this could be linked to the negative effects of these areas located outside the host plant range. Furthermore, cultivated and managed areas (CMV) negatively influence the presence of A. bungii. This could be linked to the negative effects of these areas located outside the host plants range. The effect of habitat management, together with the connectivity between suitable areas as factors influencing the probability of species presence, is consistent with the findings of Zhang et al.61 for a native context, and Osawa et al.62 for an invaded area.

Aromia bungii was positively influenced by warm temperatures, requiring 25–30 °C degrees during the warmest quarter of the year (Bio8), in areas with a marked seasonality (i.e., low isothermality index; Bio3 < 25), and with cumulative precipitation above 600 mm in the warmest quarter (Bio18). This result is partially in agreement with that of Horrocks et al.6. Indeed, in both our and Horrocks et al. studies, Bio18 was the most important variable defining the bioclimatic suitability for the species. However, discrepancies in the importance of other bioclimatic variables could derive from the different criteria used for the identification of the calibration area (calibration area identified within the bioregion of the species native range, in the present study, vs. worldwide, in Horrocks et al.6), which in turn may have influenced both the set of bioclimatic variables used in the modelling framework and the allocation of PAs; furthermore, the different algorithms may have produced appreciable differences between the two studies (ensemble model based on four algorithms, present study, vs. single algorithm, Horrocks et al.6). Some discrepancies in the variables used in the model developed by Zhang et al.61 and that of our study can be attributed to the two different calibration areas. Furthermore, the algorithms employed also differ between the two studies. Nonetheless, the suitable areas for A. bungii in China are essentially equivalent.

However, it should be highlighted how our and Horrocks et al.6 studies, even though one is calibrated in an area only slightly larger than the native range (i.e., within the biogeographical region of the species) and the other on a global scale, have produced similar bioclimatic suitability maps. These maps indicated that there were no highly bioclimatic suitable areas (e.g., probability of occurrence > 0.20) outside the native range that invaded Japan. The similarity of the results could be because, although Horrocks et al.6 allocated PAs on a global scale, PAs were never located in areas bioclimatically similar to those found in the native range and Japan, as such conditions strictly occur only there.

The global-scale habitat suitability map revealed large areas of potentially favourable establishment of A. bungii in various biogeographic areas of the world. This aspect has been analysed by us for the first time, and therefore it is not possible to compare it with other studies. This is probably due to the wide native distribution of host plants, combined with the fact that these plants have expanded in range and are largely modified by humans due to their economic interests. Conversely, the global-scale bioclimatic suitability map highlighted generally poor favourable conditions for A. bungii establishment, except for a few areas in East and Southeast Asia. The availability of suitable areas, according to habitat and bioclimatic conditions, highlights that favourable areas for A. bungii globally are restricted to some coastal areas of China towards the Yellow Sea, the Korean Peninsula, and Japan.

The predicted habitat suitability map for the ESA revealed multiple favourable areas for A. bungii. These areas encompass southern parts of eastern Siberia, the Indian subcontinent, coastal areas of China, and Japan. This pattern of suitability differed completely from that of the bioclimate, where the most suitable areas were strictly limited to the central and coastal areas of China, the Korean peninsula, and Japan. This highlights how bioclimatic conditions are an ecological forcing driver of particular importance in shaping the A. bungii range. This can be appreciated by observing the distribution of occurrences, both native and invasive, that fell right within the most suitable bioclimatic areas in the ESA. However, within this bioclimatically suitable area, habitat conditions play an important role in defining areas that are truly suitable for A. bungii. When the bioclimatic requirements were met, the species was detected only in areas with suitable habitats. The presence of A. bungii in Japan, with introduced and stable populations, is linked to the overall high suitability of this territory for the species in terms of both habitat and climate. The natural absence of A. bungiiin the area is most likely attributable to the presence of the sea, which acts as an important biogeographic barrier and separates Japan from the native range of the species. The role of the sea as a barrier is consistent with genetic data obtained for native Korean populations and invasive populations in Japan13,14.

The WP is the only area on a global scale where two adventive populations have been detected: Rosenheim (Germany63) and the Gulf of Naples (Italy64). The habitat suitability map revealed that the populations were in highly potentially favourable areas, likely due to the high availability of host plants within the wide anthropised landscapes in Europe. Given the nature of the host plants, which are of ornamental and fruit-growing interest, their abundance could be favoured even more by anthropised areas, as evidenced by the positive effects of land use, built-up areas and cultivated and managed vegetation. Conversely, the bioclimatic suitability map in the WP showed how the adventive populations were in poor to unfavourable areas. The very low bioclimatic suitability in the Mediterranean area is probably due mostly to the scarcity of rainfall during the warm period of the year, contrary to the requirements of the ecology of the species.

Conclusions

To properly assess the effects of covariates acting at local scales compared to those operating at broader scales, a two-step modeling approach is required. This ensures that the two types of variables do not obscure each other’s effects. Our approach has thus enabled a more effective evaluation of the impact of host plants and habitat context (included in the HSM), independently from bioclimatic variables (included in the BSM). The latter are, in fact, the variables most used to assess the invasiveness of non-native species at a global scale. However, this approach allows for the definition of suitable ranges but does not identify the specific areas within them that are invadable by a non-native species. By combining the two models (HSM x BSM), the two-step approach provides a way to assess, on a global scale, the areas that are truly invadable by A. bungii within bioclimatically favourable ranges.

In conclusion, based on the modelling findings, A. bungii would appear to lack the adaptive plasticity to become a particularly invasive species in contexts other than Southeast Asia. This limitation is inferred from the fact that in native areas, the species remains confined to bioclimatically favourable regions and does not spread outside them despite the wide availability of suitable habitats.

The fact that A. bungii is widely invasive in Japan is explained by the simultaneous natural presence of both bioclimatic and habitat conditions favourable to the species. Therefore, although Japan is geographically close to the mainland range of the species, the sea acted as an important geographic barrier that prevented island natural colonization.

It is therefore highly probable that the invasive populations of A. bungii in Japan, unlike those observed with other invasive species, did not undergo any ecological niche shift having already found in the invaded area all the conditions most favourable to them and not having been subjected to any selection events during the invasion process.

At present, there do not seem to be any ideal areas for the species globally outside of those identified in Southeast Asia. Aromia bungii appears to have very little ability to spread naturally in the Western Palaearctic. In fact, it seems improbable that introduced populations of Europe, despite the large availability of suitable habitat, could significantly expand their range because of inadequate bioclimatic conditions. However, it will be necessary to consider the areas of North America south of the Great Lakes, where models indicate areas whose suitability, though almost negligible, should not be ignored.

In general, introduced populations of A. bungii could hardly be expected to expand their range, except through a change in their bioclimatic niche, a process that, however, would seem unlikely based on the model calibrated on the native area and its projection on Southeast Asia.