Abstract
Species Distribution Models are commonly used with surface dynamic environmental variables as proxies for prey distribution to characterise marine top predator habitats. For oceanic species that spend lot of time at depth, surface variables might not be relevant to predict deep-dwelling prey distributions. We hypothesised that descriptors of deep-water layers would better predict the deep-diving cetacean distributions than surface variables. We combined static variables and dynamic variables integrated over different depth classes of the water column into Generalised Additive Models to predict the distribution of sperm whales Physeter macrocephalus and beaked whales Ziphiidae in the Bay of Biscay, eastern North Atlantic. We identified which variables best predicted their distribution. Although the highest densities of both taxa were predicted near the continental slope and canyons, the most important variables for beaked whales appeared to be static variables and surface to subsurface dynamic variables, while for sperm whales only surface and deep-water variables were selected. This could suggest differences in foraging strategies and in the prey targeted between the two taxa. Increasing the use of variables describing the deep-water layers would provide a better understanding of the oceanic species distribution and better assist in the planning of human activities in these habitats.
Similar content being viewed by others
Introduction
The use of Species Distribution Models (SDMs), that allow to model the distribution of species by establishing relationships between species occurrence or abundance and environmental data1, has increased considerably in recent decades. They are commonly used to address ecological issues as well as for conservation purposes, e.g. to identify areas of concentration and inform policies for the designation of protected areas2,3,4. SDMs have been extensively applied to marine top predators (e.g.5,6,7,8), as these species are threatened by multiple anthropogenic activities (e.g. collisions, pollutants, bycatch, underwater noise, habitat loss) leading to the decline of many populations9,10,11.
The distribution of marine top predators is expected to be mainly driven by prey abundance or availability12,13,14,15; yet the limited spatial and temporal extent of prey data available from in situ sampling is a major impediment to model predator distributions over large oceanic regions. Most studies solve this lack of prey data by using static bathymetric variables depicting the seafloor together with surface dynamic environmental variables depicting the water masses, as proxies for prey distribution. Surface dynamic variables include inter alia sea surface temperature, sea surface height, sea surface currents, surface chlorophyll a concentration irrespective of the actual depth range used by the species of interest5,8,16,17.
These surface environmental data are easily accessible (obtained from satellite imagery or numerical models) and available at various spatial (local to global) and temporal scales and resolutions (e.g. day, month, year, decades)6,18,19. However, most prey targeted by marine top predators live in the water column at variable depths operating on diel migration patterns20. Using environmental variables depicting the water column would seem more relevant than using surface variables. This would be particularly true for oceanic species that spend most of their time in deep waters and generally feed on mesopelagic and bathypelagic prey like deep-diving cetaceans21,22, here referring to the beaked whales (family Ziphiidae, represented in the north Atlantic by Ziphius cavirostris, Hyperoodon ampullatus and Mesoplodon spp.) and the sperm whale (family Physeteridae, Physeter macrocephalus). Most studies that model the habitat of deep-diving cetaceans use static and dynamic surface variables (e.g.5,16,23,24). However, deep-divers regularly perform long dives, sometimes to great depths25,26,27 and consequently, surface variables may be partly irrelevant to predict the distribution of their deep-dwelling prey.
Brodie et al.28 included two subsurface dynamic variables (isothermal layer depth and bulk buoyancy frequency) in SDMs to describe the habitat of pelagic predators (swordfish Xiphias gladius, blue sharks Prionace glauca, common thresher sharks Alopias vulpinus, and shortfin mako sharks Isurus oxyrinchus) in the California Current System. The authors found these variables increased the explanatory power and predictive performance of the models for most species. Along the same lines, Becker et al.29 fitted SDMs to depth environmental variables provided by an ocean circulation model and to in situ and remotely sensed oceanic variables, both considered as measured data. They showed similar performance between the two models, highlighting the good performance of environmental modelled data to describe species distribution.
The development and increasing availability of environmental variables that describe the ocean in three dimensions opens the possibility of a better understanding of the distribution of species such as deep-diving cetaceans. Dynamic variables that characterise deeper oceanic layers are expected to have a greater influence on their prey fields at depth than surface parameters alone. We propose to test this hypothesis by combining static variables (that characterise seafloor topography) with dynamic variables integrated over different depth classes, such as temperature, gradients of temperature and eddy kinetic energy, in Generalised Additive Models (GAMs30;) to model the distribution of deep-diving cetaceans in the Bay of Biscay, eastern North Atlantic. GAMs have already demonstrated their effectiveness in predicting the distribution of deep-diving cetaceans5,23,31. In this study, several depth classes, corresponding to different environments in the water column, were defined and relevant environmental variables were extracted for each class. Then, GAMs were fitted to assess the explanatory power of these variables. Finally, the distribution of deep-divers in the Bay of Biscay was predicted using the best models obtained. We expected that the use of environmental variables at depth would result in a more accurate modelling of top predator densities and a better understanding of the mechanisms that influence their distributions.
Results
During the model selection process, it turned out that the Akaike weights32 of all tested models were much lower than 0.9, probably because some variables were very similar and correlated. Consequently, we tried to approximate the average prediction obtained from all tested models by the prediction obtained from the model that combined the four most important variables for each species (Fig. 1). For both beaked and sperm whales, the coefficient of determination (R2) was close to 1 (respectively 0.93 and 0.97) so the predictions were very similar and the average prediction of all models could be approximated by the prediction of the model fitted to the four most important variables.
Comparison between the average prediction obtained from the models that explained 80% of the total Akaike weight and the prediction obtained from the model fitted to the four most important variables. If the coefficient of determination (R2) is close to 1, predictions are similar and the average prediction of all models can be approximated by the prediction of the model fitted to the four most important variables.
The importance of the variables was determined by summing the Akaike weights of the models in which they were selected. For beaked whales, the most important variables were the average temperature at the surface and the standard deviation of the temperature between 0 and 200 m (mTsurface and sdT0–200) and also static variables (roughness, slope, depth, surface of canyons). For sperm whales, the most important variables were the average temperature and the standard error of the gradients of temperature at the surface (mTsurface and sdGrTsurface) but also variables at depths between 200 and 600 m, the mean and standard deviation of the eddy kinetic energy, the average temperature and the average gradients of temperature (mEKE200–600, sdEKE200–600, mT200–600 and mGrT200–600), with mEKE200–600 being the most important variable. With the four most important variables, we obtained a unique model that combined the most important environmental variables in the pool of available variables and thus obtained functional relationships with these variables. The final models (Table 1 and Fig. 2) did not necessarily use the four most important variables as some of them were correlated (e.g., roughness and slope).
The functional relationships between beaked whale and sperm whale individual densities and the four most important and uncorrelated variables. Solid lines represent the estimated smooth functions and the blue shaded regions show the approximate 95% confidence intervals. The relative density of individuals (individuals per 100 km2) is shown on the y-axis, where a zero indicates no effect of the covariate. The black rug plot on the x-axis represents the distribution of the data. The percentages indicate the importance of the variables calculated by summing the Akaike weights of the models in which they were selected. D*: explained deviance; T: temperature; GrT: gradients of temperature; EKE: eddy kinetic energy; m: mean; sd: standard deviation.
Beaked whales
The final model of beaked whales explained 31.4% of the deviance and the root mean square error (RMSE) was equal to 0.46. The highest densities were obtained for a mTsurface above 15 °C, sdT0–200 less than 0.3 °C, a high roughness (> 200 m) and depths around 2000 m (Fig. 2). In the Bay of Biscay, the highest densities of beaked whales were predicted near the continental slope and in the abyssal plain, with maxima predicted in the canyons of the south-eastern Bay of Biscay off the Spanish coast (Capbreton, Torrelavega and Llanes canyons; Fig. 3; map of the Bay of Biscay canyons available in Appendix C). The uncertainties associated with the predictions were low (Appendix D).
Sperm whale
The explained deviance of the final sperm whale model was 24.3% and the RMSE was equal to 0.36. The highest densities were obtained for mEKE200–600 higher than 0.0015 m2·s−2, sdGrTsurface higher than 0.075 °C, mTsurface higher than 15 °C and mT200–600 less than 11.5 °C (Fig. 2). The highest densities in the Bay of Biscay were predicted in the abyssal plain, near the continental slope and near canyons and seamounts, such as the North Spanish Marginal Trough, the Biscay Seamount and the Cap Ferret and Capbreton canyons, where water movements were very dynamic (Fig. 3; map of the Bay of Biscay canyons available in Appendix C). Although the uncertainties associated with the functional relationships were large, the uncertainty associated with the prediction was fairly low except in the west where the area was less sampled (Appendix D).
Discussion
For animals that spend most of their time below the surface and feed mostly at great depths, such as beaked and sperm whales21,22, using surface environmental variables in SDMs, as commonly done to understand the mechanisms that influence their distribution5,16,23,24, may seem ecologically incomplete. The presence of animals at the surface is also probably related to mechanisms at depth. Our objective was to use, in addition to surface environmental variables, variables that depicted the water column to identify which variables were the most important for deep-divers, since we argued that surface variables might not explain well the physical processes occurring at depth that would influence the beaked and sperm whale distributions. Our results highlighted new relationships with the environment allowing to predict the highest densities of beaked whales and sperm whales near the continental slope, near canyons and seamounts and in the abyssal plain of the Bay of Biscay. Interestingly, we identified different responses between beaked whales, for which surface, subsurface and static variables were selected as the most important variables, and sperm whales for which no static variable was selected but only surface and deep-water variables.
It may seem inappropriate to use deep-water variables to explain the distribution of surface sightings but here we assumed that animals were observed at the surface where they were breathing or resting before or after having followed a prey aggregation at depth12,13,14,15. We expected that variables characterising the water masses at depth would be selected for beaked whales and that static variables would be selected for sperm whales. Brodie et al.28, showed that the addition of dynamic subsurface variables might not significantly improve the model predictive performance for species with strong responses to static variables, which would explain our findings for beaked whales. We could also hypothesise that beaked whales might forage on organisms living close to the seabed, explaining why they were more associated to physiographic features around 2000 m depth. By contrast, sperm whales would be less constrained by the presence of the slope or physiographic structures and would prey on organisms that are truly meso-to-bathypelagic; and as a consequence, the dynamic processes in the water column would thus be more important. From the analysis of stomach contents, Spitz et al.22 showed that the diets of beaked and sperm whales were different in the Bay of Biscay. Cuvier’s beaked whales fed largely on the cephalopods Teuthowenia megalops and Galiteuthis armata while the diet of sperm whales was mostly composed of the cephalopods Histioteuthis bonnellii. Teuthowenia megalops and Galiteuthis armata seem to reach greater depths (respectively 1000–2700 m, and 500–2500 m33,) than Histioteuthis bonnellii for which larger individuals have been recorded to occur between 200 and 800 m in the Atlantic33. At-depth dynamic variables should be used in other areas such as the North West Atlantic or the Mediterranean Sea to determine whether the difference in importance of static, surface and deep-water variables between beaked whales and sperm whales would also be observed and would be consistent with whale dietary data available in these areas.
By using deep-water variables with conventional static and surface variables, we highlighted that the highest beaked whale densities were obtained in areas associated with great depths, steep escarpments and high and stable surface temperatures. They appeared to be related to areas of quite high temperatures, to structures present near the 2000 m depth isobath, as shown by Rogan et al.16, and to rugged terrain characterised by higher prey richness34. The distribution of sperm whales seemed to be more related to dynamic processes at depth with a concentration of animals in dynamic zones where eddies were strong, gradients of temperature were variable, surface temperature was high and temperature at depth was low. This preference for deep waters associated with eddies could be linked to Slope Water Oceanic eDDIES (SWODDIES) structures which are eddies present at 200 m depth that strongly modify the mixing and dispersion of organic matter and thus lead to higher prey concentration35,36.
The relationships established in our models generated predicted distributions consistent with field observations and previous studies, in spite of the different variables used. No beaked whales and sperm whales were predicted on the continental shelf where no sightings were recorded, as previously shown by Roberts et al. and Rogan et al.5,16, but this does not exclude the occasional presence of these species in this area. The highest densities of beaked and sperm whales were predicted near slope discontinuities where the seafloor is steep and where prey aggregate37,38 and near canyons in the south-eastern part of the Bay of Biscay. Sperm whale distribution was slightly more homogeneous and they appeared to be less linked to these structures than beaked whales. These predictions in the Bay of Biscay were consistent with Kiszka et al.39, Rogan et al.16 and Virgili et al.31 studies; there are very few studies carried out on deep-diver habitats in the Bay of Biscay. The concentration of animals around the canyons and shelf edge was consistent with other studies in the Mediterranean Sea or in waters off the western North Atlantic continental shelf edge where higher sighting rates have been recorded in canyon areas40,41,42,43. The use of smaller segments in the analyses could help refine the understanding of the processes that influence the concentration of these species within the canyons.
Although the explained deviances were relatively high (24.3% and 31.4%) and comparable to other studies (e.g. between 30.6 and 33.8% for Rogan et al.16 or 34% for Cañadas et al.23), the selected models did not entirely explain the distribution of deep-divers as shown by the quite high RMSEs (0.36 and 0.46). This may be due to the methodological choices we made. We chose to create depth classes (in which the variables were averaged) compatible with the water masses identified in the Bay of Biscay but a finer stratification could have been considered, for example every 200 m36,44. The predictive power of the beaked whale model might have been reduced by grouping several beaked whale species with different links with the environment or different targeted prey. However, the group of beaked whales was probably mainly composed of Cuvier’s beaked whales39,45 so the effect on the model was limited. We chose to fit 'year-round' models because the studied taxa have been reported to show little or no seasonal variation in their habitats (e.g.46,47). Perhaps we could have increased the predictive power of the models and reduced the uncertainties associated with the predictions by considering only sightings recorded in spring and summer, the most sampled seasons, but we chose to keep the maximum number of sightings to fit the models. In addition, a prior selection of environmental variables, based on our knowledge, had to be made to limit the number of variables to be tested and associated computational burden. We chose to test only three dynamic variables that seemed relevant, temperature, gradients of temperature and eddy kinetic energy, each of them being available for the four depth classes. Other variables could have been considered, such as dissolved oxygen concentration48, salinity, mixed layer depth29 or isothermal layer depth28. Mannocci et al.48 have included the depth of the minimum dissolved oxygen concentration because a shallow depth has effects on the physiology of cetacean prey and cause lethargic behaviour which may ease their capture by deep-divers. Brodie et al.28 tested two subsurface variables, isothermal layer depth and bulk buoyancy frequency, to model the distribution of four predatory fish species because these variables quantify the structure and stability of the water column. Similarly, Becker et al.29 used depth environmental data (temperature, salinity, mixed layer depth) provided by an ocean circulation model, the Regional Ocean Modelling System, to model distribution of cetaceans in the California Current Ecosystem. They both showed that variables obtained from ocean circulation models improved the explanatory power of the distribution models.
The final model, built from the four most important variables was an arbitrary construction from the variables that were the most prevalent in the tested models and which had the highest Akaike weights49. We developed a methodology to identify a single model (which only works if predictions are comparable) because it can be difficult, particularly for managers, to interpret the model results to propose conservation measures, especially if the results are an average of several models. Here, the predictions of our final models were very similar or, even equivalent, to the average predictions of all tested models. So, even if our final models were not the only possible ones, the predictions we obtained were quite convincing and the results were easier to interpret than an average of several models. The addition of deep-water variables alone did not entirely explain the distribution of deep-divers but improved the explanatory power of the models and highlighted other processes that could influence their distribution at depth. More direct parameters such as prey distribution data simulated from numerical models could further improve our distribution models50,51. However, prey models need refinements to better characterise the prey of deep-diving cetaceans and thus to improve whale distribution models.
The increasing use of deep-water variables or ocean circulation models in SDMs28,29,52 can greatly improve the tools available for conservation planning and the management of human activities. The more we are able to understand the mechanisms that influence a species habitat, the more we will be able to predict its distribution and identify areas of animal concentration where efforts must be concentrated to limit the impacts of human activities on the species. By using a finer spatial resolution of the environmental variables and by considering the vertical dimension of the variables, we refined the predictions of the deep-diver distributions compared to our previous study31. Our results identified areas of concentration that had not been identified until now in some canyons of the Bay of Biscay (e.g. Capbreton, Cap Ferret, Torrelavega and Llanes canyons). As previously mentioned, many improvements can be considered, such as the use of other variables that characterise the water column, but our results confirmed the utility of deep-water variables in SDMs to model the distribution of top predators linked to meso and bathypelagic areas for a better characterising of their habitats. Increasing the use of these variables should be considered to improve the tools available for the planning of human activities, especially for species that would be closely linked to processes at depth. The availability of modelled variables describing deep water-ocean layers should be incorporated into future studies to improve the characterisation of the habitats of top predators.
Methods
Study area
The study area encompassed the Bay of Biscay and adjacent waters in the North Atlantic Ocean, from 43 to 50° North and 0 to 10° West. The width of the continental shelf increases from South to North (from 30 to 180 km). The oceanic circulation is characterised by a weak anti-cyclonic circulation in the central zone and becomes cyclonic near the continental shelf. The water column of the Bay of Biscay is divided into four major water masses: (1) between 100 and 600 m, the water column has the characteristics of the central waters of the North Atlantic Ocean; (2) between 600 and 1500 m, Mediterranean waters flow from Gibraltar; (3) between 1500 and 3000 m, there are the deep waters of the Northeast Atlantic and (4) beyond 3000 m the deep Antarctic waters flow northward36. Although sperm whales and beaked whales are able to dive beyond 2000 m, many authors [e.g.53,54,55,56] showed that most of the dives made by deep-divers do not exceed 1500–2000 m thus we considered only waters from 0 to 2000 m in this work .
Data collection and collation
In this study, we used a part of the dataset assembled in Virgili et al. (31; Fig. 4), we only considered beaked and sperm whale sightings and effort data recorded in the Bay of Biscay, eastern North Atlantic. Visual shipboard and aerial surveys performed by seven independent organisations between 1998 and 2016 were assembled (details of the surveys are listed in Appendix A). A single common dataset was created, aggregating all survey datasets standardised for units and formats. Effort data were linearized and divided into 5 km segments using ArcGIS 10.357 and the Marine Geospatial Ecology Tools software58.
Study area showing assembled survey effort (a), along with beaked whale (b) and sperm whale (c) sightings recorded during all surveys. Surveys were carried out along transects following a line-transect methodology (survey details are provided in Appendix A). Sightings were classified by group sizes with each point representing one group of individuals and point size representing the number of animals in a group. In the analyses, we used the number of individuals to estimate densities of individuals.
Cetacean sightings were recorded following line-transect methodologies that allow Effective Strip Width (ESW) to be estimated from the measurement of the perpendicular distances to the sightings59, except for the JNCC-ESAS surveys that used a 300‐m strip‐transect methodology. For each sighting, the number of individuals, the distance from the transect and the conditions of observation were recorded, allowing to build a model which estimated the ESW for beaked whales and sperm whales, depending on observation conditions and survey types (following31).
To account for the difficulty to identify them at species level, beaked whale species were pooled into one group including Cuvier’s beaked whales (Ziphius cavirostris), mesoplodonts (Mesoplodon spp.) and northern bottlenose whales (Hyperoodon ampullatus). Although species were grouped, Kiszka et al.39 and Robbins et al.45 have shown that the Cuvier’s beaked whale is the most abundant ziphiid in the study area (encounter rates 12 to 16 times higher for Cuvier’s beaked whale than for northern bottlenose whale and Sowerby’s beaked whale (Mesoplodon bidens)45).
A total of 113 sightings representing 236 individuals of beaked whales and 52 sightings totalling 106 individuals of sperm whales were assembled for the present study (Fig. 4). Aggregated data represented about 150,400 km of on-effort transects (i.e. following a transect at specified speed and altitude or height above sea level, with a specified level of visual effort) of which 53% was carried out by boat and the rest by plane (Fig. 4, Table 2).
Moran’s and Geary’s indexes were calculated to ensure there was no spatial autocorrelation in the data using the ‘spdep’ R-package60.
Data processing
Delimitation of depth classes
To determine whether the presence of deep-divers would be related to surface oceanographic processes or to processes taking place in the water column, four depth classes were delimited according to the water masses reported in the Bay of Biscay (see 4.1) in association with the dive profiles of deep-divers (Fig. 5). Environmental variables were extracted for each of the selected depth classes.
The first class was the “surface zone”; the deep-diver distribution may be related to mechanisms at the surface or these mechanisms may influence processes at depth. The second class was the epipelagic zone (named “0–200 m”), extending vertically from 0 to 200 m. We delineated this class between 0 and 200 m because most of beaked whale recovery dives occur between 0 and 200 m61 and the upper 200 m of the open ocean correspond to the euphotic zone where primary production is concentrated. Between 200 and 2000 m, we considered two water masses, the central waters of the North Atlantic Ocean between 200 and 600 m (named “200–600 m”) and the Mediterranean waters between 600 and 2000 m (named “600–2000 m”). Although water masses between 600 and 1500 m and between 1500 and 3000 m are different (see 4.1), we considered no changes in the water masses from 1500 to 2000 m and grouped them into one class.
Static and oceanographic variables
To model species distributions, it was necessary to extract environmental variables. We considered static and dynamic variables that can affect the distribution of beaked whales and sperm whales (Table 3). Static variables remained stable over time and were independent from depth classes while dynamic variables were extracted in each depth class and varied over time.
For static variables, we extracted the depth at a 15 arc second resolution (≈ 500 m; https://www.gebco.net/) and then computed the slope (inclination of the seafloor) and roughness (difference between the maximum and minimum depth of the pixels surrounding the central pixel) with the function ‘terrain’ from the R package ‘raster’63. We also extracted the surface area of canyons listed in the study area because deep‐divers are often associated with canyon structures (62; www.bluehabitats.org). All static variables were resampled at a 0.083° resolution to match the resolution of the dynamic variables.
For dynamic variables, we extracted monthly water temperatures and current vectors (U and V) for each depth class (surface, 0–200 m, 200–600 m and 600–2000 m) directly from the numerical model “Iberian Biscay Irish Ocean Reanalysis” of Copernicus (itself based on the NEMO v3.6 ocean general circulation model; https://doi.org/10.48670/moi-00029). From these variables, we computed spatial gradients of temperatures and EKE (0.5*(U2 + V2)). Gradients of temperatures were calculated as the difference between the minimum and maximum temperature values found in the eight pixels surrounding any given pixel of the grid (function ‘detectFronts’ from the R package ‘grec’64). Climatological variables were computed by calculating the means and standard deviations over the study period for each variable of each depth class (Appendix B).
Habitat-based density modelling
To model the distribution of beaked whales and sperm whales, we fitted GAMs30 with a Tweedie distribution to account for over-dispersion in the cetacean count data65 with the ‘mgcv’ R-package66. GAMs extend generalised linear models to allow for smooth, nonlinear functions of predictor variables to be determined by observed data rather than by strict parametric relationships30. The mean number of individuals per segment was linked to the additive predictors with a log-function with four degrees of freedom. An offset equal to segment length multiplied by twice the ESW, or twice the 300 m-strip for JNCC-ESAS surveys, was included (67; refer to Virgili et al.31 for the ESW estimation); ESWs were calculated for each combination of platform, class of Beaufort sea-state and class of observation height. We removed combinations of variables with Pearson correlation coefficients higher than |0.5| and tested all models with combinations of one to four variables to avoid excessive complexity68.
To assess the correlation between the variables we created a correlation matrix using the R package ‘corrplot’ (69; Fig. 6). Many variables with depths greater than 200 m were correlated with each other but also with surface variables (depth < 200 m) and with static variables such as slope, roughness and surface of canyons; showing the importance of considering correlations in the model selection.
Correlation matrix of environmental variables. This matrix was calculated using the Pearson coefficient; the larger and darker the circle, the higher the correlation between the two environmental variables; variables are considered correlated for values below − 0.5 and above 0.5. CanArea: surface of canyons; T: temperature, GrT: gradients of temperature; EKE: eddy kinetic energy; m: mean; sd: standard deviation; surf: surface; 0–200: 0–200 m; 200–600: 200–600 m; 600–2000: 600-2000 m.
The Akaike information criterion (AIC, the lower the better32) and Akaike weight (w; ‘akaike.weights’ function from ‘qpcR’ R package70;) were used for model selection. The Akaike weight of a model is the probability that this model is the best among all tested models32,71. If the Akaike weight of a model is high (w > 0.9), it can certainly be identified as the best model and inference can be made from this model alone32,49. In contrast, if w < 0.9, a model averaging is recommended, which consists in producing parameter estimates from the weighted average of several models and not from a single model49. If w < 0.9, some variables are probably very similar and correlated. In this case, it is not possible to choose a best model among all tested models since the models are equivalent and they must all be integrated to produce an average prediction of species distribution. This can be cumbersome in terms of calculations and difficult to interpret, so if possible, it is more suitable to obtain a single model.
To be able to identify this single suitable model, we aimed to approximate the average prediction obtained from the models by the prediction obtained from the model that combined the four most important variables. Following Symonds & Moussalli49, we determined the importance of each variable by summing the Akaike weights of the models in which the variable was selected and ranked all variables. A model using the first four variables was then fitted while ensuring a non-correlation of the variables (if variables were correlated, the next uncorrelated ranked variable was chosen). A prediction of relative densities (in number of animals per pixel) was produced from this model at a 0.083° resolution and compared to the average prediction obtained from the models that explained 80% of the total Akaike weight. We considered only the models that explained 80% of the Akaike weight because beyond this threshold, the models were negligible (very low explained deviances and very high AICs). If the coefficient of determination (R2) of the regression line established between the values of the average prediction and the values of the prediction obtained from the four most important variables was close to 1, predictions were similar and the average prediction of all models could be approximated by the prediction of the model fitted to the four most important variables. The four variables could therefore be used to obtain functional relationships and to predict the relative densities of beaked whales and sperm whales in the Bay of Biscay. If not, all models had to be considered to predict the species densities.
There were not enough data to fit a model by month or by season (the number of individuals in winter was too low) so we fitted models to all data of beaked whales and sperm whales and obtained climatological predictions maps for all seasons combined in the 1998–2016 period, although most of sighting data were collected in summer and the prediction maps most likely reflected the summer species distribution. The uncertainties associated with the predictions were also estimated as the standard deviation associated with the predicted relative densities; high values indicated high errors associated with density estimates. However, it should be noted that the uncertainty associated with the model prediction with the four most important variables was certainly lower than the uncertainty associated with the mean prediction of all models and it was therefore underestimated.
Finally, the model fit of each model was assessed thanks to the percentage of explained deviance30,72, the Root Mean Squared Error (RMSE) which measured the prediction errors and the model accuracy (the lower, the better73; ‘qpcR’ R-package70) and a visual inspection of predicted and observed distributions74.
Data availability
All sighting and effort data used in this study are available in the GitHub: https://github.com/avirgi01/DeepVariables.git. All data providers can be contacted via the email addresses provided in the data files.
References
Elith, J. & Leathwick, J. R. Species distribution models: Ecological explanation and prediction across space and time. Annu. Rev. Ecol. Evol. Syst. 40, 677–697 (2009).
Esselman, P. C. & Allan, J. D. Application of species distribution models and conservation planning software to the design of a reserve network for the riverine fishes of northeastern Mesoamerica. Freshw. Biol. 56, 71–88 (2011).
Valle, M. et al. Comparing the performance of species distribution models of Zostera marina: Implications for conservation. J. Sea Res. 83, 56–64 (2013).
Robinson, N. M., Nelson, W. A., Costello, M. J., Sutherland, J. E. & Lundquist, C. J. A systematic review of marine-based species distribution models (SDMs) with recommendations for best practice. Front. Mar. Sci. 4:421, (2017).
Roberts, J. J. et al. Habitat-based cetacean density models for the US Atlantic and Gulf of Mexico. Sci. Rep. 6, 22615 (2016).
Lambert, C. et al. How does ocean seasonality drive habitat preferences of highly mobile top predators? Part I: The north-western Mediterranean Sea. Deep Sea. Res. Part II Top. Stud. Oceanogr. 141, 115–132 (2017).
Lan, K.-W., Shimada, T., Lee, M.-A., Su, N.-J. & Chang, Y. Using remote-sensing environmental and fishery data to map potential Yellowfin Tuna habitats in the tropical pacific Ocean. Remote Sens. 9, 444 (2017).
Austin, R. A. et al. Predicting habitat suitability for basking sharks (Cetorhinus maximus) in UK waters using ensemble ecological niche modelling. J. Sea Res. 153, 101767 (2019).
Hobday, A. J. et al. Impacts of climate change on marine top predators: Advances and future challenges. Deep Sea Res. Part II Top. Stud. Oceanogr. 113, 1–8 (2015).
Avila, I. C., Kaschner, K. & Dormann, C. F. Current global risks to marine mammals: Taking stock of the threats. Biol. Conserv. 221, 44–58 (2018).
Panti, C. et al. Marine litter: One of the major threats for marine mammals. Outcomes from the European Cetacean Society workshop. Environ. Pollut. 247, 72–79 (2019).
Grémillet, D. et al. Spatial match-mismatch in the Benguela upwelling zone: should we expect chlorophyll and sea-surface temperature to predict marine predator distributions?. J. Appl. Ecol. 45, 610–621 (2008).
Österblom, H., Olsson, O., Blenckner, T. & Furness, R. W. Junk-food in marine ecosystems. Oikos 117, 967–977 (2008).
Hazen, E. L., Nowacek, D. P., Laurent, L. S., Halpin, P. N. & Moretti, D. J. The relationship among oceanography, prey fields, and beaked whale foraging habitat in the tongue of the ocean. PLoS ONE 6, e19269 (2011).
Louzao, M. et al. Marine megafauna niche coexistence and hotspot areas in a temperate ecosystem. Cont. Shelf Res. 186, 77–87 (2019).
Rogan, E. et al. Distribution, abundance and habitat use of deep diving cetaceans in the North-East Atlantic. Deep Sea Res. Part II Top. Stud. Oceanogr. 141, 8–19 (2017).
Bangley, C. W., Curtis, T. H., Secor, D. H., Latour, R. J. & Ogburn, M. B. Identifying important juvenile dusky shark habitat in the northwest atlantic ocean using acoustic telemetry and spatial modeling. Mar. Coast. Fish. 12, 348–363 (2020).
Yen, P. P. W., Sydeman, W. J. & Hyrenbach, K. D. Marine bird and cetacean associations with bathymetric habitats and shallow-water topographies: implications for trophic transfer and conservation. J. Mar. Syst. 50, 79–99 (2004).
Redfern, J. V et al. Techniques for cetacean–habitat modeling. Mar. Ecol. Prog. Ser. 310, 271-295 (2006).
Robison, B. H. Deep pelagic biology. J. Exp. Mar. Bio. Ecol. 300, 253–272 (2004).
Reijnders, P. J. H., Aguilar, A. & Borrell, A. Pollution and marine mammals. In Encyclopedia of Marine Mammal (eds Perrin, W. F. et al.) 890–898 (Academic Press, New York, 2009).
Spitz, J. et al. Prey preferences among the community of deep-diving odontocetes from the Bay of Biscay, Northeast Atlantic. Deep Sea Res. Part I Oceanogr. Res. Pap. 58, 273–282 (2011).
Cañadas, A. et al. The challenge of habitat modelling for threatened low density species using heterogeneous data: The case of Cuvier’s beaked whales in the Mediterranean. Ecol. Indic. 85, 128–136 (2018).
Pirotta, E., Brotons, J. M., Cerdà, M., Bakkers, S. & Rendell, L. E. Multi-scale analysis reveals changing distribution patterns and the influence of social structure on the habitat use of an endangered marine predator, the sperm whale Physeter macrocephalus in the Western Mediterranean Sea. Deep Sea Res. Part I Oceanogr. Res. Pap. 155, 103169 (2020).
Watwood, S. L., Miller, P. J. O., Johnson, M., Madsen, P. T. & Tyack, P. L. Deep-diving foraging behaviour of sperm whales (Physeter macrocephalus). J. Anim. Ecol. 75, 814–825 (2006).
Warren, V. E. et al. Spatio-temporal variation in click production rates of beaked whales: Implications for passive acoustic density estimation. J. Acoust. Soc. Am. 141, 1962–1974 (2017).
Shearer, J. M. et al. Diving behaviour of Cuvier’s beaked whales (Ziphius cavirostris) off Cape Hatteras, North Carolina. R. Soc. Open Sci. 6: 181728 (2019).
Brodie, S. et al. Integrating dynamic subsurface habitat metrics into species distribution models. Front. Mar. Sci. 5:219 (2018).
Becker, E. et al. Moving towards dynamic ocean management: How well do modeled ocean products predict species distributions?. Remote Sens. 8, 149 (2016).
Wood, S. N. On confidence intervals for generalized additive models based on penalized regression splines. Aust. N. Z. J. Stat. 48, 445–464 (2006).
Virgili, A. et al. Combining multiple visual surveys to model the habitat of deep-diving cetaceans at the basin scale. Glob. Ecol. Biogeogr. 28, 300–314 (2019).
Burnham, K. P. & Anderson, D. R. Model Selection and Multimodel Inference (Springer, New York, 2004).
Voss, N. A., Vecchione, M., Toll, R. B. & Sweeney, M. J. Systematics and Biogeography of Cephalopods (Smithsonian Institution Press, New York, 1998).
Kostylev, V. E., Erlandsson, J., Ming, M. Y. & Williams, G. A. The relative importance of habitat complexity and surface area in assessing biodiversity: Fractal application on rocky shores. Ecol. Complex. 2, 272–286 (2005).
Pingree, R. D. & Cann, B. L. Three anticyclonic slope water oceanic eDDIES (SWODDIES) in the Southern Bay of Biscay in 1990. Deep Sea Res. Part A Oceanogr. Res. Pap. 39, 1147–1175 (1991).
Koutsikopoulos, C. & Cann, B. L. Physical processes and hydrological structures related to the Bay of Biscay anchovy. Sci. Mar. 60, 9–19 (1996).
Bost, C. A. et al. The importance of oceanographic fronts to marine birds and mammals of the southern oceans. J. Mar. Syst. 78, 363–376 (2009).
Woodson, C. B. & Litvin, S. Y. Ocean fronts drive marine fishery production and biogeochemical cycling. Proc. Natl. Acad. Sci. 112, 1710–1715 (2015).
Kiszka, J., Macleod, K., Van Canneyt, O., Walker, D. & Ridoux, V. Distribution, encounter rates, and habitat characteristics of toothed cetaceans in the Bay of Biscay and adjacent waters from platform-of-opportunity Data. ICES J. Mar. Sci. 64, 1033–1043 (2007).
Waring, G. T., Hamazaki, T., Sheehan, D., Wood, G. & Baker, S. Characterization of beaked whale (Ziphiidae) and sperm whale (Physeter macrocephalus) summer habitat in shelf-edge and deeper waters off the Northeast U.S.. Mar. Mammal Sci. 17, 703–717 (2001).
Moulins, A., Rosso, M., Nani, B. & Würtz, M. Aspects of the distribution of Cuvier’s beaked whale (Ziphius cavirostris) in relation to topographic features in the Pelagos Sanctuary (north-western Mediterranean Sea). J. Mar. Biol. Assoc. U. K. 87, 177–186 (2007).
Mussi, B., Miragliuolo, A., Zucchini, A. & Pace, D. S. Occurrence and spatio-temporal distribution of sperm whale (Physeter macrocephalus) in the submarine canyon of Cuma (Tyrrhenian Sea, Italy). Aquat. Conserv. Mar. Freshw. Ecosyst. 24, 59–70 (2014).
Moors-Murphy, H. B. Submarine canyons as important habitat for cetaceans, with special reference to the Gully: A review. Deep Res. Part II Top. Stud. Oceanogr. 104, 6–19 (2014).
Millot, C. & Taupier-Letage, I. Circulation in the Mediterranean Sea. In: Saliot, A. (eds) The Mediterranean Sea. Handbook of Environmental Chemistry, vol 5K. Springer, Berlin, Heidelberg. (2005). https://doi.org/10.1007/b107143
Robbins, J. R., Bell, E., Potts, J., Babey, L. & Marley, S. A. Likely year-round presence of beaked whales in the Bay of Biscay. Hydrobiologia https://doi.org/10.1007/s10750-022-04822-y (2022).
McSweeney, D. J., Baird, R. W. & Mahaffy, S. D. Site fidelity, associations, and movements of Cuvier’s (Ziphius cavirostris) and Blainville’s (Mesoplodon densirostris) beaked whales off the island of Hawai’i. Mar. Mammal Sci. 23, 666–687 (2007).
Wimmer, T. & Whitehead, H. Movements and distribution of northern bottlenose whales, Hyperoodon ampullatus, on the Scotian Slope and in adjacent waters. Can. J. Zool. 82, 1782–1794 (2004).
Mannocci, L., Monestiez, P., Spitz, J. & Ridoux, V. Extrapolating cetacean densities beyond surveyed regions: Habitat-based predictions in the circumtropical belt. J. Biogeogr. 42, 1267–1280 (2015).
Symonds, M. R. E. & Moussalli, A. A brief guide to model selection, multimodel inference and model averaging in behavioural ecology using Akaike’s information criterion. Behav. Ecol. Sociobiol. 65, 13–21 (2011).
Lambert, C., Mannocci, L., Lehodey, P. & Ridoux, V. Predicting cetacean habitats from their energetic needs and the distribution of their prey in two contrasted tropical regions. PLoS ONE 9, e105958 (2014).
Virgili, A. et al. Towards a better characterisation of deep-diving whales’ distributions by using prey distribution model outputs?. PLoS ONE 16, e0255667 (2021).
Scales, K. L. et al. Fit to predict? Eco-informatics for predicting the catchability of a pelagic fish in near real time. Ecol. Appl. 27, 2313–2329 (2017).
Amano, M. & Yoshioka, M. Sperm whale diving behavior monitored using a suction-cup-attached TDR tag. Mar. Ecol. Prog. Ser. 258, 291–295 (2003).
Irvine, L., Palacios, D. M., Urbán, J. & Mate, B. Sperm whale dive behavior characteristics derived from intermediate-duration archival tag data. Ecol. Evol. 7, 7822–7837 (2017).
Shearer, J. M. et al. Diving behaviour of Cuvier’s beaked whales (Ziphius cavirostris) off Cape Hatteras, North Carolina. R. Soc. Open Sci. 6, 181728 (2019).
Towers, J. R. et al. Movements and dive behaviour of a toothfish-depredating killer and sperm whale. ICES J. Mar. Sci. 76, 298–311 (2019).
ESRI. ArcGIS Desktop: Release 10.3. Redlands, CA: Environmental Systems Research Institute (2016).
Roberts, J. J., Best, B. D., Dunn, D. C., Treml, E. A. & Halpin, P. N. Marine geospatial ecology tools: An integrated framework for ecological geoprocessing with ArcGIS, Python, R, MATLAB, and C++. Environ. Model. Softw. 25, 1197–1207 (2010).
Buckland, S. T., Rexstad, E. A., Marques, T. A. & Oedekoven, C. S. Distance Sampling: Methods and Applications (Springer International Publishing, New York, 2015).
Bivand, R. S. & Wong, D. W. S. Comparing implementations of global and local indicators of spatial association. TEST 27, 716–748 (2018).
Baird, R. W. et al. Diving behaviour of Cuvier’s (Ziphius cavirostris) and Blainville’s (Mesoplodon densirostris) beaked whales in Hawai‘i. Can. J. Zool. 84, 1120–1128 (2006).
Harris, P. T., Macmillan-Lawler, M., Rupp, J. & Baker, E. K. Geomorphology of the oceans. Mar. Geol. 352, 4–24 (2014).
Hijmans, R. J. raster: Geographic data analysis and modeling. R package version 3. 4–5. https://CRAN.R-project.org/package=raster (2020).
Lau-Medrano, W. grec: Gradient-based recognition of spatial patterns in environmental data. R package version 1.4.1. (2020).
Foster, S. D. & Bravington, M. V. A Poisson-Gamma model for analysis of ecological non-negative continuous data. Environ. Ecol. Stat. 20, 533–552 (2013).
Wood, S. Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. J. R. Stat. Soc (B), 73(1), 3-36 (2011).
Hedley, S. L. & Buckland, S. T. Spatial models for line transect sampling. J. Agric. Biol. Environ. Stat. 9, 181–199 (2004).
Mannocci, L. et al. Predicting cetacean and seabird habitats across a productivity gradient in the South Pacific gyre. Prog. Oceanogr. 120, 383–398 (2014).
Wei, T. & Simko, V. R package ‘corrplot’: Visualization of a correlation matrix (Version 0.84). https://github.com/taiyun/corrplot (2017).
Spiess, A. qpcR: Modelling and analysis of real‐time PCR data. R package version 1.4‐1. https://CRAN.R-project.org/package=qpcR (2018).
Fabozzi, F. J., Focardi, S. M., Rachev, S. T. & Arshanapalli, B. G. The basics of financial econometrics: Tools, concepts, and asset management applications. John Wiley & Sons (2014).
Becker, E. A. et al. Habitat-based density models for three cetacean species off southern california illustrate pronounced seasonal differences. Front. Mar. Sci. 4:121 (2017).
Neill, S. P. & Hashemi, M. R. Fundamentals of ocean renewable energy: Generating electricity from the sea. Academic Press (2018).
Becker, E. A. et al. Performance evaluation of cetacean species distribution models developed using generalized additive models and boosted regression trees. Ecol. Evol. 10, 5759–5784 (2020).
Acknowledgements
We are grateful to the many observers who participated in the surveys and collected all the data but also the ships’ captains, crews and pilots. We thank Phil Hammond and his team for providing SCANS and CODA survey data. THUNNUS survey was carried out thanks to the collaboration of the General Directorate of Fisheries and Maritime Affairs, Government of Galicia. We thank the Direction Générale de l'Armement (DGA), including Odile Gérard, for funding AV’s research grant. ML was funded by a Ramón y Cajal (RYC-2012-09897) postdoctoral contract of the Spanish Ministry of Economy, Industry and Competitiveness. This study is a contribution to the CHALLENGES (CTM2013-47032-R) project of the Spanish Ministry of Economy, Industry and Competitiveness. We would particularly like to thank Matthieu Authier for his advice in carrying out the analyses. We thank the two reviewers for their careful reading of our manuscript and for their helpful comments.
Author information
Authors and Affiliations
Contributions
A.V.: Conceptualization, Methodology, Formal analysis & Writing—Original Draft. V.T.: Methodology, Formal analysis, Review. G.D., T.E.D., S.L., M.L., M.L., J.M.C., E.P., L.R., C.S., M.B.S., O.V.C., J.A.V.B.: Investigation & Review. V.R.: Conceptualization, Review & Supervision.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Virgili, A., Teillard, V., Dorémus, G. et al. Deep ocean drivers better explain habitat preferences of sperm whales Physeter macrocephalus than beaked whales in the Bay of Biscay. Sci Rep 12, 9620 (2022). https://doi.org/10.1038/s41598-022-13546-x
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-022-13546-x
This article is cited by
-
A Comparative Genomics Approach to Understanding the Evolution of Olfaction in Cetaceans
Journal of Molecular Evolution (2024)








