Introduction

Indian coastal waters form an important habitat for a variety of marine mammal species, including the Indian Ocean humpback dolphin (Sousa plumbea) and the Indo-Pacific finless porpoise (Neophocaena phocaenoides). However, these coastlines are also marked by densely populated human settlements that depend on the ocean as a primary source of resources1,2. Consequently, cetaceans in this region are increasingly at risk of local population declines due to direct and indirect human-induced changes to their environment3. These threats are further heightened for non-migratory species with restricted habitat preferences and small fragmented populations4. While conservation measures are best derived from empirical assessment of animal populations, information on abundance, distribution, and population trends are currently lacking for most cetacean species from Indian waters. These data gaps are particularly concerning for the Indian Ocean humpback dolphin and the Indo-Pacific finless porpoise, two species commonly occurring off the West Coast of India. Both species face imminent threats due to proximity to anthropogenic activities. In addition to increasing vessel traffic, coastal infrastructure development, and chemical and noise pollution, a major cause of concern is interactions with fisheries, often leading to incidental mortalities5,6. Based on a range-wide decrease in population trends, Indian Ocean humpback dolphins are already recognized as ‘Endangered’ by the IUCN Red List7. An assessment of Indian Ocean humpback dolphin populations indicates that current range-wide mortality events could result in a 50% decline in numbers within \(\approx\) 75 years spanning both past and future generations7. Data on overall population trends for the Indo-Pacific finless porpoise are lacking throughout their distribution range. However, given the high degree of anthropogenic pressure in their habitat8,9, it is suggested the species faces considerable population decline, thus placing them under the ‘Vulnerable’ category of the IUCN Red List5. Stranding and mortality events for both species are frequently reported along the Indian coast6,10. Finless porpoises are one of the most recorded species in mortality events, particularly across the west coast of India10. Within the past decade (2013–2023), the west coast of India alone has reported 159 mortalities of humpback dolphins and 139 for finless porpoises (Marine Mammal Research and Conservation Network of India [MMRCNI], http://www.marinemammals.in, accessed on 24 January 2024). These stranding data only represent documented events, and many mortalities potentially go unreported. Despite the availability of limited stranding and mortality records, the long-term consequences of these numbers cannot be quantified due to a lack of systematic population assessment of both species.

Abundance estimation efforts for humpback dolphins using photographic mark-recapture were initiated along the coast of Goa and Gujarat in 200211 and in Maharashtra between 2014 and 20166. However, population estimates from these surveys are unavailable due to low resighting rates and the absence of continuous surveys11. Systematic monitoring for finless porpoises has only been conducted along the Maharashtra coast (2014–2016)9. These surveys were primarily designed to study humpback dolphin populations and did not cover the habitat range of finless porpoises. As a result, robust abundance data are unavailable to derive a population assessment. Recommendations of the IWC subcommittee for small cetaceans12 and the IUCN species assessment reports7 all point towards an urgent need to gather population-level data for both species across Indian waters to address rising conservation concerns.

In Indian waters, most population estimation studies have been restricted to conventional methods such as visual line transects or photographic mark-recapture surveys11,13. However, these methods prove challenging for coastal cetaceans inhabiting shallow turbid waters, particularly for finless porpoises, due to their elusive behavior, small group sizes, and irregular surfacing patterns14. Besides habitat and species-based limitations, visual surveys also require adequately trained observers15. Cetacean research in India is still in the early stages, and ensuring continuous availability of trained marine mammal observers for long-term monitoring efforts is a challenge. Shortcomings in visual surveys can be addressed by integrating passive acoustic monitoring (PAM) tools into the survey protocol, offering solutions for species detection independent of environmental conditions and human observer bias16. Line transects surveys using hydrophone arrays towed behind vessels have successfully been implemented for beaked whales17, baleen whales18, sperm whales19,20 and smaller odontocete species21,22. Most studies that utilized a combined visual and acoustic survey protocol report a higher number of acoustic encounters than visual-only methods23. With the advancement in autonomous recording technology and improved hardware, towed array designs are now compact and relatively low-cost24. Towed arrays with autonomous recording capabilities eliminate the need for expensive data acquisition systems onboard research vessels25. These compact arrays can be easily deployed off smaller boats, making them more accessible to researchers working in areas where monitoring efforts face multiple logistical and financial constraints.

However, effectively identifying animal presence using an acoustic platform depends on the vocal behavior of the target species and the detectability of the acoustic signal in its environment. For species and habitats where the use of acoustic line transects is new, there is first a need to assess the efficacy of the platform using simultaneous visual and acoustic surveys to estimate detections missed by either platform26.

In our study area off the Sindhudurg coast, resident populations of humpback dolphins and finless porpoises co-occur, though both species are starkly different in morphology and behavior, creating variation in the two species’ visibility. Humpback dolphins are bigger, more surface active, and form big groups, whereas finless porpoises are extremely elusive and have irregular surfacing patterns with smaller group sizes27. Finless porpoises also lack a dorsal fin, which is usually a prominent visual cue when conducting cetacean surveys. The two species also exhibit distinct acoustic behavior27. Finless porpoises only produce narrow-band high-frequency echolocation clicks \(\ge\) 90 kHz28. These clicks are highly directional and attenuate significantly at distances greater than 500 m in shallow turbid environments29. Humpback dolphins, on the other hand, produce broadband echolocation clicks in frequencies ranging between 15 kHz to \(\ge\) 160 kHz, along with whistles and burst pulses30. The broadband nature of these sounds allows for humpback dolphins to be detected across greater distances31. Although surveys to monitor both species together are preferred due to logistical and financial limitations, these species-level variations in detectability can hamper field data collection. Increased effort to visually detect cryptic species may compromise the accuracy of data collection for other species. Our study aimed to evaluate the performance of combined acoustic and visual line transect surveys to assess whether an integrated approach may improve overall data collection for both target species. We compared differences in the species encounter rates across the two detection platforms as a metric to assess their utility in further monitoring studies. As species detectability is also closely associated with environmental conditions8, we used environmental variables corresponding to the species encounters to measure their influence on the detections across the two platforms. We use our results to help streamline methods for broadscale population-level data collections for these two coastal species that share a habitat but vary significantly in behavior and ecology. Additionally, our findings can inform best practices for allocating human effort and logistics toward vessel-based monitoring for the focal species across data-poor regions with limited resources.

Results

Acoustic and visual encounter rates were collected during 38 transects conducted between 2020 and 2023, which covered a total distance of 1233.2 km (Table 1) and recorded 114.1 h of acoustic data. Throughout the surveys, the acoustic and visual platforms operated simultaneously, ensuring the total duration of the sighting effort was always consistent with acoustic data collection. We logged a total of 216 (both acoustic and visual) encounters for the two species.

Table 1 Survey effort per month with the total number of encounters for each species.

Indian Ocean humpback dolphin

A total of 63 on-effort humpback dolphin encounters were recorded throughout the study (Fig. 1). Of these, the mixed encounters accounted for 63.4% (n = 40), visual-only encounters corresponded to 14.2% (n = 9), and acoustic-only encounters accounted for 22.2% (n = 14). The overall encounter rate for humpback dolphins was 0.05 groups/km. The visual encounter rate for humpback dolphins was 0.03 groups/km, and the acoustic encounter rate was 0.04 groups/km. The difference between acoustic and visual encounter rates was not significant (Wilcoxon rank-sum test: W = 639.5, p-value = 0.3). Visual estimates of group sizes for humpback dolphins were between 1 and 25, with a mean group size of 5.4 (standard deviation, SD: ±5.2), and the acoustic estimates were between 1 and 16, with a mean group size of 3.2 (SD: ±2.7) (Fig. 2). While 65% (n = 26) of the visual group size estimates were higher than acoustic, 22.5% (n = 9) were the same, and 12.5% (n = 5) (Wilcoxon rank-sum test: W = 577.5, p-value < 0.01).

Fig. 1
figure 1

Indian Ocean humpback dolphin encounters by detection type. Depth contours are indicated by the blue lines. The transect route is marked in red.

Fig. 2
figure 2

Variation in Indian Ocean humpback dolphin group size estimates using acoustics and visual data. Boxes show the lower and upper quartile values. The solid line inside the box indicates the median value. Dots represent potential outliers.

Distance from shore (km), vessel traffic, and the tensor product of swell and sea state were the variables explaining the most variation in humpback dolphin encounters across detection types (Table 2). These variables had a significant effect on either visual or mixed detection types, but none of the variables showed any significance for acoustic detections. Distance from shore had a significant effect on mixed detections (p-value < 0.01), which were recorded within a range of 0.4 to 4.9 km from the shore. The highest rate of mixed detections occurred between 1.5 to 2 km from the shore. Acoustic detections were recorded between distances of 0.5 to 3.0 km from the shore, and visual detections were concentrated between 0.5 to 3.5 km. We observed an increase in mixed detections as distance from shore increased, whereas both acoustic and visual detections were seen to consistently decrease, especially beyond a distance of 1.5 km from the coastline (Fig. 3). Vessel traffic showed a borderline effect (p-value = 0.05) only on mixed detections, which were seen to gradually increase with an increase in the number of boats present during the encounters. We did not observe any significant effect of vessel traffic on acoustic and visual detections (Fig. 4). The tensor product of swell and sea state produced a non-significant result for the acoustic and mixed detection types, although the result for visual detections was borderline (p-value = 0.07). However, any suggested relationship could not be concluded from the tensor plots (Supplement).

Fig. 3
figure 3

Smooth for distance from shore (km) predicted by the GAMM for encounter rates of Indian Ocean humpback dolphins across detection type. Shaded portions indicate the 95% confidence intervals, color-coded according to the detection types. Vertical ticks on the X-axis indicate the distribution of encounters.

Fig. 4
figure 4

Smooth for the effect of vessel traffic predicted by the GAMM for encounter rates of Indian Ocean humpback dolphins across detection type. Shaded portions indicate the 95% confidence intervals, color-coded according to the detection types. Vertical ticks on the X-axis indicate the distribution of encounters.

Indo-Pacific finless porpoise

We recorded 153 encounters of finless porpoises (Fig. 5). Out of the total encounters, the acoustic-only encounters accounted for 74.5% (n = 114), and mixed encounters made up for 22.8% (n = 35). Of the total encounters, 2.6% (n = 4) were visual-only. The overall encounter rate for finless porpoises was 0.12 groups/km. The visual encounter rate was 0.03 groups/km, and the acoustic encounter rate was 0.12 groups/km. A comparison of the acoustic and visual encounter rates showed a significant difference between encounter rates estimated from the two detection types (Wilcoxon rank-sum test: W = 230.5, p-value < 0.0001).

Fig. 5
figure 5

Indo-Pacific finless porpoise encounters by detection type. Depth contours are indicated by the blue lines. The transect route is marked in red.

Visual group estimates of finless porpoises were between 1 to 5 individuals with a mean group size of 2.0 (SD: ±1.1), whereas the acoustic group sizes ranged from 1 to 9 individuals with a mean group size of 3.3 (SD: ±1.9) (Fig. 6). 62% (n = 22) of the acoustic group estimates were higher than those estimated visually, and 17% (n = 6) were the same, whereas 20% (n = 7) were lower than visual estimates. A significant difference was observed between the acoustic and visual group size estimates (Wilcoxon rank-sum test: W = 869, p-value < 0.01).

Fig. 6
figure 6

Variation in Indo-Pacific finless porpoise group size estimates using acoustics and visual. Boxes show the lower and upper quartile values. The solid line inside the box indicates the median value. Dots represent potential outliers.

Table 2 Humpback dolphin GAMM results. The table includes smooths for the predictor variables, distance shore is the measured distance from shore (km), boats indicate the vessel traffic within 1 km of the survey track, and the tensor product for swell and sea state measured in Beaufort. Significant predictor variables are highlighted in bold text.

Distance from shore (km), water depth (m), and the tensor product of swell and sea state were the most important variables for explaining finless porpoise encounters by detection type (Table 3). Distance from shore had a significant effect only on the acoustic detections (p-value < 0.05). Most acoustic detections were recorded between a distance of 0.3 to 5.3 km from shore, with the highest concentration of acoustic detections between 1 to 2 km. Mixed/visual detections were recorded at distances of 0.4 to 3.9 km from the shore, with most detections concentrated between 1.4 to 1.9 km. While we observed a slight increase in the mixed/visual detections up to 2 km from the coastline followed by a gradual decrease towards the 5 km boundary, distance from shore did not have a significant effect on this detection type (Fig. 7). Similar to distance from shore, depth (p-value < 0.01) had a significant effect only for acoustic detections. The water depth associated with the highest concentration of acoustic detections ranged between 9 and 11 m. The model showed an increase in acoustic detections across water depth, and acoustic detections remained constant in waters deeper than 15 m. The highest concentration of the mixed/visual detections was also similar to acoustic detections, mainly occurring between 9 to 11 m. However, we note a gradual decrease in the mixed/visual detections around 11 m, with very few records beyond 13 m (Fig. 8). The tensor product of swell and sea state had a borderline effect only on the mixed/visual detections (p = 0.06) (Table 3) and had no effect on the acoustic detections (Supplement).

Fig. 7
figure 7

Smooth for distance from shore (km) predicted by the GAMM for encounter rates of Indo-Pacific finless porpoises across detection type. Shaded portions indicate the 95% confidence intervals, color-coded according to the detection types. Vertical ticks on the X-axis indicate the distribution of encounters.

Fig. 8
figure 8

Smooth for depth (m) predicted by the GAMM for encounter rates of Indo-Pacific finless porpoises across detection type. Shaded portions indicate the 95% confidence intervals, color-coded according to the detection types. Vertical ticks on the X-axis indicate the distribution of encounters.

Table 3 Finless porpoise GAMM Results. The table includes smooths for the predictor variables. Distance shore is the measured distance from shore (km), depth (m), and the tensor product for swell and sea state measured in Beaufort. Significant predictor variables are highlighted in bold text.

Discussion

We present a comparison of acoustic and visual encounter rates for humpback dolphins and finless porpoises from the Sindhudurg region of India. While acoustic and visual methods showed complementary results for humpback dolphin detections, passive acoustic monitoring was most effective in detecting finless porpoises.

A key concern in comparing and matching encounter rates between acoustic and visual surveys is the difference between the detection ranges of the two platforms32. If the detection range of the acoustic platform is significantly greater than the visual, it may record more encounters simply from an ability to detect signals across wider distances. This could potentially inflate encounter rates from one platform or make it difficult to match detections across the two if such differences in detection range were not considered. Acoustic signals of some cetacean species are likely to be detected across greater ranges than visual cues32,33. However, for most smaller species of odontocetes, the visual and acoustic detection ranges are fairly comparable34. Visual detections of humpback dolphins can occur at greater distances, given their large body size and the presence of a fin. Similarly, the acoustic detection range for humpback dolphins could extend further than 1 km from the survey track, influenced by the broadband nature of their echolocation clicks. However, given the shallow bathymetry and high sediment content of the habitat, echolocation clicks are unlikely to be detected clearly beyond 1 km31. While tonal sounds of humpback dolphins (whistles and burst pulses) would propagate further, these were not considered in the analysis since we used click trains as the only detection unit to define acoustic encounters. A preliminary examination of localized distances from humpback dolphin click trains confirmed that most detections were within the same range as visual detections. Given these conditions, the likelihood of including acoustic encounters recorded beyond the visual detection range was rare. Examination of localized acoustic distances for finless porpoises showed accurate detections up to a maximum distance of 300–350 m from the towed array. Our findings are similar to observations made for acoustic detection distances for finless porpoises in China29. Given the narrow-band high-frequency properties of their echolocation clicks, sound propagation in a shallow turbid environment is often compromised. Visual detection distances for finless porpoises also fall within a similar range, as sighting the species further away from the survey track is difficult. Hence, acoustic and visual detection ranges were not starkly different for finless porpoises. Thus, for both species, we assume a good overlap between the acoustic and visual detection ranges within our study area.

Most humpback dolphin encounters were simultaneously recorded on both platforms, with neither method demonstrating significantly superior performance. Visual group sizes for humpback dolphins were generally higher than acoustic group estimates. This trend was consistent with a study from Malaysia comparing visual and acoustic group estimates for Indo-Pacific humpback dolphins27. A similar variation between acoustic and visual group size estimates has been observed in Irrawaddy dolphins35, suggesting larger groups may vocalize at lower rates. Preliminary analysis of acoustic data from our study area showed a similar trend with fewer individual echolocation tracks observed in the presence of larger groups compared to smaller group sizes. The encounter rate model for humpback dolphins explains factors responsible for variations across mixed detection types. Mixed detections were most influenced by distance from shore and the presence of vessels. Across their distribution range, humpback dolphin populations exhibit a habitat preference for coastal waters, primarily occurring within 2–3 km from the shoreline36,37. During our surveys, most humpback dolphin groups recorded beyond a distance of 2–3 km from the coast were either lone individuals or smaller groups traveling. A higher number of click trains and increased vocalization rate within smaller groups could be why most of these encounters were detected simultaneously on both platforms38. Mixed detections were also correlated with the presence of vessel traffic, indicating a potential association with the presence of preferred prey distribution and foraging behavior. The study area has high fishing intensity throughout the year, and vessel traffic primarily comprises active fishing boats (purse seiners, trawlers, and gill nets). While we do not discuss predominant group behavior in this study, foraging and feeding humpback dolphin groups have previously been recorded around fishing vessels in the study area6. Mixed events suggest the presence of actively foraging individuals using both acoustic cues for coordination and surface-active behavior, making animals detectable on both platforms39,40. The combined effect of swell and sea state had a borderline effect only on visual detections. However, a conclusive relationship could not be drawn from the GAMM model. A primary reason for this could be that most surveys were conducted in Beaufort sea states ranging between 0–3 and rarely exceeding 4. The data, therefore, may not present enough variation to draw an inference. The humpback dolphin model explained low variation in the data (17.4% deviance explained); however, this may be due to the relatively small sample size (n = 63) of encounters. Due to their extremely coastal distribution, a zig-zag survey designed closer to the shoreline, with more effort concentrated up to 2–3 km from the shore, would be preferable for monitoring this species. Overall, our findings indicate that combining visual and acoustic surveys is most effective for studying humpback dolphin populations. These findings are consistent with survey method recommendations derived for monitoring humpback dolphins in Malaysia27.

Acoustic encounter rates for finless porpoises were significantly higher compared to the visual encounter rates. These results align with studies for finless porpoises across China and Malaysia that report higher numbers of acoustic detections compared to visual27,29. We also observed variation in the group sizes estimated via acoustic and visual methods for finless porpoise encounters. However, unlike the study from Malaysia, which recorded larger group sizes (mean: 6.5 SD: ± 8.5) using visual detections as compared to acoustic (mean: 1.8 SD: ± 1.6)27, our estimates suggest the opposite. Potential factors for higher acoustic group size estimates can be attributed to the highly directional nature of the acoustic signal and reduced visibility due to turbidity in the study area. Both factors can cause individuals to echolocate constantly, contributing towards the detection and identification of distinct individual echolocation tracks. The finless porpoise encounter rate model suggests certain factors contributing to a higher frequency of acoustic detections. Acoustic detections were mainly influenced by distance from shore and depth. A higher frequency of acoustic detections recorded further from the coastline can be linked to an increased presence of porpoises in these areas. These sites could be associated with potential foraging areas. Studies show foraging porpoises spend an extensive amount of time echolocating41, which could explain the increase in acoustic detections in these locations. However, the variable was not found to have a significant effect on the mixed/visual detection types. A potential reason for this could be that the relatively small sample size of the mixed/visual detections (n = 39) compared to the acoustic detections (n = 114) might not provide enough variation in the data to observe a noticeable pattern. We also note that distance from shore was retained in the final models of both species, and the variable is probably a stronger indicator for overall habitat preference42,43 than the efficiency of the monitoring platform. Finless porpoise detections varied significantly across the depth profile, with a higher number of acoustic detections recorded beyond depths of 11 m than mixed/visual. These differences can be associated with longer dive times in depths beyond 2.5 m, as observed during studies on tagged individuals35. Studies have found finless porpoises also tend to swim slower at the end of dives beyond 2.5 m, suggesting these dives are linked to foraging events35. Finless porpoises also tend to use echolocation uniformly across different depths. Previous studies have established that depth is not a limiting factor for acoustic detections of this species44. An absence of visual detections across the depth profile could indicate individuals or groups actively foraging and feeding, characterized by longer dives and underwater duration. We found that the acoustic platform outperformed the visual across the depth profile. Although the combined effect of sea state and swell affected mixed/visual detections, it only showed a borderline effect. Visual monitoring methods and visual detection rates for most cetacean species are greatly influenced by oceanic weather conditions like increasing sea states, unfavorable wind speeds, and high wave height8,45. Although finless porpoises in certain geographic areas are reported to be visually detectable in sea states up to Beaufort 246,47,48, we had difficulty in detecting individuals when sea state \(\ge\)2 were combined with swell. Similar complications have been noted during visual surveys for finless porpoises in the Yellow Sea off China49. Range-wide, finless porpoises are known to be cryptic, with irregular surfacing patterns and small group sizes47. These behavioral traits, along with their small size and lack of a dorsal fin, provide sparse cues for visual monitoring. Overall, our data had a low number of visual detections (n = 39) for finless porpoises, and the model explained only 14.5% deviance in the data. This suggests some other factors not included in the model influence the visual and acoustic detectability of the species. Due to the elusive nature of finless porpoises, the absence of visual detections may be associated with behavioral ecology rather than external environmental conditions alone. Acoustic monitoring proved favorable, as the species vocalize constantly, irrespective of group sizes and behavioral states. Given the significantly high number of acoustic detections, we conclude that PAM is a more efficient tool for monitoring finless porpoises in the study area.

While our study provides insights into variations in acoustic and visual encounter rates of the two species, some limitations of the dataset should be noted. One constraint was the small sample size of visual-only encounters of finless porpoises (n = 4). As a result, we merged the visual-only and mixed detection types. This decision could potentially have some effect on the ability of the GAMM to detect variations within the data. Another limitation we recognize is that the sampling duration is restricted to the same hours throughout the study period. It is possible that animals may display variations in surface and acoustic behavior at different times of the day31,50. Future studies can address this by conducting surveys at different times of the day to account for potential diel variations in species behavior.

Implications for species monitoring

The Sindhudurg coast is part of a 4327 km\(^2\) region in the Arabian Sea that was demarcated as an Important Marine Mammal Area (IMMA) by the IUCN Marine Mammal Protected Area Task Force51,52 in 2019. The area primarily qualified for an IMMA based on the presence of resident populations of humpback dolphins and finless porpoises, which are vulnerable to anthropogenic pressures. The IMMA Task Force has highlighted the importance of continuously monitoring designated areas and vulnerable animal populations to detect any changes in distribution patterns and identify decreasing population trends51. The methods described in this study offer a robust framework that can be integrated within monitoring protocols to generate long-term population trends and provide rapid species assessment in response to increasing anthropogenic pressures. These encounter rates can be further analyzed to provide estimates of population densities and abundances, which can strengthen IUCN Red-List assessment reports for both species from Indian waters5,53.

Given the limited resources and trained observers available for studying cetaceans in developing countries like India, our approach of integrating acoustic and visual surveys provides a more feasible method to monitor co-occurring species together. We suggest such joint survey methods should account for varying detectability (acoustic and visual) of the target species to make monitoring efforts most efficient.

Population estimation surveys for humpback dolphins would best benefit from a combination of acoustic and visual monitoring. While we found the two detection platforms complement each other, it is important to highlight that PAM alone cannot provide adequate information on group sizes. Group size estimates can be essential when converting encounter rates to abundance and density estimates54. A combined acoustic and visual method would offer more reliable abundance estimates as we find visual data provides better information on group size estimates for this species. However, if the sole aim of a monitoring survey is to detect the presence of humpback dolphins, both standalone PAM or visual surveys would provide equally robust data. In the absence of trained observers, acoustic surveys offer another advantage of having data processed in a lab and thus require minimal personnel onboard for vessel-based surveys. In contrast, for finless porpoise surveys, acoustic detections were three-fold higher than visual detections. This difference was irrespective of weather conditions. Considering the elusive behavior of the species, even favorable sea states did not yield a high success rate for the visual platform in detecting the species. For finless porpoises, we recommend conducting acoustic surveys, even if trained observers are not available for simultaneous visual surveys. Acoustic data reliably provided robust encounter rates for the species, with minimal need for simultaneous visual validation. If localization distances are added, these standalone acoustic detections can also be converted into density and abundance estimates18. Overall, acoustic surveys also provide greater flexibility when conducting surveys in terms of weather conditions or daylight.

While this study does not focus on informing abundance or density estimates, our results highlight species-specific pros and cons of the two observation platforms, which should help to inform long-term monitoring efforts. These co-occurring species have significantly different ecology and acoustic behavior, and combined monitoring efforts would benefit from considering these differences when designing population estimation surveys.

Methods

Survey area

Surveys were conducted in Indian waters along the Sindhudurg coastline from Vijaydurg (16°33′24.72″N, 73°19′5.47″E) to Redi (15°44′43.95″N, 73°39′7.29″E) (Fig. 9). The survey area has a primarily sandy seafloor interspersed with rocky outcrops. The coastline is extremely shallow, with a gentle slope gradient, and the overall water depth within the survey region ranges from 2 m to 25 m. The study area extended \(\approx\)6km from the shoreline, forming a polygon of 376 km\(^2\). The polygon was stratified into seven blocks and labeled numerically from north to south (1–7) (Fig. 9). Following initial trials, blocks 4 and 7 were excluded from the survey effort due to their extremely shallow and rocky bathymetry, which made maneuvering the research vessel with the towed acoustic array difficult.

Fig. 9
figure 9

Study area marked with seven blocks showing transect lines in red. Blocks marked in gray indicate areas not surveyed due to bathymetry constraints. Blue lines in different shades indicate the depth (m) contours.

Field data collection

Combined acoustic and visual transects following a distance sampling protocol were conducted between 2020–2023 (Table 1). The zigzag line transects were designed in Distance 7.255 and were laid across the gradient of distribution of the focal species. An 11-meter modified trawler was used as a survey vessel throughout the study period. The vessel was equipped with a viewing platform approximately 2.5 m above the water surface. Transect speed was maintained between 11 and 12 km/h throughout the data collection period. Transects were on-effort only during daylight hours between 06:00 to 14:00 IST and in sea state \(\le\)4 Beaufort. Transects were conducted in passing mode, where the vessel continued to move along the survey track and did not approach or stop when the observers sighted a focal group56. The vessel was temporarily navigated off-track only in events where active trawling boats were directly in line with the transect.

A team of three trained observers was positioned on the platform (located at port, center, and starboard, respectively) to scan the water surface continuously. The observers covered a 140\(^\circ\) view between the starboard and port observers, and the center covered 40\(^\circ\) (20\(^\circ\) on either side) of the bow. An angle board was placed at the vessel’s bow, with the bow marking 0\(^\circ\). Observers continuously recorded environmental conditions, visibility (glare), sea state (in Beaufort), and swell height (in meters), along with presence, activity, and type of vessel traffic within a 1 km radius of the transect. Although we recorded glare at the beginning of each transect, it was not found to affect detectability as all observers wore appropriate polarized sunglasses. At every visual encounter of the focal species, observers recorded a sighting angle and radial distance to the focal group. Radial distances were estimated by eye, and all observers were trained in distance estimation throughout the survey effort. A GPS location was marked for each visual encounter on a handheld GPS device (Garmin 12H), and observers noted a minimum, maximum, and best group size estimate for the sighting57. Data on group compositions and predominant group behavior were collected opportunistically.

Acoustic data collection

A custom-made linear four-channel acoustic array was towed behind the vessel continuously to record the focal species independently of visual observations. The array was customized based on a design developed by Barkley et al. (2016)24 with modifications to the housing. The recording setup was contained in a rigid 6.35 mm PVC tube enclosing a thinner 1.5 mm PVC tube with four slots for hydrophones. The array had slots towards the opening and closing caps for flooding with seawater. Two pairs of HTI 96-Min hydrophone elements (HighTech Inc, Long Beach, MS, USA) were suspended in the inner tube using O-rings, taking care to ensure the cable did not touch the wall of the housing tube. The distance between each element within a pair was 25 cm, and the pairs were spaced 50 cm apart. All hydrophones had a flat frequency response of 2 Hz - 30 kHz, with a preamp sensitivity of \(-164\,\text {dB re:} 1\,\text {V/}\mu \text {Pa}\). The hydrophones were equipped with a 1 kHz high-pass filter to partially reduce flow and vessel noise in recordings. The hydrophones had a sensitivity range between \(-165\,\text {dB re:} 1\,\text {V/}\mu \text {Pa}\) to \(-162\,\text {dB re:} 1\,\text {V/}\mu \text {Pa}\). The hydrophones were connected to a four-channel autonomous recorder type SoundTrap ST4300 (Ocean Instruments, Auckland, NZ). The ST4300 had a frequency bandwidth of 20 Hz to 90 kHz ±3dB and an internal gain setting of 4dB. All four channels were sampled simultaneously and continuously at 288 kHz sampling rate and 16-bit resolution. File size was limited to 5 min per file for ease of downloading the data. The array was equipped with a downrigger type 2152 High-Speed Planer (Pro-Troll, Concord, CA, USA) at \(\approx\)4 m behind the vessel, suspended via a 1.5 meter slack line to maintain a steady horizontal position in the water column and to obtain a consistent towing depth between 4 and 5 m. Array depth was controlled by the tow line in certain shallow sections of the survey area and monitored with a depth sensor type Sensus UltraPro (ReefNet, Mississauga, ON, Canada). The array was towed between 23 and 25 m behind the survey vessel. All ST4300 data files were downloaded in the raw SUD format for further analysis. SUD files were decompressed to generate a set of WAV files for each survey.

Acoustic encounters

Acoustic data were manually screened in Raven Pro 1.6 (K. Lisa Yang Center for Conservation Bioacoustics, Cornell University, USA) to detect the presence of the focal species. Humpback dolphins and finless porpoises produce distinct acoustic cues which can be differentiated easily. For both species, the first and last detected acoustic cues were marked in each file. Echolocation clicks, together with tonal sounds, were used to detect the presence of humpback dolphins. An acoustic encounter for finless porpoises was defined by a silence period of 5 min between successive detections. A 5-min window was chosen considering an average swimming speed of 4.5 km/h for finless porpoises29. Since the survey vessel maintained a constant speed of 11–12 km/h, it was fast enough to avoid duplicate counts. Considering the acoustic detection range of finless porpoise clicks29, in addition to their tendency to echolocate constantly26, a silence period of 5 min was a good indicator for a new acoustic encounter. Encounters for humpback dolphins were separated by a silence period of 15 min between acoustic detections, given a longer detection range for their broadband signals31. The survey vessel speed was also fast enough to avoid duplicate encounters with humpback dolphins, considering their swimming speed between 2.5–6 km/h58,59.

Acoustic group sizes

Data for both species were further processed in PAMGuard (version 1.15.17)60 using the click detector and click classifier module. Echolocation click trains of both species were manually marked as events. Each event corresponded to a series of clicks, with changing bearing angles as the vessel crossed the vocalizing animals. Individual echolocation tracks within the predefined time bins matching visual encounters were counted to estimate the best group size from the acoustic data.

Visual encounters

Visual encounters were defined as a sighting of one or more individuals of the same species in an area. To keep data consistent and comparable between the acoustic and visual methods, the same time bins of 5 min and 15 min were used to segregate successive events. For visual encounters, as with acoustic encounters, no attempts were made to identify individuals and encounters beyond the defined time bins were treated as independent events61. Since the recording system was autonomous and all acoustic data were post-processed, the acoustic and visual encounters did not influence each other. After data processing, each encounter was labeled with a corresponding detection type (acoustic, visual, or mixed). Encounters were labeled “mixed” when simultaneous acoustic and visual detection occurred in the same timeframe. Time taken to travel between zigzag lines was taken as off-effort, and visual encounters during this time were marked accordingly. Acoustic encounters for off-effort sections were identified using timestamps from the WAV files. All off-effort data were excluded from the analysis. Transects were segmented into halves, and midpoints for each segment were marked in QGIS using the QChainage plugin version 3.10.10. Segmented transects with the half nearest to shore were labeled “in,” and the half extending to the boundary of the survey polygon was labeled “out.” Depth (in meters) and distance from shore (in kilometers) for each encounter and segment midpoints were calculated post-hoc using the GEBCO bathymetry chart http://www.gebco.net and NNJoin plugin version 3.1.3 in QGIS 3.24.2 respectively.

Data analysis

Statistical analysis was performed in R (ver 4.3.1, R Core Team 2021)62. The overall acoustic encounter rate for both species was derived by combining acoustic-only detections with mixed detections. Similarly for the visual encounter rate, visual-only and mixed detection types were both considered. A Wilcoxon rank-sum test was used to compare the acoustic and visual encounter rates and estimated group sizes using acoustic and visual methods. Visual and acoustic group estimates were only compared for mixed detection types.

Generalized Additive Mixed Models (GAMMs) were used to assess the relationship between encounters as a function of detection category and predictor variables. GAMMs were chosen as they are flexible and do not assume a linear relationship between the response and predictor variables63. GAMMs are also suitable for non-normally distributed data, which are common in ecological studies. GAMMs were fitted to the data using the mgcv package64 by including a random effect within the Generalized Additive Model (GAM) framework. Separate GAMMs were fitted for both species, with the number of encounters per segment across each detection category as the response variable. Detection type (acoustic, visual, or mixed) was included as a factor. Smooth terms for the explanatory variables were estimated separately for each detection type using the “by = factor” argument (Supplement). Segments without encounters were also included and were given a count of zero. Predictor variables included swell (m), sea state (Beaufort), depth (m), distance from shore (km), and vessel traffic (number of boats) within 1 km of the transect. To keep data consistent between segments corresponding to the presence and absence of encounters, measures for depth and distance from shore were taken using the midpoint of each segment. To test the combined effect of swell (m) and sea state, we included these as a full tensor product interaction since the two variables operated at different scales. The transect segment length (effort) was log-transformed and included as an offset. The number of knots for each smooth term was manually restricted between three and five to avoid overfitting the data. Variables with a value of p < 0.05 were considered to have a significant effect, and variables with p < 0.1 were considered borderline.

As the data structure included encounters per segment across each detection type, a unique transect label (including month, year, transect ID, and segment) was included as a random effect. Multicollinearity between predictor variables was tested by computing variance inflation factor (VIF) scores using the car package65, with a cutoff of VIF \(\ge\) 366. The model was checked for autocorrelation following the method outlined by Zuur et al. 2010 using the acf function in the itsadug package67.

The visual and mixed detection types for finless porpoise encounters were combined into a single category (type = mixed/visual) as the number of only visual detections was extremely small (n=4). Group size was not included as a variable in the GAMMs as the model could not be fitted accurately with missing values across detection types.

Different distribution families, including Poisson, negative binomial, quasi-Poisson, and Tweedie, were tested. The Tweedie family is flexible to accommodate zero-inflated and overdispersed data and was considered the most appropriate fit for both models based on diagnostic plots (Supplement). Tweedie GAMMs were built using the tw function in mgcv, which allows the function to estimate p automatically during fitting, where p assesses the relation between the mean and the variance of the distribution. The value of p ranges between 1 and 2, where p = 1 indicates a Poisson distribution and p = 2 indicates a gamma distribution. We used the restricted maximum likelihood (REML function) as the optimization method. Model diagnostics were performed using the mgcViz package68. The dredge function in the MuMIN package69 was used to conduct backward model selection. The final model for each species was chosen based on the Akaike Information Criterion (AIC) values within a \(\Delta\)AIC score of \(<2\) units of the top model.