Background & Summary

Underwater radiated noise (URN) from shipping activity has been identified as a significant component of ocean ambient noise1,2,3, with recent studies indicating that URN doubles in intensity every decade. The effects of URN on various components of the marine ecosystem, from mussels to marine mammals, have been extensively studied across disciplines4. Several reports (e.g.5,6) suggest that blunt tissue trauma comparable to blast injuries and behavioral changes reflected in social stability and foraging ability have the potential to affect aquatic animals residing in close proximity to a vessel emitting high sound intensities from its propellers or onboard engines. Research has shown that URN from vessel activity leads to a myriad of adverse effects on fish, crustaceans, and especially on cetaceans4,7,8,9,10. In particular, the responses of a given aquatic animal to anthropogenic noise can be divided into five main categories11: (I) audibility; (II) behavioral responses reflected in changes in the intensity, frequency and intervals of the animal’s vocalizations as well as stress behavior; (III) masking of sounds required for communication, localization and foraging; (IV) physiological auditory threshold shifts due to inner ear hair cell fatigue; and (V) physical damage (injury) to the auditory system. In addition, there are direct risks associated with physical disturbance, as in the cases where sea turtles12 and large baleen whales13 colliding with vessels, partly because the low-frequency sounds from ships interfere with their navigation. Because of these effects, URNs generated by ships are considered a source of pollution and should be monitored regularly by measuring ship noise.

Shipping URN includes high-power impulsive transient waves generated during ignition14, narrow-band noises generated from thrusters, engines as well pumps and generators15, and cavitation noises. The latter is caused by the fast enough turn of the propeller to allow low-pressure areas of the propeller to drop below the vapor pressure and the seawater to boil at ambient temperature. When the bubbles reach ambient pressure behind the propeller, they implode, yielding broadband stationary noise. The results are: (1) low-frequency noise (around 50 Hz, distributed over a huge area, which impacts the communication of large marine mammals like baleen whales and dolphins); (2) noise from 4-stroke engines (around 200-800 Hz, independent of speed, medium distribution, which likely impacts toothed whales); and (3) high-frequency noise with higher harmonics due to the Lloyd’s mirror effect16 (1 kHz-10 kHz, speed-dependent, which can bear a significant amount of acoustic energy, and likely effects small mammals and fish).

Standards have been established to limit the transmitted acoustic power per exposure time17, and regulatory organizations such as the European Union, the Convention on the Conservation of Migratory Species (CMS), and the Convention on Biological Diversity (CBD), have adopted resolutions aimed at reducing underwater noise from ships and other man-made noise sources. There have even been recent efforts by the shipping industry to reduce URN by creating specialized notations for ships that meet certain noise criteria (e.g., https://www.lr.org/en/knowledge/press-room/press-listing/press-release/lr-announces-new-underwater-noise-notation/). Although there are standards for quantifying the URN of vessels18, these require the cooperation of the ship: sailing on a fixed route and at nominal speed. The measurement is carried out at deep sea and under low-noise conditions. These conditions do not apply to the URN measurement of a vessel of opportunity (VOO). Specifically, vessels that are not obliged or avoid carrying an automatic identification system (AIS). As a result, the magnitude of noise from such vessels remains under-explored, and a quantitative study examining the extent of shipping URN has yet to be thoroughly conducted. The main barrier to such studies is the establishment of a proper approach to match the size and type of a vessel to its URN.

In this paper, we have addressed the challenge of quantifying the URN of VOO by integrating acoustic measurements with video footage processed using machine learning to estimate the vessel’s type, speed, and size. This negated the need to detect the vessel’s URN in the acoustic data, and the closest point of access (CPA) could be readily analyzed by time synchronization of the optical and acoustic data. The results of our work is an openly shared dictionary for URN of VOO. The data was collected at the entrance to St. Anthony’s Channel near Šibenik, Croatia. Through this channel, a daily traffic of hundreds of vessels exists. Most of these vessels do not carry an AIS but are easy to observe visually as they enter or exit the channel. In total, 1148 vessels were assessed over a period of 23 days. The measured radiated noise took place right below the vessel, where sound propagation is not affected by seasonal changes. Hence, we argue that the period sampled is representable. Our data comprises pictures and videos of the vessels, the spectrum of their URN at the CPA, and meta-data in the form of the vessel’s type, size, height, and speed. In the following, we outline our data collection method, the details of the database and its front-end tool that allows comparison of shipping URN.

Current Approaches in Shipping Noise Measurement

The literature regarding URN measurement focuses on assessing the noise levels and spectral characteristics of various vessel types under different operating conditions (e.g., speed, cargo load, draft, length, and machinery load). There are currently two main approaches for assessing URN from vessels; the first is a “full-control” approach where specific vessels are chartered to conduct measurements under specified operating conditions19,20,21. This approach requires near full control over the measurement and operating conditions (e.g., speed of the vessel, CPA range), thereby enabling highly accurate measurements as well as the ability to conduct repetitive measurements of the same vessel under different operating conditions. However, the cost-effectiveness of this approach also implies that usually, only a small number of vessels is assessed. The second and more prominent approach is the “opportunistic” or in situ approach, where sensors and data collecting units are placed near main waterways, and measurements of VOOs are acquired as they transit the area22,23,24. While the in situ approach lacks the ability to control the measurement or operating conditions, it does enable the collection of a larger sample number over longer periods of time.

The most common methodology utilized in both the full-control and the in situ approaches is cross-referencing acoustic data collected by hydrophones with vessel data. In this aspect, the introduction of the Automatic Identification System (AIS) in the early 2000s has created an accessible and convenient venue for collecting vessel data. The bulk of current research frameworks have primarily depended on AIS for retrieving vessel data22,25,26,27. While the widespread use of AIS creates more opportunities for full-control and opportunistic URN assessments, it is also important to point out its inherent limitations - as noted in the SOLAS regulation V/19.2 (see https://www.navcen.uscg.gov/sites/default/files/pdf/AIS/SOLAS.V.19.2.1-5.pdf), AIS transponders are a legal requirement only for large commercial vessels, e.g., cargo vessels, tankers, passenger ships. This implies that smaller commercial and recreational vessels that are abundant in coastal areas are mostly absent from the current research agenda. The importance of this absence is further emphasized by two aspects. Firstly, the numerical comparison between the number of large commercial vessels and the number of small recreational vessels - the world’s total fleet of large commercial vessels is comprised of nearly 109,000 ships (see https://unctadstat.unctad.org/datacentre/dataviewer/US.MerchantFleet), while there are nearly 12 million recreational vessels in the US alone28. Secondly, the scarce research that has addressed URN from small recreational vessels suggests that under certain conditions, URN from such vessels may be as high or even exceed those of large commercial vessels25,29,30. Accordingly, it would seem imperative that a novel approach for assessing URN from smaller vessels is required to fill this gap.

The main challenge for conducting wide-scale assessments of small vessels not equipped with AIS is the retrieval of vessel data in order to collect information on the vessel’s type, speed, size, and CPA range. This is available through sensors such as radar and HD cameras. Cope et al.29, for example, utilized a multi-sensor system (Marine Monitor - M2) comprised of AIS, Radar, and an HD camera for assessing URN levels of vessels. While this approach enables retrieving essential vessel data such as vessel speed, course, and CPA range, other important parameters, such as vessel size, cannot be accurately assessed. To account for the need to collect acoustic data as well as detailed vessel information accurately, we propose a novel approach based on the integration of acoustic and optical sensors.

Current Approaches in Vessel Detection from Optical Cameras

The methods for detecting ships using cameras are designed to cope with motion and observation noise. The vessel detection approach in31 performs background subtraction to detect primary motion in the scene while avoiding interference from wave motion using saliency detection. Similarly, in32, the authors propose a video-based port surveillance that combines context and motion detection. The context part of the algorithm involves graph-based segmentation to identify the ship within the water region, while the motion saliency algorithm utilizes the faster motion of the ship relative to its surroundings. A different approach is proposed in33, where eigenvalues of region covariance are computed to distinguish man-made objects from the natural background. Another type of scene separation using optical flow is proposed in34 to estimate the moving region in the image, and a Gabor filter is implemented for extracting texture features indicating where the ships are located. However, different possible vessel types make it difficult to achieve robust detection and classification.

Due to changing visibility conditions and different shapes of vessels, solutions have recently been developed to identify ships with optical cameras using machine learning methods. These require training with large data sets. For example, the dataset provided in35 contains 5,500 ship images of 109 different classes taken in the Porto Cesareo Marine Protected Area. Results of ship identification using a discriminatively learned CNN are presented. Another dataset is available in36, comprising 70,513 vessels, which were recorded in 48,966 images from 10 camera perspectives. As a basis for vessel identification, the authors propose the Single Shot Detector (SSD) algorithm, which supports significant variations in vessel size and aspect ratios.

Current approaches to vessel identification are based on transfer learning from already trained models. In37, the YOLOv3-tiny network is used to recognize vessels and distinguish between six vessel types. For the detection of large vessels, which are easily disturbed by surrounding buildings, waves and lights on the water surface, the Convolutional Block Attention Module (CBAM) is used to focus the model on the target vessel. In38, the YOLOv5XL model is used to fuse camera data with AIS information. Through transfer learning, only the last layer was trained with annotated data, which contributes to the generalization of the model. Another application of the YOLOv5 network is presented in39 to classify between vessel types.

What distinguishes our work from previous work in vessel detection from optical cameras is the use of the YOLOv5 network to detect different types of vessels traveling through narrow canals, and to estimate their size and speed. This system provides more information about the vessel than a bounding box within the image and can manage multiple vessels within a single image. This additional information is used to establish a better link between the vessel’s URN and its characteristics and to monitor maritime traffic in narrow areas such as canals, channels, and entrances to marinas with high shipping activity.

Methods

Description of the Testbed

Our testbed resides in the area of St. Anthony’s Channel (Sv. Ante in local dialect) Inlet near Šibenik, Croatia. During the summertime, the area is characterized by high-density VOO, which includes various classes of vessels, ranging from coastal commercial ferries to smaller recreational vessels, e.g., yachts, motorboats, and sailboats. Due to the narrow passage, there is only one lane within the channel. These conditions make the area an ideal testbed for analyzing underwater radiated noise from vessels that are not obligated to carry AIS transponders.

The testbed was deployed on August 2023, and aimed to record the URN of VOO passing through the Šibenik channel and to link it with the vessel’s size, type and speed. The testbed contained an acoustic recorder and a shore-fixed camera. An illustration of the testbed is shown in Fig. 1.

Fig. 1: Setup of the recorder.
Fig. 1: Setup of the recorder.
Full size image

Left panel: Illustration of the acoustic-optic testbed for quantifying the URN of VOO by using syncronized optical measurements from a shore based camera and an acoustic recorder positioned at the seabed. Right panel: Picture of the underwater acoustic recorder at the deployment site.

The acoustic recorder, an AMAR G3 JASCO with a single M36-V35-900 Geospectrum omnidirectional hydrophone, was placed near the eastern entrance to the Channel Inlet at coordinates 43.728400°, 15.879271°. See map in Fig. 2. The hydrophone had a flat with max 0.5 dB ripple response between 0.01-100 kHz and sensitivity of -164.9 dB re:1V/μPa. At the point of deployment, the channel is 130.3 m wide and most vessels pass in the middle of the channel. The recorder was mounted vertically to an anchor one meter above the seabed at a depth of 29 m. The hydrophone’s cage was covered by a dense yellow net to reduce flow noise and assist in locating the recorder after three months of deployment, as shown in Fig. 1. The recorder was set to record continuously for 12 hours a day between 7:00 to 19:00 (UTC+2) to account for daylight hours for the specified region at the time of deployment. The recording was made at a sampling rate of 48 kbps, in a resolution of 3 Bytes per sample. In total, 661 GB were recorded over a period of 113 consecutive days.

Fig. 2
Fig. 2
Full size image

A satellite image of the deployment site. The recorder location is marked in red (right next to the vessel in the image).

The shore-fixed camera was a DAHUA SD-59230U-HNI video camera with an 30x optical zoom. The camera was mounted on a designated stand roughly 800 meters from the canal entrance at coordinates 43.736305°, 15.876754°. The camera was configured to record ships passing through the canal at a continuous rate of 30 FPS at the resolution of 1920 × 1080 pixels. Its 67.8° field of view (FOV) provided a wide perspective of the canal. The camera, which is equipped with an IR night vision device, recorded data continuously, both day and night. However, during the inspection, it was found that the night vision performance did not meet the required range. Therefore, the night data was excluded from the data set, resulting in a total of 340.5 GB of data being collected over a period of 26 days. The camera data was continuously transmitted over a local network and recorded on a PC with sufficient internal memory. The data recording was monitored to avoid possible malfunctions and to ensure uninterrupted data recording. The recorded data was saved in the DAV file format, a compressed and encrypted video format developed by Dahua. To facilitate access and subsequent processing, the DAV files were converted to the more commonly used MP4 format.

Example image from the camera containing two vessels is shown in Fig. 3. For size estimation, the length and height of two landmarks were measured. These are the cretaceous limestone layer, both visible in Fig. 3.

Fig. 3
Fig. 3
Full size image

Examples of an image with two vessels obtained from the video camera.

Visual Analysis

To analyze the raw optical data, we develop a machine learning based model for vessel detection, tracking and classification from video footage. The model provides a bounding box around the vessel. The algorithm then estimates the meter/pixel ratio and determines the size of the boat in meters from the width of the bounding box. In turn, the tracking procedure follows the bounding box through consecutive frames to estimate the vessel’s velocity.

Vessel Detection and Classification

For detection, we used the YOLOv5-based trained model which is a fast training model. The details of this implementation are presented in40 and are given here in brief for completion. The YOLOv5s model is the second smallest of the YOLOv5 family (see code in https://github.com/ultralytics/yolov5. The model includes a backbone based on Darknet53 with Cross Stage Partial (CSP)41, namely 2 BottleNeckCSP modules and one Spatial Pyramid Pooling (SPP)42 block (SPPF) with max pooling (kernel pool size = 5) and a neck based on the Path Aggregation Network (PANet). All convolutional layers used a Sigmoid Linear Unit (SiLU) activation function and batch normalization except the three layers that form the head of the network (Sigmoid function).

The model detects and classifies vessels into 9 classes: sailboat, catamaran, small boat, yacht, big yacht, tour boat, passenger boat, trawler, medium boat. For vessel detection and classification, the model was trained using 2100 manually labeled images, observed in 4 different days, and was validated and tested with 100 and 300 images, respectively, captured on days unrelated to the training dataset. Tuned hyper parameters yielded 100 epochs and a batch size of 8. Challenges occur in edge cases when a boat exits the frame, causing the bounding box to start shrinking. Avoiding this, we define a region of interest (ROI) that isolates edge positions.

Vessel Size Estimation

To determine the meter/pixel conversion rates, we used as ground truth 10 vessels from the collected visual data for which the size is known by factory specifications. This small dataset included speedboats, catamarans and tourist boats for which the size can be easily found online. For each boat, up to 3 images were identified showing the vessel in the beginning, middle, and end of the frame. The meter/pixel ratio was calculated for each of the 10 vessels as the relation between real vessel length and detection bounding box width. A list of all occurrences for the explored 10 vessels is given in Table 1, and examples of the used frames are given in the Fig. 4.

Table 1 Boat Dimensions and Pixel Coefficients.
Fig. 4
Fig. 4
Full size image

Image detections of the boats used for the estimation of meter/pixel coefficient.

Velocity estimation

For estimating the vessel’s speed, we applied the SORT (Simple Online and Realtime Tracking) approach43, which is based on Kalman filtering. For vessel dynamics, we consider a constant velocity model. We choose SORT due to its ability to maintain track albeit loos of detection for a few frames. This is a typical scenario in our database due to occlusion events where vessels cross each other entering or existing the canal. The speed of the vessel is calculated by the distance between the bounding boxes at consecutive frames, while accounting for the estimated pixel/meter ratio.

Vessel Noise Estimation

Quality Control

To ensure proper synchronization between the acoustic recorder and HD camera throughout the data collection period, the system was subjected to a test trial to check for any time drifts. The internal clocks of the acoustic recorder and HD camera were also compared before and after the data collection period.

Acoustic recording of vessel transits were subjected to quality control on three levels: (1) cross-referencing the data with the HD camera, (2) manually reviewing acoustic samples by an expert analyst, and (3) monitoring real-time weather information. Samples were discarded from the post-analysis under the following criteria.

  1. 1.

    A vessel transit occurred between recording segments;

  2. 2.

    Transit occurred in a time frame where wind speed or sea state exceeded ANSI and ISO standards;

  3. 3.

    If a vessel was employing an active acoustic system (e.g., echo-sounder/Fish-Finder);

  4. 4.

    If a vessel abruptly changed its course or speed;

  5. 5.

    If a vessel conducted a north/south transit and did not enter or exit the channel;

  6. 6.

    If another vessel transited the area within the same time frame;

  7. 7.

    If a marine fauna noise was detected during the vessel’s transit.

To calculate the vessel’s source level, SL, we reverse-propagate the received acoustic signal, accounting for the transmission loss, TL, to an idealized monopole @ 1m from the source according to ISO standards44, such that

$${\rm{SL}}={\rm{SPL}}+{\rm{TL}}\,.$$
(1)

This requires the calculation of the SPL and the transmission loss as follows.

Received Noise Levels

Received noise levels (RL) measurements were conducted for the full spectrum of 0.023–24 kHz. The upper-frequency limit is set by the sampling rate, while the lower limit accounts for the cutoff frequency resulting from the water depth according to Jensen et al.45,

$${f}_{0}=\frac{c}{4D\sqrt{1-{\left(c/{c}_{{\rm{b}}}\right)}^{2}}}\,,$$
(2)

where c is the sound speed in water, cb is the sound speed in the seabed, and D is the water depth.

The time window length used for URN evaluation, which is an essential aspect of the measurement, varies across different research frameworks. The approach applied in Pine et al.46 and Zhang et al.26, for instance, utilized a constant time window length with no dependence on the length, speed, or range of the vessel. Other approaches chose to apply a variable time window depending on the vessel’s physical characteristics. For example, in Bahtiarian et al.47 and McKenna et al.22, the selected time window was dependent on the time it takes for a vessel to travel its length. Unfortunately, neither of these options is suitable for our setup. In particular, as the results in Figs. 6, 7 show, our database includes significant variances in vessel transit speeds and lengths. Instead, we chose to follow the ISO44 standard, which sets a variable time window dependent on the aspect of each vessel (60–120 deg of the bow aspect). Here, the time window length was calculated based on the vessel’s speed, as reported by the visual algorithm.

To measure the SPL, we applied (3); the variable time window used for the measurement of each vessel transit was processed in the frequency domain by a fast-Fourier transform (2048 FFT points, Hanning window with a 50% overlap). The 2048 FFT resolution was chosen to allow comparability with other research frameworks48,49.

$${\rm{SPL}}=20{\log }_{10}\left(\frac{{P}_{{\rm{RMS}}}}{{P}_{{\rm{ref}}}}\right)\,,$$
(3)

where Pref = 1 μPa and PRMS is the root mean square of the sound pressure level measured within the variable time window. For the measurement of third-octave bands, we applied the same parameters, albeit with a higher frequency resolution (32,768 FFT points), in order to comply with standard one-third octave bands according to the ANSI Standard S1.11-2014/Part 150.

Transmission Loss

The acoustic transmission loss depends on the range of the vessel, geometric spreading characteristics, and the seabed and water column properties51. Referring to the vessel’s range, while the entrance to the inlet is measured at 135.7 m, our observations showed that nearly all marine traffic passes through a  ~ 90 m lateral area in the middle of the channel inlet. These observations coincide with known hazards to safe navigation and official maritime routes described in maritime charts. Based on this information, we calculate the maximum potential CPA range as the slant between the hydrophone and the described lateral area (49 m) and the minimum potential CPA range as 29 m (water depth to hydrophone). For the geometric transmission loss calculations, we applied the mean CPA range (39 m). This ±10 m ambiguity in the vessel’s location can be compared with the location error of URN databases relying on AIS information, which is known to present a mean discrepancy of up to 97.72 m52.

In the aspect of absorption losses, applying Ainslie and McColm’s53 equation for absorption in seawater for the mean CPA range under different scenarios of temperature and salinity found in the tested area, we reach a maximum loss of roughly 0.15 dB at the mean CPA range. We, therefore, consider absorption losses as negligible. Similar conclusions about the negligibly of absorption losses in URN levels of vessels at close ranges have been made in Hermann et al.54 and McKenna et al.22. As the type of sediment in the test bed area consists primarily of fine gravel55, which possesses a high reflective coefficient21, we chose to apply the simplified equation described by Erbe56, which has also been used in similar research frameworks57,

$${\rm{TL}}=20{\log }_{10}(D)+10{\log }_{10}(R/D)\,,$$
(4)

where R is the horizontal range between the mean CPA and the receiver and we recall that D is the water depth.

Data Records

The repository data is at Science Data Bank58. We declare that the openly sharing of all videos, images and acoustic recordings of the ships in our dataset are in compliance with all ethical and legal considerations. We declare all approvals and permissions required for the open publication of the data have been granted. In particular, institute Ruder Boskovic, as a state scientific institution, has the permission to perform research on the territory of the Republic of Croatia. A permit for research in the protected area of Krka estuary, where the data was collected, has been obtained. This permit was given since 2014 as part of the project “Nautical Tourism Ecological Footprint in MPAs” financed by MEDPAN, to log data and provide local authorities with information about vessels passing through.

In this section we describe the arrangement of the data.

Arrangement of Data files

Visual Information

The optical dataset is organized into 12 folders, each named according to the starting time of the recording in the format yyyymmdd_hhmmss. Each folder contains pairs of 1-hour video files and corresponding .csv files. In total, there are 324 .csv files. Each .csv file contains detection information with the following columns:

  • time_of_detection: The time at which the detection occurred.

  • boat_class: The class of the detected boat.

  • size: The size of the detected boat.

  • velocity: The velocity of the detected boat.

An example for the visual data architecture is shown in Fig. 5.

Fig. 5
Fig. 5
Full size image

Organisation of the visual data.

Acoustic Information and Online Database

The acoustic dataset and processed acoustic information include the time and date of transit, the type of vessel, the sub-type of the vessel, the length and speed of the vessel (according to the visual algorithm), the calculated SL–SPL, and one-third octave spectrogram, and, where applicable, the full name and IMO number.

Technical Validation

Vessel Detection Results by Optical Camera

Summary of Detected Vessels

Our dataset contains 3,799,505 camera detections of various types of vessels, along with their corresponding detection times, estimated sizes, and velocities. The histogram in Fig. 6 indicates that the “small boat” class is by far the most common in this dataset, representing around 65% of all detected vessels. Additionally, sailboats are frequently observed, likely due to the presence of charter marinas in the area. A distribution of vessel sizes is presented in Fig. 7. The data reveals maxima at around 5-8 meters, which is a common size for vessels classified as small boats, and at 12-15 meters, which is typical for regular sailboats.

Fig. 6
Fig. 6
Full size image

Histogram of vessel classes the final vessel detection result.

Fig. 7
Fig. 7
Full size image

Histogram of vessels size in the final vessel detection result.

The histogram in Fig. 8 shows the density of vessels’ velocities. We determine positive speed for vessels exiting the channel towards the port and vise versa for vessels entering the channel. Results show that vessels moving towards the port move slower compared to those heading towards open sea. Most vessels travel at an absolute speeds of 2.5–4.0 ms−1 (4.85–7.7 knots). This is expected, considering that the most frequently detected class was small boats.

Fig. 8
Fig. 8
Full size image

Histogram of vessels velocity in the final vessel detection result. Positive speed reflects vessels exiting the channel, and vice versa for vessels entering the channel.

Results of Vessel Size and Speed Estimations

To examine the algorithm’s performance in estimating vessel size and velocity, a small dataset was created with the vessels of known size and speed. The picture in Fig. 9 is of a small research vessel belonging to the Institute Ruđer Bošković, with a measured waterline length of 8 m and a total length of 9.3 m. To evaluate the size estimation error, the boat performed two transactions on both the left and right sides of the channel. The images collected from 4 such passages are shown in Fig. 9.

Fig. 9
Fig. 9
Full size image

Detections on the vessel with known size for the purpose of algorithm performance evaluation.

Successful detection and classification occur in all frames. However, due to the angle of the vessel, the final part of the stern is included in the bounding box of only some of the frames. Size estimation results of the test vessel for 6 corresponding frames are given in Table 2. The average length of 8.495 m is close to the waterline length, and a standard deviation of 0.2187 m represents roughly 3% of the average value. We deem this error acceptable considering the use of a single monocular camera.

Table 2 Test results for the Ruđer Bošković Research Vessel (waterline length of 8 m and total length of 9.3 m).

Another validation of the size and velocity measurements was performed over incidences involving vessels carrying AIS. To this end, we filtered a large dataset of 65,000 AIS recorded at a low frame rate of 5 min within the channel. That resulted in only 9 recordings that aligned with the camera frame allowing us to match the AIS data with the camera data accurately. This low number of vessels emphasizes the low rate of detection available for shipping URN measurements using AIS data only. The true and estimated size and speed of these 9 vessels are listed in Table 3. An average error of roughly 7% in both length and speed is observed. Part of this error is due to the rounding operation of the AIS.

Table 3 Test results for length and speed estimations for 9 vessels identified by both camera and AIS.

Results of Shipping Noise Estimation

Of the thousands of vessel transits recorded by the visual algorithm, 1148 passed quality control (see Quality Control Section) and were processed to extract their acoustic characteristics, i.e., sound pressure level and 1/3 octave spectrum. Each vessel transit was labeled according to its type, the time and date of the transit, as well as the vessel hull name and IMO number, where applicable. The hull name and IMO no. were associated to a vessel transit in cases where the hull name was visible via the camera footage. The length and speed of each transit were associated according to the calculations made by the visual algorithm. Table 4 presents an overview of the analyzed vessels’ characteristics, i.e., class, number of observations, speed, length, and the calculated sound pressure levels at the source (SL-SPL). The largest vessel observed in the dataset was a 54-meter mega-yacht named “Premier” (IMO 9949132), while the smallest is a 3.3-meter motorboat. The fastest vessel analyzed is a motorboat sailing at 15.9 meters per second (m/sec), i.e., 31 knots. The slowest vessel is a sailboat sailing at 1.1 m/sec. The loudest vessel observed was the Postira ferry (IMO 6283202) sailing at 6.5 m/sec, which presented a calculated source level of 188.26 dB, and the quietest vessel was a sailboat sailing at 2.4 m/sec which presented a calculated source level of 151.91 dB.

Table 4 Characteristics of the Vessel Dataset (mean / Std. Dev).

Usage Notes

In addition to the dataset file, the data is available at a front end called “Hear my ship” (https://hearmyship.fer.hr/). The web tool allows separation by vessel’s type, speed, length and noise, visualization of the data (both optics and acoustics, data download and comparison between the URNs of different vessels.

Overall, 1148 vessel transits were evaluated. As mentioned in the introduction Section, URN calculations for small vessels in the coastal area have been mostly absent from the current research agenda, primarily due to the lack of any open-source information systems that broadcast vessel data (e.g., AIS found on larger commercial vessels). This, in turn, creates inherent difficulties in collecting vessel parameters, e.g., length and speed, which are crucial parameters for URN calculations. In this sense, the novel methodology implemented in this research framework enabled the formation of what can be considered the most comprehensive dataset to date, specifically focused on small to medium vessels.

In making our database of URN of VOO, our biggest concern was the averaging window around the CPA for URN estimation. The size of this window should be long enough to suppress noise but also short enough to avoid differences in the received power due to the changing distances between the receiver and the moving vessel. In our database, we have followed the method recommended in ISO 1720844 that manages this trade-off by setting the observation window as the time-frame in which the vessel passes the  ± 30° aspect tangent to the hydrophone. However, for some of the fast motorboats sailing at >10 m/sec, this resulted in relatively short time-window lengths (~4 s). A possible way to address this challenge is a statistical measure to test the stability of the acoustic intensity around the CPA and to determine the size of the observation time accordingly.

Another challenge encountered is the extensive traffic of vessels within the channel where the recordings took place. To avoid any mutual interference caused by vessels transiting the channel in close proximity, we placed a strict quality control process (see Section below) and dismissed cases identified by the video footage or acoustics of vessels transiting the channel in close proximity. As a result, roughly 90% of the vessel transits did not pass quality control. The case of too close vessels also poses a challenge for monitoring the URN of VOO in realistic conditions of near port measurements.