A High-Quality Underwater Acoustic Dataset for Algorithm Development and Analysis

Lobo, Victor; Pessanha Santos, Nuno; Moura, Ricardo

doi:10.1038/s41597-025-05564-x

Download PDF

Data Descriptor
Open access
Published: 30 July 2025

A High-Quality Underwater Acoustic Dataset for Algorithm Development and Analysis

Scientific Data volume 12, Article number: 1323 (2025) Cite this article

2191 Accesses
Metrics details

Subjects

Abstract

As data becomes increasingly available, relying on quality datasets for algorithm analysis and development is essential. However, data gathering can be expensive and time-consuming, and this process must be optimized to allow others to reuse data with simplicity and accuracy. The Wolfset is an acoustic dataset gathered using a Bruel & Kjaer type 8104 hydrophone in an anechoic tank usually used for ships’ sonar calibration. The name Wolfset is inspired by the Seawolf submarine class, renowned for its advanced sound source detection and classification capabilities. Using an anechoic tank, we can obtain a high-quality dataset representing acoustic sources without undesired external perturbations. In many operating conditions, several outboard motors and an electric motor from a basic remotely controlled ship model were used as sound sources, usually called targets. Then, external transients and noise sources were added to approximate the dataset to the sounds present in real-world conditions. This dataset uses a systematic approach to demonstrate the diversity and accuracy needed for effective algorithm development.

Sensing whales, storms, ships and earthquakes using an Arctic fibre optic cable

Article Open access 10 November 2022

Leveraging satellite observations and machine learning for underwater sound speed estimation

Article Open access 15 July 2025

A Public Dataset of Annotated Orcinus orca Acoustic Signals for Detection and Ecotype Classification

Article Open access 03 July 2025

Background & Summary

Data, information, and knowledge are essential concepts in our daily lives¹. Data is critical since its processing and analysis can provide crucial information and knowledge at a final stage, which can bring advantages depending on the field of action. Nowadays, data analysis is mainly performed using Machine Learning (ML) and Artificial Intelligence (AI) and can be applied to supply chain management², power electronics³, healthcare⁴, engineering design⁵, underwater acoustic⁶, among many others fields of science and actuation. With the expansion of data science⁷, more data is continuously becoming available, introducing new problems such as dataset standardization⁸ that must be appropriately addressed. Data standardization involves converting data into a common format to facilitate third-party analysis and processing, increasing interoperability^9,10.

Sound waves travel by vibrating particles, and their speed and range depend on the elasticity of the propagation environment¹¹. Underwater acoustics is the branch of science that studies how sound waves propagate in the dense and elastic water environment and how they interact with it^12,13. In general, oceans and rivers contain many sources of noise that can be classified as either anthropogenic¹⁴ or natural¹⁵. Anthropogenic sounds originate from sources such as ships and coastal zone operations, while natural sounds come from animals, rain, sea movement, and other sources. An anthropogenic dataset capable of being used to help develop new solutions and algorithms during research and development in underwater acoustics is essential since its acquisition is usually expensive and time-consuming.

In all navies worldwide, submariners share the same motto: There are only two types of ships, submarines and targets. To obtain operational advantage and be able to call others targets, it is essential to capture and classify underwater sound sources correctly, since it is necessary for many tasks, including the task of standard surveillance¹³. Coastal monitoring systems typically rely on a Vessel Traffic System (VTS) equipped with radar, electro-optical capabilities, and the ability to receive data from the Automatic Identification System (AIS)¹⁶. However, heavy maritime traffic areas pose a challenge as small vessels and submarines tend to evade surveillance¹⁷, which is a significant concern, especially with the increasing use of submarines due to armed conflicts in Eastern Europe¹⁸.

Some currently underwater acoustic datasets^19,20 contain some data acquired directly from the sea, but that uncontrolled environment is subject to undesired noise and perturbations. Acquiring acoustic data in controlled and well-defined scenarios, such as anechoic tanks containing absorbent plates made of cork agglomerates and rubber to minimize sound reflections for ships’ sonar calibration, can be expensive (Fig. 1). It is essential to ensure that the data gathered in those scenarios can be successfully reused. These ideal test conditions ensure a high-quality dataset representing the sound sources without undesired external perturbations. The hydrophonic effects are an essential aspect of underwater acoustics research, and their collection and analysis can be challenging. To aid this endeavor, we have gathered a dataset of hydrophonic effects, non-classified for security purposes, named Wolfset²¹. Usually, this kind of dataset is classified and gathered in the sea, where we have several undesired noise sources. The name Wolfset is inspired by the Seawolf submarine class, renowned for its advanced sound source detection and classification capabilities. This dataset can benefit various research projects and develop techniques that may automatically identify and classify these effects since the hydrophonic effects present in the dataset closely resemble real effects observed at sea. Using real data and not synthetic data also increases the dataset value since we can develop algorithms that perform well in real-world applications. Its size and content can also guarantee statistical significance and the demanded diversity. We have acquired and pre-processed a quality dataset representing outboard motors and a basic remotely controlled ship model as targets, that can be easily used to perform, e.g., classification^22,23,24. Adding external transient effects and noise to the dataset increases its proximity to reality, which is expected during algorithm design and analysis. Since we have samples with and without adding manual noise, it is possible to add different types of noise using preprocessing to better mimic the desired application.

The final dataset²¹ consists of about 1.5 gigabytes of data, encompassing 5 hours of recordings in WAVEform (WAVE) audio format. A simplified diagram describing all the steps involved in the dataset creation is illustrated in Fig. 2. The data acquisition was performed using a Bruel & Kjaer type 8104 hydrophone²⁵ followed by a two-stage adjustable gain signal amplifier Bruel & Kjaer 2636²⁶. The Data Logging was performed using a simple computer without any specific computational requirements, and the Data Pre-Processing stage only entailed adjusting the file duration and content to ensure the dataset uniformity and consistency. A technical data validation was performed for Dataset Validation, to ensure the quality of the final dataset.

Methods

The accuracy and reusability of the final dataset depend heavily on the acquisition conditions. It is crucial to consider the conditions of the acquisition environment, including the anechoic tank, the used hydrophone and signal amplifier, and the accuracy of the data acquisition process. All hardware and data acquisition processes should be carefully selected and executed to ensure the usefulness and accuracy of the dataset. Since we are dealing with a costly and time-consuming acquisition process, we must ensure the reuse capability of the acquired dataset.

Anechoic Tank

The anechoic tank was constructed in 1976 at the Lisbon Naval Base and is primarily used for calibrating ships’ sonar, which is regularly used today by the Portuguese Navy to calibrate their underwater sensors and systems. Its surface is covered with absorbent plates made of cork agglomerates and rubber, which help to reduce sound reflections. In addition, floating plates can be added or removed from the tank surface. The plates guarantee a density of approximately ρ ≅ 0.8 g/cm³. The tank is 8 meters long, 5 meters wide, and 5 meters deep. Its design includes a small auxiliary tank in one corner, separated from the main tank by a floodgate. Two movable bridges cross the tank from one end to the other, allowing better access to all the tank areas, as shown in Fig. 3.

Since the anechoic tank is used periodically, it undergoes periodic and corrective maintenance throughout the year to ensure precision in the performed tests, sensors, and systems calibration. The absorbent plates are periodically substituted when they lose their properties, providing the perfect test conditions to guarantee the accuracy of the gathered acoustic dataset²¹.

Target, Noise & Transient Sound Sources

As target sound sources, outboard motors were attached to the floodgate gate during the data acquisition, as shown in Fig. 4. An electric motor from a basic remotely controlled ship model was also used to improve the dataset diversity.

During the dataset acquisition process, one of the two existing movable bridges (Fig. 3) was used to suspend the hydrophone, and the other was used to suspend the compressed air hoses. These compressed air hoses were turned on when needed to generate bubbles as noise, guaranteeing that the acquired dataset²¹ would possess sounds that are as close to the expected reality conditions as possible. A water pumping system with a discharge located approximately 15 cm above the surface was also used to generate noise in the recordings, allowing for the inclusion of background noise. Some transients were also created using, e.g, metallic bars or shots from an air rifle. To perform the placement of the motors and all the necessary materials, a crane that supports up to 5 tons was used, as shown in Fig. 5.

Hydrophone & Signal Amplifier

A Bruel & Kjaer type 8104 hydrophone²⁵, commonly used in acoustic data applications^27,28,29, was employed during the dataset acquisition, located in the middle of the tank: 2.5 meters deep, 2.5 meters from the lateral walls, and 4 meters from both ends of the tank. This hydrophone is a passive omnidirectional device, 12 cm in length and 2 cm in diameter, weighing approximately 1.3 kg, as illustrated in Fig. 6left. The typical directivity pattern for this hydrophone model is shown in Fig. 6right, illustrating how the transfer function varies with the frequency and the location of the sound source relative to the hydrophone. This hydrophone can capture signals ranging from 0.1 Hz to 200 kHz and features a constant directivity pattern up to 20 kHz, as confirmed during calibration before dataset acquisition. In frequency, the transfer function of the transducer is almost flat up to the sampling frequency we used.

During the data acquisition, a two-stage adjustable gain signal amplifier Bruel & Kjaer 2636²⁶ was used, and the gain used on each stage in dB is discriminated in the dataset annotation file²¹, as described in the Data Records section. During all the performed tests, the amplifier was equipped with programmable filters that served as low-pass filters with a selected cutoff frequency of 22.4 kHz.

Data Logging

Following the amplifier, we used a high-quality Hewlett-Packard (HP) oscilloscope and a spectral analyzer to monitor the signals being measured. This setup allows us to fine-tune the gain manually to use most of the dynamic range for a good representation of the important (stationary) signals. But, since the gain is fixed, to mimic the expected operational conditions, there is sometimes saturation of the signal with loud transient noises.

We received the audio signal using a 16-bit sound card that is currently used for standard ship sonar calibration operations conducted at the anechoic tank. Laboratory tests revealed that the card’s transfer function was almost flat from 50 Hz to 20 kHz and could sample and quantize signals as low as 1 Hz. Since the same sound card was used for all recordings, all very low-frequency signals were consistently affected. All recordings were made at a 44.1 kHz sampling rate in mono channel mode.

Data Acquisition

During the data acquisition, we used as targets an electric motor from a basic remotely controlled ship model (Fig. 7) and four different outboard Mercury motors (Fig. 8). The outboard motors used were four Mercury models³⁰ with the following characteristics: (i) a 4.5 horsepower motor with a right pitch propeller having three blades, (ii) an 18 horsepower motor with a right pitch propeller having three blades, (iii) an eight horsepower motor with a right pitch propeller having three blades, and (iv) a 3.6 horsepower motor with a right pitch propeller having three rubber blades.

To create some transients and background noise, we have used metallic bars (Fig. 9), compressed air (Fig. 10), a water bucket (Fig. 11left), and shots from an air rifle (Fig. 11right). All the sound sources and respective added noise and transients will be appropriately described in the Data Records section.

The targets were chosen based on their prevalence in small vessels, including four combustion motors and one electric motor, to ensure variability in acoustic signatures, which is crucial for developing and testing algorithms. Additionally, the manually added noise and transients provide authentic sound sources reflective of typical operational interferences in real-world underwater systems, rather than relying solely on simulated data. This approach ensures diverse acoustic profiles, featuring distinct frequency bands and transient characteristics. The quality of the dataset, derived from all the adopted acquisition procedures, ensures its suitability for reuse in future studies.

Data Records

The dataset²¹ comprises approximately 1.5 gigabytes and includes 168 WAVE audio format files, totaling about 5 hours of audio recordings. It also contains two annotation files named Wolfset_Index.xlsx and Wolfset_Index.csv, each providing a summary of the contents of the respective WAVE files. The file names indicate the content of the corresponding recordings according to the following template XxxxxxTttNnn.WAV, with respective code given by:

X - It can take two values: (i) A - which will correspond to a regular recording, and (ii) E - which will correspond to a recording where some error or occurrence happened during the test, which will be described in the annotation file;
xxxxx - Corresponds to the intensity code of each target. The first digit refers to motor 1 (4.5 horsepower), the second to motor 2 (18 horsepower), the third to motor 3 (8 horsepower), the fourth to motor 4 (3.6 horsepower), and the fifth to motor 5 (electric motor model). For motors 1 to 4, the annotation can take one of the following values:
- Value 0 - Absent (or disconnected);
- Value 1 - Idle disengaged;
- Value 2 - Idle engaged forward;
- Value 3 - Slow forward;
- Value 4 - Medium forward;
- Value 5 - Varying (15 seconds out of gear with small accelerations, then accelerations with the propeller engaged).
For motor 5, since it is a remotely controlled electric ship model, the annotation can take one of the following values:
- Value 0 - Absent (or disconnected);
- Value 1 - Slow forward;
- Value 2 - Fast forward;
- Value 3 - Slow reverse;
- Value 4 - Fast reverse.
T - It is the initial for Transient;
tt - The transient codes form a number that is the sum of the weights of the different effects, according to:
- Decimal 0 - Without any transients;
- Decimal 1 - Compressed air in discharges;
- Decimal 2 - Water Bucket discharge;
- Decimal 4 - Hitting the metallic tube with a mallet;
- Decimal 8 - Hitting the metallic tube with an hammer;
- Decimal 16 - Air rifle shot.
N - It is the initial for Noise;
nn - The noise codes form a number that is the sum of the weights of the different effects, according to:
- Decimal 0 - Without noise;
- Decimal 1 - Compressed air bubbling very intensely (tap fully open);
- Decimal 2 - Compressed air bubbling at low intensity (tap open at 1/4);
- Decimal 4 - Water hose with low flow (tap open with just one turn);
- Decimal 8 - Water hose with a lot of flow (tap fully open).
General Notes:
1. 1.
  If there are multiple recordings under the same conditions, they will have an additional suffix denoted by xNN, where NN represents the recording number;
2. 2.
  For instance, if we have a recording labeled A01020T00N05.WAV, we have:
  - A01020 - Recording includes motor 2 at Idle disengaged and motor 4 at Idle engaged forward;
  - T00 - The recording does not present any transient fluctuations;
  - N05 - Recording including noise from Compressed air bubbling very intensely and Water hose with low flow.

A summary of the dataset filename structure and annotation codes is presented in Table 1. The dataset²¹ includes a diverse collection of recordings categorized by noise, transients, targets, and combinations. Table 2 details the subset containing only noise, encompassing four distinct noise types and one combined scenario, resulting in 34.13 minutes of dedicated noise recordings.

Table 1 Summary of the dataset filename structure and annotation codes.

Full size table

Table 2 Summary of the dataset recordings containing only noise.

Full size table

Another essential component of the dataset²¹ is the inclusion of transient acoustic events, which enhance its realism by reflecting common occurrences in real underwater environments. Table 3 summarizes the recordings containing only transients, comprising five distinct types of sounds. Among these, the air rifle shot was recorded 32 times in 10-second segments. In contrast, the remaining four transient types–water bucket discharges and impacts with a mallet or hammer–were each recorded over five-minute sessions with repeated events, offering a diverse set of non-stationary acoustic signatures.

Table 3 Summary of the dataset recordings containing only transients.

Full size table

As stated, four combustion motors were employed to simulate the acoustic signatures typically associated with small vessels. The dataset includes recordings for motors 1 through 4, each captured under five defined intensity levels, initially without any added noise or transient interference. To further enrich the dataset, additional recordings were conducted with each motor operating at the Idle engaged forward and Medium forward states, incorporating controlled noise and transient events–these transients were repeated six times to simulate realistic disturbances. A detailed summary of all recordings performed with these motors is presented in Table 4, with total durations of 54 minutes for motor 1, 52 minutes for motor 2, 57 minutes for motor 3, and 45.8 minutes for motor 4.

Table 4 Summary of the recordings performed with motors 1 to 4 under various operating conditions.

Full size table

Motor 5, an electric unit representative of low-noise propulsion systems used in small-scale platforms, was recorded at its four predefined intensity levels. To capture specific behaviors, additional one-minute recordings were included featuring rudder-induced effects–one at Slow forward and two at Fast forward–resulting in a total of seven minutes of data, as detailed in Table 5. These comprehensive recordings, across varying motor types and conditions, contribute to the dataset’s robustness and applicability for underwater acoustic analysis and machine learning applications.

Table 5 Summary of the recordings performed with motor 5 - Electric.

Full size table

These recordings cover various operational conditions and are further enriched by including controlled noise and transient events, as outlined in the corresponding tables. This combination enhances the dataset’s diversity and realism, making it well-suited for research in underwater acoustics and advanced signal processing techniques. A subset of the data also includes recordings where all motors operated simultaneously at the Idle engaged forward intensity, as shown in Table 6.

Table 6 Summary of the recordings performed combining the motors.

Full size table

The dataset was developed under highly controlled conditions using an anechoic tank and a calibrated hydrophone, ensuring high-fidelity recordings suitable for precise acoustic analysis. It also features recordings of combined motor operations to simulate complex real-world scenarios. This includes all ten possible pairings of motors 1 through 5 and one scenario with three motors (1, 3, and 4) operating simultaneously. In the latter, motor 3 stopped running at the 20-second mark of the one-minute recording. For all combined recordings, motors 1 to 4 were generally set to Idle engaged forward, and motor 5 to Fast forward. Two exceptions were made: in the pairing of motors 2 and 4, motor 2 operated at Idle engaged forward while motor 4 ran at Medium forward, and in the combination of motors 4 and 5, motor 4 was set to Idle disengaged and motor 5 to Slow forward. These combined recordings do not introduce noise or transients, allowing for clean analysis of motor interaction sounds.

Technical Validation

This section further validates the dataset’s quality by analyzing only the active motor states. Fig. 12 illustrates each motor’s total duration in seconds across intensity levels 1 to 5, excluding the inactive state. The data reveals that intensity level 2 (idle engaged forward) dominates across most motors, since this operational mode was the primary focus during testing, as it reflects a common propulsion state in small vessels loitering in a certain area. Intensity levels 3 (slow forward) and 4 (medium forward) are also well represented, providing valuable examples of transitional and moderate thrust behavior. Including all defined active intensities contributes to a more comprehensive dataset, supporting robust development and validation of underwater acoustic target recognition models. Moreover, the differences in duration distributions among motors at the same intensity indicate a diverse and realistic collection of acoustic conditions, likely achieved through purposeful test variation and scenario-based data acquisition.

Additionally, the total duration of the dataset’s transient and noise recordings²¹–excluding code 00–is illustrated in Fig. 13. The transient code chart reveals a well-balanced distribution across all transient types, including air discharges, water bucket drops, metallic impacts, and air rifle shots, each contributing similarly to the overall dataset duration. This uniformity indicates deliberate test design to cover various impulsive acoustic events adequately. In contrast, the noise code distribution displays a more heterogeneous profile, with codes 01 (intense air bubbling) and 08 (high-flow water hose) representing most of the noise-related duration. This suggests a particular emphasis on simulating turbulent or high-energy background conditions. These figures confirm that, although transients and noise comprise a smaller portion of the dataset, their inclusion is purposeful and sufficiently diverse to enhance its applicability in testing acoustic models under complex, real-world scenarios involving sudden events and dynamic background noise.

To complement the validation of the data, visualizing the Fast Fourier Transform (FFT)³¹ and the Spectrogram³² can assist in analyzing and drawing conclusions about the dataset’s quality and content by identifying dominant frequencies, assessing noise levels, detecting anomalies, and understanding the temporal evolution of frequency components, which are essential for evaluating the dataset’s consistency and integrity.

The recording A00500T00N00×01.WAV includes motor 3 operating at varying speeds without any additional noise source, whose spectrogram is illustrated in Fig. 14. Low-frequency bands showcase the motor’s primary rotational speed and its harmonics. Frequency shifts represent variations in speed, with brighter (red) regions signifying a high presence of specific frequencies in the analyzed recording.

When analyzing the recording A00500T00N00×01.WAV by viewing the FFT of all the signal, it is possible to see the existence of an approximately 50 Hz component originating from the electrical supply network, as illustrated in Fig. 15. By filtering the 50 Hz component and its harmonics up to 200 Hz with a standard second-order Infinite Impulse Response (IIR) notch filter, we eliminate this undesired signal component without compromising our spectrum. The FFT was computed using a Hanning (actually Von Hann³³) window of N = 65, 536 samples, yielding a frequency resolution of approximately 0.67 Hz, and the result is shown as the average over 40 time blocks. A FFT representation of the A00500T00N00×01.WAV recording using a decibel scale is shown in Fig. 16.

The recording A00000T00N01×01.WAV includes only the presence of noise caused by compressed air bubbling intensely without any target, with the spectrogram illustrated in Fig. 17. A comparison of the obtained spectrogram with the one shown in Fig. 14 reveals that this new spectrogram reflects a stable signal characterized by sustained low-frequency energy over an extended time frame. In contrast, the earlier spectrogram represents considerable fluctuations in frequency content over a shorter duration, illustrating the dynamic changes in the motor speed.

When analyzing the recording A00000T00N01×01.WAV FFT, as illustrated in Fig. 18, it is possible to denote a higher spread of the signal frequencies across lower frequencies, mainly caused by the noise source included in the recording. Since this is an external noise source and not some noise source from the environment, its content can be used alone or, e.g., to isolate the target’s signal in other recordings or test algorithm performance in the presence of noise. Similarly, to what was done to the previous recording, the FFT was computed using also a Hanning window of N = 65, 536 samples. Figure 19 shows the FFT of A00000T00N01×01.WAV represented on a decibel scale.

The recording A30000T00N00×01.WAV includes motor 1 at a slow forward setting without any additional noise, with the spectrogram illustrated in Fig. 20. Here, we observe the presence of some lower frequencies and a spread into higher frequencies, although they remain very low.

When analyzing the recording A00500T00N00×01.WAV FFT, as illustrated in Fig. 21, it is possible to identify a component of approximately 31 Hz in the filtered recording, indicating the motor speed at slow forward. The same FFT parameters as previously mentioned were applied to A00500T00N00×01.WAV and its spectrum, using a decibel scale, is represented in Fig. 22.

The performed analysis illustrates the quality and content of the Wolfset, a dataset covering a wide range of scenarios that can aid in successful algorithm design and testing. By collecting data in ideal conditions, free from interference by other sound sources, we can ensure high-quality data that can be used confidently to classify sound sources accurately.

Data-driven approaches are increasingly essential for advancing algorithm design, especially in acoustic signal processing. However, accessing accurate, well-structured, and freely available datasets remains challenging, as data acquisition is often resource-intensive. The Wolfset dataset addresses this gap by providing high-quality underwater acoustic recordings obtained through scientifically controlled procedures. Its structure supports various research applications, allowing a reliable algorithm development and analysis source. The intentional manual inclusion of noise and transient events enhances the dataset’s realism, making it more suitable for modeling complex real-world scenarios.

Usage Notes

This dataset²¹ can be readily used with deep learning approaches for a range of acoustic signal processing tasks, such as event monitoring³⁴, unsupervised classification³⁵, and event detection in noisy environments³⁶. Its structured annotations and inclusion of target sources and background interferences make it especially suitable for training supervised and unsupervised models under realistic underwater conditions. Furthermore, the dataset enables experimentation in anomaly detection, acoustic scene classification, and signal enhancement. Due to its standardized naming convention and controlled acquisition process, it can also serve as a robust benchmark for comparing algorithm performance. Researchers interested in exploring advanced machine learning techniques–such as self-supervised learning, domain adaptation, or few-shot learning–will find the dataset particularly useful for method development and validation.

Code availability

No dedicated scripts or custom code were used to generate or process the dataset.

References

Alavi, M. & Leidner, D. E. Knowledge management and knowledge management systems: Conceptual foundations and research issues. MIS quarterly 107–136, https://doi.org/10.2307/3250961 (2001).
Pournader, M., Ghaderi, H., Hassanzadegan, A. & Fahimnia, B. Artificial intelligence applications in supply chain management. International Journal of Production Economics 241, 108250, https://doi.org/10.1016/j.ijpe.2021.108250 (2021).
Article Google Scholar
Zhao, S., Blaabjerg, F. & Wang, H. An overview of artificial intelligence applications for power electronics. IEEE Transactions on Power Electronics 36, 4633–4658, https://doi.org/10.1109/TPEL.2020.3024914 (2020).
Article ADS Google Scholar
Bullock, J., Luccioni, A., Pham, K. H., Lam, C. S. N. & Luengo-Oroz, M. Mapping the landscape of artificial intelligence applications against covid-19. Journal of Artificial Intelligence Research 69, 807–845, https://doi.org/10.1613/jair.1.12162 (2020).
Article MathSciNet Google Scholar
Yüksel, N., Börklü, H. R., Sezer, H. K. & Canyurt, O. E. Review of artificial intelligence applications in engineering design perspective. Engineering Applications of Artificial Intelligence 118, 105697, https://doi.org/10.1016/j.engappai.2022.105697 (2023).
Article Google Scholar
Zhufeng, L., Xiaofang, L., Na, W. & Qingyang, Z. Present status and challenges of underwater acoustic target recognition technology: A review. Frontiers in Physics 10, 1044890, https://doi.org/10.3389/fphy.2022.1044890 (2022).
Article ADS Google Scholar
Kim, M., Zimmermann, T., DeLine, R. & Begel, A. Data scientists in software teams: State of the art and challenges. IEEE Transactions on Software Engineering 44, 1024–1038, https://doi.org/10.1109/TSE.2017.2754374 (2017).
Article Google Scholar
Pessanha Santos, N. The expansion of data science: Dataset standardization. Standards 3, 400–410, https://doi.org/10.3390/standards3040028 (2023).
Article Google Scholar
Villarreal, E. R. D., García-Alonso, J., Moguel, E. & Alegría, J. A. H. Blockchain for healthcare management systems: A survey on interoperability and security. IEEE Access 11, 5629–5652, https://doi.org/10.1109/ACCESS.2023.3236505 (2023).
Article Google Scholar
Hodapp, D. & Hanelt, A. Interoperability in the era of digital innovation: An information systems research agenda. Journal of Information Technology 37, 407–427, https://doi.org/10.1177/02683962211064304 (2022).
Article Google Scholar
Kinsler, L. E., Frey, A. R., Coppens, A. B. & Sanders, J. V. Fundamentals of acoustics (John wiley & sons, 2000).
Coates, R. F. & Coates, R. Underwater acoustic systems (Springer, 1990).
Dias, A. R., Pessanha Santos, N. & Lobo, V. Implementation of a passive acoustic barrier for surveillance. In OCEANS 2023 - Limerick, 1–6, https://doi.org/10.1109/OCEANSLimerick52467.2023.10244682 (2023).
Hildebrand, J. A. Impacts of anthropogenic sound. Marine mammal research: conservation beyond crisis 101–124, https://escholarship.org/uc/item/8997q8wj (2005).
Pollow, M., Behler, G. & Masiero, B. Measuring directivities of natural sound sources with a spherical microphone array. In Proceedings of the Ambisonics Symposium, 166–169 (2009).
Robards, M. et al. Conservation science and policy applications of the marine vessel automatic identification system (ais)–a review. Bulletin of Marine Science 92, 75–103, https://doi.org/10.5343/bms.2015.1034 (2016).
Article Google Scholar
Oder, T. The dimensions of russian sea denial in the baltic sea. Center for International Maritime Security 1 (2018).
Frascà, D. & Galantini, L. The issue of submarine cable security. Towards a New European Security Architecture 51 (2023).
Bobbitt, A. M. & Nieukirk, S. A collection of sounds from the sea, https://oceanexplorer.noaa.gov/explorations/sound01/background/seasounds/seasounds.html, Accessed: 2025-04-05 (2001).
Felgate, N. Open access underwater acoustics data, https://acoustics.ac.uk/open-access-underwater-acoustics-data/, Accessed: 2025-04-05 (2023).
Lobo, V., Pessanha Santos, N. & Moura, R. Wolfset: A High-Quality Underwater Acoustic Dataset for Algorithm Development and Analysis, https://doi.org/10.6084/m9.figshare.25791978 (2025).
Lobo, V. Ship noise classification: A contribution to prototype-based classifier design (2002).
Oliveira, P. M., Lobo, V., Barroso, V. & Moura-Pires, F. Detection and classification of underwater transients with data driven methods based on time-frequency distributions and non-parametric classifiers. In OCEANS’02 MTS/IEEE, vol. 1, 12–16, https://doi.org/10.1109/OCEANS.2002.1193241 (IEEE, 2002).
Oliveira, P. M. & Barroso, V. Data driven underwater transient detection based on time-frequency distributions. In OCEANS 2000 MTS/IEEE Conference and Exhibition. Conference Proceedings (Cat. No. 00CH37158), vol. 2, 1059–1063, https://doi.org/10.1109/OCEANS.2000.881741 (IEEE, 2000).
Kjær, B. Bruel & Kjaer - Transducers (2014).
Brüel & Kjær. Bruel & Kjaer measuring amplifiers wide range 2610 and 2636 (1988).
Amorim, M. C. P. et al. Painted gobies sing their quality out loud: acoustic rather than visual signals advertise male quality and contribute to mating success. Functional Ecology 27, 289–298, https://doi.org/10.1111/1365-2435.12032 (2013).
Article Google Scholar
Bevelhimer, M. S., Deng, Z. D. & Scherelis, C. Characterizing large river sounds: Providing context for understanding the environmental effects of noise produced by hydrokinetic turbines. The Journal of the Acoustical Society of America 139, 85–92, https://doi.org/10.1121/1.4939120 (2016).
Article ADS Google Scholar
Vedenev, A. I., Kochetov, O. Y., Lunkov, A. A., Shurup, A. S. & Kassymbekova, S. S. Airborne and underwater noise produced by a hovercraft in the north caspian region: Pressure and particle motion measurements. Journal of Marine Science and Engineering 11, 1079, https://doi.org/10.3390/jmse11051079 (2023).
Article Google Scholar
Marine, M. Mercury marine outboards, https://www.mercurymarine.com/us/en/engines/outboard (2014).
Nussbaumer, H. J. & Nussbaumer, H. J.The fast Fourier transform (Springer, 1982).
Kingsbury, B. E., Morgan, N. & Greenberg, S. Robust speech recognition using the modulation spectrogram. Speech communication 25, 117–132, https://doi.org/10.1016/S0167-6393(98)00032-6 (1998).
Article Google Scholar
Kahlig, P.Some Aspects of Julius Von Hann’s Contribution to Modern Climatology, 1–7, https://agupubs.onlinelibrary.wiley.com/doi/abs/10.1029/GM075p0001 (American Geophysical Union (AGU), 1993).
McQuay, C., Sattar, F. & Driessen, P. F. Deep learning for hydrophone big data. In 2017 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM), 1–6, https://doi.org/10.1109/PACRIM.2017.8121894 (IEEE, 2017).
Stinco, P., Tesei, A. & LePage, K. D. Unsupervised active sonar contact classification through anomaly detection. EURASIP Journal on Advances in Signal Processing 2023, 59, https://doi.org/10.1186/s13634-023-01016-z (2023).
Article ADS Google Scholar
Sattar, F., Driessen, P., Tzanetakis, G. & Page, W. A new event detection method for noisy hydrophone data. Applied Acoustics 159, 107056, https://doi.org/10.1016/j.apacoust.2019.107056 (2020).
Article Google Scholar
Kjær, B. Product data – Hydrophones – Types 8103, 8104, 8105 and 8106 (2019).

Download references

Acknowledgements

We want to express our gratitude to the team who worked on acquiring the dataset, especially Herculano Deusdado, António José Guerreiro Rêgo, José Miguel Reis Brás da Silva, José Pedro Ferreira da Costa, Victor Santos, Titio Matias, António Mendes, and all others who contributed to obtaining this dataset with such accuracy and content. Without this team’s assistance and help through all the phases, the dataset acquisition task would not have been possible. We would also like to thank Arsenal do Alfeite shipyard for providing the necessary infrastructure and personnel for the test development. This collaborative effort highlights the importance of strong partnerships and multidisciplinary teamwork in advancing research and developing practical, scientifically relevant resources. This work received support from the national project MArIA - Plataforma Integrada de desenvolvimento de modelos de Inteligência artificial para o mar, with grant number POCI-05-5762-FSE-000400. This work also received support from the European Union’s European Defence Fund (EDF) under project number 101110375, named FIBERMARS - FIBER optic technology for Maritime Awareness and ReSilience. The research carried out by Nuno Pessanha Santos was supported by national funds through Fundação para a Ciência e a Tecnologia (FCT) under the projects - LA/P/0083/2020, UIDP/50009/2020, and UIDB/50009/2020 (DOI: 10.54499/LA/P/0083/2020, 10.54499/UIDP/50009/2020, and 10.54499/UIDB/50009/2020) - Laboratory of Robotics and Engineering Systems (LARSyS). Ricardo Moura received support under the projects UIDB/00297/2020 and UIDP/00297/2020 (DOI: 10.54499/UIDB/00297/2020 and 10.54499/UIDP/00297/2020) - Center for Mathematics and Applications. The research carried out by Victor Lobo was supported by national funds through FCT under the project - UIDB/04152/2020 (DOI: 10.54499/UIDB/04152/2020) - Centro de Investigação em Gestão de Informação (MagIC)/NOVA IMS.

Author information

Authors and Affiliations

Portuguese Navy Research Center (CINAV), Portuguese Naval Academy (Escola Naval), Almada, 2810-001, Portugal
Victor Lobo, Nuno Pessanha Santos & Ricardo Moura
NOVA Information Management School (Nova IMS), Universidade Nova de Lisboa, Lisbon, 1070-312, Portugal
Victor Lobo
Portuguese Military Research Center (CINAMIL), Portuguese Military Academy (Academia Militar), Lisbon, 1169-203, Portugal
Nuno Pessanha Santos
Institute for Systems and Robotics (ISR), Instituto Superior Técnico (IST), Lisbon, 1049-001, Portugal
Nuno Pessanha Santos
Centro de Matemática e Aplicações (NovaMath), Universidade Nova de Lisboa, 2829-516, Caparica, Portugal
Ricardo Moura

Authors

Victor Lobo
View author publications
Search author on:PubMed Google Scholar
Nuno Pessanha Santos
View author publications
Search author on:PubMed Google Scholar
Ricardo Moura
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization, V.L.; methodology, V.L., N.P.S., and R.M.; software, V.L., and N.P.S.; validation, V.L., and N.P.S.; formal analysis, V.L., N.P.S., and R.M.; investigation, V.L., N.P.S., and R.M.; resources, V.L., N.P.S., and R.M.; data curation, V.L.; writing—original draft preparation, N.P.S., and R.M.; writing—review and editing, V.L., N.P.S., and R.M.; visualization, N.P.S., and R.M.; supervision, V.L., N.P.S., and R.M.; project administration, V.L., N.P.S., and R.M.; funding acquisition, V.L., N.P.S., and R.M.; All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Nuno Pessanha Santos.

Ethics declarations

Competing interests

The authors have no conflicts of interest to declare. All authors have seen and agree with the manuscript’s contents, and there is no financial interest to report.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lobo, V., Pessanha Santos, N. & Moura, R. A High-Quality Underwater Acoustic Dataset for Algorithm Development and Analysis. Sci Data 12, 1323 (2025). https://doi.org/10.1038/s41597-025-05564-x

Download citation

Received: 10 April 2025
Accepted: 07 July 2025
Published: 30 July 2025
DOI: https://doi.org/10.1038/s41597-025-05564-x