Introduction

In worldwide, over 800 species of fish and mammals from 109 families are identified to be soniferous1, although this is likely to be an underestimate. Over 150 species of these are found in the northwest Atlantic2. Amongst the soniferous fish and mammals, some of the most abundant and vital commercial species are codfish, drum fish, grunts, groupers, snappers, jacks, catfish3, hambuck wheals, dolphins, etc.

Utilizing the sounds of fish and mammals is recognized as passive acoustics which has been used for over 50 years in fish biology and fisheries surveys2. In fact, it is used routinely to delineate and monitor spawning areas, and study the behavior of fish and mammals4,5. The researchers use hydrophone to listen this sound from fish and mammals to identify species6,7 and estimate their population size5. They use signal processing and spectral analysis computer algorithms to perform their tasks for this purpose5,6. Generally, passive acoustic monitoring is a technique which is distinguished from other types of bioacoustics since it utilizes naturally occurring sounds to gather information on fish and mammals, rather than using artificially generated sounds. It offers a number of benefits to survey fishery population in a certain marine area including being a method of non-optically observing fish activity and distribution, being a non-invasive and non-destructive observational tool, providing the capability of continuous or long-term monitoring as well as remote monitoring, etc6. In addition, investigators use passive acoustics to monitor the sources of noise pollution, and to study the impact of man's activities on marine communities6.

A straightforward, cross-correlation based population estimation technique was proposed in Refs.8,9, was a passive acoustic monitoring technique, can solve some major drawbacks of conventional fish population estimation techniques. Researchers investigated different practical impacts which were associated with this technique. Hossain and Hossen have investigated the impact of increasing number of acoustic sensors, i.e., increasing number of cross-correlation function (CCF) in Ref.10. Hossain and Hossen also have shown the effect of underwater bandwidth and SNR on this technique in Ref.11. Impact of dispersion coefficient on this technique was investigated in Ref.12. Similarly, impact of different distribution on estimation was demonstrated in Ref.13. Hossain and Hossen have investigated the procedure of calculating statistical error of this technique illustrated in Ref.14. However, in the former researches, all the researchers have considered equal distance among the acoustic sensors. This is sometimes impossible in some practical cases because it is somewhat troublesome to put the sensors at the desired locations in the randomly distributed fish and mammals. So, the constraint of equal spacing among sensors makes this technique difficult to implement practically, which is recognized as a limitation of this technique. With an aim to overcome this limitation, we started our investigation. In this paper, we have considered three acoustic sensors, where the sensors maintain unequal distances. Three acoustic sensors can be organized by two types of topologies, i.e., acoustic sensors in line (ASL) scheme and acoustic sensors in a triangle (AST) scheme9. We have worked with both schemes to investigate the impact of unequal distances among the acoustic sensors. From diverse types of fish sounds, we have considered chirp sound and use its frequency for simulations. This type of sound is very common in damselfish (Dascyllus aruanus), humpback whales (Megaptera novaeangliae), dugongs (Dugong dugon), etc., species. Firstly, we have worked to establish a theoretical impact and then we have verified the theory by simulation. In this research, MATLAB R2010a was used as our simulation tool.

Background

In this section, the methodology of cross-correlation based population estimation technique9,10,15 is deliberated. We have considered three acoustic sensors array in this study. This type of array can be either ASL form or AST form as in Ref.10. In this section, first, we will describe the methodology of ASL scheme and then AST scheme. As we have considered chirp sound generating fish and mammals in this study, this section will describe the estimation process with respect to this chirp signal.

Let us consider a three-dimensional spherical area containing N fish and mammals and three acoustic sensors. In this area, the fish and mammals are evenly distributed over the whole region. The fish and mammals are the sources of chirp signals and the acoustic sensors are the receivers.

In the ASL scheme, the three equally separated acoustic sensors (H1, H2 and H3) stay in a straight line at the center of the sphere as shown in Fig. 1. The distances among the acoustic sensors are such that, dDBS12 (distance between H1 and H2) = dDBS23 (distance between H2 and H3) = dDBS (distance between the equidistant pair of acoustic sensors).

Figure 1
figure 1

A uniform random distribution of fish and mammals with three equally spaced acoustic sensors using ASL scheme. Here, the axes units are in meter. We used MATLAB R2010a to achieve this figure.

Now, if chirp signal is produced by a fish or mammal, it will be recorded by three sensors with the corresponding time delays and attenuations. Henceforth, we can express the CCFs of the chirp signals received at each pair of sensors individually by a delta function16,17. For each fish or mammal, we will find a delta function after cross-correlation. These delta functions inhabit any position in the sample space of the corresponding pair of acoustic sensors. Now, we can consider each delta function as a ball and the samples between the corresponding pair of acoustic sensors as the bins into which the balls may fall16,17.

We will define the number of bins, b (as shown in Fig. 2) as twice the number of samples between the acoustic sensors, minus one as16,17:

$$b = \frac{{2 \times d_{DBS} \times S_{R} }}{{S_{P} }} - 1.$$
(1)
Figure 2
figure 2

Bins, b in the cross-correlation function. We used MATLAB R2010a to achieve this figure.

Here, SR is the sampling rate and SP is the speed of propagation.

R of CCF is defined as the ratio of standard deviation (σ) to the mean (µ) is chosen as the estimation parameter of this process, as it requires no prior knowledge of the signal strength16. Two estimation parameters, R12 and R23 are derived from two CCFs, C12(τ) and C23(τ), respectively (where, τ is the time delay), to calculate the final estimation parameter of ASL scheme10. Then, the final estimation parameter, \(R_{Equal:ASL}^{2CCF}\) of ASL scheme is obtained by taking the average of R12 and R23, and can be expressed as10:

$$R_{Equal:ASL}^{2CCF} = \frac{{R_{12} + R_{23} }}{2} = \frac{{\frac{{\sigma_{12} }}{{\mu_{12} }} + \frac{{\sigma_{23} }}{{\mu_{23} }}}}{2},$$
(2)

where, σ12 and σ23 represent standard deviation and µ12 and µ23 represent the mean of the CCFs, i.e., C12(τ) and C23(τ), respectively.

It is very complex to calculate the R of CCF using mathematical expressions. So, the cross- correlation related problem is reframed into a probability problem using the well-known occupancy problem16. After reframing, R of CCF can be written as16:

$$R = \frac{\sigma }{\mu } = \frac{{\sqrt {N \times \frac{1}{b} \times (1 - \frac{1}{b})} }}{\frac{N}{b}} = \sqrt {\frac{b - 1}{N}} .$$
(3)

Hence, Eq. (2) can be expressed as10:

$$R_{Equal:ASL}^{2CCF} = \frac{{R_{12} + R_{23} }}{2} = \frac{{\sqrt {\frac{{b_{12} - 1}}{N}} + \sqrt {\frac{{b_{23} - 1}}{N}} }}{2}.$$
(4)

Here, b12 and b23 denote the number of bins of the CCFs, C12(τ) and C23(τ), respectively. In this case, b12 = b23 = b according to Eq. (1); as the values of SR and SP are constant during the estimation process and dDBS12 = dDBS23 = dDBS. Hence, Eq. (4) becomes

$$R_{Equal:ASL}^{2CCF} = \frac{{R_{12} + R_{23} }}{2} = \sqrt {\frac{b - 1}{N}} .$$
(5)

Using Eq. (5), we can estimate N, as b is known from Eq. (1) and \(R_{Equal:ASL}^{2CCF}\) can be calculated from the CCFs.

In AST scheme, the three sensors (H1, H2 and H3) form an equilateral triangle for estimation purpose, where the centroid of that triangle stays at the center of the sphere as shown in Fig. 3. So, the distances among the acoustic sensors are such that, dDBS12 (between H1 and H2) = dDBS23 (between H2 and H3) = dDBS31 (between H3 and H1) = dDBS (between the equidistant pair of acoustic sensors).

Figure 3
figure 3

A distribution of fish and mammals with three equally spaced acoustic sensors for AST scheme. Here, the axes units are in meter. We used MATLAB R2010a to achieve this figure.

In the AST scheme, three CCFs, C12(τ), C23(τ), and C31(τ) are used for estimation10. The estimation parameters, R12, R23, and R31 are derived from three CCFs, i.e., C12(τ), C23(τ), and C31(τ), respectively, to calculate the ultimate estimation parameter of the AST scheme. Then, the estimation parameter \(R_{Equal:AST}^{3CCF}\) of the AST scheme is obtained by averaging the R12, R23, and R31, and can be expressed as10:

$$R_{Equal:AST}^{3CCF} = \frac{{R_{12} + R_{23} + R_{31} }}{3} = \frac{{\frac{{\sigma_{12} }}{{\mu_{12} }} + \frac{{\sigma_{23} }}{{\mu_{23} }} + \frac{{\sigma_{31} }}{{\mu_{31} }}}}{3},$$
(6)

where, σ12, σ23, and σ31 signify the standard deviation and µ12, µ23, and µ31 signify the mean of the CCFs, i.e., C12(τ), C23(τ), and C31(τ), respectively.

Similar to Eq. (4), we can write

$$R_{Equal:AST}^{3CCF} = \frac{{R_{12} + R_{23} + R_{31} }}{3} = \frac{{\sqrt {\frac{{b_{12} - 1}}{N}} + \sqrt {\frac{{b_{23} - 1}}{N}} + \sqrt {\frac{{b_{31} - 1}}{N}} }}{3}.$$
(7)

Here, b12, b23, and b31 represent the number of bins of the CCFs, i.e., C12(τ), C23(τ), and C31(τ), respectively. Since the acoustic sensors are equidistant pair and b12 = b23 = b31, we can rewrite the Eq. (7) as10

$$R_{Equal:AST}^{3CCF} = \frac{{R_{12} + R_{23} + R_{31} }}{3} = \sqrt {\frac{b - 1}{N}} .$$
(8)

Impact of unequal distances among acoustic sensors

The previous section showed the estimation process with equal distances among the acoustic sensors. This section will investigate a theoretical and simulated impact of unequal distance among the acoustic sensors on population estimation. At first, we will investigate the theoretical approach and then simulated approach to justify the theory.

Here, from diverse types of fish sounds, we have considered chirp sound which is usually generated by damselfish (Dascyllus aruanus)18, humpback whales (Megaptera novaeangliae)19, dugongs (Dugong dugon)20, etc., species. A sound analysis of Plectroglyphidodon lacrymatus and Dascyllus aruanus species of damselfish (family pomacentridae) demonstrated that their generated chirp sounds consisted of trains of 12–42 short pulses of three to six cycles, with a durations from 0·6 to 1·27 ms; and the peak frequency varied from 3400 to 4100 Hz illustrated in Ref.21. The expression of this signal is found in Refs.9,10,15 as:

$$X(t) = A\cos \left[ {\left\{ {2\pi \left( {\frac{{(f_{2} - f_{1} )t^{2} }}{2d} + f_{1} t} \right)} \right\} + P} \right],$$
(9)

where, f1 is the starting frequency in Hz, f2 is the ending frequency in Hz, d is the duration in second, P is the starting phase, and A is the amplitude.

Figure 4 shows simulated form of chirp signal which represents a simple form of chirp with duration of 1 s.

Figure 4
figure 4

Chirp signal from simulation. We used MATLAB R2010a to achieve this figure.

Theoretical impact

In this subsection, we will describe the theoretical impact of unequal distances among the acoustic sensors on cross-correlation based population estimation technique. Firstly, we will investigate the theory for the ASL and AST schemes.

Three unequally spaced sensors denoted by H1, H2, and H3 are considered along a line for estimation. Here, the middle sensor H2 is placed at the center of the estimation area. Unequal distances signify that dDBS12 ≠ dDBS23 ≠ dDBS31. However, the process is analogous to the estimation process described in “Background”.

Therefore, we can write

$$R_{Unequal:ASL}^{2CCF} = \frac{{R_{12} + R_{23} }}{2} = \frac{{\sqrt {\frac{{b_{12} - 1}}{N}} + \sqrt {\frac{{b_{23} - 1}}{N}} }}{2}.$$
(10)

Now, b12 and b23 can be expressed using Eq. (1) as:

$$b_{12} = \frac{{2 \times d_{DBS12} \times S_{R} }}{{S_{P} }} - 1.$$
(11)

And,

$$b_{23} = \frac{{2 \times d_{DBS23} \times S_{R} }}{{S_{P} }} - 1.$$
(12)

It is not possible to simplify Eq. (10) because b12 ≠ b23 and SR and SP are fixed. Hence, Eq. (10) will be

$$N = \left( {\frac{{\sqrt {b_{12} - 1} + \sqrt {b_{23} - 1} }}{{2 \times R_{Unequal:ASL}^{2CCF} }}} \right)^{2} .$$
(13)

Now, to establish a relationship between equal and unequal distances among the acoustic sensors, if we consider the ratio \(\frac{{R_{Equal:ASL}^{2CCF} }}{{R_{Unequal:ASL}^{2CCF} }}\), i.e.,\(\frac{{\sqrt {\frac{b - 1}{N}} }}{{\frac{{\sqrt {\frac{{b_{12} - 1}}{N}} + \sqrt {\frac{{b_{23} - 1}}{N}} }}{2}}}\), equals 1, we find

$$\frac{{2 \times \sqrt {b - 1} }}{{\sqrt {b_{12} - 1} + \sqrt {b_{23} - 1} }} = 1.$$
(14)

After reframing Eq. (14), we can find

$$b = \frac{{(\sqrt {b_{12} - 1} + \sqrt {b_{23} - 1} )^{2} + 4}}{4}.$$
(15)

This is the condition between the equal and unequal sensor separation cases among the bins for three acoustic sensors in the ASL scheme.

However, there is another way to establish a special condition for this scheme. If we consider C31, i.e., CCF due to cross-correlation between acoustic signals received at H3 and H1, another condition will be found. This CCF is not considered in equal sensor separation case, as only the CCFs due to the equidistant pair of sensors are used for that case. Considering the additional CCF, C31(τ), the ultimate estimation parameter of the ASL scheme with unequally separated sensors can be achieved by averaging the estimation parameters, R12, R23, and R31, derived from the CCFs, C12(τ), C23(τ), and C31(τ), respectively, can be expressed as:

$$R_{Unequal:ASL}^{3CCF} = \frac{{R_{12} + R_{23} + R_{31} }}{3} = \frac{{\sqrt {\frac{{b_{12} - 1}}{N}} + \sqrt {\frac{{b_{23} - 1}}{N}} + \sqrt {\frac{{b_{31} - 1}}{N}} }}{3}.$$
(16)

Here, b31 is the number of bins of C31(τ), which can be written as

$$b_{31} = \frac{{2 \times d_{DBS31} \times S_{R} }}{{S_{P} }} - 1.$$
(17)

In the ASL scheme, dDBS31 = dDBS12 + dDBS23 and b12 ≠ b23 ≠ b31. Hence, from Eq. (17), we can write as

$$b_{31} = b_{12} + b_{23} + 1.$$
(18)

Therefore, from Eq. (16), we can write

$$R_{Unequal:ASL}^{3CCF} = \frac{{\sqrt {\frac{{b_{12} - 1}}{N}} + \sqrt {\frac{{b_{23} - 1}}{N}} + \sqrt {\frac{{b_{12} + b_{23} }}{N}} }}{3}.$$
(19)

After reframing Eq. (19), we can write

$$b = \frac{{(\sqrt {b_{12} - 1} + \sqrt {b_{23} - 1} + \sqrt {b_{12} + b_{23} } )^{2} + 9}}{9},$$
(20)

which implies a special condition for unequal distances among acoustic sensors using three sensors in ASL scheme.

Now, we will establish a condition for unequal distances among acoustic sensors using three sensors in AST scheme. Analogous to special condition of ASL, AST scheme also has the similar property, i.e., dDBS31 = dDBS12 + dDBS23 and b12 ≠ b23 ≠ b31.

If we take the ratio \(\frac{{R_{Equal:AST}^{3CCF} }}{{R_{Unequal:AST}^{3CCF} }}\) equals 1, we will find

$$\frac{{\sqrt {\frac{b - 1}{N}} }}{{\frac{{\sqrt {\frac{{b_{12} - 1}}{N}} + \sqrt {\frac{{b_{23} - 1}}{N}} + \sqrt {\frac{{b_{31} - 1}}{N}} }}{3}}} = \frac{{3 \times \sqrt {b - 1} }}{{\sqrt {b_{12} - 1} + \sqrt {b_{23} - 1} + \sqrt {b_{31} - 1} }} = 1.$$
(21)

After reframing Eq. (21), we can write

$$b = \frac{{(\sqrt {b_{12} - 1} + \sqrt {b_{23} - 1} + \sqrt {b_{31} - 1} )^{2} + 9}}{9}.$$
(22)

This is the condition for the equal and unequal sensor separation cases among the bins for three acoustic sensors of the AST scheme.

Simulated impact

In this subsection, we will verify the theory by using simulated results. A MATLAB simulation environment was considered to obtain the simulations of proposed scheme. To acquire simulations, a uniform random distribution of fish and mammals was considered. The entire simulations were obtained with respect to chirp generating fish and mammals. No signal parameters were changed in the simulation. We have generated fish sounds considering different real-time parameters, i.e., frequency, time duration, bandwidth, etc.9. Similarly, fish signals can be represented as swift frequency wave9. The toolbox of MATLAB provides functions to generate swept-frequency waveforms such as the chirp function.

The following parameters were used in the MATLAB simulation as stated in Table 1. The motivation of this parameter setting is to give a clear idea about the total simulation, i.e., procedure and results, of this research.

Table 1 Parameters used in the MATLAB simulation.

A negligible amount of power difference among the acoustic pulses transmitted by each fish or mammal was considered to ease the simulations. We have used 500 iterations to achieve these simulated results, which mean the simulations run for 500 times and then average to achieve the plots. A SNR of 30 (26.02059 dB SNR) was considered to accomplish the simulation11

Figure 5 shows the comparison of simulated and theoretical results. Figure 5a corresponds to the ASL scheme of three acoustic sensors and Fig. 5b corresponds to the AST scheme of three acoustic sensors to show the effect of unequal sensor separations. The values of dDBS12 = 1 m and dDBS23 = 1.5 m; b12 = 39 and b23 = 59 in Fig. 5a. The values of dDBS12 = 1 m, dDBS23 = 1.5 m and dDBS31 = 2 m; b12 = 39, b23 = 59 and b23 = 79. It is seen from the Fig. 5 that the simulated results are very close to the theoretical results. This signifies the effectiveness of cross-correlation based passive monitoring method in the case of unequal distances among the acoustic sensors. Now simulations are performed to validate the conditions which we achieved in Eqs. (15) and (22). Analogous to the Fig. 5, simulated results of \(R_{Unequal:AST}^{3CCF}\) are plotted against the number of fish and mammals.

Figure 5
figure 5

Number of fish and mammal’s vs. R of CCFs for unequal distances among acoustic sensors, (a) three acoustic sensors ASL scheme where, dDBS12 = 1 m and dDBS23 = 1.5 m and (b) three acoustic sensors AST scheme where, dDBS12 = 1 m, dDBS23 = 1.5 m and dDBS31 = 2 m. We used MATLAB R2010a to achieve these figures.

The values used to find the results in Fig. 6c are similar as scenario shows in Fig. 6a,b, i.e., dDBS12 = 0.45 m (b12 = 17), dDBS23 = 0.95 m (b23 = 37) for unequal separation case and dDBS = 0.675 m (b = 26) for corresponding equal separation case.

Figure 6
figure 6

Scenario of unequal sensor separations and its corresponding results from simulations for three acoustic sensors ASL scheme (a) unequal sensor distances among acoustic sensors where dDBS12 = 0.45 m and dDBS23 = 0.95 m, (b) corresponding equal distances among acoustic sensors, i.e., dDBS = 0.675 m, derived from condition at Eqs. (15) and (20) and (c) Number of fish and mammals vs. R of CCFs for theoretical, and simulated with unequally and corresponding equally spaced acoustic sensors where the sensor distances are similar as in (a) and (b). Here, the axes units are in meter for (a) and (b). We used MATLAB R2010a to achieve this figure.

Figure 7 is simulated for the similar purpose as Fig. 6 but for three acoustic sensors AST scheme. Figure 7a shows an unequal sensor separation case of AST scheme where dDBS12 = 1.275 m, dDBS23 = 2.075 m and dDBS31 = 0.675 m whereas Fig. 7b shows corresponding equal sensor separation case of AST scheme with the distances among equidistant sensors is 1.3 m. Figure 7c shows the variation of simulated results of unequal sensor separation case from corresponding equal sensors separation case with respect to theoretical results. The values used to find Fig. 7c are similar as scenario shows in Fig. 7a,b, i.e., dDBS12 = 1.275 m (b12 = 50), dDBS23 = 2.075 m (b23 = 82) and dDBS31 = 0.675 m (b31 = 26) for unequal separation case and dDBS = 1.3 m (b = 26) for corresponding equal separation case.

Figure 7
figure 7

Scenario of unequal sensor separation and its corresponding results from simulations for three acoustic sensors AST scheme (a) unequal sensors separation where dDBS12 = 1.275 m, dDBS23 = 2.075 m and dDBS31 = 0.675 m (b) corresponding equal distance among acoustic sensors, i.e., dDBS = 1.3 m, derived from condition at Eq. (22) and (c) Number of fish and mammals vs. R of CCFs for theoretical, and simulated with unequally and corresponding equally spaced acoustic sensors where the sensor distances are similar as in (a) and (b). Here, the axes units are in meter for (a) and (b). We used MATLAB R2010a to achieve these figures.

We can see from Figs. 5, 6, and 7, the simulated results closely agreed with the theoretical results. Therefore, these results demonstrate the robustness of estimation even with unequally sensors separation. At the same time, the effectiveness of the conditions derived in Eqs. (15) and (22) is proved.

However, this research has certain bounds, i.e., negligence of the effect of multipath interference, consideration of uniform random distributions, assumption of a negligible amount of power difference among acoustic pulses, assuming the delays to be integer, and consideration of acoustic sensors to be stayed at the middle of the estimation area. Similarly, as a passive acoustic method, this technique can be only applicable to the soniferous species.

During practical implementation of this technique, several matters should be taken in to account, i.e., dispersion factor, limited bandwidth problem, proper signal to noise ratio (SNR), etc. Dispersion coefficient is the main factor of the distance dependent attenuation It was found that with the increase of dispersion coefficient, i.e., dispersion loss in the medium, a significant amount of deviation occurred from actual fishery quantity to the estimated12. Limited bandwidth of the underwater channel poses a barrier during acquisition of fish signals, which has infinite bandwidth. To overcome this problem, a proper scaling is a mandatory task11. Similarly, a low signal to noise ratio (SNR) is also an impediment to obtain an accurate estimation. It was found that estimation with minimum 26.02 dB SNR can perform like the noiseless estimation11. Researches are underway to overcome the barrier of center placement of acoustic sensors.

Many things can affect the speed of sounds, e.g., nature of the medium, (gas, liquid or solid), temperature, additive substances, i.e., such as salt in water, etc. Sounds travel faster through denser and hotter materials. At normal temperature, sound speed is 1493 ms−1 in fresh water, which is 1533 ms−1 in sea water22. However, in our research, we consider the propagation speed of fish sound is 1500 ms−1 during simulations

Conclusion

The researchers of cross-correlation based passive monitoring technique considered equidistant acoustic sensors during their investigations previously. This consideration was a limitation of this estimation process. In this research, we have worked to eliminate this limitation of equidistant sensors. We also have worked with three acoustic sensors and derived two conditions, i.e., one regular and one special condition, for ASL scheme and one condition for AST scheme. We found that the acoustic sensors can be unequally spaced but in a way that verifies these two conditions. The conditions for both ASL and AST schemes are verified by simulations. We also have proved the robustness of this estimation process in the case of unequal sensor separation. Finally, it was our key goal to remove the barrier of the assumption of equal distance among the acoustic sensors and we have removed that in this study which satisfies our goals properly.