Introduction

UWA channel is quite a challenging communication medium for its susceptibility to environmental factors such as temperature, salinity, and pressure1. Furthermore, due to the slow propagation speed and tidal moment of acoustic waves in underwater medium, underwater wireless acoustic communication becomes even more challenging, resulting in severe multipath, delay-spread, and Doppler spread2,3. Consequently, the challenges of inter-symbol interference (ISI) and inter-carrier interference (ICI) occur. To cope with these challenges, OFDM has been widely utilized for the last few decades and is considered a promising choice for efficiently handling the frequency-selective channel and enabling high data rate communication3,4,5. Different variants have also been proposed in literature to provide better noise immunity and bit error rate performance while maintaining low implementation costs6. One such variant is CP-OFDM that has been widely studied for underwater acoustic communication. The use of cyclic prefix and orthogonal subcarriers offers various advantages, however, it also requires delicate tackling of rapid time variation that is present due to the severe Doppler effect in the case of underwater channel2,4,7. If rapid time variation is not handled carefully, the frequency shift due to the Doppler effect compromises the orthogonality between the OFDM subcarriers, leading to ICI4. The mitigation of the Doppler effect and its compensation therefore, become essential in UWA OFDM communication to fully exploit the potential of OFDM technology.

Various methods have thus been developed in the literature to estimate the Doppler effect for underwater acoustic OFDM communication systems, including Doppler-insensitive waveforms, Doppler-sensitive waveforms, the least-square methods, cross-correlation-based methods, and auto-correlation-based methods4,7,8,9,10. Among them, the classical approach is the use of a signal structure that contains a Doppler-insensitive preamble, and postamble9 like a linear frequency modulated waveform and hyperbolic frequency modulated waveform11,12. Similarly, the Doppler sensitive waveform is developed to estimate the Doppler scale factor that uses the preamble, consisting of an m-sequence-coded waveform and the Costas waveform8. Doppler sensitive waveform method uses a cross-correlator bank, where the correlator resulting in a maximum peak is selected as the true Doppler scale estimate8. One of the common disadvantages of these Doppler-insensitive and Doppler-sensitive waveform methods is the signal overhead7. Other disadvantages include a large delay time and high computational complexity7. A least-square (LS) method is also introduced to estimate the Doppler scale factor in10, however, this method requires an iterative solution and needs to be channel equalized first. Therefore, the LS method may not be a viable option in a practical UWA communication system. On the other hand, a more common method used in UWA is auto-correlation-based Doppler scale estimation. Instead of preamble and postamble, auto-correlation method uses CP on the receiver side to estimate the Doppler scale factor4,7,13,14. The accuracy of auto-correlation-based Doppler scale estimation directly depends on the sampling rate of the received signal. A higher sampling rate of the received signal provides a better estimate of a Doppler scale factor, but at the cost of computational complexity4. To resolve this issue of computational complexity, an interpolation technique is introduced4,7,15,16. To summarize, serious efforts are made to estimate and compensate the Doppler effect through conventional approaches4,7,8,9,10, with each one having its advantages and limitations.

With the rapid development and progress in the use of deep learning algorithms, and their advantages in terms of performance, recently, non-conventional deep learning methods have been introduced for UWA OFDM communication17,18,19,20. However, deep learning-based methods so far used in UWA OFDM communication are mainly related to channel estimation. For example, Liu et. al propose a deep neural network-based model: CsiPreNet, where a convolutional neural network (CNN) and long short-term memory (LSTM) are integrated to predict channel state information for an adaptive UWA downlink orthogonal frequency division multiple access17. Similarly, Li et. al, in another study, propose a denoising autoencoder deep neural network (DAE-DNN) to improve channel estimation of a UWA OFDM communication by suppressing the impulsive noise18. Although deep learning-based methods are applied to channel estimation, little to no work is done towards time-varying channels, including the Doppler effect, a crucial factor in UWA communication. Jia et.el19 exploits the sparse nature of the UWA communication channel and considers the time-varying channel with a uniform Doppler, however, they did not explicitly estimate the Doppler scale factor. In another work21, Hassan et.al propose a deep learning-based channel estimator in the presence of Doppler shifts, however, their study also predicts the transmitted information bits directly without explicitly estimating the Doppler scale factor. We, therefore, identify this gap and consider a time-varying UWA channel having both multipath and the Doppler effect. We further exploit the novel application of a feedforward neural network for estimating the Doppler scale factor in a time-varying UWA CP-OFDM communication channel.

Feedforward neural networks have been widely used in several real-world applications such as the healthcare sector22,23,24, structural health monitoring25,26, mechanics27, environmental monitoring28, and wireless communication29,30. Driven by these diverse applications of feedforward neural networks, we in this work plan to investigate its performance in estimating the Doppler scale factor in UWA CP-OFDM communication. Toward this end, we use an FFBN with three different training algorithms. The main contributions in the paper are summarized below:

  • In this paper, we introduce the novel application of a feedforward backpropagation neural network for estimating the Doppler scale factor in UWA CP-OFDM communication. Unlike the conventional method, where two CP-OFDM symbols are used to improve the Doppler scale factor estimation accuracy, we use a single OFDM symbol approach, achieving spectral efficiency.

  • In FBNN, we utilize three different training algorithms, namely FBNN-based Fletcher-Reeves Conjugate Gradient (FBNN-CGF), FBNN-based Polak-Ribi’ere Conjugate Gradient (FBNN-CGP), and FBNN-based Conjugate Gradient with Powell/Beale Restarts (FBNN-CGB).

  • The performance comparison of the three FBNN-based training algorithms is carried out in terms of mean square error (MSE) and error histograms under dynamic channel conditions of varying SNR and number of paths.

  • A synthetic dataset is generated with all real-time and key parameters such as multipath, uniformly distributed Doppler scale factor, and signal-to-noise ratio.

  • The performance analysis of the FBNN-CGF, FBNN-CGP, and FBNN-CGB is carried out through the performance metric of RMSE of Doppler speed in m/s by estimating the difference between the true and estimated Doppler scale factor. In addition, a comparison has been made with the conventional signal processing method, such as pilot-assisted autocorrelation and pilot-assisted cross-correlation-based Doppler scale estimation. Additionally, the results are further verified using a watermark NOF1 channel dataset.

  • Finally, we calculate the bit error rate (BER) performance of the relatively best FBNN-based training algorithm achieved from the RMSE performance comparison.

UWA CP-OFDM system model

Since CP-OFDM is most commonly used in underwater acoustic communication, we therefore start with the CP-OFDM communication system model that includes time-varying information of underwater channel. Consider a CP-OFDM communication system4,7, that has a total number of subcarrier K, s[k] is the baseband modulated (phase shift keying (PSK) or quadrature amplitude modulation (QAM)) symbol on the kth subcarrier, the transmit CP-OFDM passband signal in continuous time domain \(\tilde{x}(t)\) can be written as

$$\begin{aligned} \tilde{x}(t)= \text {Re} \left\{ \sum _{k=-\frac{K}{2}}^{\frac{K}{2}-1}s[k]e^{j2\pi f_{k}t} q(t) \right\} , \end{aligned}$$
(1)

where \(f_{k}=f_{c}+\frac{k}{T}\) denotes the frequency of the kth subcarrier, here \(f_{c}\) is the carrier frequency, T represents OFDM symbol duration, and the rectangular pulse shaping window is denoted by q(t). The rectangular pulse shaping window q(t) can be written in two different signal formats based on a single OFDM symbol, that is, T or two identical OFDM symbols, that is, 2T as depicted in Fig. 14. Let’s denote the duration of CP with \(T_\textrm{cp}\), the q(t) can be written as

$$\begin{aligned} q(t) = {\left\{ \begin{array}{ll} 1, t\in \left[ -T_\textrm{cp},~2T\right] ; \\ 0, \text{ elsewhere. } \end{array}\right. } \end{aligned}$$
(2)
$$\begin{aligned} q(t) = {\left\{ \begin{array}{ll} 1, t\in \left[ -T_\textrm{cp},~T\right] ; \\ 0, \text{ elsewhere. } \end{array}\right. } \end{aligned}$$
(3)
Fig. 1
figure 1

CP-OFDM signal structures (a) Single CP-OFDM symbol and (b) Two CP-OFDM symbols, where CP represents cyclic prefix, and X is the OFDM symbol7.

We use (3) instead of (2) in our proposed FBNN-based Doppler scale estimation as it uses single OFDM symbols4 to help in efficiently utilizing the scarce underwater spectrum. Next, the channel impulse response with L number of paths can be expressed as

$$\begin{aligned} h(t)= \sum _{l=0}^{L-1} A_l \delta [(1+a_{true} )t-\tau _{l}], \end{aligned}$$
(4)

where \(A_l\), \(\tau _{l}\), and \(a_{true}\) denote the amplitude, time delay of the lth path, and true Doppler scale factor, respectively. After passing the passband CP-OFDM signal through a multipath channel, the received passband signal can be written as

$$\begin{aligned} \tilde{y}(t)&=\tilde{x}(t) \star \sum _{l=0}^{L-1} A_l \delta [(1+a_{true} )t-\tau _{l}] + \tilde{n}(t), \end{aligned}$$
(5)

where \(\tilde{n}(t)\) denotes the noise. The modified received passband signal after insertion of (1) in (5) can be expressed as4,7

$$\begin{aligned} \tilde{y}(t)&= \text {Re} \Bigg \{ \sum _{l=0}^{L-1}A_l \sum _{k=-\frac{K}{2}}^{\frac{K}{2}-1}s[k]e^{j2\pi (f_{c}+\frac{k}{T})(1+a_{true} )t}e^{-j2\pi (f_{c}+\frac{k}{T})\tau _{l}} \times q[(1+a_{true} )t-\tau _{l}] \Bigg \} + \tilde{n}(t). \end{aligned}$$
(6)

Next, the received passband signal is downshifted and passed through a low-pass filter. The final received baseband signal y(t) can then be written as4,7

$$\begin{aligned} y(t)&=\sum _{l=0}^{L-1}A_{l}^{c} e^{j2\pi f_{c}a_{true} t}\sum _{k=-\frac{K}{2}}^{\frac{K}{2}-1}s[k]e^{-j2\pi \frac{k}{T}\tau _{l}}e^{j2\pi \frac{k}{T}(1+a_{true} )t} \times q[(1+a_{true} )t-\tau _{l}] + z(t), \end{aligned}$$
(7)

where \(A_{l}^{c} = A_l e^{-j2\pi f_{c}\tau _{l}}\) denotes the baseband equivalent complex path amplitude, while z(t) is the baseband noise. With an assumption that the signal sampling frequency is \(f_s = \lambda B = \frac{\lambda K}{T}\), where B and \(\lambda\) denote the OFDM signal bandwidth and oversampling factor. When \(n \in \left[ -\frac{T_\textrm{cp}-\tau _\textrm{max}}{1+a_{true}}f_s, \frac{2T-\tau _\textrm{max}}{1+a_{true}}f_s\right]\), where \(\tau _\textrm{max}\) is the maximum delay spread, then from (7), the sampled digital signal from can be expressed as:

$$\begin{aligned} y[n]=\sum _{l=0}^{L-1} A_l^{c} e^{j2\pi \epsilon _{a_{true} }\frac{n}{\lambda K}}\sum _{k=-\frac{K}{2}}^{\frac{K}{2}-1}s[k]e^{-j2\pi \frac{k}{T}\tau _{l}}e^{j2\pi \frac{k}{\lambda K}(1+a_{true} )n} + w(n), \end{aligned}$$
(8)

where \(\epsilon _{a_{true} }=f_{c} a_{true} T\) denotes the normalized Doppler shift, and here \(f_c\) is the carrier frequency.

Proposed FBNN-based Doppler scale estimation approach

The Doppler scaling factors estimation for underwater acoustic communication requires dealing with complex data, and therefore the paper proposed employs a feedforward neural network with back-propagated training algorithm. Both the neural network and training algorithms are further explained in the following subsections:

Fig. 2
figure 2

An underwater acoustic CP-OFDM communication system with FBNN-based Doppler scales estimation.

Feedforward Backpropagation Neural Network (FBNN)

Figure 2 shows a block diagram of an OFDM receiver with Doppler scale estimation. The Doppler Estimation block utilizes FBNN to estimate Doppler scaling factors. The input of this block is y[n] (eq 8) where \(a_{true}\) is the target Doppler scaling factor needs to be estimated. FBNN takes y[n] as input, and \(a_{true}\) as target during the training phase. The FBNN consists of an input layer and one or more hidden layers to estimate the Doppler scale factor (\(a_{est}\)) in its output layer. A typical workflow at the level of neurons in different layers in an FBNN is listed below.

  • Each hidden layer neuron receives an input that is a weighted combination of the input from the prior layer, which is then passed through a nonlinear activation function. Therefore, weighted output \(z_j[k]\) of the j-th neuron in the hidden layer is specified using a nonlinear activation function g and utilizing the weight \(c_{ji}\) from input neuron i to hidden neuron j as

    $$\begin{aligned} z_j[k]=g\left( \sum _{i=1}^n c_{ji}\cdot y_i[k]+b_j\right) , \end{aligned}$$
    (9)

    where \(b_j\) is the bias of the hidden neuron j and n represents the number of input samples.

  • The output neuron r yields a value \(a_{est}^r[k]\) by using a nonlinear activation function h and utilizing the weight \(v_{rj}\) from hidden neuron j to output neuron r as

    $$\begin{aligned} a_{est}^r[k]=h\left( \sum _{j=1}^m v_{rj}\cdot z_j[k]+b_r\right) , \end{aligned}$$
    (10)

    where \(b_r\) is the bias of the output neuron r and m represents the number of hidden neurons.

The weight factor between two neurons in the adjacent layers is updated during the training phase. Next, details of the training algorithms are presented along with its different variants.

Conjugate gradient-based training algorithms

During the training phase, an FBNN uses various numerical optimization methods to optimize a cost function that measures the error between the estimated values (\(a_{est}\)) and the actual target values (\(a_{true}\)). The most preferred optimization methods are conjugate gradient (CG) algorithms due to their precision, robustness, and faster convergence31. Multiple variants of the CG algorithm include the Fletcher-Reeves conjugate gradient, the Polak-Ribiére conjugate gradient, and the conjugate gradient with Powell/Beale restarts.

A CG-based algorithm is an iterative method. Like a general numerical optimization method, it begins by defining an objective function for a minimization problem as follows:

$$\begin{aligned} \min _\textbf{u} f(\textbf{a,u}) = \frac{1}{2} \textbf{u}^T\textbf{A}\textbf{u}, \end{aligned}$$
(11)

where \(\textbf{a}\) is a vector of Doppler scales, and \(\textbf{A}\) is a Hessian matrix with elements \(\textbf{A}_{(i,j)}\)

$$\begin{aligned} \textbf{A}_{(i,j)} = \frac{\partial ^2 f(\textbf{a,u})}{\partial u_i \partial u_j}, \end{aligned}$$
(12)

and \(\textbf{u}=[u_1,u_2,\cdots ,u_N]^T\) with i-th value \(u_i\) denoted as

$$\begin{aligned} u_i = a_{true}^i - a_{est}^i \end{aligned}$$
(13)

represents the error between the i-th values \(a_{true}^i\) and \(a_{est}^i\) of the true and estimated Doppler scales, respectively.

Let \(\textbf{u}^k\) denote the current estimate of the weights; then, the solution algorithm seeks an update, \(\textbf{u}^{k+1}\), which results in: \(f(\textbf{a},\textbf{u}^{k+1})<f(\textbf{a},\textbf{u}^{k})\), termed the descent condition. Consequently, \(\textbf{u}\) is updated as per the following rule:

$$\begin{aligned} \textbf{u}^{k+1} = \textbf{u}^{k}+\alpha _k \textbf{d}^{k}, \end{aligned}$$
(14)

where \(\textbf{d}^{k}\) represents a search direction and \(\alpha _k\) is the step size that gets updated by minimizing \(f(\textbf{a},\textbf{u})\) along \(\textbf{d}^{k}\).

The iteration in (14) can be solved in two steps as listed below.

  1. 1.

    Finding the suitable search direction \(\textbf{d}^{k}\) along which the value of the function locally decreases.

  2. 2.

    Performing a line search along \(\textbf{d}^{k}\) to find \(\textbf{u}^{k+1}\) such that \(f(\textbf{u}^{k+1})\) reaches its minimum value.

Fig. 3
figure 3

A flow chart of the conjugate gradient algorithm where \(\beta _k\) is computed under three different variants: CGF, CGP, and CGB.

In the context of the CG algorithm, let \(\mathbf {d^0,d^1,d^2,\cdots ,d^{n-1}}\), where \(\textbf{d}^i \textbf{A d}^j=0, i\ne j\) denote conjugate directions with respect to the Hessian matrix \(\textbf{A}\). This ensures that the search directions are orthogonal with respect to the matrix \(\textbf{A}\), leading to an efficient descent in the solution space. Starting from \(\mathbf {d^0}\), taken to be the steepest descent direction, we can use the following procedure to generate initial and next directions denoted \(\textbf{d}^0\) and \(\textbf{d}^{k+1}\), respectively as

$$\begin{aligned} \textbf{d}^0 = -\nabla f(\textbf{u}^0), \end{aligned}$$
(15)
$$\begin{aligned} \textbf{d}^{k+1} = -\nabla f(\textbf{u}^{k+1}) + \beta _k \textbf{d}^{k}, \end{aligned}$$
(16)

where \(\textbf{d}^0\) is chosen arbitrarily and \(\nabla f(\cdot )\) denotes the gradient.

Multiplying both sides in (16) by \(\textbf{d}^k \textbf{A}\) and equating to zero gives \(\beta _k\) as

$$\begin{aligned} \beta _k = \frac{\nabla f(\textbf{u}^{k+1})^T \textbf{A d}^k}{(\textbf{d}^k)^T\textbf{A d}^k}, \end{aligned}$$
(17)

where \((\cdot )^T\) denotes the matrix transpose. Having found \(\beta _k\) leads to finding \(\textbf{d}^{k+1}\) that can be utilized in (14) to finally compute \(\textbf{u}^{k+1}\).

Figure 3 shows the learning process of the CG algorithm (as formulated in (11) to (17)) in the form of a flowchart. The expression in (17) can be further simplified if additional assumptions about the function and the line search algorithm are made, leading to different versions of the CG methods, such as CGF, CGP, and CGB, as described in the following subsections.

Fletcher-Reeves Conjugate Gradient (CGF)

For the Fletcher-Reeve update, \(\beta _k\) is calculated using the procedure listed below.

Because

$$\begin{aligned} \nabla f(\textbf{u}^{k+1})^T\textbf{d}^k= & \nabla f(\textbf{u}^{k+1})^T \times \nonumber \\ & \left( -\nabla f(\textbf{u}^k)+\beta _{k-1}\textbf{d}^{k-1}\right) = 0, \end{aligned}$$
(18)

where for quadratic functions

$$\begin{aligned} \nabla f(\textbf{u}^{k+1}) = \nabla f(\textbf{u}^{k})+\alpha _k \textbf{A}\textbf{d}_k, \end{aligned}$$
(19)

therefore by exact line search condition

$$\begin{aligned} \nabla f(\textbf{u}^{k+1})^T\nabla f(\textbf{u}^{k})= & \beta _{k-1}\times \nonumber \\ & \left( \nabla f(\textbf{u}^{k})+\alpha _k \textbf{A}\textbf{d}_k\right) ^T\textbf{d}^{k-1}=0, \end{aligned}$$
(20)

results in \(\beta _k\) as the ratio of the squared-norm of the current gradient to the squared-norm of the previous gradient using the following relation

$$\begin{aligned} \beta _k = \frac{\nabla f(\textbf{u}^{k+1})^T\nabla f(\textbf{u}^{k+1})}{\nabla f(\textbf{u}^{k})^T\nabla f(\textbf{u}^{k})}. \end{aligned}$$
(21)

Polak-Ribiére Conjugate Gradient (CGP)

For the Polak-Ribiére update, the constant \(\beta _k\) is calculated as the inner product of the previous change in the gradient with the current gradient divided by the squared norm of the previous gradient. Firstly, in the case of exact line search, we notice that

$$\begin{aligned} \nabla f(\mathbf {u^{k+1}})^T \textbf{d}^k = 0. \end{aligned}$$
(22)

Thus,

$$\begin{aligned} \beta _k=\frac{\nabla f(\textbf{u}^{k+1})^T\left( \nabla f(\textbf{u}^{k+1})-\nabla f(\textbf{u}^{k})\right) }{\nabla f(\textbf{u}^{k})^T \nabla f(\textbf{u}^{k})}. \end{aligned}$$
(23)

Typically, the CGP algorithm performs similarly to the CGF algorithm. For a given Doppler scale estimation problem, it may not be possible to predict which algorithm will perform best, however, the storage requirements for the CGP algorithm are slightly larger than those for the CGF algorithm.

Conjugate Gradient with Powell/Beale Restarts (CGB)

To improve the efficiency during the search for optimal direction, the Powell-Beale technique verifies if there is any orthogonality between the current gradient \(\nabla f(\textbf{u}^{k+1})\) and the previous gradient \(\nabla f(\textbf{u}^{k})\) using an inequality

$$\begin{aligned} |\nabla f(\textbf{u}^{k+1})^T\nabla f(\textbf{u}^{k})|\ge 0.2||\nabla f(\textbf{u}^{k})||^2. \end{aligned}$$
(24)

The search direction is reset to the negative of the gradient if this condition is satisfied. In some implementations, \(\beta _k\) might be calculated as

$$\begin{aligned} \beta _k =\frac{\nabla f(\textbf{u}^{k+1})^T \mathbf {d^{k}}}{\nabla f(\textbf{u}^{k})^T \mathbf {d^{k}}}. \end{aligned}$$
(25)

For some problems, the CGB algorithm usually performs slightly better than the CGF and CGP algorithms. In the next section, we evaluate the performance of the three training algorithms. However, the storage requirements for the CGB algorithm are slightly higher than those for the CGP algorithm.

Computational complexity of training algorithms

The three conjugate gradient methods, that is, CGF, CGP, and CGB, that have been presented as candidate training algorithms in the preceding subsections, have multiple key factors that influence their complexity. These include problem scale, computational complexity, and memory requirements. Table 1 shows a comparison of the CGF, CGP, and CGB algorithms. As can be seen, all three algorithms exhibit the same computational complexity for a given number of variables N. However, the CGF algorithm is the best in terms of memory requirement compared to CGP and CGB. However, for a specific implementation on the edge, using techniques like pruning and quantization may further reduce the size and complexity of the model while retaining performance. This will further reduce the memory requirements on the edge.

Table 1 A comparison of the training algorithms in terms of scale, computational complexity, and memory requirements.

FBNN implementation and performance evaluation

Fig. 4
figure 4

FBNN-based Doppler scale estimation implementation diagram; 3 neurons in a single hidden layer and three different backpropagated learning algorithms: CGF, CGP, and CGB are used.

Figure 4 shows FBNN implementation block diagram for Doppler scaling estimation. A healthy dataset (for CP-OFDM) is generated of 1900 complex random vectors with each vector consisting of 12992 values. The dataset contains 19 different SNR values between \(-10\) dB to 26 dB with an index range per SNR of 100. With this formulation, three different datasets are created with number of paths as: 1, 5, and 10. For channel settings, the time interval between two paths having a mean value of 1ms is exponentially distributed, while paths amplitudes are Rayleigh distributed. The average power of each path is exponentially decayed by 20dB in a delay spread of 30ms, and the true Doppler scale factor (\(a_{true}\)) is uniformly distributed. Table 2 summarizes the various parameter values that generate these datasets.

Table 2 Simulation parameter settings for CP-OFDM dataset generation.

For each dataset, about \(70\%\) of complex vectors y[n] (with combined real and imaginary parts) act as input during the training phase. Similarly, each input vector corresponds to a precomputed Doppler scale factor \(a_{true}\) as the target. Furthermore, we use linear regression to train the FBNN network. A FBNN simulation requires a) data distribution between training, testing, and validation phases, b) a training algorithm, and c) specifying the activation functions for hidden and output layers. Since the FBNN neural network architecture is not very deep, a two-layer neural network: a single hidden layer with a size of 3 neurons and an output layer is sufficient in our case. Table 3 lists the parameters and settings used to train, test, and validate the FBNN for the estimation of the Doppler scale.

Table 3 Simulation parameter settings for training, testing, and validation of the FBNN Network.

A detailed stepwise implementation of FBNN is listed below:

  • During the training phase, \(70\%\) samples are randomly selected for both input and target. The remaining \(30\%\) samples (of both input and output) are reserved for the validation and testing phase.

  • Three training algorithms (CGF, CGP, and CGB) are chosen to train our FBNN network.

  • The activation functions used in the hidden and output layers are tansig and purelin.

  • The FBNN is then tested and validated using statistical measures such as mean squared error (MSE) and error histograms.

  • Finally, Doppler scale factors are estimated using the designed FBNN network.

Fig. 5
figure 5

Performance analysis of FBNN-CGF algorithm where (a), (b), and (c) shows performance plots when channel path = 1, 5, and 10 respectively, while (d), (e), and (f) shows error histograms when channel path = 1, 5, and 10 respectively.

Performance evaluation

We evaluate the performance of the three FBNN-based training algorithms: CGF, CGP, and CGB, by examining both the performance plot and error histogram. Moreover, root mean square error vs. signal-to-noise ratio plots are also used as a performance metric for Doppler scale factor estimation evaluation for each case of multipath. The RMSE of a Doppler speed in m/s can be expressed as

$$\begin{aligned} {RMSE}=\sqrt{E[\left| ({a_{est}} - a_{true} ) \times c \right| ^{2}]}, \end{aligned}$$
(26)

where \(c=1500\) m/s is the underwater sound speed.

Training, testing and validation results

This subsection presents the performance analysis of the FBNN-CGF, FBNN-CGP, and FBNN-CGB in terms of performance plots of mean square error (MSE) and error histograms as depicted in Figs. 5, 6, and 7.

Figure 5 shows MSE and error histogram plots for the FBNN-CGF algorithm with channel paths as 1, 5, and 10. From Fig. 5a, b, and c, the performance of the FBNN-CGF algorithm against training, testing, and validation data can be seen in terms of MSE vs epochs. It may be noted that MSE is lower for a single channel path and becomes larger when channel paths are increased to 10. The error histogram plots shown in Fig. 5d, e, and f are also consistent with the MSE plots results, where the error is lower in the case of a single channel path and becomes significant when 10 multipath are used.

Fig. 6
figure 6

Performance analysis of FBNN-CGP algorithm where (a), (b), and (c) shows performance plots when channel path = 1, 5, and 10 respectively, while (d), (e), and (f) shows error histograms when channel path = 1, 5, and 10 respectively.

Fig. 7
figure 7

Performance analysis of FBNN-CGB algorithm where (a), (b), and (c) shows performance plots when channel path = 1, 5, and 10 respectively, while (d), (e), and (f) shows error histograms when channel path = 1, 5, and 10 respectively.

Figure 6 shows MSE and error histogram plots for the FBNN-CGP algorithm with channel paths of 1, 5, and 10. The result trends shown in Fig. 6 are similar to those shown in Fig. 5 in terms of both MSE and error histograms. However, it can be observed that FBNN-CGP performs relatively poorly compared to FBNN-CGF.

Figure 7 shows MSE and error histogram plots for the FBNN-CGB algorithm under channel paths as 1, 5, and 10. A similar conclusion of results with consistent trends in MSE and error histogram plots with Figs. 5 and 6 may be seen. The performance of FBNN-CGB outperforms when the channel path is set to 1 as compared to channel paths set to 5 and 10. In addition, it may also be observed that FBNN-CGB performance is relatively better than FBNN-based CGF and CGP training algorithms.

RMSE vs. SNR comparisons

The performance of various FBNN-based training algorithms for estimating the Doppler scale factor is carried out by calculating RMSE vs. SNR plots as depicted in Figs. 8, 9, and 10.

Figure 8 shows RMSE of Doppler speed in m/s vs. SNR in dB’s for a single channel path. In this Figure, the proposed FBNN-based Doppler scale estimation algorithms are further compared with conventional auto-correlation and cross-correlation-based Doppler scale estimation methods. The parameters used to calculate the RMSE of the conventional methods are kept the same as given in Table 2 to ensure a fair comparison. It can be observed from the Fig. 8 that our proposed Doppler scale estimation algorithms not only estimate the Doppler scale factor but also show significant performance, especially in the lower SNR regions, where the conventional methods: autocorrelation-based methods and pilot-based cross-correlation method performance is very poor. The autocorrelation-based method (denoted by AC-M1) performs better in higher SNR regions as compared to other conventional and non-conventional FBNN-based algorithms.

Considering the FBNN-based Doppler scale factor estimation, FBNN-CGB outperforms FBNN-CGF and FBNN-CGP. This behavior is consistent with the fact described in Sect. "Conjugate Gradient with Powell/Beale Restarts (CGB)".

Fig. 8
figure 8

RMSE vs. SNR comparison of all three FBNN-based training algorithms when the number of channel path = 1.

Fig. 9
figure 9

RMSE vs. SNR comparison of all three FBNN-based training algorithms when the number of channel paths = 5.

Figure 9 shows RMSE of Doppler speed in m/s vs. SNR in dB’s for 5 channel paths. Similar to Fig. 8 results analysis, the trend of RMSE vs. SNR of each algorithm for both conventional and non-conventional methods is the same. However, it can be observed that RMSE is significantly larger in this case, which is mainly due to the multipaths. Furthermore, in this case, FBNN-based CGB, CGF, and CGP algorithms perform almost similarly. Similarly, the non-conventional FBNN-based algorithms outperform in lower SNR as compared to conventional-based AC and CC methods.

Figure 10 shows RMSE of Doppler speed in m/s vs. SNR in dB’s for 10 channel paths. The results trend is quite similar to Fig. 9. Non-conventional FBNN-based algorithms perform better in lower SNR, with FBNN-CGB performing a bit better than FBNN-CGF and FBNN-CGP. While, the autocorrelation-based method 1 performs better in high SNR regions.

Fig. 10
figure 10

RMSE vs. SNR comparison of all three FBNN-based training algorithms when the number of channel paths = 10.

To conclude, the results presented in Figs. 8, 9, and 10, show that non-conventional FBNN-based algorithms perform better in the lower SNR region as compared to conventional auto-correlation and cross-correlation based methods. Among FBNN-based algorithms, CGB-based training algorithm performs relatively better as compared to FBNN-based CGF and CGP algorithms. Among the conventional methods, Auto-correlation based method 1 (AC-M1) performs better in high SNR regions.

BER vs. SNR analysis

Since, FBNN-CGB algorithm performs relatively better than FBNN-based CGF and CGP algorithms, therefore, we further carried out bit error rate vs SNR simulations for the CGB training algorithm only for different multipath channel settings. In Fig.11, the bit error rate vs. signal-to-noise ratio is shown for the FBNN-CGB algorithm for different numbers of paths. It can be seen from Fig. 11 that when a single channel path is used, the BER performance is better as compared to channel multipath scenarios. At SNR of 4 dB, the bits are successfully decoded and BER becomes 0. However, the BER performance is lower when channel paths are 5 and 10.

Fig. 11
figure 11

FBNN-CGB’s BER vs. SNR comparison using \(1, 5, \text { and }, 10\) paths.

Experimental evaluation through watermark channel

In this section, we perform further evaluation of the FBNN-based Doppler scale estimation through the Watermark channel dataset32. To be specific, we used Norway-Oslofjord (NOF1) data, which is a single-input-single-output (SISO) shallow water setup, where transmitter and receiver are both bottom-mounted. The NoF1 data is actually the time varying impulse response (TVIR) measurements, containing a total of 60 channels; because of repeating the experiments 60 times at an interval of 400 seconds, where one TVIR measurement lasts over 32.9 seconds, therefore, a total of 33 minutes of play time32. Table 4 summarizes the NOF1 channel and measurement condition parameters. The OFDM parameters used are listed in Table 5.

Table 4 Watermark NOF1 channel parameters and sounding conditions32.

The dataset is generated using a single NOF1 channel (channel 59 of the NOF1) and introducing a uniformly-distributed Doppler scale factors to the transmitted OFDM signal; therefore, it can be termed as a semi-realistic dataset. With the use of the SNR loop as mentioned in Table 5, which is further iterated 10 times for each SNR, a healthy dataset is generated. In our case, the packets per soundings of the transmitted OFDM signal are 155; therefore, the dimension of the transmitted OFDM signal becomes \(1550 \times 9216\) per SNR. To evaluate the FBNN-based CGP, CGF, and CGB algorithms, we divide the dataset into 70%, 15%, and 15% for training, validation, and testing. The hidden layer size used in FBNN-based CGP, CGF, and CGB models is 20. First, we evaluate the performance of the proposed FBNN-based training algorithms through performance plots and error histograms, and then calculate the RMSE of each method.

Table 5 CP-OFDM parameters used for experimental evaluation of the proposed FBNN-based Doppler scale estimation using Watermark NOF1 channel.
Fig. 12
figure 12

Performance analysis of FBNN-based algorithms using watermark NOF1 channel where (a) and (b) show performance plot and error histogram for FBNN-CGP, (c) and (d) show performance plot and error histogram for FBNN-CGF, CGF, and (e) and (f) show performance plot and error histogram for FBNN-CGB algorithm.

Fig. 13
figure 13

RMSE vs. SNR comparison of all three FBNN-based training algorithms using watermark NOF1 channel.

Figure 12 shows the performance of the FBNN-based CGP, CGF, and CGB algorithms for Doppler scale estimation in UWA OFDM communication using the NOF1 channel. It can be observed from the Fig. 12 that the performance of the FBNN-based CGP algorithm is lower, while FBNN-based CGB algorithm outperforms using the watermark NOf1 channel. These results show a similar trend and conclusion drawn using the synthetic UWA channel dataset. Additionally, Fig. 13 shows the RMSE versus SNR results for all three FBNN-based CGP, CGF, and CGB algorithms. It can be seen that FBNN-CGB performance is better compared to CGF and CGP algorithms, which is again a similar conclusion to the results achieved from using the synthetic UWA channel dataset.

Conclusion

Doppler estimation has always been a challenging task in underwater acoustic communication. This paper applies a feedforward backpropagated neural network (deep learning-based) approach to estimate Doppler scale factors for underwater acoustic CP-OFDM communication. Furthermore, different conjugate gradient training algorithms: CGF, CGP, and CGB, are applied and compared to evaluate the performance and accuracy. From the results, it can be concluded that overall, FBNN-based training algorithms perform better in lower SNR regions for single and multipath channel settings. Furthermore, FBNN-CGB outperforms CGP and CGF training algorithms. The performance of the FBNN-based algorithms is further verified using the UWA watermark channel, which draws a similar conclusion.