Introduction

Orthogonal frequency division multiplexing (OFDM) employs orthogonal subcarriers separated by a parameter called the subcarrier spacing. It is well-adapted to frequency selective channels since frequency selectivity converts the subcarriers to parallel flat-fading channels1. In 4G/5G systems the carrier spacing is chosen so that inter-carrier interference (ICI) is of the same order as the underlying Gaussian noise. The subcarriers are effectively orthogonal, and the channel matrix is effectively diagonal. This makes the complexity of equalization very low; a one-tap equalizer divides each received symbol by the channel it experiences. This defining feature of OFDM has led to its widespread adoption.

As mobility increases (or carrier spacing decreases) the wireless channel becomes increasingly time-selective, and the subcarriers experience significant frequency shifts that vary from one subcarrier to another. ICI becomes more significant, the subcarriers are no longer effectively orthogonal, and the channel matrix is no longer effectively diagonal. It becomes increasingly difficult to accurately estimate the off-diagonal entries, and OFDM performance degrades with increasing Doppler spread (defined as twice the maximum absolute Doppler shift)2,3.

Doppler spread is directly proportional to the carrier frequency and the maximum absolute relative velocity between the transmitter and receiver. Spectrum in the sub-6 GHz range is already crowded, and increased demand has increased interest in higher carrier frequencies where a modest relative velocity can introduce large Doppler spreads. Vertical applications such as communication with bullet trains also introduce large Doppler spreads. OFDM may only be able to satisfy these use cases by increasing carrier spacing (and reducing spectral efficiency).

Zak-transform based orthogonal time frequency space (Zak-OTFS) introduced in refs. 2,3 is a modulation scheme that is robust to doubly selective channels. In Zak-OTFS, the information symbols are mounted in the delay-Doppler (DD) domain. The channel is also viewed in the DD domain. The received information symbols are simply the delay & Doppler shifted and attenuated versions of the transmitted information symbols. This representation of the scattering environment changes at the speed of physics, which is much slower than the speed of wireless communication.

Figure 1 illustrates the fundamental difference between OFDM and Zak-OTFS (or MC-OTFS4,5,6,7). Consider a multi-user setup where many users are served in a slot with OFDM and Zak-OTFS. In OFDM, the subcarrier spacing is adjusted to cater to the worst Doppler spread across users, which means for all the other users, even if their Doppler spread is much smaller, the subcarrier spacing is still the same. This amounts to preventing or avoiding inter-symbol interference (ISI)/(ICI) which is illustrated in the OFDM grid points in Fig. 1 wherein the subcarrier spacing is chosen to be at least twice the Doppler spread. The benefit of avoiding ICI/ISI is the low equalization complexity. On the other hand, in Zak-OTFS, since the goal is not to choose grid points to avoid interference, we allow the information symbols to undergo ICI/ISI which results in higher equalization complexity. In a nutshell, while OFDM is designed to prevent interference, Zak-OTFS is designed to increase spectral efficiency by embracing interference.

Fig. 1: A pictorial representation of the difference between OFDM and Zak-OTFS.
Fig. 1: A pictorial representation of the difference between OFDM and Zak-OTFS.
Full size image

OFDM prevents ISI/ICI, that is the subcarrier spacing is adjusted to avoid interference, while Zak-OTFS embraces ISI/ICI. This results in poor resource utilization in OFDM but low equalization complexity while resource utilization is good in Zak-OTFS at the cost of equalization complexity.

This is illustrated in Figs. 2 and 3 where we plot the spectral efficiency (in bits/s/Hz) of OFDM and the proposed Zak-OTFS as a function of maximum delay spread \(({\tau }_{\max })\) and Doppler spread \(({\nu }_{\max })\) for a frame bandwidth of 720 kHz and frame duration of 1 ms8. It is seen that the spectral efficiency of OFDM varies between 7 and 0. The spectral efficiency being zero implies that the corresponding \(({\nu }_{\max },{\tau }_{\max })\) pair is not supported by OFDM. On the other hand, the spectral efficiency of Zak-OTFS remains constant within the considered \({\nu }_{\max }\) and \({\tau }_{\max }\) range.

Fig. 2: Spectral efficiency as a function of maximum delay and Doppler spread.
Fig. 2: Spectral efficiency as a function of maximum delay and Doppler spread.
Full size image

OFDM spectral efficiency has a large variance and drops down to zero at high Doppler spread.

Fig. 3: Spectral efficiency as a function of maximum delay and Doppler spread.
Fig. 3: Spectral efficiency as a function of maximum delay and Doppler spread.
Full size image

Zak-OTFS has constant spectral efficiency as a function of maximum delay and Doppler spread.

For a channel with P resolvable paths in the DD domain, P (delay & Doppler shifted and attenuated) copies of each transmitted symbol are received. This introduces interference along both delay and Doppler axes. The effective channel matrix is no longer sparse and does not have a structure that could be exploited to equalize the received information symbols at low complexity. The current implementations of Zak-OTFS9,10 all use the linear minimum mean square error (LMMSE) matrix1 to equalize the received information symbols. However, for large frame sizes the LMMSE matrix computation becomes impractical since its complexity is cubic in the frame size owing to the need for matrix inversion.

The key contributions of the paper are listed as follows.

  • We propose a low-complexity equalization scheme for Zak-OTFS, where equalization is carried out in the frequency-domain (FD) instead of the DD domain. Specifically, the information symbols in the DD domain are converted to the FD using the inverse discrete frequency Zak transform (IDFZT)11 at the transmitter. These symbols are transmitted through a doubly selective channel with fractional DD values. A parallel development can be done in the time-domain (TD) as well. Although equalization of channel in the TD may be beneficial in time-selective channels, channel matrix is not sparse with frequency-selective channels. Since this paper deals with fractional DD channel, the TD equalization incurs high complexity owing to the TD channel matrix being dense.

  • We derive a system model in the FD that relates the transmitted FD information symbols to the received FD information symbols. We show that this matrix is modulo banded diagonal (see Fig. 18).

  • Using the null space of the IDFZT matrix, we reduce the channel matrix to a banded diagonal that admits a low-complexity conjugate gradient method (CGM)12 for equalization.

  • We show that the complexity of the proposed low-complexity equalization scheme is linear in the frame size compared to the cubic complexity of LMMSE.

  • Finally, through extensive numerical simulations, we show that FD equalization of Zak-OTFS achieves similar performance as equalization in DD in both perfect and estimated channel knowledge scenarios across various choices of pulse shaping filters13.

A preliminary version of this method was proposed in ref. 14. The main differences between14 and this paper are as follows. In this paper we derive the matrix form of the input-output (I/O) relation in the FD, which is not present in ref. 14. Next, to reduce the modulo-banded channel matrix in the FD to a banded structure, we mount symbols on the null space of the transform matrix, which is not considered in ref. 14. We also consider CGM for equalization in this paper which has complexity linear in frame size; in contrast the complexity of the method in ref. 14 is quadratic. Lastly, we study the effect of estimated channel state information (CSI) with various pulse shapes, which is not considered in ref. 14.

Notation: x denotes a complex scalar, x denotes a vector with nth entry x[n], and X denotes a matrix with (n, m)th entry X[n, m]. ()* denotes complex conjugate, \({(\cdot )}^{T}\) denotes transpose, \({(\cdot )}^{{\rm{H}}}\) denotes complex conjugate transpose, and ()−1 denotes the matrix inverse. \({\mathbb{Z}}\) denotes the set of integers and \({\mathbb{C}}\) denotes the set of complex numbers. δ[] denotes the Kronecker delta function. \(\parallel \cdot {\parallel }_{2}^{2}\) denotes the 2-norm of a vector or Frobenius norm of a matrix. \(a\equiv 0\,mod\,N\,\Rightarrow \,a=cN\) for some \(c\in {\mathbb{Z}}\). denotes the Kronecker product. \({\mathbb{1}}\) denotes the indicator function. \(null({\bf{A}})\) denotes the null space of the matrix A, \(vec(\cdot )\) denotes column-wise vectorization, and \(diag({\bf{a}})\) denotes a diagonal matrix with a as the diagonal. A 0 implies that the matrix A is positive definite. denotes the vector gradient. \({\mathcal{O}}(\cdot )\) denotes the complexity order. denotes the ceil operation.

Prior work: There are several modulation schemes proposed in the literature11,15,16,17,18,19,20,21,22,23,24. For obvious reasons, we do not compare bit-error performance against all the schemes proposed in literature. However, we quantitatively differentiate Zak-OTFS from these schemes in the following.

The modulation schemes in literature are presented in Table 1. Multicarrier-OTFS (MC-OTFS) approaches in11,23,24 are not “predictable”, i.e., the channel response from a pilot at a given location cannot be used to predict the channel response at another location3. OFDM does not work under high mobility scenarios as described earlier and is not predictable. Zak-OTFS-OGD proposed in16 does not work with fractional DD channel and therefore is not practical. Orthogonal chirp domain modulation (OCDM)15,17 are proposed with integer delay and fractional Doppler values and is not a true representation of the channel. A generalized version of OCDM is affine frequency domain multiplexing (AFDM)18,19,21,22. Again, the system model is proposed only for integer delay shifts and fractional Doppler shifts. Orthogonal delay-Doppler multiplexing (ODDM)20 is yet another modulation scheme that works in the DD domain. However, like the previous schemes, the system model is proposed only for integer delay values. Further OCDM, AFDM, and ODDM all require cyclic prefix (CP) which Zak-OTFS does not. For channel estimation, all these methods make use of the embedded pilot as opposed to the spread pilot9,10 or even data symbols25 in Zak-OTFS which incurs no spectral efficiency loss. Equalization of Zak-OTFS in the DD domain2,3,11 incurs cubic complexity in the frame size owing to matrix inversion in the LMMSE matrix computation. FD equalization of Zak-OTFS presented in the preliminary version14 incurs quadratic complexity in the frame size. The approach presented in this paper has complexity linear in the frame size. This follows from reduction of the channel matrix to banded structure and exploiting this structure in the CGM. In the DD domain where the Zak-OTFS channel matrix does not have a defined structure, equalization via CGM allows for reduced complexity.

Table 1 Characteristics of different methods

Among the modulation schemes, we pick AFDM for performance comparison (see section “Comparison with AFDM18,19”), since the system model can be extended for fractional DD values and also since it is the generalized version of OCDM. Further, AFDM and Zak-OTFS are the only two schemes which are added into the 6G standards as study items26.

Remark

There are many methods in the literature for MC-OTFS (for example4,5,6,7) that aim to perform low-complexity equalization in different domains (including the time and/or frequency domain). These methods do not directly apply to Zak-OTFS. This is because MC-OTFS and Zak-OTFS are not related through unitary transforms (for a detailed discussion, see [ref. 3 and Figs. 14, 15]). If it were, it should have been possible to move from one system model to another through unitary matrices. This also explains why some of the salient features in Zak-OTFS like predictability is missing in MC-OTFS. Hence, all the low-complexity methods for equalization proposed for MC-OTFS cannot be applied in the context of Zak-OTFS.

Results and discussion

In this section, we show the performance of the FD equalization of Zak-OTFS under both perfect CSI and estimated CSI. Table 2 shows the meaning for each variable used below. For all the simulations we consider a Zak-OTFS frame with M = 31, N = 37 and νp = 30 kHz. We consider a Vehicular-A (Veh-A) channel27 with power-delay profile shown in Table 3. Doppler for path i is computed as \({\nu }_{i}={\nu }_{\max }\cos ({\theta }_{i})\), where θi is uniformly distributed between −π and π and \({\nu }_{\max }\) is the maximum Doppler spread (Note that our channel model considers fractional delay and Doppler shifts, which is representative of real propagation environments. The path delays in Table 3 are non-integer multiples of the delay resolution 1/B. The Doppler shifts \({\nu }_{i}={\nu }_{\max }\cos ({\theta }_{i})\) are also non-integer multiples of the Doppler resolution 1/T since \(\cos ({\theta }_{i})\) is continuous valued). We present results for four pulse shapes, sinc3, root raised cosine (RRC)9,10, Gauss13, and Gauss-sinc13. For CP-OFDM, we consider a CP of 4 delay bins. The choice of b depends on the maximum Doppler spread of the channel and the pulse shaping employed and is a design parameter. For the simulations presented in the following, the choice of b is specified in the figure captions.

Table 2 Variables and their meanings
Table 3 Power-delay profile of Veh-A channel model

Comparison with existing approaches

We compare the perfect CSI performance of Zak-OTFS in DD and FD, I/O relation in (34), and CP-OFDM in Figs. 4 and 5 under high Doppler spread (\({\nu }_{\max }=815\) Hz) and low Doppler spread (\({\nu }_{\max }=81.5\) Hz), respectively. We consider an RRC pulse with parameter βτ = βν = 0.6. For the CGM method we consider \(k=250,b=\lceil {\nu }_{\max }T\rceil +1\). For (34) and CP-OFDM we consider two receiver architectures. First a genie receiver which allows joint equalization across carriers (because in practice, it is hard to acquire channel spread occurring due to ICI) and second a practical receiver, which uses a 1-tap equalizer (which uses channel estimates along the diagonal and doesn’t acquire the ICI). As mentioned in section “Embedding information symbols in FD”, for (34) the subcarrier spacing is B/MN = 810.81 Hz, while that for CP-OFDM it is B/M = 30 kHz.

Fig. 4: Comparison of BER of different modulation schemes with perfect channel knowledge at high Doppler values.
Fig. 4: Comparison of BER of different modulation schemes with perfect channel knowledge at high Doppler values.
Full size image

FD equalization (FDE) of Zak-OTFS via CGM algorithm with \(k=250,\epsilon =1{0}^{-6},b=\lceil {\nu }_{\max }T\rceil +1\). Simulation parameters: M = 31, N = 37, νp = 30 kHz, Veh-A channel with \({\nu }_{\max }=815\) Hz, RRC pulse shaping filter with βτ = βν = 0.6.

Fig. 5: Comparison of BER of different modulation schemes with perfect channel knowledge at low Doppler values.
Fig. 5: Comparison of BER of different modulation schemes with perfect channel knowledge at low Doppler values.
Full size image

FD equalization (FDE) of Zak-OTFS via CGM algorithm with \(k=250,\epsilon =1{0}^{-6},b=\lceil {\nu }_{\max }T\rceil +1\). Simulation parameters: M = 31, N = 37, νp = 30 kHz, Veh-A channel with \({\nu }_{\max }=81.5\) Hz, RRC pulse shaping filter with βτ = βν = 0.6.

At high Doppler spread (Fig. 4), consequently, the performance of 1-tap equalizer for (34) floors (since Doppler spread is of the order of subcarrier spacing). However, the genie receiver performs much better since ICI is acquired at the receiver. There is no performance difference between the two receivers for CP-OFDM. This is because the subcarrier spacing is large enough to be not be affected by the Doppler spread of 815 Hz. Further, the performance of (34) is better than that of CP-OFDM with genie receiver. This is because CP-OFDM fades more than (34) (see Fig. 17 and the discussion therein). Compared to the OFDM schemes, Zak-OTFS in DD and FD perform much better, thanks to the non-fading aspect for these schemes. The performance of DD and FD equalization is essentially the same, where in DD we used the regular LMMSE incurring complexity \({\mathcal{O}}({M}^{3}{N}^{3})\) and in FD we used the CGM described in section “Low-complexity MMSE equalization” which has complexity \({\mathcal{O}}(kbMN)\) (see section “Low-complexity MMSE equalization”).

At low Doppler spread (Fig. 5), (34) and CP-OFDM have similar performance for both receivers. This is expected since the Doppler spread is now 81.5 Hz and the subcarrier spacing for both CP-OFDM and (34) supports this Doppler spread. As seen before, the performance of Zak-OTFS with LMMSE equalizer in DD and CGM equalizer in FD is similar and better than CP-OFDM and (34).

Performance with estimated CSI

Here we present the results of the proposed receiver with estimated channel. For channel estimation, we consider a Zadoff-Chu sequence with root 101 in the DD domain as the spread pilot frame10. For all the simulations, we fix the data SNR (and data energy) and vary the pilot energy. The ratio of pilot energy to the data energy is defined to be the pilot to data ratio (PDR). To evaluate the channel estimation performance, we compute the normalized mean square error (NMSE) as:

$$NMSE=\frac{\parallel {\widehat{{\bf{h}}}}_{e\mathrm{ff}}-{{\bf{h}}}_{e\mathrm{ff}}{\parallel }_{2}^{2}}{\parallel {{\bf{h}}}_{e\mathrm{ff}}{\parallel }_{2}^{2}},$$
(1)

where heff is the effective channel matrix (see (11)) and \({\widehat{{\bf{h}}}}_{e\mathrm{ff}}\) is the estimated effective channel matrix using the spread pilot.

In Figs. 69 we compare the NMSE and BER performance of Zak-OTFS in DD and FD with estimated channel for RRC and Gauss-sinc pulse shapes, respectively. For Gauss-sinc pulse shape we fix the parameters to be ατ = αν = 0.04413 and the corresponding CGM parameters to be \(b=\lceil 5{\nu }_{\max }T\rceil\). The NMSE and BER performance with turbo iterations (described in section “Turbo Iterations”) is also added.

Fig. 6: NMSE performance with estimated channel.
Fig. 6: NMSE performance with estimated channel.
Full size image

A Zadoff-Chu sequence with root 101 in the DD domain10 is used as a spread pilot. FD equalization (FDE) of Zak-OTFS via CGM algorithm with \(k=250,\epsilon =1{0}^{-6},b=\lceil {\nu }_{\max }T\rceil +1\). Simulation parameters: M = 31, N = 37, νp = 30 kHz, data SNR = 20 dB, Veh-A channel with \({\nu }_{\max }=815\) Hz, RRC pulse shaping filter with βτ = βν = 0.6. NMSE improves with turbo iterations and achieve close to perfect CSI performance.

Fig. 7: BER performance with estimated channel.
Fig. 7: BER performance with estimated channel.
Full size image

A Zadoff-Chu sequence with root 101 in the DD domain10 is used as a spread pilot. FD equalization (FDE) of Zak-OTFS via CGM algorithm with \(k=250,\epsilon =1{0}^{-6},b=\lceil {\nu }_{\max }T\rceil +1\). Simulation parameters: M = 31, N = 37, νp = 30 kHz, data SNR = 20 dB, Veh-A channel with \({\nu }_{\max }=815\) Hz, RRC pulse shaping filter with βτ = βν = 0.6. BER improves with turbo iterations and achieve close to perfect CSI performance.

Fig. 8: NMSE performance with estimated channel.
Fig. 8: NMSE performance with estimated channel.
Full size image

A Zadoff-Chu sequence with root 101 in the DD domain10 is used as a spread pilot. FD equalization (FDE) of Zak-OTFS via CGM algorithm with \(k=250,\epsilon =1{0}^{-6},b=\lceil 5{\nu }_{\max }T\rceil\). Simulation parameters: M = 31, N = 37, νp = 30 kHz, data SNR = 20 dB, Veh-A channel with \({\nu }_{\max }=815\) Hz, Gauss-sinc pulse shaping with ατ = αν = 0.044. NMSE improves with turbo iterations and achieve close to perfect CSI performance.

Fig. 9: BER performance with estimated channel.
Fig. 9: BER performance with estimated channel.
Full size image

A Zadoff-Chu sequence with root 101 in the DD domain10 is used as a spread pilot. FD equalization (FDE) of Zak-OTFS via CGM algorithm with \(k=250,\epsilon =1{0}^{-6},b=\lceil 5{\nu }_{\max }T\rceil\). Simulation parameters: M = 31, N = 37, νp = 30 kHz, data SNR = 20 dB, Veh-A channel with \({\nu }_{\max }=815\) Hz, Gauss-sinc pulse shaping with ατ = αν = 0.044. BER improves with turbo iterations and achieve close to perfect CSI performance.

For RRC pulse shaping in Fig. 6, the NMSE performance without turbo iterations decreases linearly with PDR for FD and DD equalization of Zak-OTFS. This is expected since increasing PDR corresponds to increasing the pilot energy. With 1 turbo iteration, the NMSE performance is seen to improve with the improvement being more significant with 5 turbo iterations. The performance of both FD and DD equalization of Zak-OTFS improves similarly with turbo iterations. BER performance for the RRC pulse is plotted in Fig. 7. Performance assuming perfect CSI is also added for comparison. As observed before in Fig. 4, the perfect CSI performance for both FD and DD equalizations match and is flat across PDR, since perfect CSI allows perfect pilot cancellation (see (56)). Without turbo iterations, the BER performance is first seen to decrease with PDR and then increase. This can be explained as follows. At very low PDR, the channel estimate has high NMSE and therefore leads to poor equalization and detection performance. As the PDR increases, the accuracy of the channel estimate improves, which results in better BER performance. However, as the PDR is further increased, although the NMSE improves, the energy of the residual pilot left behind after the pilot cancellation step (56) increases and this causes increased interference during data detection. Further since at each step the pilot cancellation is not exact, the BER performance is far from perfect CSI performance. However, with turbo iterations, due to repeated pilot and data cancellation steps, the performance significantly improves and achieves the perfect CSI performance for mid PDR values. Notice that with and without turbo, the performance of both FD and DD equalization of Zak-OTFS is similar. However, the NMSE in FD is slightly better than that in DD because of decreased interference between information symbols and pilot symbols (recall that there are only MN−2b information symbols in FD compared to MN information symbols in DD). NMSE and BER results plotted in Figs. 8 and 9, respectively, for Gauss-sinc pulse shaping also show a similar trend as in the case of RRC pulse shaping.

Impact of pulse shaping filter

Here we present the simulation results comparing different pulse shapes with perfect and estimated CSI. Figure 10 shows the performance of Zak-OTFS in FD using RRC, sinc, Gauss-sinc, and Gauss pulse shapes with perfect CSI. The CGM method in Algorithm 1 is used for equalization. The performance of sinc, RRC, and Gauss-sinc are observed to be similar. Gauss pulse shaping has degraded performance owing to its non-orthogonality13.

Fig. 10: Impact of pulse shaping filter choice on BER of FD equalization (FDE) of Zak-OTFS with perfect channel knowledge.
Fig. 10: Impact of pulse shaping filter choice on BER of FD equalization (FDE) of Zak-OTFS with perfect channel knowledge.
Full size image

FD equalization of Zak-OTFS via CGM algorithm with k = 250, ϵ = 10−6. Simulation parameters: M = 31, N = 37, νp = 30 kHz, Veh-A channel with \({\nu }_{\max }=815\) Hz. For RRC, \(b=\lceil {\nu }_{\max }T\rceil +1\) and βτ = βν = 0.6. For sinc, b = N + 1. For Gauss-sinc, \(b=\lceil 5{\nu }_{\max }T\rceil\) and ατ = αν = 0.044. For Gauss, \(b=\lceil {\nu }_{\max }T\rceil +1\) and ατ = αν = 1.584.

Figures 11 and 12 show the estimated CSI performance of both Zak-OTFS in FD and DD for sinc and and Gauss pulse shapes. The NMSE performance of sinc in Fig. 11 is seen to floor. This is because, the sinc pulse decays slowly and there is significant energy outside the fundamental rectangle that is not estimated13. On the other hand, the Gauss pulse has excellent NMSE performance since it is localized within the fundamental and the NMSE decreases linearly with PDR. The BER performance of Zak-OTFS in FD and DD for sinc and Gauss pulse shapes is plotted in Fig. 12. The perfect CSI performance is also plotted for comparison. The estimated CSI performance of sinc for both FD and DD equalization is far from perfect CSI performance. This is due to the poor NMSE achieved by the sinc pulse shape. Turbo iterations also do not help since, even with turbo, we estimate the channel only within the fundamental rectangle. Gauss pulse shape also has poor estimated CSI performance. This is attributed to non-orthogonality of the Gauss pulse shape that leads to poor equalization13. Again, for all these cases, the FD and DD equalization performances match, showing its equivalence under perfect CSI, estimated CSI, and different pulse shapes.

Fig. 11: NMSE performance with estimated channel.
Fig. 11: NMSE performance with estimated channel.
Full size image

A Zadoff-Chu sequence with root 101 in the DD domain10 is used as a spread pilot. Sinc pulse shaping and Gauss pulse shaping with aτ = aν = 1.584, M = 31, N = 37, νp = 30 kHz, data SNR = 20 dB for sinc and 40 dB for Gauss, b = N + 1 for sinc and \(b=\lceil {\nu }_{\max }T\rceil +1\) for Gauss. Veh-A channel with \({\nu }_{\max }=815\) Hz. Sinc has poor NMSE, but Gauss has better NMSE. However, the Gauss pulse is not orthogonal and has poor equalization performance with estimated channel13.

Fig. 12: BER performance with estimated channel.
Fig. 12: BER performance with estimated channel.
Full size image

A Zadoff-Chu sequence with root 101 in the DD domain10 is used as a spread pilot. Sinc pulse shaping and Gauss pulse shaping with aτ = aν = 1.584, M = 31, N = 37, νp = 30 kHz, data SNR = 20 dB for sinc and 40 dB for Gauss, b = N + 1 for sinc and \(b=\lceil {\nu }_{\max }T\rceil +1\) for Gauss. Veh-A channel with \({\nu }_{\max }=815\) Hz. Sinc has poor NMSE that does not help with the bit-error performance. However, the Gauss pulse is not orthogonal and has poor equalization performance with estimated channel13.

Comparison with AFDM18,19

The basis used to modulate information symbols in the transfer domain in AFDM is18:

$${{\boldsymbol{\phi }}}_{m}[n]=\frac{1}{\sqrt{MN}}{e}^{\,j2\pi ({c}_{1}{n}^{2}+{c}_{2}{m}^{2}+\frac{mn}{MN})},$$
(2)

where \(m,n=0,1,\cdots \,,MN-1,{c}_{1},{c}_{2}\in {\mathbb{R}}\). n is the time index and m is the transfer domain index. Substituting in (16), the received symbols in the transfer domain is:

$$\begin{array}{rcl}\breve{{\bf{r}}}[\,f] & = & \frac{1}{MN}\mathop{\sum }\limits_{i=0}^{MN-1}\breve{{\bf{s}}}[i](\mathop{\sum }\limits_{\bar{k}=0}^{MN-1}\mathop{\sum }\limits_{\bar{l}=0}^{MN-1}{e}^{-\frac{j2\pi }{MN}\bar{l}\bar{k}}{\bf{h}}[\bar{k},\bar{l}]\\ & \times & \mathop{\sum }\limits_{n=0}^{MN-1}{e}^{{c}_{2}({f}^{2}-{i}^{2})+n\left(\frac{i-f}{MN}-2{c}_{1}\bar{k}\right)-\frac{i\bar{k}}{MN}+{c}_{1}{\bar{k}}^{2}})\end{array}$$
$$=\mathop{\sum }\limits_{i=0}^{MN-1}\breve{{\bf{s}}}[i]\breve{{\bf{H}}}[\,f,i],$$
(3)

and the system model is:

$$\breve{{\bf{r}}}=\breve{{\bf{H}}}\breve{{\bf{s}}}+\breve{{\bf{n}}},$$
(4)

where \(\breve{{\bf{s}}},\breve{{\bf{r}}},\breve{{\bf{n}}}\in {{\mathbb{C}}}^{MN\times 1}\) denote the vector of transmitted symbols, received symbols and noise samples, respectively, all in the transfer domain. Since \({\bf{h}}[\overline{k},\overline{l}]\) encompasses pulse shaping and fractional DD values, the system model in (4) also holds for various pulse shapes and channel models.

Choices of c1 and c2: There are two ways of choosing c1 and c2: \((i)\,{c}_{1}MN,{c}_{2}MN\in {\rm{{\mathbb{Z}}}}\), or \((ii)\,{c}_{1}MN,{c}_{2}MN\,\notin \,{\rm{{\mathbb{Z}}}}\).

Case-I: \({c}_{1}MN,{c}_{2}MN\in {\mathbb{Z}}\). The basis can be represented as:

$${{\boldsymbol{\phi }}}_{m}^{{\prime} }[n]=\frac{1}{\sqrt{MN}}{e}^{\,j2\pi (\frac{\alpha }{MN}{n}^{2}+\frac{\beta }{MN}{m}^{2}+\frac{mn}{MN})},$$
(5)

for some \(\alpha ,\beta \in {\mathbb{Z}}\).

Lemma 1

\({{\boldsymbol{\phi }}}_{m}^{{\prime} }[n]\) is a constant amplitude zero-autocorrelation (CAZAC) sequence.

Proof

Clearly \({{\boldsymbol{\phi }}}_{m}^{{\prime} }[n]\) is constant amplitude sequence. Consider the inner product:

$$\begin{array}{rcl}\langle {{\boldsymbol{\phi }}}_{m}^{{\prime} }[n],{{\boldsymbol{\phi }}}_{m}^{{\prime} * }[n-k]\rangle & = & \frac{1}{MN}\mathop{\sum }\limits_{n=0}^{MN-1}{e}^{j2\pi (\frac{\alpha }{MN}{n}^{2}+\frac{\beta }{MN}{m}^{2}+\frac{mn}{MN})}\\ & \times & {e}^{-j2\pi (\frac{\alpha }{MN}{(n-k)}^{2}+\frac{\beta }{MN}{m}^{2}+\frac{m(n-k)}{MN})}\\ & = & \frac{1}{MN}\mathop{\sum }\limits_{n=0}^{MN-1}{e}^{-j2\pi (\frac{\alpha }{MN}({k}^{2}-2nk)-\frac{km}{MN})}\\ & & \mathop{=}\limits^{(a)}\left\{\begin{array}{ll}1 & \mathrm{if}\,\,2k\equiv 0\,\mathrm{mod}\,MN\\ 0 & \mathrm{otherwise}\end{array}\right.,\end{array}$$
(6)

where step (a) follows from Identity 1. □

Conventionally, Zak-OTFS is realized through the pulsone basis. Zak-OTFS can also be realized through CAZAC basis28 and both these realizations are unitarily equivalent. For the values of c1 and c2 in Case-I, AFDM bases are CAZAC sequences. Hence, for this choice of c1 and c2, Zak-OTFS and AFDM are unitarily equivalent. Note that, prior papers on AFDM18,19 choose c1MN to be integer-valued.

Case-II: \({c}_{1}MN,{c}_{2}MN\,\notin \,{\rm{{\mathbb{Z}}}}\). In this case the basis is no longer a CAZAC sequence, since the exponent cannot be written as roots of unity. For the values of c1 and c2 in Case-II, AFDM and Zak-OTFS are not unitarily equivalent.

Figure 13 shows the heatmap of the absolute value of the self-ambiguity function when c1 is chosen to be a value in Case-II, i.e., \({c}_{1}MN=91.2\,\notin \,{\rm{{\mathbb{Z}}}}\) and m = 0. Figure 14 shows the heatmap when c1 is chosen to be a value in Case-I, i.e., \({c}_{1}MN=91\in {\mathbb{Z}}\) and m = 0. For estimating the channel in the DD domain it is desirable to have extremely localized self-ambiguity function and this is possible only when c1 is chosen to be a value in Case-I, i.e., when AFDM and Zak-OTFS are unitarily equivalent. AFDM can be regarded as a general case of Zak-OTFS, however, the generalization beyond Zak-OTFS makes channel estimation more challenging. Figure 15 compares the BER performance of AFDM and Zak-OTFS under perfect channel knowledge when c1 is chosen to be a value in Case-I. The BER curves overlap corroborating the above results.

Fig. 13: Heatmap of self-ambiguity of basis function of AFDM in dB scale.
Fig. 13: Heatmap of self-ambiguity of basis function of AFDM in dB scale.
Full size image

\(M=31,N=37,m=0,MN{c}_{1}=91.2\,\notin \,{\rm{{\mathbb{Z}}}}\).

Fig. 14: Heatmap of self-ambiguity of basis function of AFDM in dB scale.
Fig. 14: Heatmap of self-ambiguity of basis function of AFDM in dB scale.
Full size image

\(M=31,N=37,m=0,MN{c}_{1}=91\in {\mathbb{Z}}\).

Fig. 15: BER performance comparing the performance of Zak-OTFS and AFDM with \({c}_{1}MN,{c}_{2}MN\in {\mathbb{Z}}\) for Gauss-sinc pulse shaping with ατ = αν = 1.584, M = 11, N = 13, νp = 30 kHz.
Fig. 15: BER performance comparing the performance of Zak-OTFS and AFDM with 
                        $${c}_{1}MN,{c}_{2}MN\in {\mathbb{Z}}$$
                        
                          
                            
                              
                                c
                              
                              
                                1
                              
                            
                            M
                            N
                            ,
                            
                              
                                c
                              
                              
                                2
                              
                            
                            M
                            N
                            ∈
                            Z
                          
                        
                       for Gauss-sinc pulse shaping with ατ = αν = 1.584, M = 11, N = 13, νp = 30 kHz.
Full size image

Veh-A channel with \({\nu }_{\max }=815\) Hz. There is essentially no difference between AFDM and Zak-OTFS.

Methods

Before we describe the FD perspective of Zak-OTFS, we present a few preliminaries that are used throughout the paper.

Useful identities

Identity 1

29 The sum of all Nth roots of unity satisfies:

$$\begin{array}{l}\mathop{\sum }\limits_{n=0}^{N-1}{e}^{\frac{j2\pi }{N}kn}=\left\{\begin{array}{cc}N & \mathrm{if}\,\,k\equiv 0\,\mathrm{mod}\,N\\ 0 & \mathrm{otherwise}\end{array}\right..\end{array}$$

Zak-OTFS system model in discrete time

Transmitter: Consider an M × N DD grid with M delay bins (indexed by k) and N Doppler bins (indexed by l), corresponding to width τp along delay and width νp along Doppler with τpνp = 1. Given 0≤k0≤(M − 1), 0≤l0≤(N − 1), the basis element corresponding to the (k0, l0)th DD bin is \({\sum }_{n,m\in {\mathbb{Z}}}{e}^{j\frac{2\pi }{N}ln}\delta [k-{k}_{0}-nM]\delta [l-{l}_{0}-mN]\). The corresponding MN-length discrete time realization is given by the inverse discrete Zak transform11,30:

$${{\bf{p}}}_{({k}_{0},{l}_{0})}[n]=\frac{1}{\sqrt{N}}\mathop{\sum }\limits_{d\in {\mathbb{Z}}}{e}^{j\frac{2\pi }{N}d{l}_{0}}\delta [n-{k}_{0}-dM],$$
(7)

where 0≤n≤(MN − 1), which we refer to as a time domain (TD) pulsone. We refer the interested reader to2,3 for detailed discussions regarding the pulsone basis.

The pulsone basis is orthonormal, and MN DD domain information symbols \({\bf{X}}[{k}_{0},{l}_{0}]\in {{\mathbb{C}}}^{M\times N}\), 0 ≤ k0 ≤ (M−1), 0 ≤ l0 ≤ (N−1), can be transmitted by modulating different pulsones as:

$${\bf{x}}[n]=\mathop{\sum }\limits_{{k}_{0}=0}^{M-1}\mathop{\sum }\limits_{{l}_{0}=0}^{N-1}{\bf{X}}[{k}_{0},{l}_{0}]{{\bf{p}}}_{({k}_{0},{l}_{0})}[n]\in {{\mathbb{C}}}^{MN\times 1}.$$
(8)

Note that \({{\bf{p}}}_{({k}_{0},{l}_{0})}[n]\), and hence x[n], is periodic with period MN. Therefore, x[n] can also be represented in terms of a time domain orthonormal basis \({{\boldsymbol{\phi }}}_{i}[n]\in {{\mathbb{C}}}^{MN\times 1}\) that converts symbols in the transfer domain to time domain as:

$${\bf{x}}[n]=\mathop{\sum }\limits_{i=0}^{MN-1}{\bf{s}}[i]{{\boldsymbol{\phi }}}_{i}[n],$$
(9)

where \({\bf{s}}[i]\in {{\mathbb{C}}}^{MN\times 1}\), for 0 ≤ i ≤ (MN − 1), is a vector in the transfer domain given by:

$${\bf{s}}[i]=\mathop{\sum }\limits_{n=0}^{MN-1}{{\boldsymbol{\phi }}}_{i}^{* }[n]{\bf{x}}[n].$$
(10)

Receiver: The (noiseless) received time domain signal is given by:

$${\bf{y}}[n]=\mathop{\sum }\limits_{k,l\in {\mathbb{Z}}}{{\bf{h}}}_{e\mathrm{ff}}[k,l]{\bf{x}}[n-k]{e}^{\frac{j2\pi }{MN}l(n-k)}\in {{\mathbb{C}}}^{MN\times 1},$$
(11)

where \({{\bf{h}}}_{e\mathrm{ff}}[k,l]\in {{\mathbb{C}}}^{{\mathbb{Z}}\times {\mathbb{Z}}}\) represents the effective channel in the DD domain.

Substituting \(k=\overline{k}+pMN\), \(0\le \overline{k}\le (MN-1)\), \(p\in {\mathbb{Z}}\), \(l=\overline{l}+qMN\), \(0\le \overline{l}\le (MN-1)\), \(q\in {\mathbb{Z}}\):

$$\begin{array}{rcl}{\bf{y}}[n] & = & \mathop{\sum }\limits_{\overline{k}=0}^{MN-1}\mathop{\sum }\limits_{\overline{l}=0}^{MN-1}\mathop{\sum }\limits_{p,q\in {\mathbb{Z}}}{{\bf{h}}}_{e\mathrm{ff}}[\overline{k}+pMN,\overline{l}+qMN]\\ & & {\bf{x}}[n-\overline{k}-pMN]{e}^{\frac{j2\pi }{MN}(\overline{l}+qMN)(n-\overline{k}-pMN)}.\end{array}$$
(12)

Since x[n] is MN-periodic, (12) can be simplified as:

$$\begin{array}{l}{\bf{y}}[n]=\mathop{\sum }\limits_{\bar{k}=0}^{MN-1}\mathop{\sum }\limits_{\bar{l}=0}^{MN-1}\mathop{\underbrace{\mathop{\sum }\limits_{p,q\in {\rm{{\mathbb{Z}}}}}{{\bf{h}}}_{\mathrm{eff}}[\bar{k}+pMN,\bar{l}+qMN]}}\limits_{{\bf{h}}[\bar{k},\bar{l}]}\\ \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\times {\bf{x}}[n-\bar{k}]{e}^{\frac{j2\pi }{MN}\bar{l}(n-\bar{k})}.\end{array}$$
(13)

where \({\bf{h}}[\overline{k},\overline{l}]\in {{\mathbb{C}}}^{MN\times MN}\) denotes the MN-periodized version of the effective channel in the DD domain. Therefore,

$${\bf{y}}[n]=\mathop{\sum }\limits_{\overline{k}=0}^{MN-1}\mathop{\sum }\limits_{\overline{l}=0}^{MN-1}{\bf{h}}[\overline{k},\overline{l}]{\bf{x}}[n-\overline{k}]{e}^{\frac{j2\pi }{MN}\overline{l}(n-\overline{k})}.$$
(14)

Hence,

$$\begin{array}{rcl}{\bf{y}}[n] & = & \mathop{\sum }\limits_{\overline{k}=0}^{MN-1}\mathop{\sum }\limits_{\overline{l}=0}^{MN-1}{\bf{h}}[\overline{k},\overline{l}]{e}^{\frac{j2\pi }{MN}\overline{l}(n-\overline{k})}\\ & \times & \mathop{\sum }\limits_{i=0}^{MN-1}{\bf{s}}[i]{{\boldsymbol{\phi }}}_{i}[n-\overline{k}].\end{array}$$
(15)

Projecting y[n] on the basis \({{\boldsymbol{\phi }}}_{f}[n]\in {{\mathbb{C}}}^{MN\times 1}\), 0≤ f ≤ (MN − 1):

$$\begin{array}{l}\begin{array}{rcl}{\bf{r}}[f] & = & \mathop{\sum }\limits_{n=0}^{MN-1}{{\boldsymbol{\phi }}}_{f}^{* }[n]{\bf{y}}[n]\end{array}\\ \begin{array}{cl}\,\,\,\,\,\,\,\,= & \mathop{\sum }\limits_{i=0}^{MN-1}{\bf{s}}[i]\left(\mathop{\sum }\limits_{\bar{k}=0}^{MN-1}\mathop{\sum }\limits_{\bar{l}=0}^{MN-1}{e}^{-\frac{j2\pi }{MN}\bar{l}\bar{k}}{\bf{h}}[\bar{k},\bar{l}]\times \mathop{\sum }\limits_{n=0}^{MN-1}{{\boldsymbol{\phi }}}_{f}^{* }[n]{{\boldsymbol{\phi }}}_{i}[n-\bar{k}]{e}^{\frac{j2\pi }{MN}\bar{l}n}\right)\end{array}\\ \begin{array}{cl}\,\,\,\,\,\,\,\,= & \mathop{\sum }\limits_{i=0}^{MN-1}{\bf{s}}[i]{\bf{H}}[f,i],\end{array}\end{array}$$
(16)

where \({\bf{r}}[f]\in {{\mathbb{C}}}^{MN\times 1}\) is in the transfer domain.

Vectorized system model

Vectorizing (16) and including additive noise at the receiver, we obtain the system model:

$${\bf{r}}={\bf{H}}{\bf{s}}+{\bf{w}},$$
(17)

where \({\bf{r}},{\bf{s}},{\bf{w}}\in {{\mathbb{C}}}^{MN\times 1}\) are all in the transfer domain and \({\bf{H}}\in {{\mathbb{C}}}^{MN\times MN}\) is the channel matrix in the transfer domain.

In the next Subsection, we describe the FD perspective of Zak-OTFS by appropriately choosing ϕ[n].

An FD perspective of Zak-OTFS

In this Subsection, we study the FD input-output (I/O) relation of Zak-OTFS. Fig. 16 shows the block diagram of the FD input-output relation of Zak-OTFS. A high level description of the transceiver processing steps of the proposed FD equalization of Zak-OTFS is presented in Table 4.

Fig. 16: Block diagram of FD input-output relation of Zak-OTFS.
Fig. 16: Block diagram of FD input-output relation of Zak-OTFS.
Full size image

The information symbols \(\mathbf{X}\) are mounted on pulsonses \(\mathbf{p}_{(k_0, l_0)}\). The resulting vector \(\mathbf{x}\) is passed through a DFT block to obtain time domain transmit vector \(\mathbf{s}\). The vector is transmitted through a fractional DD channel \(\mathbf{H}\). At the receiver, the received symbols areequalized and DFZT is applied to obtain the DD domain symbols \(\hat{\mathbf{X}}\).

Table 4 Transceiver processing steps for FD equalization of Zak-OTFS assuming perfect channel knowledge

Transmitter Processing: To derive the FD input-output relation of Zak-OTFS, we use the inverse discrete Fourier transform (IDFT) basis:

$${{\boldsymbol{\phi }}}_{i}[n]=\frac{1}{\sqrt{MN}}{e}^{j\frac{2\pi }{MN}in}.$$
(18)

where the FD vector \({\bf{s}}[i]\in {{\mathbb{C}}}^{MN\times 1}\), for 0≤i≤(MN − 1), is given by:

$$\begin{array}{rcl}{\bf{s}}[i] & = & \mathop{\sum }\limits_{n=0}^{MN-1}{{\boldsymbol{\phi }}}_{i}^{* }[n]{\bf{x}}[n]\\ & = & \frac{1}{\sqrt{MN}}\mathop{\sum }\limits_{n=0}^{MN-1}{e}^{-j\frac{2\pi }{MN}in}{\bf{x}}[n]\end{array}$$
$$\begin{array}{rcl} & = & \frac{1}{\sqrt{MN}}\mathop{\sum }\limits_{{k}_{0}=0}^{M-1}\mathop{\sum }\limits_{{l}_{0}=0}^{N-1}{\bf{X}}[{k}_{0},{l}_{0}]\mathop{\sum }\limits_{n=0}^{MN-1}{e}^{-j\frac{2\pi }{MN}in}{{\bf{p}}}_{({k}_{0},{l}_{0})}[n]\\ & = & \frac{1}{\sqrt{M}N}\mathop{\sum }\limits_{{k}_{0}=0}^{M-1}\mathop{\sum }\limits_{{l}_{0}=0}^{N-1}{\bf{X}}[{k}_{0},{l}_{0}]\mathop{\sum }\limits_{d\in {\mathbb{Z}}}{e}^{j\frac{2\pi }{N}d{l}_{0}}\times \\ & & \mathop{\sum }\limits_{n=0}^{MN-1}{e}^{-j\frac{2\pi }{MN}in}\delta [n-{k}_{0}-dM].\end{array}$$
(19)

Note that since 0 ≤ n ≤ (MN−1), the delta function δ[nk0dM] implies that 0 ≤ d ≤ (N − 1) for which n = k0 + dM. Therefore,

$$\begin{array}{rcl}{\bf{s}}[i] & = & \frac{1}{\sqrt{M}N}\mathop{\sum }\limits_{{k}_{0}=0}^{M-1}\mathop{\sum }\limits_{{l}_{0}=0}^{N-1}{\bf{X}}[{k}_{0},{l}_{0}]\mathop{\sum }\limits_{d=0}^{N-1}{e}^{j\frac{2\pi }{N}d{l}_{0}}{e}^{-j\frac{2\pi }{MN}i({k}_{0}+dM)}\\ & = & \frac{1}{\sqrt{M}}\mathop{\sum }\limits_{{k}_{0}=0}^{M-1}\mathop{\sum }\limits_{{l}_{0}=0}^{N-1}{\bf{X}}[{k}_{0},{l}_{0}]{e}^{-j\frac{2\pi }{MN}i{k}_{0}}\frac{1}{N}\mathop{\sum }\limits_{d=0}^{N-1}{e}^{j\frac{2\pi }{N}d({l}_{0}-i)}.\end{array}$$
(20)

From Identity 1, the inner summation over d vanishes unless \({l}_{0}\equiv i\,mod\,N\), when it takes the value N. Therefore,

$$\begin{array}{rcl}{\bf{s}}[i] & = & \frac{1}{\sqrt{M}}\mathop{\sum }\limits_{{k}_{0}=0}^{M-1}\mathop{\sum }\limits_{{l}_{0}=0}^{N-1}{\bf{X}}[{k}_{0},{l}_{0}]{e}^{-j\frac{2\pi }{MN}i{k}_{0}}{\mathbb{1}}\{{l}_{0}\equiv i\,mod\,N\}\\ & = & \frac{1}{\sqrt{M}}\mathop{\sum }\limits_{{k}_{0}=0}^{M-1}{\bf{X}}[{k}_{0},i\,mod\,N]{e}^{-j\frac{2\pi }{MN}i{k}_{0}}.\end{array}$$
(21)

The above relation is called the inverse discrete frequency Zak transform (IDFZT) [11, Chapter 8].

Receiver Processing: The inner summation over n in (16) for the discrete Fourier transform (DFT) basis in (18) simplifies to:

$$\begin{array}{rcl} & & \mathop{\sum }\limits_{n=0}^{MN-1}{{\boldsymbol{\phi }}}_{f}^{* }[n]{{\boldsymbol{\phi }}}_{i}[n-\bar{k}]{e}^{\frac{j2\pi }{MN}\bar{l}n}\\ & & =\frac{1}{MN}\mathop{\sum }\limits_{n=0}^{MN-1}{e}^{-j\frac{2\pi }{MN}\,fn}{e}^{j\frac{2\pi }{MN}i(n-\bar{k})}{e}^{\frac{j2\pi }{MN}\bar{l}n}\\ & & =\frac{1}{MN}{e}^{-j\frac{2\pi }{MN}i\bar{k}}\mathop{\sum }\limits_{n=0}^{MN-1}{e}^{j\frac{2\pi }{MN}(i-f+\bar{l})n}.\end{array}$$
(22)

From Identity 1, the inner summation over n vanishes unless \(\overline{l}\equiv (f-i)\,mod\,MN\), when it takes the value MN. Therefore,

$${\bf{H}}[f,i]=\mathop{\sum }\limits_{\bar{k}=0}^{MN-1}{\bf{h}}[\bar{k},(f-i)\,{\text{mod}}\,MN]{e}^{-j\frac{2\pi }{MN}\,f\bar{k}}.$$
(23)

Unitary equivalence of FD and DD I/O relations

We now demonstrate that the FD I/O relation proposed above is unitarily equivalent to conventional DD domain I/O relation with the pulsone basis per (9), \({{\boldsymbol{\phi }}}_{i}[n]={{\bf{p}}}_{({k}_{0},{l}_{0})}[n]\), i = k0 + l0M, for which \({\bf{s}}[i]={\bf{X}}[{k}_{0},{l}_{0}]\in {{\mathbb{C}}}^{MN\times 1}\). The corresponding DD domain vectorized system model is:

$${{\bf{r}}}_{{\text{DD}}}={{\bf{H}}}_{{\text{DD}}}{{\bf{x}}}_{{\text{DD}}}+{{\bf{w}}}_{{\text{DD}}},$$
(24)

where \({{\bf{x}}}_{{\text{DD}}}=vec({\bf{X}})\). To relate the FD relation with the DD relation above, observe that the IDFZT in (21) may be expressed in terms of \({{\bf{x}}}_{{\text{DD}}}\) as:

$${\bf{s}}={\bf{R}}{{\bf{x}}}_{{\text{DD}}}\in {{\rm{{\mathbb{C}}}}}^{MN\times 1},$$
(25)
$${\bf{R}}={\bf{K}}({{\bf{I}}}_{N}\otimes {{\bf{F}}}_{M})diag({\bf{q}})\in {{\mathbb{C}}}^{MN\times MN},$$
(26)
$${\bf{K}}=\mathop{\sum }\limits_{i=1}^{N}\mathop{\sum }\limits_{j=1}^{M}{{\bf{E}}}_{ij}^{\top }\otimes {{\bf{E}}}_{ij}\in {\{0,1\}}^{MN\times MN},$$
(27)
$${\bf{q}}=[\begin{array}{c}\vdots \\ {{\bf{q}}}_{l}\\ \vdots \end{array}],0\le l\le (N-1)\in {{\mathbb{C}}}^{MN\times 1},$$
(28)
$${{\bf{q}}}_{l}[k]={e}^{-j\frac{2\pi }{MN}lk}\in {{\mathbb{C}}}^{M\times 1},0\le k\le (M-1),$$
(29)

where \({{\bf{E}}}_{ij}\in {{\mathbb{R}}}^{N\times M}\) with the (i, j)th entry equal to one and other entries zero, \({{\bf{F}}}_{M}\in {{\mathbb{C}}}^{M\times M}\) is the M-point DFT matrix. The matrix R, therefore, represents the IDFZT operation.

Lemma 2

The IDFZT matrix R is unitary.

Proof

To prove the Lemma, we must show that \({{\bf{R}}}^{{\rm{H}}}{\bf{R}}={\bf{R}}{{\bf{R}}}^{{\rm{H}}}={{\bf{I}}}_{MN}\). We evaluate:

$$\begin{array}{l}\begin{array}{rcl}{{\bf{R}}}^{{\text{H}}}{\bf{R}} & = & {(diag({\bf{q}}))}^{{\text{H}}}{({{\bf{I}}}_{N}\otimes {{\bf{F}}}_{M})}^{{\text{H}}}{{\bf{K}}}^{{\text{H}}}{\bf{K}}({{\bf{I}}}_{N}\otimes {{\bf{F}}}_{M})diag({\bf{q}})\end{array}\\ \,\,\,\,\,\,\,\,\,\,\,\mathop{=}\limits^{(a)}{(diag({\bf{q}}))}^{{\text{H}}}({{\bf{I}}}_{N}\otimes {{\bf{F}}}_{M}^{{\text{H}}})({{\bf{I}}}_{N}\otimes {{\bf{F}}}_{M})diag({\bf{q}})\\ \,\,\,\,\,\,\,\,\,\,\,\mathop{=}\limits^{(b)}{(diag({\bf{q}}))}^{{\text{H}}}diag({\bf{q}})\mathop{=}\limits^{(c)}{{\bf{I}}}_{MN},\end{array}$$
(30)

and similarly,

$$\begin{array}{l}\begin{array}{rcl}{{\bf{RR}}}^{{\text{H}}} & = & {\bf{K}}({{\bf{I}}}_{N}\otimes {{\bf{F}}}_{M})diag({\bf{q}}){(diag({\bf{q}}))}^{{\text{H}}}{({{\bf{I}}}_{N}\otimes {{\bf{F}}}_{M})}^{{\text{H}}}{{\bf{K}}}^{{\text{H}}}\end{array}\\ \,\,\,\,\,\,\,\,\,\,\,\mathop{=}\limits^{(d)}{\bf{K}}({{\bf{I}}}_{N}\otimes {{\bf{F}}}_{M})({{\bf{I}}}_{N}\otimes {{\bf{F}}}_{M}^{{\text{H}}}){{\bf{K}}}^{{\text{H}}}\\ \,\,\,\,\,\,\,\,\,\,\,\mathop{=}\limits^{(e)}{\bf{K}}{{\bf{K}}}^{{\text{H}}}\mathop{=}\limits^{(f)}{{\bf{I}}}_{MN},\end{array}$$
(31)

where steps (a) and (f) follow by observing that K is a permutation matrix and its inverse is its transpose, steps (b) and (e) follow from the orthonormality of the Fourier matrix, and steps (c) and (d) follow from the fact that the inverse of a complex phase is its conjugate. □

Lemma 3

The FD and DD domain I/O relations are unitarily eqareMNcarriers with carrier spacinguivalent.

Proof

From Lemma 2, (24) is equivalent to:

$$\begin{array}{l}\begin{array}{rcl}{{\bf{r}}}_{\text{DD}} & = & {{\bf{R}}}^{\text{H}}\mathop{\underbrace{{{\bf{RH}}}_{\text{DD}}{{\bf{R}}}^{\text{H}}}}\limits_{{\bf{H}}}\mathop{\underbrace{{{\bf{Rx}}}_{\text{DD}}}}\limits_{{\bf{s}}}+{{\bf{w}}}_{\text{DD}},\end{array}\\ \begin{array}{cl}\,\,\,\,\,\,\,= & {{\bf{R}}}^{\text{H}}{\bf{Hs}}+{{\bf{w}}}_{\text{DD}}.\end{array}\end{array}$$
(32)

Pre-multiplying (32) by R, we obtain (17):

$${\bf{r}}={\bf{R}}{{\bf{r}}}_{DD}={\bf{H}}{\bf{s}}+{\bf{w}},$$
(33)

where \({\bf{w}}={\bf{R}}{{\bf{w}}}_{DD}\). □

Embedding information symbols in FD

Instead of mounting the information symbols in the DD domain before conversion to the FD via the matrix R as in (32)–(33), an alternative is to mount information symbols directly in the FD, i.e.,

$${\bf{r}}={\bf{H}}{{\bf{x}}}_{DD}+{\bf{w}},$$
(34)

where \({\bf{H}}={\bf{R}}{{\bf{H}}}_{DD}{{\bf{R}}}^{{\rm{H}}}\) is the FD channel from (32).

Note that the input-output relation in (34) corresponds to a type of FD modulation, where there are MN carriers with carrier spacing Δf = B/MN in a single symbol of duration Ts = 1/Δf = MN/B.

We have described three system models in (17), (33), and (34). It is natural to ask which one of them is the best in terms of performance. In the next Subsection, we try to answer this question.

Recap of proposed methods

In (17), Zak-OTFS information symbols are mounted in the DD domain, which are transmitted through a DD channel. Equalization and symbol detection at the receiver also occurs in the DD domain.

In (33), Zak-OTFS information symbols are mounted in the DD domain, spreading the DD information symbols in the FD via the IDFZT matrix R, which are then transmitted through a FD channel. Equalization at the receiver occurs in the FD, followed by de-spreading the FD equalized symbols through the conjugate transpose of the IDFZT matrix, \({{\bf{R}}}^{{\rm{H}}}\) to move from the FD to DD. Subsequent symbol detection occurs in the DD domain.

In (34), information symbols are mounted in the FD domain, which are transmitted through a FD channel. Equalization and symbol detection at the receiver also occurs in the FD.

It is now natural to ask which method is better suited for time-varying fading channels. The fundamental difference between the methods is the choice of basis on which information symbols are mounted. In both FD and DD relations of Zak-OTFS, the basis used is the pulsone basis, while the DFT basis is used in (34). Therefore, the superiority of one method over another boils down to the choice of basis. We address this question next.

Which bases are non-selective?

As previously described, both FD and DD relations of Zak-OTFS mount information symbols on the pulsone basis (see (7)), whereas (34) mounts the information symbols on the IDFT basis (see (18)). In the following Lemma, we derive the condition for a basis to be non-selective. Subsequently, we show how the pulsone basis that enables both FD and DD relations of Zak-OTFS are non-selective (i.e., the received energy is constant across all carriers). On the other hand, the choice of the IDFT basis makes (34) and CP-OFDM selective (i.e., the received energy is a function of carrier index).

Lemma 4

An orthonormal basis ϕi[n] is non-selective if:

$${{\boldsymbol{\phi }}}_{i}[n-{k}_{2}]{{\boldsymbol{\phi }}}_{i}^{* }[n-{k}_{1}]={{\boldsymbol{\phi }}}_{j}[n-{k}_{2}]{{\boldsymbol{\phi }}}_{j}^{* }[n-{k}_{1}],$$

for all 0 ≤ i, j ≤ (MN − 1) and 0 ≤ k1, k2 ≤ (M − 1), assuming the MN-periodized effective channel h[k, l] is supported within 0≤k≤(M − 1), 0≤l≤(N − 1).

Proof

If all carriers have equal received energy, the signal does not experience selectivity. Therefore, formally, the non-selectivity condition corresponds to:

$$({{\bf{H}}}^{{\rm{H}}}{\bf{H}})[i,i]=({{\bf{H}}}^{{\rm{H}}}{\bf{H}})[j,j],\forall 0\le i,j\le (MN-1),$$
(35)

that is, the value must be independent of the index (i or j).

Substituting (16) in (35) and simplifying:

$$\begin{array}{rcl} & & ({{\bf{H}}}^{{\rm{H}}}{\bf{H}})[i,i]=\mathop{\sum }\limits_{{k}_{1},{k}_{2}}\mathop{\sum }\limits_{{l}_{1},{l}_{2}}{{\bf{h}}}^{* }[{k}_{1},{l}_{1}]{\bf{h}}[{k}_{2},{l}_{2}]{e}^{\frac{j2\pi }{MN}({k}_{1}{l}_{1}-{k}_{2}{l}_{2})}\times \\ & & \mathop{\sum }\limits_{{n}_{1},{n}_{2}}\left(\mathop{\sum }\limits_{f=0}^{MN-1}{{\boldsymbol{\phi }}}_{f}[{n}_{1}]{{\boldsymbol{\phi }}}_{f}^{* }[{n}_{2}]\right){{\boldsymbol{\phi }}}_{i}[{n}_{2}-{k}_{2}]{{\boldsymbol{\phi }}}_{i}^{* }[{n}_{1}-{k}_{1}]\times \\ & & {e}^{\frac{j2\pi }{MN}({l}_{2}{n}_{2}-{l}_{1}{n}_{1})}.\end{array}$$
(36)

For an orthonormal basis, by definition, the inner summation over f evaluates to δ[n1n2]. Therefore, n1 = n2 and hence:

$$\begin{array}{rcl}({{\bf{H}}}^{{\rm{H}}}{\bf{H}})[i,i] & = & \mathop{\sum }\limits_{{k}_{1},{k}_{2}}\mathop{\sum }\limits_{{l}_{1},{l}_{2}}{{\bf{h}}}^{* }[{k}_{1},{l}_{1}]{\bf{h}}[{k}_{2},{l}_{2}]{e}^{\frac{j2\pi }{MN}({k}_{1}{l}_{1}-{k}_{2}{l}_{2})}\times \\ & & \mathop{\sum }\limits_{n}{{\boldsymbol{\phi }}}_{i}[n-{k}_{2}]{{\boldsymbol{\phi }}}_{i}^{* }[n-{k}_{1}]{e}^{\frac{j2\pi }{MN}({l}_{2}-{l}_{1})n}.\end{array}$$
(37)

Notice that the summation over n is expansion along an orthonormal Fourier basis, and therefore \(({{\bf{H}}}^{{\rm{H}}}{\bf{H}})[i,i]=({{\bf{H}}}^{{\rm{H}}}{\bf{H}})[j,j]\) implies that the summands must be equal. Thus, the non-selectivity condition in (35) is satisfied for all orthonormal bases ϕi[n] with the property:

$${{\boldsymbol{\phi }}}_{i}[n-{k}_{2}]{{\boldsymbol{\phi }}}_{i}^{* }[n-{k}_{1}]={{\boldsymbol{\phi }}}_{j}[n-{k}_{2}]{{\boldsymbol{\phi }}}_{j}^{* }[n-{k}_{1}],$$
(38)

for all 0≤i, j, k1, k2 ≤ (MN − 1). If the MN-periodized effective channel h[k, l] is supported within the fundamental period 0≤ k ≤ (M − 1), 0 ≤ l ≤(N − 1), then (38) need only be satisfied for 0 ≤ k1, k2 ≤ (M − 1). □

Via Lemma 4, we now quantify which bases are non-selective.

Zak-OTFS in FD and DD: For Zak-OTFS in the FD and DD domains we consider the pulsone basis, \({{\boldsymbol{\phi }}}_{i}[n]={{\bf{p}}}_{({k}_{0},{l}_{0})}[n]\), i = k0 + l0M. From (7) we obtain:

$$\begin{array}{rcl} & & {{\boldsymbol{\phi }}}_{i}[n-{k}_{2}]{{\boldsymbol{\phi }}}_{i}^{* }[n-{k}_{1}]=\frac{1}{N}\mathop{\sum }\limits_{{d}_{1},{d}_{2}\in {\rm{{\mathbb{Z}}}}}{e}^{j\frac{2\pi }{N}({d}_{1}-{d}_{2}){l}_{0}}\times \\ & & \delta [n-{k}_{0}-{k}_{1}-{d}_{1}M]\delta [n-{k}_{0}-{k}_{2}-{d}_{2}M].\end{array}$$
(39)

For 0 ≤ k1, k2 ≤ (M − 1), the delta functions imply k1 = k2 and 0 ≤ d1 = d2 ≤ (N − 1). Therefore,

$${{\boldsymbol{\phi }}}_{i}[n-{k}_{2}]{{\boldsymbol{\phi }}}_{i}^{* }[n-{k}_{1}]=\delta [{k}_{1}-{k}_{2}],$$
(40)

which is independent of 0≤ i ≤ (MN − 1), i.e., Zak-OTFS does not exhibit selectivity.

FD I/O Relation in (34): In (34), we consider the IDFT basis in (18). We have:

$${{\boldsymbol{\phi }}}_{i}[n-{k}_{2}]{{\boldsymbol{\phi }}}_{i}^{* }[n-{k}_{1}]=\frac{1}{MN}{e}^{j\frac{2\pi }{MN}i({k}_{1}-{k}_{2})},$$
(41)

which depends on 0≤ i ≤ (MN − 1), i.e., it exhibits selectivity.

Similar analysis is possible also for CP-OFDM, but is not described due to space constraints.

Figure 17 shows the received energy per carrier (the squared column norm of the channel matrix) for Zak-OTFS, I/O relation in (34), and CP-OFDM, for a six-path Vehicular-A channel (Table 3) with 2.51 μs delay spread and 815 Hz Doppler spread. Consistent with the above discussion, we observe that the received energy per carrier remains constant for Zak-OTFS. On the other hand, the received energy per carrier varies for the I/O relation in (34) and CP-OFDM due to their fading characteristics. Note that the received energy is plotted in the dB scale. Therefore, although the variation for the I/O relation in (34) seems small (in dB), the variance of the received energy is significant in linear scale. Furthermore, the variation for CP-OFDM is larger than that of (34), suggesting greater resilience to Doppler of the latter. This is consistent with the results presented in Figs. 4 and 5 in section “Results and discussion”.

Fig. 17: Plot showing recieved energy per carrier as a function of carrier index.
Fig. 17: Plot showing recieved energy per carrier as a function of carrier index.
Full size image

Zak-OTFS is non-selective, whereas the FD I/O relation in (34) and CP-OFDM fade.

Remark

Why is the basis being non-selective important? For a modulation scheme with non-selective basis, the received energy is constant irrespective of the carrier index (Zak-OTFS in Fig. 17). This largely simplifies channel coding. On the other hand, if the basis is selective (CP-OFDM in Fig. 17), channel coding needs to be carried out across subcarriers since the received energy is a function of the subcarrier index. Such coding helps the symbols experience channels from different subcarriers and improves the SNR at the receiver. The encoding and decoding complexity of such codes is typically higher (for example, LDPC codes in OFDM).

In the next Subsection, we describe the proposed low complexity equalization and detection scheme that makes use of the FD perspective of Zak-OTFS.

Low complexity FD equalization and detection

To develop low complexity equalization & detection algorithms, we consider the FD channel matrix, H, in (33). The entries of H are given by (23):

$${\bf{H}}[\,f,i]=\mathop{\sum }\limits_{\bar{k}=0}^{MN-1}{\bf{h}}[\bar{k},(f-i)\,{\rm{m}}{\rm{o}}{\rm{d}}\,MN]{e}^{-\frac{j2\pi }{MN}\,f\bar{k}}.$$

Let \({\mathcal{S}}\) denote the support of the channel in h in an MN × MN region. For a given row index f, and i = 0, 1, , MN − 1, notice that as long as \((f-i)\,mod\,MN\) is within \({\mathcal{S}}\), the corresponding entry within H is non-zero. Therefore the matrix H has a modulo-banded structure. To demonstrate this, the absolute value of H is plotted in Fig. 18 for M = 3, N = 5. The sub- and super-diagonals fold back into the right-top and left-bottom corners, respectively, of the matrix. The non-zero values at the corners of the matrix disallow low complexity equalization. In the following Subsection, we describe a method to effectively handle these values and develop a low-complexity equalization algorithm.

Fig. 18: Heatmap of the FD channel matrix H, \({\rm{M}}=5, {\rm{N}=5}, {\nu}_{p} = 30\) kHz, and the Veh-A channel (Table 3).
Fig. 18: Heatmap of the FD channel matrix H, 
                        $${\rm{M}}=5, {\rm{N}=5}, {\nu}_{p} = 30$$
                        
                          
                            M
                            =
                            5
                            ,
                            N
                            =
                            5
                            ,
                            
                              
                                ν
                              
                              
                                p
                              
                            
                            =
                            30
                          
                        
                       kHz, and the Veh-A channel (Table 3).
Full size image

The heatmap shows modulo banded structure of the FD channel matrix.

Remark

The response of the channel to a tone (or pulse in the FD) is fixed irrespective of the modulation scheme considered. Estimating this response is possible as long as there is no ICI due to Doppler. However, under doubly-dispersive channels, the effective channel matrix is no longer diagonal (see Fig. 18). A pilot in the FD is only able to estimate the diagonal entries in the matrix, which do not capture the interaction between subcarriers due to Doppler (because of ICI) and leads to poor equalization and data detection performance. Here, we capture the I/O relation in the DD domain and move it to FD, so that we have the estimate of not just the diagonal values, but also off-diagonal values in the FD channel matrix.

Reduction to banded structure

To reduce the modulo-banded structure of H to an actual banded structure, we propose to zero out the initial and final few entries in s. However, directly nulling out the first and last few entries of s is not feasible due to the relation \({\bf{s}}={\bf{R}}{{\bf{x}}}_{DD}\), since the available degrees-of-freedom is only over \({{\bf{x}}}_{DD}\). To overcome this challenge, we propose the following approach.

Let b denote the width of the band on either side of the diagonal, referred to as spread width. For example, in Fig. 18, b = 2, which denotes the number of significant non-zero entries on either side of the diagonal. Notice that the significant values in the corner are also limited to the b × b square. We construct a matrix \({{\bf{R}}}^{{\prime} }\) as:

$${{\bf{R}}}^{{\prime} }=\left[\begin{array}{l}{{\bf{R}}}_{0:b-1}\\ {{\bf{R}}}_{MN-b:MN-1}\end{array}\right]\in {{\rm{{\mathbb{C}}}}}^{2b\times MN},$$
(42)

where R0:b−1 denotes the first b rows in R and RMNb:MN−1 denotes the last b rows in R. The rank of \({{\bf{R}}}^{{\prime} }\) is 2b and the dimension of the null space is (MN − 2b) (since the number of columns is MN).

Let \({\bf{N}}=null({{\bf{R}}}^{{\prime} })\in {{\mathbb{C}}}^{MN\times MN-2b}\) denote the null space of \({{\bf{R}}}^{{\prime} }\). We transmit the FD vector:

$${{\bf{s}}}^{{\prime} }={\bf{R}}{\bf{N}}{{\bf{x}}}_{DD}^{{\prime} }\in {{\rm{{\mathbb{C}}}}}^{MN\times 1},$$
(43)

where \({{\bf{x}}}_{DD}^{{\prime} }\) is an (MN − 2b) × 1 vector (although we lose spectral efficiency by using this approach, the spectral efficiency loss is still very small compared to CP in OFDM8 and guard symbols in an embedded pilot frame) of information symbols. With this approach, the system model becomes:

$${{\bf{r}}}^{{\prime} }={\bf{H}}{{\bf{s}}}^{{\prime} }+{\bf{w}}.$$
(44)

Our approach ensures that the first b and last b entries in \({{\bf{s}}}^{{\prime} }\) are zeros, and only the banded portion of the channel matrix H impacts the transmitted symbols.

Low-complexity MMSE equalization

To detect the information symbols, \({{\bf{r}}}^{{\prime} }\) in (44) is equalized using H. The banded structure of H enables the development of low-complexity equalization routines.

We consider the linear minimum mean square error (LMMSE) equalizer \({\bf{T}}\in {{\mathbb{C}}}^{MN\times MN}\) given by:

$${\bf{T}}={{\bf{H}}}^{{\rm{H}}}{({\bf{H}}{{\bf{H}}}^{{\rm{H}}}+{{\bf{R}}}_{n})}^{-1}={({{\bf{I}}}_{MN}+{{\bf{H}}}^{{\rm{H}}}{{\bf{R}}}_{n}^{-1}{\bf{H}})}^{-1}{{\bf{H}}}^{{\rm{H}}}{{\bf{R}}}_{n}^{-1},$$
(45)

where \({{\bf{R}}}_{n}\in {{\mathbb{C}}}^{MN\times MN}\) is the covariance matrix of noise, and H is the channel matrix. Inverting the matrix \(({{\bf{I}}}_{MN}+{{\bf{H}}}^{{\rm{H}}}{{\bf{R}}}_{n}^{-1}{\bf{H}})\) in the absence of any structure incurs complexity \({\mathcal{O}}({M}^{3}{N}^{3})\).

However, since H is banded, T is also banded. For banded matrices, low-complexity LMMSE matrix computation is feasible31,32. In the following, we describe one feasible algorithm based on the CGM12.

Consider the LMMSE equalization operation:

$$\widetilde{{\bf{s}}}={{\bf{Tr}}}^{{\prime} }\,\Rightarrow \,\mathop{\underbrace{({{\bf{I}}}_{MN}+{{\bf{H}}}^{{\rm{H}}}{{\bf{R}}}_{n}^{-1}{\bf{H}})}}\limits_{{\bf{Q}}}\widetilde{{\bf{s}}}=\mathop{\underbrace{{{\bf{H}}}^{{\rm{H}}}{{\bf{R}}}_{n}^{-1}{{\bf{r}}}^{{\prime} }}}\limits_{{\bf{t}}}.$$
(46)

The linear system is of the form \({\bf{Q}}\widetilde{{\bf{s}}}={\bf{t}}\), where Q is Hermitian positive definite (Q 0) (a matrix A 0 if \({{\bf{x}}}^{{\rm{H}}}{\bf{Ax}} > 0\) for any non-zero x. \({{\bf{x}}}^{{\rm{H}}}{\bf{Q}}{\bf{x}}={{\bf{x}}}^{{\rm{H}}}{\bf{x}}+{({\bf{Hx}})}^{{\rm{H}}}{{\bf{R}}}_{n}^{-1}{\bf{Hx}} > 0\) since Rn 0). The CGM solves \({\bf{Q}}\widetilde{{\bf{s}}}={\bf{t}}\) by minimizing the quadratic energy:

$${\mathcal{J}}(\widetilde{{\bf{s}}})=\frac{1}{2}{\widetilde{{\bf{s}}}}^{{\rm{H}}}{\bf{Q}}\widetilde{{\bf{s}}}-{{\bf{t}}}^{{\rm{H}}}\widetilde{{\bf{s}}},$$
(47)

since this has a unique minimum at:

$$\nabla {\mathcal{J}}(\mathop{{\bf{s}}}\limits^{ \sim })={\bf{Q}}\widetilde{{\bf{s}}}-{\bf{t}}=0\,\Rightarrow \,\widetilde{{\bf{s}}}={{\bf{Q}}}^{-1}{\bf{t}},$$
(48)

which is the desired LMMSE output. This function in (47) is strictly convex when Q 0, and has a unique minimizer at \(\widetilde{{\bf{s}}}={{\bf{Q}}}^{-1}{\bf{t}}\). The conjugate gradient algorithm minimizes this function iteratively by moving along conjugate directions p(i), which are chosen so that \({{\bf{p}}}^{(i){\rm{H}}}{\bf{Q}}{{\bf{p}}}^{(j)}=0\) for ij. These directions are “Q-orthogonal”, meaning no search direction undoes the progress of a previous one. Next, the solution is updated via:

$${\widetilde{{\bf{s}}}}^{(i+1)}={\mathop{{\bf{s}}}\limits^{ \sim }}^{(i)}+\alpha {{\bf{p}}}^{(i)},$$
(49)

where α is chosen to minimize \({\mathcal{J}}(\widetilde{{\bf{s}}})\) along the direction p(i). Finally, the residual is updated via:

$${{\bf{c}}}^{(i)}={\bf{t}}-{\bf{Q}}{\widetilde{{\bf{s}}}}^{(i)},$$
(50)

which becomes orthogonal to all previous residuals:

$${{\bf{c}}}^{(i)H}{{\bf{c}}}^{(j)}=0\,\,{\rm{f}}{\rm{o}}{\rm{r}}\,\,i\ne j.$$
(51)

The steps of the iterative CGM are listed in Algorithm 1.

Algorithm 1

Iterative conjugate gradient algorithm

1: Inputs: Channel matrix H, received vector \({{\bf{r}}}^{{\prime} }\), covariance matrix of noise Rn, number of delay bins M, number of Doppler bins N, tolerance ϵ, maximum number of iterations \(\max \_iter\)

2: Initialize:\(\breve{{\bf{H}}}={{\bf{H}}}^{{\rm{H}}}{\bf{H}}\), \({\bf{b}}={{\bf{H}}}^{H}{{\bf{r}}}^{{\prime} }\), \({\widetilde{{\bf{s}}}}^{(0)}={{\bf{0}}}_{MN\times 1}\), \({{\bf{c}}}^{(0)}={\bf{b}}-\breve{{\bf{H}}}{\widetilde{{\bf{s}}}}^{(0)}-{{\bf{R}}}_{n}{\widetilde{{\bf{s}}}}^{(0)}\), p(0) = c(0), \({c}_{norm}^{(0)}=\parallel {\bf{c}}{\parallel }_{2}^{2}\)

3: for \(i=1:\max \_iter\)do

4: \({{\bf{a}}}_{{\bf{p}}}=\breve{{\bf{H}}}{{\bf{p}}}^{(i-1)}+{{\bf{R}}}_{n}{{\bf{p}}}^{(i-1)}\)

5: \(\alpha =\frac{{c}_{norm}^{(i-1)}}{{{\bf{p}}}^{H}{{\bf{a}}}_{{\bf{p}}}}\)

6: \({\widetilde{{\bf{s}}}}^{(i)}={\widetilde{{\bf{s}}}}^{(i-1)}+\alpha {{\bf{p}}}^{(i-1)}\)

7: c(i) = c(i−1)αap

8: \({c}_{norm}^{(i)}=\parallel {{\bf{c}}}^{(i)}{\parallel }_{2}^{2}\)

9: if\({c}_{norm}^{(i)} < {\epsilon }^{2}\)then

10: break

11: end if

12: \({{\bf{p}}}^{(i)}={{\bf{c}}}^{(i)}+\frac{{c}_{norm}^{(i)}}{{c}_{norm}^{(i-1)}}{{\bf{p}}}^{(i-1)}\)

13: \({c}_{norm}^{(i-1)}={c}_{norm}^{(i)}\)

14: end for

15: return \({\widetilde{{\bf{s}}}}^{(i)}\)

Complexity: The complexity involved in each iteration of the conjugate gradient algorithm in Algorithm 1 is presented in Table 5. The complexity is dominated by the Step 4 in Algorithm 1 which is of the order of bMN. The overall complexity of the iterative conjugate gradient algorithm is therefore \({\mathcal{O}}(kbMN)\), where k is the number of iterations before convergence. Setting \(k=\max \_iter\) gives the worst case complexity of Algorithm 1. On the other hand, the DD domain Zak-OTFS channel matrix does not have a banded structure and this incurs complexity \({\mathcal{O}}(kM{N}^{2})\)12.

Table 5 Complexity per iteration of the iterative conjugate gradient algorithm

Information symbol detection

Let \(\widetilde{{\bf{s}}}\) denote the vector post-equalization, i.e., \(\widetilde{{\bf{s}}}={\bf{T}}{{\bf{r}}}^{{\prime} }\). Information symbols are detected via minimum distance decoding:

$${\hat{{{\bf{x}}}^{{\prime} }}}_{{\mathsf{D}}{\mathsf{D}}}\,[q]\,=\,\mathop{{\arg}\,{\min}}\limits_{{\rm{a}}\in {\rm{A}}}|({{\bf{N}}}^{{\mathsf{H}}}{{\bf{R}}}^{{\mathsf{H}}}\tilde{{\bf{s}}})[q]-a|,$$
(52)

where q = 0, 1, , (MN − 2b − 1), and \({\mathcal{A}}\) is the constellation (e.g., 4-QAM) from which information symbols are drawn.

The discussion presented up to this point assumes perfect channel knowledge. However, this is not true in practice. In the next Subsection we present a channel estimation scheme to estimate the FD channel matrix.

Practical channel estimation

Channel estimation is a crucial aspect of any practical system. In this paper, we consider the spread pilot9,10 for channel estimation.

The block diagram of the channel estimation algorithm is presented in Fig. 19. At the transmitter, along with the information symbols (For ease of exposition, in this description, we ignore the step of mounting the information symbols on the null space of \({{\bf{R}}}^{{\prime} }\) presented in section “Reduction to banded structure”. However, for all the simulations, we do perform this step as \(\sqrt{{e}_{p}}{{\bf{x}}}_{s,DD}+\sqrt{{e}_{d}}{\bf{N}}{{\bf{x}}}_{DD}^{{\prime} }\)) in \({{\bf{x}}}_{DD}\in {{\mathbb{C}}}^{MN\times 1}\) the spread pilot \({{\bf{x}}}_{s,DD}\in {{\mathbb{C}}}^{MN\times 1}\) is added at each location, i.e., the spread pilot is superimposed with the information symbols. The DD domain vectors are then transformed to the FD using the IDFZT matrix R. The received vector is converted back to the DD domain using the inverse of the IDFZT matrix, \({{\bf{R}}}^{H}\). The effective system model is:

$${{\bf{y}}}_{DD}={{\bf{R}}}^{H}{\bf{H}}{\bf{R}}(\sqrt{{e}_{d}}{{\bf{x}}}_{DD}+\sqrt{{e}_{p}}{{\bf{x}}}_{s,DD}),$$
(53)

where ep and ed denote the energy in the pilot and data symbols, respectively. At the receiver, the model-free estimate of the effective channel \({\widehat{{\bf{h}}}}_{e\mathrm{ff}}\) (in (11)) is obtained through the cross-ambiguity function between \({{\bf{y}}}_{DD}\) and \({{\bf{x}}}_{s,DD}\)9,10:

$$\begin{array}{rcl}{\widehat{{\bf{h}}}}_{e\mathrm{ff}}[k,l] & = & \mathop{\sum }\limits_{{k}^{{\prime} }=0}^{M-1}\mathop{\sum }\limits_{{l}^{{\prime} }=0}^{N-1}{{\bf{y}}}_{DD}[{k}^{{\prime} },{l}^{{\prime} }]{{\bf{x}}}_{s,DD}^{* }[{k}^{{\prime} }-k,{l}^{{\prime} }-l]\times \\ & & {e}^{-j\frac{2\pi }{MN}{l}^{{\prime} }({k}^{{\prime} }-k)}.\end{array}$$
(54)

Next the estimated channel matrix \({\widehat{{\bf{H}}}}_{DD}\) (see (24)) is constructed using the estimated \({\widehat{{\bf{h}}}}_{e\mathrm{ff}}\). An estimate of the channel matrix in (33), \(\widehat{{\bf{H}}}\) is obtained as (from (32)):

$$\widehat{{\bf{H}}}={\bf{R}}{\widehat{{\bf{H}}}}_{DD}{{\bf{R}}}^{H}.$$
(55)

For data detection, the pilot is removed from the received frame using the estimated channel as:

$${{\bf{y}}}_{DD,data}={{\bf{y}}}_{DD}-\sqrt{{e}_{p}}{\widehat{{\bf{H}}}}_{DD}{{\bf{x}}}_{s,DD}.$$
(56)

The data-only frame is then converted to FD using the IDFZT matrix and low-complexity equalization and detection is performed as described in section “Low complexity FD equalization and detection” to obtain the detected information symbols \({\widehat{{\bf{x}}}}_{DD}\).

Fig. 19: Block diagram representing the flow of the channel estimation algorithm.
Fig. 19: Block diagram representing the flow of the channel estimation algorithm.
Full size image

At the transmitter, information and pilot symbols aresuperimposed and multiplied with the IDFZT matrix \(\mathbf{R}\). The resulting symbols are transmitted over a channel and DFZT iscomputed at the receiver. Using the spread pilot, the channel is estimated and the estimated channel is used to equalize the receivedsymbols. Next the pilot symbols are removed and the data symbols are detected using the estimated channel. For the turboiterations, a pilot only frame is obtained by removing data symbols and subsequently obtaining another channel estimate.

Turbo iterations

Note that, in (56), the estimated channel matrix is used to remove the pilot. However, since the estimate is obtained in the presence of interference from data symbols, the estimate is noisy and leads to inefficient pilot removal. This affects the detection performance. To improve the performance, we perform turbo iterations which alternate between data detection and channel estimation as described below.

The estimated data is removed from the received frame as:

$${{\bf{y}}}_{{\text{DD}},{\text{pil}}}={{\bf{y}}}_{{D}{D}}-\sqrt{{e}_{d}}{\widehat{{\bf{H}}}}_{{\text{DD}}}{\widehat{{\bf{x}}}}_{{\text{DD}}},$$
(57)

to obtain the pilot-only frame. Channel estimation in (54) is performed by replacing \({{\bf{y}}}_{DD}\) with \({{\bf{y}}}_{DD,pil}\) which leads to a better estimate on account of reduced interference from data symbols. The estimated channel matrices are reconstructed as shown in (55). The data only frame is obtained through (56) and data equalization and detection is performed on this frame. The detected data is used in (57) and this procedure is repeated a fixed number of times. Better channel estimation accuracy aids detection accuracy which in turn aids channel estimation accuracy. Therefore the performance of the estimation algorithm improves with turbo iterations.