Introduction

Beamforming and beam-steering of radio, optical waves, and acoustic wave are critically needed for many applications such as high-capacity radio wireless communication1,2, optical wireless communication3,4, radar5, light detection and ranging e.g., LiDAR6, arrayed ultrasonic equipment7, and sonar8. Traditionally, the electromagnetic wave is radiated (from a transmitter) or collected (by a receiver) to/from all directions. No matter whether there is an object of interest in a specific direction, the energy is transmitted in/collected from a wide range of directions. By spatially focusing (beamforming) the radio/optical signal to a specified direction, the energy required for realizing a function e.g., data transmission, data detection, energy transfer, can be efficiently reduced9. Moreover, the spatial interference phenomenon between objects radiating/receiving can be minimized by beamforming10. It yields, 1) a better spatial de-multiplexing of parallel data streams, enabling a higher data capacity9,11,12,13; 2) a better determination of angular reflection reducing multipath effects which may deteriorate system performance14. Using the same total transmit power from multiple antennas, the beamforming can provide a stronger received signal since the signals from different transmitter antennas are coherently summed up. To lead the focused wave to a desired direction, a beam steering function is thus required together with beamforming. The most popular and powerful scheme to enable beam-steering is the well-known phased array antenna (PAA), which is an array of antenna elements that are driven by signals with well-determined phase relations between those elements15. Here an antenna refers to a radiator of waves that can be regarded as propagating phase fronts, hence including electromagnetic waves in both the light and non-light, non-ionized spectrum, and acoustic waves.

Traditionally, a 1-by-N phased array antenna requires N-1 phase shifters or other active control units. Many schemes are proposed to reduce the number of phase shifters, but other kinds of active control units should be introduced as alternative16,17,18,19. Table 1 shows the state-of-art regarding the number reduction of phase shifters in a PAA. Existing approaches, such as subarray decomposition20,21,22,23,24,25,26,27, and vector synthesis28,29,30,31,32, reduce phase shifter counts but introduce trade-offs in beam-steering range with the same antenna array. Specifically, the subarray compression was first proposed in 197420. After that, many scientists started to study the optimal subarray division methods, such as random subarray size22, the interwoven network for subarray23, and recursive algorithm for subarray division24. Work in Rupakula et al.24, using 64 phase shifters and 64 adjustable amplifiers, and control the 16×16 array to complete a \(\pm 20^\circ\) beam scan. And work in Juarez et al.26 using an asymmetric distribution combination feeding network to achieve 36° scan angle. The earliest vector synthesis phased array was proposed by Amir Mortazawi in 201028. Although the amount of the phase shifter is reduced, each array element is still followed by a tuneable amplifier control. In this study, eight antenna array elements can complete a 25° scan with two-phase shifters and eight tunable amplifiers. Based on this theory, Amir led his group to conduct several studies31,32. The latest research results of their group have successfully controlled the eight antenna array elements to complete a \(\pm 90^\circ\) scan in \(Ku\)-band using only four adjustable phase shifters31.

Table 1 State-of-art to reduce phase shifters in PAA

As shown above, our work achieves the highest phase-shifter reduction (up to 87.5% fewer phase shifters) while still supporting a ~ 30° total scan range. In other words, each phase shifter in our SVD-based design effectively controls 8 antenna elements – double the grouping factor of most prior subarray or vector-synthesis techniques (which typically grouped 4 elements per phase shifter for ~75% reduction). This represents a substantial improvement over earlier efforts, which generally achieved at most 50–75% phase shifter count reduction for similar scan angles. Furthermore, our method provides a general theoretical framework (based on singular value decomposition) that is inherently scalable to large arrays.

In this work, we propose a Cascaded Angle Offset PAA (CAO-PAA) and its Dimensionality-Reduced variant (DRCAO-PAA). As shown in Fig. 1, unlike previous approaches, DRCAO-PAA treats the beamforming weight space as a matrix and applies Singular Value Decomposition (SVD) to compress its rank, effectively reducing active controller count. We further optimize the system using Particle Swarm Optimization (PSO) and Transformer-based models to construct an efficient set of basis vectors for beam steering.

Fig. 1: Dimensionality Reduced Antenna Array(DRAA) Concept:The central of figure shows the beamforming implemented by using DRAA; the bottom right corner displays the method for compressing the number of phase shifters using SVD.
Fig. 1: Dimensionality Reduced Antenna Array(DRAA) Concept:The central of figure shows the beamforming implemented by using DRAA; the bottom right corner displays the method for compressing the number of phase shifters using SVD.
Full size image

Dimensionality Reduced Antenna Array Concept.

We validate this approach experimentally: an 8-element array achieves continuous 0–30° beam steering using only three pairs of phase shifters and VGAs, and a 4-element array demonstrates 8° continuous beam sweep with just two phase shifters and two amplifiers in an anechoic chamber. These results demonstrate substantial hardware simplification without sacrificing steering precision.

Our DRCAO-PAA framework not only unifies existing methods under a generalizable mathematical model but also enables low-cost, low-complexity phased arrays, well-aligned with the demands of 6 G massive MIMO and satellite communications.

Results

Cascaded angle offset phased array antenna (CAO-PAA)

Figure 2 illustrates the basic idea of a cascaded angle offset phased array in an intuitive way. Each dashed circle denotes the initial phase of continuous waves at the same carrier frequency. The angles of the arrows denote the value of phases (angle offsets). In the first row, a cascaded angle offset group (CAO-G) is generated with the phase offset of the n-th phase shifter denoted as \(90^\circ -(n-1)\times 30^\circ\). Similarly, the second row denotes another CAO-G generated with: \(90^\circ +(n-1)\times 30^\circ\).

Fig. 2: The basic concept of cascaded angle offset phased array.
Fig. 2: The basic concept of cascaded angle offset phased array.
Full size image

The green and red arrows represent two cascaded angle offset groups. The black arrows represent the combination of the weighted cascaded angle offset groups. Arrows represent phase offsets, with direction indicating phase value and length proportional to amplitude ratios determined by power combiners.

The continuous waves with the two CAO-G are then combined by summing them with equal/unequal power ratios as shown in the third/fourth rows. The resulting combined signals are represented by the black arrows. The combined continuous waves are then launched to a linear array of antennas. As shown in the third row of Fig. 2, the power ratio is 1:1, which yields that the generated waves only have two angle offsets, namely \(\pm {90}^{\circ }\). As shown in the fourth row, when the power ratio is changed, the combined phases will be tuned quasi-linearly, which fits a linear phased antenna array.

To optimize or replace the current phased array system, it is necessary to explore the theoretical framework of CAO-PA to stretch the innate advantages of the synthesis. Matrix expression and operation provide appropriate solutions.

In this way, the beam directions of a phased array antenna with element spacing \(d\) can be determined by a \(1\times N\) linear phase shift group with phase difference \(\Delta {\varphi }_{m}\). For a beam direction at θm, the corresponding phase shifts at the n-th elementary antenna \({\varphi }_{n}\) of a PAA can be expressed as33,34:

$${\varphi }_{n}=(n-1)\left(\frac{2\pi }{\lambda }\right)\times d\times sin{\theta }_{m}=(n-1)\times \Delta {\varphi }_{m}$$
(1)

For the phase shifts of all elementary antennas, the phase shift distribution can be as an N×M matrix:

$${\Phi }_{N\times M}=\left[\begin{array}{cccc}0 & 0 & \cdots & 0\\ \Delta {\varphi }_{1} & \Delta {\varphi }_{2} & \cdots & \Delta {\varphi }_{M}\\ 2\Delta {\varphi }_{1} & 2\Delta {\varphi }_{2} & \cdots & 2\Delta {\varphi }_{M}\\ 3\Delta {\varphi }_{1} & 3\Delta {\varphi }_{2} & \cdots & 3\Delta {\varphi }_{M}\\ \vdots & \vdots & \ddots & \vdots \\ (n-1)\Delta {\varphi }_{1} & (n-1)\Delta {\varphi }_{2} & \cdots & (n-1)\Delta {\varphi }_{M}\end{array}\right]$$
(2)

Each vector \({\Phi }_{N\times 1}^{M}\) can be decomposed into a linear combination of R vectors (\({\varPhi }_{N\times R}\)) that can be referred to as the basis of \({\varPhi }_{N\times M}\) where\(R\) is the rank of \({\varPhi }_{N\times M}\). Usually, the number of scanning directions \(M\) is much larger than the number of elementary antennas\(N\). Therefore,\(R\) is not larger than\(N\). Particularly, when \(R\) is equal to \(N\), one special basis is the identity matrix. It is the case of a traditional phased array, where the phase and amplitude of the radiated signal at each antenna are independently controlled. For the rest, the number of active controllers can be reduced with the basis\({\varPhi }_{N\times R}\). Moreover, regardless of some weights of the whole basis, the whole system can be further simplified with a new fungible basis\({\varPhi }_{N\times \varLambda }\), where \(\Lambda\) is a little bit less than\(R\). In this way, a reasonable fungible basis will give rise to the performance penalty which can be reduced accuracy of beam directions or reduced antenna gain in an acceptable range.

Dimensionality reduced CAO-PAA

To achieve the goal of stretching the innate advantages of CAO-PAA, a matrix compression method is applied to simplify the basis of \({\varPhi }_{N\times M}\), avoiding the redundancy of active controllers. Assume that A is an M×N matrix indicating the input signals of the array elements. Due to the relations, each row vector denotes one combination of amplitude and phase in an\(N\)-element antenna array, which indicates one unique direction. With the singular value deposition (SVD)35, the matrix A can be expressed as

$${{\rm{A}}}={{{\rm{USV}}}}^{\ast },$$
(3)

where U is a \(M\times M\) unitary matrix, S is a \(M\times N\) rectangular diagonal matrix with non-negative real numbers on the diagonal and \(V\) is a \(N\times N\) unitary matrix. The number of non-zero singular values is equal to the rank of \(A\). Because of the non-uniqueness of SVD, it is possible to choose the decomposition with the singular values in descending order. The singular values are always ordered from largest to smallest. Once a singular value is found to be less than an arbitrarily small number \(\varepsilon\), then the remaining singular values will be replaced by 0 to generate the reasonable fungible basis that is discussed above. The rank of the new diagonal matrix \(\Lambda\) equals the number of non-zero terms in the matrix. The assessment of the singular value can be adjusted by the accuracy of the system. In this way, a compressed matrix \(B\) can be expressed with proper rank as

$${{\rm{B}}}={{{\rm{U}}}\Lambda {{\rm{V}}}}^{\ast }.$$
(4)

With the desired rank, the difference between \(A\) and \(B\) can be limited to a small range, which minimizes the performance penalty.

After matrix compression, the linear vector group(LIVG), which can be determined by matrix\(B\), is used to describe the amplitude and phase distributions. Without loss of generality, assume that the rank of \(B\) is equal to\(R\). Due to the features of the matrix, \(p\) bases can be chosen to express the rest \(M-R\) row vectors in matrix\(B\). Then all directions from M can be expressed in a linear superposition of the vectors in the LIVG matrix.

For any direction in the matrix \(B\), there is always a linear vector \(K\), which denotes the linear combination of LIVG. Assume that \(K\) is a column vector, and the relation of the linear combination is given by:

$${{\rm{CK}}}={{\rm{P}}}^{\prime} ,$$
(5)

where \(C\) is the LIVG matrix with full a rank, and \(P{\prime}\) is the selected row vector in matrix\(B\). Due to the full rank of the augmented matrix in Eq.(11), \(K\) has a solution from the system of linear equations. Therefore, based on one LIVG, any direction corresponds to one unique K. Furthermore, all the elements in \(K\) are in form of \({A}_{m}{{{\rm{e}}}}^{j\varphi }\), where \({A}_{m}\) is the amplitude, and \(\varphi\) is the phase. The practical function of \(K\) can be realized by \(R\) amplifiers and \(R\) phase shifters. In this way, the complexity of the \(M\times N\) matrix in the whole system can be decreased by using a LIVG and \(R\) active controllers.

Simulation for dimensionality reduced CAO-PAA in beam steering

Phase shifts and amplitude weights are applied to a 1-D linear evenly spaced phased array antenna array to evaluate the beam steering performance utilizing the proposed DRCAO-PAA concept.

The simulation results in Fig. 3(a–g) show that beam steering from 0 to 30° can be realized with just 16 active controllers in controlling a \(1\times 128\) array. The phase shifts of each element of the one-dimensional 128 array \(N=128\) for 128 different scan angles \({\theta }_{m}\) (\(0 < {\theta }_{m} < {30}^{\circ }\)) are shown in Fig. 3(a). A grayscale map is used to depict it; for instance, a grayscale of 0 (black) indicates −180° phase shifts, whereas a grayscale of 1 (white) indicates +180°. After SVD compression, the initial 128 eigenvalues are reduced to 16 eigenvalues, and the phase shifts matrix is reconstructed as shown in Fig. 3(b).

Fig. 3: Simulation results for 30° scan with N = 128 and the linear vector group(LIVG) with 16 vectors.
Fig. 3: Simulation results for 30° scan with N = 128 and the linear vector group(LIVG) with 16 vectors.
Full size image

a Gray scale map of the phase shifts in array N = 128 elements for M = 128 angles; b Gray scale map of the phase shifts in array N = 128 for M = 128 angles after singular value decomposition (SVD) compression; c Normalized far-field pattern for LIVG consists of equally spaced vectors; d Normalized Ideal far-field pattern from 0° to 30°; e Reconstructed matrix normalized far-field pattern from 0° to 30°; f Magnitude of K factors for each LIVG vectors; g Phase of K factors for each LIVG vectors.

To illustrate the process of dimensionality reduced antenna array in beam steering in more detail, the beam steering simulation was performed with 16 vectors of the reconstructed matrix selected equally spaced. These 16 vectors were used to compute and synthesize a far-field pattern, which is shown in Fig. 3(c). Comparing the normalized far-field pattern calculated with (Fig. 3(e)) and without (Fig. 3(d)) eigenvalue compression matrix, the mean absolute error of the beam pointing is only within 1.3°. This indicates that there is very little beam pointing loss due to the distortion appeared in Fig. 3(b).

The simulation analysis shows that it is possible to choose vectors at equal intervals to create LIVG. However, different vector selection functions will affect the factors of \(K\). Figure 3(f, g) illustrates the amplitude and phase of \(K\) while scanning 128 various angles, which indicate the magnification required by the amplifier and the range of the phase shifter to be adjusted. Especially the amplifier multiplier, the maximum value exceeds 40. The required phase shift is evenly distributed between [−180°, 180°]. Therefore, LIVG algorithms are proposed to optimize the LIVG building.

LIVG building in DRCAO-PAA

From theoretical analysis there is infinite sets of solutions LIVG matrix, however, only part of them can be used in applications to avoid obtaining improper K with an high amplification factor. Considering the cost and benefit of an array in applications, the amplifier gain is constrained, and the phase shift precision is also limited. Thus, the establishment of LIVG affects the performance of the DRCAO-PAA beam steering.

The most straightforward and effective way is to directly choose \(R\) linearly uncorrelated vectors in the \(M\times N\) matrix to build the LIVG such as 16 equally spaced vectors displayed in Fig. 3(c). In order to find out the proper LIVG, particle swarm optimization (PSO) is introduced in our algorithm27. An optimization function is provided to PSO to minimize the performance penalty in beam steering:

$$ \min \!{{\rm{imize}}}\left({\sum }_{m=1}^{M}({{\rm{abs}}}(Am{p}_{ideal,m}-Am{p}_{svdcompr,m})) \right. \\ \left. + (abs(Angl{e}_{ldeal,m}-Angl{e}_{svdcompr,m}))\right)/2M Subject \, to \max (abs(K)) < 20$$
(6)

At \(mth\) scan step, \(Am{p}_{idear,m}\),\(Angl{e}_{ldear,m}\) represent the main lobe magnitude and pointing angle of the ideal far-field pattern; \(Am{p}_{svdcompr,m}\), \(Angl{e}_{ldear,m}\) represent the main lobe magnitude and pointing angle of the LIVG reconstructed far-field pattern. The goal of the optimization is to minimize the average difference between the ideal far-field pattern and the LIVG reconstructed far-field pattern in terms of main lobe beam pointing and magnitude. The constraint is that the amplifier gain needs to be less than 20 ( ~ 13 dB). This approach minimizes errors in beam pointing and main lobe amplitude when performance penalties are inevitable, to enhance DRCAO-PAA ‘s beam steering performance.

As illustrated in Fig. 4(a–h), the process of obtaining the optimal LIVG is demonstrated by using the PSO algorithm. The simulation is based on 16 elements (\(N=16\)) line phased array with a maximum scan angle of 30°, and the number of scanning steps is 128 (\(M=128\)). Figure 4(a) displays the weights for the 16 array elements as they are scanned at various angles. An ideal linear antenna array could synthesize beams at a specific angle based on the weights of each row, as shown in Fig. 4(b). Considering the performance penalty, an appropriate degree of compression should be set after SVD. In this case, the first four eigenvalues make up 92.87% of the weight of the total eigenvalues.

Fig. 4: Schematic diagram LIVG Building and Optimization: PSO case in scan angle 0° to 30°, each step 30°/128.
Fig. 4: Schematic diagram LIVG Building and Optimization: PSO case in scan angle 0° to 30°, each step 30°/128.
Full size image

Deep Learning Case input data set consist of scan angle = m(m = 15,20,…,85); element = n(n = 4,5,6,…,16), rank = n-1;output data set was the optimized LIVG. a M = 128, N = 16 Grey-scale map represents phase shifts [−180°,180°]at element n, degree m; b Normalized Ideal Far-field pattern(red) form 0° to 30°; c Grey-scale map after SVD compressed. d Grey-scale map reconstructed by LIVG. e Reconstructed weight matrix far-field pattern(blue) from 0° to 30°. f Iterative convergence of PSO algorithm. g The optimally constructed LIVG, where each row vector of the LIVG represents a specific beam pointing: 0°(blue),8°(purple), 12°(yellow), 30°(red). h Normalized Ideal far-field pattern(red with shadow) and optimal LIVG-built normalized far-field pattern(blue with shadow). i Deep Learning Data Set building; j Topology of Transformer Deep Learning Network Structure; k Far-field pattern of the LIVG matrix generated by the deep learning model (dashed line) and PSO algorithm (solid line).

Therefore, in this case, four vectors are selected to construct LIVG (\(rank=4\)). After recreating the matrix by zeroing the remaining eigenvalues, as illustrated in Fig. 4(c), the distortion of the weights is almost negligible. The weight matrix as in Fig. 4(d) is reconstructed again after applying the constraints of Eq.(6) to the coefficients \(K\). The distortion of the weight matrix gradually becomes apparent. Such distortion could potentially affect the beam pointing.

The optimizer’s objective is to create a better collection of LIVGs such that the difference between the ideal far-field pattern Fig. 4(b) and the reconstruction’s far-field pattern Fig. 4 (e) is sufficiently minimal to improve beam pointing accuracy. The algorithm is expected to converge the optimizer function value at 2.39 after 200 iterations as shown in Fig. 4(f). As shown in Fig. 4(g) the optimized LIVG was built, and the far-field pattern formed by the restructure weight matrix is compared to the far-field pattern of the ideal antenna in Fig. 4(h). The reconstructed far-field pattern is nearly identical to the ideal far-field pattern during the 30° scan in terms of beam direction, which implies that the active devices of the array system can be cut down to 25% by regulating the phased array of 16 elements using only 4 amplifiers and phase shifters and loss the beam pointing accuracy less than 1.73°.

To eliminate the redundancy of the repetitive computation and design a general LIVG building model, a deep learning model was designed. A wider field-of-view model is required due to our data consists of a collection of numeric values. Transformer36 has the qualities we need, such as a relatively low induction bias and a correlation of all feature values, compared to Convolutional Neural Networks (CNN)37 and Recurrent Neural Networks (RNN)38.

To generate training data for this model, as depicted in Fig. 4(i), a more comprehensive dataset was constructed by traversing steering angles from 5° to 85° with 5° for each step, element numbers from 4 to 16 with 1 for each step, and LIVG with rank \(r=2\) to \(N-1\) at intervals. For each piece of data, the input is a \(128\times 32\) matrix that includes the 16 array components’ complex weights (both real and imaginary) as well as the number of scan steps of M = 128 (If the number of arrays N < 16, the remaining values are zero. The output is the optimized LIVG for each input matrix.

The topology of the transformer is elementary, as shown in Fig. 4(i). First, the weight matrix is expanded by the visual transformer into 64 blocks of size \(64\times 16\). In contrast to the standard visual transformer39, patches are one-dimensional data in DRCAO-PAA, so that the Transformer, which is commonly used to process languages, can each patch be considered as a token (like a word) that can be inserted into the Transformer.

As the model performance is admirable in the test dataset with the 89.68% cosine similarity and the 0.0748 mean absolute error (MAE), the Transformer-based model can quickly generate an optimum LIVG, when given the DRCAO-PAA design parameters (such as the number of array elements, maximum scan angle, etc.). Figure4(j) depicts the far-field pattern of the LIVG matrix generated by the deep learning model (dashed line) and the far-field pattern of the optimal LIVG matrix solved by the PSO algorithm (solid line). The deep neural network fits the PSO algorithm well. It will simplify the design process and speed up the DRCAO-PAA calculations. This concept is also applicable to a 2D model, with the derivation of the 2D DRCAO-PAA and simulation results shown in Fig. S1.

System demonstration

To verify our proposed concept, a five-step experiment is set up, including experimental setup, Implementation of DRCAO-PAA, beam steering execution, measurement, and validation. It is designed as a mm-wave phase shifter module operating at 28 GHz for Experimental Setup. Different LIVGs are used to characterize the vector space of antenna array element weights. The weights were set by adjusting the output chip’s control of amplitude and phase to implement the DRCAO-PAA. Beam steering was achieved by adjusting the amplitude and phase with variable K factors between the input layer and the middle layer. We measured the S21 parameters between each input port and output port on the DRCAO-PAA verification board using a vector network analyzer. The normalized far-field pattern of the beam was synthesized by calculating these measured S21 parameters. Finally, we carried out a far-field pattern measurement in the chamber by applying 2 pairs of phase shifters and a variable gain amplifier.

In detail, as shown in Fig. 5(a), a mm-wave phase shifter printed circuit board (PCB) module is used to verify the DRCAO-PAA theory. Figure 5(b) shows the conceptional block diagram of the phase shifter module. Three layers—the input layer, middle connection layer, and output layer—combine to form the entire module design. In this experiment, a maximum of 4 inputs and 16 outputs were accomplished. By having low-cost and commercially available silicon beamformer chips on the input and output layer PCBs, each path between input and output ports can adjust its amplitude and phase independently with high resolution.

Fig. 5: Dimensionality-reduced phase array experimental setup and prototype for concept verification.
Fig. 5: Dimensionality-reduced phase array experimental setup and prototype for concept verification.
Full size image

a Dimensionality-reduced cascaded angle offset (DRCAO-PAA) verified board prototype. b DRCAO-PAA verified board Schematic, Input layer(red), middle layer(blue), output layer(yellow). c Far-field Pattern Measurement in Spherical mm-Wave Anechoic Chamber d 28GHz mm-wave phased array. VNA: vector network analyzer.

For mapping DRCAO-PAA theory, each yellow tap represents a fixed tap, to implement the weights of each vector in the LIVG; red features a tunable tap with adjustable amplitude and phase (linear Vector K). As an illustration, consider a LIVG built with rank=2 and N = 4. The LIVG weights can be set by adjusting the output chip’s control of the amplitude and phase of the link between Middle 1-Output 1,2,3,4 and Middle 2-Output 1,2,3,4. Beam steering is achieved by adjusting the amplitude and phase of the link between Input 1-Middle 1 and Input 2-Middle 2 with variable K.

A LIVG with rank=3N = 8(referred to as r = 3) which reduces 62.5% component, and a LIVG with rank=4N = 16(referred to as r = 4) which reduces 75% component, were both utilized to evaluate the DRCAO-PAA. After finishing the fixed tap equalization via the SPI serial interface. To achieve the 0–30° beam steering, r = 3 or r = 4 taps are employed to control 3 or 4 vectors of LIVG. The \({S}_{21}\) parameters were measured between each input port and output port on the DRCAO-PAA verification board with a vector network analyzer (VNA Agilent Technologies PNA Network Analyzer E8361C).

The normalized far-field pattern of the beam was synthesized by the calculation of measured S21 parameters. Figure. 6 (a) shows the normalized far-field pattern computed and synthesized from each of the three vectors in the LIVG with rank=3, pointing at 14.4°, 25.92°, and 29.52°, respectively. Five sets of K-factors were employed to control the LIVG to achieve beam steering from 0 to 30°. As shown in Fig. 2 (b), the normalized far-field pattern computed and synthesized by 5 sets measured S21. The beam steering from 0 to 30° is accomplished based on the DRCAO-PAA theory to regulate these three non-uniformly selected vectors.

Fig. 6: Experimental results.
Fig. 6: Experimental results.
Full size image

a Calculate and synthesize far-field pattern for 3 vectors in LIVG with r = 3. b Control linear vector group (LIVG) with r = 3 to complete 0 ~ 30° beam steering. c Calculate and synthesize far-field pattern for 4 vectors in LIVG with r = 4. d Control LIVG with r = 4 to complete 0 ~ 30° beam steering. e Measured far-field pattern for 2 vectors in LIVG with r = 2. f Control LIVG with r = 2 to complete 6° ~ 14°beam steering. (Different colors means different beam pointing).

For the 4 vectors case, Fig. 6(c) shows the normalized far-field pattern computed and synthesized from each of the 4 vectors in the LIVG, pointing at 3.24°, 6.12°, 11.16°, and 28.44°, respectively. With 5 sets of K factors, the beam steering was performed from 0 to 30°. As shown in Fig. 6(d), the normalized far-field pattern computed and synthesized by 5 sets measured S21. The case of 4 vectors is also complete the beam steering from 0 to 30°, and the beam synthesized by 16 antennas is narrower and more accurate pointing.

To further verify the DRCAO-PAA hypothesis, the setup shown in Fig. 5(c) was utilized to evaluate the antenna’s far field. The whole measurement was performed in a spherical mm-wave anechoic chamber. As shown in Fig. 6(e, f), the normalized far-field pattern of the beam was measured using this setup. With rank=2, N = 4, the normalized far-field pattern of the two vectors in LIVG, pointing at 6° and 9°, was shown in Fig. 6(e). To achieve the beam steering from 6° to 14°, different K is applied with the help of 2 phase shifters, and 2 variable gain amplifiers (VGA). As shown in Fig. 6(f), six beams are pointing at 6°,7°,8°,9°,10°, and 14°, which demonstrate beam steering from 6° to 14°. This experiment verified the correctness of the DRCAO-PAA theory by controlling a 1×4 phased array with two active controllers to complete beam steering.

The above series of experiments are used to verify our theory. 3-vectors(\(rank=3\) independent vector group) are used to characterize the vector space consisting of 8(\(N=8\)) antenna array elements weights and 4-vectors(\(rank=4\) independent vector group) are used to characterize the vector space consisting of 16(\(N=16\)) antenna array elements. The experiments demonstrate that, based on DRCAO-PAA theory, any array size of a phased array can be perfectly driven with fewer active controllers to complete beam steering.

Conclusion and future aspects

We have proposed a Dimensionality-Reduced Cascaded Angle Offset Phased Array (DRCAO-PAA) theory, which introduces Singular Value Decomposition (SVD) to compress the scan and weight matrix ranks, thereby reducing the number of active components required for beam steering. By constraining the amplitude coefficients and applying Particle Swarm Optimization (PSO) and Transformer-based models, we further optimize matrix construction and enable efficient hardware implementation. Experimental validation demonstrates that DRCAO-PAA achieves effective beam steering over 0–30° using only three pairs of phase shifters and VGAs to control an 8-element array. Additionally, in a spherical anechoic chamber, we demonstrate continuous 8° beam steering with just two phase shifters and two adjustable amplifiers for a 4-element array.

This unified framework not only generalizes prior phase-shifter-reduction techniques but also reduces system complexity and cost by enabling implementation with simpler attenuators or low-gain amplifiers. While our current validation is for 1D linear arrays, the underlying matrix compression principle is extendable to 2D planar arrays, where future work will explore orthogonal cascading and tensor decomposition strategies to enable full 3D beam steering with ultra-low controller counts—supporting the vision of scalable, AI-native 6 G and satellite communication systems.

Method

Dimensionality reduced CAO-PAA in beam steering

Phase shifts and amplitude weights are applied to a 1-D linear evenly spaced phased array antenna array to evaluate the beam steering performance utilizing the proposed DRCAO-PAA concept. At angle \(m\), weights with array number \(N\) reconstructed by SVD compression can be expressed as:

$${P}^{\prime}_{m}=CK={k}_{1}{e}^{-j{\kappa }_{1}}{\vec{c}}_{1}+{k}_{2}{e}^{-j{\kappa }_{2}}{\vec{c}}_{2}+\cdots {k}_{r}{e}^{-j{\kappa }_{r}}{\vec{c}}_{r},$$
(7)

where \({k}_{r}\) denotes the \(rth\) magnitude, \({\kappa }_{r}\) denotes the \(rth\) phase, \({\vec{c}}_{r}\) denotes the \(rth\) row vector in the composed LIVG matrix, which can be expressed as

$${\vec{c}}_{r}=\left[\begin{array}{c}{C}_{r,1}{e}^{(-j{\phi }_{c(r,1)})}\\ {C}_{r,2}{e}^{(-j{\phi }_{c(r,1)})}\\ \vdots \\ {C}_{r,n}{e}^{(-j{\phi }_{c(r,n)})}\end{array}\right],$$
(8)

where \({C}_{r,n}\) denotes the amplitude of the \(rth\) vector at \(nth\) element, \({\phi }_{c(r,n)}\) denotes the phase shifts of the \(rth\) vector at \(nth\) antenna.

Equation 8 can be rewritten as

$${P}^{\prime}_{m}=\left[\begin{array}{c}{C}_{1,1}{k}_{1}{e}^{-j{\kappa }_{1}{\phi }_{c(1,1)}}+{C}_{2,2}{k}_{2}{e}^{-j{\kappa }_{2}{\phi }_{c(2,1)}}+\cdots +{C}_{r,1}{k}_{r}{e}^{-j{\kappa }_{r}{\phi }_{c(r,1)}}\\ {C}_{1,2}{k}_{1}{e}^{-j{\kappa }_{1}{\phi }_{c(1,2)}}+{C}_{2,2}{k}_{2}{e}^{-j{\kappa }_{2}{\phi }_{c(2,2)}}+\cdots +{C}_{r,2}{k}_{r}{e}^{-j{\kappa }_{r}{\phi }_{c(r,2)}}\\ \vdots \\ {C}_{1,n}{k}_{1}{e}^{-j{\kappa }_{1}{\phi }_{c(1,n)}}+{C}_{2,n}{k}_{2}{e}^{-j{\kappa }_{2}{\phi }_{c(2,n)}}+\cdots +{C}_{r,n}{k}_{r}{e}^{-j{\kappa }_{r}{\phi }_{c(r,n)}}\end{array}\right].$$
(9)

The direction vector of a one-dimensional phased array, denoted as

$$d(\theta )=[1,{e}^{\, jkd\,\cos (\theta )},{e}^{\, j2kd\,\cos (\theta )},\ldots {e}^{\, j(N-1)kd\,\cos (\theta )}],$$
(10)

\(k=2\pi /\lambda\), \(\lambda\) is the RF signal wavelength.

Based on the beamforming theory33, The far field pattern (FFP) of such an array could be expressed as:

$$A{F}_{svd,m}(\theta )=|d(\theta ){P}^{\prime}_{m}|=|{\sum }_{n=1}^{N-1}{\sum }_{r=1}^{R}{C}_{r,n}{k}_{r}\exp \left(\right.-j({\kappa }_{r}{\phi }_{c(r,n)}+nkd\,\cos (\theta ))|.$$
(11)

The pointing error can be expressed as the peak value difference between the far field pattern before and after SVD operation.

$$Erro{r}_{pointing}=|\,\max (A{F}_{svd,m}(\theta ))-\,\max \left(\right. A{F}_{ideal,m}(\theta )|$$
(12)

Particle swarm algorithm in DRCAO-PAA

The particle swarm algorithm is a swarm intelligence algorithm designed by simulating the predatory behaviour of a flock of birds. Each particle acts as a potential solution with its unique velocity and position, and each particle has an objective value that is based on its own location which indicated the vectors index of the M×N matrix. The algorithm parameters of PSO are listed in Table 2.

Table 2 Particle swarm algorithm Parameters

First, a set of population particles is produced at random. The particles can alter their own location, which alters the objective value, by keeping track of these two best positions: the population’s overall best position gBest and the historical best position of each population particle pBest.The subsequent equation accounts for the particle’s position and velocity40.

$${v}_{i,r}^{t+1} =\omega \times {v}_{i,r}^{t}+{c}_{1}\times rand()\times (pBes{t}_{i,r}^{t}-{x}_{i,r}^{t})\\ = {c}_{2}\times rand()\times (gBes{t}_{r}-{x}_{i,r}^{t})$$
(13)
$${x}_{i,r}^{t+1}={x}_{i,r}^{t}+{v}_{i,r}^{t+1}$$
(14)

where i = 1,2,……,N, N is the seed size, r is the dimensionality which is the rank of the LIVG in this algorithm, v is the particle velocity, x is the particle position which is the index in the M×N matrix, v is the particle velocity, t is the current iteration number, and c1, c2 are learning factors, and rand() is a uniformly distributed function between [0,1].

Transformer for LIVG building

16 layers of Transformer Encode were used to build our model which proposed neither a CNN nor an LSTM, but a mechanism called dot-product Attention, on top of which the model (Transformer) is already substantially better than existing methods.

Query, Key, and Value are the three variables that The Transformer employs in (dot product) attention. In essence, the algorithm multiplies each Key by its corresponding Value to get the Attention Weight of the Query and Key phrases39.

$$Attention(Q,K,V)=soft\,\max \left(\frac{Q{K}^{T}}{\sqrt{{d}_{k}}}\right)V$$
(15)

In our model, Multi-Head Attention is used as well. It uses multiple Attention Heads (in the case of MLP, the “number of hidden layers” is increased), defined as follows:

$$MultiHead(Q,K,V)=Concat(hea{d}_{1},{..}.,hea{d}_{h}){W}^{O}$$
(16)

where

$$hea{d}_{i}=Attention(Q{{W}_{i}}^{Q},K{{W}_{i}}^{K},V{{W}_{i}}^{V})$$
(17)

Each Head has its own projection matrices\({{W}_{i}}^{Q},{{W}_{i}}^{K},{{W}_{i}}^{V}\), projected features of these matrices are used to create attention.

The Transformer model’s training process is split into coarse and fine modes. With the learning rate of 1e-5, the 200 training sessions in coarse mode are carried out using the AdamW optimizer41. With the learning rate of 1e-7 and the momentum factor set to 0.1, the 300 training sessions in fine mode are carried out using the SGD optimizer42. To prevent overfitting, the MAE is used as the loss function of the model. The “cosine similarity,” which measures how similar two numerical sequences are, is selected as the metric. The model completes training with a lossy MAE of 0.0562, and a cosine similarity of 91.83%.

Design of mm-wave phase shifter module

The mm-wave phase shifter module is operating at 28 GHz, and contains 3 layers (Fig. 5a):

The input layer has 4 inputs and 16 outputs and is made using four 1-to-4 phase shifter PCBs. On each PCB, the input signal is split into 4 paths (1-to-4), where each path has an independent 6 bits 30 dB amplitude control and 6 bits 360-degree phase control. This is achieved by using a commercially available beamformer chip, which is set to TX mode in this experiment. The chip operates at 25-30 GHz and has a max output power of +10 dBm at each output. The gain and amplitude control can be done through SPI, and the chip on each PCB has a different hardware address, so that they can be controlled by a single SPI master device.

The output layer has 16 inputs and 16 outputs and is made using four 4-to-4 PCBs. Each PCB has 4 beamformer chips, which is the same chip used on the input layer PCB. In this case, the beamformer chips are set to RX mode, where each chip can combine 4 inputs signal to 1 output and have independent amplitude and phase control on each path. As a result, by having 4 chips on board in parallel and sharing the same 4 inputs, each board can operate as a 4-to-4 network.

The middle layer is built to route and distribute the signals between the input and output layers. It also serves as a master SPI controller and power supply for all the beamformer chips used in the module. The complete mm-wave phase shifter module consumes about 12 W from a single 3.3 V power supply. On the board, there are LDOs to regulate the supply voltage and SPI buffer to minimize the noise on the control signals. The SPI can operate up to 50 MHz, which allows the module to synthesize any phase shift in any paths in microseconds.

As each path in the input and output layers has an independent amplitude and phase control, the module provides enough flexibility to synthesize and implement different DRCAO-PAA algorithms, as well as calibrate any system amplitude imbalance between different antenna channels.