# nature communications **Article** https://doi.org/10.1038/s41467-025-63486-z # Programmable circuits for analog matrix computations Received: 31 March 2025 Accepted: 17 August 2025 Published online: 26 September 2025 Rasool Keshavarz **1**, Kevin Zelaya **2**, Negin Shariati **1** & Mohammad-Ali Miri **2**,3,4 ⊠ Matrix operations are at the core of signal processing in radiofrequency and microwave networks. While analog matrix computations can dramatically speed up signal processing in multiport networks, they can also reduce the size, weight, and power of radiofrequency and microwave devices by partially eliminating the need for power-hungry electronics. These computing devices exploit fundamental properties of electromagnetic waves, enabling parallel signal processing at the speed of light. Here, we propose and demonstrate a microwave-integrated circuit capable of implementing universal unitary matrix transformations. The proposed device operates by alternating non-reconfigurable and reconfigurable layers of basic RF components, comprising cascaded power dividers and programmable phase elements, respectively. The controllable multipath interference through conjunctive use of linear wave mixing with active phase control enables creating complex transformations in this device. We experimentally demonstrate this device concept using a four-port integrated circuit operating across the frequency range of 1.5–3.0 GHz and at hundreds of micro-Watt power levels. The proposed device can pave the way for universal analog radiofrequency and microwave processors and preprocessors with programmable functionalities for multipurpose applications in advanced communications and radar systems. The evolution of radiofrequency and microwave networks for advanced communications and radar systems toward increasingly complex devices demands powerful digital signal processors to enable real-time data processing and rapid decision-making. On the other hand, fundamental limitations of digital electronics in bandwidth and power consumption have resurfaced interest in analog logic. An analog computing engine can utilize fundamental properties of electromagnetic waves for massive parallel processing to enable signal processing literally at the speed of light. On the other hand, the very nature of an analog processor can facilitate dramatic reduction in size, weight and power compared to its digital counterpart. In the past decade, there has been substantial progress in the development of units that perform linear operations leveraging electromagnetic waves in free space and in guided wave structures<sup>1-7</sup>. However, less attention is paid to reconfigurable on-chip structures that could be programmed on-demand to perform the desired transfer function between a number of ports in radiofrequency and microwave networks. In this vein, linear transforms play a pivotal role in signal processing as a large number of the relevant signal manipulations lie in this regime<sup>8</sup>. In the optical domain, interest in physical realizing linear operations on multimode light beams was driven by applications in optical computing, expanded significantly after realizations with nanophotonic circuits<sup>9-13</sup>. Photonic units also find exciting new applications as a promising platform for quantum computing and <sup>1</sup>RF and Communication Technologies (RFCT) Research Laboratory, University of Technology Sydney, Sydney, NSW, Australia. <sup>2</sup>Department of Physics, Queens College of the City, University of New York, Queens, NY, USA. <sup>3</sup>Physics Program, The Graduate Center of the City, University of New York, New York, NY, USA. <sup>4</sup>Department of Electrical and Microelectronic Engineering, Rochester Institute of Technology, Rochester, NY, USA. — mail: ali.miri@rit.edu simulations<sup>14</sup>, including photons in the microwave domain for largescale quantum communication channels15-17. Furthermore, reconfigurable photonic integrated circuits have evolved toward various architectures and configurations, coining the term programmable photonics, holding promise for applications in classical photonic signal processing<sup>18-23</sup>. In a similar fashion, the development of programmable analog matrix computing cores at radio and microwave frequencies can provide real-time, low-power data processing solutions that circumvent the fundamental limitations of purely digital systems<sup>24,25</sup>. By merging reconfigurable electromagnetic wave manipulation with analog computing, these architectures could pave the way for next-generation, high-speed signal processing applications<sup>26</sup>. Solutions in this regard include phase-array radars<sup>27,28</sup>, lensless Fourier networks<sup>29</sup>, computational imaging<sup>30</sup>, microwave beamforming<sup>31,32</sup>, and frequency filters and synthesizers<sup>33-35</sup>. There is thus an interest for integrated and programmable microwave units capable of dynamically adjusting their functionalities to accommodate various of these application scenarios. While programmable photonic systems have successfully implemented linear transformations in the optical and infrared domains, they typically require optical sources and detectors, leading to increased latency, system complexity, and power consumption. In contrast, the proposed RF-domain solution processes signals natively without optical-electrical conversion, offering reduced latency and seamless integration with existing radiofrequency (RF) front-end hardware. Additionally, the ability to implement such architectures using printed circuit boards or MMICs facilitates compact, scalable integration for embedded and low-power applications. In this work, we introduce, design, and experimentally demonstrate a programmable microwave circuit device that performs universal unitary matrix transformations. The proposed solution operates on an interlaced structure composed of fixed networks of power dividers and tunable phase shifter arrays. The programmable phase elements provide the required degrees of freedom to represent a desired unitary operation. These results are experimentally validated by a four-port device with five layers of phase shifters, operating at the central frequency of 2.1 GHz. A time-multiplexing scheme enabled design- and power-efficient programming of the phase shifter network in series using a simple microcontroller. In addition, the selfcalibration properties of the proposed circuit configuration allow for high-fidelity representation of desired matrix operations even with coarse resolution of the underlying phase shifters. The device performance is demonstrated through exemplary operations including Hadamard matrix. # **Results** #### Theoretical background To better understand the underlying structure of the proposed linear unitary transform device, Fig. 1a presents a schematic of the interlaced structure intertwining layers of non-tunable (passive) and tunable (active) components, represented through the unitary operators F and $P^{(m)}$ , respectively. In the latter, F is considered as fixed and $P^{(m)}$ is a diagonal matrix with diagonal elements $p_n^{(m)} = e^{i\phi_n^{(m)}}$ , with $m \in \{1, \ldots, M\}$ and $n \in \{1, \ldots, N\}$ , where M is the total number of layers and N the total number of ports. From these considerations, the proposed N-th port device follows from the interlaced universal unitary operation $$\mathcal{U}(\Phi) := FP^{(N+1)}F \dots FP^{(1)}F, \tag{1}$$ where $\mathcal{U}(\Phi) \in U(N)$ , and $\Phi$ is the set of trainable phase parameters. The theoretical design in Eq. (1) is versatile, as the sequence of multiplications produces a device with components that are sequentially connected, as shown in Fig. 1a Furthermore, the non-tunable layer, F, only needs to be designed once and can then be replicated throughout the device. Fabrication errors should also be considered in the design of such a layer, which changes the overall functionality of the device. As pointed out in previous works<sup>36</sup>, this interlaced device can be further tuned to mitigate fabrication errors through the phase elements. For the current design, the non-tunable layer is constructed using a sequence of two-port 50:50 Power Divider Units (PDUs). The number and placement of these units must be determined based on the required universality of the complete device. Although fewer power dividers are ideal for reducing the overall footprint, the total number shall be selected so that the resulting mixing layer F fulfills the density criterion for unitary interlaced architectures<sup>23</sup>. This procedure is done by using the S-parameter specification of each power-divider provided by the manufacturer. We fabricate a four-port device (N = 4)comprising M=5 phase layer, and consequently 20 tunable phase elements, the overall device of which is illustrated in Fig. 1b. In this case, the non-tunable mixing layer F reduces to a minimum of two layers of power dividers (dashed-white area in Fig. 1b) without jeopardizing the universality of the proposed device. Since F is non-tunable, its proper design and characterization are fundamental to the overall device performance (up to mild deviations), as once manufactured, it cannot be further modified. To assess the performance of the proposed device, 500 Haar random unitary 4 × 4 matrices are created, and the device is tuned to reconstruct each target by optimizing the training parameters $\Phi$ in Eq. (1) for each $M \in \{1, ..., 9\}$ . This is performed using gradient-based optimization algorithms, as shown in the Methods section and Supplementary Note 1. The results are summarized in Fig. 1c, where a phase transition in the figure of merit is observed at M = N + 1 = 5 layers. It is not surprising that higher errors are obtained for M < 4, as the total number of active phase elements is lower than $N^2 = 16$ , the total required to fully parameterize any arbitrary target in U(4). For M=N=4, the system is still effectively under-parameterized, as a global phase can be factored out per layer, reducing the size of the parameterization space<sup>36</sup>. We thus have selected M = N + 1 = 5 layers to balance performance with hardware complexity. This configuration maintains reconstruction errors within acceptable tolerance for key unitary operations while enabling a more compact and energy-efficient implementation suitable for practical fabrication. The acceptable error tolerance for the number of layers M is fundamentally application dependent. While communication and radar systems can tolerate moderate errors without impairing functionality, more sensitive quantum or coding applications may require lower error levels, motivating higher M values. Therefore, MMM should be selected to balance system performance requirements and hardware complexity. It is worth mentioning that the device performance is achieved even if deviation on the non-tunable layer *F* are present in the actual device. This follows from the fact that mild defects can be compensated by steering the tunable phase elements to reach the desired performance<sup>36</sup>. See Supplementary Note 1 for more information. #### **Fabrication and experimental realization** The proposed unitary matrix solver device is fabricated following the interlaced design in Fig. 1a, where six layers of mixing *F* matrices and five layers of active phase elements are implemented. The *F* matrix layers are devised with a network of power dividers. Specifically engineered for 5G applications<sup>37</sup>, these power dividers offer low loss, tight amplitude balance, and high isolation within the 1.5–2.3 GHz frequency range. Constructed from ceramic-filled Polytetra-fluoroethylene (PTFE) composites, they provide excellent electrical and mechanical stability, ensuring reliable circuit performance. For the phase layers, we opted for 6-bit digital phase shifters that feature integral CMOS drivers and low DC power consumption, making them ideal for energy-efficient designs<sup>38</sup>. These phase shifters offer a step size of 5.6°, providing complete 360° coverage with minimal attenuation variation. In addition, a low-cost, low-power consumption **Fig. 1** | **Device structure, design, and performance. a** Block diagram of the proposed universal unitary device inspired by a photonic interlaced structure where non-tunable (F) and tunable layers are intertwined in a cascaded fashion. **b** Sketch of the final device comprising non-tunable 50:50 power dividers (dashed-white) that mix incoming signals across all the channels so that the tunable phase elements (long-dashed-red) steer the signal to the required operation. The choice of the non-tunable layer is not restricted to a specific form, and the theoretical form of the corresponding transmission matrix can be deformed due to external fabrication defects. **c** Figure of merit $L(\Phi) = ||\mathcal{U}(\Phi) - \mathcal{U}_t||^2/N^2$ for the interlaced structure using the theoretical F (see Methods section). The latter is depicted as a function of the number of layers (M), showing a sudden drop in the figure of merit at M=5 layers. In this process, 500 Haard random matrices are generated for each M, presenting only the mean (dot) and standard deviation (shaded area). **d** Percentage error $(\mathcal{U}(\Phi) - \mathcal{U}_t||^2/||\mathcal{U}_t||^2) \times 100\%$ between the target and the reconstructed matrices when the previously optimized phases are replaced by 6-bit discretized phase shifters (upper panel). The lower panel presents the calibrated operation, where the discrete nature of the phase shifters is incorporated during the optimization process. system-on-chip microcontroller facilitated phase shifter control, enabling serial configuration with minimal interface requirements. This serial configuration allows connecting all the phase shifters in a cascade setup from a serial bus perspective, minimizing control pin usage. See Supplementary Notes 2, 3 for more details. The 6-bit phase shifters with 5.6° step size provide sufficient resolution for accurately implementing universal unitary operations in a 4-port device. Phase discretization errors are mitigated through digital optimization and calibration, enabling high-fidelity matrix synthesis. The upper panel in Fig. 1d depicts the percentage error of the reconstructed target when the previously optimized phases are replaced by their discrete counterpart. Indeed, a penalization is incurred in the phase discretization process, yielding an average error of 6.41% across all the 500 samples. However, by reoptimizing the device while taking into account the discrete nature of the device, the percentage error drops to a mean of 2.97%, as shown in the lower panel of Fig. 1d. Higher resolution may benefit larger-scale systems but entails increased hardware complexity. Potential mismatches between phase shifter chips are addressed through a self-calibration procedure that measures the effective device response and optimizes phase control bits to compensate for deviations. The use of integrated 6-bit digital phase shifters with CMOS drivers further enhances uniformity across phase elements, supporting stable and repeatable operation. Figure 2a shows the manufactured PCB of a $4 \times 4$ unitary matrix multiplier. A replica of the power-divider unit is separately fabricated **Fig. 2** | **Fabricated unitary universal device and its power-divider layer. a** PCB of the fabricated unitary matrix solver comprising power-divider unit layers (yellow-dashed) and the independent active phase elements (red-dot-dashed) and individually fabricated power-divider layer. This layer is experimentally tested to properly characterize the full unitary matrix solver. **b** Experimental results of the real parts of the S-matrix for the individually fabricated power-divider layer (matrix F). c Transmission matrix (normalized) of the individually fabricated power-divider unit at selected frequency points, including the target operational frequency (red-dashed). for benchmarking and calibration, which is shown in this figure and fully characterized before running the complete unitary matrix solver. In the latter, a layout of the power divider unit is presented, which was designed and simulated using the Advanced Design System (*ADS*) suit software before production. The layout for this layer was specifically designed to operate optimally at 2.1 GHz, which is the subsequent target operation of the unitary matrix solver. The individual fabricated power-divider unit is thoroughly benchmarked using a four-port vector network analyzer (VNA) to fully characterize it in the frequency spectrum 1.5-2.3 GHz. Since the power-divider layer is the only non-tunable layer, it is fundamental to properly characterize it independently so that we can estimate the required phases on the tunable layer required for the unitary matrix solver functionality. The measured transmission components of the S-parameters are summarized in Fig. 2b, where the reflection components were ignored, as their contribution is negligible. See Supplementary Note 4 for full characterization of the S-parameters. To clearly illustrate the behavior of the power-divider unit, its transmission matrix components (in normalized units) are shown in Fig. 2c for specific frequencies, including the operational frequency at 2.1 GHz used across the fully connected device. The latter shows mild deviation on the intensity of the transmission components, which is in good agreement with the ideal case show in Supplementary Note 1. In turn, deviations are expected on the phase components, as the differences in optical paths at different frequencies contribute to different phases. #### **Experimental results** Individual characterization of the fixed layer of power splitter networks permits precise measurement of the intervening operator $F_{\rm exp}$ . This allows us to use the formula in Eq. (1) by replacing $F = F_{\rm exp}$ in order to realize the desired unitary evolution $U_{\rm exp}$ by properly steering the phase elements. The latter phases are optimized according to the desired target operation by minimizing the experimental figure of merit shown in the Methods Section. Indeed, the phase elements used in the experimental setup have a limited tuning capability; however, their accuracy is sufficient for proper device operation. Since the network analyzer has four ports and both the proposed unitary matrix multiplier has eight ports (four inputs and four outputs), a set of two $2\times4$ RF switch arrays were employed at both the input and output to properly excite and gather signals from all the ports of the device. Calibration procedures utilizing RF switches ensured the precise extraction of the device's S-parameters, while calibrated cables minimized cable losses in the measurement setup. The port switching, phase-shifter microcontroller operation, and data collection from the network analyzer are ultimately performed on a personal computer. This setup is summarized and illustrated in Fig. 3a. We establish a benchmark scenario by configuring the device to perform three exemplary unitary operations. That is, we consider the $4\times 4$ identity matrix and the anti-diagonal matrix, which are among the sparsest matrices in the unitary group U(4). The identity matrix steers the device into a pass-through operation (bar configuration), whereas the anti-diagonal shuffles the channel order (cross configuration). Furthermore, we consider the Hadamard matrix, $U^{(H)}$ , which has the same density as the discrete Fourier transform but is more accessible for computation. See the Discussion section. Additionally, the $4\times 4$ Hadamard matrix is commonly used to implement Hamming codes, simple error-correcting codes that can detect single-bit errors in digital communication<sup>39</sup>. The Hadamard transform is also used in image compression and feature extraction. The optimized phases required to achieve the desired targets are loaded into the device, and the corresponding S-matrix is measured over a frequency sweep in the range of 1.5–3 GHz. The real part of the unnormalized experimental results is shown in Fig. 3b for the Hadamard matrix as a target (see Supplementary Note 3 for the full S-matrix). In the latter, it can be noted that the components $Re[S_{5,1}]$ , $Re[S_{5,2}]$ , $Re[S_{5,3}]$ , and $Re[S_{5,4}]$ reach approximately the same values at 2.1 GHz, which is expected for a Hadamard matrix (a real-valued matrix with equal intensities). To better assess the performance of the Hadamard matrix and the other two targets, the frequency response of **Fig. 3** | **Experimental setup and results. a** Schematic of the experimental setup. This setup involves a four-port vector network analyzer (VNA) composed of two inputs and two output channels. Since the device has eight ports in total (four inputs and four outputs), two $2 \times 4$ radiofrequency switches are interlaced between the VNA and the input and output ports of the device under test. The phase-shifters are electronically controlled in a series connection through the ESP32 microcontroller. **b** Measured (unnormalized) S-matrix of the constructed universal interlaced architecture operated so that the phase elements render the Hadamard matrix. ${\bf c}$ Frequency performance of the device assessed by computing the figure of merit in terms of the operational frequency for the identity, anti-diagonal, and Hadamard matrix as targets. The minimal error is located at the optimal frequency of 2.1 GHz. Theoretical (red) and measured (blue) normalized transmission matrices $T_{n_1,n_2}$ for the identity ( ${\bf d}$ ), antidiagonal ( ${\bf e}$ ), and Hadamard ( ${\bf f}$ ) configurations when the device is operated at 2.1 GHz. the reconstructed error, computed using the figure-of-merit described in Methods Section, is depicted in Fig. 3c. This reveals a minimum error for the previously optimized phases at the operational frequency of 2.1 GHz, deviations for other frequencies are due to the phases observed in Fig. 2c. For illustration, the normalized target and measured transmission matrices at 2.1 GHz are shown in Fig. 3d–f, revealing the expected performance at the optimal frequency. This not only corroborates the previous findings but also provides insight into the operational bandwidth of the device around the optimal frequency at 2.1 GHz. The fabricated four-port prototype demonstrated robust unitary transformations, including Hadamard and identity matrices, over a 1.5–3.0 GHz frequency sweep. Despite the use of 6-bit phase shifters with coarse resolution, high-fidelity performance was achieved at low power levels (-330 $\mu$ W) owing to a self-calibration scheme that compensates for fabrication-induced deviations in the fixed mixing layer. Furthermore, as shown in Fig. 3, we measured and characterized the proposed ASP system across the wide frequency range 1.5–3 GHz. Since the frequency response is well-characterized, frequency-dependent calibration can be applied. In practical scenarios, this allows us to flatten and equalize the response over a 1 GHz bandwidth centered around 2.1 GHz, making real-time wideband operation feasible and reliable within this range. #### Discussion The implementation of arbitrary unitary transformations has been explored in the context of integrated photonics for light in the optical and infrared domains. However, a fully self-contained solution that operates directly with RF signals, without the need for external optical signals, has proven elusive. Our device aims to fill this gap by introducing an integrated solution that performs unitary operations on signals in the RF domain. Unlike RAM-based analog computing<sup>40</sup>, which relies on digital memory elements and requires conversion stages, the proposed analog RF matrix processor operates natively at microwave frequencies. This approach enables real-time processing with lower latency and power consumption. By allowing direct wave-based manipulation, our device provides scalable and programmable unitary transformations, offering a complementary alternative to existing RAM-centric analog computing methods. While the present design focuses on unitary transformations, arbitrary linear (non-unitary) matrices can be implemented using a Singular Value Decomposition (SVD) approach, i.e., $A = U_1\Sigma U_2$ , where $U_1$ and $U_2$ are unitary and implemented using the current interlaced phase-coupler topology. The diagonal scaling matrix $\Sigma$ can be realized by inserting a layer of variable gain or attenuation elements. Directly replacing 3-dB couplers with variable gain amplifiers (VGAs) is theoretically possible, but presents practical limitations including stability, power consumption, and control complexity. This solution does not require any external calibration light signal and can be programmed on the fly as needed. Although the device switching capabilities are limited by the electronic elements used as the phase-element, once the architecture is fully programmed, it can fully process any number of signals. The 6-bit digital phase-shifters utilized in our experimental demonstration operate at 3.3 V. The total current for the 20 phase shifters involved was measured to be 0.1 mA, which leaves the total power of the device at 330 µW. The device was proved to achieve pass-through (diagonal matrix) and cross (antidiagonal matrix) operations within the prescribed accuracy at the operational frequency of 2.1 GHz. These matrices are particular cases of the sparsest matrices in the U(4) group, which makes the device a signal rerouting solution while also serving as a benchmark for testing the device's operation. In turn, the Hadamard matrix tested in the manuscript is an example of a dense matrix. Experimental results also show excellent agreement with the required target, achieving the same accuracy as in the sparse cases. Furthermore, the Hadamard matrix is used in image compression and feature extraction, which is computationally simpler than the Fourier transform<sup>39</sup>. Implementations of the discrete fractional Fourier transform, and its applications have been shown to be feasible in the optical and microwave domain<sup>29,41</sup>, which can be deployed in the current RF device by tuning the proper phase elements. This is particularly useful for beamforming applications<sup>31,32</sup>. Although there is a higher reconstruction error for other frequencies (Fig. 3c), one can reoptimize the phase elements by using the non-tunable F layer of Fig. 2 at a different operational frequency. Since the transmission matrix intensity of F is stable over the frequency spectrum 1.5-3.0 GHz (Fig. 2b, c), one can, in principle, program our unitary matrix solver at any other frequency if needed. This makes the proposed device flexible to operate at different frequencies. The observed performance degradation beyond 2.5 GHz is attributed to phase mismatches in the fixed-layer transmission network. Extending broadband performance can be achieved through optimized wideband power divider designs or dynamic reprogramming of the phase layers to compensate for frequency-dependent variations, an important avenue for future development. The reconstruction error between the theoretical and experimental transmission matrices at 2.1 GHz is visually illustrated in Fig. 3d-f, including the percentage error is also included for reference. This showcases a high-fidelity performance, rendering a percentage error of around 0.2% for all configurations. It is worth mentioning that such accuracy was achieved by considering both the measured amplitude and phases (up to a global factor). Moreover, robustness against temperature and fabrication variations is supported using stable materials and components in the fixed power divider layers and fine-resolution digitally controlled phase shifters. The self-calibration procedure allows dynamic compensation of performance drifts, enabling consistent high-fidelity operation under varying environmental conditions. The present device is intended as a proof of concept, designed to demonstrate its potential, and it is thus not limited to the applications discussed here. Indeed, further linear non-unitary transformations can be deployed via the SVD of the operation in question and by cascading unitary devices, such as the one proposed in this manuscript. This has been previously reported in the optical domain<sup>9,22</sup>. Beyond the current implementation, the device can be extended to millimeter-wave (mmwave) and terahertz (THz) frequencies. The demonstrated programmable analog matrix circuit is especially suitable for RF signal processing tasks that demand real-time, low-latency, and low-power operation, such as phased-array beamforming, signal routing in radar and communication systems, and microwave quantum information processing. While the device currently targets unitary transformations, its underlying principles can be extended to more complex linear operations to meet application-specific demands. Future work will explore tailoring the architecture to domain-specific requirements including bandwidth, noise resilience, and integration density. For instance, in mm-wave communications, the device can enhance data transmission rates and improve the performance of wireless links in dense urban environments and inter-satellite communication links. The device can further enable ultra-high-speed data transfer in THz systems and support scientific research in material characterization and molecular spectroscopy<sup>42</sup>. Although the present architecture scales with $O(N^2)$ phase elements, posing integration challenges for large-port systems, future implementations using monolithic microwave integrated circuits (MMICs) or hybrid photonic-assisted RF processors may offer practical pathways to scalable, low-overhead integration. Though direct amplitude and phase control on each input-output path is possible, it scales quadratically with the number of ports, leading to increased device complexity and power consumption. The proposed interlaced architecture, based on cascaded passive power dividers and programmable phase shifters, reduces the active element count to $O(N^2)$ , enabling more compact and energy-efficient implementations. Additionally, the natural enforcement of unitarity through cascaded interference ensures signal integrity and facilitates robust calibration and compensation, advantages not easily achieved in direct amplitude-phase control arrays. Scalability to higher port counts entails increased circuit size and insertion loss, as well as potentially amplified mismatch effects. Our experimental results with a 4-port device suggest that M=N+1 phase shifter layers suffice for universality, but practical system design must balance these factors. Future work will focus on optimizing component selection, minimizing loss through MMIC integration, and exploring advanced calibration schemes to support larger-scale implementations. To better contextualize the advantages of the proposed analog matrix circuit, we provide a comparative analysis against conventional digital signal processing systems. Table 1 summarizes key differences in terms of relevant metrics bandwidth, latency, power efficiency, scalability, and integration approach. ### **Methods** #### Model optimization for theoretical and experimental setup Numerical optimization techniques are used to maximize or minimize a figure of merit that quantitative measures the performance of a theoretical model or a parametric device. Throughout the manuscript, we test the theoretical device architecture in Eq. (1) and the fabricated device, which in both cases depends on the preliminary selection of testing unitary matrices as targets. For this task, we use the customary figure of merit $$L(\Phi) = \frac{||X(\Phi) - X_{\text{target}}||^2}{N^2},$$ (2) where $X(\Phi)$ is the trainable quantity, $\Phi$ the set tunable parameters, and $\|\cdot\|$ the Frobenius norm. Table 1 | Comparison between Digital Signal Processor (DSP)43,44 and the proposed Analog Signal Processor (ASP) | Metric | Typical digital signal processor (DSP) | Proposed analog signal pro-<br>cessor (ASP) | Remarks | |------------------------------------------|----------------------------------------|----------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Operating real-time bandwidth | <200 MHz | ~1 GHz | <ul> <li>ASP handles wideband signals directly in the RF domain.</li> <li>Bandwidth can be improved through frequency-dependent calibration or higher-bandwidth subcomponents.</li> <li>DSP bandwidth is strictly limited by ADC/DAC conversion and digital clock speed.</li> </ul> | | Latency | >500 ns | <10 ns | <ul> <li>ASP eliminates ADC/DAC latency and digital logic delays, relying on pure wave propagation.</li> <li>DSP suffers from significant delay due to sampling, buffering, and logic execution.</li> </ul> | | Power consumption | >300 mW | ~330 µW | <ul> <li>ASP operates at ultra-low power due to passive or low-power phase shifters.</li> <li>DSP power is dominated by high-speed switching, memory access, and digital logic complexity.</li> </ul> | | Scalability | Moderate | Moderate | <ul> <li>ASP scaling is limited by the spatial encoding, and its dimensions scale linearly with the size of the matrix operation.</li> <li>DSP scaling is challenged by increasing thermal density, timing closure, and silicon area constraints.</li> </ul> | | Precision | High | Moderate | <ul> <li>ASP achieves sufficient precision for many real-time applications like beamforming, filtering, and analog matrix operations.</li> <li>The proposed design enables compensation for variations through calibration, achieving high accuracy.</li> <li>Digital logic offers high precision limited by the DSP floating point.</li> </ul> | | Integration approach | ASICs, FPGAs | PCB/MMICs | <ul> <li>ASP leverages mature MMICs and PCB technologies with<br/>high-speed interconnects and compact integration.</li> <li>DSP relies on CMOS and expensive ASIC/FPGA platforms<br/>with long development cycles.</li> </ul> | | Functionalities & Application<br>Domains | General-purpose<br>computation | Specialized: linear operations, RF front-<br>end, direction finding, beamforming | <ul> <li>ASP processors are ideal for fast, efficient RF tasks: signal conditioning, switching, interference suppression, spatial filtering.</li> <li>DSP is general-purpose but inefficient for RF front-end processing in terms of speed and power.</li> </ul> | For the theoretical analysis, Eq. (2) is combined with Eq. (1) by using the ideal and perturbed F mixing matrix. In such a case, each power divider is represented by $U_{PD} = \frac{\sigma_0 - i\sigma_1}{\sqrt{2}}$ , with $\sigma_j$ the Paulli matrices. The latter corresponds to the ideal manufacturer specification for the power-divider unit. Thus, the ideal mixing matrix becomes $F = (\mathbb{I}_2 \otimes U_{PD})(\mathbb{I}_1 \oplus U_{PD} \oplus \mathbb{I}_1)$ , with $\otimes$ and $\oplus$ the direct-product and direct-sum operations, respectively. For the experimental calibration, the S-parameters of the F matrix are extracted and used in the optimization routine to extract the phases to be set in the experimental run. # **Data availability** The experimental datasets associated with this work are available at https://doi.org/10.6084/m9.figshare.29633726. ## References - Hu, J. et al. Diffractive optical computing in free space. Nat. Commun. 15, 1525 (2024). - Zangeneh-Nejad, F., Sounas, D. L., Alù, A. & Fleury, R. Analogue computing with metamaterials. Nat. Rev. Mater. 6, 207 (2021). - 3. Silva, A. et al. Performing mathematical operations with metamaterials. *Science* **343**, 160 (2014). - Lin, X. et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004 (2018). - Mohammadi Estakhri, N., Edwards, B. & Engheta, N. Inversedesigned metastructures that solve equations. Science 363, 1333 (2019). - Reck, M., Zeilinger, A., Bernstein, H. J. & Bernstein, H. J. Experimental realization of any discrete unitary operator. *Phys. Rev. Lett.* 73, 58 (1994). - 7. Tzarouchis, D., Edwards, B. & Engheta, N. Programmable wave-based analog computing machine: a metastructure that designs metastructures. *Nat. Commun.* **16**, 1–7 (2025). - 8. Zayed, A. I. Linear transformations in signal and optical systems. In *Operator theory*, 1–39 (Springer, 2015). - 9. Miller, D. A. Self-configuring universal linear optical component. *Photonics Res.* **1**, 1 (2013). - Schaeff, C., Polster, R., Huber, M., Ramelow, S. & Zeilinger, A. Experimental access to higher-dimensional entangled quantum systems using integrated optics. *Optica* 523-529, 2 (2015). - 11. Carolan, J. et al. Universal linear optics. Science 349, 711-716 (2015). - Clements, W. R., Humphreys, P. C., Metcalf, B. J., Kolthammer, W. S. & Walmsley, I. A. Optimal design for universal multiport interferometers. *Optica* 3, 1460 (2016). - 13. Harris, N. C. et al. Quantum transport simulations in a program-mable nanophotonic processor. *Nat. Photonics* **11**, 447–452 (2017). - Aghaee Rad, H., Ainsworth, T. & Alexander, R. N. Scaling and networking a modular photonic quantum computer. *Nature* 638, 912–919 (2025). - Jiang, W. et al. Optically heralded microwave photon addition. *Nat. Phys.* 19, 1423–1428 (2023). - Weaver, M. et al. An integrated microwave-to-optics interface for scalable quantum computing. *Nat. Nanotechnol.* 19, 166–172 (2024). - Sahu, R. et al. Entangling microwaves with light. Science 380, 718–721 (2023). - Harris, N. C. et al. Linear programmable nanophotonic processors. Optica 5, 1623–1631 (2018). - Bogaerts, W. et al. Programmable photonic circuits. Nature 586, 207 (2020). - Pérez, D., Gasulla, I. & Capmany, J. Field-programmable photonic arrays. Opt. Express 26, 27265–27278 (2018). - Mojaver, K. R., Zhao, B., Leung, E., Safaee, S. M. R. & Liboiron-Ladouceur, O. Addressing the programming challenges of practical interferometric mesh based optical processors. *Opt. Express* 31, 23851–23866 (2023). - Markowtiz, M., Zelaya, K. & Miri, M.-A. Learning arbitrary complex matrices by interlacing amplitude and phase masks with fixed unitary operations. *Phys. Rev. A* 110, 033501 (2024). - Zelaya, K., Markowitz, M. & Miri, M.-A. The Goldilocks principle of learning unitaries by interlacing fixed operators with programmable phase shifters on a photonic chip. Sci. Rep. 14, 10950 (2024). - Zou, X. et al. Photonics for microwave measurements. Laser Photonics Rev. 10, 711–734 (2016). - Marpaung, D., Yao, J. & Capmany, J. Integrated microwave photonics. Nat. Photonics 13, 80–90 (2019). - Yao, J. Microwave photonics. J. Lightwave Technol. 27, 314–335 (2009). - Pan, S., Ye, X., Zhang, Y. & Zhang, F. Microwave photonic array radars. *IEEE J. Microw.* 1, 176–190 (2021). - Chung, M., Lin, C. & Meiy, I. A 10×10 MIMO multiband broadband planar antenna for multiband applications. *Int. J. RF Microw. Com*put. Aided Eng. 1, 4839224 (2024). - Keshavarz, R., Shariati, N. & Miri, M.-A. Real-time discrete fractional fourier transform using metamaterial coupled lines network. *IEEE Trans. Microw. Theory Tech.* 71, 3414 (2023). - Imani, M. et al. Review of metasurface antennas for computational microwave imaging. *IEEE Trans. Antennas Propag.* 68, 1860–1875 (2020). - 31. Zhu, C. et al. Silicon integrated microwave photonic beamformer. Optica 7, 1162–1170 (2020). - Keshavarz, R., Zelaya, K., Shariati, N. & Miri, M.-A. An ultra-compact microstrip arrayed lines lens for beam steering and direction finding. (2024). (in press). - 33. Daulay, O. et al. Ultrahigh dynamic range and low noise figure programmable integrated microwave photonic filter. *Nat. Commun.* **13**, 7798 (2022). - Kudelin, I. et al. An optoelectronic microwave synthesizer with frequency tunability and low phase noise. *Nat. Electron.* 7, 1170 (2024). - 35. Tang, Z., Li, Y., Yao, J. & Pan, S. Photonics-based microwave frequency mixing: methodology and applications. *Laser Photonics Rev.* **14**, 1800350 (2020). - Markowitz, M., Zelaya, K. & Miri, M.-A. Auto-calibrating universal programmable photonic circuits: hardware error-correction and defect resilience. Preprint at https://arxiv.org/abs/2308. 09151 (2023). - TTM Technologies. Ultra Low Profile 0805, X4C20J1-03G Rev F. https://cdn.ttm.com/repository/products/wireless-xinger/3dbhybrid-couplers/X4C20J1-03G/X4C20J1-03G.pdf. - MACOM. Digital Phase Shifter 6-Bit, 1.4-2.4 GHz, Rev. 5 https://www.macom.com/products/product-detail/MAPS-010163. - 39. Seberry, J., Wysocki, B. J. & Wysocki, T. A. On some applications of Hadamard matrices. *Metrika* **62**, 221–239 (2005). - Kim, S. et al. Scaling-CIM: EDRAM in-memory-computing accelerator with dynamic-scaling ADC and adaptive analog operation,. IEEE J. Solid State Circuits 59, 2694–2705 (2024). - Honari-Latifpour, M., Binaie, A., Eftekhar, M., Madamopoulos, N. & Miri, M. Arrayed waveguide lens for beam steering, *Nanophotonics* 11. 3679–3686 (2022). - 42. Banks, P. A., Kleist, E. M. & Ruggiero, M. T. Investigating the function and design of molecular materials through terahertz vibrational spectroscopy. *Nat. Rev. Chem.* **7**, 480–495 (2023). - 43. Analog Devices, 2025. [Online]. Available: https://www.analog.com (2025). - 44. Texas Instruments, 2025. [Online]. Available: https://www.ti.com (2025). # **Acknowledgements** This project was supported by the U.S. Air Force Office of Scientific Research (AFOSR) Young Investigator Program (YIP) Award# FA955022-1-0189. #### **Author contributions** R.K. conducted the design of the microwave system and performed circuit-level simulations. R.K. and N.S. oversaw the hardware implementation and conducted the experimental testing and validation of the system. K.Z. and M.-A.M. performed the theoretical analysis, conducted the numerical simulations, developed the phase-optimization routine. K.Z. prepared the initial manuscript. M.-A.M. conceived the idea and supervised the project. All authors contributed to the review and editing of the final manuscript. # **Competing interests** The Authors declare no competing interests. #### **Additional information** **Supplementary information** The online version contains supplementary material available at https://doi.org/10.1038/s41467-025-63486-z. **Correspondence** and requests for materials should be addressed to Mohammad-Ali Miri. **Peer review information** *Nature Communications* thanks Kiat Seng Yeo and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available. **Reprints and permissions information** is available at http://www.nature.com/reprints **Publisher's note** Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/. © The Author(s) 2025