Scalable photonic reservoir computing for parallel machine learning tasks

Aadhi, A.; Di Lauro, L.; Fischer, B.; Dmitriev, P.; Alamgir, I.; Mazoukh, C.; Perron, N.; Viktorov, E. A.; Kovalev, A. V.; Eshaghi, A.; Vakili, S.; Chemnitz, M.; Roztocki, P.; Little, B. E.; Chu, S. T.; Moss, D. J.; Morandotti, R.

doi:10.1038/s41467-025-67983-z

Download PDF

Article
Open access
Published: 31 December 2025

Scalable photonic reservoir computing for parallel machine learning tasks

Nature Communications volume 17, Article number: 1225 (2026) Cite this article

8741 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

Neuromorphic photonics enables brain-inspired information processing with higher bandwidth and lower energy consumption than traditional electronics, addressing the growing computational demands of the Internet of Things, cloud services, and edge computing. However, even current state-of-the-art electronic and photonic platforms are incapable of delivering the scalable throughput, multitasking processing, and energy efficiency required by these applications. Here, we demonstrate a tunable photonic reservoir computing device based on a nonlinear amplifying loop mirror (NALM), leveraging a time-delayed, single-unit, all-optical architecture. By combining dense temporal encoding with wavelength-division multiplexing, the system supports concurrent multitasking across independent data channels, enabling scalable computational performance without additional hardware complexity. Experiments and theoretical validation on classification and prediction benchmarks demonstrate the device’s performance, achieving a throughput of 20 tera-operations-per-second and an energy efficiency of 4.4 fJ per operation. These results highlight a promising path towards reconfigurable, compact, and high-performance photonic processors for real-time intelligent applications.

Photonic neuromorphic architecture for tens-of-task lifelong learning

Article Open access 26 February 2024

All-optical temporal integration mediated by subwavelength heat antennas

Article Open access 23 December 2025

Integrated photonic neuromorphic computing: opportunities and challenges

Article 06 June 2024

Introduction

The rapid evolution of artificial intelligence (AI) and machine learning (ML) has revolutionized data processing, driving demand for computational platforms that are capable of handling vast amounts of data with high processing speed and minimal latency^1,2. Neuromorphic photonics presents a promising solution to these challenges^{2,3,4,5,6,7,8} as it leverages optical systems to emulate neural networks, thus providing higher bandwidth, faster speeds, lower energy consumption, and smaller footprints compared to traditional electronic processors^7,9,10,11. This solution can also lower carbon emissions, especially when training on large datasets. However, existing photonic architectures have critical limitations in computational throughput, energy efficiency, and latency, thus hindering their ability to meet the growing computational needs of AI, deep learning¹², and other real-time-based applications^13,14. Neuromorphic computing frameworks, such as extreme learning machines¹⁵, wave-based neural networks¹⁶, and reservoir computing (RC)^17,18, have emerged as compelling approaches for the efficient implementation of photonic processing devices for ML.

Optical RC systems exploit physical dynamics to process input data, thus reducing training time and complexity while improving computational efficiency. To date, single-node optical RC architectures have been demonstrated using active components (such as semiconductor lasers¹⁹, amplifiers^20,21, and electro-optic modulators^22,23), as well as passive components²⁴ (including saturable absorbers^25,26 and photodiodes²⁷). Despite significant progress, current RC implementations remain limited by their inherent architecture, in which nodes are created and interconnected through feedback mechanisms^27,28. These mechanisms simplify the design by generating virtual nodes, but also introduce latency and limit processing speed, which is defined as system throughput (i.e., the total number of operations per second across all input data streams within a single reservoir unit). Recent efforts to increase computational throughput in RC have focused on faster electro-optical modulation and advanced data encoding schemes, such as pulse-amplitude modulation^29,30,31 (PAM), often paired with wavelength-division multiplexing (WDM). However, these methods frequently rely on sequential sweeping across WDM channels^32,33,34, or time-interleaving techniques^{35,36,37,38,39}, which both introduce throughput latency and prevent the concurrent execution of independent tasks. Additional attempts at further scaling up performance have involved adding components or increasing the number and dimensionality of computational layers and readout components^5,39. Typically, these result in decreased energy efficiency and processing speeds. Furthermore, existing WDM-based RC architectures^21,38,39 also have limited tunability, when design constraints inherently fix both the reservoir memory and the activation function. At the same time, nonlinearity is introduced electronically at the input/output stage, resulting in optical-to-electrical-to-optical (OEO) latency and thus limiting the richness of reservoir dynamics. Such constraints significantly reduce system flexibility and adaptability for diverse ML tasks. Additionally, architectures employing spectral mixing approaches²¹ encounter nonlinear crosstalk⁴⁰, which, in turn, limits the number of independently encoded channels and consequently degrades system accuracy and scalability.

To address these challenges, we demonstrate an RC device based on a nonlinear amplifying loop mirror (NALM) architecture. We leverage the NALM configuration for its proven ability to generate highly stable and nonlinear dynamics^41,42,43,44, which are essential for optical reservoir computing. These dynamics enhance the reservoir’s computational capacity by enabling complex input mapping into a high-dimensional state space, thereby improving sensitivity to extremely small signal variations^45,46. The intrinsic stability of the NALM ensures consistent classification performance, even under noisy or fluctuating input conditions.

Our device features an integrated nonlinear spiral waveguide (SW) and a semiconductor optical amplifier (SOA), both of which modulate the NALM overall nonlinear transfer function (i.e., the reservoir activation function). In contrast to previous RC implementations that rely on SOAs operating in saturation, whether in unidirectional configurations^47,48,49 or through cross-gain modulation^50,51, we set the SOA below saturation for all our experiments. Saturated SOAs exhibit plateaued gain, limiting the dynamic range and tunability of the nonlinear response. The unsaturated regime, on the other hand, allows for precise tunability of the gain and, therefore, of the reservoir activation function. Furthermore, by adjusting the intensity of the propagating field using a variable optical attenuator (VOA), we achieve additional control over the reservoir’s fading memory response, significantly enhancing both versatility and stability for diverse computational tasks.

In line with the standard single-node reservoir computing framework, we establish virtual nodes within the reservoir via time-multiplexing, with training restricted to the readout layer. However, by employing multiple synchronized continuous-wave (CW) laser sources, our device can support multi-task processing across wavelength channels. This enables true concurrency for parallel data injection, both for the same or different machine learning tasks. The latter significantly reduces hardware complexity and addresses scalability and performance bottlenecks in earlier architectures. As a result, we have achieved a throughput of up to 20 tera-operations per second (TOPS), consuming only a few femtojoules per operation, while maintaining high accuracy in both classification and regression tasks. The device setup, as depicted in Fig. 1, encompasses a figure-eight fiber cavity that enables efficient multi-processing and provides rich system tunability. Such figure-eight fiber shapes can produce a range of coherent nonlinear dynamics, both with⁴¹ and without^42,43 CW injection. By leveraging dynamic regimes and higher dimensionality enabled by virtual nodes, we accurately separate and classify continuous-valued input signals, even when they are complex and highly nonlinear.

Results

We benchmarked our device to solve linearly inseparable problems using the m-bit delayed parity task, also known as the generalized exclusive OR (XOR)¹⁶. This nonlinear task performs the XOR operation between the current bit and the m prior bits stored in the reservoir memory and classifies the resulting sequences into two distinct classes. We injected a pseudorandom binary sequence (PRBS) of 4000 bits into a single-wavelength channel using a CW source (see the setup in Fig. 1) in the input device loop. Through the encoding procedure described in the Methods section entitled “Device architecture”, we generated 161 virtual processing nodes. Consistent with the RC paradigm, training was applied exclusively to the output weights, while the internal node connection weights remained fixed. These internal weights are inherently defined by the device configuration and the specific components that were employed, such as the SOA, the integrated SW, and the variable optical attenuator (VOA) (see Supplementary Note 1). We applied ridge regression²⁷ to determine the optimal output weights required to reconstruct the target signal. The accuracy of the trained model was evaluated using standard performance metrics, such as the normalized root-mean-square error (NRMSE).

We investigated the classification performance when varying the SOA gain and the signal attenuation of the VOA by tuning their driving voltages, ${V}_{{\mathrm{SOA}}}$ and ${V}_{{\mathrm{VOA}}}$, respectively (which correspond to a specific level of amplification gain $g$ and attenuation level ${A}_{{\mathrm{VOA}}}$). Figure 2 shows the NRMSE maps ($g$ and ${A}_{{\mathrm{VOA}}}$) for 2-bit, 3-bit, and 4-bit delayed sequences in (a), (b), and (c), respectively. For the 2-bit XOR, we achieved 100% classification accuracy for the validation dataset, as shown in the dark-blue shaded region of Fig. 2a, which is defined by $g < 8.3{{\rm{dB}}}$ and ${A}_{{\mathrm{VOA}}} < 4.9 \, {{\mathrm{dB}}} \left({V}_{{\mathrm{SOA}}} \right. < 0.8 \, {{\mbox{V}}}$ and ${V}_{{{\rm{VOA}}}} > 3{{\rm{V}}}$). These values delimit operational regimes characterized by reduced nonlinear contributions from the SOA and by short fading memory controlled by the VOA.

**Fig. 2: Accuracy maps for nonlinear processing optimization.**

For the 3- and 4-bit XOR sequences, a further nonlinear contribution to the overall response (i.e., the activation function) of the reservoir is required to enhance the output separability and reduce the NRMSE¹⁶. This is achieved when setting $g > 6.1{{\rm{dB}}}$ and ${A}_{{\mathrm{VOA}}} > 0.3{\mathrm{dB}}$ (${V}_{{{\rm{SOA}}}} > 0.5{{\rm{V}}}$ and ${V}_{{\mathrm{VOA}}} > 1.5{{\rm{V}}}$), respectively, for the 3- and 4-bit cases. This enables the device to achieve accuracies of 97% and 80% (with NRMSEs of 3% and 20%), as reported in Fig. 2b and c.

Reservoir tunability enables handling ML tasks with varying degrees of data separability and temporal correlations, as evident in the analysis of the effect on the propagating input signals as a function of the two tunable parameters. When an input signal, such as an optical field, enters the device reservoir loop through the 3 dB (50:50) input beamsplitter (Fig. 1), it separates into propagating and counter-propagating field components. Their phase difference, after they propagate in opposite directions and recombine at the beamsplitter, is affected by sequences of amplification and nonlinear interactions mediated by the intensity-dependent Kerr nonlinearity in the integrated SW^52,53. During such an interaction, an increase in the amplification gain causes the SOA to contribute additional nonlinear responses, thereby enhancing classification performance when further nonlinear contributions are required⁵⁴, without altering the physical reservoir topology. This unsaturated operational regime enables us to implement a controllable mechanism mediated by tunable parameters, allowing for the exploitation of nonlinearities arising from controlled optical feedback and intensity-dependent interference within the NALM. This approach allows for precise and reconfigurable reservoir functionality. In contrast to the cross-gain saturation-based nonlinearities demonstrated in earlier works^50,51. Our architecture provides greater flexibility and control over reservoir dynamics. Generally, in time-delayed RC, a higher reservoir nonlinearity can impair the fading memory, which is determined by the system’s round-trip time^46,55. By fine-tuning the input intensity attenuation through ${V}_{{\mathrm{VOA}}}$, we can modify the strength of the interconnection among the reservoir virtual nodes. This adjustment regulates the amount of circulating intra-cavity power per round-trip, thereby compensating for the reservoir’s diminished capacity to store bits within its fading memory⁵⁶.

Multi-processing and parallel operation

We established the multi-processing capabilities of our device by carrying out two well-known ML benchmarks in parallel: the Mackey-Glass (MG) and the 10th-order Nonlinear Auto Regressive Moving Average (NARMA₁₀) time series prediction tasks. These two tasks address the nonlinearity and memory of the reservoir, challenging the trade-off that occurs between them. The chaotic nature of the MG system, with high sensitivity to initial conditions, requires a reservoir with high nonlinearity. On the other hand, NARMA₁₀ requires a reservoir with a long fading memory to efficiently retain temporal dependencies among data points across various round-trips. Therefore, by performing both tasks simultaneously, we demonstrate the universal multi-processing potential of our device in tackling both of these diverse ML tasks.

We use two wavelength channels, one at 1549.2 nm for the MG task and the second at 1549.4 nm for NARMA₁₀. Crosstalk effects were reduced by applying a 0.2 nm spectral separation between the channels. We generated 1200 data points for each time series and encoded them onto ${{{\rm{N}}}}_{{{\rm{v}}}}=161$ virtual nodes in the reservoir through dense WDM. Based on the results of the m-bit XOR task (where the highest accuracies are achieved within the dark-blue-shaded region shown in Fig. 2a), we found that the best performance can be reached with $g=10.3{{\rm{dB}}}$ (${V}_{{{\rm{SOA}}}}=1.5{{\rm{V}}}$) and ${A}_{{\mathrm{VOA}}}\approx 0\,{\mathrm{dB}}$ (${V}_{{{\rm{VOA}}}}=0.5{{\rm{V}}}$). We applied offline training and cross-validation⁵⁷ independently on the two de-multiplexed outputs to obtain the weights that minimize the NRMSE between the targeted and predicted outputs. The results in Fig. 3 show misclassification rates of just 6% for MG and 1% for NARMA10, indicating high classification accuracy in both cases.

Fig. 3: Performance results from MG and NARMA10 multi-processing. — **Fig. 3: Performance results from MG and NARMA₁₀ multi-processing.**

Computational scalability of the device

We demonstrate system scalability through both increased bit density and parallel channel encoding, enabling the scaling up of computational throughput without requiring additional physical nodes. We next applied the device to solve a real-world task: recovering a nonlinearly perturbed telecom signal. Signal distortion, attenuation, dispersion, noise, and crosstalk originating from fiber propagation can all degrade or disrupt communications. Therefore, channel recovery techniques are essential for preserving signal integrity, minimizing downtime, and ensuring quality of service in networks^29,58. This requires nonlinear, low-latency, high-speed processing capabilities. As shown by previous benchmarks, our device offers features that can meet these requirements, and its computational speed can be further enhanced by employing a dense information encoding scheme alongside multi-channel processing. The dense encoding scheme (see the Methods section entitled “Information encoding and readout”) enables us to inject multiple bits per wavelength channel into the reservoir, thereby increasing speed without adding extra computational units.

We generated a PRBS of 5 × (2¹⁵ − 1) bits and electro-optically encoded it onto a CW carrier signal. We then let the signal propagate through 60 km of standard single-mode fiber (SMF). Here, the once pristine signal experiences dispersion and nonlinear distortion, including cross-phase modulation and four-wave mixing, both of which are influenced by fiber length and signal intensity. Figure 4a and b show the time traces of the modulated field intensity using PAM^30,31 before and after propagation, respectively, along with their corresponding eye diagrams. The distorted signal is then distributed and encoded across five independent channels, spaced at 0.2 nm intervals around 1550 nm (as shown in Fig. 4b), which we process concurrently via dense WDM. The number of encoding channels is limited only by the available CW sources and electro-optical intensity modulators (IMs). The representative bit sequences before and after recovery are shown in the Supplementary Fig. 5.

**Fig. 4: Pristine vs. distorted telecom signal.**

We trained and validated the output weights for each channel across four virtual nodes using the encoding technique and offline ridge regression. The bit error rate (BER, the metric that quantifies the percentage of misclassified bits in the recovered sequence) is shown in Fig. 5 (a), illustrating the individual channels and different input power levels (P_in = 2.6 dBm, 3.8 dBm, 4.2 dBm, 4.6 dBm, 5.0 dBm, and 5.4 dBm), which are commonly used in telecommunications. We observed that the BER decreases as the input signal power increases, due to improved signal-to-noise ratio.

**Fig. 5: BERs of the five channels as a function of input power.**

This change occurs regardless of the surge in distortion due to the intensity-dependent Kerr effect on transmission, which our approach can compensate for. The lowest BER value is 2 × 10⁻⁴ at 5.4 dBm of input power, representing a 145-fold improvement compared to the preprocessed distorted signal (see Fig. 5b). The inset of Fig. 5c displays a segment of the original bit sequence alongside the corresponding distorted optical signal, measured after propagation through the 60 km fiber link (prior to recovery processing). Figure 5d shows the recovered bit sequence at an input power of 5.4 dBm, with a zoomed-in segment highlighting the bit errors (marked with red squares).

The experimental results were validated using the model derived from the generalized nonlinear Schrodinger equation⁵⁹ (GNLSE) (see Supplementary Note 1). We report the average BER over the five wavelength channels (red line) and the simulated BER values (black dots) in Fig. 5a. We observed excellent agreement between the numerical predictions and our experimental results, confirming the accuracy of our model.

We can quantify the throughput of our device in terms of MAC (multiply-and-accumulate) operations per second for parallel signal recovery. The device achieves a 2.7 Tera-MAC s^-1. From the MAC calculation, we estimate a speed of 20 Tera-operations per second (TOPS) for our device. In terms of energy efficiency, the overall power consumption of the processing unit is approximately 87 mW, leading to consumption of ~ 4.4 fJ OP^-1 (see Supplementary Note 4 for detailed calculations), which outperforms both state-of-the-art fiber-based optical neural networks and photonic chips^{5,8,35,60,61,62}. See Table 1 for a numerical comparison.

Table 1 Comparison of speed and energy consumption in photonic processing devices

Full size table

Discussion

In this work, we developed a tunable neuromorphic computing device based on an NALM configuration, thereby realizing a time-delayed RC circuit. This device was applied to the high-speed, low-latency processing of nonlinear, complex ML tasks, including classification for forecasting and regression for telecom signal recovery.

This device achieves high classification accuracy in the m-bit delayed-XOR task by optimizing reservoir parameters. It also demonstrates effective handling of parallel machine learning tasks (i.e., multitasking), as shown by its performance on the MG and NARMA₁₀ time series, with a minimum prediction accuracy of more than 94%.

The reservoir’s throughput is directly linked to its fundamental clock frequency, which is defined as the inverse of the round-trip time^63,64. In our system, this corresponds to 13.3 MHz (75 ns round-trip), which sets the device’s intrinsic throughput and, therefore, its inherent throughput. However, because the architecture supports the simultaneous processing of multiple wavelength channels, the system can scale its throughput without requiring additional physical components. Using Eq. (17) reported in the Supplementary Note 4.1, we calculated the device throughput for telecom signal recovery, obtaining a value of 20 TOPS and a minimum BER of 2 × 10⁻⁴ for the recovered signal.

Advanced digital signal processing (DSP) methods, including digital backpropagation⁶⁴, typically report BER values around 2 × 10⁻². Real-time field-programmable gate array-based coherent receivers have achieved 2.4 × 10⁻⁴ at 15 Gbaud with 16-quadrature amplitude modulation (16-QAM)⁶⁵. These approaches require thousands of multipliers and significantly higher power consumption^64,66, typically around 3 pJ per bit per channel^67,68, with latency between 100 ns and 1 μs, depending on the system clock⁶⁹. In contrast, optical signal processing methods offer a latency of 20 to 200 ps and energy consumption ranging from 0.1 to 1 pJ per bit per channel^70,71.

In terms of signal quality enhancement achieved after compensation, electronic neural networks⁷² can reach a Q-factor improvement of 2 dB with a BER of about 1.1 × 10⁻³, while photonic–electronic hybrid networks⁷³ can achieve an improvement of about 0.8 dB with a BER around 10⁻⁴.

Our device achieves a BER of 2 × 10⁻⁴ after 60 km of transmission at an input power of 5.4 dBm, corresponding to a Q-factor improvement of approximately 4.6 dB (see Fig. 5b). It operates at 4.4 fJ per operation, outperforming DSP and electronic neural-network processing as well as previously reported photonic neural-network implementations, as summarized in Table 1.

This performance enhancement stems from our reservoir architecture, which primarily relies on integrated passive and low-power active elements (see Supplementary Note 4.2). The design enables parallel processing with concurrent encoding, thus reducing peripheral overhead, eliminating redundant computational layers, and minimizing iterative losses.

Unlike conventional reservoir architectures that rely on sequential or interleaved data injection, our approach processes all wavelength channels concurrently within a single unit, thus enabling true parallelism. Combined with dense encoding, this removes the latency bottleneck and allows throughput to scale directly with the number of available channels. However, the maximum processing speed is constrained by the intrinsic carrier dynamics of the SOA and by the operational boundaries set to avoid saturation, which together define the highest achievable computational throughput. Under our operating conditions, and using the equation derived in Supplementary Note 4.1 (Eq. 21), the maximum distortion-free throughput is theoretically estimated at 24 TOPS per channel. It must be noted that while the architecture can support this level of throughput, its practical performance is lower due to the limited bandwidth of the encoding hardware.

Further scalability in processing can be achieved through dense wavelength-division multiplexing (DWDM), which provides the spectral density required for massively parallel operation. For instance, with 80 channels, as is typically achieved in standard DWDM, the architecture can reach a combined throughput of nearly two peta-operations per second (POPS) under binary modulation. By employing phase-sensitive processing and higher-order modulation formats, such as 64-quadrature-amplitude modulation (6 bits per symbol), the effective throughput can increase to several tens of POPS. Integrated multi-wavelength sources based on nonlinear processes such as four-wave mixing in microresonators enable this capacity by supplying many stable, phase-coherent channels on a single chip^{74,75,76,77,78}.

These findings open new avenues for the development of affordable, scalable neuromorphic platforms to address the growing demand for ultra-fast, energy-efficient solutions in next-generation photonic signal processing and AI applications.

Methods

Device architecture

The device’s figure-eight cavity design (see Fig. 1) can be divided into input, processing (i.e., the reservoir or unit), and output stages, all of which contribute to the overall round-trip time and thereby cause the reservoir’s memory to fade. The input loop comprises a variable optical VOA (Thorlabs, V1550A) that controls the signal power, a 10:90 coupler that monitors the system, a 50:50 coupler that injects the modulated signal, and a second coupler that splits the signals into clockwise and counterclockwise directions in the NALM configuration. The NALM loop is composed of an SOA (Inphenix C-Band IPSAD1505) and an integrated 45 cm-long SW. This waveguide is fabricated from a CMOS-compatible, highly doped silica glass^{52,53,78,79,80,81,82,83}. As such, the SW features high mode confinement due to the small waveguide profile (2 × 2 µm²), with group velocity dispersion coefficients of ${\beta }_{2}$ ∼ 10 ps² km⁻¹ and ${\beta }_{3}$ ~ 0.26 ps³ km⁻¹ at 1550 nm, as well as a large third-order nonlinear coefficient of γ = 220 W⁻¹ km⁻¹. The SW, together with the phase shift mechanism between the two counter-propagating fields, performs the overall nonlinear activation function of the reservoir^52,53. The SOA operates between the linear and nonlinear regimes, far from saturation, to avoid instabilities. These include self-oscillation and chaotic behaviors, as well as noise amplification and signal fluctuations, which would be detrimental to the current tasks. A polarization controller (PC) was used to select and optimize the transmission power of one field component, specifically either the transverse electromagnetic or magnetic field. Simultaneously, an optical bandpass filter (200 GHz) was used to limit the spectral bandwidth of the propagation fields, thereby improving the signal-to-noise ratio (SNR) and system stability by eliminating unwanted amplified spectral components. A 90:10 optical coupler is used to extract a fraction of the optical power from the loop to obtain a readout of the dynamic evolution of the node states, utilizing high-bandwidth photodiodes (PDs, Finisar XPDV21x0R series, 50 GHz bandwidth) and a digital oscilloscope (OS, Agilent DSO-x92804a, 28 GHz bandwidth). An additional OS is employed to perform the nonlinear recovery task. The overall length of the cavity is 14 m, corresponding to a round-trip time of approximately ${\tau }_{r}=75{{\rm{ns}}}$.

Information encoding and readout

We encode information (i.e., bits) in time-dependent vectors ${{{\rm{u}}}}_{{{\rm{i}}}}\left({{\rm{n}}}\right)$ by means of different arbitrary waveform generators (AWGs) (Keysight M8196A, 92 GSa s^-1, 2 channels each), each driving an equal number of electro-optical Mach-Zehnder intensity modulators (IMs) (Optilab IML-1550-40-PM-V-HER, 40 GHz, polarization-maintaining, high extinction ratio, ${V}_{\pi }$ = 3 V). All modulators were synchronized using an external clock. The IMs were excited by the time-varying voltage signal containing the data, which in turn modulated the independent laser modes (i.e., the wavelength channels) produced by a narrowband continuous multi-wavelength (CW) source (ID Photonics CoBrite DX G, 4 outputs, linewidth <100 kHz). An additional CW source (Tunics T100S-HP, single output, linewidth <5 MHz) was employed to provide the fifth channel. The injected power per channel ranged from 1 mW to 5 mW, depending on the task. The applied voltage to the IMs varied between [-${V}_{\pi }/2$, ${V}_{\pi }/2$], ensuring operation within their linear regime. When multi-processing is performed, the input channels are injected into the device via wavelength division multiplexing (WDM) using a waveshaper (WS).

The reservoir nodes are established by first sampling and holding the input signal containing data for a duration $\tau \approx {\tau }_{r}$, then multiplying the signal by a uniformly distributed, multi-level random mask $R\left(n\right)$, with values ranging between −1 and 1. The mask has a sampling period ${{\rm{\tau }}}$ and duration ${{\rm{\theta }}}$ per level, such that ${E}_{{{\rm{in}}}}\left(n\right)=u\left(n\right)\cdot R\left(n\right)$, where ${E}_{{{\rm{in}}}}\left(n\right)$ is the resulting input signal to the device. This procedure maps the signal onto a higher-dimensional space by generating a number of ‘virtual’ node states (i.e., internal variables)⁸⁴, the number of which is given by ${N}_{v}=\tau /\theta$. A slight desynchronization, $\delta \approx \theta$, was applied to the sampling period $\tau$ on the AWG input sequence to ensure that all node states were dependent on neighboring nodes¹⁷.

For dense encoding, we input more than one bit of information (${n}_{{{\rm{b}}}}$) in the reservoir per round-trip ${{{\rm{\tau }}}}_{{{\rm{r}}}}$. We apply the sample and hold procedure over the asynchronous sampling rate (i.e., the inverse of the masking rate) of duration ${{\tau }_{{{\rm{r}}}}}^{{\prime} }={\tau }_{{{\rm{r}}}}/{n}_{{{\rm{b}}}}+\delta {\prime}$ to generate $N\hbox{'}$ time-multiplexed transient states of duration ${\theta }^{{\prime} }={{\tau }_{{{\rm{r}}}}}^{{\prime} }/{N}_{v}^{{\prime} }$, for each ${{\tau }_{r}}^{{\prime} }$ time interval.

The small desynchronization $\delta {\prime}$ $\approx \theta {\prime}$ between the sampling and injection rates causes a progressive shift in the input seen by adjacent virtual nodes and by nodes across round-trips, ensuring that neighboring nodes access distinct input segments over time and maintain the data correlation required for reservoir computing processing.

The output states, ${x}_{i}\left(n\right)$, are measured from the virtual nodes at time intervals $\theta$ (or ${\theta }^{{\prime} }$) by using the PDs and OS. De-multiplexing using the WS is applied in cases of parallel computations. The average output power ranges from 100 μW to 300 μW per individual input channel.

M-bit delayed XOR task

To formally describe the m-bit delayed XOR task⁸⁵, we define the operation within a logical framework. Given the input bit sequence ${\{x(n)\}}_{m}$ with delay $n$ and the number of bits $m$, the output sequence ${\{{y}_{n}\}}_{m}$ can be described by the following equation:

$$\left\{\begin{array}{c}{y}_{n}={x}_{n}\oplus {x}_{n-1}\oplus {x}_{n-2}\oplus \ldots \oplus {x}_{n-m}{{\mbox{if}}} n\ge m\\ {y}_{n}={x}_{n}{{\rm{if}}}n < m \hfill\end{array}\right.$$

(1)

where ${x}_{n}$ and ${y}_{n}$ are the current and output bits at time ${{\rm{n}}}$, respectively.

Time series prediction tasks

The Mackey-Glass (MG) time series is defined by the following equation:

$${\dot{y}}_{{\mathrm{MG}}}\left(t\right)=\beta \frac{{y}_{{\mathrm{MG}}}\left(t-\tau \right)}{1+{y}_{{\mathrm{MG}}}^{p}\left(t-\tau \right)}-h{y}_{{\mathrm{MG}}}\left(t\right)$$

(2)

in which ${y}_{{{\rm{MG}}}}(t)$ is the output at a time $t$, $\tau$ is the delay time, $\beta$ is the growth rate, $h$ is the decay rate, and $p$ is the nonlinear parameter. Parameters were set as follows: $\tau=17,\beta=0.2,h=0.1,p=10$. To simulate MG time series behavior, ${{\rm{t}}}$ was discretized using a time step $\Delta t=1$ to produce values $x(t\equiv {t}_{k})$ at discrete time points ${t}_{k}=q\Delta {{\rm{t}}}$ for $q=0,1,2,\ldots$.

The 10th-order Nonlinear Auto Regressive Moving Average (NARMA₁₀) series follows the equation

$${y}_{N10}\left(t+1\right)=0.3y\left(t\right)+0.05y\left(t\right){\sum}_{i=0}^{9}y\left(t-i\right)+1.5u\left(t\right)x\left(t-9\right)+0.1$$

(3)

in which$\,{y}_{N10}\,\left(t\right)$ is the output at time $t$, ${{\rm{while}}}$ $u(t)$ is the input, chosen as a random sequence from a uniform distribution ${{\rm{U}}}[0,1]$. Similarly to the MG task, we applied a discretized time step $\Delta t=1$ to the NARMA₁₀ time series temporal variable $t$.

The time series were simultaneously and independently encoded onto two wavelength channels generated by the multi-output CW source. Two IMs and AWG channels drove this process. Following the previously described single-bit encoding technique, both signals were multiplexed using the WS and then injected simultaneously into the device. They were then de-multiplexed and read with the OS. Offline training (as detailed in the following sections) was performed on the two datasets collected.

Nonlinear telecom signal recovery task

The nonlinearly distorted signal, which also carries information, was obtained by propagating the corresponding pristine signal through 60 km of SMF-28, with a dispersion parameter D = 18 ps nm⁻¹ km⁻¹ and an attenuation constant α = 0.24 dB km⁻¹. An additional 1 m of highly nonlinear optical fiber (with γ = 0.78 W⁻¹km⁻¹) was added to account for the full extent of nonlinear effects over longer propagation distances, which is typical in telecommunications networks. The distorted signal, carrying 5 × (2¹⁵ – 1) bits, was sequentially split into five subsequences of 2¹⁵ – 1 bits each and then independently encoded into the five wavelength channels using five IMs and AWG channels. Each AWG channel was operated at ~6.5 GHz. All five channels were amplified to 5 mW using an erbium-doped fiber amplifier (EDFA, Keopsys, PEFA-SP-C-PM−27-B202), then multiplexed into the reservoir. The additional IM and AWG channel were used to create the temporal node transients ${\theta }^{{\prime} }$ via time multiplexing. These were implemented at a masking rate of approximately 24 GHz, creating 4 virtual nodes with a 20-level random mask, thereby enabling the encoding of 488 bits per round-trip. Offline training was performed on datasets built from the output node states of all five de-multiplexed wavelength channels, using the training procedure described in the following section.

Offline training

We employed a k-fold cross-validation approach, with k set to 5 for the MG and NARMA₁₀ tasks and to 10 for the nonlinear channel equalization task. We then split the output datasets, consisting of ${x}_{i}$ values, into equally sized k subsets. The machine learning (ML) model was trained using ridge regression on $k-1$ folds and validated on the remaining fold. This process was repeated k times, with each fold used once as the validation set. For each fold, we determined the model’s accuracy by calculating the normalized root mean square error (NRMSE) as

$${{\rm{NRMSE}}}=\sqrt{\frac{1}{{N}_{v}}{\frac{{\sum }_{i}^{{N}_{v}}\left({y{\prime} }_{i}-{y}_{i}\right)}{{\sigma }^{2}\left({y}_{i}\right)}}^{2}},$$

(4)

in which ${\sigma }^{2}$ is the sample variance, ${y{\prime} }_{i}$ is the targeted output, and ${y}_{i}$ is the weighted output: ${y}_{i}={\sum }_{1}^{{N}_{v}}{W}_{{ij}}^{{{\rm{out}}}}{x}_{j}$.

Finally, we calculated the average error across all iterations to obtain a more robust estimate of the model’s overall performance.

Simulations for nonlinear channel equalization

We employed a 10,000-pseudorandom binary sequence (PRBS) extracted from the experimentally generated distorted signal. We then sampled the signal using ultra-dense encoding, mirroring the experimental procedure previously described. We injected 488 bits per cavity round-trip over four nodes into a single channel. The device response was numerically simulated using the generalized nonlinear Schrödinger equation (GNLSE) and then implemented in MATLAB (see Supplementary Note 1).

Data availability

The data supporting the findings of this work study are available both within this article and in the Supplementary Information. The raw data generated from this study is available from the corresponding authors upon request.

Code availability

The numerical codes related to this study are available from the corresponding author upon request.

References

Hecht, J. The bandwidth bottleneck that is throttling the Internet. Nature 536, 139–142 (2016).
Article ADS CAS PubMed Google Scholar
Feldmann, J. et al. Parallel convolutional processing using an integrated photonic tensor core. Nature 589, 52–58 (2021).
Article ADS CAS PubMed Google Scholar
Chen, Y. et al. All-analog photoelectronic chip for high-speed vision tasks. Nature 623, 48–57 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Feldmann, J., Youngblood, N., Wright, C. D., Bhaskaran, H. & Pernice, W. H. P. All-optical spiking neurosynaptic networks with self-learning capabilities. Nature 569, 208–214 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Ashtiani, F., Geers, A. J. & Aflatouni, F. An on-chip photonic deep neural network for image classification. Nature 606, 501–506 (2022).
Article ADS CAS PubMed Google Scholar
Merolla, P. A. et al. A million spiking-neuron integrated circuit with a scalable communication network and interface. Science 345, 668–673 (2014).
Article ADS CAS PubMed Google Scholar
Shastri, B. J. et al. Photonics for artificial intelligence and neuromorphic computing. Nat. Photon. 15, 102–114 (2021).
Article ADS CAS Google Scholar
Xu, Z. et al. Large-scale photonic chiplet Taichi empowers 160-TOPS/W artificial general intelligence. Science 384, 202–209 (2024).
Article ADS CAS PubMed Google Scholar
Strubell, E., Ganesh, A. & McCallum, A. Energy and Policy Considerations for Modern Deep Learning Research. In Proc. AAAI Conference on Artificial Intelligence vol. 34, 13693–13696 (AAAI Press, Palo Alto, CA, USA, 2020).
Patterson, D. et al. Carbon Emissions and Large Neural Network Training. Preprint at https://doi.org/10.48550/arXiv.2104.10350 (2021).
Fu, T. et al. Optical neural networks: progress and challenges. Light Sci. Appl. 13, 263 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Xia, S. et al. Deep-learning-empowered synthetic dimension dynamics: morphing of light into topological modes. Adv. Photonics 6, 026005 (2024).
Article ADS Google Scholar
Baby, D., Van Den Broucke, A. & Verhulst, S. A convolutional neural-network model of human cochlear mechanics and filter tuning for real-time applications. Nat. Mach. Intell. 3, 134–143 (2021).
Article PubMed PubMed Central Google Scholar
Dehner, C., Zahnd, G., Ntziachristos, V. & Jüstel, D. A deep neural network for real-time optoacoustic image reconstruction with adjustable speed of sound. Nat. Mach. Intell. 5, 1130–1141 (2023).
Article Google Scholar
Wang, J., Lu, S., Wang, S.-H. & Zhang, Y.-D. A review on extreme learning machine. Multimed. Tools. Appl. 81, 41611–41660 (2022).
Article Google Scholar
Fischer, B. et al. Neuromorphic computing via fission-based broadband frequency generation. Adv. Sci. 10, 2303835 (2023).
Article Google Scholar
Van der Sande, G., Brunner, D. & Soriano, M. C. Advances in photonic reservoir computing. Nanophotonics 6, 561–576 (2017).
Article Google Scholar
Rafayelyan, M., Dong, J., Tan, Y., Krzakala, F. & Gigan, S. Large-scale optical reservoir computing for spatiotemporal chaotic systems prediction. Phys. Rev. X 10, 041037 (2020).
CAS Google Scholar
Bueno, J., Brunner, D., Soriano, M. C. & Fischer, I. Conditions for reservoir computing performance using semiconductor lasers with delayed optical feedback. Opt. Express 25, 2401–2412 (2017).
Article ADS PubMed Google Scholar
Vandoorne, K., Dambre, J., Verstraeten, D., Schrauwen, B. & Bienstman, P. Parallel Reservoir Computing Using Optical Amplifiers. IEEE Trans. Neural Networks 22, 1469–1481 (2011).
Article ADS PubMed Google Scholar
Lupo, A., Picco, E., Zajnulina, M. & Massar, S. Deep photonic reservoir computer based on frequency multiplexing with fully analog connection between layers. Optica 10, 1478–1485 (2023).
Article ADS CAS Google Scholar
Paquot, Y. et al. Optoelectronic reservoir computing. Sci. Rep. 2, 287 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Guo, X. et al. Photonic implementation of the input and reservoir layers for a reservoir computing system based on a single VCSEL with two Mach-Zehnder modulators. Opt. Express. 32, 17452–17463 (2024).
Article ADS CAS PubMed Google Scholar
Vinckier, Q. et al. High-performance photonic reservoir computer based on a coherently driven passive cavity. Optica 2, 438–446 (2015).
Article ADS Google Scholar
Sorokina, M. Multi-channel optical neuromorphic processor for frequency-multiplexed signals. J. Phys. Photonics 3, 14002 (2020).
Article Google Scholar
Sorokina, M., Sergeyev, S. & Turitsyn, S. Fiber echo state network analogue for high-bandwidth dual-quadrature signal processing. Opt. Express 27, 2387–2395 (2019).
Article ADS PubMed Google Scholar
Appeltant, L. et al. Information processing using a single dynamical node as complex system. Nat. Commun. 2, 468 (2011).
Article ADS CAS PubMed Google Scholar
Larger, L. et al. High-speed photonic reservoir computing using a time-delay-based architecture: million words per second classification. Phys. Rev. X 7, 011015 (2017).
Google Scholar
Wan, Z. et al. Nonlinear equalization based on pruned artificial neural networks for 112-Gb/s SSB-PAM4 transmission over 80-km SSMF. Opt. Express 26, 10631–10642 (2018).
Article ADS PubMed Google Scholar
Argyris, A., Bueno, J. & Fischer, I. PAM-4 transmission at 1550 nm using photonic reservoir computing post-processing. IEEE Access 7, 37017–37025 (2019).
Article Google Scholar
Li, S., Ohlendorf, S. & Pachnicke, S. 100 km 56 GBd PAM-4 transmission using photonic reservoir computing. In 45th European Conference on Optical Communication (ECOC, 2019).
Gooskens, E., Sackesyn, S., Dambre, J. & Bienstman, P. Experimental results on nonlinear distortion compensation using photonic reservoir computing with a single set of weights for different wavelengths. Sci. Rep. 13, 21399 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, R.-Q. et al. Scalable wavelength-multiplexing photonic reservoir computing. APL Mach. Learn. 1, 036105 (2023).
Article CAS Google Scholar
Gooskens, E. et al. Wavelength dimension in waveguide-based photonic reservoir computing. Opt. Express 30, 15634–15647 (2022).
Article ADS PubMed Google Scholar
Xu, X. et al. 11 TOPS photonic convolutional accelerator for optical neural networks. Nature 589, 44–51 (2021).
Article ADS CAS PubMed Google Scholar
Jiang, Y., Zhang, W., Yang, F. & He, Z. Photonic convolution neural network based on interleaved time-wavelength modulation. J. Light. Technol. 39, 4592–4600 (2021).
Article ADS Google Scholar
Zhang, X. et al. 224-Gbps single-photodiode PAM-4 transmission with extended transmitter bandwidth based on optical time-and-polarization interleaving. Opt. Express, OE 28, 21155–21164 (2020).
Article ADS CAS Google Scholar
Wang, D., Nie, Y., Hu, G., Tsang, H. K. & Huang, C. Ultrafast silicon photonic reservoir computing engine delivering over 200 TOPS. Nat. Commun. 15, 10841 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Nakajima, M., Tanaka, K. & Hashimoto, T. Scalable reservoir computing on coherent linear photonic processor. Commun. Phys. 4, 20 (2021).
Article Google Scholar
Hünefeld, G. et al. Neuromorphic reservoir for nonlinear optical signal equalization. In Proc. SPIE 12880, Physics and Simulation of Optoelectronic Devices XXXII, 128800H (SPIE, 2024).
Adams, L. E., Kintzer, E. S. & Fujimoto, J. G. All-optical clock recovery using a modelocked figure eight laser with a semiconductor nonlinearity. Electron. Lett. 30, 1696–1697 (1994).
Article ADS Google Scholar
Kues, M. et al. Passively mode-locked laser with an ultra-narrow spectral width. Nat. Photon. 11, 159–162 (2017).
Article ADS CAS Google Scholar
Aadhi, A. et al. Highly reconfigurable hybrid laser based on an integrated nonlinear waveguide. Opt. Express 27, 25251 (2019).
Article ADS CAS PubMed Google Scholar
Xiong, S. et al. Investigation of stable pulse mode-locking regimes in a NALM figure-9 Er-doped fiber laser. Opt. Express 31, 514–527 (2023).
Article ADS CAS PubMed Google Scholar
Maass, W. Liquid State Machines: Motivation, Theory, and Applications. In Computability in Context 275–296 (IMPERIAL COLLEGE PRESS, https://doi.org/10.1142/9781848162778_0008 (2011).
Jaeger, H. The “echo state” approach to analysing and training recurrent neural networks-with an erratum note. GMD Rep. 148, 13 (2001).
Duport, F., Schneider, B., Smerieri, A., Haelterman, M. & Massar, S. All-optical reservoir computing. Opt. Express 20, 22783–22795 (2012).
Article ADS PubMed Google Scholar
Duport, F., Smerieri, A., Akrout, A., Haelterman, M. & Massar, S. Fully analogue photonic reservoir computer. Sci. Rep. 6, 22381 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Vandoorne, K. et al. Toward optical signal processing using photonic reservoir computing. Opt. Express 16, 11182–11192 (2008).
Article ADS PubMed Google Scholar
Tsurugaya, T. et al. Cross-gain modulation-based photonic reservoir computing using low-power-consumption membrane SOA on Si. Opt. Express 30, 22871–22884 (2022).
Article ADS CAS PubMed Google Scholar
Tsurugaya, T. et al. Reservoir computing using on-chip XGM-based nonlinear processing by membrane SOAs on Si-MZI. J. Lightw. Technol. 42, 2859–2867 (2024).
Article ADS CAS Google Scholar
Moss, D. J., Morandotti, R., Gaeta, A. L. & Lipson, M. New CMOS-compatible platforms based on silicon nitride and Hydex for nonlinear optics. Nat. Photon. 7, 597–607 (2013).
Article ADS CAS Google Scholar
Razzari, L. et al. CMOS-compatible integrated optical hyper-parametric oscillator. Nat. Photonics 4, 41–45 (2009).
Article ADS Google Scholar
Kharraz, O. M., Atieh, A., Zamzuri, A. K. & Mahdi, M. A. Enhanced gain saturation model of non-linear semiconductor optical amplifiers. IET Optoelectron 12, 263–268 (2018).
Article Google Scholar
Dmitriev, P. et al. Analysis of time-delay photonic reservoirs for neuromorphic computing. in 2022 Photonics North (PN) 1–1 (IEEE, 2022).
Inubushi, M. & Yoshimura, K. Reservoir computing beyond memory-nonlinearity trade-off. Sci. Rep. 7, 10199 (2017).
Article ADS PubMed PubMed Central Google Scholar
Lever, J., Krzywinski, M. & Altman, N. Model selection and overfitting. Nat. Methods 13, 703–704 (2016).
Article CAS Google Scholar
Argyris, A., Bueno, J. & Fischer, I. Photonic machine learning implementation for signal recovery in optical communications. Sci. Rep. 8, 8487 (2018).
Article ADS PubMed PubMed Central Google Scholar
Agrawal, G. Nonlinear Fiber Optics (Academic Press, 2013).
Bandyopadhyay, S. et al. Single-chip photonic deep neural network with forward-only training. Nat. Photon. 18, 1335–1343 (2024).
Article ADS CAS Google Scholar
Xu, X. et al. Photonic perceptron based on a kerr microcomb for high-speed, scalable, optical neural networks. Laser Photonics Rev. 14, 2000070 (2020).
Article ADS Google Scholar
Gu, Z., Ma, Q., Gao, X., You, J. W. & Cui, T. J. Direct electromagnetic information processing with planar diffractive neural network. Sci. Adv. 10, eado3937 (2024).
Article PubMed PubMed Central Google Scholar
Hülser, T., Köster, F., Jaurigue, L. & Lüdge, K. Role of delay-times in delay-based photonic reservoir computing. Opt. Mater. Express 12, 1214–1231 (2022).
Article ADS Google Scholar
Tang, D., Wu, Z., Sun, Z., Tang, X. & Qiao, Y. Joint intra and inter-channel nonlinearity compensation based on interpretable neural network for long-haul coherent systems. Opt. Express 29, 36242–36256 (2021).
Article ADS PubMed Google Scholar
Song, J. et al. Real-time FPGA prototyping of a 15GBaud SP-16QAM coherent optical receiver with optimal interpolating for clock recovery and equalization. Opt. Express 30, 26774–26786 (2022).
Article ADS CAS PubMed Google Scholar
Inoue, T., Matsumoto, R. & Namiki, S. Learning-based digital back propagation to compensate for fiber nonlinearity considering self-phase and cross-phase modulation for wavelength-division multiplexed systems. Opt. Express 30, 14851–14872 (2022).
Article ADS CAS PubMed Google Scholar
LaCroix, M.-A. et al. 8.4 A 116Gb/s DSP-based wireline transceiver in 7nm CMOS achieving 6pJ/b at 45dB loss in PAM-4/Duo-PAM-4 and 52dB in PAM-2. In 2021 IEEE International Solid-State Circuits Conference (ISSCC) 64, 132–134 (IEEE, 2021).
Nagarajan, R. et al. Recent advances in low-power digital signal processing technologies for data center applications. J. Lightw. Technol. 42, 4222–4232 (2024).
Article ADS Google Scholar
Pillai, B. S. G. et al. End-to-end energy modeling and analysis of long-haul coherent transmission systems. J. Lightw. Technol. 32, 3093–3111 (2014).
Article ADS Google Scholar
Liu, Y. et al. Parallel wavelength-division-multiplexed signal transmission and dispersion compensation enabled by soliton microcombs and microrings. Nat. Commun. 15, 3645 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Staffoli, E., Maddinelli, G. & Pavesi, L. A silicon photonic neural network for chromatic dispersion compensation in 20 Gbps PAM4 Signal at 125 km and itsScalability up to 100 Gbps. J. Lightw. Technol. 43, 557–571 (2025).
Article ADS CAS Google Scholar
Zhang, S. et al. Field and lab experimental demonstration of nonlinear impairment compensation using neural networks. Nat. Commun. 10, 3033 (2019).
Article ADS PubMed PubMed Central Google Scholar
Huang, C. et al. A silicon photonic–electronic neural network for fibre nonlinearity compensation. Nat. Electron. 4, 837–844 (2021).
Article CAS Google Scholar
Mazoukh, C. et al. Genetic algorithm-enhanced microcomb state generation. Commun. Phys. 7, 81 (2024).
Article Google Scholar
Pasquazi, A. et al. Micro-combs: a novel generation of optical sources. Phys. Rep. 729, 1–81 (2018).
Article ADS MathSciNet CAS Google Scholar
Bai, B. et al. Microcomb-based integrated photonic processing unit. Nat. Commun. 14, 66 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Aadhi, A. et al. Mode-locked laser with multiple timescales in a microresonator-based nested cavity. APL Photonics 9, 031302 (2024).
Article ADS CAS Google Scholar
Di Lauro, L. et al. Parametric control of thermal self-pulsation in micro-cavities. Opt. Lett. 42, 3407 (2017).
Article ADS PubMed Google Scholar
Rowley, M. et al. Self-emergence of robust solitons in a microcavity. Nature 608, 303–309 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Ferrera, M. et al. Low-power continuous-wave nonlinear optics in doped silica glass integrated waveguide structures. Nat. Photon. 2, 737–740 (2008).
Article ADS CAS Google Scholar
Bao, H. et al. Laser cavity-soliton microcombs. Nat. Photon. 13, 384–389 (2019).
Article ADS CAS Google Scholar
Monika, M. et al. Quantum state processing through controllable synthetic temporal photonic lattices. Nat. Photon. 19, 95–100 (2025).
Article ADS CAS Google Scholar
Yu, H. et al. Quantum key distribution implemented with d-level time-bin entangled photons. Nat. Commun. 16, 171 (2025).
Article ADS PubMed PubMed Central Google Scholar
Jaeger, H. Long short-term memory in echo state networks: details of a simulation study. Report (2012).
Karabchevsky, A. & Choudhary, A. On-Chip Photonics: Principles, Technology and Applications (Elsevier, 2024).

Download references

Acknowledgements

Funding. This work was supported by the Natural Sciences and Engineering Research Council of Canada (NSERC) through the Alliance and Discovery Grants Schemes, by the MESI PSR-SIIRI Initiative in Quebec, by the Canada Research Chair Program, as well as the Australian Research Council Centre of Excellence in Optical Microcombs for Breakthrough Science (COMBS) CE230100006. The work of L.D.L. was also supported by the Mitacs Elevate Postdoctoral Fellowship Program. The work of E.A.V. and A.V.K. was supported by the Ministry of Science and Higher Education of the Russian Federation, project no. FSER-2025-0025. The work of M.C. was supported by the Carl-Zeiss-Stiftung through the Nexus program (project P2021-05-025 SINABSE).

Author information

These authors contributed equally: A. Aadhi, L. Di Lauro.

Authors and Affiliations

Institut National de la Recherche Scientifique - Énergie Matériaux Télécommunications, Varennes, QC, Canada
A. Aadhi, L. Di Lauro, B. Fischer, P. Dmitriev, I. Alamgir, C. Mazoukh, N. Perron, S. Vakili, M. Chemnitz, P. Roztocki & R. Morandotti
Leibniz Institute of Photonic Technology, Jena, Germany
B. Fischer & M. Chemnitz
ITMO University, St Petersburg, Russia
E. A. Viktorov & A. V. Kovalev
Huawei Technologies Canada, Markham, ON, Canada
A. Eshaghi
QXP Technology Inc, Xi’an, China
B. E. Little
City University of Hong Kong, Hong Kong, China
S. T. Chu
Optical Sciences Centre, Swinburne University of Technology, Hawthorn, VIC, Australia
D. J. Moss

Authors

A. Aadhi
View author publications
Search author on:PubMed Google Scholar
L. Di Lauro
View author publications
Search author on:PubMed Google Scholar
B. Fischer
View author publications
Search author on:PubMed Google Scholar
P. Dmitriev
View author publications
Search author on:PubMed Google Scholar
I. Alamgir
View author publications
Search author on:PubMed Google Scholar
C. Mazoukh
View author publications
Search author on:PubMed Google Scholar
N. Perron
View author publications
Search author on:PubMed Google Scholar
E. A. Viktorov
View author publications
Search author on:PubMed Google Scholar
A. V. Kovalev
View author publications
Search author on:PubMed Google Scholar
A. Eshaghi
View author publications
Search author on:PubMed Google Scholar
S. Vakili
View author publications
Search author on:PubMed Google Scholar
M. Chemnitz
View author publications
Search author on:PubMed Google Scholar
P. Roztocki
View author publications
Search author on:PubMed Google Scholar
B. E. Little
View author publications
Search author on:PubMed Google Scholar
S. T. Chu
View author publications
Search author on:PubMed Google Scholar
D. J. Moss
View author publications
Search author on:PubMed Google Scholar
R. Morandotti
View author publications
Search author on:PubMed Google Scholar

Contributions

A.A. realized the experiments. L.D.L. analyzed the experimental results, developed the theory, and wrote the manuscript. A.A., B.F., and I.A. contributed to the data analysis. P.D., N.P., and C.M. contributed to numerical simulations. E.A.V. and A.V.K. supported the analysis of theoretical and experimental data. E.A., E.A.V., A.V.K., M.C., S.V., P.R., D.J.M., and R.M. participated in scientific discussions. S.T.C. designed the integrated spiral waveguide. B.E.L. fabricated the chip. R.M. supervised the project. All authors reviewed and edited the manuscript.

Corresponding authors

Correspondence to L. Di Lauro or R. Morandotti.

Ethics declarations

Competing interests

The authors L.D.L., A.A., B.F., P.D., P.R., A.E., and R.M. are inventors of a pending patent application: “System and Method for Optical Information Processing.” US18/810,064, filed on: 20/8/24; PCT/CA2022/050451, filed on: 25/3/22. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Transparent Peer Review file (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Aadhi, A., Di Lauro, L., Fischer, B. et al. Scalable photonic reservoir computing for parallel machine learning tasks. Nat Commun 17, 1225 (2026). https://doi.org/10.1038/s41467-025-67983-z

Download citation

Received: 10 March 2025
Accepted: 14 December 2025
Published: 31 December 2025
Version of record: 02 February 2026
DOI: https://doi.org/10.1038/s41467-025-67983-z