Partial coherence enhances parallelized photonic computing

Dong, Bowei; Brückerhoff-Plückelmann, Frank; Meyer, Lennart; Dijkstra, Jelle; Bente, Ivonne; Wendland, Daniel; Varri, Akhil; Aggarwal, Samarth; Farmakidis, Nikolaos; Wang, Mengyun; Yang, Guoce; Lee, June Sang; He, Yuhan; Gooskens, Emmanuel; Kwong, Dim-Lee; Bienstman, Peter; Pernice, Wolfram H. P.; Bhaskaran, Harish

doi:10.1038/s41586-024-07590-y

Download PDF

Article
Open access
Published: 31 July 2024

Partial coherence enhances parallelized photonic computing

Nature volume 632, pages 55–62 (2024)Cite this article

29k Accesses
64 Citations
108 Altmetric
Metrics details

Subjects

Abstract

Advancements in optical coherence control^1,2,3,4,5 have unlocked many cutting-edge applications, including long-haul communication, light detection and ranging (LiDAR) and optical coherence tomography^6,7,8. Prevailing wisdom suggests that using more coherent light sources leads to enhanced system performance and device functionalities^9,10,11. Our study introduces a photonic convolutional processing system that takes advantage of partially coherent light to boost computing parallelism without substantially sacrificing accuracy, potentially enabling larger-size photonic tensor cores. The reduction of the degree of coherence optimizes bandwidth use in the photonic convolutional processing system. This breakthrough challenges the traditional belief that coherence is essential or even advantageous in integrated photonic accelerators, thereby enabling the use of light sources with less rigorous feedback control and thermal-management requirements for high-throughput photonic computing. Here we demonstrate such a system in two photonic platforms for computing applications: a photonic tensor core using phase-change-material photonic memories that delivers parallel convolution operations to classify the gaits of ten patients with Parkinson’s disease with 92.2% accuracy (92.7% theoretically) and a silicon photonic tensor core with embedded electro-absorption modulators (EAMs) to facilitate 0.108 tera operations per second (TOPS) convolutional processing for classifying the Modified National Institute of Standards and Technology (MNIST) handwritten digits dataset with 92.4% accuracy (95.0% theoretically).

High-coherence parallelization in integrated photonics

Article Open access 10 September 2024

Parallel convolutional processing using an integrated photonic tensor core

Article 06 January 2021

Measuring, processing, and generating partially coherent light with self-configuring optics

Article Open access 20 September 2024

Main

Over the past century, notable progress in optical coherence control has enabled the generation of light with linewidth ranging from tens of terahertz (THz) to less than 1 kilohertz (kHz). This enhanced control has revolutionized light sources, from fluorescence¹, light-emitting diodes (LEDs)² and lasers^3,12 to distributed-feedback lasers⁴ and stabilized continuous-wave lasers^5,13, laying the foundation for numerous transformative applications, such as long-haul optical-fibre communications⁶, LiDAR⁷, optical coherence tomography^8,14 and so on. Despite the challenges in stabilizing and maintaining high optical coherence, researchers have sought to make use of the superior properties of coherent light by using partially coherent light in combination with post-processing reconstruction methods as a compromised solution^9,15,16. Taking a more direct approach, many studies have aimed to generate more coherent light from incoherent light sources^17,18, with a recent achievement in obtaining spatio-temporal coherence with an incoherent white-light source for coloured vortex-beam generation using miniaturized spiral phase plates integrated with structural colour filters¹⁰. As a result, increasing optical coherence has become a guiding principle for improving the performance and functionalities of cutting-edge optical devices and systems.

Deep learning has made a great impact on various fields^19,20,21,22, with two recent highlights being GPT-4 and Midjourney. The success of deep learning relies on training huge artificial neural networks with billions of trainable parameters, necessitating the doubling of hardware-data processing capability every 3.5 months²³. To keep up with this exponentially growing need for processing capability, photonic convolutional processing is believed to be a key to hardware-based artificial intelligence (AI) accelerators^24,25,26. Photonic processors can access a wide bandwidth of tens of THz by exploiting wavelength-division multiplexing and eliminate capacitive delay and charge/discharge energy dissipation, as photons require no potential difference to transit²⁷. Various system architectures for photonic convolutional processing have been proposed, all using coherent light sources in accordance with the guiding principle. Coherent nanophotonic circuits distribute light from a single coherent light source to the inputs of a Mach–Zehnder interferometer (MZI) array^11,28,29,30. Operating these circuits requires the precise control of numerous phase shifters to ensure the desired coherent interference in the circuit. A broadcast-and-weight protocol based on cascaded microring resonator (MRR) arrays has been demonstrated^31,32,33,34. The optical input is created by multiplexing coherent light across several wavelengths, with each wavelength being weighted by a corresponding tunable MRR of varying radii and combined in a common bus waveguide. Convolutional processing based on the broadcast-and-weight protocol requires precise control over a substantial number of MRRs, and one convolution operation on an N-dimensional vector requires N distinct coherent lights at different wavelengths. On-chip diffractive optical neural networks have been showcased, performing matrix-vector multiplication (MVM) operations within an ultra-compact footprint through coherent interference of accurately controlled diffractive light^35,36. To achieve in-memory photonic convolutional processing, which eliminates the need for data movement between the memory and photonic processors, a photonic tensor core incorporating phase-change-material photonic memories was proposed and demonstrated^37,38. A silicon nitride MRR was pumped by a coherent laser to generate a frequency comb. In a photonic tensor core consisting of N inputs and M outputs, N different wavelength components must enter their corresponding inputs to prevent measurable interference effects, which could result in unwanted intensity fluctuations. Data carried by each input coherent light at different wavelengths are weighted by the phase-change-material photonic memories and combined in a common bus.

Here we demonstrate that decreasing optical coherence can enhance photonic convolutional processing. We present a photonic convolutional processing system that takes advantage of decreased temporal coherence, hereafter referred to as a partially coherent system, to boost processing parallelism without substantially sacrificing accuracy and potentially enable large-scale photonic tensor cores. This approach eliminates the need for precise control of numerous phase shifters or MRRs and eases the requirements for stringent feedback control and thermal management by using partially coherent light sources. We showcase the broad applicability of partial coherence processing in two photonic platforms for computing applications: first, we conduct parallel convolutional processing with a 3 × 3 photonic tensor core using phase-change-material photonic memories for classifying the gaits of ten patients with Parkinson’s disease and achieve an accuracy of 92.2%; and second, we implement a high-speed 0.108 TOPS convolution processor using a 9 × 3 silicon photonic tensor core with embedded EAMs for vector encoding and weight setting, combined with on-chip photodetectors to classify the MNIST handwritten digits dataset with an accuracy of 92.4%.

Partial coherence as key to enhanced parallelism

State-of-the-art photonic tensor cores use coherent light sources, such as distributed-feedback lasers and frequency combs, for computation. A generalized unit cell to perform multiply-and-accumulate operations is shown in Fig. 1a. Light is equally split into two arms, with multiplication performed in each arm and the multiplication results summed in a common bus waveguide. The fluctuation in transmission intensity resulting from fluctuation of phase difference (Δφ) is determined by the coherence property of input light. Figure 1a illustrates the dependence of intensity fluctuation on phase difference. For a coherent light source at a fixed single frequency ${E={\rm{e}}}^{{\rm{i}}{\omega }_{0}t}$, the output intensity |E + Ee^iΔφ|² changes sinusoidally with phase difference. For an idealized incoherent light source that spans the entire frequency range, the output is unaffected by phase fluctuation. A partially coherent light source provides immunity to phase fluctuations but only makes use of a limited optical bandwidth, which makes it compatible with wavelength-division multiplexing. Partially coherent light progressively loses dependency on phase fluctuation as the phase difference increases. For small phase differences, the intensity fluctuates, resembling coherent light; at larger phase differences, the intensity remains stable, as in the case of incoherent light.

**Fig. 1: Concept of partial-coherence-enhanced parallelized photonic computing.**

A system that makes use of partially coherent light for parallelized photonic computing is proposed in Fig. 1b. The light source does not need to be a coherent light source that demands precise feedback control and thermal management. Instead, we can use a superluminescent diode (SLED)^39,40 or filtered light from a broadband light source, enabling simpler integration and less stringent circuit management. The partially coherent light is then evenly distributed to N input channels, with each channel modulated to generate the input vector (x₁ ⋯ x_N)^T. MVM is performed by the photonic tensor core with weights encoded in the photonic crossbar array. The weighting elements can be any photonic device that enables amplitude modulation, such as the phase-change-material photonic memories or EAMs used here. This partially coherent system offers much higher parallelism when compared to a coherent system. As shown in Fig. 1c, a Gaussian-shaped optical carrier can be sent to all input channels and summed in a bus waveguide, as intensity fluctuation caused by phase fluctuations is eliminated. By contrast, in a coherent system, different input channels should receive optical carriers at distinct wavelengths to avoid intensity fluctuation. Consequently, one MVM operation for input vectors of dimension N requires only one optical band when using partially coherent light but consumes N optical bands if coherent light is used. The enhancement in parallelism is thus N-fold when using partially coherent light as compared to coherent light. This also implies better scalability of the photonic tensor core. The scalability of a partially coherent system will not be limited by the spectral window of photonic components, as the input optical bandwidth does not scale with input vector dimension.

Coherence properties of light sources

Coherent and partially coherent light is generated from a coherent laser and by filtering the amplified spontaneous emission (ASE) of an erbium-doped fibre amplifier (EDFA), respectively. The wavelength spectra of the investigated light sources centred around 1,550 nm are shown in Fig. 2a, including a Gaussian-shaped coherent source with a linewidth narrower than 70 pm, a Gaussian-shaped partially coherent source with 0.8-nm bandwidth filtered by a demultiplexer (DEMUX) on ITU grid channel C34 and four non-Gaussian-shaped partially coherent sources with 2.0, 4.0, 8.0 and 16.0 nm bandwidths filtered by an optical tunable band-pass filter. All light sources are operated in continuous-wave mode. A thermo-optically controlled MZI array with increasing path differences is used to determine the coherence lengths of all lights (Supplementary Fig. 1). The concept proposed in Fig. 1a is verified in Fig. 2b, which illustrates diminishing phase sensitivity with increasing length differences. The degree of coherence, defined by the interference strength $\frac{{I}_{\max }-{I}_{\min }}{{I}_{\max }+{I}_{\min }}$, is extracted from Fig. 2b and presented in Fig. 2c. The degree of coherence of coherent light maintains a level around unity at a large length difference of 4,000 µm. By contrast, that of partially coherent light decreases notably with increasing length differences, with a generally lower degree of coherence accompanied by a wider optical bandwidth. Quantitatively, the coherence length, defined as the length difference for which the degree of coherence decreases to 0.5, is inversely proportional to the optical bandwidth (Fig. 2d), in agreement with theory⁴¹. A comparison between Gaussian-shaped and non-Gaussian-shaped partially coherent light reveals a negligible difference in the degree of coherence and coherence length (Supplementary Fig. 2).

We investigate the effect of optical bandwidth and noise of filtered ASE on optical modulation. For coherent light, the noise remains at a low level above the system noise floor and exhibits a weak dependence on the intensity received at the photodetector (Supplementary Fig. 3). Conversely, for partially coherent light, the noise increases linearly with the intensity received at the photodetector and is inversely related to optical bandwidth. This observation can be explained by the stochastic properties of an EDFA⁴², which introduce inherently elevated noise levels compared with coherent light. However, this noise can be reduced by increasing the ratio of optical bandwidth to electrical bandwidth. The linearly increasing noise leads to a saturated signal-to-noise ratio (SNR) in partially coherent light (Fig. 2e), implying that coherent light holds an advantage in high-intensity scenarios in which partially coherent light is hampered by a compromised SNR. Nonetheless, in integrated photonic circuits, the signal (intensity received at the photodetector) typically spans from 0.1 µW to 0.1 mW. In this range of interest to many applications, the SNR of partially coherent light from EDFA ASE is not substantially lower compared with that of coherent light. The viable SNR of partially coherent light is verified by measuring 2-GHz eye diagrams at 0.05-mW signal (Fig. 2f). Although the eye diagram of 0.8-nm-bandwidth C34 partially coherent light is ambiguous compared with coherent light, the clarity of the eye diagram markedly improves with an enlarged optical bandwidth, becoming clear at 4.0-nm bandwidth and beyond. At a lower modulation speed of 100 MHz, all eye diagrams are clear (Supplementary Fig. 4).

Eliminating intensity fluctuation

The elimination of intensity fluctuation caused by phase fluctuation within a single MZI has been verified in Fig. 2b. When transferring this concept from a single-device level to a system level, we must account for potential complexities and further variables that may influence the stability and reliability of the entire photonic system, requiring further verification at the system level. A photonic tensor core using phase-change photonic memories featuring the architecture proposed in Fig. 1b, hereafter referred to as photonic memory tensor core, is fabricated to perform enhanced parallelized photonic computing. As a proof of concept, the photonic memory crossbar array represents a 3 × 3 weight matrix (Fig. 3a) and the working principle is described in Supplementary Text 1. The weights are encoded in non-volatile phase-change-material photonic memories. Using the pump–probe weight-setting scheme⁴³, the non-volatile amplitude modulation enabled by controlling the crystalline state of phase-change-material photonic memories enables 4-bit operations. The maximum transmission change T_max − T_min is greater than 20% (Supplementary Fig. 6). The transmission levels T are mapped to weights w in [−1, 1] by defining $w=\frac{T-\frac{{T}_{\max }+{T}_{\min }}{2}}{\frac{{T}_{\max }-{T}_{\min }}{2}}$. In this work, the mapping is implemented by post-processing on a computer (Methods). This mapping approach can be implemented in hardware using a balanced photodetection scheme (Supplementary Text 2). On the other hand, besides changing the hardware architecture, the neural networks themselves can be modified to adapt to the non-negative nature of photonic computing systems⁴⁴.

**Fig. 3: Elimination of intensity fluctuation in a system using partially coherent light.**

Figure 3b presents the schematic of the setup used to investigate the intensity fluctuation when using coherent or partially coherent light (0.8-nm-bandwidth C34). A path difference of 1 m between adjacent inputs is introduced by incorporating 1-m-long fibre delays. This 1-m path difference is substantially longer than the measured coherence length of 550 µm in 0.8-nm-bandwidth C34 partially coherent light, which will effectively eliminate intensity fluctuations. When coherent light is split and directed to three input channels, strong intensity fluctuations are observed (Fig. 3c), resulting from phase fluctuations along the optical paths. On the contrary, when partially coherent light is used, intensity fluctuations are eliminated (Fig. 3d). This immunity of transmission intensity to phase fluctuation is the desired property offered by partially coherent light that will enable higher parallelism. Specifically, using partially coherent light, light in one optical band can be distributed to all input channels to perform MVM operations, allowing for full bandwidth use.

Parallelized convolution of gait signals from patients with Parkinson’s disease

As a proof-of-concept example to showcase the capability of partial-coherence-enhanced parallelized photonic computing, we construct a system using the photonic memory tensor core to identify patients with Parkinson’s disease by analysing their gaits. The enhanced parallelism offers a way to simultaneously monitor a large number of patients. The partially coherent light has a bandwidth of 0.8 nm, modulated at 1 kHz. As shown in Fig. 4, gait signals from patients with Parkinson’s disease originally in the form of time series are recorded. The gait signal from patient j at time i is represented by x_ij. x_ij is carried by wavelength λ_j and sent into optical channel i. Taking the gait signal from patient 1 for example, the input vector is (x₁₁, x₂₁, x₃₁)^T carried by λ₁. The 3 × 3 photonic memory crossbar array defines three kernels of dimension 3 × 1, represented by the weight matrix $W={\left[\begin{array}{ccc}{w}_{11} & {w}_{12} & {w}_{13}\\ {w}_{21} & {w}_{22} & {w}_{23}\\ {w}_{31} & {w}_{32} & {w}_{33}\end{array}\right]}^{{\rm{T}}}$. Specifically, the rows of W are set to ${\left[\begin{array}{c}1\\ 1\\ -1\end{array}\right]}^{{\rm{T}}}$, ${\left[\begin{array}{c}1\\ -1\\ 1\end{array}\right]}^{{\rm{T}}}$ and ${\left[\begin{array}{c}-1\\ 1\\ 1\end{array}\right]}^{{\rm{T}}}$, performing right-edge extraction, peak suppression and left-edge extraction, respectively. Using an extra wavelength λ₂, the system can perform convolutional processing for patient 2 in parallel. For comparison, the schematic of a coherent system to implement the same convolutional processing is described in Supplementary Text 3. Notably, in the partially coherent system, the same wavelength can enter different input waveguide channels because the intensity fluctuation is eliminated. The ability to enable the same wavelength to enter different input channels provides superior advantages compared with the conventional computing system that uses coherent light. The optical bandwidth is fully used because no further wavelength channels are required to avoid intensity fluctuation. In comparison with our partially coherent system, which uses two wavelengths, a coherent system will require six wavelengths. This advantage scales up with the desired parallelism and the size of the photonic tensor core. For a parallelism of P (that is processing gait signals from P patients in parallel) and a photonic tensor core with dimension N by M, the reduction in the required number of wavelengths is (N − 1) × P.

The convolution results obtained using the partially coherent system are shown in Fig. 4b–e and compared with the results obtained using a coherent system. For a typical gait signal, the desired features are successfully extracted in both computing systems, as shown in Fig. 4b. Theoretical convolution results obtained by a central processing unit (CPU) are presented in Supplementary Fig. 9. The convolution results of all gait signals from all ten patients are presented in Supplementary Figs. 10 and 11. Figure 4c shows the accuracy of the convolution operation. The two computing systems show similar accuracy. The respective errors follow Gaussian distributions and show close mean values and standard deviations. Using these convolution results obtained by the photonic systems, we construct a convolutional neural network (CNN) to identify patients with Parkinson’s disease (Fig. 4d). The CNN is first implemented by a CPU to test the necessity of the convolution layer. Using a convolution layer, the classification accuracy is improved from 84.4% to 92.7% (Fig. 4e). When the convolution layer is implemented by photonic systems, the classification accuracy reaches more than 92.2% in both computing systems, showing a performance close to the CPU implementation. The confusion maps of CNN classification results are presented in Supplementary Fig. 12. The evolution of CNN loss and accuracy with respect to increasing epochs are presented in Supplementary Fig. 13. The partially coherent system achieves similar performance to the coherent system, but with much fewer wavelength channels and less stringent light-source requirements.

High-speed convolution of MNIST datasets

The applicability of partially coherent systems extends beyond the photonic memory tensor core described above, which is operated at a modest modulation speed of 1 kHz for specialized applications such as gait-signal classification. The versatility of the general approach caters to any photonic weighting device using amplitude modulation and is proficient at performing high-speed convolutional processing for diverse AI tasks. This is demonstrated through a high-speed 9 × 3 silicon photonic tensor core using EAMs, equipped with an integrated input EAM array and output photodetector array (Fig. 5a,b). The chip is fabricated using IMEC’s iSiPP50G silicon photonics platform, which provides the active components at a higher integration level. Hereafter we refer to this system as a photonic EAM tensor core. Using a field-programmable gate array (FPGA)-controlled electro-optic interface to the photonic EAM tensor core, we perform convolutional processing on the MNIST handwritten digits dataset at a data-loading rate of 2 gigasamples per second (GSa s⁻¹) in each channel, using 8.0-nm-bandwidth partially coherent light. This 2 GSa s⁻¹ data-loading rate brings the total system processing speed to 0.108 TOPS considering the size of the photonic tensor core, and an estimated energy efficiency of 1 TOPS W⁻¹ (Supplementary Text 4). Supplementary Fig. 14 illustrates the configuration and data flow of the entire system, which operates analogously to the 3 × 3 photonic memory tensor core described above. Using the digit ‘0’ from the MNIST dataset as an example (Fig. 5c), the 2 GSa s⁻¹ partially coherent system effectively extracts edges using Sobel G_x and Sobel G_y filters, albeit with increased background noise. As the noise originates from the stochastic properties of the ASE light source, it can be mitigated by averaging further convolutions per sample. Quantitatively, the normalized standard deviation of the error is 0.094 without averaging (Fig. 5d), which is reduced to 0.049 by four-point averaging (Fig. 5e). When the convolution results are used as input to a CNN for classification (Fig. 5f), accuracies of 92.4% without averaging and 93.9% with four-point average are achieved, closely aligning with the theoretical accuracy of 95.0% attained from CPU-implemented convolutions. The corresponding confusion maps and evolution of loss and accuracy with respect to increasing epochs are presented in Supplementary Figs. 15 and 16. Furthermore, the convolutional processing on the MNIST fashion products dataset, executed using the same system, reveals similar performance trends, detailed in Supplementary Text 5. We wish to note that the 2 GSa s⁻¹ data-loading rate is limited by the digital-to-analogue converters (DACs) of the FPGA and not the photonic chip. The partially coherent light can provide a data-loading rate of at least 30 GSa s⁻¹ (Supplementary Fig. 21), which brings the total system processing speed to 1.62 TOPS per optical carrier. Furthermore, using an ASE optical bandwidth of 40 nm for ten optical carriers (4 nm optical bandwidth per optical carrier), the partially coherent system is expected to reach 16.2 TOPS system processing speed.

**Fig. 5: High-speed convolution of the MNIST handwritten digits dataset.**

Discussion and conclusion

We have demonstrated that decreasing optical coherence can lead to enhanced performance in photonic computing systems, challenging the conventional wisdom that a higher degree of coherence is always advantageous. By decreasing the degree of coherence, we effectively exploit the optical bandwidth to boost parallelism without substantially degrading convolution accuracy. Specifically, reducing the coherence of the input light sources enables the same wavelength to be distributed across all input channels of a photonic tensor core. This also implies better scalability of photonic tensor cores, as the input optical bandwidth does not scale with the input vector dimension and thus is not limited by the spectral window of photonic components. In a system with an N × N photonic tensor core and P × N available wavelengths, partially coherent light facilitates P × N parallel convolutional processing operations, whereas coherent light facilitates P parallel operations. The limitations of partially coherent systems are related to the intrinsically reduced SNR, which is attributed to the stochastic properties of ASE. Considering these advantages and limitations, a quantitative comparison between coherent and partially coherent systems is shown in Supplementary Text 6. This comparison suggests that, although coherent systems exhibit advantages at the small scale by delivering high SNRs and modest parallelism, partially coherent systems surpass them at larger scales by offering enhanced parallelism and comparable SNRs. Furthermore, the SNR of partially coherent light may be improved by the substitution of EDFA ASE with broadband SLEDs⁴⁵ and further optimized by coupling with saturated semiconductor optical amplifiers⁴⁶. SLEDs have high spatial coherence with the benefit of easier coupling to the waveguide, moderate optical bandwidth (a few nanometres to tens of nanometres) for partial coherence control and favourable optical power^39,40. We also note that the long delay lines required in large partially coherent systems are challenging to implement. The solutions to address this long delay line issue are discussed in detail in Supplementary Text 7. Assuming that we require the loss of the longest delay line to be below 3 dB and we use only one ASE source with an optical bandwidth of 4 nm, the system can support approximately a maximum of 59 input channels on a silicon nitride-on-silicon platform with a propagation loss of 0.4 dB cm⁻¹ (refs. ^47,48). To realize larger partially coherent systems, we can use an array of independent ASE sources working at the same wavelength, with each ASE source driving a few tens of input waveguide channels. These independent ASE sources are uncorrelated, eliminating the need for longer delay lines to overcome the coherence length of a single source. Using numerous ASE sources is still advantageous compared with using numerous lasers because each laser can only drive one input channel and independent lasers should still use different wavelengths to avoid undesired interference^49,50.

As a proof of concept, we used partially coherent light in a system featuring a 3 × 3 photonic memory tensor core to demonstrate the parallel convolution of two gait signals from patients with Parkinson’s disease. These convolution results were subsequently used for CNN classification. Comparable convolutional processing and CNN classification accuracies were achieved as compared with using coherent light, while conserving four optical bands. To illustrate the broad applicability of partially coherent systems for high-speed convolutional processing in more complex AI tasks, we demonstrated 0.108 TOPS convolutional processing on MNIST handwritten digits dataset using a 9 × 3 photonic EAM tensor core with integrated input modulators and output photodetectors. The CNN classification accuracy reaches 92.4%, slightly below the theoretical accuracy of 95.0%, yet improvable to 93.9% through four-point average. A comparative analysis with other prevailing state-of-the-art photonic computing systems is provided in Supplementary Text 8. Our partially coherent system uniquely features phase insensitivity throughout the whole system. This technological shift away from coherent light considerably alleviates system requirements by circumventing stringent light-source specifications and eliminating the need for numerous precise phase controls, MRR controls and thermal management. Our findings suggest that EDFA ASE, SLEDs or other simple light sources can be used to bolster photonic computing performance rather than diminish it. This insight has the potential to revolutionize photonic computing systems as they evolve to accommodate increasingly complex computational tasks and continue to scale up to large N and P values.

Methods

Device fabrication

MZI array

The fabrication started from a silicon-on-insulator wafer (SOITEC) with a 220-nm silicon (Si) device layer and a 2-µm buried oxide layer. A 200-nm-thick positive e-beam resist (CSAR 62) was spin-coated on a diced 1 cm × 1 cm silicon-on-insulator chip, followed by 3 min pre-bake at 150 °C. The e-beam resist was patterned by e-beam lithography (EBL; JEOL JBX-5500 50 kV) and developed in AR 600-546 for 30 s, MIBK for 15 s and IPA for 15 s in sequence. The waveguide patterns were transferred to the Si device layer (etch depth = 110 nm) by reactive ion etching (Oxford Instruments PlasmaPro) with SF₆ and CHF₃ gases, followed by O₂ plasma cleaning of CSAR. A 1-µm-thick silicon dioxide (SiO₂) was deposited by plasma-enhanced chemical vapour deposition (Oxford Instruments PlasmaPro) as the upper cladding layer to isolate waveguides from thermo-optic phase shifters. Next, a 2-µm-thick double-layer PMMA (PMMA 495 A8 and PMMA 950 A4) was spin-coated on the chip, followed by EBL patterning and development in MIBK:IPA = 1:3 for 1 min to define the heater patterns. A 200-nm-thick NiCr layer was sputtered using a magnetron sputtering system (physical vapour deposition, AJA International), followed by PMMA lift-off to form NiCr heaters. Gold pads of 100 nm thickness were fabricated using a similar process as NiCr heater fabrication, but with e-beam evaporation (Plassys MEB550S). A 3–5-nm Cr layer was deposited before gold deposition to serve as an adhesion layer. The optical image of the fabricated MZI array is shown in Supplementary Fig. 1.

Photonic memory crossbar array

The Si photonic circuit was fabricated using the foundry multi-project wafer service provided by CORNERSTONE. The detailed specifications of CORNERSTONE standard waveguide components can be found at https://cornerstone.sotonfab.co.uk/. The fabricated Si photonic circuit has a 1-µm-thick SiO₂ upper cladding. SiO₂ windows were patterned by EBL and opened by hydrogen fluoride for the following deposition of the Ge₂Sb₂Te₅ (GST)/indium tin oxide (ITO) stack. Next, GST/ITO stack windows were opened by the above-mentioned PMMA process. A 10-nm-thick/10-nm-thick GST/ITO stack was deposited on the waveguide using a magnetron sputtering system (physical vapour deposition, AJA International). The GST and ITO targets were respectively sputtered at 30 W RF power with 3 sccm Ar flow and 40 W RF power with 3 sccm Ar flow at a base pressure of 10⁻⁷ torr. The stack was then lifted off in acetone for 180 min at 50 °C. Next, the thermo-optic phase shifters were fabricated using the method described for the MZI array. Finally, the chip was annealed on a hotplate for 5 min at 250 °C to fully crystallize the GST. The fabricated photonic memory crossbar array is shown in Fig. 3a.

Photonic EAM tensor core

The photonic EAM tensor core was fabricated using the foundry multi-project wafer service provided by IMEC: iSiPP50G, with details at https://www.imeciclink.com/en/asic-fabrication/si. This platform provides the monolithic integration of passive waveguide circuits, integrated EAMs and integrated photodetectors used in the photonic EAM tensor core.

Measurement setup

Coherence property measurement

The coherent light was generated by a tunable coherent laser (Santec, TSL-550) operating at 1,550 nm. The 0.8-nm-bandwidth C34 partially coherent light was generated by filtering the ASE from an EDFA (Pritel FA-33) with a passive DEMUX module (Gezhi, DWDM-100G-DEMUX) operating at channel C34 of the ITU grid. The 2.0, 4.0, 8.0 and 16.0-nm-bandwidth partially coherent light sources were generated by filtering the same ASE with an optical tunable band-pass filter (Santec, OTF-350) operating at a centre wavelength of 1,550 nm. The spectra were measured by an optical spectrum analyser (Anritsu, MS9710C). For eye diagrams, light was modulated by a pulse generator (Agilent, 8133A) through an electro-optic modulator (Lucent 2623N) and received by a photodetector (Newport New Focus 1611) connected to an oscilloscope (Tektronix, TDS7404B).

System setup for parallel convolutional processing

The experimental setup for parallel convolutional processing on two gait signals is shown in Fig. 4a. The photonic memory crossbar array has three input channels and three output channels, representing a d_3×3 matrix consisting of three d_1×3 kernels. The input light was switchable between an EDFA (Pritel FA-33) and a tunable pump laser (Santec, TSL-550) using an optical switch (Gezhi GZ-12C-1×2-SM). The phase-change-material photonic memory in each cell of the photonic memory crossbar array was first set to the desired weight to correctly define kernels. The tunable pump laser was used in phase-change-material weight setting. The amplified pump light passed through a DEMUX module (Gezhi, DWDM-100G-DEMUX) so that different wavelengths were routed to different input channels (λ₁ = 1,550.12 nm to Ch 1, λ₂ = 1,550.92 nm to Ch 2 and λ₃ = 1,551.72 nm to Ch 3). After setting all phase-change-material weights, parallel convolution was performed using the ASE from the EDFA. The DEMUX module was used to separate two wavelengths with a spacing of 0.8 nm to two different channels (λ₁ = 1,550.12 nm and λ₂ = 1,550.92 nm). Each wavelength was split into three channels by an optical splitter (FS PLC splitter). The three channels serve as the input light to the three respective input waveguide channels of the photonic memory tensor core. Adjacent channels have a 1-m path difference, using a further 1-m-long fibre to eliminate the coherence among all three input light sources. The gait-signal data were loaded into each channel using a variable optical attenuator (VOA; Thorlabs V1550A). The VOAs were driven by a digital signal processor (DSP; NI USB-6259). The polarization of output light from the VOA was controlled by a polarization controller (Thorlabs FPC032). Different wavelengths carrying the gait signal at the same time index from different patients were then grouped by a MUX array (Gezhi, DWDM-100G-MUX) to form three inputs to the respective input channels of the photonic memory tensor core. Convolutions were performed naturally as light propagated through the photonic memory crossbar array. Each output channel of the photonic memory tensor core contained both wavelengths λ₁ and λ₂. The two wavelengths were demultiplexed to obtain the outputs and detected by a photodetector array (Newport New Focus 2011) and finally read out from the DSP.

System setup for high-speed convolutional processing

The experimental setup for high-speed convolutional processing on the MNIST datasets is shown in Supplementary Fig. 13. The whole system operating at 2 GSa s⁻¹ was controlled by a FPGA evaluation board (Xilinx, Zynq UltraScale+ RFSoC ZCU216) with a processing system unit, a programmable logic unit, 16 DACs and 16 analogue-to-digital controllers. The optical input was the 8.0-nm-bandwidth partially coherent light equally split into nine input grating couplers. The MNIST data were read by the processing system unit, stored in its DDR4 memory and accessed by the programmable logic unit to output at nine analogue-to-digital controllers that modulated optical signals through the input EAM array. The weights on the photonic EAM crossbar array were set by a low-speed DSP. The three convolutional processing outputs were received by the integrated photodetector array connected to three transimpedance amplifiers and analogue-to-digital controllers, routed back to the processing system unit and stored in DDR4 memory.

Mapping non-negative transmission to negative convolution results

The input gait signals and image data presented in this work are non-negative, that is, x ∈ [0, 1]. The kernels involve negative values, that is, w ∈ [−1, 1]. The measurable outputs from the photonic system are non-negative as a result of them being physical quantities. We need to map these non-negative outputs to convolution results in the range [−1, 1]. This is done by the following steps:

(a)
We normalize every gait signal or image data to [0, 1] using software and load these normalized data to the photonic tensor core using modulators.
(b)
We represent the input data x using the output power of the modulator by setting P = x(P_max − P_min) + P_min, in which P_max and P_min are the maximum and minimum outputs from the modulator, respectively.
(c)
We represent the weight w using the transmission level of the phase-change material or the EAM by setting $T=w\left(\frac{{T}_{\max }-{T}_{\min }}{2}\right)+\frac{{T}_{\max }+{T}_{\min }}{2}$, in which T_max and T_min are the maximum and minimum transmission levels of the weight-setting device, respectively.
(d)
We set the input vector x to the target input data and set the kernel w to the target weights. The measured output is:
$${\sum }_{i}{P}_{i}\times {T}_{i}={\sum }_{i}\left[({P}_{\max }-{P}_{\min })\left(\frac{{T}_{\max }-{T}_{\min }}{2}\right){x}_{i}{w}_{i}+({P}_{\max }-{P}_{\min })\frac{{T}_{\max }+{T}_{\min }}{2}{x}_{i}+{P}_{\min }\left(\frac{{T}_{\max }-{T}_{\min }}{2}\right){w}_{i}+{P}_{\min }\frac{{T}_{\max }+{T}_{\min }}{2}\right]$$
(1)

Step (d) should be performed for every input vector x.
(e)
We set all x = 0 and all w = 0. Thus all P = P_min and all $T=\frac{{T}_{\max }+{T}_{\min }}{2}$. The measured output is:
$${\sum }_{i}{P}_{\min }\frac{{T}_{\max }+{T}_{\min }}{2}$$
(2)

Step (e) only needs to be performed once for the whole system.
(f)
We set all x = 0 and set w to the target weights. Thus all P = P_min and ${T}_{i}={w}_{i}\left(\frac{{T}_{\max }-{T}_{\min }}{2}\right)+\frac{{T}_{\max }+{T}_{\min }}{2}$. The measured output is:
$${\sum }_{i}\left[{P}_{\min }\left(\frac{{T}_{\max }-{T}_{\min }}{2}\right){w}_{i}+{P}_{\min }\frac{{T}_{\max }+{T}_{\min }}{2}\right]$$
(3)

Step (f) needs to be performed once for each kernel.
(g)
We set x to the target input data and set all w = 0. Thus P_i = x_i(P_max − P_min) + P_min and all $T=\frac{{T}_{\max }+{T}_{\min }}{2}$. The measured output is:
$${\sum }_{i}\left[\left({P}_{\max }-{P}_{\min }\right)\frac{{T}_{\max }+{T}_{\min }}{2}{x}_{i}+{P}_{\min }\frac{{T}_{\max }+{T}_{\min }}{2}\right]$$
(4)

Step (g) should be performed for every input vector x.
(h)
We perform post-processing on a computer using the measured output from steps (d)–(g) as:
$${\rm{Result}}=\left(1\right)-\left(3\right)-\left(4\right)+\left(2\right)=\left({P}_{\max }-{P}_{\min }\right)\left(\frac{{T}_{\max }-{T}_{\min }}{2}\right){\sum }_{i}{x}_{i}{w}_{i}$$
(5)
(i)
We normalize the results to [−1, 1] using software because all results share the same factor of ${(P}_{\max }-{P}_{\min })(\frac{{T}_{\max }-{T}_{\min }}{2})$ and x ∈ [0, 1] and w ∈ [−1, 1].

We can see that the hardware computation is doubled using this mapping approach, yet this mapping approach can be implemented without doubling by hardware implementation involving a balanced photodetection scheme (Supplementary Text 2).

Generation, convolution and output of gait signals

The properties of the original gait-signal data collected by force sensors (Ultraflex Computer Dyno Graphy, Infotronic) are described in the next section ‘CNN model; Gait-signal dataset’.

For parallel convolution of the middle three time-domain data of two gait signals, the input matrix is a d_3×2 matrix: $X=\left[\begin{array}{cc}{x}_{11} & {x}_{12}\\ {x}_{21} & {x}_{22}\\ {x}_{31} & {x}_{32}\end{array}\right]$. The jth column of X contains the middle three time-domain data of the jth gait signal (Fig. 4). The ith row of X contains the ith time-domain data of two gait signals. A DSP drove VOAs to load gait signals into the optical domain. The photonic memory tensor core was then effectively performing:

$$\begin{array}{c}{Y=W\times X=\left[\begin{array}{ccc}{w}_{11} & {w}_{12} & {w}_{13}\\ {w}_{21} & {w}_{22} & {w}_{23}\\ {w}_{31} & {w}_{32} & {w}_{33}\end{array}\right]}^{{\rm{T}}}\left[\begin{array}{cc}{x}_{11} & {x}_{12}\\ {x}_{21} & {x}_{22}\\ {x}_{31} & {x}_{32}\end{array}\right]\\ \,\,\,\,=\,\left[\begin{array}{cc}\mathop{\sum }\limits_{n=1}^{3}{{w}_{n1}x}_{n1} & \mathop{\sum }\limits_{n=1}^{3}{{w}_{n1}x}_{n2}\\ \mathop{\sum }\limits_{n=1}^{3}{{w}_{n2}x}_{n1} & \mathop{\sum }\limits_{n=1}^{3}{{w}_{n2}x}_{n2}\\ \mathop{\sum }\limits_{n=1}^{3}{{w}_{n3}x}_{n1} & \mathop{\sum }\limits_{n=1}^{3}{{w}_{n3}x}_{n3}\end{array}\right]=\left[\begin{array}{cc}{y}_{11} & {y}_{12}\\ {y}_{21} & {y}_{22}\\ {y}_{31} & {y}_{32}\end{array}\right]\end{array}$$

in which ${y}_{{ij}}={\sum }_{n=1}^{3}{{w}_{{ni}}x}_{{nj}}$ represents the convolution result of the middle three time-domain data of the jth gait signal using the ith kernel. Each row of Y was output from the respective photonic memory tensor core output channel.

CNN model

Gait-signal dataset

Gait signals from ten patients with Parkinson’s disease were taken from the ‘Gait in Parkinson’s Disease’ database in PhysioNet^51,52. This database includes the vertical ground reaction force records of individuals as they walked at their usual, self-selected pace for approximately 2 min on level ground. The corresponding clinical information of ten patients is provided in Supplementary Table 1. Fifty gait pulses were extracted from each patient, leading to a total of 500 gait pulses. Each pulse has a 1.2-s duration. The original electrocardiogram signals have a 0.01-s time resolution. Gait pulses were extracted with a time interval of 0.04 s (that is, one out of every four original data), leading to 31 data in the extracted gait pulses. The 0.04-s time interval was carefully chosen to minimize the extracted dataset while maintaining the key features from the original gait pulses. Eighty per cent of pulses were used for training and 20% were used for testing, that is, a total of 400 pulses for training and 100 pulses for testing.

MNIST dataset

The test dataset of MNIST handwritten digits and MNIST fashion products were respectively taken from https://git-disl.github.io/GTDLBench/datasets/mnist_datasets/ and https://developer.ibm.com/exchanges/data/all/fashion-mnist/. In both cases, the 10,000 test images were split into a training set with 8,000 images and a testing set with 2,000 images.

CNN architecture

The CNN architecture for the classification of the gaits dataset is shown in Fig. 4d. The input layer takes the gait signal, which is in the form of a d_31×1 1D array. The 1D array is passed to a convolution layer consisting of three d_1×3 kernels. Convolution operations were implemented with a stride of 1 and ‘valid padding’, resulting in a d_3×(31-3+1) output. The output was activated by a rectified linear unit layer and flattened to a d_87×1 vector. The flattened activated output was then fed to a fully connected layer with ten neurons. The output from the fully connected layer was converted to probabilities by a softmax layer. Finally, the classification result was obtained. The gait signals were classified into ten categories, representing ten patients with Parkinson’s disease. The convolution operations were implemented using the photonic memory tensor core. The convolution results were processed by the following CNN layers using the MATLAB R2021b Deep Learning Toolbox. Weights of the fully connected layer were trained by the Adam optimizer. A hundred epochs were used to reach the final CNN outcomes. The CNN architecture for the MNIST datasets is similar to that for the gaits dataset, as shown in Fig. 5d. We will only mention the key differences here. For the MNIST datasets, besides the trivial difference in layer dimensions, the images were convolved with ‘same padding’ implemented by the photonic EAM tensor core. We used 50 epochs to reach the final CNN outcomes.

Data availability

The data that support the findings of this study are available from the corresponding author on request. The gait dataset analysed in this study is available from the open source ‘Gait in Parkinson’s Disease’ in PhysioNet at https://doi.org/10.13026/C24H3N. The MNIST handwritten digits dataset is available at https://git-disl.github.io/GTDLBench/datasets/mnist_datasets/. The MNIST fashion products dataset is available at https://developer.ibm.com/exchanges/data/all/fashion-mnist/. Source data are provided with this paper.

Code availability

The code used in this work is available from the authors on request.

References

Stokes, G. G. On the change of refrangibility of light. Philos. Trans. R. Soc. Lond. 142, 463–562 (1852).
ADS Google Scholar
Round, H. J. A note on carborundum. Electr. World 49, 309 (1907).
Google Scholar
Maiman, T. H. Stimulated optical radiation in ruby. Nature 187, 493–494 (1960).
Article ADS Google Scholar
Nakamura, M. et al. GaAs–Ga_1−xAl_xAs double-heterostructure distributed-feedback diode lasers. Appl. Phys. Lett. 25, 487–488 (1974).
Article ADS CAS Google Scholar
Nakamura, M., Aiki, K., Umeda, J. & Yariv, A. CW operation of distributed-feedback GaAs-GaAlAs diode lasers at temperatures up to 300 K. Appl. Phys. Lett. 27, 403–405 (1975).
Article ADS CAS Google Scholar
Kikuchi, K. Digital coherent optical communication systems: fundamentals and future prospects. IEICE Electron. Express 8, 1642–1662 (2011).
Article Google Scholar
Li, N. et al. A progress review on solid-state LiDAR and nanophotonics-based LiDAR sensors. Laser Photonics Rev. 16, 2100511 (2022).
Article ADS Google Scholar
Shaipanich, T., Pahlevaninezhad, H. & Lam, S. in Interventions in Pulmonary Medicine (eds Díaz-Jimenez, J. P. & Rodriguez, A. N.) 267–279 (Springer, 2017).
Bourassin-Bouchet, C. & Couprie, M. E. Partially coherent ultrafast spectrography. Nat. Commun. 6, 6465 (2015).
Article ADS CAS PubMed Google Scholar
Wang, H. et al. Coloured vortex beams with incoherent white light illumination. Nat. Nanotechnol. 18, 264–272 (2023).
Article ADS CAS PubMed Google Scholar
Shen, Y. et al. Deep learning with coherent nanophotonic circuits. Nat. Photonics 11, 441–446 (2017).
Article ADS CAS Google Scholar
Lipsett, B. M. S. & Mandel, L. Coherence time measurements of light from ruby optical masers. Nature 199, 553–555 (1963).
Article ADS Google Scholar
Hayashi, I., Panish, M. B., Foy, P. W. & Sumski, S. Junction lasers which operate continuously at room temperature. Appl. Phys. Lett. 17, 109–111 (1970).
Article ADS CAS Google Scholar
Araki, M. et al. Optical coherence tomography in coronary atherosclerosis assessment and intervention. Nat. Rev. Cardiol. 19, 684–703 (2022).
Article PubMed PubMed Central Google Scholar
Clark, J. N., Huang, X., Harder, R. & Robinson, I. K. High-resolution three-dimensional partially coherent diffraction imaging. Nat. Commun. 3, 993 (2012).
Article ADS CAS PubMed Google Scholar
Durr, A., Kramer, R., Schwarz, D., Geiger, M. & Waldschmidt, C. Calibration-based phase coherence of incoherent and quasi-coherent 160-GHz MIMO radars. IEEE Trans. Microw. Theory Tech. 68, 2768–2778 (2020).
Article ADS Google Scholar
Peng, D. et al. Optical coherence encryption with structured random light. PhotoniX 2, 6 (2021).
Article PubMed PubMed Central Google Scholar
Liu, Y. et al. Robust far-field imaging by spatial coherence engineering. Opto-Electronic Adv. 4, 210027 (2021).
Article Google Scholar
Lecun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS CAS PubMed Google Scholar
Assael, Y. et al. Restoring and attributing ancient texts using deep neural networks. Nature 603, 280–283 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Rao, Z. et al. Machine learning-enabled high-entropy alloy discovery. Science 378, 78–85 (2022).
Article ADS CAS PubMed Google Scholar
Dauparas, J. et al. Robust deep learning–based protein sequence design using ProteinMPNN. Science 378, 49–56 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Shastri, B. J. et al. in Encyclopedia of Complexity and Systems Science (ed. Meyers, R. A.) 1–37 (Springer, 2018).
Zhou, H. et al. Photonic matrix multiplication lights up photonic accelerator and beyond. Light Sci. Appl. 11, 30 (2022).
Article ADS PubMed PubMed Central Google Scholar
Wetzstein, G. et al. Inference in artificial intelligence with deep optics and photonics. Nature 588, 39–47 (2020).
Article ADS CAS PubMed Google Scholar
Shastri, B. J. et al. Photonics for artificial intelligence and neuromorphic computing. Nat. Photonics 15, 102–114 (2021).
Article ADS CAS Google Scholar
Nahmias, M. A. et al. Photonic multiply-accumulate operations for neural networks. IEEE J. Sel. Top. Quantum Electron. 26, 7701518 (2020).
Article CAS Google Scholar
Pai, S. et al. Experimentally realized in situ backpropagation for deep learning in photonic neural networks. Science 380, 398–404 (2023).
Article ADS CAS PubMed Google Scholar
Mourgias-Alexandris, G. et al. Noise-resilient and high-speed deep learning with coherent silicon photonics. Nat. Commun. 13, 5572 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, H. et al. An optical neural chip for implementing complex-valued neural network. Nat. Commun. 12, 457 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Tait, A. N., Nahmias, M. A., Shastri, B. J. & Prucnal, P. R. Broadcast and weight: an integrated network for scalable photonic spike processing. J. Light. Technol. 32, 4029–4041 (2014).
Article Google Scholar
Deng, Y. & Chu, D. Coherence properties of different light sources and their effect on the image sharpness and speckle of holographic displays. Sci. Rep. 7, 5893 (2017).
Article ADS PubMed PubMed Central Google Scholar
Huang, C. et al. A silicon photonic–electronic neural network for fibre nonlinearity compensation. Nat. Electron. 4, 837–844 (2021).
Article CAS Google Scholar
Bai, B. et al. Microcomb-based integrated photonic processing unit. Nat. Commun. 14, 66 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Yan, T. et al. All-optical graph representation learning using integrated diffractive photonic computing units. Sci. Adv. 8, eabn7630 (2022).
Article CAS PubMed PubMed Central Google Scholar
Fu, T. et al. Photonic machine learning with on-chip diffractive optics. Nat. Commun. 14, 70 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Feldmann, J. et al. Parallel convolution processing using an integrated photonic tensor core. Nature 589, 52–58 (2021).
Article ADS CAS PubMed Google Scholar
Ríos, C. et al. In-memory computing on a photonic platform. Sci. Adv. 5, eaau5759 (2019).
Article ADS PubMed PubMed Central Google Scholar
Mehta, K. et al. High-power heterogeneously integrated III-V/silicon superluminescent diode. IEEE Photonics Technol. Lett. 35, 365–368 (2023).
Article ADS CAS Google Scholar
De Groote, A. et al. Heterogeneously integrated III–V-on-silicon multibandgap superluminescent light-emitting diode with 290 nm optical bandwidth. Opt. Lett. 39, 4784–4787 (2014).
Article ADS PubMed Google Scholar
Akcay, C., Parrein, P. & Rolland, J. P. Estimation of longitudinal resolution in optical coherence imaging. Appl. Opt. 41, 5256–5262 (2002).
Article ADS PubMed Google Scholar
Valero, N. et al. High-power amplified spontaneous emission pulses with tunable coherence for efficient non-linear processes. Sci. Rep. 11, 4844 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Rios, C. et al. Integrated all-photonic non-volatile multi-level memory. Nat. Photonics 9, 725–732 (2015).
Article ADS CAS Google Scholar
Becker, M. et al. in NeurIPS 2023 Workshop: Machine Learning and the Physical Sciences (eds Nord, B. et al.) (MIT Press, 2023).
Guo, X. et al. Correlation between emission and relative intensity noise spectral profiles of an Er-doped fiber superfluorescent source. AIP Adv. 12, 055226 (2022).
Article ADS CAS Google Scholar
Zhao, M., Morthier, G. & Baets, R. Analysis and optimization of intensity noise reduction in spectrum-sliced WDM systems using a saturated semiconductor optical amplifier. IEEE Photonics Technol. Lett. 14, 390–392 (2002).
Article ADS Google Scholar
Sacher, W. D. et al. Monolithically integrated multilayer silicon nitride-on-silicon waveguide platforms for 3-D photonic circuits and devices. Proc. IEEE 106, 2232–2245 (2018).
Article CAS Google Scholar
Siew, S. Y. et al. Review of silicon photonics technology and platform development. J. Light. Technol. 39, 4374–4389 (2021).
Article ADS CAS Google Scholar
Magyar, G. & Mandel, L. Interference fringes produced by superposition of two independent maser light beams. Nature 198, 255–256 (1963).
Article ADS Google Scholar
Paul, H. Interference between independent photons. Rev. Mod. Phys. 58, 209–231 (1986).
Article ADS Google Scholar
Frenkel-Toledo, S. et al. Treadmill walking as an external pacemaker to improve gait rhythm and stability in Parkinson’s disease. Mov. Disord. 20, 1109–1114 (2005).
Article PubMed Google Scholar
Goldberger, A. L. et al. PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. Circulation 101, e215–e220 (2000).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This research was supported by the European Union’s Horizon 2020 research and innovation programme (grant no. 101017237, PHOENICS project) and the European Union’s Innovation Council Pathfinder programme (grant no. 101046878, HYBRAIN project). This research was financed in part by UK Research and Innovation (UKRI, EP/T023899/1, EP/R001677/1 and EP/W022931/1). B.D. acknowledges financial support from Singapore A*STAR International Fellowship (AIF). We acknowledge funding support by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy EXC 2181/1 – 390900948 (the Heidelberg STRUCTURES Excellence Cluster), the Excellence Cluster 3D Matter Made to Order (EXC-2082/1—390761711) and CRC 1459 ‘Intelligent Matter’. We thank X. Li, N. Youngblood and U. Ali for help with sample fabrication.

Author information

These authors contributed equally: Bowei Dong, Frank Brückerhoff-Plückelmann

Authors and Affiliations

Department of Materials, University of Oxford, Oxford, UK
Bowei Dong, Samarth Aggarwal, Nikolaos Farmakidis, Mengyun Wang, Guoce Yang, June Sang Lee, Yuhan He & Harish Bhaskaran
Institute of Microelectronics, Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
Bowei Dong & Dim-Lee Kwong
Kirchhoff-Institute for Physics, Heidelberg University, Heidelberg, Germany
Frank Brückerhoff-Plückelmann, Lennart Meyer, Jelle Dijkstra & Wolfram H. P. Pernice
Center for NanoTechnology, University of Münster, Münster, Germany
Ivonne Bente, Daniel Wendland, Akhil Varri & Wolfram H. P. Pernice
Photonics Research Group, Ghent University – imec, Ghent, Belgium
Emmanuel Gooskens & Peter Bienstman

Authors

Bowei Dong
View author publications
Search author on:PubMed Google Scholar
Frank Brückerhoff-Plückelmann
View author publications
Search author on:PubMed Google Scholar
Lennart Meyer
View author publications
Search author on:PubMed Google Scholar
Jelle Dijkstra
View author publications
Search author on:PubMed Google Scholar
Ivonne Bente
View author publications
Search author on:PubMed Google Scholar
Daniel Wendland
View author publications
Search author on:PubMed Google Scholar
Akhil Varri
View author publications
Search author on:PubMed Google Scholar
Samarth Aggarwal
View author publications
Search author on:PubMed Google Scholar
Nikolaos Farmakidis
View author publications
Search author on:PubMed Google Scholar
Mengyun Wang
View author publications
Search author on:PubMed Google Scholar
Guoce Yang
View author publications
Search author on:PubMed Google Scholar
June Sang Lee
View author publications
Search author on:PubMed Google Scholar
Yuhan He
View author publications
Search author on:PubMed Google Scholar
Emmanuel Gooskens
View author publications
Search author on:PubMed Google Scholar
Dim-Lee Kwong
View author publications
Search author on:PubMed Google Scholar
Peter Bienstman
View author publications
Search author on:PubMed Google Scholar
Wolfram H. P. Pernice
View author publications
Search author on:PubMed Google Scholar
Harish Bhaskaran
View author publications
Search author on:PubMed Google Scholar

Contributions

All authors contributed to this work substantially. B.D., F.B.-P., W.H.P.P. and H.B. conceived the experiment. B.D. and F.B.-P. investigated the coherence properties of different light sources. B.D. fabricated and measured the photonic memory tensor core, with assistance from S.A., N.F., M.W., G.Y., J.S.L. and Y.H. F.B.-P fabricated and measured the photonic EAM tensor core, with assistance from L.M., J.D., I.B., D.W., A.V., E.G. and P.B. All authors discussed the data and wrote the manuscript together. W.H.P.P. and H.B. led the work.

Corresponding author

Correspondence to Harish Bhaskaran.

Ethics declarations

Competing interests

H.B. and W.H.P.P. hold shares in Salience Labs Ltd. The other authors declare no competing interests.

Peer review

Peer review information

Nature thanks Kathy Lüdge, Jiayang Wu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Source data

Source Data Fig. 2

Source Data Fig. 3

Source Data Fig. 4

Source Data Fig. 5

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dong, B., Brückerhoff-Plückelmann, F., Meyer, L. et al. Partial coherence enhances parallelized photonic computing. Nature 632, 55–62 (2024). https://doi.org/10.1038/s41586-024-07590-y

Download citation

Received: 16 May 2023
Accepted: 17 May 2024
Published: 31 July 2024
Issue date: 01 August 2024
DOI: https://doi.org/10.1038/s41586-024-07590-y

This article is cited by

Advanced Design for High-Performance and AI Chips
- Ying Cao
- Yuejiao Chen
- Bingang Xu
Nano-Micro Letters (2026)
Parallel optical computing capable of 100-wavelength multiplexing
- Xiao Yu
- Ziqi Wei
- Peng Xie
eLight (2025)
Self-driving laboratories, advanced immunotherapies and five more technologies to watch in 2025
- Michael Eisenstein
Nature (2025)
Spectral convolutional neural network chip for in-sensor edge computing of incoherent natural light
- Kaiyu Cui
- Shijie Rao
- Shengjin Wang
Nature Communications (2025)
Probabilistic photonic computing for AI
- Frank Brückerhoff-Plückelmann
- Anna P. Ovvyan
- Wolfram Pernice
Nature Computational Science (2025)

Comments

Commenting on this article is now closed.

Bhibuthi bhusan Patel 1 August 2024, 06:41

In this experiment of MNIST,the fluctuation in calculations(hand written and theoritical)can be cited in EAMs application.