Main

Light, as a medium for communication, has the unique ability to transmit volumes of data with minimal energy loss. This capability not only sparked the revolution of internet-based communication over fibre-optic networks but also holds the potential to greatly expand computing power beyond current capabilities. Specifically, artificial intelligence (AI) is poised to dramatically transform the computational landscape if provided with more efficient data communication between the nodes in computer networks1,2. A critical bottleneck to the full implementation of light-based communication is the conversion of electrical data from inside a computer chip to optical data. At present, the data are stored densely in these semiconductor chips in compute nodes but are sent out of the chip through centimetre-long electrical wires before finally interfacing with optical transmitters in the form of pluggable optical transceivers. The limited number of these electrical channels and their wire lengths result in data rates that are not scalable without accounting for a substantial amount of energy consumption3,4. To overcome this bottleneck, electrical channels must be condensed and converted into optical signals within a compact area.

Previously, intensive efforts have produced chip-scale transmitters and receivers (transceivers) towards this goal but have been marked by a lack of efficiency or scale. These works build on the field of integrated photonics, a technology that aggregates a multitude of optical components onto a single integrated chip. In particular, silicon is highly appealing as a material platform for integrated photonics since it can leverage the tremendous investment in the complementary metal–oxide–semiconductor (CMOS) infrastructure used to fabricate microelectronics chips5. The silicon photonics technology platform includes devices such as microresonator-based modulators6,7, filters and germanium photodiodes8 that are compact, efficient in their electrical-to-optical and optical-to-electrical conversions and scalable to many wavelength channels9,10. So far, the largest of these systems is composed of 64 channels of photonics and electronics on a single chip and achieves 240 fJ per communicated bit by the transmitter11,12,13. However, this system has receiver energy consumption above 1,000 fJ per bit and has a limited density from the lateral arrangement of photonics and electronics on the same two-dimensional chip. While this monolithic integration of CMOS transistors alongside photonic devices on the same chip may appear highly appealing14,15, this configuration ‘freezes’ transistors at a given node size and, thus, cannot benefit from the further energy efficiency, size and speed gains of moving to more advanced CMOS nodes. Alternatively, three-dimensional (3D) integration combines a more efficient, leading edge CMOS node electronic chip and a separate photonic chip to improve on these limitations. Ongoing 3D efforts have demonstrated sub-200 fJ per bit powers from transmitter16,17,18,19 and receiver front ends17,18,20, but the chip-to-chip bond spacings are either substantially larger than the devices themselves16,17,19,20 or rely on emerging hybrid bonding technology18. Furthermore, 3D-integrated transceiver demonstrations have yet to show more than eight channels16.

Here, we present a transceiver that is a large-scale 3D system, featuring 80 channels on a single chip, and is at the forefront of energy and area efficiency, consuming only 120 fJ per bit from its electro-optic front ends in a high-density platform. The transceiver achieves low energy consumption through low-capacitance 3D connections between photonics and codesigned CMOS electronic circuits. Although each channel has a relatively low data signalling rate of 10 Gb s−1, this allows the receiver electronics to operate with high sensitivity and minimal energy consumption. The large array of channels compensates for the low per-channel data rates, delivering a high aggregate data rate of 800 Gb s−1in a compact transmitter or receiver area of only 0.15 mm2 (5.3 Tb s−1 mm2). In addition, having many low-data-rate channels relaxes signal processing and time multiplexing of the low-data-rate streams native to the processor3,21. Furthermore, wavelength-division multiplexing sources for these numerous data streams are becoming available with the advent of chip-scale microcombs9,22. Our demonstration unlocks the tremendous potential of light as a high-bandwidth and energy-efficient medium for interchip communication, offering an immediate solution to the pressing challenge of AI scaling.

Results

We implement this high-density transceiver using compact photonic devices and dense, 28-nm-node codesigned electronic circuits; however, the total density ultimately depends on the 3D bond spacings. To address this, we use a high-density bonding process using copper pillar bumps. An electroplating process is used to form bumps on the photonic chip with copper pedestals capped with a layer of tin. The copper–tin bumps are then bonded to a nickel-plated electronic chip under a thermal and compressive force bringing the chips together. Figure 1a illustrates the bonding of the electronic and photonic chips. Figure 1b shows a microscope image of the photonic device array after microbumping. This dense photonic device array resides within the photonic test chip, which Fig. 1c shows with the electronic test chip and a diagram of the fibre-to-chip coupler (see Supplementary Note 1 for coupler details). We push the limits of this bonding technology by using a 15 μm spacing and 10 μm bump diameters (25 μm pitch) in an array of 2,304 bonds. This process balances two potential failure modes for such close spacing: excessive tin causing flow and electrical shorting to adjacent bonds during bonding and insufficient tin leading to brittle bonds23. We test our bonding process using cross-sectional scanning electron micrographs of the bonds, displayed in Fig. 1d, and by measuring the force needed to separate the bonded chips. The cross-section analysis reveals that tin does not flow to adjacent bonds, while the shear test demonstrates a robust 2.1 kg (114.9 MPa) force required to separate the bonded chips. The modelling and measurements show each bond has a 10 fF capacitance (Methods). Figure 1e shows the assembled transceiver, which is wire bonded to a printed circuit board and optically coupled to a fibre array (Methods). Figure 1f illustrates the material layers in the electronic chip, photonic chip and bonds and shows how fibre and wire bonds interface with the transceiver. This bonding technique provides an ideal platform to achieve the required density for chip-to-chip data communication links.

Fig. 1: 3D-integrated photonic-electronic transceiver.
figure 1

a, An illustration of the 3D-integrated photonic-electronic system combining arrays of electronic cells with arrays of photonic devices. b, A microscope image of the 80-channel photonic device arrays with an inset of two transmitter and two receiver cells. c, Microscope images of the photonic and electronic chips. The active photonic circuits occupy an area outlined in white, while the outer photonic chip area is used to fan out the optical/electrical lanes for fibre coupling and wire bonding. The blue overlay shows a four-channel transmitter and receiver waveguide path; the disk and ring overlays are not to scale. An inset shows a diagram of the fibre-to-chip edge coupler, consisting of a silicon nitride (Si3N4) inverse taper and escalator to silicon. d, A scanning electron microscope image of the bonded electronic and photonic chip cross-section. e, An image of the wire-bonded transceiver die bonded to a printed circuit board and optically coupled to a fibre array with a US dime for scale. f, A cross-sectional diagram of the electronic and photonic chips and their associated material stacks. Both chips consist of a crystalline silicon substrate, doped-silicon devices and metal interconnection layers.

The 3D-integrated chip contains an array of 80 transmitter cells and 80 receiver cells; these cells are organized into 20 waveguide buses with four wavelength channels per bus. A blue overlay in Fig. 1c outlines a single transmitter and receiver bus. Each transmitter cell has a local memory in the electronic chip that stores a pseudo-random bit sequence. A periodic clock signal triggers the electronics, and the transmitter cell electronics send out the programmed bit sequences as voltage pulses incident on the photonic modulator electrodes. These voltage pulses blue shift the microdisk resonance from a blocking to a non-blocking state and, thus, modulate an on-resonance laser line. Figure 2a illustrates the transmitter experiment, and Fig. 2b shows a schematic of the transmitter cell, while Fig. 2c shows the spectrum of the modulator bus with four microdisk resonances. After the transmitter characterization, we test the receivers, which function similarly: in each receiver cell, the wavelength channels carry signals on the photonic chip, and microrings selectively drop wavelengths onto each respective photodiode. The electronic chip then amplifies the photocurrent generated by the photodiode and writes the data into the local memory of each receiver cell, as illustrated in Fig. 3a. Figure 3b shows a schematic of the receiver cell. For performance characterization, an on-chip circuit compares the receiver memory to expected data and keeps an error count that is periodically read out of the chip. This architecture of transmitters and receivers fills the array of channels in the dense area provided by the bonding process.

Fig. 2: Transmitter characterization and performance.
figure 2

a, An illustration of the transmitter experimental test setup showing a single wavelength laser channel modulated by the transmitter and measured on an oscilloscope. b, The transmitter cell circuit schematic. c, An optical spectrum of the four-channel transmitter bus. d, A resonance shift of a representative microdisk as a function of reverse-bias voltage applied to the vertical p–n junction. e, The transmitted signal ER (power of ‘1’ bit divided by power of ‘0’ bit), IL (power of ‘1’ bit divided by power before the modulator) and normalized OMA (normalized power of ‘1’ bit minus power of ‘0’ bit) with a 1.5 V driver voltage. f, The ER and IL at maximum OMA for a range of driver voltages. g, Measured energy consumption of the transmitter array for a range of driver voltages and a \(\frac{1}{4}\)CV2 fit, where C is the capacitance charged by the driver voltage V. h, A BER measurement of the modulated signal input to a commercial receiver at 1, 1.25 and 1.5 V driver voltages; the received power is the average signal power at the commercial receiver. i, Eye diagrams for all 80 modulators on the photonic chip at 10 Gb s−1 per modulator and 1 dBm input laser power.

Fig. 3: Receiver characterization and performance.
figure 3

a, An illustration of the receiver test setup showing a laser line modulated by a commercial transmitter and received by the 3D-integrated photonic-electronic receiver. b, A receiver cell circuit schematic. c, A responsivity (light to electrical current conversion efficiency) measurement of the photodiode. d, An optical spectrum of a receiver bus. e, A BER test of a receiver cell using a commercial transmitter signal; the received power is the average signal power at the photodiode.

The transmitter cell within the 80-cell array consumes 50 fJ per bit when driving the microdisks with a 1 V swing. This power is dynamic, equal to \(\frac{1}{4}\,{CV}^{\,2}\), where C is the capacitance being charged or discharged during a bit transition, and V is the charged-to-discharged voltage24. The vertical p–n junction microdisk enables a low voltage drive by featuring a higher overlap of the p–n depletion region and the optical whispering gallery mode of the disk compared with lateral junctions7 and results in an electrical-to-optical response of 75 pm resonance shift per applied volt (Fig. 2d). We further characterize this response with the dynamic insertion loss (IL, the power of a ‘1’ bit divided by the power before the modulator) and extinction ratio (ER, the power of a ‘1’ bit divided by the power of a ‘0’ bit). Figure 2e shows these metrics, captured from the modulated signal output of a transmitter cell driven at 1.5 V. In this measurement, the laser wavelength moves into the shifting resonance and the optical modulation amplitude (OMA, the power of ‘1’ bit minus the power of ‘0’ bit) of the output signal increases, reaching a maximum at 2.5 dB IL and 4 dB ER. Figure 2f shows the ER and IL at maximum OMA for driver voltages between 1 and 1.5 V. These high ERs and low ILs per volt enable a reduced V in \(\frac{1}{4}\,{CV}^{\,2}\). The capacitance sources include the microdisk p–n junction (128 fF), bond pads (10 fF) and capacitances within the driver circuit (61 fF), combining for a total expected capacitance of 199 fF (Methods). These capacitances exhibit low values through microdisk compactness, miniaturized bonds and careful design in the 28 nm electronic chip technology. Figure 2g shows the transmitter energy consumption as all 80 modulators are transmitting data with drive voltages ranging between 1 and 1.5 V. The \(\frac{1}{4}\,{CV}^{\,2}\) model fits 198 fF total capacitance per cell using this data, aligning closely with the expected 199 fF from the independently measured and modelled devices. Next, we record eye diagrams for each of the 80 transmitters on the chip with a drive at 10 Gb s−1 per transmitter and 1 dBm laser power before the modulator (Fig. 2i). As every transmitter modulates, we measure current and voltage on the transmitter power supply for the previous energy consumption (Fig. 2g). With no optical amplification, the oscilloscope receiver is the limiting factor of the eye qualities with an 8 µW input-referred root mean square noise (denoted as input-referred noise here on). All 80 eye diagrams in the array are open and uniform, which confirms the high yield of the bonding process and validates our many-channel approach. As a further confirmation of transmitter signal quality, a bit-error ratio (BER) test with a reference receiver demonstrates error-free performance (BER <10−12) down to a receiver noise-determined power for 1-, 1.25- and 1.5-V modulator drives (Fig. 2h). In the following section, our on-chip receiver shows a dramatically reduced input-referred noise of 480 nW. The 80-channel photonic transmitter array outputs an aggregate data rate of 800 Gb s−1and occupies an area of 0.15 mm2, demonstrating an unprecedented bandwidth density of 5.3 Tb s−1 mm2.

The receiver cell consumes 70 fJ per bit when receiving a 10 Gb s−1signal at −24.85 dBm average power with a 4 × 10−10 BER. The receiver spends energy as a static biasing of the electronic amplifier. The photodiode is a vertical p-silicon, i-germanium and n-germanium diode that efficiently converts optical signals to electrical current with an efficiency of 1 A W−1(Fig. 3c). The capacitance of this photodiode is crucial since receiver noise is proportional to the amplifier input capacitance and is compensated by static biasing power (Methods and Supplementary Note 3). Minimizing this noise is critical for reducing laser power sourced into the link and improving energy efficiency. With a measured photodiode capacitance of 17 fF and pad capacitances of 10 fF, the simulated input-referred noise is 300 nW. A BER test is used to evaluate this performance; a signal from an ideal modulator is used to send a 10 Gb s−1data stream into the chip from which on-chip circuits measure errors in the received bits. On the photonic chip, a ring resonator on a four-channel bus filters the modulated signal to a photodiode. Figure 3d shows the four-channel bus spectrum. We then gradually reduce the signal power while counting errors on the electronic chip, obtaining the BER curve in Fig. 3e. The receiver exhibits a 4 × 10−10 BER at a −24.85 dBm average input optical power. This reveals a sensitivity of −22.3 dBm OMA using the 13 dB ER signal and an input-referred noise of 480 nW. We record the energy consumption of the receiver from its power supply at −24.85 dBm input optical power to be 70 fJ per bit. This result, along with the transmitter cell performance, demonstrates that both the receiver and transmitter consume less than 100 fJ per bit while supporting a massive bandwidth transfer of 800 Gb s−1from the dense array.

The transmitter and receiver cells independently demonstrate a combined 120 fJ per bit; we next connect them and validate their combined performance. Optical fibre (4 m of SMF28) connects two separate transceivers as a complete data communication link, with one transceiver functioning as a transmitter and the other as a receiver (Fig. 4a). A polarization controller is the only component between the transmitter and receiver. A shared clock synchronizes the two electronic chips, and programmable clock delays in each receiver cell align the transmitted data with the receiver sampling point. A laser diode array provides four wavelength channels at −5 dBm power per channel, which we feed into a bus on the transmitter chip. Figure 4b shows the spectrum of these laser channels. Individual transmitter cells simultaneously modulate each wavelength at 8 Gb s−1at a 1.5 V drive, resulting in open eye diagrams for each channel (Fig. 4c). The signal powers are too low for detection by the diagnostic oscilloscope receiver, so we amplify the signal before the oscilloscope and normalize the eye diagrams. However, the signals do not require amplification for the electronic-photonic receiver due to its high sensitivity. The average power per channel at each receiver photodiode is −19.5 dBm. On-chip error counters record errors in each channel in 1-min intervals, revealing a maximum recorded BER of 6 × 10−8 and a minimum count of no errors in the interval, denoted as 10−12, in Fig. 4d. This result shows that the transmitters and receivers within the 3D integration can form a complete low-power, high-bandwidth link needed for next-generation computing systems.

Fig. 4: Transmitter to receiver data communication link demonstration.
figure 4

a, An illustration of the transmitter to receiver link showing a 3D-integrated photonic-electronic transmitter modulating four laser lines and a separate photonic-electronic receiver converting the four data channels back to the electrical domain. The illustration excludes a polarization controller between the transmitter and receiver. b, A spectrum of the link laser source. c, Eye diagrams of the four channels after the transmitter. The diagrams do not include a power scale bar, as an amplifier is used before the oscilloscope, but it is not used in the optical link. d, BERs of the data channels after the receiver.

Discussion

Integrated photonic chips present a promising low-power platform to address the data transfer demands of AI computing. Here, we realized this promise by demonstrating a scaled-up array of 80 channels on a single electronic-photonic, densely 3D-integrated transceiver. This multichip module consumes minimal energy by virtue of the large number of channels, cutting-edge low-capacitance bonding technology, codesigned electronic-photonic circuits and advanced devices used. While our demonstrated system achieves record performance in terms of energy efficiency and bandwidth density, the performance can be further improved in future implementations. Although the microdisks used in this demonstration exhibit high performance, resonant modulators can be developed with lower capacitance7,25 and a higher electro-optical response7,26, both of which would decrease the dynamic power of the transmitter. Similarly, on the receiver side, lower capacitance photodiodes27,28 could reduce the power and noise of the receiver architecture. However, miniaturizing photodiodes requires considering a loss of responsivity29, presenting complex link-level trade-offs. In addition, the energy consumption of the electronic circuits can be further reduced by moving to a more advanced CMOS node. While our demonstrated bonding technology is approaching the limit of how closely spaced tin bonds can be made, further density scaling could be realized through the development of hybrid bonding18. However, after achieving the low-capacitance value of the bonds demonstrated in this work, pursuing a further reduction in bond capacitance would yield diminishing returns in terms of energy efficiency.

Reduced chip-to-fibre optical losses can improve the loss budget of our demonstrated link, and laser powers may be as low as 47 fJ per bit with a distributed feedback laser and 30 fJ per bit for a scalable, high channel count comb laser (Supplementary Note 1). Furthermore, silicon resonators are sensitive to temperature and fabrication variation, requiring thermal control circuits30,31 and, for minimized power, reduced heat leakage into the environment using methods such as a silicon substrate removal around resonators19,32,33. Detailed wafer-scale resonance variation data and approximate thermal energy contributions across a range of scenarios provide a best case thermal energy consumption of 176 fJ per bit (Supplementary Note 2). A dense wavelength-division multiplexing architecture may be used to reduce this power penalty21. Moreover, the photonic circuits are highly polarization-sensitive and require polarization-maintaining fibre or the addition of polarization management circuits29,34,35. Finally, while we demonstrate high-bandwidth density, a higher per-fibre bandwidth and photonic chip edge bandwidth density can be achieved with wavelength scaling by cascading more arrayed channels onto fewer waveguide buses9,10. This architecture can combine with chip-scale frequency combs to generate hundreds of wavelength channels9.

While the potential impact of this technology is evident for the advancement of energy-efficient AI computing, its use may extend to far reaching applications. These low-power, massively parallel optical links could enable pervasive device connectivity and transform computing by streamlining resource allocation through optically linked, disaggregated and reconfigurable computing and memory resources36,37,38,39, revolutionizing the computing landscape over the next decade and beyond.

Methods

Transceiver assembly

Separate CMOS foundries are used to fabricate the electronic and photonic chips. The photonic chips were fabricated through the American Institute for Manufacturing Integrated Photonics on a custom 300-mm silicon-on-insulator wafer. The American Institute for Manufacturing Integrated Photonics process design kit includes the microdisks, ring filters and photodiodes40. The microdisks and ring filters have quality factors of 5,800 and 2,800, respectively, and both have a free spectral range of 25.6 nm for a channel spacing of 6.4 nm. An example of the photodiode design can be found in ref. 41. The electronic chips were fabricated through Taiwan Semiconductor Manufacturing Company Limited on a shared multiproject wafer in a 28-nm CMOS process node. Both chips feature a square array of 2,304 aluminium pads at a 25-μm pitch on their top surfaces. These pads connect to lower metal layers within each chip and the devices on the silicon layer.

The photonic and electronic chips are then processed postfabrication before bonding their pad arrays together. In this step, we core the 300-mm photonic wafer to a 200-mm wafer, and a wafer-level process is used to bump its pads with electroplated layers of copper and tin. The electronic chips, received as individual 1.6 mm2 units from a shared wafer, are unsuitable for wafer-level photolithography-based processes. Alternatively, we adopt a chip-level process of electroless nickel plating, followed by an additional layer of immersion gold plating to prevent nickel oxidation. After dicing the bumped photonic wafer into 6.5 mm × 3 mm chips, a thermo-compression bond is used to connect the bumped photonic chips to the plated electronic chips.

To power and operate the transceiver, we create electrical connections to the electronic chip through the bonds to the photonic chip. The metal layers on the photonic chip wire these connections to large electrical pads on an exposed edge of the photonic chip. Wire bonds connect these pads to a printed circuit board, which connects to: (1) a microcontroller that programs the electronic chip, (2) power sources that supply the electronic chip voltage rails and (3) a radio-frequency (RF) clock generator for the 5 GHz clock of the electronic chip. This clock line is a coplanar RF waveguide on the printed circuit board with a matched impedance to 50 Ω RF cables. The optical fibres couple light to waveguide buses through silicon nitride edge couplers; these couplers are on the side of the photonic chip that is opposite the wire-bond pads. A micropositioner was used to align a standard single mode fibre v-groove array with the edge couplers, which are spaced at a 127 μm pitch. The assembly procedures of photonic wafer bumping and bonding, electronic chip plating and wire bonding are conducted at Micross AIT, CVI and Cornell University, respectively.

Capacitance models and measurements

We identify several sources of capacitance that affect energy efficiency: (1) chip pads, (2) bump parasitics, (3) the microdisk junction, (4) the photodiode junction and (5) electronic driver capacitance. Extended Data Fig. 1a depicts these capacitance sources. These capacitances are determined through electrostatic simulations, circuit model simulations and empirical measurements. Focusing initially on the bumps and photonic chip pads, an electrostatic solver (Ansys Maxwell) is used to simulate the photonic chip pad-to-substrate capacitance (4 fF) and bump-to-bump parasitic capacitance (<1 fF). Similarly, we use a circuit model simulator (Cadence Virtuoso) to determine the electronic chip pad capacitance, which we find to be 6 fF. The simulator is also used to calculate the effective capacitance of the driver. We simulate the transmitter driving a capacitor (Cload), subtract \(\frac{1}{4}{{{C}}}_{{\rm{load}}}{{{V}}}^{2}\) from the total simulated energy consumption of the transmitter, and treat this adjusted driver-only power as \(\frac{1}{4}{{{C}}}_{{\rm{driver}}}{{{V}}}^{2}\). From this simulation, the effective capacitance of the driver is calculated to be 61 fF.

Experimental methods are used determine the capacitances of the microdisk and photodiode junctions, along with a validation of the simulated pad capacitance. A vector-network analyser (Keysight P5007A) is used to record an electrical RF reflection from probed devices. We fit the magnitude and phase of the reflected wave across RF frequencies to a reflection from a lumped complex impedance. Extended Data Fig. 1b shows the imaginary impedances of the devices and their associated fitted capacitor impedances. During these measurements, an RF bias tee provides a d.c. reverse-bias voltage to the device. An electronic calibration module (Keysight N4693D) is used to calibrate out the vector-network analyser response of the RF cable up to the output of the bias tee. The bias tee connects to a 25-μm pitch RF probe (FormFactor InfinityXT), and this probe response is calibrated out of the results by de-embedding its unlanded response as an electrical open. After this calibration, measurements of the photonic pad and bumped photonic pad show capacitances of 3 and 4 fF, respectively. Subsequent measurements are used to first de-embed these pad responses and then measure the photodiode and microdisk capacitances. The measured photodiode capacitance is 17 fF.

The measured microdisk capacitance decreases with an increasing reverse bias, as anticipated from a p–n junction. Extended Data Fig. 1c shows the measured capacitance of the four microdisks across reverse biases. The energy spent per bit transition is the integral of this capacitance function weighted by the difference of supply voltage and output voltage24. A midpoint Riemann sum of the microdisk capacitances, weighted by reverse-bias voltage subtracted from 1 V, between 0 V and 1 V bias voltage, yields an effective disk junction capacitance for dynamic energy consumption of 128 fF (averaged across the four microdisks).

A summation of the pad, driver and junction capacitances results in a 199 fF transmitter capacitance. This result is in excellent agreement with the capacitance directly measured from the energy consumption of the transmitter (198 fF), validating the capacitance models and measurements.

Transmitter characterization

Each transmitter result is experimentally measured from the 3D-integrated photonic-electronic chip. An exception is the d.c. electro-optic response measurement, for which a separate photonic chip with the same modulator design is used. We apply a voltage to a probe to reverse bias the modulator at varying d.c. voltages and use an optical spectrum analyser (Keysight 8164B) to record each response. The remainder of the transmitter, receiver and link characterizations use an optical switch (Polatis 1000n 24 × 10). This switch optically connects equipment and devices under test and minimizes fibre mating cycles. This approach streamlines the measurement process and eliminates potential power discrepancies that might stem from fibre mating inconsistencies. An optical spectrum analyser (Aragon BOSA 400) is used to measure the transmitter bus spectrum. For dynamic data transmission, a microcontroller is used to write a different PRBS6 pattern into the 64-bit registers in each of the 80 transmitter cells. Next, all modulators transmit this data simultaneously as the electronic chip is clocked, and all data registers are driven out of the chip to their respective microdisks. The 64-bit pattern transmitted by each modulator repeats indefinitely as the chip clock is running. In this state, we record the eye diagrams of each modulator, dynamic characteristics of the modulators and transmitter energy consumption from the electronic driver array voltage rail. The asymmetry between upper and lower regions of the eye diagrams may be due to uneven sizing of the P-channel metal-oxide-semiconductor (PMOS) and N-channel metal-oxide-semiconductor (NMOS) transistors in the transmitter driver. A narrow linewidth tunable laser (Santec TSL-210) is used as the light source in these measurements. Laser light travels through a fibre polarization controller and then into the chip. An oscilloscope (Tektronix DSA8300 with an 80C01 Optical Sampling Module) is used to receive modulated light for dynamic characterization and eye diagrams. In the BER test, modulated light initially passes through a variable optical attenuator (VOA) before reaching a commercial receiver (Thorlabs RXM40AF). The commercial receiver converts the optical signals into electrical signals that are read by a BER tester (Anritsu MU195040A). We sweep the received optical power with the VOA and record errors from the BER tester to construct the transmitter BER curves.

Receiver characterization

An ideal modulation source and an on-chip bit-error checker circuit are used to characterize the receiver cell performance. Separately, a tunable laser (Keysight 8164B) and a d.c. electrical probe landed on photodiode pads are used to measure the photodiode responsivity; the probe applies a reverse-bias voltage and senses photocurrent from a known input laser power. Next, for the dynamic characterization, we use an ideal modulation source (Thorlabs MX35E) consisting of an internal laser and a lithium niobate Mach–Zehnder modulator. A pulsed pattern generator (PPG, Anritsu MU195020A) is used to drive the modulator with a repeating 64-bit PRBS6 pattern. The signal travels through fibre and a polarization controller before coupling into the photonic chip. Voltage is applied to a doped-silicon resistor adjacent to the ring filter to generate heat and tune the ring-filter resonance to the desired wavelength channel. The ring resonator drops the signal to a photodiode, which converts it from light into photocurrent for the electronic chip to then amplify. Timing circuits continuously write the received bits into a 64-bit long memory in a cycle. For timing, a programmable timing offset circuit in the electronic chip and a timing offset of the PPG align the incoming data to the receiver sampling point. A split clock source synchronizes the receiver chip and PPG clock frequencies. As the final step, readouts from the serial programming port display the saved received bits and confirm data reception. However, the serial port cannot update fast enough to give a BER in a short time frame. Instead, an on-chip error-counter circuit in each cell compares the received memory with preprogrammed expected bits, and if there is a discrepancy, it adds an error to an ongoing count. The readouts from the serial port display this count, and we obtain a BER curve as we sweep signal power using a VOA inside the ideal modulation source.

Link demonstration

The link demonstration combines the previously described experimental setups of the transmitter and receiver. A microcontroller sets the sent bits in a transmitter chip, and a second microcontroller reads the received bits in a separate receiver chip. Four channels of data transmit through the link at 8 Gb s−1 per channel and serial port readouts from the receiver record these data, along with an ongoing error count for each channel. A shared clock signal synchronizes the two transceivers and a programmable delay block in each receiver cell is used to delay the receiver sampling point with respect to the transmitter clock. A distributed feedback laser array (Thorlabs PRO8) is used as the four-channel optical source for the link. An arrayed waveguide grating multiplexes these different wavelengths of light from the laser array onto a single fibre. We place polarization controllers before the transmitter chip and before the receiver chip to ensure optimal coupling into the fundamental transverse electric mode of each waveguide. The fibres in the link are standard SMF28 fibre. An optical switch is used to direct light from the transmitter to an erbium-doped fibre amplifier and an oscilloscope for each eye diagram; the switch then directs light back to the 3D-integrated receiver for BER measurements without amplification. The optical losses in this link amount to 14.5 dB. These losses are from several sources: three chip-to-fibre interfaces at 3 dB each account for 9 dB, the modulation IL is 2.5 dB, a modulation penalty accounts for 1.5 dB (this is the difference between the optical power of a ‘1’ bit and the average power) and an extra 1.5 dB of power is lost through the optical switch and fibre connectors throughout the link. Here, we use 8 Gb s−1per channel to accommodate the timing circuits after our receiver. However, in the transmitter section, we show error-free operation while transmitting 10 Gb s−1data using a commercial clock and data recovery circuit. A 1.5 V drive is used due to the limitations of the receiver input current digital-to-analogue converter (DAC). Transmitter data at a 1 V drive has a lower extinction ratio and a higher average input current to the TIA, which exceeds the DAC circuit upper limit.

Electronic circuit architectures

Transmitted data starts as bits in the memory of each transmitter cell. Timing circuits running on a 5-GHz input clock (half transmitted data rate) generate memory read addresses and two-to-one multiplexer select signals. The circuits in the data path operate at a voltage supply of 1 V, except for the driver, which operates between 1 and 1.5 V. Extended Data Fig. 2a shows the driver circuit. Inside of the design, high-speed 1 V transistors in a cascode configuration prevents transistor junction breakdown from supply voltages exceeding 1 V. The main driver branch (M5–M8) has wide transistors to reduce the switching delay on modulator capacitance (Cmicrodisk = 128 fF). A capacitor (Ccoupling = 183 fF) ensures a high switching speed while the auxiliary branch (M1–M4) holds the d.c. voltage level.

The receiver circuit senses a modulated photodiode current, amplifies it to digital levels at the supply rail voltage and deserializes the signal before writing it into internal memory. Extended Data Fig. 2b shows the amplifier circuit, which uses an inverter-based transimpedance amplifier (TIA) as an initial gain stage followed by an equalizer and inverters. A programmable current DAC at the amplifier input supplies a current (IDAC) that cancels the d.c. offset of the photodiode current. The TIA stage has a high feedback resistance for a high gain (Rf = 18.6 kΩ). This resistor equates to a lower input resistance (Rin = 2.1 kΩ) from the Miller effect; however, it combines with the input capacitance (Cphotodiode = 17 fF, Cpad = 10 fF) for a low-frequency pole. As a remedy, an active inductor circuit in the subsequent equalizer stage cancels out the TIA pole (Req = 3.1 kΩ, Ceq = 33.6 fF). After the equalizer, ensuing inverters ensure the output swings between 0 and 1 V. An isolated, 1-V power supply of the receiver amplifiers mitigates supply noise.

The TIA dominates the receiver amplifier energy consumption, and its energy per bit is the static biasing power divided by the data rate. However, the TIA design introduces trade-offs between noise, bandwidth and power. Equation (1) shows how the receiver signal-to-noise ratio, SNR, relates to receiver energy per bit (E/bitRX), input signal (I) input capacitance (C) and channel bandwidth (BW). Supplementary Note 3 provides a derivation of equation (1). This relationship sets a boundary on channel data rate scaling. With constant SNR and C, the design can expand BW with an increase in the input signal. In this context, the energy per bit remains constant, with the growing BW balancing out added laser power. This could imply an indefinite data rate scaling. However, a rise in BW necessitates wider TIA transistors, which subsequently contribute to the input capacitance, C. This sequence results in a degradation of SNR at the receiver for high BWs, establishing a cap on the energy-efficient per-channel data rate. To achieve higher data rates without compromising energy, the focus should be on parallel data communication across multiple channels. Similar conclusions have been made in other studies, which advocate for parallel channels operating at moderate data rates3,21.

$${\mathrm{SNR}} \approx{\left(\frac{I}{\mathrm{BW}}\right)}^{2}\frac{E/{\mathrm{bi{t}}}_{\mathrm{RX}}}{{C}^{\,2}}.$$
(1)