Optoelectronic array of photodiodes integrated with RRAMs for energy-efficient in-sensor computing

Pan, Wen; Wang, Lai; Tang, Jianshi; Huang, Heyi; Hao, Zhibiao; Sun, Changzheng; Xiong, Bing; Wang, Jian; Han, Yanjun; Li, Hongtao; Gan, Lin; Luo, Yi

doi:10.1038/s41377-025-01743-y

Download PDF

Article
Open access
Published: 15 January 2025

Optoelectronic array of photodiodes integrated with RRAMs for energy-efficient in-sensor computing

Wen Pan¹,
Lai Wang ORCID: orcid.org/0000-0001-7262-0020^1,2,
Jianshi Tang ORCID: orcid.org/0000-0001-8369-0067^2,3,
Heyi Huang³,
Zhibiao Hao^1,2,
Changzheng Sun^1,2,
Bing Xiong ORCID: orcid.org/0000-0002-1015-4578^1,2,
Jian Wang^1,2,
Yanjun Han^1,2,
Hongtao Li^1,2,
Lin Gan^1,2 &
…
Yi Luo^1,2

Light: Science & Applications volume 14, Article number: 48 (2025) Cite this article

7329 Accesses
9 Citations
18 Altmetric
Metrics details

Subjects

Abstract

The rapid development of internet of things (IoT) urgently needs edge miniaturized computing devices with high efficiency and low-power consumption. In-sensor computing has emerged as a promising technology to enable in-situ data processing within the sensor array. Here, we report an optoelectronic array for in-sensor computing by integrating photodiodes (PDs) with resistive random-access memories (RRAMs). The PD-RRAM unit cell exhibits reconfigurable optoelectronic output and photo-responsivity by programming RRAMs into different resistance states. Furthermore, a 3 × 3 PD-RRAM array is fabricated to demonstrate optical image recognition, achieving a universal architecture with ultralow latency and low power consumption. This study highlights the great potential of the PD-RRAM optoelectronic array as an energy-efficient in-sensor computing primitive for future IoT applications.

Non-volatile optoelectronic memory based on a photosensitive dielectric

Article Open access 05 September 2023

All-inorganic perovskite quantum dot light-emitting memories

Article Open access 22 July 2021

Electrical programmable multilevel nonvolatile photonic random-access memory

Article Open access 01 August 2023

Introduction

With the widespread use of artificial intelligence (AI) and 5 G network¹, the Internet of Things (IoT) is developing rapidly. The growing data generated by the IoT has brought a burden to the data center (even though breakthroughs in in-memory computing for electrical and optical domains^2,3,4,5), which requires new technologies for terminal devices. Edge computing devices⁶ can be applied to processing visual, auditory, tactile, and other signals to provide efficient processing and reduce data transmission⁷. Emerging “sensing with computing” on-chip architectures⁸, including near- or in-sensor computing⁹, make a difference for terminal devices that operate on the IoT. Compared to traditional off-chip processing in Von Neumann architecture, image processing in front-end sensors reduces communication and computation burdens, thereby improving time and energy efficiency.

So far, a variety of architectures have been developed to implement image processing functions¹⁰ (including contrast enhancement, noise suppression, recognition, etc.). The main forms and characteristics of existing in-sensor computing are summarized in Table 1. Some studies have implemented near- or in-sensor computing through circuit designs⁷, where processing elements (PEs) and circuits are made near or within the pixel array, respectively^{11,12,13,14,15}. Also, Sony¹⁶ proposes a 3D integrated vision chip to vertically integrate the functional layers (sensor, memory, computing, communication, etc.) in space, with a processing speed reaching 1000 frames per second (fps). Apart from circuit designs, technologies involving novel material systems and advanced devices have recently been adopted. The memristor array enables parallel in-memory computing on the output from the photoelectric sensor array^{17,18,19,20,21,22}, thus simplifying PE circuits. Besides, some photonic synaptic devices made of detect-and-memorize (DAM) materials have been proposed for in-sensor computing, exhibiting multi-level conductance states and long-term plasticity (LTP) under various light conditions^{6,23,24,25,26,27,28,29,30,31,32,33,34}. In addition, optoelectronic sensors that integrate sensing, storage, and processing functions have attracted much attention due to acting as an artificial neural network (ANN) for IoT applications^{1,10,35,36,37}. In short, to better deliver the potential of the IoT, multifunctional integrated devices and low-power consumption systems are desired¹.

Table 1 Overview of the main forms and characteristics of existing in-sensor computing

Full size table

Here, we demonstrate an architecture that integrates photodiodes (PDs) with resistive random-access memories (RRAMs) to implement in-sensor computing for image recognition. Hundreds of silicon PDs integrated with RRAMs are fabricated on a chip, and the PD-RRAM unit cell exhibits multi-level photovoltaic responses as controlled by RRAMs that have non-volatile and multi-resistance state characteristics. These characteristics make the PD-RRAM cell to be a highly reliable and self-powered unit with adjustable photo-responsivity. Multiple individual cells are then wired into the PD-RRAM array, whose capability to perform multiply-accumulate computation (MAC) between optical images and weights is experimentally verified. The PD-RRAM array is further used to implement real-time letter recognition by physical networks with high accuracy. In summary, the architecture is presented to achieve pattern recognition and image pre-processing, which reduces the amount of image data from sources and improves the efficiency of the optoelectronic signal conversion. This type of architecture provides a real-time machine vision approach for the IoT, featuring high integration, ultralow latency, and low power consumption.

Results

PDs integrated with RRAMs

The conventional photovoltaic device (such as p-i-n, n-p-n, etc.) exhibits a constant photo-responsivity (R_e), where the photocurrent (I_ph) grows with the power of incident light P (i.e., I_ph = R_e × P). To implement tunable optoelectronic outputs, we design a unit consisting of a PD used for detecting light, a fixed resistance, and an RRAM used for weight storage. The designed PD-RRAM cell has a three-terminal structure, as schematically illustrated in Fig. 1a. The R_e can be electrically modulated by setting RRAM resistance values. The PD-RRAM cell has three working modes for its application, namely detecting mode, writing mode, and reading mode. For the detecting mode, terminals of the R₀ and PD are under control with the RRAM terminal floated, and the image information is output by the I_ph of the PD; for the writing mode, terminals of the R₀ and RRAM are under control with the PD terminal floated, and the RRAM is set into the target; for the reading mode, the current of the RRAM terminal (I_out) is output under illumination with three terminals grounded.

**Fig. 1: Photodiodes (PD) integrated with RRAMs.**

The schematic of the cross-sectional view is illustrated in Fig. 1b. The PD has a p-i-n structure, and a layer of HfO₂ film with the metal-insulator-metal (MIM) structure is fabricated on the top of the anode to form RRAMs. Holes for RRAMs are created by etching through a patterned SiO₂ isolation layer and terminating at the surface of the pad. The high resistivity material (such as NiCr, TiN, etc.) is used as R₀, sputtered on the oxide. Detailed fabrication steps are described in Methods and Fig. S1. The HfO₂-based RRAM is chosen because of its light insensitivity, low cost, easy preparation, stable performance, scaling down, and non-volatile properties. This type of design could be universally used for PDs of various materials (such as Si, GaAs, InP, GaSb, etc.). In this work, it’s used to fabricate PD-RRAM cells built on a silicon wafer shown in Fig. 1c. The cells are wired into a 3 × 3-pixel array where each pixel consists of a Si p-i-n detector with a diameter of 120 μm, an RRAM with an area of 200 μm² and a resistance made of NiCr alloy. A circular region is directly exposed to light from above, as a window of the PD. Contact electrodes are arranged on the front and back respectively.

According to the principles of operations, the RRAM resistance (R) changes the magnitude of the I_out for a given power of the incident light, modifying the terminal current ratio of the RRAM to PD under the short-circuit condition (i.e., weight). The R_e (that is, a ratio of the I_out to the incident light power in the reading mode) is proportional to the weights (w) set by RRAM resistance,

$$w=\frac{{R}_{0}}{R+{R}_{0}}$$

(1)

Taking R₀ as 1 kΩ, when R is changed from 1 kΩ to 100 kΩ, the weight ranges from 0.0099 to 0.5. The number of discrete weights increases with the number of RRAM resistance states. The resistance-tunability of w (theoretically) and R_e (experimentally) are shown in Fig. 1d. This dependence is measured with a fixed white LED power of 10 mW/cm², whereas R varies from 1 kΩ to 30 kΩ. The measured R_e (Fig. 1d) exhibits the expected modulation and it shows the stability of programming.

Programmable optoelectronic output

The dependence of the optoelectronic output (I_out) on the RRAM resistance (R) as well as light power (P) is further investigated. Figure 2a shows the detecting mode of the PD-RRAM cell with sweeping voltages under light/dark conditions and Fig. S2b shows the measured spectral response for different wavelength (shown in Fig. S2a). The p-i-n diodes possess a self-powered characteristic under the condition of zero bias, which is almost unaffected by a small load circuit (such as 1 kΩ). In other words, the R_e of PDs does not change with a small load circuit, at zero bias. In this architecture, the PD could be regarded as a current source that changes with light due to its characteristics. The RRAM plays the role of weight storage, with non-volatile data retention and programmable multi-resistance states. Figure 2b shows 20 consecutive direct current (D.C.) write/erase behaviors on switching characteristics of an individual HfO₂-based RRAM with a compliance current (I_CC) of 100 μA in the SET process and a stop voltage of −2.5 V in the RESET process, and a switching window of ~100 can be obtained with a low resistance state (LRS) of ~1 kΩ and a high resistance state (HRS) of ~100 kΩ. With the typical SET or RESET pulse train applied to the RRAM, the resistance switches from 1 kΩ to 100 kΩ with at least 10 resistance states (as shown in Fig. 2c, Fig. S5a, b). Some works^{38,39,40,41,42,43,44,45,46,47,48} focus on improving the analog switching behaviors of RRAMs, which is considered to be achievable in HfO₂-based RRAMs. In 2023, Rao et al.³⁸ report HfO_x-based memristors fabricated on chip achieve 2048 conductance levels, which reaches the 11-bit floating-point precision. Here we demonstrate the feasibility of this architecture, instead of optimizing the continuous tunability of RRAMs.

**Fig. 2: Reconfigurable optoelectronic output of the PD-RRAM cell.**

A system is setup to test the programmable output (I_out) of the PD-RRAM cell under white LED illumination in the reading mode, as illustrated in Figs. 2d and S6. The I_ph generated by the PD is shunted into two dividing currents (i.e., the I_out and I_o) whose magnitudes are proportional to the I_ph. The measured output for varying RRAM resistance states and light intensities is shown in Fig. 2e. As the P of LED light increases from 0 to 10 mW/cm², the I_out increases linearly with the light power that the window of the PD receives. As R ranges from 1 kΩ to 30 kΩ, the I_out increases with the decrease of R, which results from the shunt ratio of the circuit. It is according to the theoretical principles, as expected. In order to understand the dependence more intuitively, the surface fitting of the short-circuit output (I_out) in three dimensions is shown in Fig. 2f. According to the side view from left, the I_out is inversely proportional to R at the same light intensity, which means the results of Fig. 1d are consistent across light intensities.

The linear dependence of the I_out on P for any given R programmed is confirmed, and the R_e changes with R as expected. It verifies the programmable optoelectronic output characteristic of the PD-RRAM cell, which is useful for analog multiplication between P and a given R_e. The integrated PD-RRAM cell, as demonstrated above, has prospects of constructing the PD-RRAM array with in-sensor computing capability.

Transient optoelectronic response

The characteristics of the high response speed and low noise are required for the PD-RRAM cell to ensure its real-time image recognition. The test system is setup (Figs. 2d and S6) to measure the transient optoelectronic response. Figure 3a demonstrates the short-circuit output current (I_out) time sequence with the P of LED light stepped up from 0 to 5 mW/cm². The PD-RRAM cell has a fast response and responds linearly to light intensity, with a low dark current (~10⁻⁹ A) and little noise (~10⁻¹¹ A). The high signal-to-noise ratio (SNR) is crucial for the physical analog calculation of the photocurrent signals, improving the accuracy of image recognition. The fast response and low noise show the potential for real-time processing of optoelectronic image signals.

Besides, the high-speed response measurement system is setup to get the upper limit of the PD-RRAM cell, as schematically illustrated in Figs. 3b and S7a. A blue mini-LED is adopted as the light source, with the 3 dB bandwidth of ~15 MHz. The mini-LED emits optical switching signals to the PD-RRAM cell, with frequency of 1 MHz. The I_out of the PD-RRAM cell is amplified into the voltage signal (V_out) measured by the oscilloscope (Figs. 3c and S7c), which exhibits a voltage pulse sequence of 1 MHz. The rise and fall times of a pulse, measured between 10% and 90% of the full pulse amplitude, are both 30 ns under the short-circuit condition, longer than those of an individual PD by 5 ns (more details in Fig. S7b, c). The response speed of PDs degrades slightly with a load circuit (RRAM and R₀), at zero bias. As the RRAM has a read time of less than nanosecond speed⁴⁹, the response speed of the PD-RRAM cell is mainly limited by the PD response time and RC time constants. The RRAM mainly has an effect on the load R_L of PD,

$${R}_{L}=\frac{R\times {R}_{0}}{R+{R}_{0}}$$

(2)

As we know, the smaller the R_L, the faster the response of PDs. This design maintains the numerical range of total load R_L < R₀ ~ 1 kΩ, and thus keeps PDs working at a high response speed.

In-sensor computing as a classifier

The PD-RRAM array for in-sensor computing is demonstrated, performing analog MAC operations between optical images and weights. And this type of architecture can be used to implement real-time image processing, such as pattern recognition by a single-layer perceptron. As illustrated in Fig. 4a, the PD-RRAM array consists of N pixels (i.e., N cells), with pixels connected in parallel for analog computing (i.e., inference). An efficient MAC operation between optical images and weights are performed in this array under the short-circuit condition, just like mathematical matrix calculations in neural networks. In this array, the R_e and w can be converted accordingly as shown in Fig. 1d. The multiplication of the R_e and P occurs at each pixel, and N output currents generated by pixels are summed together according to Kirchhoff’s law. The total output current (I_out) is expressed as

$${I}_{out}=\mathop{\sum }\limits_{i=1}^{n}{I}_{out\_i}=\mathop{\sum }\limits_{i=1}^{n}{R}_{e\_i}\times {P}_{i}$$

(3)

class="removeElement">where R_{e_i} and P_i are the photo-responsivity and input light power at the i-th pixel, respectively. The I_out is converted to an output voltage (V_out) via a trans-impedance amplifier (TIA), and then the V_out is fed to a sigmoid activation function to generate a nonlinear output by software or circuits. When the V_out exceeds the threshold voltage (V_th), it is judged to be recognized as 1; Otherwise, it is recognized as 0.

**Fig. 4: A 3 × 3 PD-RRAM array for in-sensor computing as a classifier.**

To experimentally verify the image recognition function, a 3 × 3 PD-RRAM array (shown in Fig. 1c) is used for recognizing by real-time processing of optoelectronic signals. In the training process, a set of images of letters with 3 × 3 pixels are used in training/inference (Fig. 4b), and the target letter (i.e., ‘X’) is classified into class 1 while others are class 0. The training of this architecture is first carried out as a classifier, adopting a single-layer perceptron (fully-connected network with 9 inputs, 1 output, 1 layer) with fully positive weights. Figure 4c shows the accuracy of the classifier during training under different noise levels (σ represents the variance of image noise), and the accuracy eventually tends to 95%~100% in the back propagation (BP). It verifies the feasibility of this architecture in theory, with certain anti-noise performance. Thus, the corresponding weight matrix is obtained by training, and the RRAM resistance states can be converted accordingly with the weight matrix (as shown in Fig. 1d). Then, the weight matrix is transferred to the PD-RRAM array where RRAMs are programmed into theoretical resistance states. Here, the theoretical weights, theoretical RRAM resistance states and actual RRAM resistance states after programming are shown in Fig. 4d, and the deviations between the actually programmed weights and the theoretical ones are small.

After programming, the PD-RRAM array could directly conduct the inference once an optical image is input. To test it, the optical setup system for image recognition is shown in Figs. 4e and S8a. More specifically, LED light is spatially modulated by a digital micromirror device (DMD) and the resulting images are projected onto the 3 × 3 PD-RRAM array chip. At this point, PDs in the array receive corresponding light intensities (P_i) and generate photocurrents (I_ph), which are collected after being diverted by RRAMs as the I_out according to (3). Thus, the PD-RRAM array would conduct MAC operations, and output results of recognition by the I_out. Figure 4f shows the test results of the measured output (I_out) under the projections of different letters (a duration of 0.5 s), and it leads to the distinct I_out during the presentations of different input images. It is observed that the I_out exceeds the threshold (here is 70 nA) when an image belonging to the target letter (i.e., ‘X’) is presented (Fig. S8b), while it is always lower than the threshold when the images of other classes are presented. Moreover, the measured output currents (I_out) agree well with the theoretically calculated ones. The test results exhibit great performance for image recognition, with a fast response and high SNR (though there still exists the background dark current). The weights exhibit only slight changes after recognition, demonstrating the reliability as a classifier. Apart from that, a 10 × 10 array of the same architecture replaced with resistors are tested for high-speed recognition of digits (Fig. S8c). Projection of different images possesses a duration of 100 μs, and the recognition time is less than 2 μs (Fig. S8d). This is an optoelectronic calculation performed during the conversion of optical signals to electrical signals.

To analyze the potential of complex image recognition, the test of fingerprints with 256 × 256 pixels is carried out here (Fig. S4). The classifier has 65,536 pixels and is used to distinguish the target fingerprint, trained on fingerprint images database. The accuracy of training and test eventually tends to 95% ~100% and 85% ~90%, respectively. This provides a theoretical basis for the hardware implementation of complex recognition. To implement the 256 × 256-pixel array, it is necessary to consider the large-scale integrated scheme where the transistors should be adopted instead of R₀ to prevent the crosstalk and miniaturize the unit cell (more details in Fig. S11). As well, the CMOS technology can be used in the fabrication for CMOS compatible PDs (such as Si PD, thin film detectors, etc.), while chip bonding can be used to integrate the detector chip with the RRAM array for CMOS incompatible PDs (such as GaSb-based infrared detection).

It is confirmed that the PD-RRAM array could implement MAC operations and in-sensor computing for image recognition. And the accuracy of recognition increases with the number of pixels, that is, scaling the PD-RRAM array up would improve the performance of this architecture. Besides complex image recognition, this design can also be used as filter kernels for convolution¹⁰. Moreover, there is no electrical crosstalk between pixels here in this array, that is, the output current generated by a pixel would not flow through a neighboring pixel. It can be attributed to its three-terminal design and working under the short-circuit condition.

Robustness analysis for image recognition

After verifying the feasibility of this architecture, the robustness of the PD-RRAM array for image recognition is analyzed. As mentioned before, the cell has a dark current of ~10⁻⁹ A and a noise of ~10⁻¹¹ A, which may lower the ratio of the effective output current to the dark one. Figure 5a shows that the dark current of the I_out decreases with the increase of the RRAM resistance, which indicates that it’s mainly determined by RRAMs rather than PDs. And therefore, it is the primary goal to reduce the dark current of RRAMs whose materials and structures need to be optimized. Figure 5b shows an optimized design of RRAMs (Ta/Ti/HfO₂/SiO₂/Au), adding a layer of SiO₂ film that is fabricated by plasma-enhanced chemical vapor deposition (PECVD) and etched by reactive ion etching (RIE). This switching layer, composed of different materials, effectively reduces the dark current of RRAMs to ~10⁻¹⁰ A (@HRS) and ~10⁻⁹A (@LRS). For this architecture, reducing the dark current and switching voltage has a crucial impact on the overall system performance. In 2017, Yao et al.⁵⁰ reports a structure that balance these parameters more effectively, which suppresses the dark current while moderately reducing the switching voltage.

**Fig. 5: Robustness of the PD-RRAM array for image recognition.**

In addition, using a pair of RRAMs to represent signed weights can also enhance the robustness of systems, and the PD-RRAM array after revision is schematically illustrated in Fig. 5c. A single pixel has a pair of RRAMs and realizes signed weights with a differential pair. The MAC operations of positive weights (R⁺) and negative weights (R⁻) are performed at t₀ and t₀ + Δt (Δt < 1 μs) respectively, and the results are subtracted from each other in the peripheral circuits. The final result (I) is expressed as

$$I={I}_{+}-{I}_{-}$$

(4)

where the I⁺ and I⁻ are measured at t₀ and t₀ + Δt, respectively. The signed weights could improve the robustness and stability of this system to a certain extent, reducing the interference of noise and other factors.

Robustness of the PD-RRAM array for the image noise is first analyzed. The fabricated PDs possess a photocurrent standard deviation of less than 5% (Fig. S3a), and thus a different noise level is added in the training/inference (MNIST database shown in Fig. S3b). The classifier has 784 pixels and is used to distinguish ‘5’. Accuracy of the classifier during training in the original design converges slower than that after revision (Fig. S3c, d). As well, the weight values also influence the accuracy of the classifier. Next, robustness of the PD-RRAM array for the weight errors and failed pixels rate is analyzed. According to characteristics of this architecture, the weight fluctuation increases with the decrease of the RRAM resistance (Fig. S9a, b), and different weight standard deviations are added in the inference. Figure 5d shows that the design after revision possesses a stronger anti-noise ability than the original one (Fig. S3e, f). The malfunction of the pixels would also affect the whole performance of the PD-RRAM array. The different failed pixels rates are added in the inference, and weights of some pixels are set to 0 (or 0.5) at random. It shows that failed pixels have a significant impact, as the output is jointly contributed by each pixel (Fig. S9c,d). However, the randomness of failed pixels would also bring different effects on the recognition results. Figure 5e indicates that the design after revision possesses better robustness than the original one. From results, it can be seen that when the rate is controlled within 10%, the accuracy still remains at a high level. See Fig. S9 for more information.

Moreover, this architecture can be designed to implement multi-class recognition and each pixel consists of a PD and 2 M RRAM, used for the M-class recognition (Fig. S10a). For the reading mode, terminals of the R₀, PD, and corresponding RRAM are under the short-circuit condition, with other terminals floated. A single-layer ANN is used for multi-class recognition in the array, with 20 outputs to distinguish ‘0’–‘9’ of MNIST database (Fig. S10b, c). The accuracy of training and test eventually tends to 95% and 70% (Fig. S10d, e), respectively. In this design, higher integration degree can be achieved than that with each pixel divided into M subpixels, due to the smaller size of RRAMs. With the number of RRAMs and pixels increasing, the recognition accuracy and system robustness of this array would be improved. Nevertheless, large scaling of the PD-RRAM array still remains a mainly technological task. The SNR degradation caused by the device miniaturization needs further investigation. The PD-RRAM array in a practical large-scale array chip may be affected by crosstalk, which remains to be studied.

Discussion

In summary, the PD-RRAM array is presented for in-sensor computing to implement pattern recognition and image pre-processing. The physical network could reduce data conversion and digital signal processing, improving the efficiency of photoelectric signal conversion⁵¹. This concept provides possibilities for the IoT applications with improved speed, energy efficiency, and reliability. The PD-RRAM unit cell, with characteristics of the reconfigurable optoelectronic output and fast response, provides a device technology foundation for this architecture. The PD-RRAM array is further demonstrated, performing MAC operations between optical images and weights, and can be used to implement real-time image processing functionalities (such as classification). Due to the capability of pixel-level parallel computing, self-powered characteristics in MAC operations, and combination with machine learning, this architecture could achieve high reliability, ultralow latency, and low power consumption for inference. The stable electrical connection between PDs and RRAMs has been proven to be feasible. This work demonstrates a universal design that could be adopted for PDs of various materials, opening up a new way for the hardware implementation of real-time machine vision in different bands. However, this type of physical signal computation for sensing paradigm also has many risks and shortcomings. The SNR of optoelectronic signals will greatly affect the calculation performance (for instance, the accuracy of recognition). The noise issue still needs improvements, though we have achieved great experimental results so far. The noise problem is hard to ignore with the scaling up of the PD-RRAM array. Future work would focus on the monolithic integration and large-scaling of the PD-RRAM array with interconnects and control electronics, which can be implemented with the CMOS technology. There are still many problems to be solved in this process, including the cross-bar structure of large arrays, the uniformity and miniaturization of devices, and the test of large arrays.

Materials and methods

Device fabrication

A layer of intrinsic Si is epitaxially grown on a n-Si substrate. Next, B ions are implanted on the surface of intrinsic Si, and a p-i-n junction is formed after thermal annealing. The device patterns are defined through photolithography. Then, the p-Si is etched to form the electrical isolation of pixels by RIE in SF₆ plasma for a certain time. A 200-nm-thick SiO₂ layer is then deposited on the surface by plasma-enhanced chemical vapor deposition (PECVD), and the oxide is windowed by RIE in CF₄ plasma. Contact electrodes (Al/Au = 20/100 nm) are formed by a lift-off process using magnetron sputtering, with thermal annealing at 350 °C and N₂ atmosphere to form ohmic contacts. Again, a 100-nm-thick SiO₂ layer is deposited on the surface by PECVD, and the oxide is etched by RIE to form holes. Then, the 100-nm-thick NiCr alloy is sputtered as R₀, formed by a lift-off process. Next, an 8-nm-thick HfO₂ layer is deposited using atomic layer deposition (ALD) at 250 °C, and is etched by inductively coupled plasma in Ar plasma. At last, top electrodes of RRAMs (Ti/Ta/Au = 5/50/100 nm) are formed by a lift-off process using magnetron sputtering.

There are still significant difficulties and challenges, including: (1) The processing technology of large-scale arrays; (2) The uniformity and production yield of devices within the array; (3) The noise and dark current in the array. The large-scaling of the array is closely related to the stability and uniformity of devices. The devices in this design mainly include PDs and RRAMs, with the former possessing a consistency of 1% ~5% and the latter having relatively poor uniformity. Due to the fact that the production yield of devices has not reached 100%, the higher the number of pixels in the array, the more likely it is to have bad pixels in the array. Accordingly, the uniformity and production yield are significant factors that cannot be ignored if the array is scaled up. In order to achieve more stable and uniform devices, the material and structure of the oxide layer are crucial. Here, the hole structure is adopted for RRAMs, and the HfO₂ made by ALD could provide a more uniform and denser layer, which also lowers the noise and dark current for this array.

Measurements

The I–V characteristics are measured with Agilent/HP 4155 C. The pulse measurements are implemented by Keysight 1500 A. In the photovoltaic measurement, a LED with tunable light intensities is used as the light source. Only the windows of PDs are considered as being subjected to the illumination and used for the calculation of light power. The LED light is spatially modulated by a DMD (ViaLUX).

Experimental setup

Schematics of the experimental setup are shown in Fig. 4e. The LED light is spatially modulated by a DMD (ViaLUX). The letters are displayed with a frame frequency of ~2 fps. The resulting image is projected onto the 3 × 3 PD-RRAM array chip using lens. The PD-RRAM array chip is programmed into theoretical weight matrixes by source meters (Keysight 1500 A and Agilent 4155 C). For time-resolved measurements, the output current is recorded with Agilent 4155 C.

Data availability

Source data are provided in this paper. The data that supports the other findings of this study are available from the corresponding authors upon reasonable request.

References

Zhou, F. C. et al. Low-voltage, optoelectronic CH₃NH₃PbI_3−xCl_x memory with integrated sensing and logic operations. Adv. Funct. Mater. 28, 1800080 (2018).
Article Google Scholar
Chen, Z. G. & Segev, M. Highlighting photonics: looking into the next decade. eLight 1, 2 (2021).
Article MATH Google Scholar
Zhou, Z. C. et al. Prospects and applications of on-chip lasers. eLight 3, 1 (2023).
Article MATH Google Scholar
Fan, Z. B. et al. Integral imaging near-eye 3D display using a nanoimprint metalens array. eLight 4, 3 (2024).
Article MATH Google Scholar
Zeng, K. B. et al. Synthesized complex-frequency excitation for ultrasensitive molecular sensing. eLight 4, 1 (2024).
Article MATH Google Scholar
Sun, L. F. et al. In-sensor reservoir computing for language learning via two-dimensional memristors. Sci. Adv. 7, eabg1455 (2021).
Article ADS Google Scholar
Zhou, F. C. & Chai, Y. Near-sensor and in-sensor computing. Nat. Electron. 3, 664–671 (2020).
Article MATH Google Scholar
Yang, X. H. et al. Breaking the energy-efficiency barriers for smart sensing applications with “sensing with computing” architectures. Sci. China Inf. Sci. 66, 200409 (2023).
Article Google Scholar
Pan, W. et al. A future perspective on in-sensor computing. Engineering 14, 19–21 (2022).
Article MATH Google Scholar
Cui, B. Y. et al. Ferroelectric photosensor network: an advanced hardware solution to real-time machine vision. Nat. Commun. 13, 1707 (2022).
Article ADS MATH Google Scholar
Wu, N. J. Neuromorphic vision chips. Sci. China Inf. Sci. 61, 060421 (2018).
Article Google Scholar
Komuro, T., Kagami, S. & Ishikawa, M. A dynamically reconfigurable SIMD processor for a vision chip. IEEE J. Solid-State Circuits 39, 265–268 (2004).
Article ADS MATH Google Scholar
Jendernalik, W. et al. An analog sub-miliwatt CMOS image sensor with pixel-level convolution processing. IEEE Trans. Circuits Syst. I: Regul. Pap. 60, 279–289 (2013).
Article MathSciNet MATH Google Scholar
Shi, C. et al. A 1000 fps vision chip based on a dynamically reconfigurable hybrid architecture comprising a PE array processor and self-organizing map neural network. IEEE J. Solid-State Circuits 49, 2067–2082 (2014).
Article ADS MATH Google Scholar
Yang, L. Q. et al. A 1.2 V, 3.1% 3σ-accuracy thermal sensor analog front-end circuit in 12 nm CMOS process. J. Semicond. 42, 032401 (2021).
Article ADS Google Scholar
Yamazaki, T. et al. 4.9 A 1ms high-speed vision chip with 3D-stacked 140GOPS column-parallel PEs for spatio-temporal image processing. In: Proceedings of the 2017 IEEE International Solid-State Circuits Conference. San Francisco, CA, USA: IEEE, 82–3 (2017).
Zhang, Z. F. et al. In-sensor reservoir computing system for latent fingerprint recognition with deep ultraviolet photo-synapses and memristor array. Nat. Commun. 13, 6590 (2022).
Article ADS MATH Google Scholar
Chu, M. et al. Neuromorphic hardware system for visual pattern recognition with memristor array and CMOS neuron. IEEE Trans. Ind. Electron. 62, 2410–2419 (2015).
Article ADS MATH Google Scholar
Li, C. et al. Analogue signal and image processing with large memristor crossbars. Nat. Electron. 1, 52–59 (2018).
Article ADS MATH Google Scholar
Wang, M. et al. Gesture recognition using a bioinspired learning architecture that integrates visual data with somatosensory data from stretchable sensors. Nat. Electron. 3, 563–570 (2020).
Article MATH Google Scholar
Wang, S. et al. Networking retinomorphic sensor with memristive crossbar for brain-inspired visual perception. Natl Sci. Rev. 8, nwaa172 (2021).
Article Google Scholar
Tong, L. et al. 2D materials–based homogeneous transistor-memory architecture for neuromorphic hardware. Science 373, 1353–1358 (2021).
Article ADS MATH Google Scholar
Zhang, J. Y. et al. Recent progress in photonic synapses for neuromorphic systems. Adv. Intell. Syst. 2, 1900136 (2020).
Article MATH Google Scholar
Dai, S. L. et al. Light-stimulated synaptic devices utilizing interfacial effect of organic field-effect transistors. ACS Appl. Mater. Interfaces 10, 21472–21480 (2018).
Article Google Scholar
Gao, S. et al. An oxide schottky junction artificial optoelectronic synapse. ACS Nano 13, 2634–2642 (2019).
Article MATH Google Scholar
Hu, D. C. et al. Memristive synapses with photoelectric plasticity realized in ZnO_1-x/AlO_y heterojunction. ACS Appl. Mater. Interfaces 10, 6463–6470 (2018).
Article ADS MATH Google Scholar
Kumar, M., Abbas, S. & Kim, J. All-oxide-based highly transparent photonic synapse for neuromorphic computing. ACS Appl. Mater. Interfaces 10, 34370–34376 (2018).
Article Google Scholar
Lee, M. et al. Brain-inspired photonic neuromorphic devices using photodynamic amorphous oxide semiconductors and their persistent photoconductivity. Adv. Mater. 29, 1700951 (2017).
Article Google Scholar
Wu, J. Y. et al. Broadband MoS₂ field-effect phototransistors: ultrasensitive visible-light photoresponse and negative infrared photoresponse. Adv. Mater. 30, 1705880 (2018).
Article Google Scholar
Fu, X. et al. Graphene/MoS_2-xO_x/graphene photomemristor with tunable non-volatile responsivities for neuromorphic vision processing. Light Sci. Appl. 12, 39 (2023).
Article ADS Google Scholar
Qu, T. Y. et al. A flexible carbon nanotube sen-memory device. Adv. Mater. 32, e1907288 (2020).
Article MATH Google Scholar
Dang, B. J. et al. Physically transient optic-neural synapse for secure in-sensor computing. IEEE Electron Device Lett. 41, 1641–1644 (2020).
Article ADS MATH Google Scholar
Liu, W. Z. et al. Self-powered and broadband opto-sensor with bionic visual adaptation function based on multilayer γ-InSe flakes. Light Sci. Appl. 12, 180 (2023).
Article ADS MATH Google Scholar
Wang, S. J. et al. An organic electrochemical transistor for multi-modal sensing, memory and processing. Nat. Electron. 6, 281–291 (2023).
Article MATH Google Scholar
Lee, S. et al. Programmable black phosphorus image sensor for broadband optoelectronic edge computing. Nat. Commun. 13, 1485 (2022).
Article ADS MATH Google Scholar
Jang, H. et al. In-sensor optoelectronic computing using electrostatically doped silicon. Nat. Electron. 5, 519–525 (2022).
Article MATH Google Scholar
Mennel, L. et al. Ultrafast machine vision with 2D material neural network image sensors. Nature 579, 62–66 (2020).
Article ADS Google Scholar
Rao, M. Y. et al. Thousands of conductance levels in memristors integrated on CMOS. Nature 615, 823–829, (2023).
Article ADS MATH Google Scholar
Zhang, W. B. et al. Analog-type resistive switching devices for neuromorphic computing. Phys. Status Solidi (RRL) – Rapid Res. Lett. 13, 1900204 (2019).
Article ADS Google Scholar
Bertaud, T. et al. In-operando and non-destructive analysis of the resistive switching in the Ti/HfO₂/TiN-based system by hard x-ray photoelectron spectroscopy. Appl. Phys. Lett. 101, 143501 (2012).
Article ADS Google Scholar
Wu, W. et al. Improving analog switching in HfO_x-based resistive memory with a thermal enhanced layer. IEEE Electron Device Lett. 38, 1019–1022 (2017).
Article ADS MATH Google Scholar
Roy, S. et al. Toward a reliable synaptic simulation using Al-doped HfO₂ RRAM. ACS Appl. Mater. Interfaces 12, 10648–10656 (2020).
Article MATH Google Scholar
Wang, C. X. et al. HfO_x/AlO_y superlattice-like memristive synapse. Adv. Sci. 9, e2201446 (2022).
Article Google Scholar
Woo, J. et al. Improved synaptic behavior under identical pulses using AlO_x/HfO₂ bilayer RRAM array for neuromorphic systems. IEEE Electron Device Lett. 37, 994–997 (2016).
Article ADS MATH Google Scholar
Woo, J. et al. Linking conductive filament properties and evolution to synaptic behavior of RRAM devices for neuromorphic applications. IEEE Electron Device Lett. 38, 1220–1223 (2017).
Article ADS MATH Google Scholar
Mahata, C. et al. Resistive switching and synaptic behaviors of an HfO₂/Al₂O₃ stack on ITO for neuromorphic systems. J. Alloy. Compd. 826, 154434 (2020).
Article MATH Google Scholar
Jiang, H. et al. Sub-10 nm Ta channel responsible for superior performance of a HfO₂ memristor. Sci. Rep. 6, 28525 (2016).
Article ADS Google Scholar
Wang, Q. et al. Low-cost dual-stage offset-cancelled sense amplifier with hybrid read reference generator for improved read performance of RRAM at advanced technology nodes. J. Semicond. 42, 082401 (2021).
Article ADS Google Scholar
Zahoor, F., Azni Zulkifli, T. Z. & Khanday, F. A. Resistive Random Access Memory (RRAM): an overview of materials, switching mechanism, performance, multilevel cell (mlc) storage, modeling, and applications. Nanoscale Res. Lett. 15, 90 (2020).
Article ADS Google Scholar
Yao, P. et al. Face classification using electronic synapses. Nat. Commun. 8, 15199 (2017).
Article ADS Google Scholar
Pan, W. et al. An encoded photodetector array for high-frame-rate compressed sensing and computational imaging. IEEE Sens. J. 1–1, https://doi.org/10.1109/JSEN.2024.3506937 (2024).

Download references

Acknowledgements

All the authors gratefully acknowledge the National Key Research and Development Program (2021YFA0716400), the National Natural Science Foundation of China (62225405, 62350002, 61991443, 62127814, 62235005, and 61927811), and the Collaborative Innovation Center of Solid-State Lighting and Energy-Saving Electronics.

Author information

Authors and Affiliations

Department of Electronic Engineering, Tsinghua University, Beijing, China
Wen Pan, Lai Wang, Zhibiao Hao, Changzheng Sun, Bing Xiong, Jian Wang, Yanjun Han, Hongtao Li, Lin Gan & Yi Luo
Beijing National Research Center for Information Science and Technology (BNRist), Tsinghua University, Beijing, 100084, China
Lai Wang, Jianshi Tang, Zhibiao Hao, Changzheng Sun, Bing Xiong, Jian Wang, Yanjun Han, Hongtao Li, Lin Gan & Yi Luo
School of Integrated Circuits, Tsinghua University, Beijing, 100084, China
Jianshi Tang & Heyi Huang

Authors

Wen Pan
View author publications
Search author on:PubMed Google Scholar
Lai Wang
View author publications
Search author on:PubMed Google Scholar
Jianshi Tang
View author publications
Search author on:PubMed Google Scholar
Heyi Huang
View author publications
Search author on:PubMed Google Scholar
Zhibiao Hao
View author publications
Search author on:PubMed Google Scholar
Changzheng Sun
View author publications
Search author on:PubMed Google Scholar
Bing Xiong
View author publications
Search author on:PubMed Google Scholar
Jian Wang
View author publications
Search author on:PubMed Google Scholar
Yanjun Han
View author publications
Search author on:PubMed Google Scholar
Hongtao Li
View author publications
Search author on:PubMed Google Scholar
Lin Gan
View author publications
Search author on:PubMed Google Scholar
Yi Luo
View author publications
Search author on:PubMed Google Scholar

Contributions

L.W., W.P., J.T., and Y.L. conceived the project. L.W., Z.B., L.G., and Y.L. supervised the study. W.P., C.S., B.X., J.W., Y.H., and T.L. fabricated the PD-RRAM array chip. W.P. and H.H. captured the experimental data for the PD-RRAM array chip and processed the data. All authors participated in the writing of the paper.

Corresponding author

Correspondence to Lai Wang.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Supplementary information

41377_2025_1743_MOESM1_ESM.docx

Supplementary information for Optoelectronic array of photodiodes integrated with RRAMs for energy-efficient 1 in-sensor computing

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pan, W., Wang, L., Tang, J. et al. Optoelectronic array of photodiodes integrated with RRAMs for energy-efficient in-sensor computing. Light Sci Appl 14, 48 (2025). https://doi.org/10.1038/s41377-025-01743-y

Download citation

Received: 12 April 2024
Revised: 13 December 2024
Accepted: 02 January 2025
Published: 15 January 2025
Version of record: 15 January 2025
DOI: https://doi.org/10.1038/s41377-025-01743-y