Abstract
The scaling law of deep learning, which governs the relationship between model size and performance, has led to critical concerns regarding efficiency and sustainability. To address these challenges, this study presents a computational approach using self-organized submillimeter-long tungsten disulfide nanotube cluster as a 3D very-large-scale physical reservoir. The reservoir, with its 0D van der Waals interfaces on the order of 108, or 1.0×1010 mm-3, matches the synaptic quantity and density of the fruit fly’s brain. The reservoir demonstrates the capability to perform a wide range of tasks from monomodal challenges to multimodal endeavors such as speech-to-image and medical image generation. The photosensitive mimetic synaptic connections in the very large scale reservoir emulate the optogenetic modulation of neuron circuits in in-vivo biological systems. By integrating the principles of the scaling law, multimodal task capabilities, and mimetic optogenetic mechanisms, this research paves a path toward advanced computing architectures tailored for next-generation energy-efficient artificial intelligence.
Similar content being viewed by others
Introduction
In the dynamically evolving realm of artificial intelligence, the scaling law of deep learning has entrenched itself as a foundational concept1,2,3,4. This scalability has fueled remarkable performance advancements across diverse domains, including computer vision, natural language processing, and multimodal learning paradigms5,6,7,8,9. Empirically, the model’s accuracy in tackling complex tasks exhibits a positive correlation with its architectural scale. However, larger models suffer from high computational costs10,11, which lead to low model efficiency and pose significant challenges to environmental sustainability. These concerns have prompted the exploration of alternative computing paradigms beyond the von Neumann framework12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33, with physical reservoir computing (PRC) emerging as a transformative paradigm. Unlike conventional digital systems rooted in sequential logic, PRC fundamentally redefines computation by embedding information processing within the analog dynamics of physical substrates, marking a paradigmatic shift from abstract instruction-based operations to physics-driven computation34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63. The essence of PRC lies in its ability to transform input signals through the inherent complex dynamics to produce computationally relevant outputs, leveraging the physical system’s intrinsic computational capabilities17,46,64,65,66.
On the other hand, optogenetics, an experimental method at the intersection of genetics and optics, has dramatically transformed the field of neuroscience67,68,69,70. Optogenetics enables researchers to control and monitor the activity of neurons using light. The optogenetics approach involves the cells that express light-sensitive proteins that can be activated or inhibited by light of specific wavelengths. This allows for the precise control of cellular signaling and has been employed to probe complex biological questions regarding a variety of behaviors and physiological processes, including movement, learning, memory, and sensory processing67,68,69,70. While traditional optogenetics depends on genetic modification to express the light sensitive proteins, the essence is fundamentally about light manipulation, and the scope of this field has expanded to include non-genetic methods71,72,73,74,75,76,77,78. Therefore, the optoelectrical response of the WS2 nanotubes and their interface allow light to manipulate projections inside the WS2 nanotube cluster, translating the principles of optogenetics into the PRC paradigm.
In this context, our research introduces a convergence of very-large-scale (VLS) computational physical reservoir with the feasible control afforded by light. The 3D reservoir, composed of 0D van der Waals interfaces on the order of 108 or 1010 mm-3, not only emulates the synaptic quantity and density of fruit flies’ brains but also integrates photosensitive mimetic optogenetic connections. A key strength of the VLS PRC system is its ability to handle a diverse range of tasks. It effectively supports both monomodal and multimodal endeavors including speech-to-image, colorectal tissue image generation, and blood cell image generation, while maintaining high energy efficiency with 7.0 μW per input. The physical reservoir is designed to respond to light stimuli, effectively translating the principles of optogenetics into the PRC paradigm. By integrating the scale of PRC, the capability for monomodal/multimodal tasks, the exceptional energy efficiency and the mimetic optogenetics, this interdisciplinary research provides a perspective on how optoelectronics, neuroscience, material science, and computational science merge to shape a future pathway for the next-generation neuromorphic computing hardware platform.
Results and discussion
Scaling law and principle of optogenetics
As shown in Fig. 1a, scaled artificial neural network models have set benchmarks in tasks of computer vision, natural language processing, robotics, autonomous driving, scientific discovery, etc. By scaling up the number of parameters, deep learning models advance across various domains and enable more intelligent systems79. The parameter volume of deep learning models has increased significantly in the past few decades, from the representative LeNet (60,000 parameters)80 to GPT-4 (around 1 trillion parameters)81. The increased number of parameters enables enhanced capability to process tasks of increasing complexity, and even the emergence of unpredictable intelligence82,83,84,85, as shown in Supplementary Fig. 1. A similar pattern of the scaling law is found in the neural systems of biological organisms. Figure 1b shows the number of neurons and synapses of Caenorhabditis elegans, fruit flies, mice, orangutans, and humans. Extensive networks of neurons and synapses support more complex functions, such as abstract reasoning, problem-solving, language-using, social interaction, self-awareness, and deep emotional experiences86,87,88,89. The research on biological neural systems has led to the development of optogenetics. As shown in Fig. 1c, by implanting optical sensitive ion channel proteins, the operation of the neural system can be manipulated with an incident light beam. The exposure to light selectively modulates the active or suppressed status of ion channels, thereby altering the membrane potential of neuronal cell membranes. This precise manipulation of neural cells has the capacity to shape and direct the behavior of living organisms, transforming the level of control researchers can exert over neural networks.
a Scaling law of deep learning models with expanding parameter volume. b The quantities of neurons and synapses in various species, from Caenorhabditis elegans to humans. c Schematic diagram of optogenetics. The right panel shows three light-sensitive proteins, with only the central one irradiated. The small red dots within it represent the ion flow process. d SEM images of WS2 nanotubes, showing the hierarchical aggregation of WS2 nanotubes. Scale bars: 100 μm (left) and 25 μm (right). e Micro X-ray CT images of WS2 nanotubes. Scale bars: 100 μm (left) and 100 μm (right). f The images of the WS2 cluster on a substrate with fabricated electrodes. Scale bars: 500 μm (center), 100 μm (right).
WS2 nanotube cluster as the scaled physical reservoir
Drawing inspiration from the scaled biological neural networks and optogenetics, we propose a computational paradigm using a self-organized, submillimeter-long, photoreactive tungsten disulfide (WS2) nanotube cluster as a dynamic physical reservoir. The bottom-up synthesis progress and material characterization of the WS2 nanotube cluster are described in Method and Supplementary Figs. 2–590. The length of the nanotube is measured in the transmission electron microscope (TEM) with a maximum length of 0.5 mm and an average diameter below 50 nm. As shown by the scanning electron microscope (SEM) image in Fig. 1d and Supplementary Fig. 2, the synthesized ultra-long nanotubes aggregate into bundles, each of which is approximately hundreds of micrometers in size. These bundles coalesce into a larger cluster, resulting in a bundle-cluster hierarchical structure. Such a bi-level structure is verified by the micro X-ray computed tomography (CT) with a resolution of 5 μm. Figure 1e shows the shape and the cross-section of the cluster by the micro X-ray CT, showing the sub-structure with interspace and filled bundle blocks. The porosity of the cluster Φ is 32%. The sub-structure aligns with the hierarchical characteristics of large deep learning models and the architecture of brain networks which are adept at capturing distinct levels of features91,92,93. Figure 1f displays the WS₂ nanotube cluster on a substrate with fabricated electrodes, while the complete setup is illustrated in Supplementary Fig. 6.
The self-organized WS2 nanotube cluster provides intricate pathways through the numerous van der Waals interfaces. Here we evaluate the quantity of the van der Waals contacts by the WS2 nanotubes. The WS2 bundles are modeled using the random packing of 1D rods that form the numerous 0D interfaces, developed by Philipse94,95 and refined in this study considering the hollow morphology and flexibility of the WS2 nanotubes. The model is elaborated in Supplementary Fig. 7 and Supplementary Note 1. The random packing volume fraction φ of 14% is derived using:
where m and V are the mass and volume of the WS2 nanotube cluster, ρ is the density of WS2 (7.5 g·cm-3), D and d are the outer and inner diameter of the hollow WS2 nanotube. According to the random contact model, we have
where L’ is the equivalent length of the nanotube with rigidness, D is the diameter of the nanotube, and <c> is the average contact per rod. Note that L’ is shorter than the actual length of the nanotube considering the long flexible nanotube as the combination of shorter rigid segments with an equivalent length of L’. For the thin segment, the average contact number is 5.494,95. From Eq. (1), the approximate value of L’ is obtained, which is 2.8 μm. We can further derive the contact density as 1.0×1010 mm-3 and the total contact quantity as 1.4×108 for the WS2 cluster in Fig. 1e with a volume of 1.4×10-2 mm3 extracted from the micro X-ray CT. Comparably, the brain of a fruit fly has a synapse density of 0.7×1010 mm-3, a volume of 1.4×10-2 mm3, and a total synapse number of ~108. The abundant connectivity in the WS2 nanotube cluster resembles the fruit fly’s synapses, providing the rich complexity needed for the scaled PRC. The electrodes and peripheral circuits are deliberately positioned outside the WS₂ nanotube clusters to preserve the intrinsic density of the WS2 nanotubes’ cluster without disruption. This design ensures that the high-density van der Waals interfaces remain undisturbed while enabling electrical access to the reservoir. The electrodes, connecting gaps, and peripheral circuits in the system are constructed on the web surface and do not interfere with the number or functionality of the mimetic synapses. Importantly, the computational capability of our PRC system relies on the collective behavior of these interfaces rather than precise control of individual connections. This approach mirrors the statistical nature of biological neural networks, where global network properties dominate over single-synaptic precision. The inherent randomness and complexity within the cluster structure are the key features that enable it to mimic biological synaptic networks. This bioinspired design allows the system to maintain structural integrity while exhibiting robust computational performance.
Before utilizing the ultralong WS2 nanotube cluster with massive van der Waals 0D interfaces for PRC, we investigate the electrical and photoelectrical characteristics of the single 1D WS2 nanotube and single 0D van der Waals interfacial connection, respectively. In essence, the electrical path within the cluster, despite its complexity, is composed of the nanotubes themselves and the van der Waals contacts. The four-terminal devices of single and crossbred WS2 nanotubes were fabricated, as shown in Fig. 2a and b. The schematic and equivalent circuit diagrams of electrical measurement are shown in Fig. 2c. The conductivity of a single WS2 nanotube averages at 192 S·m-1, as shown in Fig. 2d90. The I-V curve presents high nonlinearity of the 0D van der Waals interface, as shown in Fig. 2e and f. As temperature decreases, the current exhibits a concomitant reduction, a phenomenon indicative of an energy barrier that charge carriers must surmount to traverse the 0D van der Waals interface as shown in Fig. 2g. An equivalent barrier height on the order of several hundred meV is derived from an Arrhenius analysis of how the current varies with temperature, as detailed in Supplementary Fig. 8. Therefore, the 0D van der Waals barriers provide the nonlinearity in the electrical dynamics of the self-aligned WS2 nanotube cluster, which is essential for the nonlinear projection in reservoir computing.
SEM images of (a) 1D WS2 nanotube (scale bar: 5 μm) and (b) 0D van der Waals interface (scale bar: 3 μm) devices. c, Schematic of the four-probe measurement and equivalent circuits. d I–V curves of the 1D WS2 nanotube device. Inset: schematic of the four-probe measurement. The results are identical to Ref. 90. e I–V curves of the 0D van der Waals interface device. Inset: schematic of the four-probe measurement. f The nonlinearity factor as a function of voltage. Inset: schematic of the four-probe measurement. g I-V curves of a single van der Waals interface device, with temperatures ranging from 150 K to 400 K in increments of 50 K. Inset: schematic of the four-probe measurement. h I-V curves of a single WS2 nanotube device, with laser powers of 0 mW, 0.1 mW, 1 mW, and 2 mW. Inset: schematic of the photocurrent measurement. i I-V curves of a single 0D van der Waals interface device. Inset: schematic of the photocurrent measurement. j The absorption and photocurrent spectrum of the WS2 devices. The photocurrent spectrum indicates a bandgap of about 1.8 eV. k The electrochemical impedance spectroscopy of the 3D nanotube cluster. l, Response of the WS2 nanotube cluster corresponding to different pulse streams.
We further study the optoelectrical response of the single 1D nanotube and the single 0D van der Waals interface device. Upon light illumination, the WS2 nanotube presents an intrinsic photovoltaic effect in accordance with the report by Zhang et al.96, as shown in Fig. 2h and Supplementary Fig. 9. This photovoltaic effect is programmable based on the history of the applied current and voltage, accompanied by interlayer sliding inside the nanotubes97. The programmable photovoltaic response is also observed at the single van der Waals interface device due to the 0D sliding ferroelectric effect98, as shown in Fig. 2i and Supplementary Fig. 10. The absorption spectrum of WS2 nanotubes covers most of the visible wavelength range, corresponding to a bandgap of 1.83 eV, as shown in the upper panel of Fig. 2j. This is in accordance with the photovoltaic current spectra of the 1D nanotube device and 0D van der Waals interface device, as shown in the middle and lower upper panels of Fig. 2j. The optoelectrical response of the WS2 nanotube and their 0D interface allow the intervention of light as a method for manipulating the projections inside the WS2 nanotube cluster, which imitates the optogenetic response of the neural ion channels67,68,69,70,99,100,101,102,103,104.
We further study the dynamics of the 3D nanotube cluster as the physical reservoir. The electrochemical impedance spectroscopy of the 3D nanotube cluster is shown in Fig. 2k and in Supplementary Fig. 11. The cluster shows a capacitive impedance, potentially resulting from the combination of interlayer capacitance105 and nanobattery effect106. The capacitance includes the intrinsic capacitance determined by the complex network structure of the cluster, which is formed by the geometric coupling of the nanotubes and does not change with the voltage bias, and the parasitic capacitance that is sensitive to the electrical dynamics of nanotubes, whose origins is from the nanobattery effect. The capacitive character of the 3D nanotube cluster enables temporal information storage. By applying the voltage pulse stimulation, the responding current reduces with a relaxation time of 0.06 ms, as shown in Supplementary Fig. 12. The cluster was then subjected to different input pulse streams, as shown in Fig. 2l. The read states collected with the input of a 4-bit temporal binary series can be well separated at distinguishable levels. The classification accuracy for distinct pulse signals remains unaffected by readout voltage fluctuations due to the high-dimensional projection property in the reservoir space, which ensures different temporal patterns are mapped to non-overlapping regions. Moreover, the system exhibits intrinsic tolerance to minor perturbations through its nonlinear dynamics and short-term memory properties, while the trained readout layer effectively adapts to such fluctuations by extracting the fundamental statistical features of reservoir responses. The WS₂ nanotube cluster exhibits synaptic behaviors, transitioning from short-term plasticity (STP) to long-term plasticity (LTP) under repeated voltage pulses (0.5 V, 100 ms, 200 ms intervals). As shown in Supplementary Fig. 13a-c, with increasing pulse counts (N = 2, 5, 10), the device shifts from transient STP to persistent LTP, mimicking biological synaptic potentiation. Additionally, the paired-pulse facilitation (PPF) ratio, defined as (A₂ - A₁) / A₁ × 100%, was measured. Supplementary Fig. 13d shows the PPF ratio decreasing with longer pulse intervals, consistent with biological synapses. The ability to store and process temporal information lays the foundation for its application as a computing reservoir for various tasks, as presented below.
Execution of monomodal tasks
We demonstrate the paradigm of VLS PRC based on the 3D WS2 nanotube cluster by implementing various monomodal tasks.
For the handwritten digit recognition task, the digit images from 0 to 9 were binarized and transformed into spatiotemporal sequences, represented by low (0 V) and high voltage (5 V) electrical pulses with eight input channels, as depicted in Fig. 3a. The handwritten digit recognition task was performed by feeding voltage pulses into the 3D WS₂ nanotube network, where the elicited responses were recorded as inputs for the next-level artificial neural network. This network was subsequently fine-tuned through standard backpropagation training to achieve digit classification. As shown in Fig. 3b, classification accuracies were tracked over 200 training epochs, with input data augmented by adding Gaussian noise (σ = 0.05, 0.1,…, 0.4) to evaluate system robustness. In Fig. 3c the confusion matrix was generated for the data with a noise level of σ = 0.05. Despite a gradual decrease in accuracy on the test set with increasing noise levels, the VLS PRC model retained its recognition ability, showcasing the robust image recognition capabilities of the 3D VLS PRC platform. Supplementary Fig. 14 and Supplementary Table 1 compare the performance of the computation using different sizes of the reservoir. It can be seen that the accuracy of the PRC was improved with increasing cluster size. This may be attributed to the fact that projections within a smaller reservoir occur in a lower-dimensional space, which limits the performance of the PRC.
a Schematic of handwritten digit recognition. b Classification accuracies of the handwritten digit recognition task after 200 training epochs, with noise levels of σ = 0.05 to 0.4. c Confusion matrix corresponding to the noise level of σ = 0.05. Color intensity is positively correlated with classification accuracy. d Schematic of speech recognition. Speech digits from 0 to 9 are preprocessed using the Lyon passive cochlear model. e Accuracies of speech recognition. f Confusion matrix with noise of σ = 0.05. g Schematic of biological signal recognition with the four types of ECG waveforms (normal sinus rhythm, coronary artery disease, myocardial infarction, and complete atrioventricular block). h Confusion matrix of the ECG classification task. i MSE curve for time-series forecasting task. j Predicted values (red line) compared with ground truth (black line).
To further assess the capability of the 3D VLS PRC platform in temporal information classification, we conducted a benchmark test focusing on speech recognition. As shown in Fig. 3d, the audio waveforms of isolated spoken digits (zero to nine in English), sourced from the NIST TI-46 database, were resampled at a rate of 12.5 kHz. The waveforms were preprocessed into 8 frequency channels using the Lyon cochlear model107. The Lyon passive cochlear model is elaborated in Supplementary Fig. 15 and Supplementary Note 2. The preprocessed data were then fed into the reservoir. As shown in Fig. 3e and f, we used 200 samples to form the test set and the accuracies increase with the training epochs for all noise levels and reach a satisfying recognition ability for different noise levels. This demonstrates the 3D VLS PRC’s capability to conduct speech classification task even in chaotic auditory environments.
For the biological signal recognition, electrocardiogram (ECG) signals of normal sinus and arrhythmic rhythm—coronary artery disease, myocardial infarction, and complete atrioventricular block108 were used as input signals for the reservoir in Fig. 3g. The signal was amplified and fed into the reservoir via a signal generator. The output was collected and processed with the tunable artificial neural network. The 3D VLS PRC exhibited excellent performance in the recognition task and can do the accurate classification of the normal and the arrhythmic electrocardiogram signals, as shown in Fig. 3h.
Besides the classification tasks, this work explored the prediction task of nonlinear time-series using the 3D VLS PRC. The research builds upon the nonlinear equation of autoregressive moving average with exogenous inputs established in prior studies109, aiming to evaluate the capability of the 3D VLS PRC to process dynamic input time signals and its predictive performance. The dynamic equation is given by
The primary objective of the task is to predict the value y(k), which is dependent on the current input u(k) as well as the recent outputs y(k-1) and y(k-2). The results depend on the system’s response to the current input and its short-term memory capabilities in relation to historical outputs. The input signal u(k), which is represented by the electrical voltage, was subsequently fed into the VLS PRC, while dynamic response was collected and transferred to the readout layer. By training the weights of the readout layer, the mean squared error decreased with increasing number of epochs, as shown in Fig. 3i. The ground truth time-series values and the prediction through the 3D VLS PRC exhibited a strong correlation, as illustrated in Fig. 3j.
The self-organized, submillimeter-long WS₂ nanotube cluster functions as a 3D physical reservoir for very-large-scale computation. The abundance of van der Waals interfaces (1010 mm-3) provides a high-dimensional state space for capturing input patterns. The density of the 3D WS2 nanotube cluster serves as a critical design parameter for optimizing reservoir computing performance, providing essential guidance for future neuromorphic hardware development. Even as the reservoir’s scale and complexity expand, internal information mapping occurs simultaneously. Thus, differently from neural networks implanted in very-large-scale logic circuits, here the power consumption and computation time are not sensitive to the scale. We analyzed the energy consumption of the 3D VLS PRC. The input voltage (V) pulses to the 3D WS2 nanotube cluster reservoir are set to 5 V, and the average operating current (I) is approximately 1.4 μA, attributed to the large impedance of the 3D WS2 nanotube cluster. The power consumption of each input node is P = V × I = 5 V × 1.4 μA = 7.0 μW. The power can be further lowered by decreasing the amplitude of the input voltage, for example, the input voltage of 2.5 V corresponds to an operating current of 0.22 μA and a power of 0.6 μW. For the readout layer, a digital implementation using analog-to-digital converters and an FPGA consumes approximately 81 mW in total. Additional details are provided in the Supplementary Note 3 and Supplementary Table 2. Therefore, the repeatable performance of the entire reservoir computing system has been demonstrated, with additional specific details provided in Supplementary Table 3.
Mimetic optogenetics
Optogenetics offers the second sight that uses light to modulate the organism’s activity or behavior. Here, the mimetic optogenetic behavior is demonstrated using the light-sensitive 3D VLS PRC system. The WS2 nanotube reservoir is exposed to a light beam with an intensity of 3×10-4 W·cm-2. As shown in Fig. 4a, upon illumination, the accuracy of handwritten digit recognition notably decreases, with a reduction of 32% at a noise level of σ = 0.05. The confusion matrix from the illuminated reservoir is presented in Fig. 4b. The increased confusion is caused by the photoconductive and photovoltaic effect in the 1D WS2 nanotube and the 0D van der Waal interfaces, which changes the projection inside the reservoir and disables the trained PRC model. However, this disability can be mitigated by a second-time training of the fully connected layers in the PRC model, as shown by the confusion matrix in Fig. 4c and the learning curves in Supplementary Fig. 16. Similar mimetic optogenetic change is also observed in the speech recognition task (Figs. 4d-f, Supplementary Fig. 17), the electrocardiogram signal recognition task (Figs. 4g-i) and the time-series prediction task (Figs. 4j-l). The light exposure impacts the feature vectors extracted from the WS2 nanotube-network reservoir, confuses the previously learned model, and lowers the recognition or prediction accuracies. The damaged function can be repaired by readjusting the weights in the fully connected layers through the second-time training. In prior research, Cai et al. established that electrical stimulation of neural network reservoirs significantly enhances computational accuracy through augmented neuronal interconnectivity within the reservoir architecture34. Our experimental findings reveal analogous precision enhancement in photo-stimulated reservoir computing systems, potentially stemming from the optimized 3D connectivity established within the nanotube matrix. This structural evolution in the WS₂ nanocluster configuration appears to facilitate strengthened inter-nanotube signal transduction pathways, mirroring the neural network optimization observed in bioelectrical systems. Therefore, the optoelectrical response of the WS2 nanotubes and their interface allow light to manipulate projections inside the WS2 nanotube cluster, translating the principles of optogenetics into the PRC paradigm.
a Accuracies and losses of handwritten digit classification under dark and illuminated conditions. b, c Confusion matrices (b) under illumination and (c) after second-time training with noise of σ = 0.05. d Accuracies and losses of speech recognition. e, Confusion matrix under illumination with noise of σ = 0.05. f Confusion matrix after second-time training with noise of σ = 0.05. g Accuracies of biological signal recognition under three conditions: darkness, illumination, and after second-time training. h, i Confusion matrices (h) under illumination and (i) after second-time training. j MSE function of the time-series prediction. k, l Predicted values from the VLS PRC compared with ground truth (k) under illumination and (l) after second-time training.
Execution of multimodal tasks
The development of multimodal models is regarded as an essential step towards general artificial intelligence. Current deep learning models still struggle with multimodal tasks that humans can easily accomplish. In this work, we utilize the 3D VLS PRC to demonstrate a speech-to-image multimodal generative task which exceeds previous reservoir computing systems, as shown in Supplementary Table 4. Gaussian noises are introduced to an MNIST handwritten digit image dataset over timesteps and a diffusion model is trained to denoise for the reverse process. The 3D VLS PRC device, demonstrated in this work, projects the speech input signals into higher dimensional vectors as the latent conditions to guide the generation of images via the cross-attention mechanism, as illustrated in Fig. 5a. The device performance is reflected in the algorithm through the quality of these projections, as higher nonlinearity and richer dynamics enable separation of features in the latent space, directly influencing the accuracy and robustness of the multimodal output. The loss function curves for generating digital images are shown in Fig. 5b. The generated digit images, guided by the latent vectors from the reservoir, closely resembled their original counterparts, despite a small portion of irregularities highlighted by the blue box, as shown in Fig. 5c. Detailed information about the generative denoising process is depicted in Supplementary Fig. 18. It can be seen that the latent vectors, from the projection of the reservoir, are qualified to condition the generation of the handwriting digits.
a Schematic of a speech-to-image task using the 3D VLS PRC. b Loss curves of the speech-to-image task. c–f Left: digital images generated (c) in darkness, (d) under weak illumination, (e) under intense illumination, and (f) after second-time training. The blue and red boxes label the irregular images and errors. Right: visualization of the latent vectors with reduced dimensions. g, h Blood cell images generated (g) in darkness, (h) under intense illumination. The blue and red boxes label the irregular images and errors. i, j Colorectal tissue images (i) in darkness, (j) under intense illumination. The blue and red boxes label the irregular images and errors.
We further conduct the mimetic optogenetic test on the multimodal model. In the biological system, the disorder in neural and synaptic activity causes cognitive dissonance and intellectual dysfunction. In this work, we used a 532 nm laser with adjustable intensity to modulate reservoir projections, mimicking neural and synaptic activity dynamics in biological intelligence systems. As the 3D VLS reservoir was exposed to weak illumination (1.6×10-4 W·cm-2), there was a noticeable degeneration of the output image, as depicted by the blue and red boxes in Fig. 5d and Supplementary Fig. 19. These annotations, generated by an ANN-based detection algorithm110, classify errors as either critical (red boxes) or partial defects (blue boxes). These results are consistent with our manual observations. Interestingly, we note that the generated images for “7” look more like the shape of “5”. Despite the general lack of explainability of neuromorphic computing, the latent space from the reservoir provides interpretability to a certain extent that the projected vectors of “7” merge with that of “5”, as indicated in the right panel of Fig. 5d. Furthermore, increasing the light intensity (9.3×10-4 W·cm-2) resulted in a more significant degeneration, in which case, the model even fails to generate the shape of a digit or generates a digit image with a wrong label (for example, image “8” is generated with an input speech of “0”), as shown in Fig. 5e, Supplementary Fig. 20, Supplementary Movie 1 and Supplementary Movie 2. The optogenetic perturbation, by modulating the reservoir states through light illumination, further demonstrates the device’s functional role. Using a second-time training with adjustments of weights, the multimodal diffusion model regains the ability of speech-to-image generation, as shown in Fig. 5f and Supplementary Fig. 21. Interestingly, the second-time training enhances the accuracies of the 3D VLS PRC under high-illumination conditions. The enhanced performance can be attributed to the changes in functional connectivity, which enables the dynamic reshaping of the reservoir34. We further extended the 3D VLS PRC multimodal generative paradigm to biomedical domains. The 3D WS₂ nanotube reservoir successfully performed colorectal tissue image and blood cell image generation tasks in Fig.5g-j and Supplementary Figs. 22-25, demonstrating its capability to process complex medical imaging data. The system maintained robust performance despite the increased complexity of the biomedical features, while still exhibiting the characteristic optogenetic modulation under light illumination. This expansion to clinically significant imaging modalities underscores the system’s potential for medical applications in healthcare settings.
This study introduces a computational framework that leverages a self-organized 3D physical reservoir and verifies the feasibility and potential of this concept. The reservoir utilizes the abundant 0D van der Waals interfaces within a WS2 nanotube cluster as mimetic synaptic connections, resembling the neural system of a fruit fly (with a synaptic quantity of ~108 and synaptic density of ~1010 mm-3). Such a high-density synaptic connection enables the system to handle complex sequential tasks and endows it with a high degree of parallel computing capability. The 3D VLS PRC exhibits outstanding performance in both monomodal tasks, such as handwritten digit recognition, speech recognition, biological signal recognition, time-series prediction, and multimodal tasks like speech-to-image generation, highlighting its potential for complex applications. By exploiting the photoelectrical response of the 1D semiconducting nanotubes and their 0D interfaces, the study demonstrates a mimetic optogenetic behavior, enabling the manipulation of the VLS PRC dynamics and projections through light stimulation. This work investigates the principles of the scaling law, the capability to perform multimodal tasks, and the mimetic optogenetic mechanisms in the VLS physical reservoir, providing an innovative perspective on how computer science, nanotechnology, optoelectronics, and neuroscience merge to shape the next-generation artificial intelligence paradigm. Looking ahead, the development of all-hardware neuromorphic systems represents a critical frontier for future research. Future efforts will focus on bridging the gap between software-based training and hardware implementations, leveraging the unique properties of WS₂ nanotube cluster and their optoelectronic responses to create more robust and efficient neuromorphic computing platforms.
Methods
Materials
WS2 nanotubes were synthesized through two steps: firstly, the ultralong W5O14 nano whiskers were obtained by the high-temperature annealing of HxWO3 in a sealed quartz ampoule environment. These nanowhiskers were then used as precursors and subjected to a sulfidation process at 845 °C for 6 h in a flowing mixture of H2S and H2 gases, ultimately yielding almost pure ultralong (0.1 − 0.5 mm) WS2 nanotubes. The synthesis process is highly controlled and scalable, with a yield approaching 100%. The nanotubes exhibit exceptional stability, retaining their structural and functional properties for over 25 years, making them ideal for long-term applications in neuromorphic computing. The final WS2 nanotube crystals consist of a complex, bottom-up randomly stacked network of WS2 nanotubes, providing the material foundation for this research. The nanotube sample was then transferred for X-ray diffraction, SEM, TEM, micro X-ray CT, Raman spectroscopy, and absorption spectroscopy characterization.
Devices
WS2 nanotubes were dispersed on a silicon substrate coated with 500 nm of silicon dioxide. The dispersion density of WS2 on the silicon wafer was examined using an optical microscope, which allowed for the selection of suitable single nanotubes and their crossbars. Subsequently, a thin film of PMMA photoresist was deposited via spin-coating, followed by thermal treatment to remove residual solvent. Electron beam lithography was then employed to pattern the electrode geometry. After the development of the pattern, Cr (20 nm)/Au (120 nm) metal layers were deposited as the electrode via electron beam evaporation. The lift-off process was done by dissolving the PMMA in acetone. Subsequently, electrical and optoelectrical measurement of the device were conducted with a semiconductor characterization system with a probe station. The incident lights were introduced by lasers with wavelengths of 488, 532, 633 nm and a supercontinuum laser with a range between 655 nm to 1,000 nm.
Physical reservoir computing framework
The WS2 nanotube cluster was integrated into a reservoir computing framework. A microcontroller or DG1032Z signal generator was used to transmit digital or analog signals (handwritten digits, spoken digits, bio-signals, time-series signals) to the WS2 nanotube cluster in the form of electrical voltage via gold wire or tungsten probes. The reservoir layer received the temporal electrical stimuli and mapped them into a high-dimensional computational space. The output voltage signals were collected by an oscilloscope and fed into a layer of fully connected artificial neural networks, which delivered the final results for classification and prediction.
Dataset
The handwriting images were from the MNIST dataset, which originates from the National Institute of Standards and Technology. The 28×28 pixel grayscale images were downscaled to 8×8 binary images that can be represented by the binary high and low voltages. All images are labeled with the corresponding digit they represent, ranging from 0 to 9. The speech recognition data were from the TI-46 dataset, which consists of read utterances including the numbers 0 to 9. The sound was collected using an Electro-Voice RE-16 Dynamic Cardioid microphone at a 12.5 kHz sample rate with 12-bit quantization. The waveforms of the speeches were fed into 8 frequency channels using the Lyon cochlear model, and then into the reservoir. Biological signals were sampled from the built-in electrocardiogram waveforms of the signal generator (one normal rhythm and three arrhythmias) for the task of arrhythmia detection. The input for time-series prediction was derived from the random discontinuous waveforms built into the signal generator.
Monomodal task training
For classification tasks such as handwritten digit recognition, speech recognition, and biological signal recognition, the feature vector from the reservoir serves as the input to the readout layer. Using the states of the reservoir at the final timestep as input x, and the output\(\,\hat{y}\) was obtained through the softmax layer.
where \({w}_{{ij}}\) refers to the synaptic weight between the reservoir and the softmax layer, b is the bias.
The cross-entropy loss function was used to measure the difference between the read-out prediction \({\hat{y}}_{j}\) and the desired target \({y}_{j}\)
where n is the total number of samples.
Linear regression algorithm was used to train the readout function for nonlinear time-series prediction task. Assume the state of the reservoir is x, which is fed to the readout network as the input. The cost function is defined as
where n is the total number of samples, \({y}^{(i)}\) is the desired value of the input \({x}^{(i)}\), w refers to the weight matrix.
Gradient descent was used to find the parameters that correspond to the minimum value of the cost function J.
Multimodal task training
In the time-series signal forecasting task, the input signal feature vector generated by the VLS PRC is utilized through the cross-attention mechanism as a conditional vector to guide the process from Gaussian noises to the target images. The equation for adding Gaussian noise\(\,\epsilon\) to an original image x0 is given by
where xt refers to the generated noisy image, βs refers to the noise variance added at time step s.
The mean squared error (MSE) loss function was used to measure the difference between the predicted noise and the actual noise. The MSE loss is defined as
where n refers to the total number of samples, \({\epsilon }_{i}\) represents the actual noise, and \({\hat{\epsilon }}_{i}\) represents the predicted noise.
Data availability
The authors declare that the data supporting the findings of this study are available within the paper and its Supplementary Information files.
References
Bahri, Y., Dyer, E., Kaplan, J., Lee, J. & Sharma, U. Explaining neural scaling laws. Proc. Natl. Acad. Sci. USA 121, e2311878121 (2024).
Schulz, M.-A. et al. Different scaling of linear models and deep learning in UKBiobank brain images versus machine-learning datasets. Nat. Commun. 11, 4238 (2020).
Meir, Y. et al. Power-law scaling to assist with key challenges in artificial intelligence. Sci. Rep. 10, 19628 (2020).
Dermitzakis, K., Carbajal, J. P. & Marden, J. H. Scaling Laws in Robotics. Procedia Computer Sci. 7, 250–252 (2011).
Wang, X. et al. Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey. Mach. Intell. Res. 20, 447–482 (2023).
Krizhevsky, A., Sutskever, I. & Hinton, G. E. ImageNet classification with deep convolutional neural networks. Commun. ACM 60, 84–90 (2017).
Hirschberg, J. & Manning, C. D. Advances in natural language processing. Science 349, 261–266 (2015).
Schmidhuber, J. Deep learning in neural networks: An overview. Neural Netw. 61, 85–117 (2015).
Jabeen, S. et al. A review on methods and applications in multimodal deep learning. ACM Trans. Multimed. Comput., Commun. Appl. 19, 1–41 (2023).
Nilashi, M. et al. Measuring sustainability through ecological sustainability and human sustainability: A machine learning approach. J. Clean. Prod. 240, 118162 (2019).
Hino, M., Benami, E. & Brooks, N. Machine learning for environmental monitoring. Nat. Sustainability 1, 583–588 (2018).
Wang, Y. et al. An in-memory computing architecture based on two-dimensional semiconductors for multiply-accumulate operations. Nat. Commun. 12, 3347 (2021).
Bao, H. et al. Toward memristive in-memory computing: principles and applications. Front. Optoelectron. 15, 23 (2022).
Van De Burgt, Y. et al. A non-volatile organic electrochemical device as a low-voltage artificial synapse for neuromorphic computing. Nat. Mater. 16, 414–418 (2017).
Kimura, H. et al. Complementary ferroelectric-capacitor logic for low-power logic-in-memory VLSI. IEEE J. Solid-State Circuits 39, 919–926 (2004).
Jia, H. Y., Valavi, H., Tang, Y. Q., Zhang, J. T. & Verma, N. A Programmable Heterogeneous Microprocessor Based on Bit-Scalable In-Memory Computing. IEEE J. Solid-State Circuits 55, 2609–2621 (2020).
Sun, Y. et al. Experimental demonstration of a skyrmion-enhanced strain-mediated physical reservoir computing system. Nat. Commun. 14, 3434 (2023).
Mennel, L. et al. Ultrafast machine vision with 2D material neural network image sensors. Nature 579, 62–66 (2020).
Yang, Z. et al. A vision chip with complementary pathways for open-world sensing. Nature 629, 1027–1033 (2024).
Pei, J. et al. Towards artificial general intelligence with hybrid Tianjic chip architecture. Nature 572, 106–111 (2019).
Liao, F. et al. Bioinspired in-sensor visual adaptation for accurate perception. Nat. Electron. 5, 84–91 (2022).
Zhou, F. et al. Optoelectronic resistive random access memory for neuromorphic vision sensors. Nat. Nanotechnol. 14, 776–782 (2019).
Zhong, Y. et al. A memristor-based analogue reservoir computing system for real-time and power-efficient signal processing. Nat. Electron. 5, 672–681 (2022).
Liang, X., Luo, Y., Pei, Y., Wang, M. & Liu, C. Multimode transistors and neural networks based on ion-dynamic capacitance. Nat. Electron. 5, 859–869 (2022).
Larger, L. et al. Photonic information processing beyond Turing: an optoelectronic implementation of reservoir computing. Opt. Express 20, 3241–3249 (2012).
Elbedwehy, A. N., El-Mohandes, A. M., Elnakib, A. & Abou-Elsoud, M. E. FPGA-based reservoir computing system for ECG denoising. Microprocessors Microsyst. 91, 104549 (2022).
Bai, K. & Yi, Y. DFR: An energy-efficient analog delay feedback reservoir computing system for brain-inspired computing. ACM J. Emerg. Technol. Comput. Syst. 14, 1–22 (2018).
Lin, C., Liang, Y. & Yi, Y. Fpga-based reservoir computing with optimized reservoir node architecture. 2022 23rd International Symposium on Quality Electronic Design (ISQED), Santa Clara, CA, USA, 2022, pp. 1-6.
Schürmann, F., Meier, K. & Schemmel, J. Edge of chaos computation in mixed-mode vlsi-a hard liquid. Advances in Neural Information Processing Systems 17 (2004).
Duport, F., Schneider, B., Smerieri, A., Haelterman, M. & Massar, S. All-optical reservoir computing. Opt. Express 20, 22783–22795 (2012).
Brunner, D. & Fischer, I. Reconfigurable semiconductor laser networks based on diffractive coupling. Opt. Lett. 40, 3854–3857 (2015).
Rowlands, G. E. et al. Reservoir computing with superconducting electronics. Preprint at https://doi.org/10.48550/arXiv.2103.02522 (2021).
Ouyang, B. et al. Bioinspired in-sensor spectral adaptation for perceiving spectrally distinctive features. Nat. Electron. 7, 705–713 (2024).
Cai, H. W. et al. Brain organoid reservoir computing for artificial intelligence. Nat. Electron. 6, 1032–1039 (2023).
Cucchi, M. et al. Reservoir computing with biocompatible organic electrochemical networks for brain-inspired biosignal classification. Science Advances 7 (2021).
Gao, C. et al. Toward grouped-reservoir computing: organic neuromorphic vertical transistor with distributed reservoir states for efficient recognition and prediction. Nat. Commun. 15, 740 (2024).
Kong, L.-W., Brewer, G. A. & Lai, Y.-C. Reservoir-computing based associative memory and itinerancy for complex dynamical attractors. Nat. Commun. 15, 4840 (2024).
Lian, M. et al. Towards mixed physical node reservoir computing: light-emitting synaptic reservoir system with dual photoelectric output. Light.: Sci. Appl. 13, 179 (2024).
Lin, Z., Lu, Z., Di, Z. & Tang, Y. Learning noise-induced transitions by multi-scaling reservoir computing. Nat. Commun. 15, 6584 (2024).
Moon, J. et al. Temporal data classification and forecasting using a memristor-based reservoir computing system. Nat. Electron. 2, 480–487 (2019).
Baltussen, M. G., de Jong, T. J., Duez, Q., Robinson, W. E. & Huck, W. T. S. Chemical reservoir computation in a self-organizing reaction network. Nature 631, 549–555 (2024).
Wu, X. S. et al. Wearable in-sensor reservoir computing using optoelectronic polymers with through-space charge-transport characteristics for multi-task learning. Nat. Commun. 14, 468 (2023).
Milano, G. et al. In materia reservoir computing with a fully memristive architecture based on self-organizing nanowire networks. Nat. Mater. 21, 195–202 (2022).
Yao, P. et al. Fully hardware-implemented memristor convolutional neural network. Nature 577, 641–646 (2020).
Yan, M. et al. Emerging opportunities and challenges for the future of reservoir computing. Nat. Commun. 15, 2056 (2024).
Liang, X. et al. Physical reservoir computing with emerging electronics. Nat. Electron. 7, 193–206 (2024).
Du, C. et al. Reservoir computing using dynamic memristors for temporal information processing. Nat. Commun. 8, 2204 (2017).
Vandoorne, K. et al. Experimental demonstration of reservoir computing on a silicon photonics chip. Nat. Commun. 5, 3541 (2014).
Torrejon, J. et al. Neuromorphic computing with nanoscale spintronic oscillators. Nature 547, 428–431 (2017).
Zhong, Y. et al. Dynamic memristor-based reservoir computing for high-efficiency temporal signal processing. Nat. Commun. 12, 408 (2021).
Sun, L. et al. In-sensor reservoir computing for language learning via two-dimensional memristors. Sci. Adv. 7, eabg1455 (2021).
Govia, L., Ribeill, G., Rowlands, G., Krovi, H. & Ohki, T. Quantum reservoir computing with a single nonlinear oscillator. Phys. Rev. Res. 3, 013077 (2021).
Marinella, M. J. & Agarwal, S. Efficient reservoir computing with memristors. Nat. Electron. 2, 437–438 (2019).
Liang, X. et al. Rotating neurons for all-analog implementation of cyclic reservoir computing. Nat. Commun. 13, 1549 (2022).
Chen, Z. et al. All-ferroelectric implementation of reservoir computing. Nat. Commun. 14, 3585 (2023).
Jang, Y. H. et al. Time-varying data processing with nonvolatile memristor-based temporal kernel. Nat. Commun. 12, 5727 (2021).
Hochstetter, J. et al. Avalanches and edge-of-chaos learning in neuromorphic nanowire networks. Nat. Commun. 12, 4008 (2021).
Tanaka, H. et al. A molecular neuromorphic network device consisting of single-walled carbon nanotubes complexed with polyoxometalate. Nat. Commun. 9, 2693 (2018).
Maass, W., Natschläger, T. & Markram, H. Real-time computing without stable states: A new framework for neural computation based on perturbations. Neural Comput. 14, 2531–2560 (2002).
Jaeger, H. The “echo state” approach to analysing and training recurrent neural networks-with an erratum note. Bonn., Ger.: Ger. Natl. Res. Cent. Inf. Technol. GMD Tech. Rep. 148, 13 (2001).
Chen, R. et al. Thin-film transistor for temporal self-adaptive reservoir computing with closed-loop architecture. Sci. Adv. 10, eadl1299 (2024).
Gaurav, A. et al. Reservoir computing for temporal data classification using a dynamic solid electrolyte ZnO thin film transistor. Front. Electron. 3, 869013 (2022).
Lao, J. et al. Ultralow-power machine vision with self-powered sensor reservoir. Adv. Sci. 9, 2106092 (2022).
Tanaka, G. et al. Recent advances in physical reservoir computing: A review. Neural Netw. 115, 100–123 (2019).
Qi, Z. et al. Physical reservoir computing based on nanoscale materials and devices. Adv. Funct. Mater. 33, 2306149 (2023).
Love, J. et al. Spatial analysis of physical reservoir computers. Phys. Rev. Appl. 20, 044057 (2023).
Newman, J. P. et al. Optogenetic feedback control of neural activity. Elife 4, e07192 (2015).
Szobota, S. & Isacoff, E. Y. Optical control of neuronal activity. Annu. Rev. Biophysics 39, 329–348 (2010).
Ciabatti, E., Gonzalez-Rueda, A., Mariotti, L., Morgese, F. & Tripodi, M. Life-Long Genetic and Functional Access to Neural Circuits Using Self-Inactivating Rabies Virus. Cell 170, 382–392.e314 (2017).
Park, S. et al. Optogenetic control of nerve growth. Sci. Rep. 5, 9669 (2015).
Carvalho-de-Souza, J. L. et al. Photosensitivity of neurons enabled by cell-targeted gold nanoparticles. Neuron 86, 207–217 (2015).
Du, Z. et al. Photoacoustic: A Versatile Nongenetic Method for High-Precision Neuromodulation. Acc. Chem. Res. 57, 1595–1607 (2024).
Jiang, Y. et al. Nongenetic optical neuromodulation with silicon-based materials. Nat. Protoc. 14, 1339–1376 (2019).
Lu, X., Shen, Y. & Campbell, R. E. Engineering photosensory modules of non-opsin-based optogenetic actuators. Int. J. Mol. Sci. 21, 6522 (2020).
Ma, L. et al. Multifunctional bioactive Nd-Ca-Si glasses for fluorescence thermometry, photothermal therapy, and burn tissue repair. Sci. Adv. 6, eabb1311 (2020).
Spiltoir, J. I. & Tucker, C. L. Photodimerization systems for regulating protein–protein interactions with light. Curr. Opin. Struct. Biol. 57, 1–8 (2019).
Tian, B. Nongenetic neural control with light. Science 365, 457–457 (2019).
Wang, M. et al. Nongenetic optical modulation of neural stem cell proliferation and neuronal/glial differentiation. Biomaterials 225, 119539 (2019).
Chen, S. et al. Artificial organic afferent nerves enable closed-loop tactile feedback for intelligent robot. Nat. Commun. 15, 7056 (2024).
Lecun, Y., Bottou, L., Bengio, Y. & Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998).
Baktash, J. A. & Dawodi, M. J. a. p. a. Gpt-4: A review on advancements and opportunities in natural language processing. arXiv preprint arXiv:2305.03195 (2023).
Thirunavukarasu, A. J. et al. Large language models in medicine. Nat. Med. 29, 1930–1940 (2023).
Chang, Y. et al. A survey on evaluation of large language models. ACM Trans. Intell. Syst. Technol. 15, 1–45 (2024).
Zhao, H. et al. Explainability for large language models: A survey. ACM Trans. Intell. Syst. Technol. 15, 1–38 (2024).
Singhal, K. et al. Large language models encode clinical knowledge. Nature 620, 172–180 (2023).
Herculano-Houzel, S. et al. The elephant brain in numbers. Front. Neuroanat. 8, 46 (2014).
Herculano-Houzel, S. Not all brains are made the same: new views on brain scaling in evolution. Brain, Behav. Evolution 78, 22–36 (2011).
White, J. G., Southgate, E., Thomson, J. N. & Brenner, S. The structure of the nervous-system of the nematode Caenorhabditis elegans. Philos. Trans. R. Soc. Lond. Ser. B, Biol. Sci. 314, 1–340 (1986).
Zhu, F. et al. Architecture of the mouse brain synaptome. Neuron 99, 781–799. e710 (2018).
Kundrát, V. C. et al. Submillimeter-Long WS2 Nanotubes: The Pathway to Inorganic Buckypaper. Nano Lett. 23, 10259–10266 (2023).
Hilgetag, C. C. & Goulas, A. ‘Hierarchy’ in the organization of brain networks. Philos. Trans. R. Soc. Lond. Ser. B, Biol. Sci. 375, 20190319 (2020).
Lecun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Dong, S., Wang, P. & Abbas, K. A survey on deep learning and its applications. Computer Sci. Rev. 40, 100379 (2021).
Freeman, J. O. et al. Random packing of rods in small containers. Granul. Matter 21, 1–11 (2019).
Philipse, A. P. The random contact equation and its implications for (colloidal) rods in packings, suspensions, and anisotropic Powders. Langmuir 12, 1127–1133 (1996).
Zhang, Y. et al. Enhanced intrinsic photovoltaic effect in tungsten disulfide nanotubes. Nature 570, 349–353 (2019).
Sun, Y. et al. Mesoscopic sliding ferroelectricity enabled photovoltaic random access memory for material-level artificial vision system. Nat. Commun. 13, 5391 (2022).
Niu, Y. et al. 0D van der Waals interfacial ferroelectricity. Nat. Commun. 14, 5578 (2023).
Shemesh, O. A. et al. Temporally precise single-cell-resolution optogenetics. Nat. Neurosci. 20, 1796–1806 (2017).
Hamada, H. T. et al. Optogenetic activation of dorsal raphe serotonin neurons induces brain-wide activation. Nat. Commun. 15, 4152 (2024).
Baratta, M. et al. Optogenetic control of genetically-targeted pyramidal neuron activity in prefrontal cortex. Nature Precedings, 1-1 (2012).
Liu, K. et al. An optoelectronic synapse based on α-In2Se3 with controllable temporal dynamics for multimode and multiscale reservoir computing. Nat. Electron. 5, 761–773 (2022).
Druet, V. et al. A single n-type semiconducting polymer-based photo-electrochemical transistor. Nat. Commun. 14, 5481 (2023).
Yuan, C., Xu, K. X., Huang, Y. T., Xu, J. J. & Zhao, W. W. An aquatic autonomic nervous system. Adv. Mater. 36, 2407654 (2024).
Tu, Y.-J., Delmerico, S. & McDaniel, J. G. Inner layer capacitance of organic electrolytes from constant voltage molecular dynamics. J. Phys. Chem. C. 124, 2907–2922 (2020).
Valov, I. et al. Nanobatteries in redox-based resistive switches require extension of memristor theory. Nat. Commun. 4, 1771 (2013).
Lyon, R. F. Speech Recognition Experiments with a Cochlear Model. Can. Acoust. 14, 10–11 (1986).
Moody, G. B. & Mark, R. G. The impact of the MIT-BIH arrhythmia database. IEEE Eng. Med. Biol. Mag. 20, 45–50 (2001).
Chetouani, Y. A non-linear auto-regressive moving average with exogenous input non-linear modelling and fault detection using the cumulative sum (Page-Hinkley) test: application to a reactor. Int. J. Computer Appl. Technol. 32, 187–193 (2008).
Keras Team. (n.d). Image classification from scratch example. Retrieved from https://github.com/keras-team/keras-io/blob/master/examples/vision/image_classification_from_scratch.py.
Acknowledgements
This work is supported by Natural Science Foundation of China (Grant No. 62274011) (Y.G.), (Grant No. 12301547) (W.L.) and Innovation Plan of Beijing Institute of Technology (Y.G.). We thank Dr. Ayala Cohen and Prof. Zhiwei Wang for useful discussions, and Analysis & Testing Center, BIT for help with the experimental. R.T. acknowledges the support of the Estate of Manfred Hecht and the Estate of Diane Recanati. He also acknowledges the Perlman Family Foundation.
Author information
Authors and Affiliations
Contributions
Conceptualization: YG. Methodology: XH, ZQ, YG. Material: AZ, VK, RT. Discussion: XH, ZQ, VK, HL, ZL, XG, PM, WZ, SH, RL, HW, PW, AZ, WL, RT, YG. Funding acquisition: WL, RT, YG. Supervision: YG. Experimental: XH, ZQ, VK, HL, ZL, XG, PM, WZ, SH, RL, HW, PW, AZ, WL, RT, YG. Writing: XH, RT, WL, YG.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Weiguo Huang and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Han, X., Qi, Z., Kundrat, V. et al. Very-large-scale mimetic optogenetic synapses for physical reservoir computing. Nat Commun 17, 1514 (2026). https://doi.org/10.1038/s41467-025-68229-8
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41467-025-68229-8







