Introduction

Two-dimensional (2D) materials exhibit a plethora of nonlinear optical responses1,2,3, such as saturable absorption4,5, stimulated emission6,7, and high harmonic generation8,9,10,11,12,13. Especially, due to the Coulomb interactions confined in the atom-thin plane, the nonlinear responses of 2D materials are sensitive to external modulations1,2,3,8,9,10,11,12,13,14,15. This is desired for optical neuromorphic computing16,17,18,19,20,21, enabling the characteristics of intrinsic tunability, ultrafast, power-efficient, and parallelism6,8,10,20,22,23,24. The modulation strategies are mainly conducted in guided space by integrating 2D materials onto waveguides15,25,26,27, ring resonators28,29, and fibers30. The embedding of 2D materials in free-space computing systems can provide new flexibilities to design neuromorphic functions14,22,23,31,32,33. However, the modulation performance in free space is limited8,9,10,11,12,13,14,34, because of the limited light-matter interaction length along the out-of-plane direction of 2D materials8,10,11,34. In addition, the requirement of reconfigurable encoding procedures for different functions in free-space computing systems is another important challenge for stable and effective tunability22,23,24,31,32,33.

Fabricating electrode array onto 2D materials and applying electric fields can modulate nonlinear signals by changing the oscillating strength of excitons or breaking the crystal inversion symmetry8,9; the electronic system shows high tunability but also sacrifices the speed to ~ ns8,9,12,13. Free-space photonic structures, including nanocavities and metamaterials, can modulate signals at high speed and efficiency10,11,15,34,35. However, the spectral bandwidth and modulation depth are restricted by their resonant nature, limiting the tunability range to fit with neural dynamics10,14,34. Except for the above contradiction, the complex fabrication and large-scale integration are also challenging for the coupling of 2D materials with external structures. Therefore, using bare 2D materials independently is envisioned as a promising route to develop new free-space optical computing systems. Recently, related devices, such as spatial light modulators36 and meta-lenses37 have been demonstrated with purely patterned 2D materials and have realized neural networks. However, the tunability of the system is still limited by the fixed geometric structures38. To support computing functions with high performance and tunability, directly encoding optical excitation fields to intrinsically excite various nonlinear responses rather than rely on geometric structures is a solution4,8,10. However, compared to the above strategies, the limited signal-to-noise ratio (SNR) is a congenital restriction. Therefore, building a free-space optical neuromorphic computing system based on 2D materials remains a challenge (Fig. 1a).

Fig. 1: Free-space optical neuromorphic computing concept based on a pure 2D MoS2 array.
figure 1

a Contradiction between performance and tunability of the system for different strategies. The proposed system shows an improved modulation performance and simultaneously ensures the tunability of the system. b Schematic of the computing concept based on a pure MoS2 flake array. Within the pump-probe-control setup (bottom), the relative transmittance (ΔT) is effectively enhanced and modulated, assisting the design of computing functions. c Schematic of the computing mechanism. The synergistic transition mainly involves five specific bands: the VB maximum and CB minimum near the K points, the parallel VB and CB between the K and Γ points, and the high-energy nested CBs near the Γ point. The ΔT with the control pulse off is dominated by TPA at the K points. The ΔT with the control pulse on is dominated by SESA between the K and Γ points. d Simplified schematic of the computing system. e Schematic of the computing methodology. Computing functionalities can be realized by modulating the pulse on/off state, pulse power, pulse delay, and encoding the flake thickness. f Schematic of optical ANN. The ΔT is a function of control power and control time delay. The control power in each cell is the input signal, and the control time delay is encoded into weight. The cell thicknesses are identical. g Schematic of digital processing functions. Optical AND, OR, NAND, NOR gates, and DAC are realized by encoding the control pulse power and delay into input bits and choosing appropriate cell thickness as input ports.

In this work, we proposed a new free-space optical neuromorphic computing concept to conduct signal processing and computing within a bare 2D molybdenum disulfide (MoS2) array. This system shows an improved modulation performance with intrinsic fast speed, low energy consumption, and enhanced SNR of > 16 dB while simultaneously ensuring tunability through the synergistic encodings of the 2D cells and the excitation pulses. The key mechanism is the intrinsic transition from two-photon absorption (TPA)39 to synergistic excited states absorption (SESA)40, revealed from the modulated relative transmittance (ΔT) of MoS2 by a pump-probe-control scheme. After synergistic excitation, the ΔT is enhanced because of the much higher density of states (DOS) for the final excited states40. In addition, the ΔT shows strong thickness-dependence, due to the generation of ionized electron-hole pairs assisted by Coulomb screening. The relaxation of ΔT is prolonged, mainly attributed to the thermalization of ionized electron-hole pairs40,41. Then, the control pulse delay, pulse power, and flake thickness can encode different computing functions to demonstrate the modulated performance and tunability. Here, an artificial neural network (ANN) was developed by encoding the control time delay into 20 weight levels. The optical ANN showed a maximum operating time of ~ 5.47 ps in one cell, and the operation energy consumption for one multiplication calculation was between ~ 5.12 − 9.12 pJ. After training, the experimental testing agreed with the simulated results. In addition, digital processing, such as optical logic gates and a digital-to-analog converter (DAC) was demonstrated to show the tunability and adaptability to different functions. The performance modulation in MoS2 is stable and reliable. The pulse spot size determines the equivalent cell size, showing the potential for a higher cell density. More importantly, the methodology and concept apply to various 2D materials. Therefore, this free-space optical neuromorphic computing concept based on pure 2D materials can provide a new approach to program neuromorphic applications.

Results

Free-space computing concept based on bare 2D MoS2 flake array

The system concept is based on a pure MoS2 flake array without electrode arrays, complex nanocavities, or meta-structures, therefore simultaneously avoiding the complex fabrication and restrictions of these structures (Fig. 1b). To improve the modulation of nonlinear responses based on the synergistic transitions, we designed a pump-probe-control experimental setup4 (Supplementary Fig. 1a) based on the band structure of MoS2 (Fig. 1c). In theory, the A and B excitons near the band edge are residing at the K points of the Brillouin zone40. The C peak is determined by transitions between K and Γ points, distributed in a circular region around the Γ point where the conduction band (CB) and valence band (VB) are nearly parallel40. This particular band alignment generates larger joint DOS for much higher absorption near the C peak, and more substantial many-body effects can occur under external excitation40. Here, A and B excitons are in the same state by neglecting the spin-orbit coupling. Therefore, the synergistic excitation mechanism mainly involves five specific bands in the momentum space: the VB maximum and CB minimum near the K points, the parallel VB-CB between the K and Γ points, and the high-energy nested CBs near the Γ point. Then we can determine the pump pulse (1040 nm, 220 fs), probe pulse (902 nm, 100 fs), and control pulse (451 nm, 100 fs) to modulate the ΔT of MoS2. The superimposed light field exhibited an effective full width at half maximum of ~ 156 nm. The pump (control) time delay is defined as the excitation time difference of the pump (control) pulse relative to the probe pulse, modulated mechanically by two time-delay lines. Because the pump and probe photon energies were smaller than the bandgap, the pump-probe response without the control pulse is attributed to TPA. The control pulse photon energy is near the C peak to effectively promote transitions in the parallel VB-CB bands. The synergistic excitation under the control pulse can generate excited states in the high-energy nested CBs, where the much higher DOS can enhance the ΔT, therefore, intrinsically improving the modulation capability. Within this mechanism, we can build a free-space computing system (Fig. 1d), in which the output power of each pulse is controlled within the laser source, the pulse on/off state and delay are controlled within the light path, and the MoS2 cell is selected by modulating the focusing location. As a result, the ΔT responses, as a function of control pulse state, pulse power, pulse delay, and flake thickness, can be used to encode different computing functions (Fig. 1e), as will be discussed later. Because our mechanism and function encoding scheme are mainly determined by the absorption in MoS2 rather than the geometry of the MoS2 array, here inverse design is not needed. In the experimental setups (Supplementary Fig. 1b), the ΔT responses were measured by a photodiode, and we used the original data for the following discussions, with the signal unit in counts per second (cts/s).

Modulation performance

The ΔT responses were characterized to reveal the key mechanism and the modulation performance. The experimental setups are summarized in the methods. It should be noted that the nonlinear characteristics are sample-independent. The on/off state of the control pulse was switched in situ to directly enhance the ΔT response. The ΔT was near zero (pump absorption ~ 1%) when the control pulse was off, while an enhanced ΔT (pump absorption ~ 15%) was detected when the control pulse was on (Fig. 2a, sample A), indicating the synergistic excitation nature different from saturable absorption1,4. The on/off state of the control pulse can realize an intensity modulator with an enhanced SNR of > 16 dB, which is determined by the higher DOS in the high-energy nested CBs. With the increase of the control power, the ΔT dynamics under scanned pump time delay were enhanced monotonically (Fig. 2b, Supplementary Fig. 2, Sample B, ΔT mappings in Supplementary Figs. 3 and 4), indicating the enhancement is mainly attributed to the control pulse. The non-shifted peak and identical relaxation lifetime reveal that the control power did not alter the final excited states. Quantitatively, the ΔT showed a linear relationship with power for all three pulses (Fig. 2c, sample C, ΔT mappings in Supplementary Figs. 5 to 7). The signal followed \(\Delta T\propto {P}_{{{\rm{Pump}}}}\times {P}_{{{\rm{Probe}}}}\times {P}_{{{\rm{Ctl}}}}\), matching the TPA nature when the control pulse was off, and excluding the stimulated emission mechanism when the control pulse was on. This linear relationship with the pulse power can encode power as input and ΔT as output for computing. It should be noted that higher pulse power can lead to saturable absorption. The pulse power should be limited to keep the above linear relationship and ensure accuracy based on the proposed computing methods. Besides, due to the synergistic effects, the ΔT dynamics under scanned control time delay showed a prolonged relaxation to over ~ 5 ps (Fig. 2d), which can be feasible for pulse delay-encoded applications. This prolonged relaxation is also revealed from ΔT mappings in Supplementary Figs. 8 and 9.

Fig. 2: Modulation performance.
figure 2

a The ΔT enhancement (pump absorption ~ 15%) is controlled by the on/off state of the control pulse. The ΔT distribution is extracted along the dashed lines in the inset mappings. Scale bars, 5 μm. b ΔT dynamics vs. pump time delay under different control powers in sample B. The shaded area shows the standard deviation of ΔT acquisition. c The linear dependence of ΔT with power for three pulses in sample C. The solid line shows the linear fitted results, and the shaded area shows the standard deviation of ΔT acquisition. d The red line shows the fast decay with the pump time delay when the control pulse was off. The blue line shows the prolonged decay with the control time delay when the control pulse was on. The shaded area shows the standard deviation of ΔT acquisition. e The thickness distribution for eight selected regions. f, g The thickness-ΔT correlation at a pump time delay of 82 fs, with the control pulse off (f) and on (g). The right panels show the ΔT distribution in each region.

The ΔT response under scanned pump time delay was analyzed for regions with different thicknesses (Sample D, regions marked in Supplementary Figs. 10 and 11). Regardless of the thickness, the ΔT peak was blue shifted to the same pump time delay of 48 fs when the control pulse was on, indicating all regions share the same excitation path (Supplementary Fig. 12). After summarizing the thickness distributions for all regions in Fig. 2e (atomic force microscopy (AFM) image in Supplementary Fig. 13), we focused on the ΔT-thickness correlation at a pump time delay of 82 fs. The ΔT showed weak thickness-dependence without the control pulse (Fig. 2f and Supplementary Fig. 14) due to the limited changes in DOS for TPA42. With the control pulse included, the ΔT decreased monotonically with the increase of thickness when the thickness was below ~70 nm, while the ΔT enhancement was quenched for thicker locations over ~ 70 nm (Fig. 2g and Supplementary Fig. 15), showing the screening of the control pulse. The enhanced ΔT is attributed to the much higher DOS for SESA40. As the thickness increases, the high DOS near the nested CBs contributes to stronger Coulomb screening41,43. This screening effect makes more hot carriers generate ionized electron-hole pairs and relax through intraband scattering, leading to the observed lower ΔT41,42. Over the thickness limit of ~ 70 nm, the intraband scattering dominates and leads to the observed quenching effect. The thermalization of ionized electron-hole pairs can contribute to the prolonged relaxation lifetime41,43 (detailed explanation in Supplementary Note 1). It should be noted that computing functions in this study are encoded with cell thickness below ~ 70 nm, where SESA dominates. A similar thickness dependence is revealed in the gain distribution \(G={\Delta T}_{{{{\rm{control}}}} \, {{{\rm{on}}}}}/{\Delta T}_{{{{\rm{control}}}} \, {{{\rm{off}}}}}\) (Supplementary Fig. 16). Hence, the improved modulation performance relies on enhanced excitation and prolonged relaxation. This strong thickness dependence is also evidence that the nonlinear process in MoS2 is dominated by the interaction with the in-plane electric field component of the light field44.

Optical ANN in the system

Then, this system can realize an optical ANN, and using bare 2D materials for ANN was rarely estimated. The optical system performed input encoding, weight encoding, vector-matrix multiplication, and output detection. The weight updating values during backpropagation, and output summation and activation were calculated with a digital computer. Next, we focus on the performance of the optical system. Here, the output ΔT for each cell was a function of control power and control time delay, where the control power functioned as the input signal, and the control time delay marked as the signal delay Δτ was encoded as the weight (Fig. 1f). Therefore, the output ΔT quantifies the calculation of \({\mbox{input}}\times {\mbox{weight}}\) in each MoS2 cell. We used 9 cells with the same thickness for ANN computing (optical image in Supplementary Fig. 13h), so the output ΔT in each cell can be added directly. The pump time delay was zero, and pump-probe powers were 2 mW as fixed parameters during computing. ΔT responses were measured for each cell in sequence. Under different PCtl from 1 to 8 mW, the Δτ was continuously modulated from 5.47 to 0 ps in the first half cycle, and then increased to 5.47 ps in the second half cycle, to monitor the ΔT responses in 8 cycles (Fig. 3a and Supplementary Fig. 17a). This continuous Δτ was set by mechanically moving the delay line on the light path of the control pulse. A larger difference in light path length leads to a larger Δτ. From the results, the response is stable from cycle to cycle, and the symmetric characteristic is beneficial for weight updates. Besides, MoS2 is stable at ambient conditions, the laser power is below the saturation threshold and induces no damage to the material, and the light-matter interaction shows an enhanced SNR, thus, the ANN computing performance can be reliable in the long term. Under fixed PCtl of 8 mW, we switched the Δτ between 0 ps and 5.47 ps periodically and measured the ΔT responses in 50 cycles, the system can maintain stable endurance performance (Supplementary Fig. 17b). Because the ΔT relaxation is generally fitted with exponential equations of Δτ 45, the logarithm of ΔT was applied to encode the normalized weight (\({{\mathrm{ln}}}(\Delta T)\propto w\)), leading to an approximately linear relationship between weight and Δτ. Based on the experimental results in Fig. 3a, the Δτ in the range of 0–5.47 ps was encoded into 20 weight levels (Fig. 3b), which also determined the maximum operation time of ~ 5.47 ps in one cell. For the potentiation and depression process, multilevel ΔT states were measured for the encoded Δτ values under different PCtl, in which adjacent ΔT states can be distinguished clearly (Fig. 3c, d and Supplementary Fig. 17c–j).

Fig. 3: Optical ANN in the system.
figure 3

a The stable and symmetric ΔT potentiation and depression in 8 cycles under different control powers. In each cycle, the control time delay was continuously decreased from 5.47 to 0 ps in the first half cycle and then increased to 5.47 ps in the second half cycle. b The approximately linear relationship between weight and Δτ. Based on the experimental results in (a), we encoded Δτ in the range of 0 to 5.47 ps into 20 weight levels. c, d The multilevel ΔT states for the potentiation (c) and depression (d) process under different control powers. The inset shows the minimum 5 ΔT states under \({P}_{{Ctl}}=1 \, {{{\rm{mW}}}}\), indicating adjacent ΔT states can be distinguished. e The training accuracy and cost in 30 epochs for a single layer ANN to classify a custom training dataset. f The experimental and simulated output vector components for the custom testing dataset. g The average output vector components for the true label H, U, S, and T in both experiment and simulation. h The testing confusion matrix in experiment and simulation for the custom testing dataset. i The training accuracy and cost in 100 epochs for a single layer ANN to classify the MNIST training dataset. j The testing confusion matrix in simulation for the MNIST testing dataset.

Based on the encoded multilevel ΔT states, a single-layer ANN with 36 synapses was applied for pattern recognition, with SoftMax as the activation function and cross-entropy as the cost function (detailed ANN setup can be found in Supplementary Note 2)18. A custom training dataset consisted of 3 × 3 patterns for letters H, U, S, and T, encoded by PCtl for each pixel (Supplementary Fig. 18). To evaluate the noise robustness, the Gaussian noise with a mean value of zero and standard deviation of 0.2 mW is included in the training dataset. Each pattern was transformed into a 1 × 9 input vector, and after calculating the vector-by-matrix multiplication in the pure MoS2 array, the output of the network was a 1 × 4 vector \(\left[\begin{array}{cc}{\Delta T}_{1} & \begin{array}{ccc}{\Delta T}_{2} & {\Delta T}_{3} & {\Delta T}_{4}\end{array}\end{array}\right]\), and the maximum component in the output vector could acquire the recognition result. In the off-chip training process, the initial Δτ components were set to zero, which were then updated to modulate the output vector in 30 epochs (Supplementary Fig. 19), and the corresponding weights were summarized in Supplementary Fig. 20. The average output vector components for the true label H, U, S, and T in each epoch were calculated in Supplementary Fig. 21a. By comparing the true labels and the predictions (Supplementary Fig. 21b), the training accuracy reached 100%, and the cost decreased to 0.03 (Fig. 3e). To evaluate the recognition performance, predictions for a testing dataset were carried out in both experiments and simulations. Four components in the output vector were summarized in Fig. 3f for the 100 testing patterns, and the average output vector for the true label H, U, S, and T was calculated in Fig. 3g, showing the experiment results were in accordance with the simulation (testing details in Supplementary Fig. 22). The testing accuracy was 100% and 92% in simulation and experiment, respectively. The confusion matrix for the testing results in the experiment and simulation is shown in Fig. 3h. We emphasize that the signal-detecting error exhibited less influence on the experimental testing accuracy since the SNR is large enough to be distinguished by our system. In addition, the off-chip trained weight values were used for the experimental testing. It is normal there exists a deviation from actual on-chip trained weight values, and this is also a reason for the lower testing accuracy in the experiment. It should be noted that the testing procedure was directly carried out based on the detected ΔT signals, which included an intrinsic noise level of ~ 5% and the experimental results can indicate the noise robustness. Besides, we also used the MNIST dataset to further evaluate the applicability of our system for optical neuromorphic computing in simulation (details in Supplementary Note 3). We built a one-layer neural network following the system setup. The training accuracy and cost in 100 epochs are shown in Fig. 3i. After testing, the testing accuracy is 87.07%, and the testing confusion matrix is shown in Fig. 3j. Therefore, we conclude that the system can nicely perform classification within an optical ANN. Based on the experimental setup, the operation energy consumption for one multiplication calculation with the three pulses after ΔT accumulation was between ~ 5.12−9.12 pJ. The performance is compared to technologies implementing electronic or photonic schemes in Supplementary Table 1.

Digital processing in the system

Through the synergistic encodings of the 2D cells and the excitation pulses, digital processing functions were designed by implementing the ΔT-thickness correlation directly. The correlation ensures a higher SNR at relatively weak illumination, which is beyond the reach of the conventional pump-probe setup. In addition, etching the thickness of 2D flakes is simpler compared to the fabrication of external electronic circuits or photonic structures. As schematically shown in Fig. 1g, the control pulse power and control time delay encode the input bits, and by choosing cells with appropriate thickness as input ports, the output ΔT is summarized externally to encode different functions. The pump time delay was fixed at 0. Here we chose 6 cells in a patterned MoS2 array with different thicknesses (optical image in Supplementary Fig. 13g). The thickness dependence for each cell was measured at first, under the pump and control time delays of zero and the pump-probe powers of \({P}_{{{\rm{Pump}}}}={P}_{{{\rm{Probe}}}}=2\, {{{\rm{mW}}}}\). When the control power was increased from 0 mW to 7 mW, the ΔT mappings of each cell were summarized in Fig. 4a, showing the highest SNR of 129.4. The average ΔT under increasing control power was calculated in Fig. 4b, with the gradient \(\frac{d\Delta T}{{\mbox{d}}{P}_{{Ctl}}}\) determined by the cell thickness. Four logic functions (AND, OR, NAND, and NOR) were designed with different input encoding schemes and selected cells (Fig. 4d), and the truth table is summarized in Fig. 4c. The output threshold was encoded at 60 cts/s. When the control pulse delay was fixed at 0 ps, and the input bit 1 (bit 0) was encoded by a control pulse power of 3 mW (0 mW), AND (OR) logic can be achieved by using cell 3 and 4 (cell 5 and 6). The output ΔT of cells 3 and 4 are both below the threshold, and the summed ΔT is above the threshold, encoding, AND logic (Fig. 4e). The output ΔT of cells 5 and 6 are both above the threshold, therefore, encoding OR logic (Fig. 4e). When the control pulse power was fixed at 3 mW, and the input bit 1 (bit 0) was encoded by a control pulse delay of 4 ps (0 ps), NAND (NOR) logic can be achieved by using cell 5 and 6 (cell 3 and 4). A larger delay suppresses the output ΔT, therefore, the summed ΔT of cells 5 and 6 is above the threshold when at least one input bit is 0, achieving NAND logic (Fig. 4e). The summed ΔT of cells 3 and 4 is above the threshold when both input bits are 0, achieving NOR logic (Fig. 4e). In addition, based on the same encoding scheme of AND/OR logic, a 4-bit optical DAC was demonstrated by choosing cells 1, 3, 4, and 6 as the input ports, and the summed ΔT output directly converted each 4-bit digit into an analog value, as shown in Fig. 4f.

Fig. 4: Digital processing in the system.
figure 4

a ΔT mappings under different control powers in 6 cells with different thicknesses. Scale bars, 100 nm. b The average ΔT under different control powers in 6 cells, the gradient \(\frac{d\varDelta T}{{\mbox{d}}{P}_{{Ctl}}}\) is determined by the cell thickness. The control pulse delay was 0 for (a) and (b). c Truth table of logic functions. d The input signal encodings and selected input cells for different logic functions. For AND and OR logic, the control pulse delay is 0, and input bits are encoded by the control pulse power. For NAND and NOR logic, the control pulse power is 0, and input bits are encoded by the control pulse delay. e The output ΔT of the system can achieve different logic functions based on the setup scheme. The red dashed line indicates the defined ΔT threshold. f Based on the same encoding scheme of AND and OR logic, the output ΔT in cells 1, 3, 4, and 6 can form an optical 4-bit DAC.

Discussion

In the free-space optical neuromorphic computing concept, the synergistic transition is the key to function designs, and this requires sample-independent ΔT modulation, which is evaluated in more samples (Supplementary Figs. 2326). The results also indicate that the computing mechanism and setup are reproducible and reliable. Except for the general results for digital processing and neuromorphic computing, special phenomena at the flake edge and wrinkles are also discussed in Supplementary Figs. 27, 28, and Supplementary Note 4, 5, and a deeper understanding of ΔT responses in MoS2 will enrich new computing mechanisms.

The free-space optical neuromorphic computing system based on a pure 2D cell array enables computing functions with fast speed, low power consumption, large SNR, and simplified architecture, relieving the restrictions of peripheral electronic circuits, complex nanocavities, and meta-structures. In addition, the synergistic encodings of the 2D cells and the excitation pulses also show higher tunability to program different computing functions. Besides, the excited states in 2D materials can be transferred into electronic signals in situ, assisting the matching between optical and electronic modules for further integration in free space. As a physics-aware neural network platform, free-space optical neuromorphic computing in 2D cell arrays is feasible for analog signal processing in the real world. For example, combined with the excellent photon detection characteristics of 2D materials, this platform can function as a smart sensor to detect and pre-process optical information simultaneously.

For array fabrication, thickness-controllable assembly of 2D films can be realized through bottom-up CVD growth46 and top-down etching47. Recent progress in the robotic assembly also opens a new way to fabricate an array of 2D flakes48. In the free-space neuromorphic computing system with controllable cell thicknesses, a large-scale array can be fabricated by patterning large-area 2D film directly. The unit size can be narrowed down to the laser spot size4. Thus, a higher cell integration density can be achieved. The large-scale high-density 2D flake array can be inserted on the light path of free-space optical systems directly for computing. For functionalization, vector-by-matrix multiplication was carried out by scanning the laser focusing spot over the whole array, this setup compromises the intrinsic fast speed of our concept, but the pump-probe-control process is still fast4. The solution is integrating an array of laser sources to modulate and read out signals simultaneously, then we can fully apply the fast speed and high parallelism. Except for multilayer MoS2, the intrinsic nonlinear responses in many 2D materials are advantageous1,2,3, and more efforts are needed to develop new free-space applications in 2D materials. This work focuses on multilayer MoS2 because the band structure of MoS2 can match the laser source parameters to realize the best modulation performance. By matching the laser photon energy with the band structure of the selected 2D material, other 2D materials are also applicable in our system setup under the same modulation methodology and the same mechanism. Although integrated devices on waveguides, ring resonators, and fibers are not the topic in this work, the key mechanism of synergistic transitions and the compatibility of 2D materials with these structures will also enrich optical neuromorphic applications. For example, a 2D flake array can be capped above a waveguide array, this structure functions as an array of semiconductor optical amplifiers (SOA). The three pulses as input signals can synergistically modulate the gain at each SOA node, and then computation can be realized with the amplified signals on waveguides. Based on the discussions in data processing and computing applications, the proposed concept can enlighten the integration of optical communication and neuromorphic computing systems.

Methods

1. Sample preparation

Bulk MoS2 crystals (hq-Graphene) were mechanically exfoliated with Nitto tape, and few-layer samples were attached to a SiO2/Si 300 nm/500 μm wafer. We selected MoS2 samples with different thicknesses under an optical microscope. Then the selected samples were attached to PVA (Aladin) films at 45 °C and transferred onto glass slides at 70 °C. The PVA film was removed in deionized water for ~ 30 min. The AFM mapping for sample D was acquired by the Digital TNSTRUments/Veeco Dimension 3000 system under tapping mode. MoS2 arrays were fabricated by etching exfoliated MoS2 flakes with Ar/SF6 (20 sccm/20 sccm) plasma directly. Because the exfoliated MoS2 flake generally has nonuniform thicknesses in different locations, each cell in the array can have different thickness after patterning. We chose cells with appropriate thicknesses for measurements.

2. Pump-probe-control strategy

The schematic of the pump-probe-control system is shown in Supplementary Fig. 1a. The dual-output femtosecond laser source (InSight DeepSee, Spectra-Physics, Newport) was operated at the frequency of 80 MHz. The pump (1040 nm, 220 fs) pulse was modulated by an acousto-optic modulator (AOM, 1205C-1, Isomet) at a frequency of 2.23 MHz. A time-delay line (TD) after AOM was used to modulate the pump time delay. Both the pump and probe (902 nm, 100 fs) pulses were combined by a dichroic mirror (DMSP1000, Thorlabs) collinearly, and filtered by a polarization-maintaining single-mode optical fiber (PM-SM-F) (LPC-08-1060-6/125-P-3.8-20AC-40-3AF, OZ Optics) within a length of 0.3 m to keep the transverse Gaussian distribution. The control pulse was frequency-doubled by the probe pulse through a beta-barium borate crystal (BBOC) (SHG@900 nm, Union Optic). The control pulse was filtered by a 25 μm pinhole (H) and a quarter-wave plate (QWP). The three pulses with matched Gaussian distribution were controlled and scanned by a galvanometer scanning lens (SL) and tube lens (TL). The objectives (OBs) with high N.A. (N.A. 1.49, UAPON 100XOTIRF, and N. A. 1.2 UPLSAPO 60XW, Olympus) were used to focus the pulses on the MoS2 samples. The pump pulse signal was transmitted through an oil condenser (OC) (N.A. 1.4, U-ACC, Olympus) and three bandpass filters (BPF) (ZET800/200 bp, ET980sp-2P8, Chroma Technology). Then the transmitted photons of the probe beam were detected by a Si PIN photodiode (S3994-01, Hamamatsu) at the frequency of 2.23 MHz. We used the original data for all discussions, with the signal unit in counts per second (cts/s). A high-speed lock-in amplifier (HF2 LI, Zurich Instruments) was implemented to demodulate the pump-probe signal for sensitive heterodyne detection. The intensity mapping results were acquired by an acquisition card (PCI 6251, National Instrument) with a frequency of 1.25 MHz.

Before carrying out measurements within the MoS2 array, the whole system was calibrated to synchronize the three incident beams. The illumination time was synchronized by calibrating the initial location of the TDs. Then we can acquire the accurate pump time delay and control time delay. The spatial location of each beam was calibrated by dichroic mirrors, then we can accurately synchronize the coupling of the three beams at the same focusing spot.

Here, we selected the pump and probe wavelengths based on the band structure of MoS2. The pump and probe photon energies were below the bandgap (\({E}_{{{\rm{pump}}}} < {E}_{{{\rm{gap}}}},\,{E}_{{{\rm{probe}}}} < {E}_{{{\rm{gap}}}}\)), thus, using a single pulse alone cannot modulate the relative transmittance (\(\Delta T\)) signals. The \({E}_{{{\rm{pump}}}}+{E}_{{{\rm{probe}}}}\) was selected to excite carriers near the band edge, thus the \(\Delta T\) modulation was relatively low. The control photon energy was much higher than the band gap (\({E}_{{{\rm{ctl}}}} > {E}_{{{\rm{gap}}}}\)) and near the high-energy nested conduction bands to assist exciting carriers in a high density-of-states (near C peak in Fig. 1b). Therefore, the \(\Delta T\) signals can be significantly enhanced and modulated due to the synergistic excitation mechanism. Compared to photonic structures, no resonant excitation is required. This computing setup and function encoding schemes are mainly determined by the modulation of absorption in MoS2, rather than the geometry of the MoS2 array. Therefore, our system can realize various computing by simply changing the laser pulse setup. This is different from the design of meta-structures or other optical structures, whose function is related to the geometry structure and inverse design is necessary. Here, inverse design is not needed for our scheme.

3. Setups of ΔT modulation

Four modulation methods of control pulse state, pulse power, pulse delay, and flake thickness, are proposed to encode different functions. The experimental setup is schematically explained in Supplementary Fig. 1b. For control pulse state switching, a baffler is used on the light path of the control pulse to control the transmission (on state) or blockage (off state). The pulse power is modulated by controlling the laser source output. The delay time is modulated by mechanically moving the delay line on the light path of the control pulse. For thickness control, the focusing spot of the laser pulse is turned to select a different cell with a different thickness. Here, we used the exfoliated MoS2 flake with nonuniform thickness for array patterning. Therefore, each cell in the array has a different thickness. Appropriate cells are selected in the same means to encode computing functions.

The key mechanism of synergistic transitions from two-photon absorption (TPA) to synergistic excited states absorption (SESA) is characterized in Fig. 2. Figure 2a shows a large \(\Delta T\) enhancement by controlling the on/off state of the control pulse. In this setup, the pump and probe power is \({P}_{{{\rm{Pump}}}}={P}_{{{\rm{Probe}}}}=5\, {{{\rm{mW}}}}\), the on-state power of the control pulse is \({P}_{{{\rm{Ctl}}}}=5.5\, {{{\rm{mW}}}}\), the off-state power of the control pulse is zero, and the pump and control time delay are zero. Figure 2b shows the \(\Delta T\) vs. pump time delay under different control power to reveal the signal enhancement is mainly attributed to the control pulse. In this setup, the pump and probe power are \({P}_{{{\rm{Pump}}}}={P}_{{{\rm{Probe}}}}=4\, {{{\rm{mW}}}}\), the control power is increased from 0 mW to 8 mW, the control time delay is zero. Figure 2c shows the linearly enhanced \(\Delta T\) vs. pulse power for three pulses. In this setup, the pump and control time delay is zero, and only one pulse power is monotonically increased from 0 mW to 7 mW, while the other two pulse powers are fixed at 2 mW. Figure 2d shows the \(\Delta T\) vs. pulse delay to reveal the prolonged relaxation. In this setup, the pump, probe, and control power are \({P}_{{{\rm{Pump}}}}={P}_{{{\rm{Probe}}}}={P}_{{{\rm{Ctl}}}}=2\, {{{\rm{mW}}}}\). When the control pulse is off, the red dots show the fast decay of \(\Delta T\) vs. pump time delay, here the control time delay is zero. When the control pulse is on, the blue dots show the prolonged relaxation of \(\Delta T\) vs. control time delay, here the pump time delay is zero. Under synergistic excitation, we also noted the response started at an earlier negative control time delay of ~ − 534 fs. Figure 2f, g shows the ΔT vs. thickness under a pump time delay of 82 fs. In this setup, the pump, probe, and control power are \({P}_{{{\rm{Pump}}}}={P}_{{{\rm{Probe}}}}=2\, {{{\rm{mW}}}},\,{P}_{{{\rm{Ctl}}}}=5.5\, {{{\rm{mW}}}}\), and the control time delay is zero.