Voltage-controlled magnetoelectric devices for neuromorphic diffusion process

Cheng, Yang; Shu, Qingyuan; Lee, Albert; He, Haoran; Zhu, Ivy; Chen, Minzhang; Chen, Renhe; Wang, Zirui; Zhang, Hantao; Wang, Chih-Yao; Yang, Shan-Yi; Hsin, Yu-Chen; Shih, Cheng-Yi; Lee, Hsin-Han; Cheng, Ran; Wang, Kang L.

doi:10.1038/s41467-025-58932-x

Download PDF

Article
Open access
Published: 30 May 2025

Voltage-controlled magnetoelectric devices for neuromorphic diffusion process

Yang Cheng¹,
Qingyuan Shu¹,
Albert Lee¹,
Haoran He¹,
Ivy Zhu²,
Minzhang Chen¹,
Renhe Chen³,
Zirui Wang⁴,
Hantao Zhang ORCID: orcid.org/0000-0001-6281-6708⁵,
Chih-Yao Wang⁶,
Shan-Yi Yang⁶,
Yu-Chen Hsin⁶,
Cheng-Yi Shih⁶,
Hsin-Han Lee⁶,
Ran Cheng ORCID: orcid.org/0000-0003-0166-2172^4,5 &
…
Kang L. Wang ORCID: orcid.org/0000-0002-9363-1279¹

Nature Communications volume 16, Article number: 5022 (2025) Cite this article

5301 Accesses
10 Citations
Metrics details

Subjects

Abstract

Neuromorphic diffusion models have become one of the major breakthroughs in the field of generative artificial intelligence. Unlike discriminative models that have been well developed to tackle classification or regression tasks, diffusion models aim at creating content based upon contexts learned. However, the more complex algorithms of these models result in high computational costs using today’s technologies. Here, we develop a spintronic voltage-controlled magnetoelectric memory hardware for the neuromorphic diffusion process. The in-memory computing capability of our spintronic devices goes beyond current Von Neumann architecture, where memory and computing units are separated. Together with the non-volatility of magnetic memory, we can achieve high-speed and low-cost computing, which is desirable for the increasing scale of generative models in the current era. We experimentally demonstrate that the hardware-based true random diffusion process can be implemented for image generation and achieve comparable image quality to software-based training as measured by the Fréchet inception distance (FID) score, achieving ~10³ better energy-per-bit-per-area over traditional hardware.

Noise resilient leaky integrate-and-fire neurons based on multi-domain spintronic devices

Article Open access 19 May 2022

Neuromorphic scaling advantages for energy-efficient random walk computations

Article 14 February 2022

Thermally-robust spatiotemporal parallel reservoir computing by frequency filtering in frustrated magnets

Article Open access 10 October 2023

Introduction

Diffusion processes are ubiquitous. In human brains, the membrane potential of neurons is affected by stochastic noise^1,2,3, which can be described by the Langevin equation in neuro field theory (Fig. 1a). Inspired by Langevin dynamics, denoising diffusion probabilistic models (DDPM, or diffusion model)⁴ were proposed and have become one of the major breakthroughs in deep learning over the last few years. One crucial aspect of DDPM is the construction of a diffusion process, which involves applying Gaussian noise sequentially to the data until it reaches an isotropic Gaussian distribution (Fig. 1b). While the diffusion model has demonstrated significant potential in applications such as image generation, data recovery, and inpainting, it encounters constraints related to computing speed and energy efficiency. Like other advanced models such as ChatGPT, the parameter space of today’s deep learning algorithms has drastically increased from million to trillion⁵ to tackle more and more complex demands. Such large-scale brain-inspired neuromorphic computing cannot be efficiently implemented using the conventional Von Neumann architecture computers^6,7, where storage and computing units are separated, leading to tremendous energy and latency costs in shuttling data back and forth.

Spintronic devices offer a natural solution to the limitations of current computing hardware in supporting neuromorphic computing algorithms. The bits “1” and “0” are represented by the spin-up and spin-down states in magnetic materials that can be integrated with the complementary metal-oxide-semiconductor (CMOS) back-end-of-line (BEOL) process with high-volume production tools. The non-volatility of the magnetic material allows for low energy consumption in compact data storage. The manipulation of the two spin states via an electric field or current provides the capability of in-memory computing that goes beyond the conventional Von Neumann computational paradigm. This is particularly ideal for neuromorphic computing, which takes inspiration from the human brain. Some spintronic devices have been proposed as artificial synaptic coupling amongst neurons, nonlinear activation functions, and reservoir layers^{8,9,10,11,12,13}. Simple tasks such as pattern and vowel recognition using a few-layer fully connected neural network or recurrent neural network (RNN) have been demonstrated^{14,15,16,17,18,19,20,21,22}. Compared to the above discriminative models that only draw boundaries by dividing the data space into different classes, generative models that aim at understanding how data are embedded into the space are more difficult to learn but more important for advanced artificial intelligence^23,24. However, using energy-efficient spintronic devices to tackle the more complex generative tasks has not yet been achieved.

In this article, we report the use of a CMOS-integrated voltage-controlled magnetoelectric random access memory (MeRAM, or VC-MRAM) for the diffusion process. We demonstrate that the switching probability of spin states can be tuned by changing the voltage pulse width and magnitude, via the voltage-controlled magnetic anisotropy (VCMA) effect in our on-chip fabricated magnetic tunneling junctions (MTJ). By sequentially applying a pulse train to an MTJ, the spin state will be updated accordingly and form a Markov chain. Combining multiple such MTJs to an MeRAM array with assigned integer and fraction bit-widths, we can achieve a desirable complex diffusion process. We show that the highly energy efficient MeRAM-based stochastic diffusion process can be successfully implemented into DDPM for image generation. The step-by-step evolution of the MeRAM readout emulates the change of image pixel value under a Markov process in the diffusion model. The quality of generated images matches that of a software-trained model measured by FID score²⁵.

The building block of data storage for MeRAM is an MTJ where it has two magnetic layers and an MgO insertion in between²⁶, as shown in Fig. 1c. The magnetization state of the reference layer (m_ref) is fixed, whereas that of the free layer (m_free) can be manipulated by the gate voltage applied across the MgO barrier through the VCMA effect. The physical origin of the VCMA effect relates to the modulation of the carrier density at the interface or the electric-field-induced changes of the orbital magnetic moment^27,28,29. Figure 1d illustrates how the voltage pulse width and magnitude affect the switching probability of the free layer. Suppose that the initial state is where m_free is in parallel with m_ref (P state or bit 0) and an in-plane magnetic field is applied, under a small gate voltage V_g (V_g < V_c), the VCMA effect does not cancel the perpendicular anisotropy (PMA) in the free layer. This makes the equilibrium position of m_free close to the P state. Therefore, there is only a small chance that the m_free can be switched to be antiparallel to m_ref (AP state or bit 1) due to thermal fluctuations after turning off the pulse. When V_g is large enough to compensate PMA (V_g > V_c), the equilibrium position of m_free is in plane along the external field direction. The m_free undergoes a damped oscillation towards in-plane^26,30. Then the switching probability also oscillates depending on the relative position of m_free when V_g is off. At the first 1 ns, the switching probability increases to approach a near 100% deterministic switching. However, when the pulse is long enough to make m_free in-plane, the switching rate would stay around 50% as pure thermal random switching (See Methods for the details of the simulation). Given the deterministic switching at a short pulse width and a 50% switching rate for a long pulse width, MeRAM has been demonstrated as a promising candidate for high energy efficient non-volatile memory (NVM) as well as a true random number generator in probabilistic computing^{31,32,33,34,35}. However, its sequential stochastic generation capability has not been explored. In this work, combining the low cost of NVM with its tunable switching rate, we achieve an in-memory Markov process using MeRAM³⁶. As shown in Fig. 1e, when a pulse train with different gate voltage and width for each pulse is applied, the state of MTJ continuously changes and only depends on the previous state and the applied pulse, forming a stochastic diffusion process.

VC-MTJ properties

We first characterize our MeRAM device consisting of a single MTJ. On-chip MTJs with a diameter of 100 nm are fabricated with the full stack on an 8” wafer shown in Fig. 2a (See Methods for details of fabrication and structure characterization). As the reference layer is pined by a synthetic antiferromagnet layer, the VCMA effect at MgO/CoFeB free layer interface modifies the PMA energy density (K_u). (See Supplementary Note 3 for the details about measurements of K_u in the free layer). A VCMA coefficient of 40 fJ/Vm is extracted by a linear fit of K_u to the electric field^37,38. The read out of MTJ states is made through measuring the tunnel magnetoresistance (TMR) between the bit line and word line. Figure 2b shows the out-of-plane field dependence TMR measurement of our MTJ device. The external field switches the free layer between P and AP states, where the AP state has a high magnetoresistance due to the mismatch of majority and minority spin channels³⁹. Our measurement shows a 220% on/off ratio (${R}_{{AP}}/{R}_{P}$) between the two states. To demonstrate the tunability of the switching rate, we perform electric-field-induced switching measurements. Figure 2c shows the obtained switching probability using voltage pulses with various lengths and amplitudes. A pulse train is applied to the device and the MTJ states are recorded using a fast oscilloscope, as shown in Fig. 2d, e (See Methods for the experiment setup). Then the switching probability can be extracted by counting the number of AP- > P and P- > AP states. When V_g is 2.1 V which is below the critical voltage V_c of 2.4 V, a low switching probability is observed. When V_g = V_c, the switching probability saturates at 50% as an indication of thermal fluctuation induced stochastic switching (Details of the switching profile at 0.4 ns and 2 ns are shown in Fig. 2d, e). When V_g is 2.7 V which is above V_c, voltage-induced precessional motion of magnetization leads to the damped oscillation of switching probability, which eventually ends up as 50%. This is consistent with our numerical simulation as shown in Fig. 1d.

**Fig. 2: Characterization of voltage-controlled MTJ.**

MeRAM-based Gaussian noise generation

For practical use, we need a MeRAM unit consisting of multiple MTJs, with each pulse train updating the states of the MTJ as a Markov chain. Figure 3a shows a MeRAM unit with eight MTJs connected by the bit line. There are two integer bit-width and six fraction bit-width with assigned digit values. The read out of MeRAM by the post processing unit gives ${A}_{i}=\mathop{\sum }\limits_{{bit}=1}^{8}{2}^{({bit}-7)}\cdot {{MTJ}}_{i}\,({bit})$, where A_i ranges from 0 to $\frac{255}{64}$. The distribution of A_i can be obtained by

$$\left(\begin{array}{c}{P}_{{A}_{i}=0}\\ .\\ .\\ .\\ {P}_{{A}_{i}=\frac{255}{64}}\end{array}\right)=M\left(\begin{array}{c}{P}_{{A}_{i-1}=0}\\ .\\ .\\ .\\ {P}_{{A}_{i-1}=\frac{255}{64}}\end{array}\right)$$

(1)

$$M=\left(\begin{array}{c}1-{{P}_{P\to {AP}}}^{(1)}\\ {{P}_{P\to {AP}}}^{(1)}\end{array}\begin{array}{c}{{P}_{{AP}\to P}}^{(1)}\\ 1-{{P}_{{AP}\to P}}^{(1)}\end{array}\right)\bigotimes ...\bigotimes \left(\begin{array}{c}1-{{P}_{P\to {AP}}}^{(8)}\\ {{P}_{P\to {AP}}}^{(8)}\end{array}\begin{array}{c}{{P}_{{AP}\to P}}^{(8)}\\ 1-{{P}_{{AP}\to P}}^{(8)}\end{array}\right)$$

(2)

**Fig. 3: Illustration of MeRAM array for diffusion process.**

M is the Markov matrix defined by the Kronecker product of eight single transition matrices for each individual MTJ. Further, we need to calculate the distribution of $\varepsilon$, where $\varepsilon (i)={A}_{i}-{A}_{i-1}$ generated by our MeRAM to match the desired neuromorphic diffusion process. In the diffusion model, $\varepsilon$ is taken as the noise added to the data, which ideally follows a Gaussian distribution (See Supplementary Note 4 for derivation). Working backwards from the target distribution, we can extract the voltage pulse widths and magnitudes on each MTJ in the MeRAM unit. In our experiments, 2.4 V gate voltage is applied with a 0.4 ns pulse width to the leading MTJ (Most Significant Bit, or MSB) while 2 ns pulse for the remaining MTJs. Figure 3b shows the distribution of sampled 10,000 and 40,000 $\varepsilon$ values from the pulse trains. With a larger sample size, the distribution of $\varepsilon$ gets closer to a standard Gaussian as expected. As a proof-of-concept, the 8-bit MeRAM array allows the sampling of standard Gaussian noise in a range of 4σ. More bits or arrays can increase the range and granularity of the generated noise (See Supplementary Note 4 for details). Compared with a conventional CMOS-based pseudo random number generator which needs extra bias generators to achieve tunable switching probability, our voltage-controlled MTJ saves 80% of energy and has ~10³ higher figure of merit (FOM) (See Supplementary Note 5), in addition to having true randomness.

Image generation tasks in CMOS-integrated MeRAM array

With the demonstrated capability of generating Gaussian noise using our VC-MTJ devices, we implement it in DDPM for our image generation task. We first illustrate with a simple letter pattern learning and generation task using our CMOS-integrated MeRAM array with 80 $\times$ 80 VC-MTJs. This marks the first time a MeRAM device has been integrated with the 180 nm CMOS process⁴⁰. As shown in Fig.3c, DDPM contains two parts, a forward diffusion process and a reverse diffusion process. In the forward diffusion process, Gaussian noises are added to the training data (pixel data ${{\bf{x}}}_{{\boldsymbol{0}}}$) in T steps sequentially, with the variance increasing with each step t. When T is large enough, ${{\bf{x}}}_{{\boldsymbol{T}}}$ is subject to a Gaussian distribution. In the reverse diffusion process, starting from pure Gaussian noise ${{\bf{x}}}_{{\boldsymbol{T}}}$, pixel distribution ${{\bf{x}}}_{{\boldsymbol{t}}-{\boldsymbol{1}}}$ is learned by comparing the known posterior distribution$q({{\bf{x}}}_{{\boldsymbol{t}}-{\boldsymbol{1}}}|{{\bf{x}}}_{{\boldsymbol{t}}})$ to the predicted distribution ${p}_{\theta }({{\bf{x}}}_{{\boldsymbol{t}}-{\boldsymbol{1}}}|{{\bf{x}}}_{{\boldsymbol{t}}})$. Using the variational inference method, the Kullback–Leibler divergence (KL-divergence) of $q\left({{\bf{x}}}_{{\boldsymbol{t}}-{\boldsymbol{1}}},|,{{\bf{x}}}_{{\boldsymbol{t}}}\right)$ and ${p}_{\theta }({{\bf{x}}}_{{\boldsymbol{t}}-{\boldsymbol{1}}}|{{\bf{x}}}_{{\boldsymbol{t}}})$ simplifies to the prediction of the noise distribution from ${{\bf{x}}}_{{\boldsymbol{t}}}$ to ${{\bf{x}}}_{{\boldsymbol{t}}-{\boldsymbol{1}}}$. This can be achieved by introducing a convolutional neural network-based U-Net. We repeat the above process in one epoch and perform multiple iterations (epochs) to improve the performance of the diffusion model. To generate images, we use the trained noise in the reverse diffusion process to sample a new denoised image ${{\bf{x}}}_{{\boldsymbol{0}}}^{\prime}$ step by step from ${{\bf{x}}}_{{\boldsymbol{T}}}$. In our experiment, we choose to use a total step T of 1000. In our MeRAM unit, we utilize a set of 40,000 random numbers sampled from our device as the noise dataset, applying it in both the training phase (Forward Diffusion) and the generation phase (Reverse Diffusion). This involves changing the states of VC-MTJs by applying voltage to the MeRAM array. In the images, each pixel corresponds to one VC-MTJ, with dark blue and light blue representing a P state and an AP state, respectively. Here, x_t is the coordinate of P states, and the trajectories of P states (change from x_t to x_t+1) follow a Gaussian diffusion process⁴¹. We have separately trained the array on different letter patterns such as “U”, “C”, “L”, and “A”, and subsequently generated new patterns using the trained models. The results align with our expectations, showcasing the effective pattern generation capabilities of our device.

Performance of MeRAM-based generative diffusion model

To evaluate the performance of our MeRAM-based generative diffusion model, we utilize the CelebFaces Attributes (CelebA-HQ) dataset⁴² with 64 × 64 resolution images for training. We follow the original DDPM setup, where noise is added to each pixel value (x_t) to learn the connections prior to image generation. Figure 4a illustrates the image generation process across different training epochs. Notably, after 100 epochs, both the MeRAM-based and the fully software-based diffusion models produce high-quality images. We quantify image quality using the FID score, where lower scores indicate better quality. We compare the images generated using the software-based and our MeRAM-based diffusion process across various numbers of training epochs (See Supplementary Information for details of DDPM implementation). As shown in Fig. 4b, despite the slight deviations of the MeRAM-generated noise from a strictly Gaussian distribution, the improvements in image quality with an increasing number of epochs occur at the same rate for both the software-based and MeRAM-based diffusion models, but with much lower energy consumption.

**Fig. 4: Performance of imaging generation task using MeRAM based diffusion model.**

Discussion

In conclusion, we have demonstrated the use of voltage-controlled MeRAM for the neuromorphic diffusion process. Notably, we have successfully accomplished high-quality image generation using an integrated CMOS MeRAM chip for the first time. Our work goes beyond traditional discriminative models, implementing MeRAM to advance the state-of-the-art generative diffusion models. In the context of DDPM, the precise and controlled generation of noise is a central component. By enhancing noise generation efficiency using MeRAM, we are able to reduce the iterative burden typically associated with these models, leading to a much more streamlined and energy-efficient process. Furthermore, MeRAM overcomes the limitations of conventional STT-based MRAM, which faces higher energy consumption, limited endurance, and reduced speed due to incubation delay⁴³. Additionally, while STT and SOT-MRAM requires a low energy barrier to function as a probabilistic bit (p-bit)—compromising retention time—VCMA-based MTJ devices in offer the unique flexibility to serve as both p-bits and memory cells, making it particularly suitable for large-scale neuromorphic probabilistic computing applications^44,45. We believe our spintronics hardware could overcome one of the biggest challenges in the current era of neuromorphic computing—the gap between the increasing complexity of algorithms and the computing platform that remains in von Neumann architecture.

Methods

Fabrication and characterization of MTJ

Magnetic multilayer stacks are grown on an 8-inch CMOS backend wafer by sputtering, followed by annealing at 360 °C for 20 min, which is compatible with standard CMOS backend processing. MTJ is defined by E-beam lithography and its diameter is 100 nm. The standard MTJ fabrication process is performed on the whole wafer. Transmission electron microscopy and scanning electron microscope are used to characterize the growth and fabrication of the MTJ stack, as shown in Supplementary Fig. 1. For the switching probability measurements, we use a GMW 5201 Projected Field Electromagnet driven by a Kepco Bipolar Operational Power Supply to apply an external in-plane magnetic field 390 Oe. A Keithley 2636A source meter is used for (a) triggering a Tektronix PSPL10050A Programmable Pulse Generator to generate voltage pulses over the MTJ device; (b) applying a small constant voltage across the MTJ for resistance readout. This voltage is applied across a constant series resistance that serves as a voltage divider and is injected to the MTJ through the DC input of a Bias Tee. To collect the data, an Agilent MSO7014B oscilloscope is connected across the MTJ. The switching behavior is reflected by the voltage fluctuation recorded by the oscilloscope.

Simulation of switching probability under different voltage

Macrospin simulations are performed to obtain the switching probability in the thermal activation regime, namely, when the voltage is slightly below the critical voltage that removes the barrier completely. In the simulation, we numerically integrate the Landau-Lifshitz-Gilbert equation using Matlab

$$\dot{{\boldsymbol{m}}}=-{\gamma }_{0}{\boldsymbol{m}}\times ({{\boldsymbol{H}}}_{{eff}}+{\boldsymbol{h}})+\alpha {\boldsymbol{m}}\times \dot{{\boldsymbol{m}}}$$

(3)

where the effective field ${{\boldsymbol{H}}}_{{eff}}=-{\partial }_{{\boldsymbol{m}}}w/{{\mu }_{0}M}_{s}$ is obtained from the gradient of the energy density profile $w$ containing the PMA energy and Zeeman energy

$$w={K}_{{eff}}(V)\left(1-{m}_{z}^{2}\right)-{{\mu }_{0}M}_{s}{H}_{x}{m}_{x}$$

(4)

The effective anisotropy ${K}_{{eff}}\left(V\right)$ is controlled by the applied voltage across MgO via the VCMA effect. For sufficiently large ${K}_{{eff}}$, the energy landscape has two energy minima corresponding to the two PMA states. To investigate the switching probability, we include a white noise as the thermal random field ${\boldsymbol{h}}$. Following Brown’s derivation⁴⁶,

$$\left\langle {h}_{i}(t)\right\rangle=0,\,\left\langle {h}_{i}\left(t\right){h}_{j}(t+\tau )\right\rangle=\mu {\delta }_{{ij}}\delta (\tau )$$

(5)

where $\mu=\frac{2{k}_{B}T\alpha }{{\gamma }_{0}{\mu }_{0}{M}_{s}V}$ is used to satisfy the thermal equilibrium condition. Numerically, we follow Scholz’s approach and use the 2nd order Heun’s method to integrate the dynamics⁴⁷. Adopting this approach ensures a good balance between numerical stability and complexity.

During the simulation, we initialize the spin in the absence of an external voltage so that the state will evolve from one of the two strong PMA states. Then, we turn on a voltage pulse with sharp edges. The PMA energy term is instantly modified, thereby affecting the dynamics through the effective field. The switching event is determined if the state is trapped in the other PMA state a few nano-second after the voltage is turned off. The switching probability ((P_AP->P + P_P->AP)/2) is obtained by counting the number of the switching events among 150 trials.

Data availability

All data needed to evaluate the conclusions in the paper are available within the article and its Supplementary Information files. All data generated during the current study are available from the corresponding author upon request.

References

Coombes, S., beim Graben, P., Potthast, R. & Wright, J. Neural Fields: Theory and Applications (Springer, 2014).
Bressloff, P. C. & Webber, M. A. Front propagation in stochastic neural fields. SIAM J. Appl. Dyn. Syst. 11, 708–740 (2012).
Article MathSciNet Google Scholar
Gerstner, W. & Kistler, W. M. Spiking Neuron Models: Single Neurons, Populations, Plasticity (Cambridge University Press, 2002).
Ho, J., Jain, A. & Abbeel, P. Denoising diffusion probabilistic models. Adv. Neural Inf. Process. Syst. 33, 6840–6851 (2020).
Google Scholar
Brown, T. et al. Language models are few-shot learners. Adv. neural Inf. Process. Syst. 33, 1877–1901 (2020).
Google Scholar
Von Neumann, J. First draft of a report on the EDVAC. IEEE Ann. Hist. Comput. 15, 27–75 (1993).
Article MathSciNet Google Scholar
Big data needs a hardware revolution. Nature 554, 145–146, https://doi.org/10.1038/d41586-018-01683-1 (2018).
Grollier, J. et al. Neuromorphic spintronics. Nat. Electron. 3, 360–370 (2020).
Article Google Scholar
Sharad, M., Augustine, C., Panagopoulos, G. & Roy, K. Spin-based neuron model with domain-wall magnets as synapse. IEEE Trans. Nanotechnol. 11, 843–853 (2012).
Article ADS Google Scholar
Siddiqui, S. A. et al. Magnetic domain wall based synaptic and activation function generator for neuromorphic accelerators. Nano Lett. 20, 1033–1040 (2019).
Article ADS Google Scholar
Pinna, D., Bourianoff, G. & Everschor-Sitte, K. Reservoir computing with random skyrmion textures. Phys. Rev. Appl. 14, 054020 (2020).
Article ADS CAS Google Scholar
Raymenants, E. et al. Chain of magnetic tunnel junctions as a spintronic memristor. J. Appl. Phys. 124, 152116 (2018).
Kumar, A. et al. Multistate compound magnetic tunnel junction synapses for digital recognition. ACS Appl. Mater. Interfaces 16, 10335–10343 (2024).
Article CAS PubMed Google Scholar
Song, K. M. et al. Skyrmion-based artificial synapses for neuromorphic computing. Nat. Electron. 3, 148–155 (2020).
Article Google Scholar
Romera, M. et al. Vowel recognition with four coupled spin-torque nano-oscillators. Nature 563, 230–234 (2018).
Article ADS CAS PubMed Google Scholar
Torrejon, J. et al. Neuromorphic computing with nanoscale spintronic oscillators. Nature 547, 428–431 (2017).
Article CAS PubMed PubMed Central Google Scholar
Yokouchi, T. et al. Pattern recognition with neuromorphic computing using magnetic field–induced dynamics of skyrmions. Sci. Adv. 8, eabq5652 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Raab, K. et al. Brownian reservoir computing realized using geometrically confined skyrmion dynamics. Nat. Commun. 13, 6982 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Liu, S. et al. Bayesian neural networks using magnetic tunnel junction-based probabilistic in-memory computing. Front. Nanotechnol. 4, 1021943 (2022).
Article Google Scholar
Zhang, D. et al. All spin artificial neural networks based on compound spintronic synapse and neuron. IEEE Trans. Biomed. Circuits Syst. 10, 828–836 (2016).
Article PubMed Google Scholar
Rzeszut, P. et al. Multi-state MRAM cells for hardware neuromorphic computing. Sci. Rep. 12, 7178 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Yang, D. et al. Spin-orbit torque manipulation of sub-terahertz magnons in antiferromagnetic α-Fe2O3. Nat. Commun. 15, 4046 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
OpenAI. Generative Models, https://openai.com/blog/generative-models/ (2016).
Jebara, T. Machine Learning: Discriminative and Generative Vol. 755 (Springer Science & Business Media, 2012).
Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B. & Hochreiter, S. Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in Neural Information Processing Systems, Vol. 30 (2017).
Amiri, P. K. et al. Electric-field-controlled magnetoelectric RAM: progress, challenges, and scaling. IEEE Trans. Magn. 51, 1–7 (2015).
Article Google Scholar
Miwa, S. et al. Voltage controlled interfacial magnetism through platinum orbits. Nat. Commun. 8, 15848 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Kawabe, T. et al. Electric-field-induced changes of magnetic moments and magnetocrystalline anisotropy in ultrathin cobalt films. Phys. Rev. B 96, 220412 (2017).
Article ADS Google Scholar
Duan, C.-G. et al. Surface magnetoelectric effect in ferromagnetic metal films. Phys. Rev. Lett. 101, 137201 (2008).
Article ADS PubMed Google Scholar
Grezes, C. et al. Write error rate and read disturbance in electric-field-controlled magnetic random-access memory. IEEE Magn. Lett. 8, 1–5 (2016).
Article Google Scholar
Yang, J. et al. in ESSDERC 2021-IEEE 51st European Solid-State Device Research Conference (ESSDERC). 115-118 (IEEE, 2021).
Alzate, J. G. et al. In Proc. International Electron Devices Meeting 29.25. 21-29.25. 24 (IEEE, 2012).
Raimondo, E. et al. In Proc. IEEE 24th International Conference on Nanotechnology (NANO) 326–330 (IEEE, 2024).
Liu, S. et al. Random bitstream generation using voltage-controlled magnetic anisotropy and spin orbit torque magnetic tunnel junctions. IEEE J. Explor. Solid-State Comput. Devices Circuits 8, 194–202 (2022).
Article ADS Google Scholar
Shao, Y. et al. Probabilistic computing with voltage-controlled dynamics in magnetic tunnel junctions. Nanotechnology 34, 495203 (2023).
Article CAS Google Scholar
Norris, J. R. Markov Chains (Cambridge University Press, 1998).
Shiota, Y. et al. Quantitative evaluation of voltage-induced magnetic anisotropy change by magnetoresistance measurement. Appl. Phys. Express 4, 043005 (2011).
Article ADS Google Scholar
Nozaki, T. et al. Large voltage-induced changes in the perpendicular magnetic anisotropy of an MgO-based tunnel junction with an ultrathin Fe layer. Phys. Rev. Appl. 5, 044006 (2016).
Article ADS Google Scholar
Meservey, R. & Tedrow, P. Spin-polarized electron tunneling. Phys. Rep. 238, 173–243 (1994).
Article ADS Google Scholar
Suhail, H. et al. In Proc. International Electron Devices Meeting (IEDM) 1–4 (IEEE, 2023).
Sohl-Dickstein, J., Weiss, E., Maheswaranathan, N. & Ganguli, S. In Proc. International Conference on Machine Learning 2256–2265 (PMLR, 2015).
Liu, Z., Luo, P., Wang, X. & Tang, X. In Proc. IEEE International Conference on Computer Vision 3730–3738 (IEEE, 2015).
Jacob, V. K. et al. A nonvolatile compute-in-memory macro using voltage-controlled MRAM and in-situ magnetic-to-digital converter. In Proc. IEEE Journal on Exploratory Solid-State Computational Devices and Circuits (IEEE, 2023).
Li, X. et al. Restricted Boltzmann machines implemented by spin–orbit torque magnetic tunnel junctions. Nano Lett. 24, 5420–5428 (2024).
Article ADS CAS PubMed Google Scholar
Ren, R. et al. Initialization-free and magnetic field-free spin–orbit p-bits with backhopping-like magnetization switching for probabilistic applications. Nano Lett. 24, 10072–10080 (2024).
Article CAS PubMed Google Scholar
Brown, W. F. Jr Thermal fluctuations of a single-domain particle. Phys. Rev. 130, 1677 (1963).
Article ADS Google Scholar
Scholz, W., Schrefl, T. & Fidler, J. Micromagnetic simulation of thermally activated switching in fine particles. J. Magn. Magn. Mater. 233, 296–304 (2001).
Article ADS CAS Google Scholar

Download references

Acknowledgements

The authors in University of California, Los Angeles acknowledge the support from the National Science Foundation (NSF) Award No. 1810163 and No. 2427172; This work at University of California, Riverside is supported by the Air Force Office of Scientific Research under Grant No. FA9550-19-1-0307.

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, University of California, Los Angeles, CA, USA
Yang Cheng, Qingyuan Shu, Albert Lee, Haoran He, Minzhang Chen & Kang L. Wang
Department of Physics, The Ohio State University, Columbus, OH, USA
Ivy Zhu
Department of Electrical and Computer Engineering, University of California, San Diego, CA, USA
Renhe Chen
Department of Electrical and Computer Engineering, University of California, Riverside, CA, USA
Zirui Wang & Ran Cheng
Department of Physics and Astronomy, University of California, Riverside, CA, USA
Hantao Zhang & Ran Cheng
Industrial Technology Research Institute, Taipei, Taiwan, ROC
Chih-Yao Wang, Shan-Yi Yang, Yu-Chen Hsin, Cheng-Yi Shih & Hsin-Han Lee

Authors

Yang Cheng
View author publications
Search author on:PubMed Google Scholar
Qingyuan Shu
View author publications
Search author on:PubMed Google Scholar
Albert Lee
View author publications
Search author on:PubMed Google Scholar
Haoran He
View author publications
Search author on:PubMed Google Scholar
Ivy Zhu
View author publications
Search author on:PubMed Google Scholar
Minzhang Chen
View author publications
Search author on:PubMed Google Scholar
Renhe Chen
View author publications
Search author on:PubMed Google Scholar
Zirui Wang
View author publications
Search author on:PubMed Google Scholar
Hantao Zhang
View author publications
Search author on:PubMed Google Scholar
Chih-Yao Wang
View author publications
Search author on:PubMed Google Scholar
Shan-Yi Yang
View author publications
Search author on:PubMed Google Scholar
Yu-Chen Hsin
View author publications
Search author on:PubMed Google Scholar
Cheng-Yi Shih
View author publications
Search author on:PubMed Google Scholar
Hsin-Han Lee
View author publications
Search author on:PubMed Google Scholar
Ran Cheng
View author publications
Search author on:PubMed Google Scholar
Kang L. Wang
View author publications
Search author on:PubMed Google Scholar

Contributions

Y.C. designed, planned and initiated the study. Q.S., H.H., and Y.C. performed the voltage-controlled switching probability measurement. A.L., R.H.C. and Z.W. performed circuit implementation simulations. I.Z. performed the DDPM training. C.Y.W., S.Y.Y., Y.C.H., C.Y.S. and H.H.L. grew and fabricated devices. H.Z. and R.C. contributed to the theoretical modeling of Markov process. Y.C. and K.L.W. supervised the project. Y.C., Q.S., A.L., H.H., I.Z., M.C., R.H.C., Z.W., and K.L.W. drafted the manuscript. All authors discussed the results and commented on the manuscript.

Corresponding authors

Correspondence to Yang Cheng or Kang L. Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Peer Review File (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cheng, Y., Shu, Q., Lee, A. et al. Voltage-controlled magnetoelectric devices for neuromorphic diffusion process. Nat Commun 16, 5022 (2025). https://doi.org/10.1038/s41467-025-58932-x

Download citation

Received: 25 August 2024
Accepted: 03 April 2025
Published: 30 May 2025
Version of record: 30 May 2025
DOI: https://doi.org/10.1038/s41467-025-58932-x