Echo state property and memory capacity of artificial spin ice

Taniguchi, Tomohiro

doi:10.1038/s41598-025-93189-w

Download PDF

Article
Open access
Published: 17 March 2025

Echo state property and memory capacity of artificial spin ice

Tomohiro Taniguchi¹

Scientific Reports volume 15, Article number: 9073 (2025) Cite this article

1400 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

Physical reservoir computing by using artificial spin ice (ASI) has been proposed on the basis of both numerical and experimental analyses. ASI is a many-body system consisting of ferromagnets with various interactions. Recently, fabricating magnetic tunnel junctions (MTJs) as ferromagnets in an ASI was achieved in the experiment, which enables an electrical detection of magnetic state of each MTJ independently. However, performing a recognition task of time-dependent signal by such an MTJ-based ASI has not been reported yet. In this work, we examine numerical simulation of a recognition task of time-dependent input and evaluate short-term memory and parity-check capacities. These capacities change significantly when the magnitude of the input magnetic field is comparable to a value around which the magnetization alignment is greatly affected by the dipole interaction. It implies that the presence of the dipole interaction results in a loss of echo state property. This point was clarified by evaluating Lyapunov exponent and confirming that the drastic change of the memory capacities appears near the boundary between negative and zero exponents, which corresponds to the edge of echo state property.

Spintronic reservoir computing without driving current or magnetic field

Article Open access 23 June 2022

Magnetic memory driven by topological insulators

Article Open access 29 October 2021

Observation and theoretical calculations of voltage-induced large magnetocapacitance beyond 330% in MgO-based magnetic tunnel junctions

Article Open access 12 July 2021

Physical reservoir computing^{1,2,3,4,5,6,7,8,9} is one type of information processing scheme utilizing nonlinear response from many-body systems as computational resource. It enables to recognize and/or predict time-series data, such as human voice and movie, and therefore, will be applicable for several purposes including natural language processing. Physical reservoir computing has been examined in systems having various physical systems such as soft matter^10,11, quantum matter¹², and opitcal system^{13,14,15,16,17}. In the field of spintronics, physical reservoir computing has been examined in various magnetic structure, such as spin-torque oscillator^{18,19,20,21,22,23,24}, magnetic skyrmion^25,26,27,28, voltage-controlled memory²⁹ and spin wave^{30,31,32,33,34,35}. Each system has interesting features. For example, using spin-torque oscillator will enable all-electrical manipulation of device in nanometer scale, and thus, will be applicable to edge computing. A slow propagation speed of spin wave might contribute to a relatively long memory functionality.

Among such ferromagnetic-based reservoirs, artificial spin ice (ASI) is another candidate that can be applied to physical reservoir computing. ASI is a many-body system consisting of ferromagnets and has frustration due to magnetic interactions such as local exchange interaction and/or dipole interaction^{36,37,38,39,40,41}. The presence of the frustration, or non-uniqueness of ground state, has a possibility to be used in distinguishing complex time-dependent data. Physical reservoir computing by ASI has been recently proposed in both numerical⁴² and experimental^43,44 studies, where the magnetic states were read by spin wave propagation or magnetic force microscopic image. In these ASIs, an external magnetic field is a necessary factor for computation as input signal. Recently, on the other hand, a honeycomb ASI consisting of magnetic tunnel junctions (MTJs) in nanometer scale⁴⁵ was experimentally fabricated⁴⁶. This MTJ-based ASI has a feasibility in realizing all-electrically-controlled ASI, and thus, is suitable for several applications such as edge computing, although currently only the reading of the magnetic states of the MTJs was done electrically while its manipulation was still carried on by applying an external magnetic field⁴⁵. However, a recognition task of such an MTJs-based ASI has not been examined yet.

In this work, we perform numerical simulation of the Landau-Lifshitz-Gilbert (LLG) equation of an MTJ-based honeycomb ASI and evaluate its computational ability. Firstly, a saturation magnetization curve by applying an external magnetic field is evaluated. The curve shows non-monotonic behavior at a certain magnetic field strength, which originates from the dipole interactions between the MTJs. Secondly, the short-term memory and parity-check capacities are evaluated as a figure of merit of the computational ability. Here, the external magnetic field is used as input signal, and its strength and applied angle are widely changed. A drastic reduction of these capacities is observed near the magnetic field strength giving the non-monotonic behavior of the magnetization saturation. It implies that the dipole interaction between the MTJs leads to a loss of echo state property. To examine this consideration, the Lyapunov exponent is also evaluated, which clarifies the presence of echo state property. It is revealed that the boundary of the small-capacity region corresponds to the edge of echo state property. These results provide a procedure to achieve high computational ability of ASI, i.e., the edge of echo state property can be estimated from the saturation magnetization curve, and large memory capacities are obtained outside the edge.

System description

Description of artificial spin ice

In Fig. 1a, we show a schematic view of ASI consisting of elliptical-shaped MTJs aligned in xy plane⁴⁵. The number of the MTJs is $N=72$, according to the experiment⁴⁶. In this work, we use an external magnetic field with the strength $H_\textrm{appl}$ which is applied in the direction having the angle $\varphi _{H}$ measured from the x axis as input for a recognition task of time-dependent data; see also the following sections. Although the previous simulation assumes to utilize spin-transfer torque effect^47,48 for electrical manipulation of the magnetization states in an ASI, it is difficult to achieve because such a manipulation requires a thin ferromagnetic layer in an MTJ while the dipole interaction between MTJs become strong when a volume of an MTJ is large; this dilemma should be solved in future. The detection of the magnetization state via magnetoresistance effect, on the other hand, ca be experimentally achieved⁴⁶; therefore, in principle, we can electrically detect the magnetization direction in each MTJ independently. Figure 1b shows a schematic view of one hexagon in the ASI. The radii of the elliptical plane along the long and short axes are denoted as a and b, while the distance from two MTJs to a cross point of their long axes is denoted as d. In the following calculation, we assume that the radii of the ith ($i=1,2,\cdots ,N$) MTJ are randomly distributed around their designed values, a and b, as $a_{i}=a\left( 1+\sigma \xi _{ai}\right)$ and $b_{i}=b\left( 1+\sigma \xi _{bi}\right)$, where $\xi _{ai}$ and $\xi _{bi}$ are uniformly distributed random values in the range of $-1< \xi _{ai},\xi _{bi} < 1$ and the dimensionless parameter $\sigma$ determines the magnitude of the randomness of the MTJs’ size. This is because a dispersion of the MTJs’ size is unavoidable in experiments⁴⁵. The thickness of the MTJs is assumed to be common. The values of the parameters are summarized in Methods for numerical method solving the LLG equation. We also introduce a local coordinate $XYZ_{i}$ for the latter discussion, where the $X_{i}$ and $Y_{i}$ axes are parallel to the ith MTJ. In the following, we call the xyz and XYZ coordinates as the global (or experimental) and local coordinates, respectively. In the experiment⁴⁶, the magnetization directions of MTJs in the x direction (in global coordinate) are measured. The local coordinate is, however, sometimes convenient to catch the dynamical behavior of the magnetization, define demagnetization coefficients, and so on. Therefore, we use both coordinates, depending on the situation.

Since a honeycomb structure is unchanged even by rotating it around a perpendicular axis with $60^{\circ }$, one might consider that the present ASI also has the same symmetry. This is, however, not true due to two factors. First, we introduce the randomness in the MTJs size to reflect an experimental dispersion⁴⁵, as mentioned above. Second, the presence of the magnetization vector breaks the structural symmetry. To clarify this point, we show the initial magnetization alignments of the magnetizations for the case of $\varphi _{H}=0^{\circ }$ and $60^{\circ }$; see Fig. 1c. Here, the magnetization directions of MTJs are indicated by arrows. We note that the magnetization directions are common for both cases. One can, however, notice that the magnetization alignment for $\varphi _{H}=0^{\circ }$ does not become identical to that for $\varphi _{H}=60^{\circ }$. This is because we use a common initial condition of the magnetization alignment in the xy plane for two cases. Regarding these facts, the results shown below will be different for a certain $\varphi _{H}$ and $\varphi _{H}+60^{\circ }$, although the difference may be small.

LLG equation

The magnetization dynamics is evaluated by solving the LLG equation for the magnetization. We denote the unit vector pointing in the magnetization direction of the ith MTJ as $\textbf{m}_{i}$. The LLG equation of $\textbf{m}_{i}$ is given by

$$\begin{aligned} \frac{d \textbf{m}_{i}}{dt} = -\gamma \textbf{m}_{i} \times \left( \textbf{H}_\textrm{appl} + \textbf{H}_{\textrm{shape},i} + \textbf{H}_{\textrm{dip},i} \right) + \alpha \textbf{m}_{i} \times \frac{d\textbf{m}_{i}}{dt}, \end{aligned}$$

(1)

where the values of the gyromagnetic ratio $\gamma$ and the Gilbert damping constant $\alpha$ are assumed to be common for all MTJs and are given by $1.764\times 10^{7}$ rad/(Oe s) and 0.01, respectively. The external magnetic field $\textbf{H}_\textrm{appl}$ is also commonly applied to the all MTJs. The shape magnetic anisotropy field is

$$\begin{aligned} \textbf{H}_{\textrm{shape},i} = -4\pi M \sum _{U=X,Y,Z} N_{i,U} m_{i,U} \textbf{e}_{i,U}, \end{aligned}$$

(2)

where $\textbf{e}_{i,U}$ ($U=X,Y,Z$) is the unit vector defining the local coordinate $XYZ_{i}$ mentioned above. Apparently, using the local coordinate is useful to express the shape demagnetization field. The demagnetization coefficient, $N_{i,U}$, along the U direction is estimated numerically, according to the formula in Ref.⁴⁹, by using the radii, $a_{i}$ and $b_{i}$, and the thickness of the ith MTJ. The value of the saturation magnetization M is 1500 emu/c.c. Then, for an ideal MTJ ($a_{i}=a$ and $b_{i}=b$), $N_{iX}=0.040...$, $N_{Y}=0.154...$, and $N_{Z}=0.804...$, or equivalently, $4\pi MN_{X}\simeq 766$, $4\pi MN_{Y}\simeq 2919$, and $4\pi MN_{Z}\simeq 15165$ Oe. The dipole field $\textbf{H}_{\textrm{dip},i}$ is defined as

$$\begin{aligned} \textbf{H}_{\textrm{dip},i} = \sum _{j (\ne i) = 1}^{N} \textbf{H}_{\textrm{dip},ij}, \end{aligned}$$

(3)

where $\textbf{H}_{\textrm{dip},ij}$ is the stray magnetic field from the jth MJT. We note that the magnitude of $\textbf{H}_{\textrm{dip},ij}$ is on the order of 100 Oe at the most for the present parameters; see also Methods for the numerical method solving the LLG equation. The initial conditions of the magnetization are the alignments shown in Fig. 1c, where we assume that each magnetization is assumed to be parallel to the long axis of the MTJ.

Before moving to the physical reservoir computing, we investigate the role of the dipole interaction by solving Eq. (1) for various $H_\textrm{appl}$. Figure 2a and b show time evolution of the magnetization in the 10th MTJ (see Fig. 1a) in (a) global (experimental) and (b) local coordinate. Recall that the magnetization is assumed to be parallel to the long-axis direction of the MTJ at the initial time; therefore, $m_{X}$, which is the X component of the magnetization in the local coordinate, satisfies $|m_{X}|=1$ at the initial state. The magnitude $H_\textrm{appl}$ and the angle $\varphi _{H}$ of the external magnetic field are 1.0 kOe and $0^{\circ }$, respectively. As shown in the figure, the magnetization direction is eventually saturated after a few nanoseconds. We repeat similar calculations for various magnitude $H_\textrm{appl}$ with a fixed angle $\varphi _{H}=0^{\circ }$. Figure 2c and d summarize the values of the saturated $m_{x}$ [x component of the magnetization in the global (experimental) coordinate] and $m_{X}$ [X component in the local coordinate]. In Fig. 2c, $H_\textrm{appl}$ is swept from small to large value. In this case, we evaluate the saturation value of the magnetization under a certain value of $H_\textrm{appl}$, and then, using this saturated value as a new initial condition, the saturated value of the magnetization under a slightly larger value of $H_\textrm{appl}$ is evaluated. In Fig. 2d, on the other hand, the initial state of the magnetization is reset to the original state, i.e., parallel to the long-axis direction, for all $H_\textrm{appl}$. Figure 2c may be suitable for a comparison with experimental works, where, for example, a magnetization curve is often evaluated by using the sweep magnetic field⁴⁵. Figure 2d, on the other hand, may be useful to catch the difference of the saturated value of the magnetization under the common initial condition. In both Fig. 2c and d, $m_{x}$ becomes close to $+1$ because the external magnetic field points to the x direction in this case ($\varphi _{H}=0^{\circ }$). Note that these figures indicate an appearance of non-monotonic behavior near $H_\textrm{appl}\simeq 800$ Oe. We confirmed that such non-monotonic behavior also appears in the other MTJs; see the insets of Fig. 2c and d, where the values of $m_{X}$ in the 3rd and 11th MTJs are shown (see Fig. 1a). Such a non-monotonic behavior does not appear when the dipole interaction is absent and thus, each MTJ independently reacts to the external magnetic field. Accordingly, the appearance of the non-monotonic behavior in the magnetization saturation to the x direction originates from the dipole interaction between MTJs. It is, unfortunately, difficult to derive an analytical formula of the magnetic field corresponding to the non-monotonic saturation of the magnetization because of the complexity of the magnetic energy; see Methods for numerical method solving the LLG equation. It depends on both the shape magnetic anisotropy field and the dipole interaction. Although an analytical estimation is difficult, such a field strength will be estimated even in experiments by measuring saturation magnetization curve similar to Fig. 2c and d. As we will see below, a drastic change of memory function in ASI appears around this field magnitude.

Results

Evaluation of memory capacities

The short-term memory and parity check capacities, denoted as $C_\textrm{STM}$ and $C_\textrm{PC}$ in the following, respectively, are evaluated by applying random binary input $\textrm{bi}_{\ell }=0$ or 1 ($\ell =1,2,\cdots$) to the ASI as the external magnetic field as

$$\begin{aligned} \textbf{H}_\textrm{appl} = H_\textrm{appl} \left( 2 \textrm{bi}_{\ell } - 1 \right) \left( \cos \varphi _{H} \textbf{e}_{x} + \sin \varphi _{H} \textbf{e}_{y} \right) , \end{aligned}$$

(4)

i.e., $\textbf{H}_\textrm{appl}=-H_\textrm{appl}(\cos \varphi _{H}\textbf{e}_{x}+\sin \varphi _{H}\textbf{e}_{y})$ for $\textrm{bi}_{\ell }=0$ and $\textbf{H}_\textrm{appl}=H_\textrm{appl}(\cos \varphi _{H}\textbf{e}_{x}+\sin \varphi _{H}\textbf{e}_{y})$ for $\textrm{bi}_{\ell }=1$ (see also Fig. 1a). The random input $\textrm{bi}_{\ell }$ is constant during a pulse width $t_\textrm{p}$. While the detail of the evaluation methods of the memory capacities is summarized in Methods, we briefly recall that the short-term memory and parity check capacities quantify the number of target data a reservoir can recognize. The target data for the evaluation of the short-term memory capacity is

$$\begin{aligned} z_{\ell ,D}^\textrm{STM} = \textrm{bi}_{\ell -D}, \end{aligned}$$

(5)

i.e., the target data is the input signal itself. In other words, the short-term memory capacity characterize the number of the input data a resevoir can recognize as is. An integer $D(=0,1,2,\cdots )$ is called as delay, characterizing the order of the past input data. In this work, the short-term memory capacity is defined from memory function for $D=1,2,\cdots$; see also Methods for the evaluation method of the memory capacity. Note also that the memory capacities are the sum of component-wise capacities, which quantify the reproducibility of the target data with a given D; see the same Methods. The target data for the evaluation of the parity-check capacity is given by

$$\begin{aligned} z_{\ell ,D}^\textrm{PC} = \sum _{m=0}^{D} \textrm{bi}_{\ell -D+m}\ \ \ \ (\textrm{mod}\ 2). \end{aligned}$$

(6)

The parity-check capacity is one type of nonlinear memory capacity. For simplicity, we call the short-term memory and parity-check capacities as memory capacities when it is unnecessary to distinguish them. The values of the parameters (for example, the number of the training data) for the evaluation of the short-term memory and parity check capacities are summarized in Methods. In the following, we show these memory capacities for various values of the parameters.

Figure 3a shows an example of the input binary data for the pulse width of 5 ns, while Fig. 3b and c show the values of N-$m_{x}$ used for the evaluation of the memory capacities, where (b) $H_\textrm{appl}=1.0$ kOe and $\varphi _{H}=25^{\circ }$ and (c) $H_\textrm{appl}=2.1$ kOe and $\varphi _{H}=25^{\circ }$. As mentioned below, the short-term memory capacity corresponding to Fig. 3b is small, while the condition of the magnetic field for Fig. 3c gives the maximum value of the short-term memory capacity. In Fig. 3b, the magnetizations can show various states because the magnetic field strength is relatively small. Note that some of the magnetizations show non-identical behavior with respect to the same series of inputs. For example, Fig. 3a shows two common series, where the random binary inputs change from 0 to 1 during $t=10.015$ $\mu$s and $t=10.020$ $\mu$s and during $t=10.025$ $\mu$s to $t=10.030$ $\mu$s. Similarly, the input changes from 1 to 0 during $t=10.020$ $\mu$s to 10.025 $\mu$s and during 10.040 $\mu$s and $t=10.045$ $\mu$s. The output data are, however, different for these series of the inputs data. For example, all the magnetization deviates from $m_{x}=-1.0$ when the input changes from 0 to 1 during $t=10.015$ $\mu$s and $t=10.020$ $\mu$s. Some magnetizations, however, remain near $m_{x}=-1.0$ when the input changes from 0 to 1 during $t=10.025$ $\mu$s and $t=10.030$ $\mu$s. Such a different response with respect to the same series of the input data will lead to a fail of learning and result in the component-wise capacity for small delay D. It may, on the other hand, contribute to the component-wise capacity for a large D; see also Methods for the evaluation of the memory capacities. We should note that the value of $H_\textrm{appl}$ in Fig. 3b is close to the magnetic field strength where the saturation curve of the magnetization becomes non-monotonic ($H_\textrm{appl}\simeq 800$ Oe); see Fig. 2a and b. It implies that the dipole interaction between MTJs lead to non-identical response of the magnetizations with respect to the same series of the input data. In contrast, in Fig. 3c, the magnetizations show almost the same response to the same series of the input data. Hence, this case is expected to recognize the target data for small delay D and result in a relatively large memory capacities.

Figure 4a and b summarize the dependence of the short-term memory and parity-check capacities on the magnetic field strength $H_\textrm{appl}$ and the angle $\varphi _{H}$. The values of the memory capacities are drastically changed, depending on the values of the parameters. For example, the short-term memory capacity is only 0.02 for the condition in Fig. 3b ($H_\textrm{appl}=1.0$ kOe and $\varphi _{H}=25^{\circ }$), while it is maximized to 5.69 for the condition in Fig. 3c ($H_\textrm{appl}=2.1$ kOe and $\varphi _{H}=25^{\circ }$). In particular, both capacities become small for the magnetic field strength near $H_\textrm{appl}=800$ Oe. Recall again that this strength is close to the value at which non-monotonic saturation behavior is observed due to the dipole interaction; see Fig. 2c and d. A large capacity appear near this small capacity condition, i.e., there is an edge between small and large capacities. In the next section, we investigate its origin by evaluating the Lyapunov exponent.

Evaluation of Lyapunov exponent and estimation of echo state property

We evaluate the Lyapunov exponent $\varLambda$ to investigate the origin of the edge of small and large memory capacities in Fig. 4a and b. The evaluation method of the Lyapunov exponent is summarized in Methods. Firstly, let us explain the motivation to evaluate the Lyapunov exponent.

Physical reservoir computing works well when the output data from physical reservoir is solely determined by the input data and becomes independent of the initial state of the reservoir⁵⁰. In other words, physical reservoir should show the same output with respect to the same input. This property is called echo state property. Whether physical reservoir can have the echo state property or not depends on the system parameters and the input characters (pulse strength and width, and so on). In some cases, the computational ability of physical reservoir is often maximized near the edge of echo state property, where the edge is a boundary of the system parameters and/or the input characters at which the echo state property is lost or recovered. The edge of echo state property is sometimes identical to the edge of chaos, which is a boundary between chaotic and ordered dynamics in physical reservoir. It should be, however, noted that these edges are not necessarily the same. In addition, while the edge of chaos has been often regarded as an ideal state for recurrent neural network⁵¹, it is recently mentioned by Jaeger in Foreword of Ref.⁹ that the edge of echo state property, not the edge of chaos, is a suitable state; it implies that the edge of chaos is a suitable state for the computation when it overlaps with the edge of echo state property. In addition, Jaeger points out that even this condition is limited, i.e., some tasks might be solved well at the edge of echo state property but the edge of echo state property is not always a best state for the computation. Based on this argument, the relationship between the short-term memory capacity, echo state property, and chaos for physical reservoir computing using spin-torque oscillators was recently investigated⁵². We extend this previous work to ASI here and study the role of echo state property on the memory capacities. For this purpose, the Lyapunov exponent is evaluated due to the following reason.

The Lyapunov exponent $\varLambda$ is an expansion rate of two dynamical responses with an infinitesimally different initial conditions. When the Lyapunov exponent is negative, two solutions saturate to a same state. When the Lyapunov exponent is zero, on the other hand, the difference between two solutions are kept to be constant. In the case of ferromagnetic system, for example, a magnetization relaxation (or switching) to a fixed stable state corresponds to the dynamics with $\varLambda <0$. On the other hand, an auto-oscillation of magnetization in a spin-torque oscillator corresponds to the dynamics with $\varLambda =0$ because, for example, if two oscillators have slightly different initial phases, the phase difference is kept to be constant due to the periodicity of the dynamics; in this case, the dynamical response depends on the initial phase. Recalling that the echo state property means that the dynamical response is independent of the initial state, the edge of echo state property can be defined as a boundary between negative and zero Lyapunov exponents. Therefore, the evaluation of the Lyapunov exponent will clarify the relationship between the echo state property and memory capacities.

Figure 4c show the Lyapunov exponent of the ASI for various $H_\textrm{appl}$ and $\varphi _{H}$. Comparing it with Fig. 4a and b, we notice that small memory capacities appear when the Lyapunov exponent is non-negative. Therefore, we conclude that the boundary between the small and large memory capacities is the edge of echo state property, which supports the argument by Jaeger⁹ mentioned above. This is also consistent with the result shown in Fig. 3. When the output data shows different responses with respect to the same series (01 or 10) input data in Fig. 3b, it means that the echo state property for $D=1$ is lost, and as a result, the short-term memory capacity is small. Summarizing these results, the dipole interaction between MTJs results in non-monotonic saturation of the magnetizations in ASI at a certain strength of the external magnetic field, and the memory capacities becomes small near this field strength due to the loss of echo state property.

At the end of this section, we give some comments on the positive Lyapunov exponent in Fig. 4c. An existence of a positive Lyapunov exponent is sometimes regarded as the presence of chaotic dynamics^53,54,55. We, however, do not consider that the positive Lyapunov exponent in Fig. 4c indicate the presence of chaotic behavior in ASI. This is because the magnetizations in MTJs tend to saturate to certain directions by the application of the external magnetic field, as shown in Fig. 2a and b, while chaos is a sustainable dynamics and its dynamical trajectory does not saturate. The presence of the positive Lyapunov exponent arises from the fact that there are several magnetization alignments corresponding to local minima of magnetic energy, and which minimum the system settles on depends on various factors including the initial state of the magnetizations. This point may be differently explained in terms of attractor in nonlinear science^53,54,55. An attractor is a region in a phase space, toward which a system evolves; thus, if the dynamical state initially locates inside an attractor, the system remains in it. A local minimum of energy landscape is a typical attractor, called a point attractor, because a system will fall into the minimum due to energy dissipation (relaxation). There are also various kinds of attractors, such as periodic and chaotic (strange) attractors^53,54,55. In the present system, the magnetization dynamics shown in Figs. 2 and 3 indicate that there are several point attractors corresponding to the local minima of magnetic energy, and the ASI saturates to one of them. A positive Lyapunov exponent appears when two systems with slightly different initial conditions fall into different point attractors; in this case, although the distance between two systems is expanded, the dynamics cannot be regarded as chaos. Therefore, we do not consider that chaotic dynamics exists in this work; accordingly, edge of chaos is also absent in this work. In addition, the fact that the region corresponding to the positive Lyapunov exponent in Fig. 4c is small and does not fit with the boundary between the small and large memory capacities again confirms that the edge of echo state property, not the edge of chaos, determines the boundary.

Conclusion

In summary, magnetization dynamics in an ASI and its memory capacities for physical reservoir computing were investigated by numerical simulation of the LLG equation. A non-monotonic saturation of the magnetization with respect to an application of an external magnetic field was observed, which originated from the dipole interaction between MTJs. Both the short-term memory and parity-check capacities become quite small for the magnetic field strength around which such a non-monotonic behavior appears. By evaluating the Lyapunov exponent, as well as investigating the temporal magnetization dynamics, it was found that the small memory capacities were due to the loss of echo state property. In other words, the boundary between the small and large memory capacities corresponds to the edge of echo state property.

Methods

Numerical method solving the LLG equation

We apply the fourth-order Runge-Kutta method to the LLG equation with a time increment of $\Delta t=1.0$ ps. In this work, we use $a=200$ nm, $b=75$ nm and $d=20$ nm, while the thickness of the MTJs is assumed to be common for all the specimen and is 20 nm⁴⁵. The Mersenne Twister⁵⁶ for Fortran was used to generate random numbers, such as $\xi _{ai}$, $\xi _{bi}$, and $\textrm{bi}_{\ell }$, in this work. For example, $\xi _{a1}$ ad $\xi _{b1}$ are $-0.326...$ and $-0.569...$. Using these random numbers, both the shape magnetic and dipole fields of MTJs become slightly random around their designed values, where the designed value means that all MTJs have the common radii, a and b. Recall that the dispersion of the MTJs size is characterized by the dimensionless parameter $\sigma$. Assuming that the central positions of MTJs are located as in the case of an ideal honeycomb structure, $\sigma$ should satisfy $\sigma \le d/a$ to avoid an overlap of MTJs. In this work, we use $\sigma =0.02$, i.e., the maximum difference of $a_{i}$ and $b_{i}$ from the designed value is 2%.

Recall that the dipole field acting on the ith MTJ is the sum of the stray magnetic field from the other MTJs. The stray magnetic field $\textbf{H}_{\textrm{dip},ij}$ from the jth MTJ is numerically estimated by the method developed in Ref⁵⁷, where the value of the stray magnetic field at the center of the ith MTJ is used as $\textbf{H}_{\textrm{dip},ij}$. In general, the stray magnetic field can be expressed as

$$\begin{aligned} \textbf{H}_{\textrm{dip},ij} = M \begin{pmatrix} I_{i,j,1,1} & I_{i,j,1,2} & I_{i,j,1,3} \\ I_{i,j,2,1} & I_{i,j,2,2} & I_{i,j,2,3} \\ I_{i,j,3,1} & I_{i,j,3,2} & I_{i,j,3,3} \end{pmatrix} \begin{pmatrix} m_{j,x} \\ m_{j,y} \\ m_{j,z} \end{pmatrix}, \end{aligned}$$

(7)

where we use the global coordinate, for convenience. The values of the matrix elements, $I_{i,j,p,q}$ ($p,q,=1,2,3$) depend on the size ($a_{j}$ and $b_{j}$) of the jth MTJ and the relative position between the ith and jth MTJs⁵⁷. For example, using $\xi _{a1}$ and $\xi _{b1}$ mentioned above and $M=1500$ emu/c.c., the values of the stray magnetic field from the 1st ($i=1$) to the 7th ($j=7$) MTJ are estimated to be $MI_{7,1,1,1}\simeq 51$, $MI_{7,1,1,2}\simeq 37$, $MI_{7,1,2,1}\simeq 81$, $MI_{7,1,2,2}\simeq 7$, and $MI_{7,1,3,3}\simeq -34$ Oe (the other matrix elements are zero). Recall that these MTJs are an example of the nearest MTJs (see Fig. 1a). Therefore, we mentioned in the main text that the stray magnetic field $\textbf{H}_{\textrm{dip},ij}$ from one MTJ is on the order of 100 Oe at maximum. Note that an MTJ has four nearest MTJs (except MTJs around the edge of ASI), and thus, the strength of the dipole field $\textbf{H}_{\textrm{dip},i}$ can be about four times larger than that of $\textbf{H}_{\textrm{dip},ij}$ at maximum. We notice that the non-monotonic behavior of the saturation curves observed in Fig. 2c and d relates to the randomness of the MTJ size. In the absence of the randomness ($\sigma =0$), the non-monotonic behavior disappears. We consider that it comes from a cancellation of the stray magnetic field. For example, when we focus on the 3rd MTJ in Fig. 1a, only the x component of the stray magnetic fields generated by the surrounding MTJs remain finite, while the y component becomes totally zero due to the cancellation, when the randomness is absent. The remaining x component of the total stray magnetic field just enhances the applied magnetic field and does not lead any non-monotonic behavior in the saturation curve. When the randomness is finite, on the other hand, the total y component becomes finite and gives the non-monotonic behavior.

The magnetic energy density is the sum of the Zeeman energy, shape magnetic anisotropy energy, and the dipole interactions, which are modeled as

$$\begin{aligned} E = -\sum _{i=1}^{N} M \textbf{m}_{i} \cdot \textbf{H}_\textrm{appl} + 2\pi M^{2} \sum _{i=1}^{N} \sum _{U=X,Y,Z} N_{i,U} m_{i,U}^{2} - \sum _{i=1}^{N} M \textbf{m}_{i} \cdot \textbf{H}_{\textrm{dip},i}. \end{aligned}$$

(8)

Because of several reasons, such as randomness of the shape magnetic anisotropy and dipole fields and finite size of ASI, it is difficult to estimate the magnetic field strength around which the saturation magnetization curve shows non-monotonic behavior shown in Fig. 2c and d.

Evaluation method of memory capacities

The evaluation method of the short-term memory and parity-check capacities consists of two steps, where we firstly evaluate weight by learning (training) and secondly evaluate the capacities by using the weight^{1,6,12,20,50,51}. In the following, we summarize these procedures.

The first procedure is learning (training). A series of pulse input signal $r_{\ell }$ ($\ell =1,2,\cdots ,N_\textrm{L}$) is injected into physical reservoir, where $N_\textrm{L}$ is the number of the input signal for the determination of the weight. The suffix $\ell$ distinguishes the order of the input signal. A binary input signal $b_{\ell }$ is used in the present work, as in the case of Ref.²⁰. Another choice of the input signal is a uniformly distributed random signal ($0 \le r_{\ell } \le 1$ or $-1 \le r_{\ell } \le 1$)⁵⁰. The target data $z_{\ell ,D}$ can be defined from the input signal $r_{\ell }$. The delay $D(=0,1,2,\cdots )$ quantifies the number of the past data used to define target data. Since physical reservoir computing aims to recognize past input signal from the present output signal, it is necessary to introduce the delay D to distinguish the order of the past input signal. The target data of the short-term memory and parity-check capacities are shown in Eqs. (5) and (6). After defining the target data, the weight $w_{D,i}$ is estimated to minimize,

$$\begin{aligned} \sum _{\ell =1}^{N_\textrm{L}} \left( \sum _{i=1}^{N+1} u_{\ell ,i} w_{D,i} - z_{\ell ,D} \right) ^{2} \end{aligned}$$

(9)

where $u_{\ell ,i}$ is the output data from the ith node (MTJ in the present work) with respect to the $\ell$th input signal. In this work, $m_{i,x}$ at time just before switching an input pulse input is used as $u_{\ell ,i}$, i.e., when $\ell$th input is injected during time interval $t_{\ell } < t \le t_{\ell }+t_\textrm{p}$, $m_{i,x}$ at $t=t_{\ell }+t_\textrm{p}$ is used as $u_{\ell ,i}$. The $N+1$th output data, $u_{\ell ,N+1}$, is the bias term, $u_{\ell ,N+1}=1$. The weight is obtained as Moore-Penrose inverse matrix of $u_{\ell ,i}$. Note that the value of the weight $w_\textrm{D,i}$ depends on the target data; therefore, strictly speaking, it is necessary to add some index to $w_\textrm{D,i}$ to distinguish the weights for the evaluation of the short-term memory and parity-check capacities. For simplicity, however, we omit such an index because the following procedure is commonly adopted for their evaluations. Recall that the target data $z_{\ell ,D}$ for the short-term memory task is the random binary input $\textrm{bi}_{\ell -D}$ and its example is shown in Fig. 3a. Also, the output data $u_{\ell ,i}$ is $m_{i,x}$, as mentioned above, and their examples for 72 MTJs are shown in Fig. 3b and c.

Next, we evaluate the memory capacities by injecting a different series of input signal $r_{n}^{\prime }$ ($n=1,2,\cdots ,N_\textrm{E}$), where the prime symbol is added to quantities to distinguish them from those used in learning. The number of the input signal, $N_\textrm{E}$, is not necessarily the same as the number used in learning ($N_\textrm{L}$). From the output data $u_{n,i}^{\prime }$ as the response to $r_{n}^{\prime }$ and using the weight determined by learning, system output is defined as

$$\begin{aligned} y_{n,D}^{\prime } = \sum _{i=1}^{N+1} u_{n,i}^{\prime } w_{D,i}. \end{aligned}$$

(10)

If the learning is done well, the system output will reproduce the target data $z_{n,D}^{\prime }$ defined from $r_{n}^{\prime }$. In Fig. 5, we show examples of the target data (red dots) and the system output (black line) for the evaluation of the short-term memory capacity ($z_{\ell ,D}^\textrm{STM}=\textrm{bi}_{\ell -D}$ and $u_{n,i}^{\prime }=m_{i,x}$) with the delays of (a) $D=0$ and (b) $D=6$. As shown, the target data with the small delay ($D=0$) is easily reproduced because the output data includes the past information within short time. For the large delay ($D=6$), however, the reproducibility becomes low because the relaxation dynamics of the magnetization erases the past information sufficiently before the present time. The reproducibility is quantified by the correlation coefficient,

$$\begin{aligned} \textrm{Cor}(D) = \frac{\sum _{n=1}^{N_\textrm{E}} \left( z_{n,D}^{\prime } - \langle z_{n,D}^{\prime } \rangle \right) \left( y_{n,D}^{\prime } - \langle y_{n,D}^{\prime } \rangle \right) }{\sqrt{ \sum _{n=1}^{N_\textrm{E}} \left( z_{n,D}^{\prime } - \langle z_{n,D}^{\prime } \rangle \right) ^{2} \sum _{n=1}^{N_\textrm{E}} \left( y_{n,D}^{\prime } - \langle y_{n,D}^{\prime } \rangle \right) ^{2} }}, \end{aligned}$$

(11)

where the symbol $\langle \cdots \rangle$ means an average. The component-wise capacity for the target data $z_{n,D}^{\prime }$ is defined as

$$\begin{aligned} C(z_{n,D}^{\prime }) = \left[ \textrm{Cor}(D) \right] ^{2}. \end{aligned}$$

(12)

Figure 5c shows the examples of the component-wise capacities for $D=0, 1,2,\cdots ,20$, where the examples of the target data and the system output in Eq. (11) are shown in Fig. 5a and b, as mentioned above. The component-wise capacity is unity when the system output completely reproduce the target data, while it tends to approach zero when the system output is largely different from the target data. The component-wise capacity is defined for each target data, $z_{n,D}^{\prime }$ ($D=0,1,2\cdots$). The memory capacity is defined as

$$\begin{aligned} C = \sum _{D=1}^{D_\textrm{max}} C(z_{n,D}^{\prime }), \end{aligned}$$

(13)

where $D_\textrm{max}$ is the maximum delay. Note that we use the definition of the memory capacity in Ref.²⁰, where the component-wise capacities from $D=1$ are used for the evaluation of the memory capacity. In some papers, however, the component-wise capacity for $D=0$ is included in the definition of the memory capacity¹². The component-wise capacity often becomes small for a large delay D, except some cases²³, and thus, the value of the memory capacity is saturated when $D_\textrm{max}$ is set to sufficiently large value. According to the definition of the component-wise capacity, for example, it is small in the case in Fig. 3b, where, as mentioned in the main text, some magnetizations show different response with respect to the same series of the input data, 01 or 10; in this case, the component-wise capacity even for $D=1$ becomes small. In fact, the short-term memory capacity of this case is sufficiently smaller than 1, indicating that the ASI cannot recognize the input data injected even only one step before. At the same time, however, we should note that a different response with respect to two digits inputs may contribute to the component-wise capacity for a large $D(\ge 2)$ because such a different response may reflect a sufficiently past input signal.

In the present work, we first solve the LLG equation with a fixed magnetic field from $t=0$ to $t=100$ ns, where the magnetic field is given by Eq. (4) and $\textrm{bi}$ is fixed to $+1$. This procedure is to saturate the ASI to a stable state under the constant magnetic field. Then, 300 random binary inputs are injected for washout, $N_\textrm{L}=1000$ random binary inputs are injected for learning, 300 random binary inputs are injected for washout again, and $N_\textrm{E}=1000$ random binary inputs are injected for the evaluation of the memory capacities.

In the main text, the memory capacities are evaluated by solving the LLG equation with 64bits Fortran code. Therefore, the output data are obtained with high precision. In such a case, for example, although the magnetization is almost saturated at $t=5$ ns in Fig. 2a, a small change of $m_{x}$ may possibly occur during the relaxation process (recall that the pulse width of the input signal is $t_\textrm{p}=5$ ns). The learning and evaluation processes of the memory capacities might be affected by such a small change of $m_{x}$. A precision of experimental components⁴⁶ is, however, often small compared with that in the computational evaluation. Therefore, an attempt is made to evaluate the short-term memory and parity-check capacities by using the same output data used in Fig. 4 except for the fact that the value of the output data, $m_{i,x}$ is reduced to three digits; for example, if $m_{ix}$ is $m_{i,x}=0.12345\cdots$, $m_{ix}=0.12300...$ is used as the output. Figure 6a and b summarize the short-term memory and parity-check capacities evaluated by this reduced precision. The color scale is the same with that used in Fig. 4a and b. We confirm that, although the values of the memory capacities become small, several characteristics, such as a change of the memory capacities near the magnetic field strength of 800 Oe, show common tendencies for Figs. 4 and 6. Given the results so far, we believe that the results obtained by the numerical simulation in this work will be tested by future experiments.

Evaluation method of Lyapunov exponent

The Lyapunov exponent is evaluated by Shimada-Nagashima method⁵⁸. In this method, the maximum Lyapunov exponent is obtained as an average of temporal Lyapunov exponent, which can be estimated from an instantaneous expansion rate of two infinitesimally separated magnetizations to the most expanded direction. We introduce the zenith and azimuth angles of the kth MTJ ($k=1,2,\cdots ,N$), $\theta _{k}(t)$ and $\varphi _{k}(t)$, as $\textbf{m}_{k}=(\sin \theta _{k}\cos \varphi _{k},\sin \theta _{k}\sin \varphi _{k},\cos \theta _{k})$ and evaluate their expansion rate^59,60,61.

Let us denote the initial time to evaluate the Lyapunov exponent as $t_\textrm{init}$. We also denote the kth magnetization at $t=t_\textrm{init}$ as $\textbf{m}_{k}(t_\textrm{init})$. At $t=t_\textrm{init}$, we introduce $\textbf{m}_{k}^{(1)}(t_\textrm{init})=[\sin \theta _{k}^{(1)}\cos \varphi _{k}^{(1)},\sin \theta _{k}^{(1)}\sin \varphi _{k}^{(1)},\cos \theta _{k}^{(1)}]$, where the distance between $\textbf{m}_{k}(t_\textrm{init})$ and $\textbf{m}_{k}^{(1)}(t_\textrm{init})$ is infinitesimally small. For simplicity, we denote this distance as

$$\begin{aligned} \mathscr {D}_{k}^{(1)}[\textbf{m}_{k}(t_\textrm{init}),\textbf{m}_{k}^{(1)}(t_\textrm{init})] = \sqrt{ \left[ \theta _{k}(t_\textrm{init}) - \theta _{k}^{(1)}(t_\textrm{init}) \right] ^{2} + \left[ \varphi _{k}(t_\textrm{init}) - \varphi _{k}^{(1)}(t_\textrm{init}) \right] ^{2} }. \end{aligned}$$

(14)

Since $\textbf{m}_{k}^{(1)}$ is introduced over all the MTJs, i.e., $k=1,2,\cdots ,N$, we define the total distance as

$$\begin{aligned} \mathscr {D}^{(1)}(t_\textrm{init}) = \sum _{k=1}^{N} \mathscr {D}_{k}^{(1)}[\textbf{m}_{k}(t_\textrm{init}),\textbf{m}_{k}^{(1)}(t_\textrm{init})]. \end{aligned}$$

(15)

The value of $\mathscr {D}^{(1)}(t_\textrm{init})$ is fixed to be a small value, which is denoted as $\varepsilon$ and is $1.0\times 10^{-5}$ in this work. The values of $\theta _{k}^{(1)}(t_\textrm{init})$ and $\varphi _{k}^{(1)}(t_\textrm{init})$ are arbitrary by keeping the condition $\mathscr {D}^{(1)}(t_\textrm{init})=\varepsilon$.

The time evolution of $\textbf{m}_{k}(t_\textrm{init})$ and $\textbf{m}_{k}^{(1)}(t_\textrm{init})$ to $\textbf{m}_{k}(t_\textrm{init}+\Delta t)$ and $\textbf{m}_{k}^{(1)}(t_\textrm{init}+\Delta t)$ is obtained by solving the LLG equation. Note that their distance,

$$\begin{aligned} \mathscr {D}_{k}^{(1)}[\textbf{m}_{k}(t_\textrm{init}+\Delta t),\textbf{m}_{k}^{(1)}(t_\textrm{init}+\Delta t)] = \sqrt{ \left[ \theta _{k}(t_\textrm{init}+\Delta t) - \theta _{k}^{(1)}(t_\textrm{init}+\Delta t) \right] ^{2} + \left[ \varphi _{k}(t_\textrm{init}+\Delta t) - \varphi _{k}^{(1)}(t_\textrm{init}+\Delta t) \right] ^{2} }, \end{aligned}$$

(16)

is generally different from $\mathscr {D}_{k}^{(1)}[\textbf{m}_{k}(t_\textrm{init}),\textbf{m}_{k}^{(1)}(t_\textrm{init})]$. Therefore, introducing

$$\begin{aligned} \mathscr {D}^{(1)}(t_\textrm{init}+\Delta t) = \sum _{k=1}^{N} \mathscr {D}_{k}^{(1)}[\textbf{m}_{k}(t_\textrm{init}+\Delta t),\textbf{m}_{k}^{(1)}(t_\textrm{init}+\Delta t)], \end{aligned}$$

(17)

the expansion rate of the magnetizations from $t=t_\textrm{init}$ to $t=t_\textrm{init}+\Delta t$ is given by $\mathscr {D}^{(1)}(t_\textrm{init}+\Delta t)/\varepsilon$. The temporal Lyapunov exponent at $t=t_\textrm{init}+\Delta t$ is then defined as

$$\begin{aligned} \varLambda ^{(1)} = \frac{1}{\Delta t} \ln \frac{\mathscr {D}^{(1)}(t_\textrm{init}+\Delta t)}{\varepsilon }. \end{aligned}$$

(18)

While the initial perturbations, $\theta _{k}^{(1)}(t_\textrm{init})-\theta _{k}(t_\textrm{init})$ and $\varphi _{k}^{(1)}(t_\textrm{init})-\varphi _{k}(t_\textrm{init})$, are arbitrary, the perturbation from $t=t_\textrm{init}+\Delta t$ should be chosen so that $\textbf{m}_{k}^{(n)}[t_\textrm{init}+(n-1)\Delta t]-\textbf{m}[t_\textrm{init}+(n-1)\Delta t]$ ($n=2,3,\cdots$) points to the most expanded direction. This is a key idea of the Shimada-Nagashima method⁵⁸, i.e., even if the initial perturbation is arbitrary, the distance between two solutions of equation of motion will naturally move to the most expanded direction. In the present case, at $t=t_\textrm{init}+(n-1)\Delta t$ ($n=2,3,\cdots$), we define $\textbf{m}_{k}^{(n)}[t_\textrm{init}+(n-1)\Delta t]$ by introducing the zenith and azimuth angles as

$$\begin{aligned} \theta _{k}^{(n)}[t_\textrm{init}+(n-1)\Delta t] = \theta _{k}[t_\textrm{init}+(n-1)\Delta t] + \varepsilon \frac{\theta _{k}^{(n-1)}[t_\textrm{init}+(n-1)\Delta t]-\theta _{k}[t_\textrm{init}+(n-1)\Delta t]}{\mathscr {D}^{(n-1)}[t_\textrm{init}+(n-1)\Delta t]}, \end{aligned}$$

(19)

$$\begin{aligned} \varphi _{k}^{(n)}[t_\textrm{init}+(n-1)\Delta t] = \varphi _{k}[t_\textrm{init}+(n-1)\Delta t] + \varepsilon \frac{\varphi _{k}^{(n-1)}[t_\textrm{init}+(n-1)\Delta t]-\varphi _{k}[t_\textrm{init}+(n-1)\Delta t]}{\mathscr {D}^{(n-1)}[t_\textrm{init}+(n-1)\Delta t]}, \end{aligned}$$

(20)

where $\mathscr {D}^{(n-1)}[t_\textrm{init}+(n-1)\Delta t]$ is defined as

$$\begin{aligned} \begin{aligned} \mathscr {D}^{(n-1)}[t_\textrm{init}+(n-1)\Delta t]&= \sum _{k=1}^{N} \mathscr {D}_{k}^{(n-1)}\{\textbf{m}_{k}[t_\textrm{init}+(n-1)\Delta t],\textbf{m}_{k}^{(n-1)}[t_\textrm{init}+(n-1)\Delta t]\}, \\&= \sum _{k=1}^{N} \sqrt{ \left\{ \theta _{k}[t_\textrm{init}+(n-1)\Delta t] - \theta _{k}^{(n-1)}[t_\textrm{init}+(n-1)\Delta t] \right\} ^{2} + \left\{ \varphi _{k}[t_\textrm{init}+(n-1)\Delta t] - \varphi _{k}^{(n-1)}[t_\textrm{init}+(n-1)\Delta t] \right\} ^{2} }. \end{aligned} \end{aligned}$$

(21)

Then, $\textbf{m}_{k}^{(n)}[t_\textrm{init}+(n-1)\Delta t]-\textbf{m}_{k}[t_\textrm{init}+(n-1)\Delta t]$ points to the most expanded direction. Note that, according to the definition, $\textbf{m}_{k}^{(n)}[t_\textrm{init}+(n-1)\Delta t]$ satisfies

$$\begin{aligned} \begin{aligned} \mathscr {D}^{(n)}[t_\textrm{init}+(n-1)\Delta t]&= \sum _{k=1}^{N} \mathscr {D}_{k}^{(n)}\{\textbf{m}_{k}[t_\textrm{init}+(n-1)\Delta t],\textbf{m}_{k}^{(n)}[t_\textrm{init}+(n-1)\Delta t]\} \\&= \sum _{k=1}^{N} \sqrt{ \left\{ \theta _{k}[t_\textrm{init}+(n-1)\Delta t] - \theta _{k}^{(n)}[t_\textrm{init}+(n-1)\Delta t] \right\} ^{2} + \left\{ \varphi _{k}[t_\textrm{init}+(n-1)\Delta t] - \varphi _{k}^{(n)}[t_\textrm{init}+(n-1)\Delta t] \right\} ^{2} } \\&= \varepsilon , \end{aligned} \end{aligned}$$

(22)

i.e., the sum of the distance between $\textbf{m}_{k}[t_\textrm{init}+(n-1)\Delta t]$ and $\textbf{m}_{k}^{(n)}[t_\textrm{init}+(n-1)\Delta t]$ is $\varepsilon$. Then, we evaluate the time evolution of $\textbf{m}_{k}[t_\textrm{init}+(n-1)\Delta t]$ and $\textbf{m}_{k}^{(n)}[t_\textrm{init}+(n-1)\Delta t]$ to $\textbf{m}_{k}(t_\textrm{init}+n\Delta t)$ and $\textbf{m}_{k}(t_\textrm{init}+n\Delta t)$ and define their distance as

$$\begin{aligned} \mathscr {D}_{k}^{(n)}[\textbf{m}_{k}(t_\textrm{init}+n\Delta t),\textbf{m}_{k}^{(n)}(t_\textrm{init}+n\Delta t)] = \sqrt{ \left[ \theta _{k}(t_\textrm{init}+n\Delta t) - \theta _{k}^{(n)}(t_\textrm{init}+\Delta t) \right] ^{2} + \left[ \varphi _{k}(t_\textrm{init}+\Delta t) - \varphi _{k}^{(n)}(t_\textrm{init}+\Delta t) \right] ^{2} }. \end{aligned}$$

(23)

Using their sum, $\mathscr {D}^{(n)}(t_\textrm{init}+n\Delta t)=\sum _{k=1}^{N}\mathscr {D}_{k}^{(n)}[\textbf{m}_{k}(t_\textrm{init}+n\Delta t),\textbf{m}_{k}^{(n)}(t_\textrm{init}+n\Delta t)]$, the temporal Lyapunov exponent at $t=t_\textrm{init}+n\Delta t$ is defined as

$$\begin{aligned} \varLambda ^{(n)} = \frac{1}{\Delta t} \ln \frac{\mathscr {D}^{(n)}(t_\textrm{init}+n\Delta t)}{\varepsilon }. \end{aligned}$$

(24)

Repeating these procedures, the maximum Lyapunov is obtained as a long-time average of the temporal Lyapunov exponent as

$$\begin{aligned} \varLambda = \lim _{\mathscr {N}\rightarrow \infty } \frac{1}{\mathscr {N}} \sum _{i=1}^{\mathscr {N}} \varLambda ^{(i)}. \end{aligned}$$

(25)

In this work, we start with evaluating the temporal Lyapunov exponent when the first washout inputs are injected. Since there are two 300 washout inputs, 1000 training inputs, and 1000 test inputs, and the pulse width is 5.0 ns, as mentioned above, the number of the averaging (or equivalently, the number of the temporal Lyapunov exponent) is $1.3\times 10^{7}$ (recall that $\Delta t=1.0$ ps).

Data availability

The datasets used and/or analyses during the current study available from the corresponding author on reasonable request.

References

Jaeger, H. Short term memory in echo state networks. GMD Rep. 152, 60 (2002).
MATH Google Scholar
Maas, W., Natschläger, T. & Markram, H. Real-time computing without stable states: A new framework for neural computation based on perturbations. Neural Comput. 14, 2531 (2002).
Article MATH Google Scholar
Jaeger, H. & Haas, H. Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication. Science 304, 78 (2004).
Article ADS CAS PubMed MATH Google Scholar
Verstraeten, D., Schrauwen, B., D’Haene, M. & Stroobandt, D. An experimental unification of reservoir computing methods. Neural Netw. 20, 391 (2007).
Article CAS PubMed MATH Google Scholar
Appeltant, L. et al. Information processing using a single dynamical node as complex system. Nat. Commun. 2, 468 (2011).
Article ADS CAS PubMed MATH Google Scholar
Dambre, J., Verstraeten, D., Schrauwen, B. & Massar, S. Information processing capacity of dynamical systems. Sci. Rep. 2, 514 (2012).
Article CAS PubMed PubMed Central MATH Google Scholar
Sillin, H. O. et al. A theoretical and experimental study of neuromorphic atomic switch networks for reservoir computing. Nanotechnology 24, 384004 (2013).
Article PubMed MATH Google Scholar
Tanaka, G. et al. Recent advances in physical reservoir computing: A review. Nueral. Netw. 115, 100 (2019).
Article MATH Google Scholar
Nakajima, K. & Fischer, I. (eds) Reservoir Computing: Theory, Physical Implementations, and Applications (Springer, 2021).
MATH Google Scholar
Nakajima, K., Li, T., Hauser, T. & Pfeifer, R. Exploiting short-term memory in soft body dynamics as a computational resource. J. R. Soc. Interface 11, 20140437 (2014).
Article CAS PubMed PubMed Central Google Scholar
Nakajima, K., Hauser, H., Li, T. & Pfeifer, R. Information processing via physical soft body. Sci. Rep. 5, 10487 (2015).
Article ADS PubMed PubMed Central MATH Google Scholar
Fujii, K. & Nakajima, K. Harnessing disordered-ensemble quantum dynamics for machine learning. Phys. Rev. Appl. 8, 024030 (2017).
Article ADS MATH Google Scholar
Fiers, M. A. A. et al. Nanophotonic reservoir computing with photonic crystal cavities to generate periodic patterns. IEEE Trans. Neural. Netw. Learn. Syst. 25, 344 (2014).
Article PubMed Google Scholar
Nakayama, J., Kanno, K. & Uchida, A. Laser dynamical reservoir computing with consistency: an approach of a chaos mask signal. Opt. Express 24, 8679–8692 (2016).
Article ADS PubMed MATH Google Scholar
Pauwels, J., Verschaffelt, G., Massar, S. & der Sande, G. V. Distributed Kerr non-linearity in a coherent all-optical fiber-ring reservoir computer. Frot. Phys. 7, 138 (2019).
Article MATH Google Scholar
Harkhoe, K. & der Sande, G. V. Task-independent computational abilities of semiconductor lasers with delayed optical feedback for reservoir computing. Photonics 6, 124 (2019).
Article CAS MATH Google Scholar
Brunner, D. Soriano, M. C. & van der Sande, G. (eds.) Photonic Reservoir Computing: Optical Recurrent Neural Networks (De Gruyter, 2019).
Torrejon, J. et al. Neuromorphic computing with nanoscale spintronic oscillators. Nature 547, 428 (2017).
Article CAS PubMed PubMed Central MATH Google Scholar
Furuta, T. et al. Macromagnetic simulation for reservoir computing utilizing spin dynamics in magnetictunnel junctions. Phys. Rev. Appl. 10, 034063 (2018).
Article ADS CAS Google Scholar
Tsunegi, S. et al. Evaluation of memory capacity of spin torque oscillator for recurrent neural networks. Jpn. J. Appl. Phys. 57, 120307 (2018).
Article ADS MATH Google Scholar
Riou, M. et al. Temporal pattern recognition with delayed-feedback spin-torque nano-oscillator. Phys. Rev. Appl. 12, 024049 (2019).
Article ADS CAS MATH Google Scholar
Kanao, T. et al. Reservoir computing on spin-torque oscillator array. Phys. Rev. Appl. 12, 024052 (2019).
Article ADS CAS Google Scholar
Yamaguchi, T. et al. Periodic structure of memory function in spintronics reservoir with feedback current. Phys. Rev. Res. 2, 023389 (2020).
Article CAS MATH Google Scholar
Tsunegi, S. et al. Information processing capacity of spintronic oscillator. Adv. Intell. Syst. 5, 2300175 (2023).
Article Google Scholar
Bourianoff, G., Pinna, D., Sitte, M. & E.-Sitte, K. Potential implementation of reservoir computing models based on magnetic skyrmions. AIP Adv. 8, 055602 (2018).
Article ADS MATH Google Scholar
Prychynenko, D. et al. Magnetic skyrmion as a nonlinear resistive element: A potential building block for reservoir computing. Phys. Rev. Appl. 9, 014034 (2018).
Article ADS CAS Google Scholar
Raab, K. et al. Brownian reservoir computing realized using geometrically confined skyrmion dynamics. Nat. Commun. 13, 6982 (2022).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Lee, O. et al. Task-advaptive physical reservoir computing. Nat. Mater. 23, 79 (2023).
Article ADS PubMed PubMed Central Google Scholar
Taniguchi, T., Ogihara, A., Utsumi, Y. & Tsunegi, S. Spintronic reservoir computing without driving current or magnetic field. Sci. Rep. 12, 10627 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Nakane, R., Tanaka, G. & Hirose, A. Reservoir computing with spin waves excited in a garnet film. IEEE Access 6, 4462 (2018).
Article MATH Google Scholar
Watt, S. & Kostylev, M. Reservoir computing using a spin-wave delay-line active-ring resonator based on yttrium-iron-garnet film. Phys. Rev. Appl. 13, 034057 (2020).
Article ADS CAS MATH Google Scholar
Watt, S., Kostylev, M. & Ustinov, A. B. Enhancing computational performance of a spin-wave reservoir computer with input synchronization. J. Appl. Phys. 129, 044902 (2021).
Article ADS CAS Google Scholar
Namiki, W. et al. Experimental demonstration of high-performance physical reservoir computing with nonlinear interfered spin wave multidetection. Adv. Intell. Syst. 5, 2300228 (2023).
Article MATH Google Scholar
Namiki, W., Nishioka, D., Tsuchiya, T. & Terabe, K. Fast physical reservoir computing, achieved with nonlinear interfered spin waves. Neuromorph. Comput. Eng. 4, 024015 (2024).
Article Google Scholar
Iihama, S., Koike, Y., Mizukami, S. & Yoshinaga, N. Universal scaling between wave speed and size enables nanoscale high-performance reservoir computing based on propagating spin-waves. NPJ Spintronics 2, 5 (2024).
Article MATH Google Scholar
Bramwell, S. T. & Gingras, M. J. P. Spin ice state in frustrated magnetic pyrochlore materials. Science 294, 1495 (2001).
Article ADS CAS PubMed MATH Google Scholar
Wang, R. F. et al. Artificial ‘spin ice’ in a geometrically frustrated lattice of nanoscale ferromagnetic islands. Nature 439, 303 (2006).
Article ADS CAS PubMed MATH Google Scholar
Tanaka, M., Saitoh, E., Miyajima, H., Yamaoka, T. & Iye, Y. Magnetic interactions in a ferromagnetic honeycomb nanoscale network. Phys. Rev. B 73, 052411 (2006).
Article ADS Google Scholar
Qi, Y., Brintlinger, T. & Cumings, J. Direct observation of the ice rule in an artificial kagome spin ice. Phys. Rev. B 77, 094418 (2008).
Article ADS Google Scholar
Gartside, J. C. et al. Realizing of ground state in artificial kagome spin ice via topological defect-driven magnetic writing. Nat. Nanotechnol. 13, 53 (2018).
Article ADS CAS PubMed MATH Google Scholar
Skjœrvø, S. H., Marrows, C. H., Stamps, R. L. & Heyderman, L. J. Advances in artificial spin ice. Nat. Rev. Phys. 2, 13 (2020).
Article Google Scholar
Hon, K. et al. Numerical simulation of artificial spin ice for reservoir computing. Appl. Phys. Express 14, 033001 (2021).
Article ADS CAS MATH Google Scholar
Gartside, J. C. et al. Reconfigurable training and reservoir computing in an artificial spin-vortex ice via spin-wave fingerprinting. Nat. Nanotechnol. 17, 460 (2022).
Article ADS CAS PubMed MATH Google Scholar
Hu, W. et al. Distinguishing artificial spin ice states using magnetoresistance effect for neurmorphic computing. Nat. Commun. 14, 2562 (2023).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Kubota, H. et al. Magnetization process of stadium-shaped magnetic tunnel junction cells for artificial spin ice. IEEE Trans. Magn. 59, 4400704 (2023).
Article CAS MATH Google Scholar
Kubota, H. et al. Magnetic coupling and magnetization process in artificial spin ice. In The 48th Annual Conference on Magnetics in Japan (2024).
Slonczewski, J. C. Current-driven excitation of magnetic multilayers. J. Magn. Magn. Mater. 159, L1 (1996).
Article ADS CAS MATH Google Scholar
Berger, L. Emission of spin waves by a magnetic multilayer traversed by a current. Phys. Rev. B 54, 9353 (1996).
Article ADS CAS MATH Google Scholar
Beleggia, M., Graef, M., Millev, Y. T., Goode, D. A. & Rowlands, G. Demagnetization factors for elliptic cylinders. J. Phys. D Appl. Phys. 38, 3333 (2005).
Article ADS CAS Google Scholar
Kubota, T., Takahashi, H. & Nakajima, K. Unifying framework for information processing in stochastically driven dynamical systems. Phys. Rev. Res. 3, 043135 (2021).
Article CAS MATH Google Scholar
Bertschinger, N. & Natschläger, T. Real-time computation at the edge of chaos in recurrent neural networks. Neural Comput. 16, 1413 (2004).
Article PubMed MATH Google Scholar
Yamaguchi, T., Tsunegi, S., Nakajima, K. & Taniguchi, T. Computational capability for physical reservoir computing using a spin-torque oscillator with two free layers. Phys. Rev. B 107, 054406 https://doi.org/10.1103/PhysRevB.107.054406 (2023).
Article ADS CAS Google Scholar
Alligood, K. T., Sauer, T. D. & Yorke, J. A. Chaos. An Introduction to Dynamical Systems (Springer, 1997).
Strogatz, S. H. Nonlinear Dynamics and Chaos: With Applications to Physics, Biology, Chemistry, and Engineering 1st edn. (Westview Press, 2001).
MATH Google Scholar
Ott, E. Chaos in Dynamical Systems 2nd edn. (Cambridge University Press, 2002).
Book MATH Google Scholar
Matsumoto, M. & Nishimura, T. Mersenne twister: A 623-dimensionally equidistributed uniform pseudo-random number generator. ACM Trans. Model. Comput. Simul. 8, 3 (1998).
Article MATH Google Scholar
Taniguchi, T. Stray magnetic fields from elliptical-shaped and stadium-shaped ferromagnets. Jpn. J. Appl. Phys. 62, 103002 (2023).
Article ADS CAS MATH Google Scholar
Shimada, I. & Nagashima, T. A numerical approach to ergodic problem of dissipative dynamical systems. Prog. Theor. Phys. 61, 1605 (1979).
Article ADS MathSciNet MATH Google Scholar
Taniguchi, T. Synchronization and chaos in spin torque oscillator with two free layers. AIP Adv. 10, 015112 (2020).
Article ADS CAS MATH Google Scholar
Taniguchi, T. Bifurcation to complex dynamics in largely modulated voltage-controlled parametric oscillator. Sci. Rep. 14, 2891 (2024).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Taniguchi, T. Feedback voltage driven chaos in a three-terminal spin-torque oscillator. Phys. Rev. B 110, 134403 (2024).
Article CAS MATH Google Scholar

Download references

Acknowledgements

The author is grateful to Takehiko Yorozu, Hikaru Nomura, Teijiro Isokawa, Hitoshi Kubota, and Yoshishige Suzuki for valuable discussion. This work was support by JSPS KAKENHI Grant, Number 20H05655, 24K01336 and 24K00547.

Author information

Authors and Affiliations

National Institute of Advanced Industrial Science and Technology (AIST), Research Center for Emerging Computing Technologies, Tsukuba, Ibaraki, 305-8568, Japan
Tomohiro Taniguchi

Authors

Tomohiro Taniguchi
View author publications
Search author on:PubMed Google Scholar

Contributions

T.T. designed the project, made code and performed the LLG simulations, prepared figures, and wrote the manuscript.

Corresponding author

Correspondence to Tomohiro Taniguchi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Taniguchi, T. Echo state property and memory capacity of artificial spin ice. Sci Rep 15, 9073 (2025). https://doi.org/10.1038/s41598-025-93189-w

Download citation

Received: 12 November 2024
Accepted: 05 March 2025
Published: 17 March 2025
DOI: https://doi.org/10.1038/s41598-025-93189-w