Modeling nonlinear oscillator networks using physics-informed hybrid reservoir computing

Shannon, Andrew; Houghton, Conor; Barton, David A. W.; Homer, Martin

doi:10.1038/s41598-025-03957-x

Download PDF

Article
Open access
Published: 02 July 2025

Modeling nonlinear oscillator networks using physics-informed hybrid reservoir computing

Scientific Reports volume 15, Article number: 22497 (2025) Cite this article

924 Accesses
1 Citations
Metrics details

Subjects

Abstract

Surrogate modeling of non-linear oscillator networks remains challenging due to discrepancies between simplified analytical models and real-world complexity. To bridge this gap, we investigate hybrid reservoir computing, combining reservoir computing with “expert” analytical models. Simulating the absence of an exact model, we first test the surrogate models with parameter errors in their expert model. Second, in a residual physics task, we assess the performance when the expert model lacks key non-linear coupling terms present in an extended ground-truth model. We focus on short-term forecasting across diverse dynamical regimes, evaluating the use of these surrogates for control applications. We show that hybrid reservoir computers generally outperform standard reservoir computers and exhibit greater robustness to parameter tuning. This advantage is less pronounced in the residual physics task. Notably, unlike standard reservoir computers, the performance of the hybrid does not degrade when crossing an observed spectral radius threshold. Furthermore, there is good performance for dynamical regimes not accessible to the expert model, demonstrating the contribution of the reservoir.

Novel nondelay-based reservoir computing with a single micromechanical nonlinear resonator for high-efficiency information processing

Article Open access 20 October 2021

A novel approach to minimal reservoir computing

Article Open access 10 August 2023

Efficient optimisation of physical reservoir computers using only a delayed input

Article Open access 18 January 2025

Introduction

Networks of oscillators appear widely across engineering, and in both the physical and biological sciences. When the networks are non-linear their dynamical behavior can be complex, displaying synchronization, chaos, and traveling waves^1,2,3. Creating predictive dynamical models is important in areas such as surrogate modeling of non-linear oscillator networks (NLONs) for smart electrical grid optimization^4,5,6, biological computing⁷, synthetic biology^8,9 and the diagnosis and treatment of neurological disorders such as epilepsy¹⁰ and Parkinson’s disease^11,12,13,14. These surrogate models have two broad applications: parameter inference and control. Parameter inference can help identify critical states and associated parameter values within a system, ideally also exposing underlying mechanistic processes. Control using surrogate models aims to exploit system knowledge to improve control performance. Methods such as model predictive control¹⁵ and model-based reinforcement learning¹⁶ are examples.

In this paper, we focus on hybrid reservoir computing (RC), a specific form of physics-informed machine learning (PIML). In particular, with a view to probing its viability for control applications, we investigate how well hybrid RC performs surrogate modeling of NLONs.

Real NLONs are often high dimensional, partially observable, noisy, and involve complex intra-network interactions. As such, it can be extremely difficult to create accurate surrogate models. The classical approach to this task is direct physics-based modeling, where mechanisms are pre-ordained and parameters fit to data. Machine learning (ML), or data-driven modeling, is an alternative approach which uses fully parameterized models and with parameters updated using learning algorithms. These two contrasting methods confer distinct benefits and drawbacks.

Physics-based models are physically accurate within the bounds of the assumptions made in their construction. Similarly, within the domain of their training data, data-driven models perform well. Failure when predicting out-of-domain is therefore common to the two approaches. ML models can be updated online using new data to adapt to new situations, although this is in itself a challenging problem¹⁷. Physics-based models are inherently interpretable: each term generally has some understood physical meaning, or represents some physical laws or constraints. On the other hand, ML discards this prior knowledge in favor of complete parameterisation from observed data. When data-driven models have many parameters, they require large amounts of data. Physics-based models tend to have few parameters that need fitting to data, and therefore generally require less data. Both methods can be computationally expensive as physics-based models rely on complex numerical schemes, and data-driven models often need to use extensive training algorithms for parameter updates.

PIML is a recently-formulated approach for surrogate modeling and prediction applications¹⁸. It combines both physics-based and data-driven methods, attempting to make use of the best features of each. Its goal is to obtain physically constrained, robust, and interpretable models that capture both expert knowledge of dynamical processes and the information that can be extracted from data obtained from sensing and recording devices. PIML models promise to be more data efficient as they do not require all of the dynamics to be learned from scratch, and they may also facilitate adaptivity through the use of machine learning. PIML-based control for NLONs may thus result in more robust, efficient and accurate controllers that are adaptable and generalisable.

For PIML-based modeling of dynamical systems, it is natural to consider ML components with a time component or sequential nature. For instance, recurrent neural networks (RNNs) and their variants (LSTMs¹⁹, GRUs²⁰) are a common choice^21,22,23. However, RC^24,25 is a particularly promising alternative, due to its simple training procedure. The simplicity of an RC may also offer a unique benefit for PIML parameter inference; while a large RNN can learn, or over learn, a complete model of the data without any input from the physics-based component, an RC has a limited capacity which may force the model to use the physics-based component, making it more interpretable. The small number of parameters used by an RC may also further enhance the low-data requirement conferred by the use of system knowledge. RCs are a restricted form of RNN, where learning only takes place in an external readout layer. Sequential data is passed into the reservoir via a fixed, random input weight matrix. An update rule then acts as a discrete non-linear map upon the internal state stored within the activations of the reservoir nodes. The result is a high-dimensional non-linear filter of the incoming data with a fading memory of past states. Scaling the weights of the input matrix and internal connectivity controls the extent to which past-state information is maintained in the hidden state and how much influence is exerted by the input data. To compute an output from the reservoir’s internal state, an output weight matrix is trained, often using regularized linear regression. The readout layer may be trained to perform n-step-ahead prediction. When configured to predict the next step in a sequence, the reservoir may be run autoregressively with its output fed back in as the next input instance and thus used for time-series forecasting. This is the format we are considering here: using RCs for the prediction of dynamical system trajectories.

The main advantage of RCs over more complex RNNs is their ease of initialization and simplicity of training. Good time-series forecasting performance can be achieved using only linear regression, even when chaotic dynamics are present²⁶, and issues such as the vanishing gradient problem are avoided. Since they require only general non-linear high-dimensional filtering of inputs and a fading memory of past inputs, RCs can also be constructed in a wide range of physical substrates^27,28.

Recently, a PIML variant of an echo state network RC was proposed²⁹. In hybrid RC (Fig. 1), the prediction from a standard RC is augmented by a single-step integration of an expert ordinary differential equation (ODE) model of the system being predicted. The next step prediction of the ODE model is passed into the reservoir alongside the current state. It is also passed around the reservoir to be considered by the output weight matrix on its own merit, during training and inference. The output weight matrix, still the only trained component, thus aims to combine the augmented reservoir state with the ODE model prediction to most accurately predict the next state. This approach allows the reservoir to compensate for errors in the expert ODE model, and has been shown to result in superior performance when compared to models that use only one of the two components, that is, a standard RC or an expert ODE model.

In particular, the hybrid RC was used to predict the dynamics of the Lorenz and Kuramoto-Sivashinsky systems when incorporating a model of each system with parameter error. With the correct model structure, the hybrid RC was shown to perform well, better than either a standard RC or ODE model in isolation. The hybrid RC also maintained good performance when, under particular parameter settings, the standard RC and ODE model performed poorly²⁹.

To investigate the potential of hybrid RCs for the novel example of NLON prediction and control—where the ground-truth dynamics is more complex, or the expert model is further from the true system than in previous work²⁹—we evaluated their performance on two tasks: parameter error and residual physics.

Parameter error, the first, simpler, task tests how well a hybrid RC predicts the trajectory of a network of standard Kuramoto oscillators, when the parameters in the hybrid RC model do not correctly match the parameters of the ground truth model. This follows the previous evaluation but with the Kuramoto oscillator network replacing the Lorenz system²⁹. The test is run across a range of hyperparameters to assess performance robustness to tuning, and across three qualitatively different dynamical regimes.
Residual physics, our second task, is more challenging. We measure the short term prediction performance of the hybrid RC when the hybrid RC uses a simpler model than the ground truth. This is intended to mimic real-world examples where the complex interactions of an oscillating system are unknown. In these examples a simpler, approximate, model is often used, but even small non-linear terms can quickly make predictions inaccurate. Our interest is in whether the reservoir component of the hybrid RC can compensate for the over-simplification of the model.

In our implementation, the residual physics is an additional higher harmonic in the coupling term for the ground truth Kuramoto-like system: we give the hybrid RC the standard Kuramoto model without this addition. This bi-harmonic Kuramoto model³⁰ produces behaviors not accessible to the original Kuramoto model. For example, when clustering of the oscillators around a phase occurs in the standard Kuramoto model, there is only one cluster; with an extra harmonic term this need not be true. The residual physics task aims to replicate realistic control scenarios with incomplete knowledge of the system structure. This is an interesting challenge for the hybrid RC since exact knowledge of the ground truth non-linearities has previously been identified as being crucial for Kuramoto oscillator network attractor reconstruction when using the Next Generation Reservoir Computer^31,32. We run this test across a range of hyper parameters, with four qualitatively different dynamical regimes, using the results to inform a demonstrative grid-search optimization process simulating the development of a surrogate model for control applications. Overall, we are not seeking the best method for modeling NLONs^33,34, instead, we look to understand if the benefit of the hybrid approach seen in ²⁹ carries over to application to NLONs and the more realistic residual physics task.

Methods

We present first an overview of the standard and hybrid RCs used in this study, followed by a shared procedure for ground truth data generation and testing in each task. We then describe the specific initialization and training methods, followed by details of the parameter error and residual physics tasks. The details of the hybrid RC and parameter error task largely follow the approach in ²⁹. They can therefore be skipped if the reader is familiar with this prior work. The only exception is our addition of the “Phase component transformation” to deal with the phase variables. Our aim is to test hybrid reservoir computing as an approach, rather than to tune a model to a particular application, so we have used as a baseline the parameter settings previously used in ²⁹; this isolates the consequence of using the new ground truth system and the new residual physics task from the effect of parameter tuning.

Reservoir computing

We use the echo state network (ESN)²⁴ formulation of RC throughout this work (Fig. 2). An ESN comprises $D_r$ nodes with internal states at time t denoted $\mathbf {r_t}\in \mathbb {R}^{D_r\times 1}$. The nodes are coupled together via a weight matrix ${\bf{A}} \in \mathbb {R}^{D_r\times D_r}$. Two weight matrices, ${\bf{B}} \in \mathbb {R}^{D_r\times D_u}$ and ${\bf{C}} \in \mathbb {R}^{D_u\times D_r}$, corresponding to the input weights and the output weights respectively complete the architecture. Input data, ${\bf{u}}_t\in \mathbb {R}^{D_u\times 1}$, is fed into the network via the input weight matrix, ${\bf{B}}$. An update rule operates upon the hidden state and input data to produce the next internal state. For an ESN without a leak term, as used here, the update rule is

$$\begin{aligned} {\bf{r}}_{t+1}=\tanh \left( {\bf{A}}{\bf{r}}_{t}+{\bf{B}}{\bf{u}}_{t}\right) . \end{aligned}$$

(1)

Only the output weight matrix, ${\bf{C}}$, is trained, generally via regularized linear regression to produce an output that can take many forms depending on the task. Here, we are focused on next-step prediction for dynamical-systems forecasting, so the output matrix is trained to estimate the next state of the ground truth trajectory. An internal state history matrix ${\bf{R}}=[{\bf{r}}_0,{\bf{r}}_1,\dots ,{\bf{r}}_{T}]\in \mathbb {R}^{D_r\times n_T}$ is formed by passing a training trajectory, ${\bf{u}}_T\in \mathbb {R}^{D_u\times n_T}$, into the reservoir in a feed-forward fashion (Fig. 2 training mode). The linear-regression step optimizes the weights in ${\bf{C}}$ to best form a map between ${\bf{R}}$ and the corresponding next step states of ${\bf{u}}_T$.

To produce a forecast from a given initial condition, an ESN must be run autoregressively, using its output as the next input (Fig 2 prediction mode). This requires first that the internal reservoir state is synchronized to the start of the trajectory it is forecasting. This warm-up phase, with the RC run in feed-forward mode, tends to be quite short, and requires that the reservoir satisfies the echo state property³⁵.

A non-linear transformation of the reservoir internal states, ${\bf{r}}_t$, may be applied before the output computation to enrich the representations or break symmetries in the input data³⁶.

Hybrid reservoir computing

A hybrid RC²⁹ (Fig. 3) is constructed out of a standard echo state network with the addition of an ODE model of the target system (the ‘expert ODE model’, Fig. 3, blue triangle). An auxiliary next-step prediction $\tilde{{\bf{u}}}_{t+1}$ is computed from the present state ${\bf{u}}_t$ by a numerical integrator. The integrator’s prediction is then concatenated with the present state, $[\tilde{{\bf{u}}}_{t+1};{\bf{u}}_{t}]$, before being passed into the reservoir as usual via the input weight matrix. The integrator’s prediction, $\tilde{{\bf{u}}}_{t+1}$, is also passed around the reservoir and concatenated with the reservoir internal states after the internal update, $[\tilde{{\bf{u}}}_{t+1};{\bf{r}}_{t+1}]$. Because of the concatenation of the usual data instances with the predictions of the ODE model, the size of the input and output weight matrices must change accordingly: ${\bf{B}}\in \mathbb {R}^{D_r\times 2D_u}$, and ${\bf{C}}\in \mathbb {R}^{D_u\times (D_r+D_u)}$. If a non-linear transformation is applied, it is applied to the reservoir internal states, ${\bf{r}}_{t+1}$, only, not the augmented state vector as a whole. The dual use of the integrator’s prediction $\tilde{{\bf{u}}}_{t+1}$ is present to maintain the same approach as ²⁹, despite it obscuring where the benefit of the hybrid approach originates.

The training procedure is the same as for the standard reservoir, except that the internal-state history matrix includes the auxiliary next-state predictions, $\tilde{{\bf{u}}}_{t+1}$, and therefore ${\bf{C}}$ is being optimized to map from the augmented state, $[\tilde{{\bf{u}}}_{t+1};{\bf{r}}_{t+1}]$, to the next state, ${\bf{u}}_{t+1}$. Trajectory forecasting again requires a warm-up phase, and proceeds in the same fashion as the standard ESN method with the addition of the computation of the expert ODE model’s auxiliary next state predictions.

Shared test procedure

We used a shared method across each task in this study. In each task, there is a distinct ground truth dynamical system that we are trying to predict. To produce the training, warm-up, and test data for each task, we evolved the ground truth system for a long time and segmented the resulting trajectory. The training, warm-up and test trajectory lengths were chosen according to previous work presenting the hybrid RC architecture²⁹ to enable comparison with the performance achieved there. Figure 4 details the lengths of each stage, and shows schematically the organization of the stages. The long trajectory was divided such that the start is the training span. After a gap, 20 disjoint warm-up/test spans follow. In every test, regardless of the performance evaluation metric, we initialized 40 instantiations of the reservoir (standard or hybrid) and trained them on the appropriate training span. Each test then consisted of synchronizing the reservoirs to the initial condition of the test span by feeding in the corresponding warm-up span, and then running the reservoirs autoregressively to produce forecasts of the same length as the test span. As a control, 40 instantiations of the hybrid RC’s expert ODE model (each with sampled parameter error) were produced and integrated with initial conditions matching the test trajectories.

Model initialization and training

Standard reservoirs

To initialize the standard and hybrid RCs we used the same method as in previous work²⁹. In particular, the internal connectivity matrix of each reservoir, ${\bf{A}}$, was constructed as an Erdős–Rényi random graph with mean degree $\left<d\right>$, corresponding to an edge probability of $\left<d\right> /D_r$. The weights were initialized uniformly randomly between $-1$ and 1, and then scaled such that the spectral radius was set to a desired value. The spectral radius of a matrix is the magnitude of its largest eigenvalue. The input weight matrix, ${\bf{B}}$, was initialized with a single non-zero element per row, corresponding to one connection per reservoir node. Each of these connections was assigned uniformly randomly to one of the input dimensions. The weight of each connection was drawn from a uniform distribution on a range set by the input scaling hyper-parameter: for all $i\in \{1,\dots ,D_r\}$,

$$\begin{aligned} j&\sim \text {Uniform}\{1,\dots ,D_u\},\\b_{ij}&\sim \text {Uniform}(-\text {input scaling}, \text {input scaling}),\\b_{ik}&=0\;\text {for}\;k\not = j. \end{aligned}$$

(2)

The spectral radius and input scaling were set to 0.4 and 0.15 respectively as baseline values as in previous work²⁹. Although restricting the input to one per node may not result in optimum performance due to a lack of mixing at the present step, this was done to match the approach in ²⁹ to allow easy comparison of the results.

We used the non-linear transformation g, applied to the reservoir internal states, ${\bf{r}}_t$, that was used previously²⁹. The output of the reservoir, ${\bf{u}}_{t+1}$, was thus computed as

$$\begin{aligned} {\bf{u}}_{t+1}={\bf{C}}g\left( {\bf{r}}_{t+1}\right) , \end{aligned}$$

(3)

with the non-linear transformation $g:\mathbb {R}^{D_r}\mapsto \mathbb {R}^{D_r}$ defined such that its i^t^h-component, $g_i$, is given for all $i\in \{1,\ldots ,D_r\}$ by

$$\begin{aligned} g_i({\bf{r}}) = {\left\{ \begin{array}{ll} r_i, & i=\operatorname {odd},\\ r_i^2, & i=\operatorname {even}. \end{array}\right. } \end{aligned}$$

(4)

The output weight matrix, ${\bf{C}}$, was trained via regularized linear regression to perform next step prediction. An internal state history was formed from the reservoir node activations that were obtained upon processing a training trajectory. g was first applied column-wise to the state history matrix ${\bf{R}}=[{\bf{r}}_0,{\bf{r}}_1,\dots ,{\bf{r}}_T]$ such that $\hat{{\bf{R}}}=g_{col}({\bf{R}})=[g({\bf{r}}_0),g({\bf{r}}_1),\dots ,g({\bf{r}}_T)]$. The output weight matrix, ${\bf{C}}$, was then computed using the ridge regression equation

$$\begin{aligned} {\bf{C}}=\left( \hat{{\bf{R}}}^{T}\hat{{\bf{R}}}+\beta {\bf{I}}\right) ^{-1}\hat{{\bf{R}}}^{T}{\bf{U}}^+, \end{aligned}$$

(5)

where ${\bf{U}}^+\in \mathbb {R}^{D_u\times n_T}$ is the target state matrix, comprising the states one step ahead of the training trajectory states, and $\beta$ is the regularization parameter. ${\bf{X}}^T$ indicates the transpose of ${\bf{X}}$ and ${\bf{X}}^{-1}$ the inverse. ${\bf{I}}$ is the $D_r \times D_r$ identity matrix.

Hybrid Reservoirs

For the hybrid RC, the internal reservoir weight matrix, ${\bf{A}}$, was constructed in the same way as for the standard reservoir. The initialization of the input weight matrix, ${\bf{B}}$, differed from the standard reservoir implementation. A hyper-parameter, the Knowledge Ratio, was used to determine the fraction of connections to the expert ODE model output states. For instance, with a knowledge ratio of 0.3, $30\%$ of the connections are from states within the expert ODE model output, and $70\%$ are from the incoming data. This was achieved by connecting each reservoir node to either the expert model or the input data, with probability $KR\in [0,1]$ of connection to the expert model. Again, only one connection per reservoir node to the input states/ODE output was formed and the weights were drawn from a uniform distribution on a range set by the input scaling parameter, so, for all $i\in \{1,\dots ,D_r\}$ we set

$$\begin{aligned} j&\sim {\left\{ \begin{array}{ll} \text {Uniform} \left\{ 1,\dots ,D_u \right\} \text {, with probability }KR,\\ \text {Uniform} \left\{ D_u+1,\dots ,2D_u \right\} \text {, with probability }1-KR. \end{array}\right. }\\b_{ij}&\sim \text {Uniform}(-\text {input scaling}, \text {input scaling}),\\b_{ik}&=0\;\text {for}\;k\not = j. \end{aligned}$$

(6)

A knowledge ratio of 0.5 was used as a baseline. A sweep from 0.1 to 0.9 knowledge ratio on the parameter error task is presented in the supplementary information (Supplementary Fig. S6).

The output weight matrix, ${\bf{C}}$, was again trained using regularized linear regression. However, in the hybrid case, the augmented state history matrix $\tilde{{\bf{R}}}\in \mathbb {R}^{(D_u+D_r)\times n_T}$ is formed by concatenation of the auxiliary next step predictions with the reservoir node activations, $\tilde{{\bf{R}}}=[\tilde{{\bf{U}}};{\bf{R}}]$. $\tilde{{\bf{U}}}\in \mathbb {R}^{D_u\times n_T}$ contains the auxiliary next step predictions of the expert ODE model corresponding to each instance of the training data. Each column of $\tilde{\bf{R}}$ is thus $[\tilde{{\bf{u}}}_{t+1};{\bf{r}}_{t+1}]$. The activations ${\bf{r}}_{t+1}$ are, in turn, produced by processing the training data state, ${\bf{u}}_t$, and the corresponding auxiliary next step prediction, $\tilde{{\bf{u}}}_{t+1}$. Only the reservoir internal states, ${\bf{r}}_{t+1}$, within $\tilde{{\bf{R}}}$ were transformed by g prior to computation of the output matrix, ${\bf{C}}$, or any output states during prediction.

Phase component transformation

Dynamical systems models of NLONs are often composed of phase variables, one for each oscillator in the network. As phase is bounded within $[-\pi ,\pi ]$, representing the angle around the unit circle, a phase variable trajectory can contain discontinuous jumps. To avoid this, we project the phase variables onto the x and y axes, forming continuous x and y phase components, which are then processed by the reservoir. To conserve the magnitude of the phase variables upon each iteration of the reservoir (standard or hybrid), we projected the phase components to phases, and back to components again after every step (Figs. 2 and 3, phase component magnitude normalization).

Parameter error task

In the parameter error task, we were aiming to evaluate the performance of the hybrid RC when its expert model differs from the ground truth only through errors in its parameters. This was the case considered previously²⁹. We conducted this test to find out if the hybrid RC could provide a forecasting performance improvement when predicting NLONs, as previously demonstrated for the Lorenz and Kuramoto-Sivashinsky systems. We carried out a parameter sweep over a range of hyperparameter settings to test the robustness of any improvement over the standard RC.

Ground truth

The Kuramoto model is a canonical model of a system of coupled non-linear oscillators³⁷. We use it as the ground truth in the parameter error task, and as the expert ODE model of the hybrid RC throughout this study. In particular, we used an all-to-all coupled network of Kuramoto oscillators. This model can demonstrate synchronous, asynchronous, and chaotic behavior. The fixed reference frame form of the model is

$$\begin{aligned} \frac{{{\,\textrm{d}\,}}\theta _i}{{{\,\textrm{d}\,}}t}=\omega _i+\frac{K}{N}\sum ^{N}_{j=1}\sin \left( \theta _j-\theta _i\right) , \end{aligned}$$

(7)

where $\theta _i(t)$ are the individual oscillator phases, $\omega _i$ are the oscillator natural frequencies, K is the coupling strength, and N is the number of oscillators.

The different dynamical states of Eq. (7) are achieved by varying the global coupling strength K and the natural frequencies $\omega _i$. For large enough K, and a tight enough distribution of $\omega _i$, the oscillators will synchronize, or demonstrate phase locking. When the coupling strength is low, or the frequencies far apart, the model will display asynchronous behavior³⁷.

We explored three qualitatively distinct dynamical regimes of the Kuramoto model in the parameter error task. They were: a fully synchronized regime, an asynchronous regime, and a multi-frequency regime that has timescale separation between the oscillators but otherwise demonstrates phase locking (Fig. 5).

To obtain each regime, the natural frequencies and coupling strengths were varied, with the natural frequencies being drawn from a uniform distribution between $-1$ and 1. For the multi-frequency regime, the high frequency oscillator’s natural frequency was set to $z (3.0 + w)$ where w is a uniformly distributed random number between 0 and 1 and z is equal to $-1$ or $+1$ with equal probability. Three realizations of each regime were produced to use as ground truth in the shared test procedure. The parameters for each regime are detailed in Table 1.

Table 1 Standard Kuramoto model parameters for each dynamical regime.

Full size table

As the Kuramoto model was also being used within the hybrid RC, and therefore needed to be integrated upon every iteration, we used a transformed version of the system of equations to coincide with the x, y phase component projection that the reservoirs were using. Each of the oscillator phase variables $\theta _i$ was transformed into phase components, x$_i=\cos \left( \theta _i\right)$ and $y_i=\sin \left( \theta _i\right)$. Under this transformation, the Kuramoto model is described by a pair of ODEs for each oscillator, one for each phase component:

$$\begin{aligned} \frac{{{\,\textrm{d}\,}}x_i}{{{\,\textrm{d}\,}}t}&=-\omega _iy_i-\frac{Ky_i}{N}\sum ^{N}_{j=1}\left( y_jx_i-x_jy_i\right) ,\\\frac{{{\,\textrm{d}\,}}y_i}{{{\,\textrm{d}\,}}t}&=\omega _ix_i+\frac{Kx_i}{N}\sum ^{N}_{j=1}\left( y_jx_i-x_jy_i\right) . \end{aligned}$$

(8)

Models

The standard and hybrid RC’s were initialized with a set of baseline parameters and whichever parameter was being varied was set according to the parameter sweep being conducted (Table 2). For the parameter error task, the hybrid RC’s expert ODE model, and therefore the control ODE model, was the same as the ground truth Kuramoto model. To introduce the parameter error, we added multiplicative error to the coupling strength and to each of the natural frequencies:

$$\begin{aligned} p \rightarrow (1 + \xi )p, \end{aligned}$$

(9)

with p standing in for the parameters and $\xi$ sampled independently for each parameter from a normal distribution, $\text{Normal}(0,\sigma ^2_K)$ for coupling strengths and $\text{Normal}(0,\sigma ^2_\omega )$ for natural frequencies. The standard deviations $\sigma _K,\sigma _\omega$ were set according to the relevant parameter sweep (baseline 0.05, other settings are explored in the supplementary information, Supplementary Figs. S1 to S4). The errors were sampled independently for each instantiation.

Evaluation metric

To evaluate the quality of the trajectory predictions in the parameter error task, we used the normalized mean square error (NMSE)

$$\begin{aligned} \text {NMSE}\left( t\right) =\frac{\Vert {\bf{u}}\left( t\right) -{\bf{u}}^*\left( t\right) \Vert }{\left<\Vert {\bf{u}}\left( t\right) \Vert ^2\right>^\frac{1}{2}}, \end{aligned}$$

(10)

where ${\bf{u}}^*(t)$ is the prediction and ${\bf{u}}(t)$ the ground truth. This metric was used in previous work²⁹ to evaluate the valid time metric. For each test, the mean NMSE across the entire trajectory was computed, capturing the overall, long-term, agreement between the forecast and the ground truth.

Parameter sweep

A range of hyper parameters was investigated in the parameter error task (Table 2). Only the effects of the input scaling and spectral radius parameters are shown here; other results are presented in the supplementary information (Supplementary Figs. S1 to S7). To ensure that a wide range of model instantiations was tested, a random seed dependent on the parameter sweep index was used in the initialization.

Table 2 Parameter baselines and ranges for the parameter error task. The parameter sweep modifies individual parameters whilst holding the rest at the baseline setting.

Full size table

Residual physics task

In the residual physics task, we evaluated the ability of the hybrid RC to forecast trajectories of a ground truth whose dynamical equations are different from the expert ODE model. To do this we kept the expert ODE model the same as in the parameter error task, i.e., the standard Kuramoto oscillator network model, and used an extension with additional coupling terms to produce a ground truth with new dynamical regimes not accessible to the standard Kuramoto model. In this task, we again evaluated the performance of the standard and hybrid RCs with parameter sweeps, and compared the performance of both RCs to the hybrid RC’s expert ODE model alone. Subsequently, we conducted a grid search to investigate the potential of an optimized hybrid RC in comparison to an optimized standard RC.

Ground truth

We used an extended, bi-harmonic version of the Kuramoto oscillator network model that includes an extra harmonic in the nonlinear coupling term³⁰, given by

$$\begin{aligned} \frac{{{\,\textrm{d}\,}}\theta _i}{{{\,\textrm{d}\,}}t}=\omega _i+\frac{K}{N}\sum ^{N}_{j=1}\left[ \sin \left( \theta _j-\theta _i+\gamma _1\right) +a\sin \left( 2(\theta _j-\theta _i)+\gamma _2\right) \right] . \end{aligned}$$

(11)

This introduces a structural difference between the ground truth and the hybrid reservoir’s model; the extra harmonic allows for more complex dynamical regimes, including heteroclinic cycles and self-consistent partial synchrony³⁰. The regimes can be elicited by appropriate choice of the coupling phase shifts $\gamma _1$ and $\gamma _2$ and the scaling of the second harmonic a.

As with the parameter-error test, we used a set of qualitatively different dynamical regimes to evaluate the performance of the hybrid RC on this task. Four cases were chosen: synchronous, where oscillators are phase locked into one or more clusters, asynchronous, where no phase locking occurs, heteroclinic cycles, where partially synchronized clusters exist and individual oscillators travel between them sporadically, and self-consistent partial synchrony, where a consistent but rotating phase distribution is maintained without full synchronization (Fig. 6). For brevity, the self-consistent partial synchrony regime will be referred to as ‘partial synchrony’ in what follows. We were particularly interested in the heteroclinic cycles and partial synchrony regimes. Since the standard Kuramoto model does not demonstrate these behaviors, the hybrid reservoir must learn the structural difference between its expert ODE model and the ground truth. For this task, we sampled the natural frequencies $\omega _i$ from a Lorentzian distribution, $\omega _i\sim \text {Cauchy}(\mu , \Delta \omega )$, where $\mu$ is the center of the distribution, and $\Delta \omega$ is the width. The parameters for each regime³⁰ are shown in Table 3.

Table 3 Bi-harmonic Kuramoto model parameters for each dynamical regime. Varying the first harmonic phase shift, $\gamma _1$, produces the four different behaviors.

Full size table

Models

The standard and hybrid RCs were again initialized using the baseline parameters, and whichever parameters were being varied were set appropriately depending on the parameter sweep or grid search parameter set (Table 4, Fig. 7). For the residual physics task the standard Kuramoto model is used as the hybrid RC’s expert ODE model, and therefore the control ODE model. As such, neither model was the same as the ground truth system. We introduced parameter error to the coupling strength and to each of the natural frequencies in the same fashion as in the parameter error task.

Evaluation metric

For the residual physics task, we focus on the application of hybrid RCs to control, and therefore used the valid-time metric to assess short-term prediction quality in both the parameter sweep and the grid search. The valid time $t^*$ is the time during which the predicted trajectory has a NMSE less than some threshold $\epsilon$ in NMSE

$$\begin{aligned} t^*=\text {max}\left\{ t:\text {NMSE}(\tau )\le \epsilon , \forall \tau \le t\right\} . \end{aligned}$$

(12)

Throughout this task, we used $\epsilon =0.4$. This metric only reports the short term accuracy, and therefore does not provide information on the long term prediction quality of the models. For our application focus, where short time horizons are often all that is required, this provides a sufficient view of the performance.

Parameter sweep

We used the same method for the residual physics task that we did previously, for the parameter error task. However, here we varied a different set of parameters. Specifically, we varied the spectral radius, input scaling, regularization strength, and reservoir size (Table 4).

Table 4 Parameter baselines and ranges for the residual physics task. The parameter sweep modifies individual parameters whilst holding the rest at the baseline setting.

Full size table

Grid search

To explore the potential of ‘optimally’ tuned standard and hybrid RCs, a grid search was conducted in the residual physics task. We consider here only parameter tuning of the hybrid and standard RCs to maintain a consistent architecture and approach to ²⁹. Prior works have explored optimization of the particular combination of model-based and data-driven components in the hybrid RC³⁸. High and low parameter settings for the input scaling, spectral radius, and regularization strength were chosen from the results of the parameter sweeps. In particular, these were selected to approximately range the optimum region identified for each parameter, across both the standard and hybrid RC results. A set of all of the combinations of these parameters was produced, labeled from A to H, at each vertex of a cube in parameter space (Fig. 7). For each parameter combination, we followed the shared evaluation method as for both of the parameter sweeps. 40 instantiations of standard and hybrid reservoirs were produced and then trained and tested on ground truth data from three of the four regimes in the residual physics task. These were the synchronous, heteroclinic cycles, and partial synchrony regimes.

Results

In this section we demonstrate the capability of the hybrid RC to predict NLONs in various circumstances. We first present results from the parameter error task. Across three qualitatively different dynamical regimes, we compare the mean performance of multiple standard and hybrid RCs. As a control, we test the hybrid RC’s expert model, the base ODE model, which is also subject to parameter error. We show parameter sweeps of spectral radius and input scaling. Second, we give the results of the residual physics task, where the hybrid RC is given a reduced form of the ground truth bi-harmonic Kuramoto model. Parameter sweeps are presented for four dynamical regimes. First, we test synchronous and asynchronous regimes. These were present in the standard Kuramoto model, however the bi-harmonic synchronous regime can support multiple clusters. The other two regimes are heteroclinic cycles, and partial synchrony; both are inaccessible to the standard Kuramoto model. We test the effects of spectral radius, input scaling, regularization strength and reservoir size. Short-term prediction quality is evaluated to investigate viability for control applications. Finally, the results of a grid search optimization process across three parameters are presented, illustrating the potential of a tuned hybrid RC.

Parameter error

Our initial aim was to assess whether hybrid RCs are a viable architecture for surrogate modeling of NLONs by testing the long-term prediction quality across a range of parameters. We find that the performance of the hybrid RC is consistently better than the base ODE model or the standard RC. The hybrid RC achieves low error even when both of its constituent parts, the base ODE model and the standard RC, do not. This matches what was previously reported for the Lorenz system²⁹. Despite some variation due to parameter tuning, the hybrid RC often achieves near zero mean NMSE, particularly for the synchronous and multi-frequency regimes (Fig. 8a, d, c, f).

In the synchronous regime (Fig. 8a) there is an immediate departure from zero mean NMSE as the spectral radius crosses 1.0 for the standard RC. Notably, the performance of the hybrid RC also suffers here, becoming more variable; however, perhaps surprisingly, it recovers as the spectral radius is increased further. For the asynchronous (Fig. 8b) and multi-frequency regimes (Fig. 8c), the standard RC and ODE models perform equally poorly, and again a slight increase in the error variance for a spectral radius greater than 1.0 can be seen for both the standard and hybrid RC.

Changing the input scaling has little effect when predicting the synchronous (Fig. 8d) regime. However, for the asynchronous (Fig. 8e) and multi-frequency (Fig. 8f) regimes, increasing the input scaling improves the performance of the standard RC. Interestingly, the performance of the hybrid RC decreases with increasing input scaling for the asynchronous regime, but it still maintains an advantage over the standard RC throughout. In the multi-frequency regime, the performance of the hybrid RC is roughly constant, and the standard RC eventually matches it at an input scaling greater than 1.7. The standard RC performance is generally more variable still, however some standard RCs achieve zero mean NMSE under this parameter setting.

Residual physics

Parameter sweep

As in the parameter-error task we find that the hybrid RC generally performs better than the standard RC (Fig. 9), although this is more variable than in the parameter-error task and the performance of the two is more similar. This is true even on the out of domain heteroclinic cycles and partial synchrony regimes not accessible by the expert model alone. In the asynchronous case, the standard RC outperforms the hybrid RC, except for cases with spectral radii above 1.0. The base ODE model achieves higher valid times than the standard or hybrid RC across the partial synchrony regime sweeps (Fig. 9d, h, i, p), and occasionally for the asynchronous (Fig. 9b, f, j, n) and heteroclinic cycles regimes (Fig. 9c, g, k, o). It is clear however, that the long term prediction of the base ODE model is not accurate as it fails to capture the qualitative character of these regimes (Supplementary Figs. S11, S12, S13).

The spectral radius again has a strong effect on the performance of the standard and hybrid RC; furthermore, the effect is different for the two (Fig. 9a–d). Most notable is the divergence in behavior at and above a spectral radius of 1.0. There is a sharp decline in performance for the standard RC on the synchronous regime similar to that observed in the parameter error task (Fig. 8a). The standard RC achieves the maximum $250~\text{s}$ valid time with a spectral radius below 1.0 but this falls to $75~\text {s}$ for a spectral radius of 2.0. As in the parameter error task, the hybrid RC begins to fail at a spectral radius of 1.0 but then recovers as the spectral radius is increased further (a). On all the other regimes, (b–d), the hybrid RC shows the same recovery, this time after a steady decrease in valid time as the spectral radius increases to 1.0, followed by an increase back to near its maximum valid time. On these regimes optimum performance is reached for the standard and hybrid RC’s with a spectral radius of 0.0. The input scaling parameter (Fig. 9e–h) only affects the more complex dynamical regimes (f—asynchronous, g—heteroclinic cycles, h—partial synchrony), with maximum valid time for the standard and hybrid RC at minimum input scaling.

The regularization parameter (Fig. 9i–l) has a strong effect on the valid time of the standard and hybrid RC. High regularization in the synchronous regime greatly reduces the performance of the standard RC. In the asynchronous, heteroclinic cycles, and partial synchrony dynamical regimes, sufficiently low regularization appears to cause failure of the standard and hybrid RC. In all cases, the optimal regularization strength range is much broader for the hybrid RC than the standard RC.

Beyond requiring at least 100 nodes, the reservoir size (Fig 9m–p) has little effect on the valid time on any of the regimes. The hybrid RC benefits slightly from a larger reservoir (350 nodes) when predicting the heteroclinic cycles regime but it still outperforms the standard RC across the full range of reservoir sizes explored.

Grid search

In the final part of this study we investigate how easy it is to optimize hybrid RCs. We evaluated the performance of hybrid and standard RCs using the valid time metric in a grid search of the hyper parameters that were shown to most strongly affect performance (Fig. 9). We ran tests on three of the dynamical regimes (synchronous, heteroclinic cycles and partial synchrony), evaluating the hybrid RC and standard RC with parameter settings defined by the corners of a cube in parameter space, labeled A–G (Fig. 7).

As identified in the parameter sweep tests, the synchronous regime is the easiest to predict (Fig. 10, top row). The hybrid RC achieves near perfect valid times ($250~\text {s}$) across all parameter settings in the grid search. A high spectral radius (C, D, G, H) slightly reduces the performance of some hybrid reservoirs, but the majority still reach $250~\text {s}$ valid time. In contrast, the standard RC is strongly affected by a high spectral radius as observed earlier in the parameter sweeps, with many reservoirs achieving less than $50~\text {s}$ valid time (C, D, G, H). High regularization strength (B, F) also reduces the standard RC performance, but not as significantly as high spectral radius. Input scaling has little effect on the performance here as the chosen settings are only slightly above and below the optimum value found during the parameter sweeps for both the hybrid RC and standard RC (0.1). Optimal performance on the synchronous regime is reached with a low setting of all three hyper-parameters, with all standard and hybrid RCs reaching $250~\text {s}$ valid time (A, E).

In the heteroclinic cycles regime (Fig. 10, middle row), the hybrid RCs consistently outperform the standard RCs. The performance is poor however, with valid times on the order of 1.0 to $2.0~\text {s}$ (10 to 20 steps). The individual hybrid RCs reach higher valid times than the standard RCs in all cases except parameter setting C, with low regularization strength, high spectral radius, and low input scaling. A high spectral radius (C, D, G, H) significantly reduces the performance of the standard RC. The hybrid RC is less affected by this, aside from a noticeable increase in variance when regularization is high (B–D, and F–H). The maximum valid time reached by the hybrid RC is $2.085~\text {s}$, and for the standard RC it is $1.435~\text {s}$. This is a $45.8\%$ improvement for the ‘optimal’ hybrid RC over the ‘optimal’ standard RC.

The performance of the standard and hybrid RC is least favorable on the partial synchrony regime (Fig. 10, bottom row), achieving valid times only up to $1.0~\text {s}$ (10 steps). There is still a clear performance difference between the two RC models however. In their best parameter setting (A—low spectral radius, input scaling, and regularization), the performance of the two models is similar, but the hybrid RC has less variance across reservoir instantiations. A high spectral radius again results in poor performance for the standard RC, with many standard RCs achieving $0~\text {s}$ valid time (C, D, G, H). Comparing the statistics of the valid time across parameter settings, the mean hybrid RC performance is fairly stable and the performance of individual reservoirs tightly grouped, whereas the mean performance of the standard RC is at or near $0~\text {s}$ for all but parameter settings A, E, and F. The maximum valid time reached by the best hybrid RC is $0.920~\text {s}$, and for the best standard RC it is $0.925~\text {s}$. This is not an improvement over the standard RC, however the hybrid RC approaches this far more consistently than the standard RC.

Discussion

While the performance advantage of hybrid RC over standard RC and ODE prediction is known from previous work²⁹, we have extended this in two ways. First, we reproduced this result but applied to the important example of an oscillating system. Second, we introduced a new task, the residual physics task, to mimic situations where the interaction between oscillators is not fully known. This was a more stringent test than the parameter error task, however we still observe higher performance from the hybrid RC than the standard RC, although, using the valid time metric, the base ODE does better in some cases. Unlike the clear advantage seen in ²⁹, the benefit of the hybrid approach is thus more nuanced here. As soon as the ground truth regime becomes complex, the performance of the reservoirs is substantially degraded, dropping from $250~\text {s}$ valid time, to around $1~\text {s}$, which is low even compared to the valid times achieved on the chaotic Lorenz system ($\sim 10~\text {s}$²⁹), although these predictions may still be useful.

Notably, although both the standard and hybrid RC had low valid times on the heteroclinic cycles and partial synchrony regimes, the hybrid RC consistently outperformed the standard RC. This suggests that the hybrid RC captures some aspects of the residual physics, with, for example, a relative maximum performance gain of $45.8\%$ on the heteroclinic cycles regime. Although the valid times are low for these regimes, they may be sufficient for control applications with short time horizon requirements.

There are clear limitations of the valid time metric for any application requiring long term, qualitative prediction accuracy, such as attractor reconstruction tasks. This is evident in the qualitative differences between the base ODE and the hybrid RC’s trajectory predictions for asynchronous, heteroclinic cycles, and partial synchrony regimes (Supplementary Figs. S11, S12, and S13), which do not align with their relative performance with valid time. The ODE quickly falls into a synchronous regime, predicting synchronized, slow trajectories at odds with the ground truth for the asynchronous, heteroclinic cycles, or partial synchrony regimes. The addition of the reservoir in the hybrid RC changes this, giving oscillations that much better capture the gross features of the ground truth behavior. In the heteroclinic cycles regime, the hybrid RC more accurately recreates the underlying frequency of the main cluster, and produces oscillators that briefly leave this attracting group. In the partial synchrony regime, the hybrid RC again more closely reproduces the underlying oscillation frequency of the distribution. However it struggles to capture some complex features of these regimes, such as oscillator departure timing in the heteroclinic cycles regime, and the distributional behavior of the partial synchrony regime. When they fail, both the standard and hybrid RC can produce noisy, high-frequency oscillations, or steady state trajectories (Supplementary Fig. S16). As such, the robustness of the hybrid RC may be useful in safety critical applications, where oscillation decay or growth could be catastrophic.

Over-fitting is a risk for any data-based modeling but is a particular hazard when using models for control. The standard RC requires careful tuning of the regularization strength to avoid this whereas the hybrid RC does not, even when using an expert model missing substantial components. This is particularly true for the heteroclinic cycles and partial synchrony regimes, where the performance of the hybrid RC is good across a broader range of regularization strengths, above and below the optimum value required for the standard RC. The output matrix of the hybrid RC balances attention between the reservoir activity and the expert model’s predictions. By providing extra states for weight assignment containing some correct features of the ground truth, the hybrid RC reduces training data influence and over-fitting risk. This is a delicate, task dependent, balance, as evidenced by Fig. 9b, f, j, n, where the standard RC outperforms the hybrid RC, but the hybrid RC performance increases with increasing regularization strength. Inspection of the predicted trajectories for this regime (Supplementary Fig. S11), indicates the hybrid RC may be biased too far towards the synchronous behavior of its expert model with the baseline regularization setting.

We have shown that the scaling of the reservoir weights has a strong effect on prediction quality for the standard RC; the hybrid RC is also affected, but in a different and less pronounced way. The spectral radius, controlling the scale of the internal connectivity matrix, is particularly influential. This is a well-documented phenomenon for a standard RC^39,40. The scaling parameters are tied to prediction quality through their impact on the RC’s Lyapunov exponents, and conditional Lyapunov exponents (CLEs). These are features of dynamical systems that determine the structure of phase space, and any constituent attractors. Primarily, they are critical for the satisfaction of the echo state property, a fundamental property for the function of an RC that imbues a reservoir with a fading memory and capacity for generalized signal-induced synchronization^25,41. Heuristically, it has been suggested that a spectral radius below one is required to have the echo state property, however this is neither necessary nor sufficient³⁵. The spectral radius is however correlated with the magnitude of the RC’s CLEs⁴², and we observe failure of the standard RC for spectral radii greater than one in accordance with this.

The hybrid RC does not respond in the same way. Lyapunov exponents and CLEs are also key for successful attractor reconstruction^42,43. Although our performance metric is reporting short-term prediction accuracy and not the long-term ergodic property accuracy, the attractor climate, we suggest that the hybrid RC’s divergent response to increasing spectral radius could be a result of a different correspondence between spectral radius and the CLEs leading to successful attractor reconstruction where the standard RC has failed.

For optimum performance, a spectral radius of zero, corresponding to no memory of past states and no coupling between reservoir nodes, is most effective. This is unsurprising as the ground truth system is first order, and all state information is available to the reservoirs; all the information required for the next step can be obtained from the current step alone.

The scaling parameters have also been implicated in the connection between RC and delay embedding^31,44. The high performance of both standard and hybrid RCs at low spectral radius suggests that they do not require a long memory for the tasks considered here. To investigate whether the reservoirs might benefit from richer representations of fewer past states instead of long memory, we tested reservoirs that completed multiple internal updates per step. These “multi-step” reservoirs are similar to the “drifting-state” reservoirs in ⁴⁵ except only the final internal state is used for output computation, after multiple non-linear updates have been applied to form a more complex representation. We found no consistent effect on performance across various dynamical system forecasting tasks (Supplementary Fig. S17).

The behavior of the standard and hybrid RC in response to parameter changes is similar across the three non-synchronous regimes. A further test conducted with a slower version of the asynchronous regime resulted in near-zero valid times for both models across all parameters (Supplementary Fig. S18). We believe this was due to the slow, highly correlated, dynamics (Supplementary Fig. S15) creating a training span in a distinct region of phase space from the test spans (Supplementary Fig. S19).

The improved performance of the hybrid RC over the standard RC comes at the cost of increased computational complexity. Further work must evaluate, on a case-by-case basis, whether the performance improvement justifies the cost, and whether the computational cost alone rules out chosen control applications. Extending the hybrid RC architecture to include recent developments in reservoir computing^46,47,48 may help to reduce this cost.

Data availability

The data that support the findings of this study are available in the Research Data Repository of the University of Bristol, https://data.bris.ac.uk/data/, at https://doi.org/10.5523/bris.3ho1s7lix2qvi2wm24fa7nwamu.

Code availability

The code used to generate and process the results of this study are available under an MIT licence on Github (https://github.com/AJS50/Hybrid_RC_for_NLONS_paper_code) as well as in the Research Data Repository of the University of Bristol, https://data.bris.ac.uk/data/, at https://doi.org/10.5523/bris.3ho1s7lix2qvi2wm24fa7nwamu.

References

Bick, C., Panaggio, M. J. & Martens, E. A. Chaos in Kuramoto oscillator networks. Chaos 28, 071102 (2018).
Article ADS MathSciNet PubMed Google Scholar
Dörfler, F. & Bullo, F. Synchronization in complex networks of phase oscillators: A survey. Automatica 50, 1539–1564 (2014).
Article MathSciNet Google Scholar
Iatsenko, D., Petkoski, S., McClintock, P. & Stefanovska, A. Stationary and traveling wave states of the Kuramoto model with an arbitrary distribution of frequencies and coupling strengths. Phys. Rev. Lett. 110, 064101 (2013).
Article ADS CAS PubMed Google Scholar
Dörfler, F., Chertkov, M. & Bullo, F. Synchronization in complex oscillator networks and smart grids. Proc. Natl. Acad. Sci. 110, 2005–2010 (2013).
Article ADS MathSciNet PubMed PubMed Central Google Scholar
Skardal, P. S. & Arenas, A. Control of coupled oscillator networks with application to microgrid technologies. Sci. Adv. 1, e1500339 (2015).
Article ADS PubMed PubMed Central Google Scholar
Liu, T., Song, Y., Zhu, L. & Hill, D. J. Stability and control of power grids. Annu. Rev. Control Robot. Auton. Syst. 5, 689–716 (2022).
Article Google Scholar
Ren, X. et al. Cardiac muscle cell-based coupled oscillator network for collective computing. Adv. Intell. Syst. 3, 2000253 (2021).
Article ADS Google Scholar
Elowitz, M. B. & Leibler, S. A synthetic oscillatory network of transcriptional regulators. Nature 403, 335–338 (2000).
Article ADS CAS PubMed Google Scholar
Goldbeter, A. et al. From simple to complex oscillatory behavior in metabolic and genetic control networks. Chaos 11, 247–260 (2001).
Article ADS CAS PubMed Google Scholar
Tsakalis, K., Chakravarthy, N. & Iasemidis, L. Control of epileptic seizures: Models of chaotic oscillator networks. In Proceedings of the 44th IEEE Conference on Decision and Control. 2975–2981 (IEEE, 2005).
Lozano, A. M. et al. Deep brain stimulation: Current challenges and future directions. Nat. Rev. Neurol. 15, 148–160 (2019).
Article PubMed PubMed Central Google Scholar
Holt, A. B. & Netoff, T. I. Origins and suppression of oscillations in a computational model of Parkinson’s disease. J. Comput. Neurosci. 37, 505–521 (2014).
Article MathSciNet PubMed PubMed Central Google Scholar
Arlotti, M., Rosa, M., Marceglia, S., Barbieri, S. & Priori, A. The adaptive deep brain stimulation challenge. Parkinson. Relat. Disord. 28, 12–17 (2016).
Article Google Scholar
Santaniello, S., Fiengo, G., Glielmo, L. & Grill, W. M. Closed-loop control of deep brain stimulation: A simulation study. IEEE Trans. Neural Syst. Rehabil. Eng. 19, 15–24 (2010).
Article PubMed Google Scholar
Schwenzer, M., Ay, M., Bergs, T. & Abel, D. Review on model predictive control: An engineering perspective. Int. J. Adv. Manuf. Technol. 117, 1327–1349 (2021).
Article Google Scholar
Moerland, T. M., Broekens, J., Plaat, A. & Jonker, C. M. Model-based reinforcement learning: A survey. Found. Trends Mach. Learn. 16, 1–118 (2023).
Article Google Scholar
Göring, N., Hess, F., Brenner, M., Monfared, Z. & Durstewitz, D. Out-of-domain generalization in dynamical systems reconstruction. In Proceedings of the 41st International Conference on Machine Learning, ICML’24 (JMLR.org, 2024).
Karniadakis, G. E. et al. Physics-informed machine learning. Nat. Rev. Phys. 3, 422–440 (2021).
Article Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9, 1735–1780 (1997).
Article CAS PubMed Google Scholar
Cho, K., van Merriënboer, B., Bahdanau, D. & Bengio, Y. On the properties of neural machine translation: Encoder–decoder approaches. In Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation. 103–111 (Association for Computational Linguistics, 2014).
Vermaak, J. & Botha, E. Recurrent neural networks for short-term load forecasting. IEEE Trans. Power Syst. 13, 126–132 (1998).
Article ADS Google Scholar
Schäfer, A. M. & Zimmermann, H. G. Recurrent neural networks are universal approximators. Int. J. Neural Syst. 17, 253–263 (2007).
Article PubMed Google Scholar
Gajamannage, K., Jayathilake, D. I., Park, Y. & Bollt, E. M. Recurrent neural networks for dynamical systems: Applications to ordinary differential equations, collective motion, and hydrological modeling. Chaos 33, 013109 (2023).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Jaeger, H. The “echo state” approach to analysing and training recurrent neural networks. In GMD Technical Report 148 (German National Research Center for Information Technology, 2001).
Maass, W., Natschläger, T. & Markram, H. Real-time computing without stable states: A new framework for neural computation based on perturbations. Neural Comput. 14, 2531–2560 (2002).
Article PubMed Google Scholar
Bollt, E. On explaining the surprising success of reservoir computing forecaster of chaos? The universal machine learning dynamical system with contrast to VAR and DMD. Chaos 31, 013108 (2021).
Article ADS MathSciNet PubMed Google Scholar
Yu, Z. et al. Tapered whisker reservoir computing for real-time terrain identification-based navigation. Sci. Rep. 13, 5213 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Hauser, H. Physical reservoir computing in robotics. In Reservoir Computing: Theory, Physical Implementations, and Applications (Nakajima, K. & Fischer, I. Eds.) . 169–190 (Springer Singapore, 2021).
Pathak, J. et al. Hybrid forecasting of chaotic processes: Using machine learning in conjunction with a knowledge-based model. Chaos 28, 041101 (2018).
Article ADS MathSciNet PubMed Google Scholar
Clusella, P., Politi, A. & Rosenblum, M. A minimal model of self-consistent partial synchrony. New J. Phys. 18, 093037 (2016).
Article ADS Google Scholar
Gauthier, D. J., Bollt, E., Griffith, A. & Barbosa, W. A. Next generation reservoir computing. Nat. Commun. 12, 1–8 (2021).
Article Google Scholar
Zhang, Y. & Cornelius, S. P. Catch-22s of reservoir computing. Phys. Rev. Res. 5, 033213 (2023).
Article CAS Google Scholar
Srinivasan, K. et al. Parallel machine learning for forecasting the dynamics of complex networks. Phys. Rev. Lett. 128, 164101 (2022).
Article ADS MathSciNet CAS PubMed Google Scholar
Li, X. et al. Higher-order granger reservoir computing: Simultaneously achieving scalable complex structures inference and accurate dynamics prediction. Nat. Commun. 15, 2506 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Yildiz, I. B., Jaeger, H. & Kiebel, S. J. Re-visiting the echo state property. Neural Netw. 35, 1–9 (2012).
Article PubMed Google Scholar
Lu, Z. et al. Reservoir observers: Model-free inference of unmeasured variables in chaotic systems. Chaos 27, 041102 (2017).
Article ADS PubMed Google Scholar
Strogatz, S. H. From Kuramoto to Crawford: Exploring the onset of synchronization in populations of coupled oscillators. Phys. D Nonlinear Phenom. 143, 1–20 (2000).
Article ADS MathSciNet Google Scholar
Duncan, D. & Räth, C. Optimizing the combination of data-driven and model-based elements in hybrid reservoir computing. Chaos 33, 103109 (2023).
Article ADS MathSciNet PubMed Google Scholar
Venayagamoorthy, G. K. & Shishir, B. Effects of spectral radius and settling time in the performance of echo state networks. Neural Netw. 22, 861–863 (2009).
Article PubMed Google Scholar
Jiang, J. & Lai, Y.-C. Model-free prediction of spatiotemporal dynamical systems with recurrent neural networks: Role of network spectral radius. Phys. Rev. Res. 1, 033056 (2019).
Article CAS Google Scholar
Pecora, L. M. & Carroll, T. L. Synchronization of chaotic systems. Chaos 25, 097611 (2015).
Article ADS MathSciNet PubMed Google Scholar
Hart, J. D. Attractor reconstruction with reservoir computers: The effect of the reservoir’s conditional Lyapunov exponents on faithful attractor reconstruction. Chaos 34, 043123 (2024).
Article ADS MathSciNet PubMed Google Scholar
Pathak, J., Lu, Z., Hunt, B. R., Girvan, M. & Ott, E. Using machine learning to replicate chaotic attractors and calculate Lyapunov exponents from data. Chaos 27, 121102 (2017).
Article ADS MathSciNet PubMed Google Scholar
Duan, X.-Y. et al. Embedding theory of reservoir computing and reducing reservoir network using time delays. Phys. Rev. Res. 5, L022041 (2023).
Article CAS Google Scholar
Sakemi, Y., Morino, K., Leleu, T. & Aihara, K. Model-size reduction for reservoir computing by concatenating internal states through time. Sci. Rep. 10, 21794 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Jaurigue, L., Robertson, E., Wolters, J. & Lüdge, K. Reservoir computing with delayed input for fast and easy optimisation. Entropy 23, 1560 (2021).
Article ADS MathSciNet PubMed PubMed Central Google Scholar
Köster, F., Patel, D., Wikner, A., Jaurigue, L. & Lüdge, K. Data-informed reservoir computing for efficient time-series prediction. Chaos 33, 073109 (2023).
Article ADS MathSciNet PubMed Google Scholar
Ma, H., Prosperino, D. & Räth, C. A novel approach to minimal reservoir computing. Sci. Rep. 13, 12970 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

AS is funded by a UK Research and Innovation grant (EP/S022937/1). This work was carried out using the computational facilities of the Advanced Computing Research Centre, University of Bristol - http://www.bris.ac.uk/acrc/.

Author information

Conor Houghton, David Barton and Martin Homer contributed equally to this work.

Authors and Affiliations

School of Computer Science, University of Bristol, Bristol, BS8 1UB, UK
Andrew Shannon
School of Engineering Mathematics and Technology, University of Bristol, Bristol, BS8 1TW, UK
Conor Houghton, David A. W. Barton & Martin Homer

Authors

Andrew Shannon
View author publications
Search author on:PubMed Google Scholar
Conor Houghton
View author publications
Search author on:PubMed Google Scholar
David A. W. Barton
View author publications
Search author on:PubMed Google Scholar
Martin Homer
View author publications
Search author on:PubMed Google Scholar

Contributions

AS, CH, DB and MH: conceptualization, methodology, writing—review & editing. CH, DB and MH: project management. AS: investigation, formal analysis, visualization, writing—original draft preparation.

Corresponding author

Correspondence to Andrew Shannon.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shannon, A., Houghton, C., Barton, D. et al. Modeling nonlinear oscillator networks using physics-informed hybrid reservoir computing. Sci Rep 15, 22497 (2025). https://doi.org/10.1038/s41598-025-03957-x

Download citation

Received: 25 February 2025
Accepted: 23 May 2025
Published: 02 July 2025
DOI: https://doi.org/10.1038/s41598-025-03957-x

Subjects

Abstract

Similar content being viewed by others

Novel nondelay-based reservoir computing with a single micromechanical nonlinear resonator for high-efficiency information processing

A novel approach to minimal reservoir computing

Efficient optimisation of physical reservoir computers using only a delayed input

Introduction

Methods

Reservoir computing

Hybrid reservoir computing

Shared test procedure

Model initialization and training

Standard reservoirs

Hybrid Reservoirs

Phase component transformation

Parameter error task

Ground truth

Models

Evaluation metric

Parameter sweep

Residual physics task

Ground truth

Models

Evaluation metric

Parameter sweep

Grid search

Results

Parameter error

Residual physics

Parameter sweep

Grid search

Discussion

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links