A deep learning approach to real-time Markov modeling of ion channel gating

Oikonomou, Efthymios; Juli, Yannick; Kolan, Rajkumar Reddy; Kern, Linda; Gruber, Thomas; Alzheimer, Christian; Krauss, Patrick; Maier, Andreas; Huth, Tobias

doi:10.1038/s42004-024-01369-y

Download PDF

Article
Open access
Published: 30 November 2024

A deep learning approach to real-time Markov modeling of ion channel gating

Communications Chemistry volume 7, Article number: 280 (2024) Cite this article

4593 Accesses
1 Citations
11 Altmetric
Metrics details

Subjects

Abstract

The patch-clamp technique allows us to eavesdrop the gating behavior of individual ion channels with unprecedented temporal resolution. The signals arise from conformational changes of the channel protein as it makes rapid transitions between conducting and non-conducting states. However, unambiguous analysis of single-channel datasets is challenging given the inadvertently low signal-to-noise ratio as well as signal distortions caused by low-pass filtering. Ion channel kinetics are typically described using hidden Markov models (HMM), which allow conclusions on the inner workings of the protein. In this study, we present a Deep Learning approach for extracting models from single-channel recordings. Two-dimensional dwell-time histograms are computed from the idealized time series and are subsequently analyzed by two neural networks, that have been trained on simulated datasets, to determine the topology and the transition rates of the HMM. We show that this method is robust regarding noise and gating events beyond the corner frequency of the low-pass filter. In addition, we propose a method to evaluate the goodness of a predicted model by re-simulating the prediction. Finally, we tested the algorithm with data recorded on a patch-clamp setup. In principle, it meets the requirements for model extraction during an ongoing recording session in real-time.

Single shot detection of alterations across multiple ionic currents from assimilation of cell membrane dynamics

Article Open access 12 March 2024

Artificial neural network model for predicting changes in ion channel conductance based on cardiac action potential shapes generated via simulation

Article Open access 09 April 2021

Bottom-up design of Ca²⁺ channels from defined selectivity filter geometry

Article 22 October 2025

Introduction

Since the groundbreaking work of Hodgkin and Huxley¹, who established a stunningly prescient model on how voltage-gated membrane conductance for Na⁺ and K⁺ shapes the trajectory of an action potential, understanding the gating behavior of the predicted ion channels has become a major scientific endeavor arguably culminating in the advent of the patch-clamp technique². In its single-channel configuration, this method enables direct recordings of discrete movements of channel protein moieties associated with rapid transitions between conducting (open) and non-conducting (closed) states, thereby giving unparalleled insights into gating mechanisms at high temporal resolution³. Typically, hidden Markov models (HMM)⁴ are used to describe the underlying kinetics of ion channel gating^5,6,7,8, which can be employed to deduce structure-function relationships.

One approach, among many, to infer the underlying HMM from a single-channel patch-clamp recording is the two-dimensional dwell-time histogram (2D-histogram) analysis⁹. It builds on the idealization of single-channel time series, that is, estimating in which conducting state (open or closed) the ion channel is at any given time. From the idealized time series, the durations of neighboring open and closed intervals (dwell times) are paired as tuples and are accumulated in the 2D-histogram. It has been shown that 2D-histograms contain all the necessary information to infer the underlying HMM of the recorded ion channel⁸. A major issue with approaches that encompass the idealization of the single-channel recording is their general susceptibility to noise and artifacts introduced by the limited bandwidth of the recording setup. Despite a number of advancements that have been made to improve the quality of the recordings^10,11, noise is still the major limitation for analysis. Therefore, a low-pass filter is usually employed, which has the drawback of further reducing the bandwidth. In the end, the interplay of noise and bandwidth compromises the idealization process and especially the detection of fast gating events. Considering these effects, the temporal resolution was partially extended¹². As an alternative, the direct analysis of single-channel recordings without idealization has also been explored^13,14,15,16.

A considerable improvement to 2D-histogram analysis came with the introduction of simulations of single-channel time series^17,18, where errors made during idealization of the recorded time series occur similarly in the simulated one. Therefore, errors partially cancel out in an iterative process of comparing both histograms to deduce the underlying HMM. Thereby, the modeling process becomes very robust regarding noise and fast gating. An improved version¹⁹ was later used to model the interaction of chloramine-T with the neuronal ion channel Nav1.2a²⁰. In our recent study, we have demonstrated the superior performance of Markov modeling 2D-histograms using simulations²¹. High-Performance Computing (HPC) enabled us to manage the tremendous computational requirements of this approach and make the fit very accurate. With HPC, it is now possible to explore the minimum signal-to-noise ratio (SNR), recording bandwidth, and number of recorded gating events required for successful modeling. Nevertheless, the modeling process is limited by the available HPC resources and requires a certain amount of hands-on time to configure the algorithm. In this study, we present the solution to the latter problems by developing deep neural networks (NNs) using time-series simulations. We provide data showing that the artificial intelligence (AI) approach can compete with the previous 2D-Fit. In principle, after training the networks, this approach will enable online Markov modeling of single-channel patch-clamp recordings.

Methods

Simulating 2D-histograms

The process of transitioning between open and closed states of ion channels is termed gating. It is assumed to be an ideal process with the channels exclusively being either in a conducting or a non-conducting state. Transitions between states are considered to be instant on the relevant time scale. Experimentally recorded time series of single ion channels are distorted by noise and the transitions are affected by the low-pass filter of the setup as well as by the recording bandwidth of the amplifier. For the simulation of time series, a HMM is designated by the user. It is defined by a topology, which encompasses a number of open and closed states with given connections, and rates governing the transition between the states in a stochastic process^14,21. First, an ideal time series is created with the HMM, meaning that samples are assigned to either an open or closed state for each sampling interval. According to the given conductances of the open and closed states, the current amplitudes are assigned. Then, a step response emulating the effects of the low-pass filter is applied to each transition between the current levels. Finally, to obtain a signal with the desired SNR, noise with an appropriate amplitude is added to the time series. The SNR is defined as

$${{\rm{SNR}}}=\frac{I}{\sigma }$$

(1)

with I being the current amplitude (difference between the open and closed current level) and σ the standard deviation of the noise. In this study, we did not account for open channel noise²² and assume the same σ for both the open and closed states. Unless otherwise stated, an SNR = 5 was used throughout the manuscript. The application of the step response and noise generation are described below.

In order to compute the two-dimensional dwell-time histograms (2D-histograms) used for training the NNs, the simulated time series are idealized using the higher-order Hinkley detector (HOHD)^23,24. The HOHD takes the current amplitudes of the open and closed levels, as well as the SNR, as input. It computes higher-order integrals to derive a score, which is compared to an SNR-dependent threshold for event detection. Note, that the initial ideal time series generated with the HMM might deviate significantly from the idealized time series after application of the HOHD due to effects imposed by noise and filtering. Dwell-times of neighboring open and closed events are combined in tuples and assembled in a logarithmically binned 2D-histogram having a resolution of 60 × 60 bins with 10 bins per decade and ranging from 10 µs to 10 s. Since the time series used in this study are sufficiently long (more than 1 million samples), we assume detailed balance (microscopic reversibility), enabling us to use both: open to closed and closed to open dwell-time pairs in our 2D-histograms^25,26,27. The resulting datasets (Table 1) are stored as NumPy arrays²⁸.

Table 1 Training datasets

Full size table

The datasets were simulated on the Erlangen National High Performance Computing Center (NHR@FAU) parallel cluster “Fritz“, with each computing node containing two Intel Xeon Platinum 8360Y “Ice Lake” processors (36 cores per chip) running at a base frequency of 2.4 GHz, 54 MB shared L3 cache per chip and 256 GB of DDR4 RAM. The time consumption for generating training datasets is stated in Table 2.

Table 2 Time benchmarks for simulation of training data, training, inference, and model estimation with the 2D-Fit

Full size table

Application of the step response

In the simulation process, the rectangular gating events of the ideal time series are replaced with the step response function emulating the effect of the low-pass filter and recording bandwidth of the recording system. The 2D-Fit implements two options for the step response. The first is the digital step response that was generated using a 4-pole low-pass Bessel filter function with the corner frequency set to 10 kHz from the Python library SciPy²⁹. The second is the experimental step response that was recorded on the patch-clamp setup, similar to ref. ³⁰. We recorded 1000 step responses and computed their ensemble average. The step responses were recorded at 100 kHz, with the gain set to 100 mV/pA, and the low-pass filter corner frequency set to 10 kHz. The resulting step response is 45 samples long.

Generation of noise

After applying the step response to the ideal time series, a noise time series is superimposed. As for the step response, the 2D-Fit has two options for noise simulation. The first uses white noise that is subsequently filtered with a 4-pole digital low-pass Bessel filter at 10 kHz using SciPy²⁹ and finally scaled to the desired SNR. For the second option, in order to reduce the mismatch between the digitally simulated data and real experimental recordings, noise is generated from a given power spectrum, as previously described³¹. The authors proposed to add a random phase to each point of a power spectrum to compute a randomized noise series using the inverse Fourier transformation. We implemented this feature into the 2D-Fit, using the Cooley-Tukey algorithm of the fast Fourier transform (FFT) and inverse fast Fourier transform (IFFT), which is an efficient implementation of the Fourier and inverse Fourier transformation³². The C++ source code for the Cooley-Tukey algorithm was kindly provided by Anda Ouyang (https://github.com/AndaOuyang/FFT). In total, more than 10 h of noise was recorded for the computation of the spectra, using a patch-clamp setup with an Axopatch 200B amplifier (Molecular Devices). Two datasets were acquired: One using the patch resistance of an Axon cell model (10 GΩ, Molecular Devices) and another with the bath resistance (10 MΩ). Additionally, to acquire a smooth power spectrum, we split the noise recordings of each set into 63 segments, computed the power spectrum of each segment, and then formed the ensemble average. With the underlying power spectrum, we are able to simulate noise time series with a length of 10 M samples. Finally, we acquired a noise power spectrum from a patch-clamp recording of a real cell. The recording took place at room temperature using a patch-clamp setup consisting of a CV 203BU headstage, an Axopatch 200B amplifier, and an Axon Digidata 1550B digitizer (all instruments from Molecular Devices). The time series were recorded using pCLAMP v11.2 (Molecular Devices). Borosilicate glass pipettes with filament (Science Products) were pulled on a DMZ-Universal Puller (Zeitz-Instruments) with a tip resistance of 21 MΩ. The patch-clamp data was collected from a HEK 293 T cell using the voltage-clamp configuration with near-physiological sodium and potassium ion gradients. After establishing a Gigaseal, the patch was excised, and the pipette was moved just beneath the surface of the bath solution. The time series was recorded at −90 mV, close to the equilibrium potential of potassium, at a sampling frequency of 100 kHz with the output gain set to 100 mV/pA and the built-in low-pass filter set to 10 kHz.

Deep NN architectures, training, and evaluation

The NNs were trained on the datasets listed in Table 1. These datasets contain a number of 2D-histograms generated from different HMMs with randomly assigned transition rates drawn from a logarithmic distribution. The datasets were split into training, validation, and test data. During training, the validation data was used for monitoring performance. The test data, which was not used during training, was exclusively used for generating the figure plots. It has to be mentioned that, as for any Deep Learning approach, the NNs can only make predictions about objects that lie within the parameter space spanned by the training dataset. Nevertheless, using our pipeline, it is possible to train NNs covering the parameters space that one defines.

Two main architectures were implemented using TensorFlow 2.7.0 (DOI: 10.5281/zenodo.4724125) as illustrated in Fig. 1: One for the determination of the topology (Fig. 1A) and one for estimation of the rates (Fig. 1B). Both architectures are adapted versions of the Inception-Resnet-V2 as originally proposed by ref. ³³. The architectures combine the technique of residual connections³⁴ to allow for training of deep architectures, and inception architectures³⁵. No batch normalization³⁶ was used, since a drop in predictive performance was observed when enabled. Additionally, the number of filters in every convolutional layer was reduced by a factor of 4, the final Global-Average-Pooling-Layer was replaced with a Flatten-Layer and Global-Max-Pooling-Layer for the regression and classification tasks, respectively. All layers were initialized using a uniform Glorot initialization³⁷ and all biases were initialized with zeros. We omitted the “stem” module, since it reduces the size of our 60 × 60 histograms too much, compared to the 299 × 299 sized images used in ref. ³³. Furthermore, no dropout or any other forms of regularization were used. For the topology classification NN (Fig. 1A), the final layer was a dense layer with 18 output nodes, corresponding to the 18 linear five-state topologies to be classified, and a “softmax” activation. For the rates estimation architecture (Fig. 1B), the final layer was replaced with a dense layer consisting of 8 output nodes, which corresponds to the number of rates in the five-state topologies to be estimated, followed by a linear activation. Finally, the Reduction-B module was replaced with a module that increases channel size without pooling (Channel-Increase, Fig. 1C).

**Fig. 1: Illustration of the neural network architectures used for Markov modeling.**

Training for all tasks was conducted using the Adam optimizer³⁸ with a starting learning rate of (1e-3) for the first epoch and all other parameters as proposed previously³⁸. For the regression and classification tasks, a global batch size of 1024 and 4096 were used, respectively. Unless mentioned otherwise, the learning rate was reduced by a factor of 0.1 after 8 epochs with no improvement in the validation loss, and an early stopping criterion was applied to terminate training after 12 epochs with no improvement in the validation loss. The selected loss functions were the categorical cross entropy and log-cosh for the classification and regression tasks, respectively.

As stated above, the bin width of the 2D-histograms was scaled logarithmically³⁹. Furthermore, for training, bin occupancy was transformed according to:

$${a'}_{ij}=\left\{\begin{array}{cc}2{log }_{10}({a}_{ij}), & {a}_{ij} \, > \, 0\\ 0, \hfill & {a}_{ij}=0\end{array}\right.$$

(2)

with a'_ij and a_ij being the bin occupancy of the initial and rescaled 2D-histograms, respectively, of the bin with coordinates (i,j). This provides predictions of comparable performance to the canonical square root transformation⁴⁰, while facilitating a faster convergence during training, likely due to compression of the occupancy of the 2D-histograms to a smaller range. Since the range of the rate constants k_ij that are to be estimated spans multiple decades, from 100 s⁻¹ to 1 Ms⁻¹, the labels for the regression task were also log-transformed for numerical stability of the training process.

The NNs were trained on the NHR@FAU parallel cluster “Alex“, with each node containing eight NVIDIA A100 (40 GB HBM2 @ 1555 GB/s; HGX board with NVLink). Multiple GPUs were used in parallel with distributed training and data parallelism, using the Tensorflow function tf.distribute.MirroredStrategy() with NcclAllReduce(). Time consumption for training the NNs and inference is stated in Table 2. The source code in its current stage (work in progress) is available at Zenodo (DOI: 10.5281/zenodo.12750594). Instructions on how to reproduce the data of this work and how to use the code for setting up experiments can be found in the Supplementary Methods.

Rearrangement of label arrays to facilitate training for symmetric topologies

The label arrays store the ground truth for the rate estimation, which is used by the NNs during training. For symmetric topologies, for example, the linear COCOC topology (which can be read forward and backward), there are two ways to define a model given a label array. For example, if the eight transition rates are mapped to the indexes of the label array as [k₁₂,k₂₁,k₂₃,k₃₂,k₃₄,k₄₃,k₄₅,k₅₄], then they can be rearranged in reverse like [k₅₄,k₄₅,k₄₃,k₃₄,k₃₂,k₂₃,k₂₁,k₁₂] and still define the same HMM. Without a unique definition for each model, the networks have difficulties to train properly. Therefore, it was enforced that k₁₂ > k₅₄, and in the case of k₁₂ < k₅₄ the array was rearranged as stated above.

Metrics for evaluating the performance of predicting the topology

The recall and precision are common metrics used to evaluate the performance of classification NNs. Let $D=\left\{{\mathrm{1,2}},...,18\right\}$ be the set of all indices of the topologies (classes). Then, ${r}_{i}=\frac{{n}_{{ii}}}{{K}_{i}}$ and ${p}_{i}=\frac{{n}_{{ii}}}{{L}_{i}}$ are the recall and precision of class with index $i\in D$, respectively, ${n}_{{ii}}$ the number of examples in the test dataset with ground truth $i$ classified as $i$ (correct classifications), ${K}_{i}$ the total number of examples of class $i$ in the test dataset, and ${L}_{i}$ the number of examples in the test dataset classified as $i$ by the NN. For evaluating the number of misclassifications, the analogous False Negative Rate (FNR) and False Discovery Rate (FDR) were used. They are defined as ${\text{FNR}}_{{ij}}=\frac{{n}_{{ij}}}{{K}_{i}}$ and ${\text{FDR}}_{{ij}}=\frac{{n}_{{ij}}}{{L}_{i}}$, respectively, with ${\text{FNR}}_{{ij}}$,${\text{FDR}}_{{ij}}$ being the FNR and FDR of the misclassification case that a class with index $i$ is classified as class $j\in {D\backslash }\{i\}$, and ${n}_{{ij}}$ being the number of examples in the test dataset with ground truth $i$ classified as $j$. The recall and FNR scores are related to the performance of the network from the developers’ perspective. The user (experimentalist) would rather be working with the precision and FDR scores since, in this case, they are equal to the posterior probability $P({y|x})$, with $x$ being the prediction of the NN and $y$ the ground truth, which indicates the probability of the class $y$ being the correct prediction given that the NN predicted class $x$.

Metric for evaluating the performance of estimating the rates

In our previous work,²¹ the mean absolute percentage error (MAPE) score was used to assess the predictive performance of the algorithm related to the rate constants estimation. In this study, we use the root absolute error (RAE) score instead

$${{\rm{RAE}}}=\sqrt{\left|{\log }_{10}(k_{Pr})-{\log }_{10}(k_{GT})\right|}$$

(3)

with k_Pr being the prediction and k_GT the ground truth. A graphical comparison of the two scores is depicted in Fig. 1D.

Computation of 2D-difference-histograms

When employing the proposed algorithm on real experimentally recorded data, the ground truth is unknown. Thus, the quality of the results cannot be evaluated directly. Nevertheless, an estimation of the predictions of the NNs without knowing the ground truth can be done using the difference histogram (2D_Diff) that is computed with the experimental histogram (2D_GT) and a histogram simulated using the prediction of the NNs (2D_Pr). The 2D-histograms 2D_GT and 2D_Pr are scaled in the same way as the training data using Eq. 2, and then 2D_Diff is computed according to the formula:

$${z}_{{ij}}=\sqrt{{x}_{{ij}}^{2}-{y}_{{ij}}^{2}},\, {x}_{{ij}}^{2}-{y}_{{ij}}^{2}\ge 0$$

(4)

and

$${z}_{{ij}}=-\sqrt{{y}_{{ij}}^{2}-{x}_{{ij}}^{2}},\, {x}_{{ij}}^{2}-{y}_{{ij}}^{2}\, < \, 0$$

with ${z}_{{ij}},{y}_{{ij}},{x}_{{ij}}$ being the bin occupancies of 2D_Diff, 2D_GT, and 2D_Pr, respectively, and $i,j$ the bin coordinates. We found that this representation allows for good visualization of differences in 2D_GT and 2D_Pr. Practically, the errors of bins with small occupancies are suppressed, and those with a high occupancy are enhanced.

Computation of the goodness of the predicted Markov model and uncertainty quantification of the corresponding transition rates

Based on Eq. 4, the normalized volume deviation ${V}_{D}$ between two 2D-histograms can be calculated

$${V}_{{{\rm{D}}}}({{\bf{S}}},{{\bf{M}}})=\frac{{\sum}_{i,j}\sqrt{\left|{s}_{{ij}}^{2}-{m}_{{ij}}^{2}\right|}}{{\sum}_{i,j}{s}_{{ij}}+{\sum}_{i,j}{m}_{{ij}}}$$

(5)

with ${V}_{{{\rm{D}}}}({{\bf{S}}},{{\bf{M}}})$ being the volume deviation between 2D-histograms ${{\bf{S}}}$ and ${{\bf{M}}}$, ${s}_{{ij}}$,${m}_{{ij}}$ the bin occupancy of ${{\bf{S}}}$ and ${{\bf{M}}}$, respectively, and $i,j$ the bin coordinates. Due to the normalization, ${V}_{{{\rm{D}}}}({{\bf{S}}},{{\bf{M}}})$ is constrained to the range [0,1], being ${V}_{{{\rm{D}}}}({{\bf{S}}},{{\bf{M}}})=0$ in case of a perfect match and ${V}_{{{\rm{D}}}}({{\bf{S}}},{{\bf{M}}})=1$ in case of no overlap.

However, each simulation as well as the experimental recordings encompass a certain variability due to their stochastic nature, the degree of which is dependent on the underlying HMM⁴¹. To quantify the goodness of a prediction and take into account the stochastic variability, $N$ time series are simulated using the predicted HMM and the 2D-histogram is calculated for each, obtaining $N$ 2D_Pr-histograms (${{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{N}$). Then, the mean volume deviation ${\bar{V}}_{{{\rm{D}}}}({{\bf{G}}},{{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{N})$ between the 2D_GT-histogram ${{\bf{G}}}$ and each of the $N$ 2D_Pr-histograms ${{{\bf{H}}}}_{n}$, is calculated as

$${\bar{V}}_{{{\rm{D}}}}\left({{\bf{G}}},{{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{N}\right)=\frac{{\sum}_{n}{V}_{{{\rm{D}}}}\left({{\bf{G}}},{{{\bf{H}}}}_{n}\right)}{N}$$

(6)

with $n\in \left\{{{\mathrm{1,2}}},...,N\right\}$. Furthermore, to estimate the stochastic variability of the predicted model, the mean reference deviation ${\bar{V}}_{{{\rm{R}}}}({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{N})$ between all $N$ 2D_Pr-histograms ${{{\bf{H}}}}_{n}$ is defined as

$${\bar{V}}_{{{\rm{R}}}}({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{N})=\frac{{\sum }_{n=1}^{N}{\sum }_{m=n+1}^{N}{V}_{{{\rm{D}}}}\left({{{\bf{H}}}}_{n},{{{\bf{H}}}}_{m}\right)}{N(N-1)/2}$$

(7)

with $n,m\in \left\{{{\mathrm{1,2}}},...,N\right\}$. The ${\bar{V}}_{{{\rm{R}}}}({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{N})$ score can serve as a reference for ${\bar{V}}_{{{\rm{D}}}}({{\bf{G}}},{{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{N})$ to estimate the quality of a prediction.

Finally, to obtain an estimate for the uncertainty quantification of the transition rates of the predicted model, the $N$ simulated 2D_Pr-histograms ${{{\bf{H}}}}_{n}$ are fed to the rates estimation NN to be re-predicted. This way, the parameter space around the initial prediction is explored, and a distribution, quantifying the uncertainty, for each transition rate can be obtained. Alternatively, scores based on the maximum likelihood score could also be defined^17,18,41.

Note that all of the above calculations are independent of the ground truth model and can be obtained from the 2D-histogram of an experimental time series and the predicted HMM.

Simulation of time series with the patch-clamp setup

One goal of this study was to evaluate the robustness of the NNs when dealing with experimental patch-clamp data. Time series were generated with the patch-clamp setup, as roughly outlined before⁴², to encompass the filter, bandwidth, and noise characteristics of “real” data. First, using a COCOC topology, ideal time series were simulated using the algorithm proposed before¹⁴. The ideal time series was then used as a command protocol for a voltage-clamp recording on an Axopatch 200B amplifier with a Digidata 1550B digitizer and Clampex 11.2 software (all from Molecular Devices). After compensation of capacitive artifacts, time series were recorded with a CV 203BU head stage connected to the bath resistance (10 MΩ) of a Patch-1U Model Cell (Molecular Devices). Higher resistances (Patch-configuration) could not be used for this purpose since significant capacitive artifacts would be introduced upon each voltage change in the command protocol. For analyzing time series of real cells, the cell-attached spectrum (Fig. 8B) should be used. Time series were recorded at a sampling frequency of 100 kHz, a gain of 100 mV/pA, and the low-pass filter of the amplifier set to 10 kHz. Capacitive feedback was enabled. Two sets of 100 time series each 10 s long were recorded and stored in a binary file format. Finally, 2D-histograms were computed using the HOHD²⁴ as implemented in the 2D-Fit²¹ and saved as NumPy files. The simulated training datasets have a current amplitude of 2000 arbitrary units (AU) with the baseline set at 22,000 AU, the open level at 20,000 AU and are idealized using these values. On the other hand, the semi-synthetic datasets were recorded at ~33,000 AU and ~44,000 AU (with small variations depending on the given SNR) for the baseline and open level, respectively, which were used for the idealization.

2D-Fit

Simulation of time series and generation of 2D-histograms was performed using an improved version of the 2D-Fit program²¹. Modifications are related to noise generation and the step response function as stated above. For the employed HPC resources, see above and Table 2. The 2D-Fit algorithm was also used for evaluating the performance of the NNs. All settings were as described previously²¹. The source code in its current stage (work in progress) is available at Zenodo (DOI: 10.5281/zenodo.12750594). Instructions on how to reproduce the data of this work and use the code for setting up experiments can be found in the Supplementary Methods.

Computation of inference benchmarks

The inference time was estimated in isolated environments created using Docker images provided by Intel (intel-optimized-tensorflow: 2.13-idp-base) and NVIDIA (nvcr.io/nvidia/tensorflow: 23.10-tf2-py3). These images are optimized for model inference on CPU and GPU, respectively. The results are summarized in Table 2. For inference of the single 2D-histograms the trained model was called directly with training set to False, while the 10,000 2D-histograms were fed to the trained NN in batches of 64 using the predict() function of the tensorflow.keras.Model class.

Data analysis

The analysis and visualization of data were performed using Origin PRO2023 (OriginLab Corp).

Results

Previously, we demonstrated the power of modeling single-channel patch-clamp recordings with two-dimensional dwell-time histograms (2D-histograms) using simulations (2D-Fit)²¹. 2D-histograms are an elegant way of squeezing rather large time series with varying lengths into a small and fixed-size data structure, which contains all necessary information to derive the underlying HMM⁸. For their computation, the time series are idealized using a jump detector, resulting in a train of consecutive “dwell-times” in either the open or closed state. The idealization has the additional advantage that noise, artifacts, and a drifting baseline can be handled during preprocessing. Then, neighboring open and closed dwell-times are paired and accumulated in the 2D-histogram. In our previous study, the underlying HMM was estimated in an iterative process. Single-channel time series were simulated using HMMs, and their transition rates were adjusted until the experimental and simulated 2D-histograms matched²¹. Different topologies had to be explored to find the overall best-corresponding model. The enormous computational effort was handled by utilizing high-performance computing (HPC).

In this study, using the simulation routine of the 2D-Fit, sets of time series covering the desired parameter space were simulated, and 2D-histograms were computed to serve as training data (Fig. 2). Thereby, deducing the HMM from patch-clamp time series effectively becomes a task of image classification and analysis. Figure 2 illustrates the steps involved in extracting the kinetic scheme of a given experimental time series using NNs. After the idealization of the recorded time series, the resulting 2D-histogram (experimental 2D-histogram) is fed into a two-stage analysis. In the first stage, the topology of the HMM is estimated with the topology-NN. The second stage consists of a set of NNs each trained on a single topology from the first stage. The experimental 2D-histogram is fed to the specific NN of the second stage corresponding to the estimated topology in order to predict the rates. We used modified versions of the Inception-Res-Net-V2 architecture³³ as illustrated in (Fig. 1A–C) for the NNs of both stages.

**Fig. 2: Flow chart of the proposed algorithm.**

NNs and simulation of training data

The training data for the NNs, a set of 2D-histograms, is derived from simulated time series. A smooth 2D-histogram that has a high bin occupancy with a low relative variation over neighboring bins is desirable for successful training. The stochastic variation per bin follows a Poisson distribution. Therefore, the relative errors ultimately depend on the number of gating events in the time series. For the experimental patch-clamp time series, the number of recorded events is limited by the gating behavior of the ion channel and the lifetime of the Gigaseal, ranging from minutes to tens of minutes. Additionally, the applied low-pass filter, in combination with the sampling rate, imposes a limit on obtained events per recording time. For the training data, the number of simulated events is limited by computational constraints such as available HPC resources. In our previous study, we analyzed the length of the time series for successful modeling with 2D-histograms²¹. Given typical gating behavior in the range of 10 s⁻¹ to 100 ks⁻¹, a length of at least 1 M samples (10 s at a sampling frequency of 100 kHz) was required to obtain meaningful results. Therefore, in this study, we decided to use time series consisting of 10 M samples (100 s simulated time), with the number of events in each varying strongly ranging from a few hundred up to more than 300,000, for training the NNs. The 2D-histograms computed from the data have a resolution of 60 × 60 with logarithmically scaled axes for both the open and closed dwell times, ranging from 10 µs to 10 s. The bin occupancy is scaled according to Eq. 2 to balance fast and slow rates generating a different amount of events in the time series. In our previous study, we used the mean absolute percentage error (MAPE) for evaluating fit results and comparing fit performance²¹. However, given a ratio of the predicted rate divided by the ground truth, the MAPE severely penalizes ratios above 1 and is not sensitive to small ratios below 10⁻¹. In contrast, RAE behaves symmetrically regarding the ratio and does not lose sensitivity at low values. Therefore, in this study, we used the RAE (Eq. 3) instead, which we found is a better representation of the error (illustrated in Fig. 1D).

Topology estimation of the underlying hidden Markov model

As detailed above, the first step in modeling the kinetics of ion channels is determining the topology of the underlying HMM. It has been shown that 2D-histograms contain all necessary information to infer the underlying HMM⁸. For the topology estimation, we simulated a training dataset encompassing all linear five-state topologies (Fig. 3A), comprising eight rates k_ij each. Figure 3A depicts the 18 topologies, grouped according to the number of open (O) and closed (C) states, respectively. Opposing topologies in both columns become identical when open and closed states are interchanged. Furthermore, the topologies are grouped according to their interconductance rank (R = 1, R = 2), which is defined as the number of independent C-O transitions in a topology⁴³. For each topology, time series were simulated using transition rates drawn from a logarithmic distribution from within the range 100 s⁻¹ to 100 ks⁻¹. Using these time series, 2D-histograms were generated to train the modified Inception-Resnet-V2 architecture³³ (Fig. 1A). First, we investigated how the accuracy varies in relation to the size of the training dataset. Training with each set was repeated three times. The obtained average accuracy is depicted in Fig. 3B. A considerable gain in accuracy with an increasing number of training samples was observed. For the largest dataset of 10⁷ 2D-histograms, an accuracy of ~44% was obtained (Fig. 3B), reflecting predominantly the inherent ambiguity of Markov modeling. Due to computational constraints, we did not simulate larger sets of training data. The resulting confusion matrices visualize the training results (Fig. 3C, D). The confusion matrix in Fig. 3C shows the recall (diagonal) and FNR, while Fig. 3D shows the precision (diagonal) and false discovery rate (FDR). The precision and FDR are especially useful for the experimentalist since, for any prediction of the NN, a probability distribution is obtained, constraining the set of likely topologies. Confusion, as indicated by the matrices, mainly occurred for certain topologies, within the same rank. As expected, opposing topologies from the left and right column (Fig. 3A) have near identical accuracy. In summary, the NNs were able to distinguish between different topologies. Nevertheless, there were substantial confusions between certain topologies, which we address in the discussion.

**Fig. 3: Topology estimation of Markov models using neural networks.**

Estimating the rates of the linear COCOC and CCCOO topologies

After identifying the most likely topology of the underlying HMM of the time series, the rates that govern the transition between its states have to be estimated. For this task, individual NNs are used for each topology. Out of the 18 topologies (Fig. 3A), two were chosen to be analyzed in further detail by determining their underlying rates. The linear COCOC and CCCOO topologies represent two variants with rank 2 and rank 1, respectively. These topologies encompass eight rates k_ij each. The results of the best and worst predicted rates according to the error score (RAE, Eq. 3) are shown in Fig. 4A–D. For the COCOC topology, the best-predicted rate k₅₄ shows a very good correlation with the ground truth (Fig. 4A). Due to the symmetry of the topology, the label array had to be rearranged by enforcing k₁₂ > k₅₄, as described in the methods section, to facilitate training. Therefore, values close to the maximum of the parameter space are less likely for k₅₄. The rate k₂₁ with the worst prediction still shows good correlation for this topology (Fig. 4C). The rates of the CCCOO topology should be more difficult to predict, since intraconductance transitions (C to C and O to O) do not produce observable events. Indeed, the result was less accurate compared to the COCOC models. Still, the C-O transition was predicted with a good correlation, albeit with several outliers at slower rates (Fig. 4B). However, the rate k₂₁ connecting the distant C states is not predicted well with a considerable number of uncorrelated data points (Fig. 4D).

**Fig. 4: Transition rates estimation of COCOC and CCCOO models using neural networks.**

To display the outcome of all predictions for the transition rates k_ij in a single graph for each topology, we introduced a different presentation for the results. Predicted rates k_ij are displayed as cumulative distributions of the error score (RAE). Rates connecting the same states have been paired and are visualized as the boundaries of the hatched areas. As a reference, the cumulative distribution of error scores that were computed with randomly drawn rates is displayed as pink dotted lines (Fig. 5A, B). With this representation, the disparity in predictive performance between the individual rates and the far better performance of the COCOC models in contrast to CCCOO becomes obvious (Fig. 5A, B). For the CCCOO topology the best-predicted rates were k₃₄ and k₄₃ (Fig. 5B), being the “gateway” states⁴³ that facilitate the only C-O connection of the topology. Importantly, the distance of the error scores to the randomly drawn rates is evident nevertheless.

**Fig. 5: Analysis of the transition rates estimation for COCOC and CCCOO models.**

Simulations of single-channel time series, as well as gating of real ion channels are stochastic processes. Even with the same topology and identical rates, the resulting 2D-histograms vary slightly. Here we address the impact of this stochastic variation on the performance of the NNs. For both topologies (Fig. 5A, B) the predictions of the transition rates were ranked according to their respective error score (RAE). Five models were selected from the 100th, 75th, 50th, 25th, and 0 percentiles. Using the ground truth for these models, 1000 2D-histograms were simulated each and the rates were predicted with the NNs. The error scores (RAE) of the predictions are visualized (Fig. 5C, D blue dots). For both topologies, the stochastic simulation process considerably influences the outcome of the NNs' prediction. While the prediction for the model at the 100th percentile is very robust, the variability increased for the other models at lower percentiles. For comparison, we analyzed the same models using the previously developed 2D-histogram fit with simulations (2D-Fit)²¹. Because of the considerable computational resources required we were only able to analyze a limited set of four time series of each model (Fig. 5C, D orange dots). Interestingly, the performance of both algorithms varies for the different models.

We anticipate, as stated above, that the smoothness of the 2D-histogram, depending on the number of events, has a fundamental impact on the quality of the predictions. To address this, we explored the performance of the NNs related to the number of detected events in the 2D-histograms. The ranked error scores (RAE) of the rates’ predictions were plotted together with the number of detected events in the respective 2D-histograms (Fig. 5E, F). Whereas for the COCOC no relevant correlation can be observed (Fig. 5E), for the CCCOO an inverse correlation between the number of detected events and the RAE score exists (Fig. 5F). Since the CCCOO topology has only a single C-O transition, it generates on average fewer events than the COCOC topology (Fig. 5E vs. 5F). Nevertheless, given the substantial variance in the number of detected events it is obviously not the only factor determining predictive performance. In order to estimate how much the predictive performance improves with increasing number of events, we trained another NN of the same architecture using dataset No. 4 (Table 1), which contains 2D-histograms whose underlying time series have a length of 100 million samples (roughly 17 min of simulated recording time). The results of the predictions on the test dataset are illustrated in Fig. 5F as the dashed blue line. As expected the error scores (RAE) improved compared to the NN for 10 million samples.

Overall, it can be stated that the rates of models underlying a linear five-state topology can be estimated using the Deep Learning approach. Predictive performance varies strongly across topologies and between rates. In addition, the rates of transitions that are farther away from C-O links are predicted worse than those that are closer. Given the same model, the stochastic variability of the simulations lead to significant variations in predictive performance. Finally, predictive performance may increase on average with the number of detected events depending on the topology.

High noise and fast gating

Given the microscopic currents of ion channels, which are in the magnitude of fA to pA, patch-clamp recordings are always endowed with a significant amount of noise. To improve the SNR, a low-pass filter has to be applied, causing a distortion of the signal by imposing an effective limitation of the bandwidth. Transitions at rates higher than the corner frequency of the filter (fast gating) are particularly affected. The distortion manifests as an apparent reduction of the current amplitude. Hence, idealization of the time series may become inaccurate. In our recent publication, we have demonstrated that rates could still be extracted on a noisy background and beyond the corner frequency of the low-pass filter²¹. The basic idea is that errors made in the idealization occur similarly in the experimental and simulated time series and cancel out to a certain degree. Capitalizing on the same principle, we now probed the performance of NNs when employed on data with high noise and fast gating. We separately examine the effect of a high-noise background and fast gating. In both cases, the analysis is structured in the same way and is divided into two parts. First, we illustrate how the quality of single predictions can be assessed, and then we address the overall performance of the NNs.

For the SNR analysis, two NNs of the regression architecture (Fig. 1A) were trained with datasets No. 2 and No. 5 (Table 1) with an SNR = 5 (low noise) and an SNR = 2 (high noise), respectively. The models in the test dataset for the SNR = 2 were ranked according to the RAE of their predictions and the top-ranked, one selected from below the 50th percentile, and the lowest ranked were chosen. For each model, one time series was simulated using the respective ground truth labels. A small excerpt from each time series is depicted in Fig. 6A–C (Ground Truth). Similarly, three-time series were simulated with the predicted rates (Prediction). For each time series, the corresponding 2D-historgram (2D_GT and 2D_Pr) is displayed accompanied by the 2D-difference-histogram (2D_Diff) and the current distributions. For all three time series, the appearance of the predicted time series, the general shape of the 2D-histogram, as well as the current amplitude distributions are almost indistinguishable from the ground truth. However, the 2D_Diff-histograms appear to be very sensitive to the quality of the prediction, mirroring the rank of the chosen model ranging from a perfect match (top-ranked, Fig. 6A), a fair match (Fig. 6B) to a considerable mismatch (lowest ranked, Fig. 6C). Of note, stochastic variations in the simulation of the ground truth, in the simulation of the predictions, and in prediction errors can contribute to an imperfect match. Furthermore, some models with different rates might exist that have very similar (almost indistinguishable) kinetics, leading to the NN predicting one of these alternative versions. This would result in a good match of the 2D_Pr and 2D_GT but substantial deviations from the ground truth. To address this issue, to quantify the goodness of the predicted model, and to obtain an uncertainty quantification of the predicted rates, we introduced the volume deviation score and the re-prediction of the rates (see methods). First, we re-simulated the predicted HMM 100 times to account for the randomness imposed by the stochastic simulation process and noise (SNR = 2). From the re-simulated time series, we yield 100 2D_Pr-histograms $({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})$ for comparison with the ground truth 2D_GT-histogram, using the ${\bar{V}}_{{{\rm{D}}}}({{\bf{G}}},{{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})$ score (Eq. 6), which is displayed in the 2D_GT-histograms (Figs. 6A–C and 7A–C). Theoretically, a value of ${\bar{V}}_{{{\rm{D}}}}({{\bf{G}}},{{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})=0$ would indicate an exact match and a value of ${\bar{V}}_{{{\rm{D}}}}({{\bf{G}}},{{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})=1$ non-overlapping histograms. However, due to the underlying stochastic processes, values close to zero cannot be achieved. This stochastic mismatch is model-specific⁴¹. Therefore, to estimate a reference by which to gauge the volume deviation, the volume reference deviation ${\bar{V}}_{{{\rm{R}}}}({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})$ (Eq. 7) is computed using the re-simulated 2D_Pr-histograms and is indicated in Figs. 6 and 7A–C in the 2D_Pr-histograms. Additionally, by re-predicting solutions from the 100 re-simulated 2D_Pr-histograms with the NNs, we explore the parameter space around the initial prediction and obtain the corresponding error scape of the rate constants (Fig. 6D–F). However, this only yields meaningful results if the mismatch between 2D_GT and 2D_Pr is sufficiently small, which is the case if ${\bar{V}}_{{{\rm{D}}}}({{\bf{G}}},{{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})$ is small and does not deviate strongly from ${\bar{V}}_{{{\rm{R}}}}({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})$.

**Fig. 6: Evaluation of the predicted transition rates of time series with a low signal-to-noise ratio.**

**Fig. 7: Transition rates estimation for COCOC models, including fast gating rates.**

To exemplify how such an analysis could look like, we iterate through the presented model predictions. Given the 2D_Diff -histogram, the model in Fig. 6A represents an almost perfect match. Indeed, the deviation between the 2D_GT-histogram and the 2D_Pr-histograms is minimal (${\bar{V}}_{{{\rm{D}}}}({{\bf{G}}},{{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})=0.125$ vs. ${\bar{V}}_{{{\rm{R}}}}({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})=0.115$), indicating that the prediction represents a very good solution. Next, the obtained distribution of rates k_ij from the re-predictions (Fig. 6D) can be inspected. The observed scatter of the k_ij is in the bounds of what can be expected from the variations caused by the stochastic nature of the simulation and the ion channel gating, as well as the low SNR = 2. Hence, this finding is compatible with the assumption that a unique model solution has been found. The model in Fig. 6B displays a slight degree of deviation between the 2D_GT and 2D_Pr histograms as indicated by the scores ${\bar{V}}_{{{\rm{D}}}}({{\bf{G}}},{{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})=0.238$ vs. ${\bar{V}}_{{{\rm{R}}}}({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})=0.192$). Importantly, ${\bar{V}}_{{{\rm{R}}}}({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})=0.192$ is considerably larger compared to the model in Fig. 6A, where ${\bar{V}}_{{{\rm{R}}}}({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})=0.115$, suggesting that the ground truth model is stronger affected by the stochastic gating process and noise. The comparably large scatter of the re-predicted transition rates k_ij (Fig. 6E) supports this notion, and additionally indicates that the predicted set of k_ij is unlikely to be unique. Finally, the predicted model in Fig. 6C displays almost no overlap (${\bar{V}}_{{{\rm{D}}}}({{\bf{G}}},{{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})=0.867$ vs. ${\bar{V}}_{{{\rm{R}}}}({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})=0.244$). Therefore, this model solution can be rejected.

Next, we analyze the performance of the NNs on the entire test set, using the RAE, which requires the ground truth. As in the preceding section, the results are displayed in the form of cumulative distributions (Fig. 6D–G). Each graph displays the results for both SNRs of a pair of rates that link the same states. Similarly to the previous section, the label array of the COCOC topology has been rearranged to facilitate the training of the NNs (see methods). As expected, the accuracy drops when reducing the SNR = 5 to SNR = 2, and the loss of accuracy is comparable for all k_ij.

Now, the analysis of the results for fast gating follows. The regression architecture was trained using dataset No. 6 (Table 1), encompassing a five-state COCOC topology. Specifically, the slow rates k₁₂ to k₄₃ covered the range of 100 s⁻¹ to 10 ks⁻¹, while the fast rates k₄₅ and k₅₄ range from 10 ks⁻¹ to 1 Ms⁻¹, which solely contain frequencies that are above the corner frequency of the low-pass filter (10 kHz). For for illustrating the single predictions (Fig. 7A–C) we recapitulated the structure of presentation as described above (Fig. 6) for the SNR-analysis. A detailed description of how to estimate the quality of the predicted models is given above. The fast gating events are visible as episodes of flickering between the open and closed states or, in case of the model in Fig. 7B, as an apparent deviation from the open level. Qualitatively, we obtained similar results as for the noise analysis (Fig. 6A–C). Interestingly, for the lowest ranked model, rare episodes of fast gating (Fig. 7C, red arrow) were not captured by the networks, whereas “baseline gating” looks very similar.

In terms of network performance, the resulting prediction of the trained NNs on the entire test dataset is illustrated in Fig. 7D–G. The two slow rates (k₃₂ and k₃₄), having the worst predictions overall, are displayed and demonstrate a very good correlation with the ground truth (Fig. 7D, E). For the fast rates k₄₅ and k₅₄ the correlation is equally good (Fig. 7F, G). Importantly, even with a slightly worse accuracy beyond 300 ks⁻¹ the information that those rates are very fast can still be retrieved. In conclusion, we demonstrated that NNs are capable of extracting rates on a noisy background (SNR = 2) and are not restricted to rates below the corner frequency of the low-pass filter. To demonstrate the capabilities of the algorithm, a comparison with an analytical approach is given in the Supplementary Results (Supplementary Fig. 1).

Performance of the NNs on data obtained with a patch-clamp setup

NNs can learn minuscule nuances that exist in training data, which could be important for high accuracy but could also be a confounding factor. Therefore, realistic simulations of patch-clamp data for training are critical to the performance of the NNs. By applying the 2D-histogram transformation, the dimensionality of the patch-clamp time series is reduced by rearranging a list of 1D-dwell times into a 2D array and removing the correlation of adjacent pairs. Therefore, the representation becomes more abstract and could reduce possible confounding details. Nevertheless, to minimize the mismatch between the simulated time series used for training and the experimental time series obtained via the patch-clamp setup, two significant improvements were made. First, the step response of the simulated data was optimized by recording multiple steps with the patch-clamp amplifier and computing their ensemble average, similarly to³⁰. When comparing the recorded step response with the default simulated 4-pole low-pass Bessel filter, deviations between the two are readily visible (Fig. 8A). The recorded step response has a steeper slope and fewer oscillations. We replaced the default Bessel filter response with the experimental one and used it in the simulation process. The second improvement was to match the noise spectra of the experimental time series with the simulated one. The noise was recorded with the amplifier of the patch-clamp setup, and the power spectrum was computed. Simulated noise was then generated from the power spectrum using an algorithm proposed previously³¹. We recorded the noise from three different sources: a 10 MΩ resistor (bath) a 10 GΩ resistor (patch) from a model cell, and a real cell in the cell-attached configuration (Fig. 8B). The power spectrum of all sources is distinctly different from the default artificial Gaussian white noise filtered with a 4-pole low-pass Bessel filter used in our previous study²¹. Using the power spectrum of the recorded time series, we were able to simulate noise with an indistinguishable power spectrum, including stray noise (red arrows, Fig. 8B), in the case of the real cell.

**Fig. 8: Transition rates estimation for data generated with the patch-clamp setup.**

In this section, the robustness of NNs when applied to experimentally recorded single-channel patch-clamp data is analyzed. Since the ground truth of real recordings is unknown, semi-synthetic test datasets were generated. One feasible approach for accomplishing that is presented in ref. ⁴², where the patch-clamp setup is used to emulate real single-channel recordings. Using this method, we recorded single-channel data by executing waveform protocols with ideal time series on the patch-clamp setup. With this approach, we encompassed the noise spectrum and the filter response of the setup in the time series. However, in contrast to a real experimental time series, we know the ground truth. In total, two datasets consisting of 100 time series each with a length of 1 million samples (10 s) were acquired. The amplitude of the ideal time series was adjusted to obtain an SNR ≈ 6 and SNR ≈ 8, respectively (Table 1 datasets No. 15,16). After idealization, the resulting 2D-histograms were fed into NNs. The NNs were trained on fully synthetically simulated datasets generated using different combinations of the default step response, default noise, experimental step response, and experimental noise. For the time series with an SNR ≈ 6, the NN trained on data between an SNR = 4 and SNR = 6 was generally able to generate meaningful results (Fig. 8C). Using the recorded step response and noise generated with the power spectrum combined to simulate the training data, did provide a significantly better result compared to the default, even reaching the performance of the respective simulated test dataset (orange colored line).

With an SNR = 8, the idealization of the time series for generating 2D-histograms is only negligibly affected by noise. Therefore, as expected, the type of noise did not affect the predictive performance of the NNs (Fig. 8D, trained with data between an SNR = 8 and SNR = 10). In contrast, using the experimental step response made a considerable difference.

In conclusion, it was demonstrated that the time series obtained with the patch-clamp setup can be successfully modeled. After accounting for the step response and the specific noise spectrum, the predictions did considerably improve to a level matching purely simulated data.

Discussion

In this study, we demonstrated that NNs are capable of identifying the HMM that governs the gating kinetics of an ion channel. By capitalizing on the recent advancements made in massively simulating single-channel patch-clamp time series²¹, datasets consisting of two-dimensional dwell-time histograms (2D-histograms) were simulated and used for training NNs. With this Deep Learning approach, it is possible to identify the most likely topologies and estimate transition rates in a high-noise scenario, down to SNR = 2, as well as beyond the corner frequency of the low-pass filter. In principle, the trained NNs could be employed during an ongoing single-channel patch-clamp recording to obtain the kinetic model in real-time (see Table 2 for inference time).

The state of the art for topology identification is fitting the data to multiple topologies and then selecting the one that delivers the best results according to the fit score (e.g., log-likelihood)^44,45. For analytical algorithms^{12,13,46,47,48}, the computational burden this encompasses does not pose a problem, especially considering the computational power available today and given that they can be computed efficiently. However, when examining recordings with low SNR, which is related to a small ion channel conductance and fast gating behavior, the limits of analytical algorithms are approached⁴⁹. The impact of low SNR, low-pass filtering, and consequently reduced recording bandwidth are not easily analytically resolved. That is where iterative simulation-based approaches excel^{17,18,19,20,21}, since errors made in the idealization process, such as missed events and false alarms, cancel out by comparing simulations and experimental data. Unfortunately, this powerful method comes with a major drawback: substantial computational requirements. With the Deep Learning approach, we overcame the last hurdle. The simulations used to train the NNs need to be computed only once. Thereafter, the inference time for predicting the models is negligible (Table 2).

In our tests, the topology estimation NN reached an accuracy of ~44%. At first glance, this might seem unimpressive. However, a closer look at the confusion matrix (Fig. 3C, D) reveals that statistics limit confusion to only a few topologies. More specifically, the confusion matrix in Fig. 3D illustrates the precision and FDR scores, which equal the probability that any of the topologies is the correct one given a certain prediction of the NN. A group of the most probable topologies can be selected for further analysis and the transition rates can be estimated for each of them with the corresponding transition rates estimation NNs.

A common issue of data-driven approaches is evaluating the quality of an obtained result when the ground truth is unknown, which in our case consists of the topology and the rates of the HMM. Luckily, for single-channel recordings, this can be done rather easily by simulating the predicted model and comparing it to the experimental data. In Figs. 6A–C and 7A–C, we exemplify how such a comparison could look like. By visual inspection of the kinetics, the experimenter can already reject a result if there are obvious discrepancies between the two time series. Next, the current distributions of the time series can be overlaid, indicating if a match in the state occupancies has been achieved. This is especially helpful when examining ion channels with fast gating since the resulting skew in the current distributions contains increasingly more information as the transition rates rise beyond the corner frequency of the low-pass filter⁵⁰. If this approach is extended to encompass non-stationary data, the ensemble of the re-simulated single-channel currents should match the time-dependent state occupancy ²⁰. Finally, the most powerful tool for judging the goodness of an estimated solution is the 2D-histogram, since it comprises a complete visual representation of the unknown HMM⁸. Consequently, the 2D-difference histograms (2D_Diff) proved to be very sensitive to prediction errors (Figs. 6A–C and 7A–C).

When evaluating the quality of a prediction, the concept of equivalent topologies has to be considered^43,51,52,53. Models of different topologies exist that produce the exact same kinetic and are, therefore, indistinguishable. To this date, there is no analytical method to determine all topologies in a class of equivalent topologies⁴³. By applying an analysis such as the one presented here, it is possible to end up with a solution that approximates the experimental data very well, meaning it has the same kinetic, but does not match the ground truth. If the deviation of the ground truth and predicted 2D-histograms (2D_GT and 2D_Pr) are equally small for different topologies, the models could be structurally or practically equivalent.

As a quantifiable score of the goodness of an estimated model, i.e., the match of 2D_GT and 2D_Pr, we introduce the mean volume deviation ${\bar{V}}_{{{\rm{D}}}}({{\bf{G}}},{{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})$ (Eq. 6) and mean volume reference ${\bar{V}}_{{{\rm{R}}}}({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})$ (Eq. 7) (Figs. 6A–C and 7A–C). The scores are calculated by simulating a set of 100 time series using the predicted model and computing the 2D-histogram for each, incorporating the stochastic variability of the simulation process. A complete match ${\bar{V}}_{{{\rm{D}}}}({{\bf{G}}},{{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})=0$ cannot be achieved given the stochastic behavior of HMMs and the simulated time series. Therefore, for reference, ${\bar{V}}_{{{\rm{R}}}}({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})$ is computed, which gives the model-dependent stochastic variation of 2D_Pr. In Figs. 6A–C and 7A–C, the score ${\bar{V}}_{{{\rm{D}}}}({{\bf{G}}},{{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})$ is in agreement with the deviation observed by visual inspection of the 2D_Diff-histograms.

Furthermore, Hines and colleagues⁵⁴ demonstrated the importance of estimating the scatter of the transition rates in addition to the goodness of the estimated model. They obtained the scatter by exploring the parameter space using Bayesian inference. It does not only provide confidence intervals for the rates, in addition, it indicates possible non-identifiability of transition rates associated with non-unique models if the scatter is not strictly confined⁵⁴. Advancing this idea, a fit was introduced for Markov modeling of whole-cell patch-clamp data combined with fluorescence data⁵⁵. This leads to the question of how to implement uncertainty quantification of the rates k_ij with our Deep Learning approach.

We found a solution that exploits the inherent capabilities of the algorithm, namely simulation and model prediction. The 100 2D-histograms simulated with the initial prediction used for computing ${\bar{V}}_{{{\rm{D}}}}({{\bf{G}}},{{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})$ and ${\bar{V}}_{{{\rm{R}}}}({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})$ were used to make re-predictions, such that the parameter space surrounding the initial prediction is explored. This way, confidence intervals for the transition rates can be obtained (Figs. 6D–F and 7D–F). In addition, the Deep Learning approach can support the experimentalist by pointing out potential model non-identifiability if the confidence intervals for the transition rates are not confined. However, if the initial prediction is of poor quality, which can be estimated a priori using the volume deviation scores, the confidence intervals are of limited value (for an interpretation of the scores see the Supplementary Results and Supplementary Fig. 2). An alternative method to address this issue could be to compute ${\bar{V}}_{{{\rm{R}}}}({{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100})$ using segmented experimental data, forming 2D-histograms for each segment to replace the 100 2D-histograms (${{{\bf{H}}}}_{1},\ldots ,{{{\bf{H}}}}_{100}$) of the predicted solutions. The drawback of this approach is that, unlike simulated data, the experimental time series has a limited duration.

Given the data-driven nature of our approach, an increase in the accuracy of the predicted models should be observed with more information available. This can be done in two ways. One is increasing the length of the underlying experimental time series, and the other is increasing the size of the training dataset, i.e. the number and length of simulated time series. When examining the former (Fig. 5F) we found that, as expected, the error score (RAE) was reduced insinuating that longer recordings will lead to more accurate predictions. The effect of increasing the amount of training data was examined in Fig. 3B, where the gain in accuracy is illustrated. Moreover, this elucidates the importance of using simulated time series as a basis for training the NNs, given that the overall simulated time of the dataset equates to years of recording time. This brings up the question if the computational constraints regarding simulation time (Table 2) could be alleviated. One possible approach to this is presented by ref. ⁵⁶, where the authors use a generative adversarial network (GAN) to simulate a single-channel patch-clamp time series. Using a graphic accelerator, this algorithm is very efficient and could potentially generate massive amounts of data in a very short time. However, there are two reasons this method is not suitable for our approach. First, the GAN uses samples of real experimental time series as input. Hence, the ground truth of the outputted synthetic data is still unknown. Second, it is uncertain if the data simulated with the GAN is similar enough to real experimental data such that the NNs will not become confounded.

Thus, not only the amount of data is paramount but also its quality. To ensure that the simulated data is as similar as possible to real single-channel patch-clamp data, we derived an experimentally recorded step response, similarly to³⁰. Additionally, we recorded noise using the patch-clamp setup, computed the power spectrum, and used the algorithm proposed by ref. ³¹ to generate a randomized noise series. At this point, the question arose on how to test the Deep Learning approach reliably without having access to sufficient amounts of labeled experimental data. The answer was by emulating the activity of an ion channel using the patch-clamp setup. To that end, we generated semi-synthetic single-channel patch-clamp data, similarly to ref. ⁴². To demonstrate the impact of the improvements (noise spectrum and step response) to the realism of the simulated data, we generated combinations of datasets where we substituted the experimentally derived step response and noise with their analytically computed counterparts and used them to train different NNs. The importance of the improvements to the simulation routine is visualized in Fig. 8, where it is apparent that the quality of the simulated data has achieved a sufficient similarity, to the experimentally recorded data. Furthermore, we speculate that the transformation of the time series into 2D-histograms further reduced any confounding details still present in the simulated data. Unfortunately, open-channel noise²² could not be included in the semi-synthetic time series generated with the patch-clamp setup due to technical reasons. If implemented in the simulation in a future version, we assume that the additional information contained in the time series could improve model prediction.

What are the requirements for setting up a time series analysis with this approach? To obtain optimal results, noise has to be recorded and an experimental step-response has to be acquired from the recording system. The derived noise spectrum and the step-response are then used by the simulation. A selection of Markov model topologies and the boundaries of the corresponding transition rates have to be defined, and a set of times series is simulated for each topology. Finally, the NNs are trained on the 2D-histograms derived from the simulated time series. Simulation and training are computationally demanding and are best carried out on an HPC cluster. However, the simulated data can be reused for training new NNs and the number of topologies could be expanded. The trained NNs can also be reused or shared with other groups if a similar recording system is utilized. While all these steps can be performed using the software provided online (see methods) and consulting this manuscript and the Supplementary Methods, a user-friendly frontend with a graphical user interface (GUI) would be desirable in the future.

For real-time prediction, the algorithm has to be integrated into the recording software. Idealization of a time series, generation of the 2D-histogram, and inference using the NNs (Table 2) are computationally lightweight and could be running in the background of a standard office computer. Computation of the error scores according to Figs. 6A–F and 7A–F can be computed at the end of a recording without significant delay.

In some experiments, different recordings need to be integrated into a single Markov model, e.g. using voltage steps to model potential dependence or using varying compound concentrations to model the dose dependence. The variation of the transition rates under these conditions could aid in reducing the unidentifiable space of the models. In these cases, a joint fit is often used and could be implemented in a future iteration of the proposed Deep Learning approach. How can this be implemented? Importantly, the algorithm is, in principle, capable of analyzing non-stationary data^17,20. Two different approaches are proposed here. For each recorded time series with varying conditions, an individual model can be estimated initially, and transition rate dependencies can be derived post hoc by using a separate fit. The advantage is additional assumptions necessary for a joint fit, such as constraining all recordings to a single topology, do not interfere with the initial model estimations. The disadvantage could be a potential loss of modeling power. One possible way of implementing the equivalent of a joint fit would be to train an NN to receive a set of multiple 2D-histograms as input (e.g., one for each recording condition). A further constraint could be to couple specific transition rates of the HMMs used to simulate the time series such as to account for membrane potential or compound concentration. In this case, the disadvantage would be the additional effort to set up the model simulation and modify the architecture of the NNs.

In conclusion, due to its power regarding low SNR and fast gating, the ability to provide scores for the goodness of an obtained solution, and the ability to assist the experimenter with topology identification, we believe this to facilitate a rapid evaluation of single-channel patch-clamp recordings. Finally, the proposed Deep Learning approach could be extended to become applicable to data of other domains that follow Markovian behavior.

Data availability

All setfiles to generate the training data of Table 1 and the corresponding trained NNs used in the manuscript are available at Zenodo (https://doi.org/10.5281/zenodo.12750594).

Code availability

The source code in its current stage is available at Zenodo (https://doi.org/10.5281/zenodo.12750594).

References

Hodgkin, A. L. & Huxley, A. F. A quantitative description of membrane current and its application to conduction and excitation in nerve. J. Physiol. 117, 500–544 (1952).
Article PubMed PubMed Central CAS Google Scholar
Hamill, O. P., Marty, A., Neher, E., Sakmann, B. & Sigworth, F. J. Improved patch-clamp techniques for high-resolution current recording from cells and cell-free membrane patches. Pflug. Arch. 391, 85–100 (1981).
Article CAS Google Scholar
Neher, E. Single-channel recording. https://doi.org/10.1007/978-1-4419-1229-9 (Springer US, 1995).
Rabiner, L. & Juang, B. An introduction to hidden Markov models. IEEE ASSP Mag. 3, 4–16 (1986).
Article Google Scholar
Colquhoun, D. & Hawkes, A. G. Relaxation and fluctuations of membrane currents that flow through drug-operated channels. Proc. R. Soc. Lond. Ser. B Biol. Sci. 199, 231–262 (1977).
CAS Google Scholar
Colquhoun, D. & Hawkes, A. G. On the stochastic properties of single ion channels. Proc. R. Soc. Lond. Ser. B Biol. Sci. 211, 205–235 (1981).
CAS Google Scholar
Colquhoun, D. & Hawkes, A. G. On the stochastic properties of bursts of single ion channel openings and of clusters of bursts. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 300, 1–59 (1982).
Article PubMed CAS Google Scholar
Fredkin, D. R. & Rice, J. A. Aggregated markov processes and channel gating kinetics. J. Res. Natl Bur. Stand. 90, 517–520 (1985).
Article CAS Google Scholar
Qin, F. & Li, L. Model-based fitting of single-channel dwell-time distributions. Biophys. J. 87, 1657–1671 (2004).
Article PubMed PubMed Central CAS Google Scholar
Levis, R. A. & Rae, J. L. The use of quartz patch pipettes for low noise single channel recording. Biophys. J. 65, 1666–1677 (1993).
Article PubMed PubMed Central CAS Google Scholar
Parzefall, F., Wilhelm, R., Heckmann, M. & Dudel, J. Single channel currents at six microsecond resolution elicited by acetylcholine in mouse myoballs. J. Physiol. 512, 181–188 (1998).
Article PubMed PubMed Central CAS Google Scholar
Qin, F., Auerbach, A. & Sachs, F. Estimating single-channel kinetic parameters from idealized patch-clamp data containing missed events. Biophys. J. 70, 264–280 (1996).
Article PubMed PubMed Central CAS Google Scholar
Fredkin, D. R. & Rice, J. A. Maximum likelihood estimation and identification directly from single-channel recordings. Proc. Biol. Sci. 249, 125–132 (1992).
Article PubMed CAS Google Scholar
Albertsen, A. & Hansen, U. P. Estimation of kinetic rate constants from multi-channel recordings by a direct fit of the time series. Biophys. J. 67, 1393–1403 (1994).
Article PubMed PubMed Central CAS Google Scholar
Qin, F., Auerbach, A. & Sachs, F. A direct optimization approach to hidden Markov modeling for single channel kinetics. Biophys. J. 79, 1915–1927 (2000).
Article PubMed PubMed Central CAS Google Scholar
Venkataramanan, L. & Sigworth, F. J. Applying hidden markov models to the analysis of single ion channel activity. Biophys. J. 82, 1930–1942 (2002).
Article PubMed PubMed Central CAS Google Scholar
Magleby, K. L. & Weiss, D. S. Estimating kinetic parameters for single channels with simulation. A general method that resolves the missed event problem and accounts for noise. Biophys. J. 58, 1411–1426 (1990).
Article PubMed PubMed Central CAS Google Scholar
Magleby, K. L. & Weiss, D. S. Identifying kinetic gating mechanisms for ion channels by using two-dimensional distributions of simulated dwell times. Proc. Biol. Sci. 241, 220–228 (1990).
Article PubMed CAS Google Scholar
Huth, T., Schroeder, I. & Hansen, U.-P. The power of two-dimensional dwell-time analysis for model discrimination, temporal resolution, multichannel analysis and level detection. J. Membr. Biol. 214, 19–32 (2006).
Article PubMed CAS Google Scholar
Huth, T., Schmidtmayer, J., Alzheimer, C. & Hansen, U.-P. Four-mode gating model of fast inactivation of sodium channel Nav1.2a. Pflug. Arch. 457, 103–119 (2008).
Article CAS Google Scholar
Oikonomou, E. et al. 2D-dwell-time analysis with simulations of ion-channel gating using high-performance computing. Biophys. J. 122, 1287–1300 (2023).
Article PubMed PubMed Central CAS Google Scholar
Sigworth, F. J. Open channel noise. I. Noise in acetylcholine receptor currents suggests conformational fluctuations. Biophys. J. 47, 709–720 (1985).
Article PubMed PubMed Central CAS Google Scholar
Hinkley, D. V. Inference about the change-point from cumulative sum tests. Biometrika 58, 509–523 (1971).
Article Google Scholar
Schultze, R. & Draber, S. A nonlinear filter algorithm for the detection of jumps in patch-clamp data. J. Membr. Biol. 132, 41–52 (1993).
Article PubMed CAS Google Scholar
Song, L. & Magleby, K. L. Testing for microscopic reversibility in the gating of maxi K+ channels using two-dimensional dwell-time distributions. Biophys. J. 67, 91–104 (1994).
Article PubMed PubMed Central CAS Google Scholar
Rothberg, B. S. & Magleby, K. L. Testing for detailed balance (microscopic reversibility in ion channel gating. Biophys. J. 80, 3025–3026 (2001).
Colquhoun, D., Dowsland, K. A., Beato, M. & Plested, A. J. R. How to impose microscopic reversibility in complex reaction mechanisms. Biophys. J. 86, 3510–3518 (2004).
Article PubMed PubMed Central CAS Google Scholar
Harris et al. Array programming with NumPy. Nature 585, 357–362 (2020).
Article PubMed PubMed Central CAS Google Scholar
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
Article PubMed PubMed Central CAS Google Scholar
Mukhtasimova, N., DaCosta, C. J. B. & Sine, S. M. Improved resolution of single channel dwell times reveals mechanisms of binding, priming, and gating in muscle AChR. J. Gen. Physiol. 148, 43–63 (2016).
Article PubMed PubMed Central CAS Google Scholar
Carrettoni, M. & Cremonesi, O. Generation of noise time series with arbitrary power spectrum. Comput. Phys. Commun. 181, 1982–1985 (2010).
Article CAS Google Scholar
Cooley, J. W. & Tukey, J. W. An algorithm for the machine calculation of complex fourier series. Math. Comput. 19, 297 (1965).
Article Google Scholar
Szegedy, C., Ioffe, S., Vanhoucke, V. & Alemi, A. Inception-v4, inception-ResNet and the impact of residual connections on learning. CoRR abs/1602.0 (2016).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. CoRR abs/1512.0, (2015).
Szegedy, C. et al. Going deeper with convolutions. CoRR abs/1409.4, (2014).
Ioffe, S. & Szegedy, C. Batch normalization: accelerating deep network training by reducing internal covariate shift. CoRR, (2015).
Glorot, X. & Bengio, Y. Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (eds. Teh, Y. W. & Titterington, M.) 9, 249–256 (PMLR, 2010).
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. https://arxiv.org/abs/1412.6980 (2014).
Blatz, A. L. & Magleby, K. L. Quantitative description of three modes of activity of fast chloride channels from rat skeletal muscle. J. Physiol. 378, 141–174 (1986).
Article PubMed PubMed Central CAS Google Scholar
Sigworth, F. J. & Sine, S. M. Data transformations for improved display and fitting of single-channel dwell time histograms. Biophys. J. 52, 1047–1054 (1987).
Article PubMed PubMed Central CAS Google Scholar
Rothberg, B. S. & Magleby, K. L. Kinetic structure of large-conductance Ca2+-activated K+ channels suggests that the gating includes transitions through intermediate or secondary states. A mechanism for flickers. J. Gen. Physiol. 111, 751–780 (1998).
Article PubMed PubMed Central CAS Google Scholar
Celik, N. et al. Deep-Channel uses deep neural networks to detect single-molecule events from patch-clamp data. Commun. Biol. 3, 3 (2020).
Article PubMed PubMed Central Google Scholar
Bruno, W. J., Yang, J. & Pearson, J. E. Using independent open-to-closed transitions to simplify aggregated Markov models of ion channel gating kinetics. Proc. Natl Acad. Sci. 102, 6326–6331 (2005).
Article PubMed PubMed Central CAS Google Scholar
Horn, R. Statistical methods for model discrimination. Applications to gating kinetics and permeation of the acetylcholine receptor channel. Biophys. J. 51, 255–263 (1987).
Article PubMed PubMed Central CAS Google Scholar
Ball, F. G. & Sansom, M. S. Ion-channel gating mechanisms: model identification and parameter estimation from single channel recordings. Proc. R. Soc. Lond. Ser. B Biol. Sci. 236, 385–416 (1989).
CAS Google Scholar
Horn, R. & Lange, K. Estimating kinetic constants from single channel data. Biophys. J. 43, 207–223 (1983).
Article PubMed PubMed Central CAS Google Scholar
Colquhoun, D. & Hawkes, A. G. The principles of the stochastic interpretation of ion-channel mechanisms. in single-channel recording 397–482. https://doi.org/10.1007/978-1-4419-1229-9_18 (Springer US, 1995).
Qin, F., Auerbach, A. & Sachs, F. Hidden Markov modeling for single channel kinetics with filtering and correlated noise. Biophys. J. 79, 1928–1944 (2000).
Article PubMed PubMed Central CAS Google Scholar
Qin, F. Principles of single-channel kinetic analysis. Methods Mol. Biol. 1183, 371–399 (2014).
Article PubMed Google Scholar
Schröder, I., Harlfinger, P., Huth, T. & Hansen, U. P. A subsequent fit of time series and amplitude histogram of patch-clamp records reveals rate constants up to 1 per microsecond. J. Membr. Biol. 203, 83–99 (2005).
Article PubMed Google Scholar
Kienker, P. Equivalence of aggregated Markov models of ion-channel gating. Proc. R. Soc. Lond. Ser. B Biol. Sci. 236, 269–309 (1989).
CAS Google Scholar
Wagner, M., Michalek, S. & Timmer, J. Estimating transition rates in aggregated Markov models of ion channel gating with loops and with nearly equal dwell times. Proc. R. Soc. Lond. Ser. B Biol. Sci. 266, 1919–1926 (1999).
Article Google Scholar
Wagner, M. & Timmer, J. Model selection in non-nested hidden Markov models for ion channel gating. J. Theor. Biol. 208, 439–450 (2001).
Article PubMed CAS Google Scholar
Hines, K. E., Middendorf, T. R. & Aldrich, R. W. Determination of parameter identifiability in nonlinear biophysical models: a Bayesian approach. J. Gen. Physiol. 143, 401–416 (2014).
Article PubMed PubMed Central CAS Google Scholar
Münch, J. L., Paul, F., Schmauder, R. & Benndorf, K. Bayesian inference of kinetic schemes for ion channels by Kalman filtering. Elife 11, e62714 (2022).
Ball, S. T. M. et al. DeepGANnel: synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks. PLoS One 17, e0267452 (2022).
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

This study was supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation, HU 2358/1-1) to TH. The authors gratefully acknowledge the scientific support and HPC resources provided by the Erlangen National High Performance Computing Center (NHR@FAU) of the Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU) under the NHR project b109dc. NHR funding is provided by federal and Bavarian state authorities. NHR@FAU hardware is partially funded by the German Research Foundation (DFG)—440719683. The present work was performed in (partial) fulfillment of the requirements for obtaining the degree Dr. rer. biol. hum.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Institut für Physiologie und Pathophysiologie, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
Efthymios Oikonomou, Yannick Juli, Rajkumar Reddy Kolan, Linda Kern, Christian Alzheimer & Tobias Huth
Erlangen National High Performance Computing Center, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
Thomas Gruber
Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
Patrick Krauss & Andreas Maier

Authors

Efthymios Oikonomou
View author publications
Search author on:PubMed Google Scholar
Yannick Juli
View author publications
Search author on:PubMed Google Scholar
Rajkumar Reddy Kolan
View author publications
Search author on:PubMed Google Scholar
Linda Kern
View author publications
Search author on:PubMed Google Scholar
Thomas Gruber
View author publications
Search author on:PubMed Google Scholar
Christian Alzheimer
View author publications
Search author on:PubMed Google Scholar
Patrick Krauss
View author publications
Search author on:PubMed Google Scholar
Andreas Maier
View author publications
Search author on:PubMed Google Scholar
Tobias Huth
View author publications
Search author on:PubMed Google Scholar

Contributions

E.O.: designed research, performed research, analyzed data, and wrote the paper. Y.J. and R.R.K.: performed research and analyzed data. L.K.: performed research. T.G.: contributed analytic tools. C.A., P.K., and A.M.: designed research. T.H.: designed research, analyzed data, and wrote the paper.

Corresponding author

Correspondence to Tobias Huth.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Chemistry thanks Marcel Goldschen-Ohm and the other, anonymous, reviewers for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Oikonomou, E., Juli, Y., Kolan, R.R. et al. A deep learning approach to real-time Markov modeling of ion channel gating. Commun Chem 7, 280 (2024). https://doi.org/10.1038/s42004-024-01369-y

Download citation

Received: 16 February 2024
Accepted: 18 November 2024
Published: 30 November 2024
Version of record: 30 November 2024
DOI: https://doi.org/10.1038/s42004-024-01369-y