PatternFusion: a hybrid model for pattern recognition in time-series data using ensemble learning

Bouchelligua, Wided

doi:10.1038/s41598-025-28649-4

Download PDF

Article
Open access
Published: 09 December 2025

PatternFusion: a hybrid model for pattern recognition in time-series data using ensemble learning

Wided Bouchelligua¹

Scientific Reports volume 15, Article number: 43371 (2025) Cite this article

993 Accesses
Metrics details

Subjects

Abstract

PatternFusion is a new ensemble framework that attempts to overcome the drawbacks of classical time-series analysis with the property of interpretability, along with high performance and multi-scale temporal detection. Predesign pattern recognition approaches in time-series usually work individually, forgetting about the collaborative potential of statistical models with deep learning structures. To address this, PatternFusion integrates seamlessly the BiLSTM networks for temporal memory, the CNN modules for spatial analysis and the LightGBM for statistical interpretability by a dynamic attention-driven fusion mechanism. Our multi-criteria optimization approach increases precision, robustness against a noisy input, interpretability, and the computational efficiency. A large set of experiments on various benchmark datasets proves the dominance on all F1-score, AUC, and EER measures that characterize robust detection of complex temporal patterns. Key innovations are adaptive attention-based fusion, multi-scale temporal feature encoding, explicit confidence quantification and temporal post-processing. These features combined together allow high-fidelity monitoring in critical applications like healthcare, finance, industrial systems and environmental sensing making PatternFusion a transformative solution for real-time, interpretable time-series pattern recognition.

A multi-scale dual-stream fusion network for high-accuracy sEMG-based gesture classification

Article Open access 07 January 2026

Optimization of deep learning architecture based on multi-path convolutional neural network algorithm

Article Open access 04 June 2025

Deep learning empowered sensor fusion boosts infant movement classification

Article Open access 14 January 2025

Introduction

Modern analytics in healthcare, finance, industry, and smart cities heavily rely on time-series data to extract meaningful insights for decision-making^1,2,3. However, accurately detecting complex patterns remains challenging due to temporal dependencies, noise, irregular sampling, and dynamic behaviors that conventional tools struggle to capture^4,5,6.

Three primary methodological approaches dominate time-series analysis. Statistical methods like ARIMA⁷, exponential smoothing^8,9, and Bayesian structural models¹⁰ handle clear temporal patterns but fail with nonlinear relationships and large datasets. Deep learning models RNNs¹¹, LSTMs^12,13, and CNNs—improve pattern detection but act as “black boxes,” sacrificing interpretability for performance, a critical drawback in high-stakes applications^14,15,16. Ensemble methods enhance robustness through model diversity^17,18,19, yet traditional techniques (bagging, boosting) rely on homogeneous models, missing opportunities to combine statistical and deep learning strengths^20,21. Recent hybrid frameworks integrate multiple paradigms but often use static fusion strategies, limiting adaptability to real-world temporal variations^22,23,24. Advanced attention mechanisms^25,26 show promise for dynamic integration but remain underexplored in heterogeneous time-series ensembles^27,28. The core challenge lies in developing models that balance accuracy, robustness, interpretability, and computational efficiency across diverse temporal scales^29,30. Current methods excel in specific areas but lack generalized effectiveness^31,32, suffering from key limitations: (1) poor integration across learning paradigms, (2) interpretability-performance tradeoffs, (3) difficulty capturing multi-scale patterns, (4) rigid ensemble fusion, and (5) inadequate confidence quantification^33,34. A critical research gap persists in dynamic integration—existing approaches operate within single paradigms or use static ensembles, failing to adaptively combine model strengths^35,36. Statistical models offer interpretability but struggle with complexity, while deep learning excels in pattern extraction but lacks transparency^37,38. Additionally, most methods cannot simultaneously detect short- and long-term patterns^39,40, a significant hurdle in fields like healthcare and finance. The absence of adaptive fusion mechanisms further restricts ensemble flexibility^41,42, and few methods provide reliable uncertainty estimates, limiting their use in risk-sensitive applications^43,44. Addressing these gaps requires a novel, adaptive framework that unifies diverse methodologies while preserving interpretability and scalability.

The growing demand for reliable, interpretable, and real-time pattern recognition models has motivated hybrid architectures that integrate deep learning and decision-based frameworks. However, most existing approaches either prioritize accuracy at the cost of explainability or sacrifice computational efficiency to achieve interpretability. This imbalance is particularly critical in healthcare and biometric systems, where model transparency, latency, and resource efficiency directly affect trust and deployment feasibility. PatternFusion addresses this gap by providing a balanced hybrid framework that unifies temporal–spatial learning from BiLSTM and CNN modules with the interpretable decision boundaries of LightGBM through an adaptive attention-driven fusion mechanism. This design not only enhances accuracy but also ensures explainability and operational efficiency, making the model suitable for real-world, high-stakes applications.

In response to these challenges, we formalize the time-series pattern recognition problem as follows: Given a multivariate time-series $\:\mathbf{X}=\{{\mathbf{x}}_{1},{\mathbf{x}}_{2},\dots\:,{\mathbf{x}}_{T}\}$, where each $\:{\mathbf{x}}_{t}\in\:{\mathbf{R}}^{d}$ represents a $\:d$-dimensional observation at time step $\:t$, our objective is to develop a mapping function $\:f:\mathbf{X}\to\:\mathbf{Y}$ that assigns each temporal segment to a pattern class $\:y\in\:\mathbf{Y}=\{{y}_{1},{y}_{2},\dots\:,{y}_{C}\}$, where $\:C$ represents the number of pattern categories including a “no pattern” class. This mapping function must optimize multiple conflicting objectives simultaneously, which we formulate as a multi-criteria optimization problem:

$$\:\underset{f}{\text{m}\text{i}\text{n}}\mathcal{J}\left(f\right)={\alpha\:}_{1}{\mathcal{L}}_{\text{class}}\left(f\left(\mathbf{X}\right),\mathbf{Y}\right)+{\alpha\:}_{2}{\mathcal{L}}_{\text{robust}}\left(f,\mathbf{X}, \varepsilon \right)+{\alpha\:}_{3}{\mathcal{L}}_{\text{interp}}\left(f\right)+{\alpha\:}_{4}{\mathcal{L}}_{\text{comp}}\left(f\right)\:$$

(1)

where $\:{\mathcal{L}}_{\text{class}}$ quantifies classification error, $\:{\mathcal{L}}_{\text{robust}}$ measures sensitivity to perturbations $\varepsilon$, $\:{\mathcal{L}}_{\text{interp}}$ evaluates model interpretability, $\:{\mathcal{L}}_{\text{comp}}$ represents computational complexity, and $\:{\alpha\:}_{i}$ are importance weights satisfying $\:\sum\:_{i=1}^{4}{\alpha\:}_{i}=1$. To account for temporal context, we define a contextual window function $\:{\varPhi\:}_{w}\left({\mathbf{x}}_{t}\right)=\{{\mathbf{x}}_{t-w},\dots\:,{\mathbf{x}}_{t},\dots\:,{\mathbf{x}}_{t+w}\}$ that incorporates neighboring observations within window size $\:w$. The pattern recognition function can then be represented as a composition of encoding, fusion, and classification operations:

$$\:f\left(\mathbf{X}\right)={h}_{\text{class}}\circ\:{g}_{\text{fusion}}\circ\:\left\{{e}_{1}\left({\varPhi\:}_{{w}_{1}}\left(\mathbf{X}\right)\right),{e}_{2}\left({\varPhi\:}_{{w}_{2}}\left(\mathbf{X}\right)\right),\dots\:,{e}_{K}\left({\varPhi\:}_{{w}_{K}}\left(\mathbf{X}\right)\right)\right\}\:\:\:$$

(2)

where $\:{e}_{k}$ represents the $\:k$-th encoder with corresponding context window size $\:{w}_{k}$, $\:{g}_{\text{fusion}}$ is an attention-based fusion mechanism defined as:

$$\:{g}_{\text{fusion}}\left(\{{e}_{1},{e}_{2},\dots\:,{e}_{K}\}\right)=\sum\:_{k=1}^{K}{\alpha\:}_{k}\left(\mathbf{X}\right)\cdot\:{e}_{k}\left({\varPhi\:}_{{w}_{k}}\left(\mathbf{X}\right)\right)\:\:$$

(2)

with attention weights $\:{\alpha\:}_{k}\left(\mathbf{X}\right)$ computed as:

$$\:{\alpha\:}_{k}\left(\mathbf{X}\right)=\frac{\text{e}\text{x}\text{p}\left({s}_{k}\left(\mathbf{X}\right)\right)}{\sum\:_{j=1}^{K}\text{e}\text{x}\text{p}\left({s}_{j}\left(\mathbf{X}\right)\right)}\:$$

(4)

where $\:{s}_{k}\left(\mathbf{X}\right)$ represents a learnable scoring function that evaluates the relevance of the $\:k$-th encoder for the given input. The confidence in the prediction is quantified through a calibrated probability measure:

$$\:\text{Conf}\left(f\left(\mathbf{X}\right)\right)=\underset{c\in\:\{1,2,\dots\:,C\}}{\text{m}\text{a}\text{x}}P\left(y=c|\mathbf{X},f\right)\:\:\:\:$$

(5)

This formulation explicitly addresses the multi-faceted challenges in time-series pattern recognition through a unified mathematical framework that integrates multi-scale temporal context, heterogeneous model fusion, adaptive attention allocation, and confidence calibration.

The primary objectives of this research are: (1);

To develop a hybrid ensemble framework that dynamically integrates the complementary strengths of statistical, recurrent, and convolutional approaches to time-series pattern recognition.
To implement an attention-driven fusion mechanism that adaptively weights model contributions based on input characteristics and pattern types.
To incorporate explicit confidence quantification for enhanced reliability in critical decision-making contexts.
To design a learnable temporal feature encoding module that captures patterns across multiple temporal scales.
To evaluate the proposed framework across diverse benchmark datasets to demonstrate its generalizability, robustness, and comparative advantages over existing approaches.

The significance of this research extends across multiple dimensions of time-series analytics and machine learning. By addressing the persistent integration barrier between statistical and deep learning approaches, PatternFusion establishes a new paradigm for hybrid model architectures that can simultaneously achieve high accuracy and interpretability—a critical advancement for applications in healthcare^45,46, finance^47,48, industrial monitoring^49,50, and environmental sensing^51,52,53. The attention-driven fusion mechanism represents a significant innovation in ensemble learning, enabling dynamic adaptation to diverse pattern characteristics and providing valuable insights into model decision processes. Furthermore, the incorporation of explicit confidence quantification addresses a critical requirement in risk-sensitive applications, where uncertainty estimation is essential for responsible deployment of pattern recognition systems^54,55,56. From a methodological perspective, PatternFusion bridges multiple research communities—statistical time-series analysis, deep learning, and ensemble methods—fostering cross-disciplinary integration and knowledge transfer.

The primary contributions of this paper are as follows:

We introduce PatternFusion, a novel hybrid ensemble framework for time-series pattern recognition that integrates statistical, recurrent, and convolutional learning paradigms through an attention-driven fusion mechanism.
We develop a temporal feature encoding module that combines handcrafted statistical features with learned representations to capture patterns across multiple time scales.
We implement an adaptive attention mechanism that dynamically weights model contributions based on input characteristics, improving both accuracy and interpretability.
We incorporate a confidence scoring framework that quantifies prediction reliability and supports threshold-based decision-making.
We provide approaches to the processing of the predictions over time that increase the stability and reduce the false positives under dynamic conditions.
We conduct a comprehensive evaluation across diverse benchmark datasets, demonstrating consistent performance improvements over state-of-the-art baselines and providing insights into the effectiveness of different model components through detailed ablation studies.

While ensemble frameworks combining CNN and LSTM have been explored previously, PatternFusion advances this paradigm through a dynamic attention-guided fusion layer that adaptively balances deep temporal-spatial representations with interpretable gradient-boosted decisions. Unlike static hybrid ensembles, PatternFusion performs context-aware weighting between BiLSTM, CNN, and LightGBM outputs, allowing it to generalize across diverse time-series modalities. This adaptive fusion, combined with confidence-aware decision calibration, provides both superior accuracy and interpretability, thus establishing genuine methodological novelty beyond earlier fixed-ensemble designs.

The remainder of this paper is organized as follows: Sect. 2 presents a systematic review of related work in time-series pattern recognition, examining statistical approaches, deep learning methods, ensemble techniques, and emerging hybrid models. We analyze their relative strengths and limitations, identifying key research gaps that motivate our work. Section 3 details the methodology behind PatternFusion, including dataset preparation, model architecture, training procedures, and evaluation metrics. We provide a comprehensive description of the proposed hybrid ensemble framework, with particular emphasis on the temporal feature encoding, attention-driven fusion, and confidence scoring components. Section 4 presents experimental results and comparative analyses across multiple benchmark datasets, including industrial sensors, physiological signals, and synthetic pattern streams. We demonstrate PatternFusion’s superior performance in terms of accuracy, F1-score, robustness to noise, and interpretability, supported by visualizations of attention weights and confidence scores. In addition, we conduct ablation studies to evaluate the impact of each model component on final performance separately. Section 5 concludes with a summary of important results, describes limitations of the current methodology, and outlines new directions for future study, including possible applications in streaming contexts, domain transfer tasks, and improvements with explainable AI.

Literature review

This section reviews the present state of time-series pattern recognition techniques, analyzing their strengths and weaknesses and the possibility of its use in hybrid ensemble systems. We divide the research into four main groups: Approaches based on the traditional statistics, deep learning networks, ensemble methodologies, and hybrid methods with attention mechanisms.

Statistical approaches to time-series analysis

Statistical methods have long been the foundation of time-series analysis. Box and Jenkins’ ARIMA models⁴ remain an industry standard for linear temporal relationships, while De Livera et al.‘s TBATS¹⁰ introduced trigonometric seasonality for improved forecasting. Probabilistic approaches enhance uncertainty quantification. Scott and Varian’s Bayesian structural models⁴³ integrate prior knowledge with data, providing credible prediction intervals. Hidden Markov Models (HMMs)⁴¹ detect regime shifts by modeling latent state transitions, and particle-filtered state-space models¹³ handle nonlinear dynamics and non-Gaussian noise. Modern decomposition techniques²⁵ isolate trend, seasonal, and residual components, improving interpretability. However, statistical methods struggle with complex nonlinear patterns and high-dimensional data⁶. Their reliance on strict assumptions (stationarity, predefined distributions)²⁴ and domain-specific feature engineering^23,42 limits real-world applicability.

Deep learning for temporal pattern recognition

Deep learning has revolutionized time-series analysis by enabling automatic feature extraction from raw data. Long Short-Term Memory (LSTM) networks²⁰ excel at capturing long-range dependencies through their gated memory, while Bidirectional LSTMs¹⁹ leverage past and future context. Gated Recurrent Units (GRUs)⁹ offer a simpler yet effective alternative. Convolutional Neural Networks (CNNs), adapted for temporal data, have also proven powerful. WaveNet⁴⁴ introduced dilated convolutions for efficient long-range modeling, while Temporal CNNs (TCNs)³ outperformed RNNs in sequence tasks. Studies⁴⁷ show CNNs often surpass recurrent models in time-series classification. Attention mechanisms further enhanced temporal modeling. Transformers⁴⁵ enable global dependency capture, with specialized variants like Informer⁵⁴ and Autoformer⁴⁹ optimizing long-sequence processing. Spatial-spectral transformers⁴⁶ extend these benefits to multi-dimensional data.

Despite their strengths, deep learning models face challenges: high computational costs, overfitting on small datasets, and lack of interpretability^27,38. Their “black-box” nature limits adoption in domains requiring transparent decision-making.

Ensemble methods in time-series analysis

Ensemble techniques have significantly improved pattern recognition accuracy and robustness. Traditional approaches like bagging⁵ and boosting¹⁷ have been adapted for time-series analysis, with innovations like the Time Series Forest algorithm¹² combining feature extraction with ensemble learning. Modern gradient boosting methods (e.g., XGBoost) are also widely applied.Recent research focuses on temporal-specific ensembles. Temporal Ensembling²⁹ uses consistency regularization for semi-supervised learning, while multi-view approaches²¹ enhance diversity through different time-series representations. Hybrid methods have emerged as particularly effective, such as ROCKET¹¹, which pairs random convolutional kernels with linear classifiers, and elastic distance measure ensembles³⁵ that improve classification accuracy. Adaptive models like Zhang et al.‘s⁵¹ dynamically adjust weights based on prediction confidence.

However, current ensemble methods face limitations. Most rely on homogeneous models or basic voting mechanisms, missing opportunities to combine different paradigms’ strengths^36,56. Additionally, they often sacrifice interpretability for accuracy^22,33, restricting their use in decision-critical applications requiring explainability.

Hybrid models and attention mechanisms

Hybrid models combining different learning paradigms show significant promise in time-series pattern recognition. Studies like Li et al.³³ (LSTM + SVR) and Zhu et al.⁵⁶ (CNN + LSTM) demonstrate improved performance by merging deep learning’s temporal modeling with statistical methods’ generalization. However, most current implementations use static integration, limiting their adaptability to varying pattern characteristics. Attention mechanisms have revolutionized temporal modeling by enabling dynamic feature selection and integration. Key developments include:

Vaswani et al.‘s self-attention⁴⁵ for arbitrary time-point relationships.
Qin et al.‘s dual-stage attention RNNs⁴⁰ for improved multivariate forecasting.
Specialized variants like TapNet⁵² and pattern-adaptation networks³².
Applications in finance (spatio-temporal attention⁸ and multivariate forecasting (encoder-decoder frameworks¹⁴.

The integration of attention with ensemble learning offers particular potential to:

Balance accuracy and interpretability.
Dynamically weight model contributions.
Adapt to diverse pattern morphologies^36,38.

However, research on attention-driven fusion in heterogeneous ensembles remains limited, representing a key opportunity for future work (Table 1).

Table 1 Comparative analysis of state-of-the-art time-series pattern recognition methods.

Subjects

Abstract

Similar content being viewed by others

A multi-scale dual-stream fusion network for high-accuracy sEMG-based gesture classification

Optimization of deep learning architecture based on multi-path convolutional neural network algorithm

Deep learning empowered sensor fusion boosts infant movement classification

Introduction

Literature review

Statistical approaches to time-series analysis

Deep learning for temporal pattern recognition

Ensemble methods in time-series analysis

Hybrid models and attention mechanisms

Methodology

Dataset collection

Dataset description

Proposed model: patternfusion

Temporal feature encoding

Base learner ensemble

Attention-driven fusion

Prediction and confidence scoring

Loss function and regularization

Temporal post-processing

Pattern decision thresholding

Novelty and differentiation analysis

Theoretical justification and model choice

Evaluation metrics

Results and discussion

Classification performance

Comparative analysis

Discussion

Adaptive fusion and architecture search

PatternFusion: a hybrid ensemble model for time-series pattern recognition

Model comparison with confidence intervals

Comparison with state-of-the-art (SOTA) methods

Comparison with transformer-based models (Informer, Autoformer)

Real-time pattern and anomaly detection

Pattern recognition timeline

Feature importance comparison

Ablation study

Real-time pattern detection interface

High-dimensional feature projection

Multi-scale decomposition insight

Dynamic pattern evolution mapping

Cluster separation in latent space

Cross-dataset validation and generalizability

Integration of self-supervised learning (contrastive learning)

In-the-wild testing and cross-distribution scenarios

Evaluation on noisy, real-world data and simulated noise scenarios

Hyperparameter sensitivity analysis

Computational cost and inference time analysis

Robustness to unseen or drifting distributions

Concept drift simulation

Error analysis and misclassification insights

Confusion matrix analysis

Ethical considerations

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Consent for publication

Employment

Research involving human participants and/or animals

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links