Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
Deep residual networks with convolutional feature extraction for short-term load forecasting
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 27 January 2026

Deep residual networks with convolutional feature extraction for short-term load forecasting

  • Junchen Liu1,
  • Faisul Arif Ahmad1,
  • Khairulmizam Samsudin1,
  • Fazirulhisyam Hashim1 &
  • …
  • Mohd Zainal Abidin Ab Kadir2 

Scientific Reports , Article number:  (2026) Cite this article

  • 366 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Engineering
  • Mathematics and computing

Abstract

Conventional deep learning models struggle with balancing feature extraction and long-term temporal representation in Short-Term Load Forecasting (STLF). This study proposes a Convolutional Neural Network–Embedded Deep Residual Network (CNN-Embedded DRN) designed to enhance early-stage feature extraction and generalization capability across diverse climatic conditions. The objectives of this study are to integrate Convolutional Neural Network (CNN)-based local feature extraction into the DRN framework for capturing fine-grained temporal and spatial load patterns, to employ residual learning for mitigating gradient degradation and improving network stability, to evaluate the model’s predictive performance against baseline and ablation models across two datasets representing temperate (ISO-NE) and tropical (Malaysia) climates, and to validate its statistical significance and seasonal robustness through bootstrap analysis and multi-seasonal evaluation. The results demonstrate that the proposed CNN-Embedded DRN consistently outperforms all comparative models, achieving the lowest Mean Absolute Percentage Error (MAPE) values of 1.5303% and 5.0566% on the ISO-NE and Malaysia datasets, respectively. The inclusion of residual network (ResNet) and CNN-Embedded ResNet as ablation experiments confirms that CNN-based local feature extraction effectively complements residual learning, while bootstrap analysis verifies that the observed improvements are statistically significant. The proposed model provides a reliable and generalizable framework for STLF, offering improved accuracy, robustness, and adaptability under varying climatic and demand conditions. Future research will focus on extending this framework toward multi-regional and multi-scale forecasting, incorporating attention mechanisms for enhanced long-term dependency modeling, and exploring adaptive hybrid residual architectures for real-time energy management applications.

Similar content being viewed by others

Forecasting large-scale circulation regimes using deformable convolutional neural networks and global spatiotemporal climate data

Article Open access 19 May 2022

Anthropogenic fingerprints in daily precipitation revealed by deep learning

Article Open access 30 August 2023

Seasonal forecasting of the hourly electricity demand applying machine and deep learning algorithms impact analysis of different factors

Article Open access 18 March 2025

Data availability

The datasets generated and/or analysed during the current study are not publicly available due to licensing and institutional restrictions, but are available from the corresponding author upon reasonable request.

Abbreviations

1D CNN:

One-dimensional convolutional neural network

Adam:

Adaptive moment estimation

ANN:

Artificial neural network

BiGRU:

Bidirectional gated recurrent unit

BiLSTM:

Bidirectional long short-term memory

CNN:

Convolutional neural network

Conv1D:

One-dimensional convolutional layer

CRN:

Convolutional residual network

DNN:

Deep neural network

DRN:

Deep residual network

ELM:

Extreme learning machines

FC:

Fully connected

GAP:

Global average pooling

GAP1D:

One-dimensional global average pooling

GRU:

Gated recurrent unit

ISO-NE:

New England independent system operator

LF:

Load forecasting

LSTM:

Long short-term memory

MAE:

Mean absolute error

MAPE:

Mean absolute percentage error

MSE:

Mean square error

MW:

Megawatt

MTLF:

Medium-term load forecasting

NMSE:

Normalized mean square error

N-HiTS:

Neural Hierarchical interpolation for time series forecasting

Pooling1D:

One-dimensional pooling layer

R:

Correlation coefficient

R2 :

Coefficient of determination

RBF:

Radial basis function

ReLU:

Rectified linear unit

ResNet:

Residual network

ResNetPlus:

Modified ResNet structure

RNN:

Recurrent neural network

SELU:

Scaled exponential linear unit

STLF:

Short-term load forecasting

SVR:

Support vector regression

VSTLF:

Very short-term load forecasting

References

  1. Ahmad F A, Liu J, Hashim F & Samsudin K. Short-term load forecasting utilizing a combination model: a brief review. Int. J. Technol. 15, 121–129 (2024). https://doi.org/10.14716/ijtech.v15i1.5543

  2. Liu, J., Ahmad, F. A., Samsudin, K., Hashim, F. & Ab Kadir, M. Z. A. Performance evaluation of activation functions in deep residual networks for short-term load forecasting. IEEE Access 13, 78618–78633. https://doi.org/10.1109/ACCESS.2025.3565798 (2025).

    Google Scholar 

  3. Koponen, P., Ikäheimo, J., Koskela, J., Brester, C. & Niska, H. Assessing and comparing short term load forecasting performance. Energies 13, 2054. https://doi.org/10.3390/en13082054 (2020).

    Google Scholar 

  4. Ceperic, E., Ceperic, V. & Baric, A. A strategy for short-term load forecasting by support vector regression machines. IEEE Trans. Power Syst. 28, 4356–4364. https://doi.org/10.1109/TPWRS.2013.2269803 (2013).

    Google Scholar 

  5. Hippert, H. S., Pedreira, C. E. & Souza, R. C. Neural networks for short-term load forecasting: A review and evaluation. IEEE Trans. Power Syst. 16, 44–55. https://doi.org/10.1109/59.910780 (2001).

    Google Scholar 

  6. Kuster, C., Rezgui, Y. & Mourshed, M. Electrical load forecasting models: A critical systematic review. Sustain. Cities Soc. 35, 257–270. https://doi.org/10.1016/j.scs.2017.08.009 (2017).

    Google Scholar 

  7. Cecati, C., Kolbusz, J., Różycki, P., Siano, P. & Wilamowski, B. M. A novel RBF training algorithm for short-term electric load forecasting and comparative studies. IEEE Trans. Ind. Electron. 62, 6519–6529. https://doi.org/10.1109/TIE.2015.2424399 (2015).

    Google Scholar 

  8. Chen, Y. et al. Short-term load forecasting: similar day-based wavelet neural networks. IEEE Trans. Power Syst. 25, 322–330. https://doi.org/10.1109/TPWRS.2009.2030426 (2009).

    Google Scholar 

  9. Zhao Y, Luh P B, Bomgardner C & Beerel G H. Short-term load forecasting: multi-level wavelet neural networks with holiday corrections. Proc. IEEE Power Energy Soc. Gen. Meet. 1–7 (2009). https://doi.org/10.1109/PES.2009.5275304

  10. Eren, Y. & Küçükdemiral, İ. A comprehensive review on deep learning approaches for short-term load forecasting. Renew. Sustain. Energy Rev. 189, 114031. https://doi.org/10.1016/j.rser.2023.114031 (2024).

    Google Scholar 

  11. Li L, Ota K & Dong M. Everything is image: CNN-based short-term electrical load forecasting for smart grid. Proc. 14th Int. Symp. Pervasive Syst. Algorithms Netw. 344–351 (2017). https://doi.org/10.1109/ISPAN-FCST-ISCC.2017.78

  12. Jurado, M., Samper, M. & Rosés, R. An improved encoder-decoder-based CNN model for probabilistic short-term load and PV forecasting. Electr. Power Syst. Res. 217, 109153. https://doi.org/10.1016/j.epsr.2023.109153 (2023).

    Google Scholar 

  13. Narayan A & Hipel K W. Long short term memory networks for short-term electric load forecasting. Proc. IEEE Int. Conf. Syst. Man Cybern. 2573–2578 (2017). https://doi.org/10.1109/SMC.2017.8123012

  14. Bento, P., Pombo, J., Mariano, S. & Calado, M. R. Short-term load forecasting using optimized LSTM networks via improved bat algorithm. Proc. Int. Conf. Intell. Syst. https://doi.org/10.1109/IS.2018.8710498 (2018).

    Google Scholar 

  15. Kwon, B. S., Park, R. J. & Song, K. B. Short-term load forecasting based on deep neural networks using LSTM layer. J. Electr. Eng. Technol. 15, 1501–1509. https://doi.org/10.1007/s42835-020-00424-7 (2020).

    Google Scholar 

  16. Tang, X., Dai, Y., Liu, Q., Dang, X. & Xu, J. Application of bidirectional recurrent neural network combined with deep belief network in short-term load forecasting. IEEE Access 7, 160660–160670. https://doi.org/10.1109/ACCESS.2019.2950957 (2019).

    Google Scholar 

  17. Ran, P., Dong, K., Liu, X. & Wang, J. Short-term load forecasting based on CEEMDAN and transformer. Electr. Power Syst. Res. 214, 108885. https://doi.org/10.1016/j.epsr.2022.108885 (2023).

    Google Scholar 

  18. Jiang B, Liu Y, Geng H, Zeng H & Ding J. A transformer based method with wide attention range for enhanced short-term load forecasting. Proc. 4th Int. Conf. Smart Power Internet Energy Syst. 1684–1690 (2022).

  19. Li, S., Zhang, W. & Wang, P. TS2ARCformer: A multi-dimensional time series forecasting framework for short-term load prediction. Energies 16, 5825. https://doi.org/10.3390/en16155825 (2023).

    Google Scholar 

  20. Chen, K. et al. Short-term load forecasting with deep residual networks. IEEE Trans. Smart Grid 10, 3943–3952. https://doi.org/10.1109/TSG.2018.2844307 (2018).

    Google Scholar 

  21. Tian, Y., Yu, S., Wen, M., Zhang, K. & Chen, Y. Short-term load forecasting scheme based on improved deep residual network and LSTM. Proc. CIRED Berlin Workshop CIRED 2020, 117–120. https://doi.org/10.1049/oap-cired.2021.0257 (2020).

    Google Scholar 

  22. Li, H., Zhang, P. & Li, C. Short-term load forecasting for distribution substations based on residual neutral networks and long short-term memory neutral networks with attention mechanism. J. Phys. Conf. Ser. 2030, 012087. https://doi.org/10.1088/1742-6596/2030/1/012087 (2021).

    Google Scholar 

  23. Sheng, Z., Wang, H., Chen, G., Zhou, B. & Sun, J. Convolutional residual network to short-term load forecasting. Appl. Intell. 51, 2485–2499. https://doi.org/10.1007/s10489-020-01932-9 (2021).

    Google Scholar 

  24. Sheng, Z., An, Z., Wang, H., Chen, G. & Tian, K. Residual LSTM based short-term load forecasting. Appl. Soft Comput. 144, 110461. https://doi.org/10.1016/j.asoc.2023.110461 (2023).

    Google Scholar 

  25. Ding, A., Liu, T. & Zou, X. Integration of ensemble GoogLeNet and modified deep residual networks for short-term load forecasting. Electronics 10, 2455. https://doi.org/10.3390/electronics10202455 (2021).

    Google Scholar 

  26. Ullah, K. et al. Short-term load forecasting: A comprehensive review and simulation study with CNN-LSTM hybrids approach. IEEE Access https://doi.org/10.1109/ACCESS.2024.3440631 (2024).

    Google Scholar 

  27. Hua, Q. et al. A short-term power load forecasting method using CNN-GRU with an attention mechanism. Energies 18, 106. https://doi.org/10.3390/en18010106 (2024).

    Google Scholar 

  28. He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. Proc. IEEE Conf. Comput. Vis. Pattern Recognit. https://doi.org/10.48550/arXiv.1512.03385 (2016).

    Google Scholar 

  29. Challu, C. et al. N-HiTS: Neural hierarchical interpolation for time series forecasting. Proc. AAAI Conf. Artif. Intell. 37, 6989–6997. https://doi.org/10.1609/aaai.v37i6.25854 (2023).

    Google Scholar 

  30. Zhang, Q., Li, C., Su, F. & Li, Y. Spatiotemporal residual graph attention network for traffic flow forecasting. IEEE Internet Things J. 10, 11518–11532. https://doi.org/10.1109/JIOT.2023.3243122 (2023).

    Google Scholar 

  31. Bao, Y. X., Cao, Y., Shen, Q. Q. & Shi, Q. Global–local spatial–temporal residual correlation network for urban traffic status prediction. Comput. Intell. Neurosci. 2022, 7344522. https://doi.org/10.1155/2022/7344522 (2022).

    Google Scholar 

  32. Ashebir, S. & Kim, S. Energy demand forecasting using temporal variational residual network. Forecasting 7(3), 42. https://doi.org/10.3390/forecast7030042 (2025).

    Google Scholar 

  33. Zhang, J., Chen, F., Cui, Z., Guo, Y. & Zhu, Y. Deep learning architecture for short-term passenger flow forecasting in urban rail transit. IEEE Trans. Intell. Transp. Syst. 22(11), 7004–7014. https://doi.org/10.1109/TITS.2020.3000761 (2020).

    Google Scholar 

  34. Yamashita, R., Nishio, M., Do, R. K. G. & Togashi, K. Convolutional neural networks: An overview and application in radiology. Insights Imaging 9, 611–629. https://doi.org/10.1007/s13244-018-0639-9 (2018).

    Google Scholar 

  35. Kiranyaz, S. et al. 1D convolutional neural networks and applications: A survey. Mech. Syst. Signal Process. 151, 107398. https://doi.org/10.1016/j.ymssp.2020.107398 (2021).

    Google Scholar 

  36. Shi, H., Xu, M. & Li, R. Deep learning for household load forecasting—a novel pooling deep RNN. IEEE Trans. Smart Grid 9, 5271–5280. https://doi.org/10.1109/TSG.2017.2686012 (2017).

    Google Scholar 

  37. Huang G et al. Snapshot ensembles: train 1, get m for free. Preprint at https://arxiv.org/abs/1704.00109 (2017). https://doi.org/10.48550/arXiv.1704.00109

  38. Khoshkangini, R., Tajgardan, M., Lundström, J., Rabbani, M. & Tegnered, D. A snapshot-stacked ensemble and optimization approach for vehicle breakdown prediction. Sensors 23, 5621. https://doi.org/10.3390/s23125621 (2023).

    Google Scholar 

  39. Kingma D P & Ba J. Adam: a method for stochastic optimization. Preprint at https://arxiv.org/abs/1412.6980 (2014). https://doi.org/10.48550/arXiv.1412.6980

  40. Johnston, M. G. & Faulkner, C. A bootstrap approach is a superior statistical method for the comparison of non-normal data with differing variances. New Phytol. 230, 23–26. https://doi.org/10.1111/nph.17159 (2021).

    Google Scholar 

Download references

Funding

The authors declare that no external funding was received for this study.

Author information

Authors and Affiliations

  1. Department of Computer and Communication Systems Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), 43400, Serdang, Selangor, Malaysia

    Junchen Liu, Faisul Arif Ahmad, Khairulmizam Samsudin & Fazirulhisyam Hashim

  2. Advanced Lightning, Power and Energy Research Centre (ALPER), Faculty of Engineering, Universiti Putra Malaysia (UPM), 43400, Serdang, Selangor, Malaysia

    Mohd Zainal Abidin Ab Kadir

Authors
  1. Junchen Liu
    View author publications

    Search author on:PubMed Google Scholar

  2. Faisul Arif Ahmad
    View author publications

    Search author on:PubMed Google Scholar

  3. Khairulmizam Samsudin
    View author publications

    Search author on:PubMed Google Scholar

  4. Fazirulhisyam Hashim
    View author publications

    Search author on:PubMed Google Scholar

  5. Mohd Zainal Abidin Ab Kadir
    View author publications

    Search author on:PubMed Google Scholar

Contributions

J.L. and F.A.A. contributed to the conceptualization of the study. J.L. developed the methodology and conducted the investigation with F.A.A. J.L. prepared the original draft. J.L., F.A.A., K.S., F.H., and M.Z.A.A.K. contributed to the review and editing of the manuscript. F.A.A., K.S., F.H., and M.Z.A.A.K. provided supervision. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Faisul Arif Ahmad.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

1. ISO-NE dataset:

https://www.iso-ne.com/isoexpress/web/reports/load-and-demand

2 Malaysia dataset:

https://www.gso.org.my/SystemData/SystemDemand.aspx

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, J., Ahmad, F.A., Samsudin, K. et al. Deep residual networks with convolutional feature extraction for short-term load forecasting. Sci Rep (2026). https://doi.org/10.1038/s41598-026-35410-y

Download citation

  • Received: 04 July 2025

  • Accepted: 06 January 2026

  • Published: 27 January 2026

  • DOI: https://doi.org/10.1038/s41598-026-35410-y

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • CNN
  • DRN
  • DNN
  • STLF
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on Twitter
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics