Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
Primacy of feature engineering over architectural complexity for intermittent demand forecasting
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 06 January 2026

Primacy of feature engineering over architectural complexity for intermittent demand forecasting

  • B. Sendhil Nathan1,2,
  • P. M. Aravinth1,
  • B. Veera Siva Reddy1,
  • C. Chandrasekhara Sastry1,
  • Sachin Salunkhe3 &
  • …
  • Robert Cep4 

Scientific Reports , Article number:  (2026) Cite this article

  • 661 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Engineering
  • Mathematics and computing

Abstract

Intermittent demand forecasting remains a fundamental challenge in large-scale supply chains due to extreme demand sparsity, irregular occurrence patterns, and highly variable demand magnitudes. While recent studies have increasingly adopted complex multi-stage model architectures to address these challenges, the role of statistically grounded feature engineering has received comparatively less attention. This study proposes the Smoothed Hybrid Occurrence-Size (SHOS) framework, which generates adaptive, series-specific estimates of demand occurrence probability and conditional demand size using sparsity-aware exponential smoothing. These estimates are incorporated as features into supervised machine learning models trained on large-scale, zero-padded panel data. The proposed approach is evaluated on an automotive aftermarket dataset comprising approximately 1.4 million monthly observations across 56,000 spare-part time series, using an 11-fold rolling-window cross-validation protocol. Empirical results demonstrate that SHOS-enhanced models achieve substantial performance improvements over baseline feature sets, reducing mean absolute error (MAE) by approximately 50% and weighted mean absolute percentage error (WMAPE) by over 40% in highly intermittent demand segments. Notably, despite their increased architectural complexity, two-stage hurdle-based models do not outperform the proposed single-stage SHOS-enhanced framework. Formal statistical testing using the Wilcoxon signed-rank test confirms that the performance advantage of the single-stage SHOS model is consistent and statistically significant across all validation folds (p < 0.001). These findings reveal an unexpected but practically important insight: robust, statistically informed feature engineering can be more effective than increased model complexity for intermittent demand forecasting. The results highlight the value of simple, interpretable, and computationally efficient forecasting frameworks for large-scale operational deployment, while motivating future validation across additional application domains.

Similar content being viewed by others

Research on bearing fault diagnosis based on machine learning and SHAP interpretability analysis

Article Open access 21 November 2025

Explainable dual LSTM-autoencoders with exogenous features for anomaly detection and supply chain forecasting

Article Open access 27 November 2025

Advancing real-time validation of automotive software systems via continuous integration and intelligent failure analysis

Article Open access 25 September 2025

Data availability

All the data and material used in this study is available in the manuscript, and further details if required, the corresponding author will provide the same, through proper requisition.

References

  1. Nikolopoulos, K. We need to talk about intermittent demand forecasting. Eur. J. Oper. Res. 291, 549–559. https://doi.org/10.1016/j.ejor.2019.12.046 (2021).

    Google Scholar 

  2. Kourentzes, N. & Athanasopoulos, G. Elucidate structure in intermittent demand series. Eur. J. Oper. Res. 288, 141–152. https://doi.org/10.1016/j.ejor.2020.05.046 (2021).

    Google Scholar 

  3. Pinçe, Ç., Turrini, L. & Meissner, J. Intermittent demand forecasting for spare parts: A critical review. Omega (United Kingdom). 105, 102513. https://doi.org/10.1016/j.omega.2021.102513 (2021).

    Google Scholar 

  4. J. D. Croston. Forecasting and stock control for intermittent demands. Oper. Res. Q. 23, 289–303 (1972).

  5. Syntetos, A. A. & Boylan, J. E. The accuracy of intermittent demand estimates. Int. J. Forecast. 21, 303–314. https://doi.org/10.1016/j.ijforecast.2004.10.001 (2005).

    Google Scholar 

  6. Teunter, R. H., Syntetos, A. A. & Babai, M. Z. Intermittent demand: Linking forecasting to inventory obsolescence. Eur. J. Oper. Res. 214 (3), 606–615. https://doi.org/10.1016/j.ejor.2011.05.018 (2011).

    Google Scholar 

  7. Ke, G. et al. T.-Y. Liu. LightGBM: A highly efficient gradient boosting decision tree. In Advances in Neural Information Processing Systems. Vol. 30. 3146–3154 (Curran Associates, Inc., 2017).

  8. Chen, T. C. Guestrin. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 785–794. https://doi.org/10.1145/2939672.2939785 (ACM, 2016).

  9. Babai, M. Z., Arampatzis, M., Hasni, M., Lolli, F. & Tsadiras, A. On the use of machine learning in supply chain management: A systematic review. IMA J. Manag. Math. 36, 21–49. https://doi.org/10.1093/imaman/dpae029 (2025).

    Google Scholar 

  10. Gutierrez, R. S., Solis, A. O. & Mukhopadhyay, S. Lumpy demand forecasting using neural networks. Int. J. Prod. Econ. 111, 409–420. https://doi.org/10.1016/j.ijpe.2007.01.007 (2008).

    Google Scholar 

  11. Makridakis, S., Spiliotis, E. & Assimakopoulos, V. The M4 competition: Results, findings, conclusion and way forward. Int. J. Forecast. 34, 802–808. https://doi.org/10.1016/j.ijforecast.2018.06.001 (2018).

    Google Scholar 

  12. Makridakis, S. et al. The M5 competition: Background, organization, and implementation. Int. J. Forecast. 38, 1325–1336. https://doi.org/10.1016/j.ijforecast.2021.07.007 (2022).

  13. Syntetos, A. A. & Boylan, J. E. On the bias of intermittent demand estimates. Int. J. Prod. Econ. 71, 457–466. https://doi.org/10.1016/S0925-5273(00)00143-2 (2001).

    Google Scholar 

  14. Eaves, A. H. C. & Kingsman, B. G. Forecasting for the ordering and stock-holding of spare parts. J. Oper. Res. Soc. 55, 431–437. https://doi.org/10.1057/palgrave.jors.2601697 (2004).

  15. Teunter, R. H., Syntetos, A. A. & Babai, M. Z. Intermittent demand: Linking forecasting to inventory obsolescence. Eur. J. Oper. Res. 214, 606–615. https://doi.org/10.1016/j.ejor.2011.05.018 (2011).

    Google Scholar 

  16. Zhang, G., Patuwo, B. E. & Hu, M. Y. Forecasting with artificial neural networks: The state of the art. Int. J. Forecast. 14 (1), 35–62. https://doi.org/10.1016/S0169-2070(97)00044-7 (1998).

    Google Scholar 

  17. Smyl, S. A hybrid method of exponential smoothing and recurrent neural networks for time series forecasting. Int. J. Forecast. 36, 75–85. https://doi.org/10.1016/j.ijforecast.2019.03.017 (2020).

    Google Scholar 

  18. Li, L., Kang, Y., Petropoulos, F. & Li, F. Feature-based intermittent demand forecast combinations: Accuracy and inventory implications. Int. J. Prod. Res. 61, 7557–7572. https://doi.org/10.1080/00207543.2022.2153941 (2023).

    Google Scholar 

  19. Ma, T., Lin, Y., Zhou, X. & Zhang, M. Grading evaluation of goaf stability based on entropy and normal cloud model. Adv. Civil Eng. 2022 (1), 9600909. https://doi.org/10.1155/2022/9600909 (2022).

    Google Scholar 

  20. Xu, B. et al. Study on the prediction of the uniaxial compressive strength of rock based on the SSA-XGBoost model. Sustainability 15 (6), 5201. https://doi.org/10.3390/su15065201 (2023).

    Google Scholar 

  21. Cheng, Y. et al. Hybrid data-driven model and Shapley additive explanations for peak dilation angle of rock discontinuities. Mater. Today Commun. 40, 110194. https://doi.org/10.1016/j.mtcomm.2024.110194 (2024).

    Google Scholar 

  22. Ma, T. et al. Elastic modulus prediction for high-temperature treated rock using multi-step hybrid ensemble model combined with coronavirus herd immunity optimizer. Measurement 240, 115596. https://doi.org/10.1016/j.measurement.2024.115596 (2025).

    Google Scholar 

  23. Ma, T. et al. Physics-informed neural networks for capturing the true relationships between parameters to predict the dynamic triaxial strength of rocks in cold environments. Measurement 118900. https://doi.org/10.1016/j.measurement.2025.118900 (2025).

  24. Shen, L. et al. A new CNN-GRU deep learning framework optimized by CHIO for precise prediction of debris flow velocity. Stoch. Environ. Res. Risk Assess. 1–21. https://doi.org/10.1007/s00477-025-02973-7 (2025).

  25. Ma, T. et al. Hybrid empirical-data-driven neural network for predicting air-entry value in unsaturated soils. Math. Geosci. 1–30. https://doi.org/10.1007/s11004-025-10230-4 (2025).

  26. Liu, H. et al. Deep learning in rockburst intensity level prediction: performance evaluation and comparison of the NGO-CNN-BiGRU-attention model. Appl. Sci. 14 (13), 5719. https://doi.org/10.3390/app14135719 (2024).

    Google Scholar 

  27. Wang, Y., Ma, T., Shen, L., Wang, X. & Luo, R. Prediction of thermal conductivity of natural rock materials using LLE-transformer-lightGBM model for geothermal energy applications. Energy Rep. 13, 2516–2530. https://doi.org/10.1016/j.egyr.2025.02.003 (2025).

    Google Scholar 

  28. Xie, S., Lin, H., Ma, T., Peng, K. & Sun, Z. Prediction of joint roughness coefficient via hybrid machine learning model combined with principal components analysis. J. Rock Mech. Geotech. Eng. 17 (4), 2291–2306. https://doi.org/10.1016/j.jrmge.2024.05.059 (2025).

    Google Scholar 

  29. Nathan, B. S., Reddy, S., Sastry, B. V., Krishnaiah, C. C., Eswaramoorthy, K. V. & J., & Innovative framework for effective service parts management in the automotive industry. Front. Mech. Eng. 10, 1361688. https://doi.org/10.3389/fmech.2024.1361688 (2024).

    Google Scholar 

  30. B, S. N. et al. A machine learning framework for long-term forecasting of spare part demand in end-of-life product scenarios. Sci. Rep. https://doi.org/10.1038/s41598-025-31171-2 (2025).

    Google Scholar 

  31. Bobbili, V. S. R. et al. Physics-informed neural networks for predicting high-strain-rate energy absorption in additively manufactured lattice materials. Progress Additive Manuf. 1–27. https://doi.org/10.1007/s40964-025-01460-3 (2025).

  32. Reddy, B. V. S. et al. Machine learning approaches for predicting mechanical properties in additive manufactured lattice structures. Mater. Today Commun. 40, 109937. https://doi.org/10.1016/j.mtcomm.2024.109937 (2024).

    Google Scholar 

  33. Reddy, B. V. S. et al. Performance evaluation of machine learning techniques in surface roughness prediction for 3D printed micro-lattice structures. J. Manuf. Process. 137, 320–341. https://doi.org/10.1016/j.jmapro.2025.01.082 (2025).

    Google Scholar 

  34. Lichman, M. P. Smyth. Prediction of sparse user-item consumption rates with zero-inflated poisson regression In The Web Conference - Proceedings of the World Wide Web Conference, WWW 2018. 719–728. https://doi.org/10.1145/3178876.3186153 (ACM, 2018).

  35. Wallström, P. & Segerstedt, A. Evaluation of forecasting error measurements and techniques for intermittent demand. Int. J. Prod. Econ. 128, 625–636. https://doi.org/10.1016/j.ijpe.2010.07.013 (2010).

    Google Scholar 

  36. Cameron, A. C. & Trivedi, P. K. Regression Analysis of Count Data (Cambridge University Press, 2012).

  37. Kourentzes, N. On intermittent demand model optimisation and selection. Int. J. Prod. Econ. 156, 180–190. https://doi.org/10.1016/j.ijpe.2014.06.007 (2014).

    Google Scholar 

Download references

Funding

This work was co-funded by the European Union under the REFRESH - Research Excellence for Region Sustainability and High-tech Industries project (Project No. CZ.10.03.01/00/22_003/0000048) via the Operational Programme Just Transition. This article was also supported by the Students Grant Competition SP2024/087, Specific Research of Sustainable Manufacturing Technologies, financed by the Ministry of Education, Youth and Sports (MEYS), Czech Republic, and the Faculty of Mechanical Engineering, VŠB-Technical University of Ostrava.

Author information

Authors and Affiliations

  1. Department of Mechanical Engineering, Indian Institute of Information Technology Design and Manufacturing Kurnool (IIITDM Kurnool), Kurnool, Andhra Pradesh, 518008, India

    B. Sendhil Nathan, P. M. Aravinth, B. Veera Siva Reddy & C. Chandrasekhara Sastry

  2. Ford Global Technology & Business Center, Ford Motor Private Limited, Chennai, Tamil Nadu, 600119, India

    B. Sendhil Nathan

  3. Department of Mechanical Engineering, Gazi University, Ankara, Turkey

    Sachin Salunkhe

  4. Department of Machining, Assembly and Engineering Metrology, Faculty of Mechanical Engineering, VSB-Technical University of Ostrava, 70800, Ostrava, Czech Republic

    Robert Cep

Authors
  1. B. Sendhil Nathan
    View author publications

    Search author on:PubMed Google Scholar

  2. P. M. Aravinth
    View author publications

    Search author on:PubMed Google Scholar

  3. B. Veera Siva Reddy
    View author publications

    Search author on:PubMed Google Scholar

  4. C. Chandrasekhara Sastry
    View author publications

    Search author on:PubMed Google Scholar

  5. Sachin Salunkhe
    View author publications

    Search author on:PubMed Google Scholar

  6. Robert Cep
    View author publications

    Search author on:PubMed Google Scholar

Contributions

S. N. B.: Conceptualization (equal); Data curation (lead); Formal analysis (lead); Investigation (equal); Methodology (equal); Writing-original draft (lead); Writing-review & editing (equal). (A) P. M.: Conceptualization (equal); Methodology (equal); Formal analysis (lead); Investigation (equal); Writing & editing (equal). (B) V. S. R.: Conceptualization (equal); Data curation (lead); Formal analysis (lead); Investigation (equal); Methodology (equal); Writing-original draft (lead); Writing & editing (equal). (C) C. S.: Conceptualization (equal); Methodology (equal); Formal analysis (lead); Funding acquisition (lead); Supervision (lead); Investigation (equal); Writing & editing (lead). S. S.: Funding acquisition (lead); Investigation (equal); Writing & editing (equal). R. C.: Funding acquisition (lead); Investigation (equal); Writing & editing (equal).

Corresponding authors

Correspondence to B. Veera Siva Reddy or C. Chandrasekhara Sastry.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nathan, B.S., Aravinth, P.M., Reddy, B.V.S. et al. Primacy of feature engineering over architectural complexity for intermittent demand forecasting. Sci Rep (2026). https://doi.org/10.1038/s41598-026-35197-y

Download citation

  • Received: 12 December 2025

  • Accepted: 02 January 2026

  • Published: 06 January 2026

  • DOI: https://doi.org/10.1038/s41598-026-35197-y

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Feature engineering
  • Intermittent demand forecasting
  • Machine learning for supply chains
  • Sparse time series
  • statistical-ML hybrid models
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on Twitter
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics