Abstract
Employee attrition poses significant challenges to organizations, impacting productivity, morale, and financial stability. Predicting attrition and understanding its underlying drivers are critical for implementing effective retention strategies. In this study, we propose a comprehensive framework that utilizes advanced machine learning techniques to predict employee attrition and job change likelihood. The framework integrates robust preprocessing pipelines, state-of-the-art predictive models, and explainability tools such as SHAP (SHapley Additive exPlanations) to ensure transparency and fairness in HR analytics. By addressing key challenges such as class imbalance, feature selection, and model interpretability, our approach provides actionable insights for proactive talent management. We evaluate the framework on multiple datasets (including the IBM HR Analytics Employee Attrition & Performance dataset and the HR Analytics: Job Change of Data Scientists dataset), achieving near-optimal performance metrics across diverse scenarios. Notably, the Adaptive Boosting (AB) and Histogram Gradient Boosting (HGB) models demonstrate superior performance, with high Precision, Recall, F1-score, and Accuracy. Global and local interpretability analyses using SHAP visualizations reveal critical predictors of attrition, such as OverTime, JobLevel, and JobSatisfaction, enabling targeted interventions. The results underscore the framework’s adaptability, scalability, and potential for real-time deployment in organizational settings. This study contributes to advancing HR analytics by bridging gaps in predictive accuracy, interpretability, and generalizability; offering practical solutions for mitigating employee turnover and safeguarding human capital investments.
Data availability
The current study utilized four datasets: 1. The IBM HR Analytics Employee Attrition & Performance dataset available at: https://www.kaggle.com/datasets/pavansubhasht/ibm-hr-analytics-attrition-dataset 2. The “HR Analytics: Job Change of Data Scientists” dataset available at: https://www.kaggle.com/datasets/arashnic/hr-analytics-job-change-of-data-scientists 3. The HR Dataset v14 is a synthetic dataset available at: https://rpubs.com/rhuebner/hrd_cb_v14. 4. The Attrition Rate of a Company dataset available at: https://www.kaggle.com/datasets/anujachintyabiswas/attrition-rate-of-a-company
References
Vignesh, S.G., Sarojini, V., Vetrivel, S. Employee attrition and employee retention-challenges & suggestions. In International Conference On Economic Transformation with Inclusive Growth (2018).
Holtom, B. C., Mitchell, T. R., Lee, T. W. & Eberly, M. B. 5 turnover and retention research: A glance at the past, a closer review of the present, and a venture into the future. Acad. Manag. Ann. 2(1), 231–274 (2008).
Goswami, B. K. & Jha, S. Attrition issues and retention challenges of employees. Int. J. Sci. Eng. Res. 3(4), 1–6 (2012).
White, M. F. True employee turnover costs: A qualitative case study. PhD thesis, Northcentral University (2017).
O’Connell, M., Kung, M.-C. The cost of employee turnover. Ind. Manag. 49(1) (2007).
Wale-Oshinowo, B. A. & Majekodunmi, S. A. Workforce attrition and sustainable retention strategies in micro, small and medium-sized enterprises: Trends and insights from the literature. J. Manag. Sci. 61(9), 177–194 (2024).
Hajra, H., Jayalakshmi, G. Evaluating the impact of employee attrition on organizational performance through mis analytics. In Multidisciplinary Approaches to AI, Data, and Innovation for a Smarter World 87–110 (IGI Global Scientific Publishing, 2025).
Huang, V., Cheng, E. P. Job openings and hires decline in 2023 as the labor market cools. Monthly Labor Review (2024).
Coates, K.R. Strategies for reducing employee turnover costs. PhD thesis, Walden University (2024).
Jackson, S. E. & Schuler, R. S. Understanding human resource management in the context of organizations and their environments. Ann. Rev. Psychol. 46(1), 237–264 (1995).
Coelho, R. Reasons for employees to leave an organization (2024).
Peterson, S. L. Toward a theoretical model of employee turnover: A human resource development perspective. Hum. Resour. Dev. Rev. 3(3), 209–227 (2004).
MacDonald, W. The impact of job demands and workload on stress and fatigue. Aust. Psychol. 38(2), 102–117 (2003).
Qureshi, I., Jamil, R., Iftikhar, M., Arif, S., Lodhi, S., Naseem, I., Zaman, K. Job stress, workload, environment and employees turnover intentions: Destiny or choice. Arch. Sci.(Sciences Des Archives) 65(8) (2012).
Agyeman, C. M. & Ponniah, V. Employee demographic characteristics and their effects on turnover and retention in MSMEs. Int. J. Recent Adv. Organ. Behav. Decis. Sci. 1(1), 12–29 (2014).
Shaffer, M.A. Expatriate turnover: An investigation of the decision process and an analysis of the impact and nature of spouse adjustment. The University of Texas at Arlington (1994).
Okon, R., Odionu, C. S. & Bristol-Alagbariya, B. Integrating data-driven analytics into human resource management to improve decision-making and organizational effectiveness. IRE J. 8(6), 574 (2024).
Majam, T. & Jarbandhan, D. B. Data driven human resource management in the fourth industrial revolution (4ir). Africa’s Pub. Serv. Deliv. Perform. Rev. 10(1), 588 (2022).
Elugbaju, W. K., Okeke, N. I. & Alabi, O. A. Human resource analytics as a strategic tool for workforce planning and succession management. Int. J. Eng. Res. Dev. 20(11), 744–756 (2024).
Pala, S. K. Use and applications of data analytics in human resource management and talent acquisition. Int. J. Enhanc. Res. Manag. Comput. Appl. (2024).
Paul, Z. I. et al. Reshaping the future of HR: Human resource analytics and talent management. Bull. Bus. Econ. (BBE) 13(2), 332–340 (2024).
Dreichuk, M. & Sytnyk, Y. HR analytics as a risk monitoring tool in personnel management systems. Econ. Educ. 10(1), 22–28 (2025).
Kiran, P. R., Chaubey, A. & Shastri, R. K. Role of HR analytics and attrition on organisational performance: A literature review leveraging the SCM-TBFO framework. Benchmark. Int. J. 31(9), 3102–3129 (2024).
Srivastava, A. K., Patnaik, D. Data-driven insights and predictive modelling for employee attrition: A comprehensive analysis using statistical and machine learning techniques. J. Comput. Anal. Appl. 34(1) (2025)
Rajagopal, N. K., Anand, M. & Mohanty, S. Exploring machine learning applications in human resources management: A comprehensive review. Innov. Intell. Digit. Technol. Towards Increased Effic. 2, 303–313 (2025).
Sharma, R., Jain, A., Manwal, M., et al. Enhancing human resource management through deep learning: A predictive analytics approach to employee retention success. In 2024 IEEE International Conference on Information Technology, Electronics and Intelligent Communication Systems (ICITEICS) 1–4 (IEEE, 2024).
Izharuddin, M. Job satisfaction, and working environment: Mediating role of work-life balance. Manag. Stud. Entrep. J. 5(1), 1144–1149 (2024).
Choi, Y. A prediction of work-life balance using machine learning. Asia Pacific J. Inf. Syst. 34(1), 209–225 (2024).
Sajidha, S., Vijayanand, D., Rajagopal, M., Sambasivam, D., Nisha, V. Decoding work-life balance conundrum: A meta-learning prediction approach. In Creating AI Synergy Through Business Technology Transformation 313–336 (IGI Global, 2025).
Sun, Y. & Jung, H. Machine learning (ml) modeling, IoT, and optimizing organizational operations through integrated strategies: The role of technology and human resource management. Sustainability 16(16), 6751 (2024).
Setiawan, I., Suprihanto, S., Nugraha, A., Hutahaean, J. Hr analytics: Employee attrition analysis using logistic regression. In Iop Conference Series: Materials Science and Engineering, vol. 830, p. 032001 (IOP Publishing, 2020).
Krishna, S. & Sidharth, S. HR analytics: Employee attrition analysis using random forest. Int. J. Perform. Eng. 18(4), 275 (2022).
Nagpal, P., Pawar, A., SH, M. Predicting employee attrition through HR analytics: A machine learning approach. In 2024 4th International Conference on Innovative Practices in Technology and Management (ICIPTM), Noida, India, pp. 1–4 (2024).
Gazi, M. S. et al. Employee attrition prediction in the USA: A machine learning approach for HR analytics and talent retention strategies. J. Bus. Manag. Stud. 6(3), 47–59 (2024).
Arqawi, S., Rumman, M., Zitawi, E., Abunasser, B. & Abu-Naser, S. Predicting employee attrition and performance using deep learning. J. Theor. Appl. Inf. Technol. 100(21), 6526–6536 (2022).
Maloku, F., Maloku, B. Analyzing IBM HR data: Employee attrition and performance insights. J. Eng. Appl. Sci. Technol. SRC/JEAST-382. 268, 2–10. 10.47363/JEAST/2024 (6) (2024)
Nimmagadda, S., Lakshmi, R. J., Kishan, M. S. R., Veeram, N. HR analytics for predicting employee attrition with logistic regression. J. Inf. Educ. Res. (2024)
Poornappriya, T., Gopinath, R. Employee attrition in human resource using machine learning techniques. Webology 18(6) (2021).
Yahia, N. B., Hlel, J. & Colomo-Palacios, R. From big data to deep data to support people analytics for employee attrition prediction. IEEE Access 9, 60447–60458. https://doi.org/10.1109/ACCESS.2021.3074559 (2021).
Xin, J. L. J. & Mahadi, N. Hr analytics for data-driven employee attrition management. J. Organ. Behav. Perform. 29(3), 45–65 (2024).
Das, R. C., Devi, A. Conceptualizing the importance of HR analytics in attrition reduction. Int. Res. J. Adv. Sci. Hub 2(Special Issue ICAMET 10S), 40–48 (2020).
Cabello-Solorzano, K., Araujo, I., Peña, M., Correia, L., J. Tallón-Ballesteros, A. The impact of data normalization on the accuracy of machine learning algorithms: A comparative analysis. In International Conference on Soft Computing Models in Industrial and Environmental Applications, 344–353 (Springer, 2023).
Shehab, N., Badawy, M. & Ali, H. A. Toward feature selection in big data preprocessing based on hybrid cloud-based model. J. Supercomput. 78(3), 3226–3265 (2022).
Jeon, H. & Oh, S. Hybrid-recursive feature elimination for efficient feature selection. Appl. Sci. 10(9), 3211 (2020).
Awad, M. & Fraihat, S. Recursive feature elimination with cross-validation with decision tree: Feature selection method for machine learning-based intrusion detection systems. J. Sensor Actuator Netw. 12(5), 67 (2023).
Watanabe, S. Tree-structured parzen estimator: Understanding its algorithm components and their roles for better empirical performance. arXiv preprint arXiv:2304.11127 (2023)
Pedro, F. A review of data mining, big data analytics, and machine learning approaches. J. Comput. Nat. Sci 3, 169–181 (2023).
Naidu, G., Zuva, T., Sibanda, E. M. A review of evaluation metrics in machine learning algorithms. In Computer Science On-line Conference 15–25 (Springer, 2023).
Dey, R., Mathur, R. Ensemble learning method using stacking with base learner, a comparison. In International Conference on Data Analytics and Insights 159–169 (Springer, 2023).
Acknowledgements
During the preparation of this manuscript, the authors used ChatGPT (OpenAI, GPT-5, 2025 version) for text refinement, including grammar, sentence structure, and clarity improvements. The authors reviewed and edited the output and take full responsibility for the content of this publication. No part of the research design, data analysis, results interpretation, or scientific conclusions relied on AI-generated content.
Funding
No funding was received.
Author information
Authors and Affiliations
Contributions
Conceptualization, Maytha AL-Ali, Majed Alwateer, and Shatha Abed Alsaedi; Data curation, Shatha Abed Alsaedi and Hossam Magdy Balaha; Formal analysis, Maytha AL-Ali, Mahmoud Badawy, and Mostafa A. Elhosseini; Investigation, Majed Alwateer, Shatha Abed Alsaedi and Hossam Magdy Balaha; Methodology, Shatha Abed Alsaedi, Hossam Magdy Balaha and Mahmoud Badawy; Software, Shatha Abed Alsaedi and Hossam Magdy Balaha; Supervision, Mahmoud Badawy and Mostafa A. Elhosseini; Validation, Maytha AL-Ali and Majed Alwateer; Visualization, Shatha Abed Alsaedi and Hossam Magdy Balaha; Writing - original draft, Maytha AL-Ali, Majed Alwateer, Shatha Abed Alsaedi, and Hossam Magdy Balaha; Writing - review & editing, Mahmoud Badawy and Mostafa A. Elhosseini.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
AL-Ali, M., Alwateer, M., Alsaedi, S.A. et al. Integrating machine learning and explainable AI for employee attrition prediction in HR analytics. Sci Rep (2026). https://doi.org/10.1038/s41598-026-36424-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-36424-2