Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
A machine learning based framework for predictive school management using student and faculty analytics
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 04 April 2026

A machine learning based framework for predictive school management using student and faculty analytics

  • Ming Yang1,
  • Zhe Li2 &
  • Shaoyan Liu3Ā 

Scientific Reports , ArticleĀ number:Ā  (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Computational biology and bioinformatics
  • Mathematics and computing

Abstract

New technologies in education have created a huge amount of data that, when used effectively, can have a major impact on the functioning of an institution and the academic achievement of students. Nevertheless, all existing predictive models are still disconnected and do not integrate historical trends, student-faculty relationships, and trend patterns into a coherent decision-making system. The paper describes an integrated machine learning system that integrates several synergistic AI technologies: (1) deep learning systems (LSTM, GRU, CNN, and Transformers) to model academic growth over time; (2) comprehensible gradient boosting ensembles (XGBoost, LightGBM, and CatBoost) to understandably infer and analyze structured data. (3) graph convolutional networks (GCNs) to encode academic relationships between students, professors, and courses; and (4) data-centric oriented approaches (multitasking, transfer, and federated learning). The framework is tested on two UCI benchmark datasets (n = 649) with fully isolated holdout sets using strict nested cross-validation to prevent data leakage. The framework yields 99.6% and 97.5% predictive accuracy (5.6% and 6.3% improvement over the top baselines) and high recall (99.4% and 96.7%) in classifying at-risk students. Each component has been shown to contribute fully in ablation studies, and the hybrid framework has been shown to outperform state-of-the-art transformed table models (TabTransformer, FT-Transformer, and SAINT) (99.6% vs. 97.2% for the best transformer). Robustness analysis with feature noise and missing data (> 96% accuracy with 20% missing data) demonstrates excellent regression. Fairness assessment indicates that gender and age bias are very small, and mitigation strategies (reweighting, adversarial debiasing) bring the parental education gap down to 0.1%. Cross-domain experiments (mathematics/Portuguese) show a performance loss of -2.3%, indicating internal generalizability, but cross-institutional validation remains to be performed. This framework provides educators with interpretable, actionable insights into evidence-based interventions, demonstrating that for accurate, fair, and robust predictive educational analytics, multi-paradigm AI integration is essential and comprehensive.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

  1. Ationu, H. Predicting student performance using machine learning: a data-driven approach with consideration of special needs students. no April. https://doi.org/10.13140/RG.2.2.32112.98569 (2025).

    Google ScholarĀ 

  2. Hakkal, S. & Lahcen, A. A. XGBoost To Enhance Learner Performance Prediction. Comput. Educ. Artif. Intell. 7, 100254. https://doi.org/10.1016/j.caeai.2024.100254 (2024).

    Google ScholarĀ 

  3. Altabrawee, H., Ali, O. A. J. & Ajmi, S. Q. Predicting Students’ Performance Using Machine Learning Techniques. J. Univ. BABYLON. Pure Appl. Sci. 27 (1), 194–205. https://doi.org/10.29196/jubpas.v27i1.2108 (2023).

    Google ScholarĀ 

  4. Adefemi, K. O., Mutanga, M. B. & Jugoo, V. Hybrid Deep Learning Models for Predicting Student Academic Performance. Math. Comput. Appl. 30 (3), 10–20. https://doi.org/10.3390/mca30030059 (2025).

    Google ScholarĀ 

  5. Alnasyan, B., Basheri, M. & Alassafi, M. The power of Deep Learning techniques for predicting student performance in Virtual Learning Environments: A systematic literature review. Comput. Educ. Artif. Intell. 6, 100231. https://doi.org/10.1016/j.caeai.2024.100231 (2024).

    Google ScholarĀ 

  6. Liu, Y. et al. Predicting Student Performance Using Clickstream Data and Machine Learning. Educ. Sci. 13 (1). https://doi.org/10.3390/educsci13010017 (2023).

  7. Arya, M. et al. A CNN-LSTM-based deep learning model for early prediction of student’s performance. Int. J. Smart Sens. Intell. Syst. 17 (1), 1–10. https://doi.org/10.2478/ijssis-2024-0036 (2024).

    Google ScholarĀ 

  8. Abatal, A. et al. A Comprehensive Evaluation of Machine Learning Techniques for Forecasting Student Academic Success. J. Electron. Electromed Eng. Med. Inf. 7 (1), 1–12. https://doi.org/10.35882/jeeemi.v7i1.489 (2025).

    Google ScholarĀ 

  9. Tang, L. Comparison the Performances for Distributed Machine Learning: Evidence from XGboost and DNN. Appl. Comput. Eng. 103 (1), 209–215. https://doi.org/10.54254/2755-2721/103/20241196 (2024).

    Google ScholarĀ 

  10. Jiang, W. Deep Learning-Based Prediction of Student Performance in Physics Education Using Multimodal Data. Proc. 2025 Int. Conf. Big Data Informatiz Educ. ICBDIE 2025. 119–124. https://doi.org/10.1145/3729605.3729627 (2025).

  11. Gurcan, F. Enhancing breast cancer prediction through stacking ensemble and deep learning integration. PeerJ Comput. Sci. 11 https://doi.org/10.7717/PEERJ-CS.2461 (2025).

  12. Laribi, N. et al. Ensemble deep learning of CNN vs vision transformers for brain lesion classification on MRI images, CEUR Workshop Proc., vol. 3892, pp. 203–219, (2024).

  13. Sudhamathy, G. & Valliammal, N. The Bayesian CNN-LSTM classification model to predict and evaluate learner’s performance. Int. J. Appl. Sci. Eng. 20 (4). https://doi.org/10.6703/IJASE.202312_20(4).007 (2023).

  14. Elrahman, A. A., Soliman, T. H. A., Taloba, A. I. & Farghally, M. F. A Predictive Model for Student Performance in Classrooms using Student Interactions with an eTextbook. Inf. Sci. Lett. 12 (1), 9–12. https://doi.org/10.18576/isl/120102 (2023).

    Google ScholarĀ 

  15. Wu, X., Yu, Z., Zhang, C. & Zhiheng, Z. Research on MOOC dropout prediction by combining CNN-BiGRU and GCN, vol. 13486, no. Cvaa p. 109, 2025, (2024). https://doi.org/10.1117/12.3055872

  16. Kumar, P. Predictive modeling for injury prevention in athletes using artificial intelligence. Int. J. Physiol. Sport Phys. Educ. 6 (2), 17–20. https://doi.org/10.33545/26647710.2024.v6.i2a.76 (2024).

    Google ScholarĀ 

  17. Sun, Q. et al. Machine learning-based assessment of diabetes risk: Machine learning-based assessment of diabetes risk: Q. Appl. Intell., 55, 2, 1–13, doi: https://doi.org/10.1007/s10489-024-05912-1. (2025).

    Google ScholarĀ 

  18. Selvaraj, J., Jerith, G. G., Karthikeyan, R. & Senthil, K. EAI Endorsed Transactions Assessment of CatBoost for Diabetes Prevention in Comparison to XGBoost: AI model capable of predicting the onset of diabetes, 11, pp. 1–8, https://doi.org/10.4108/eetiot.5880

  19. Chella, A., Pirrone, R., Sorbello, R. & Jóhannsdóttir, K. R. Advances in Digital Science. Adv. Intell. Syst. Comput. 1352, no. https://doi.org/10.1007/978-3-030-71782-7 (March, 2024).

  20. Nadar, N. Enhancing student performance prediction through stream analysis dataset using modified XGBoost algorithm. Int. J. Inf. Technol. Secur. 15 (2), 75–86. https://doi.org/10.59035/knug1085 (2023).

    Google ScholarĀ 

  21. Herath, D., Dinuwan, C., Ihalagedara, C. & Ambegoda, T. Enhancing Educational Outcomes Through AI Powered Learning Strategy Recommendation System. Int. J. Adv. Comput. Sci. Appl. 15 (10), 739–748. https://doi.org/10.14569/IJACSA.2024.0151075 (2024).

    Google ScholarĀ 

  22. Borna, M. R., Saadat, H., Hojjati, A. T. & Akbari, E. Analyzing click data with AI: implications for student performance prediction and learning assessment. Front. Educ. 9, no. https://doi.org/10.3389/feduc.2024.1421479 (December, 2024).

  23. Balayet Hossain, M. et al. Enhancing Medicare Fraud Detection With a CNN-Transformer-XGBoost Framework and Explainable AI. IEEE Access. 13, 79609–79622. https://doi.org/10.1109/ACCESS.2025.3562577 (2025).

    Google ScholarĀ 

  24. Dritsas, E. & Trigka, M. Application of Deep Learning for Heart Attack Prediction with Explainable Artificial Intelligence. Computers 13 (10). https://doi.org/10.3390/computers13100244 (2024).

  25. Silva., A. P. C. and Using Data Mining to Predict Secondary School Student Performance, Proc. 5th Futur. Bus. Technol. Conf., no. 978-9077381-39–7, pp. 5–12, [Online]. Available: UCI Repository - Student Performance Data (2008).

  26. Hasan, R., Palaniappan, S., Mahmood, S., Abbas, A. & Sarker, K. U. Dataset of students’ performance using student information system, moodle and the mobile application ā€˜edify’. Data 6 (11), 1–10. https://doi.org/10.3390/data6110110 (2021).

    Google ScholarĀ 

  27. Li, X. & Li, S. Transformer Help CNN See Better: A Lightweight Hybrid Apple Disease Identification Model Based on Transformers, (2022).

  28. Khoshkroodi, A., Parvini Sani, H. & Aajami, M. Stacking Ensemble-Based Machine Learning Model for Predicting Deterioration Components of Steel W-Section Beams. Buildings 14 (1). https://doi.org/10.3390/buildings14010240 (2024).

  29. Emima, D. I. G. A. A. Integrative Ensemble Learning Algorithm for Predicting Students ’, pp. 72–84, (2025).

  30. Wang, Z. et al. Model for prediction of oxygen required in BOF steelmaking. Ironmak. Steelmak. 39 (3), 228–233. https://doi.org/10.1179/1743281211Y.0000000085 (2023).

    Google ScholarĀ 

  31. Oyucu, S., Ersƶz, B., Sağıroğlu, Ş., Aksƶz, A. & BiƧer, E. Optimizing Lithium-Ion Battery Performance: Integrating Machine Learning and Explainable AI for Enhanced Energy Management. Sustain 16 (11). https://doi.org/10.3390/su16114755 (2024).

  32. Kipf, T. N. & Welling, M. SEMI-SUPERVISED CLASSIFICATION WITH GRAPH CONVOLUTIONAL NETWORKS. 3 , pp. 1–14, (2022).

  33. Cao, L., Shen, Z. & Xu, S. Efficient forest fire detection based on an improved YOLO model. Vis. Intell. 2 (1). https://doi.org/10.1007/s44267-024-00053-y (2024).

  34. Radford, A. et al. Learning Transferable Visual Models From Natural Language Supervision. Proc. Mach. Learn. Res. 139, 8748–8763 (2021).

    Google ScholarĀ 

  35. Brendan McMahan, H. et al. y Communication-efficient learning of deep networks from decentralized data, Proc. 20th Int. Conf. Artif. Intell. Stat. AISTATS 2017, 54, (2023).

  36. Zafar, W. et al. Enhanced TumorNet: Leveraging YOLOv8s and U-Net for Superior Brain Tumor. Results Eng. 102994. https://doi.org/10.1016/j.rineng.2024.102994 (2024).

Download references

Funding

The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.

Author information

Authors and Affiliations

  1. Department of Science and Education, Aviation General Hospital, No. 3, Beiyuan Road, Chaoyang District, Beijing, 100012, China

    Ming Yang

  2. Discipline Inspection Office, Capital Medical University Affiliated Beijing Hospital of Traditional Chinese, Beijing, 100010, China

    Zhe Li

  3. Department of Educational Teaching Quality Construction, Chinese Research Academy of Traditional Chinese Medicine College, Beijing, 100700, China

    Shaoyan Liu

Authors
  1. Ming Yang
    View author publications

    Search author on:PubMedĀ Google Scholar

  2. Zhe Li
    View author publications

    Search author on:PubMedĀ Google Scholar

  3. Shaoyan Liu
    View author publications

    Search author on:PubMedĀ Google Scholar

Contributions

M.Y conceived the study, designed the methodology, and supervised the overall research. Z.L performed the experiments, data analysis, and visualization. S.L contributed to data collection, literature review, and result validation. M.Y drafted the initial manuscript, and all authors reviewed, edited, and approved the final version of the manuscript for publication.

Corresponding author

Correspondence to Ming Yang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yang, M., Li, Z. & Liu, S. A machine learning based framework for predictive school management using student and faculty analytics. Sci Rep (2026). https://doi.org/10.1038/s41598-026-47278-z

Download citation

  • Received: 31 October 2025

  • Accepted: 31 March 2026

  • Published: 04 April 2026

  • DOI: https://doi.org/10.1038/s41598-026-47278-z

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Predictive Educational Analytics
  • Hybrid Deep Learning
  • Graph Neural Networks (GNNs)
  • Ensemble Methods
  • Student At-Risk Identification
  • Intelligent School Management Systems
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics