Abstract
With a growing number of institutions involved in the global education market, it has become increasingly challenging to verify the authenticity of academic documents and to match them between universities. The paper introduces a deep Blockchain-based system to verify, transfer and match certificates, transcripts, and study plans. ARABBERTV2, pre-trained large language model (LLM) is used in this study to extract high-quality semantic representations from academic transcripts, which are then processed through dimensionality reduction and classification stages to detect equivalences and mismatches. To further enhance privacy and collaboration without sharing raw transcripts, Federated Learning (FL), is used to locally fine-tune a shared model under blockchain-coordinated aggregation. The evaluation was conducted on 661 collected study plans from Saudi universities, with 629 processed PDF documents used for training and testing. After dimensionality reduction to 41 principal components, the proposed model achieved 98.13% classification accuracy and a Kappa statistic of 0.9784. Integrating Federated Learning further improved performance, increasing accuracy from 93.5% (baseline) to 95.6%, and AUC-ROC from 0.947 to 0.972, while reducing inter-university performance variance. The findings demonstrate the efficiency of the proposed model and its importance in building such public framework in academic environments. The findings also demonstrate that Saudi universities primary plans differ from each other. The results recognize the contribution of the deep LLM characteristics to the production of perceptive categorization conclusions.
Similar content being viewed by others
Data availability
The complete dataset, preprocessing steps, feature extraction process, federated learning simulation, and blockchain coordination are publicly accessible to ensure reproducibility at: https://github.com/SamiMnasri/Data-Study-plans-pdfs-classified-by-university-college-major-_.
References
Ayub Khan, A. et al. Educational blockchain: A secure degree attestation and verification traceability architecture for Higher Education Commission. Appl. Sci. (Basel) 11, 10917. https://doi.org/10.3390/app112210917 (2021).
Alrowaily, M. A. et al. Modeling and analysis of proof-based strategies for distributed consensus in blockchain-based peer-to-peer networks. Sustainability 15(2), 1478. https://doi.org/10.3390/su15021478 (2023).
Alam, A. Platform utilising blockchain technology for elearning and online education for open sharing of academic proficiency and progress records. In Smart Data Intelligence. Algorithms for Intelligent Systems (eds Asokan, R. et al.) (Springer, 2022). https://doi.org/10.1007/978-981-19-3311-0_26.
Mnasri, S. & Habbash, M. Study of the influence of Arabic mother tongue on the English language using a hybrid artificial intelligence method. Interact. Learn. Environ. 31(9), 5568–5581. https://doi.org/10.1080/10494820.2021.2012809 (2023).
Laddha, S. et al. COVID-19 diagnosis and classification using radiological imaging and deep learning techniques: A comparative study. Diagnostics (Basel) 12(8), 1880. https://doi.org/10.3390/diagnostics12081880 (2022).
Han, M. et al. (eds). A novel blockchain-based education records verification solution. in proceedings of the 19th annual sig conference on information technology education (SIGITE ‘18). Association for Computing Machinery, New York, NY, USA, 178–183. (2018). https://doi.org/10.1145/3241815.3241870
Caldarelli, G. & Ellul, J. Trusted academic transcripts on the blockchain: A systematic literature review. Appl. Sci. (Basel) 11(4), 1842. https://doi.org/10.3390/app11041842 (2021).
Satybaldy, A., Subedi, A. & Nowostawski, M. A framework for online document verification using self-sovereign identity technology. Sensors (Basel) 22, 8408. https://doi.org/10.3390/s22218408 (2022).
Paull, M. et al. Matching expectations for successful university student volunteering, Education + Training, 59(2), 122–134. Emerald Publishing Limited, (2017). https://doi.org/10.1108/ET-03-2016-0052
Nipaporn Chanamarn, K. & Tamee Enhancing Efficient Study Plan for Student with Machine Learning Techniques, International Journal of Modern Education and Computer Science(IJMECS),9,3,1–9, (2017). https://doi.org/10.5815/ijmecs.2017.03.01
Hu, S. et al. BlockDL: Privacy-Preserving and Crowd-Sourced Deep Learning Through Blockchain, 2021 IEEE Symposium on Computers and Communications (ISCC), Athens, Greece, 1–7, (2021). https://doi.org/10.1109/ISCC53001.2021.9631423
Gupta, R., Patel, M. M. & Shukla, A. Sudeep Tanwar. Deep learning-based malicious smart contract detection scheme for internet of things environment. Comput. Electr. Eng. 97 https://doi.org/10.1016/j.compeleceng.2021.107583 (2022).
Shafay, M. et al. Blockchain for deep learning: Review and open challenges. Cluster Comput. 26, 197–221. https://doi.org/10.1007/s10586-022-03582-7 (2023).
Behera, M. R., Upadhyay, S. & Shetty, S. Sep. Federated Learning using Smart Contracts on Blockchain, based on Reward Driven Approach. arXiv:2107.10243v2 [cs.CR], (2021).
Alsaadi, A. H. & Bamasoud, D. M. Blockchain technology in education system a survey examining potential uses of Blockchain in Saudi Arabia education. Int. J. Adv. Comput. Sci. Appl. 12(5), 730–739. https://doi.org/10.14569/IJACSA.2021.0120585 (2021).
Alangari, S. et al. Developing a blockchain-based digitally secured model for the educational sector in Saudi Arabia toward digital transformation. PeerJ Comput. Sci. 8, e1120. https://doi.org/10.7717/peerj-cs.1120 (2022).
Alshahrani, M., Beloff, N. & White, M. Towards a Blockchain-based Smart Certification System for Higher Education: An Empirical Study. Int. J. Comput. Digit. Syst. 11 (1). https://doi.org/10.12785/ijcds/110145 (2021).
Li, H. & Han, D. EduRSS: A Blockchain-Based educational records secure storage and sharing scheme. IEEE Access 7, 179273–179289. https://doi.org/10.1109/ACCESS.2019.2956157 (2019).
Turkanović, M., Hölbl, M., Košič, K., Heričko, M. & Kamišalić, A. EduCTX: A Blockchain-Based higher education credit platform. IEEE Access 6, 5112–5127. https://doi.org/10.1109/ACCESS.2018.2789929 (2018).
Aamna Tariq, H. B. & Haq Syed Taha Ali. Cerberus: A Blockchain-Based Accreditation and Degree Verification System. Cryptography and Security (cs.CR) arXiv:1912.06812 [cs.CR]. https://doi.org/10.48550/arXiv.1912.06812
Kathole, A. B. et al. Secure federated cloud storage protection strategy using hybrid heuristic attribute-based encryption with permissioned Blockchain. IEEE Access 12, 117154–117169. https://doi.org/10.1109/ACCESS.2024.3447829 (2024).
Kathole, A. B. et al. Electronic health records protection strategy by using blockchain approach. Multimed. Tools Appl. 83, 86883–86894. https://doi.org/10.1007/s11042-024-19772-x (2024).
Kathole, A. B. et al. A novel approach to IoT security for intrusion detection system using ensemble network and heuristic-assisted feature fusion. J. Discrete Math. Sci. Cryptogr. 27(7), 2207–2217. https://doi.org/10.47974/JDMSC-2092 (2024).
Kathole, A., Patil, S., Jadhav, D., Pathak, H. & Mirge, A. S. Development of student intent-based educational chatbot system with adaptive and attentive DTCN on symmetric convolution approach. MethodsX 15 https://doi.org/10.1016/j.mex.2025.103542 (2025).
Patil, S. D., Kathole, A. B., Kumbhare, S., Vhatkar, K. & Kimbahune, V. V. A blockchain-based approach to ensuring the security of electronic data. Int. J. Intell. Syst. Appl. Eng. 12(11), 649–55 (2024).
Kumbhare, S., Kathole, A. B. & Shinde, S. Federated learning aided breast cancer detection with intelligent heuristic-based deep learning framework. Biomed. Signal Process. Control https://doi.org/10.1016/j.bspc.2023.105080 (2023).
Arndt, T. & Guercio, A. Blockchain-based transcripts for mobile higher-education. Int. J. Inf. Educ. Technol. 10(2), 84–89. https://doi.org/10.18178/ijiet.2020.10.2.1344 (2020).
Wissam Antoun, F. & Baly and Hazem Hajj. AraBERT: Transformer-based Model for Arabic Language Understanding. In Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection, May 2020, 9–15, Marseille, France. European Language Resource Association, ISBN = 979-10-95546-51-1. (2020).
Zeroual, I., Goldhahn, D., Eckart, T., Lakhouaja, A. & OSIAN. : Open source international Arabic news corpus - preparation and integration into the CLARIN infrastructure. In Proceedings of the Fourth Arabic Natural Language Processing Workshop, pages 175–182, Florence, Italy, August. Association for Computational Linguistics. (2019).
Funding
The authors extend their appreciation to the Deanship of Scientific Research at University of Tabuk for funding this work through Research 0278-1443-S.
Author information
Authors and Affiliations
Contributions
Conceptualization (M. Alghamdi, I. Alkhazi,); Data curation (S. Mnasri, A. Hassanat, M. Alrowaily); Formal analysis (A. Hassanat, S. Mnasri, L. Zhenzhong,); Funding acquisition (M. Alghamdi, I. Alkhazi); Methodology (A. Hassanat, S. Mnasri, M. Alrowaily,); Project administration (M. Alghamdi, S. Mnasri); Resources (S. Mnasri, M. Arbab, N. Bloui); Software (A. Hassanat, S. Mnasri); Validation (L. Zhenzhong, M. Alghamdi,); Writing original draft (S. Mnasri, M. Arbab);
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval and informed consent statements
Ethics and informed consent statements approved.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Alghamdi, M., Mnasri, S., Hassanat, A. et al. Federated deep blockchain-based system for secure verification of academic transcripts and matching study plans in Saudi universities. Sci Rep (2026). https://doi.org/10.1038/s41598-026-43328-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-43328-8


