Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
A hybrid actor–critic and BERT framework for intelligent course recommendation in IoT-aware e-learning systems
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 23 February 2026

A hybrid actor–critic and BERT framework for intelligent course recommendation in IoT-aware e-learning systems

  • Xia Chunqin1,2 &
  • Wu Peixi3 

Scientific Reports , Article number:  (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Engineering
  • Mathematics and computing

Abstract

The necessity for recommendation models that can capture both semantic information and device-mediated learner interactions has increased due to the rapid growth of IoT-aware e-learning environments. IoT-enhanced learning in this context refers to intelligent learning platforms that continuously create and log heterogeneous interaction data, including session dynamics, access patterns across linked devices, and engagement behaviors. This work introduces a coherent hybrid framework that combines an Actor–Critic reinforcement learning agent optimized using Proximal Policy Optimization (PPO) with BERT-based semantic encoding. By combining textual content with context-aware interaction logs gathered from intelligent learning platforms, the method creates richer learner representations. While a Mahalanobis distance module offers correlation-aware similarity cues to enhance resilience under sparse and high-dimensional data, these representations allow the Actor–Critic agent to constantly improve its recommendation policy. The usefulness of the suggested framework for IoT-aware intelligent e-learning systems is demonstrated by experiments conducted on three public MOOC datasets, which show steady improvements over robust baselines.

Data availability

The datasets utilized and/or examined in the current investigation are accessible from the corresponding author upon reasonable request.

References

  1. Hoi, S. C., Sahoo, D., Lu, J. & Zhao, P. Online learning: A comprehensive survey. Neurocomputing 459, 249–289 (2021).

    Google Scholar 

  2. Madni, S. H. H. et al. Factors influencing the adoption of IoT for E-learning in higher educational institutes in developing countries. Front. Psychol. 13, 915596 (2022).

    Google Scholar 

  3. Kapoor, K. edX courses dataset 2021 [Dataset]. Kaggle. (2021). https://www.kaggle.com/datasets/khusheekapoor/edx-courses-dataset-2021

  4. Tahir, S., Hafeez, Y., Abbas, M. A., Nawaz, A. & Hamid, B. Smart learning objects retrieval for E-Learning with contextual recommendation based on collaborative filtering. Educ. Inform. Technol. 27 (6), 8631–8668 (2022).

    Google Scholar 

  5. Torkashvand, A., Jameii, S. M. & Reza, A. Deep learning-based collaborative filtering recommender systems: a comprehensive and systematic review. Neural Comput. Appl. 35 (35), 24783–24827 (2023).

    Google Scholar 

  6. Huang, C. Q. et al. XKT: toward explainable knowledge tracing model with cognitive learning theories for questions of multiple knowledge concepts. IEEE Trans. Knowl. Data Eng. 36 (11), 7308–7325 (2024).

    Google Scholar 

  7. Embarak, O. An adaptive paradigm for smart education systems in smart cities using the internet of behaviour (IoB) and explainable artificial intelligence (XAI). In 2022 8th International Conference on Information Technology Trends (ITT) (pp. 74–79). IEEE. (2022).

  8. Maatuk, A. M., Elberkawi, E. K., Aljawarneh, S., Rashaideh, H. & Alharbi, H. The COVID-19 pandemic and E-learning: challenges and opportunities from the perspective of students and instructors. J. Comput. High. Educ. 34 (1), 21–38 (2022).

    Google Scholar 

  9. Zeng, F., Tang, R. & Wang, Y. User personalized recommendation algorithm based on GRU network model in social networks. Mobile information systems, 2022(1), 1487586. (2022).

  10. Huang, F., Bei, Y., Yang, Z., Jiang, J., Chen, H., Shen, Q., … Yu, P. S. (2025, March).Large language model simulator for cold-start recommendation. In Proceedings of the Eighteenth ACM International Conference on Web Search and Data Mining (pp. 261–270).

  11. Nikolakopoulos, A. N., Ning, X., Desrosiers, C. & Karypis, G. Trust your neighbors: A comprehensive survey of neighborhood-based methods for recommender systems. Recommender Syst. Handb., 39–89. (2021).

  12. Roy, D. & Dutta, M. A systematic review and research perspective on recommender systems. J. Big Data. 9 (1), 59 (2022).

    Google Scholar 

  13. Wang, Y., Ma, W., Zhang, M., Liu, Y. & Ma, S. A survey on the fairness of recommender systems. ACM Trans. Inform. Syst. 41 (3), 1–43 (2023).

    Google Scholar 

  14. Romero, C. (ed, S.) Educational data mining and learning analytics: An updated survey. Wiley interdisciplinary reviews: Data Min. Knowl. discovery 10 3 e1355 (2020).

    Google Scholar 

  15. Xu, H., Zhao, N., Xu, N., Niu, B. & Zhao, X. Reinforcement learning-based dynamic event-triggered prescribed performance control for nonlinear systems with input delay. Int. J. Syst. Sci., 1–16. (2025).

  16. Perifanis, V. & Efraimidis, P. S. Federated neural collaborative filtering. Knowl. Based Syst. 242, 108441 (2022).

    Google Scholar 

  17. Thorat, S. A., Ashwini, G. & Seema, M. Survey on collaborative and content-based recommendation systems. In 2023 5th International Conference on Smart Systems and Inventive Technology (ICSSIT) (pp. 1541–1548). IEEE. (2023).

  18. Bai, T., Wen, J. R., Zhang, J. & Zhao, W. X. A neural collaborative filtering model with interaction-based neighborhood. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (pp. 1979–1982). (2017).

  19. Zhao, X. et al. Deep reinforcement learning for list-wise recommendations. arXiv preprint arXiv :180100209. (2017).

  20. Alshareet, O. & Awasthi, A. Collaborative filtering in the age of AI: foundations, innovations, and emerging trends. Computing 107 (11), 1–56 (2025).

    Google Scholar 

  21. Biswas, P. K. & Liu, S. A hybrid recommender system for recommending smartphones to prospective customers. Expert Syst. Appl. 208, 118058 (2022).

    Google Scholar 

  22. Yuan, H. & Hernandez, A. A. User cold start problem in recommendation systems: A systematic review. IEEE access. 11, 136958–136977 (2023).

    Google Scholar 

  23. Lu, J., Wu, D., Mao, M., Wang, W. & Zhang, G. Recommender system application developments: a survey. Decis. Support Syst. 74, 12–32 (2015).

    Google Scholar 

  24. Xiao, J. et al. Attentional factorization machines: Learning the weight of feature interactions via attention networks. arXiv preprint arXiv :170804617. (2017).

  25. Guo, H., Tang, R., Ye, Y., Li, Z. & He, X. DeepFM: a factorization-machine based neural network for CTR prediction. arXiv preprint arXiv :170304247. (2023).

  26. Chen, X., Yao, L., McAuley, J., Zhou, G. & Wang, X. Deep reinforcement learning in recommender systems: A survey and new perspectives. Knowl. Based Syst. 264, 110335 (2023).

    Google Scholar 

  27. Chen, M. et al. Off-policy actor-critic for recommender systems. In Proceedings of the 16th ACM Conference on Recommender Systems (pp. 338–349). (2022).

  28. Wang, Y., Ma, D., Ma, J. & Jin, Q. HGCR: A heterogeneous graph-enhanced interactive course recommendation scheme for online learning. IEEE Trans. Learn. Technol. 17, 364–374 (2023).

    Google Scholar 

  29. Sun, F. et al. BERT4Rec: Sequential recommendation with bidirectional encoder representations from transformer. In Proceedings of the 28th ACM international conference on information and knowledge management (pp. 1441–1450). (2019), November.

  30. Deng, W., Zhu, P., Chen, H., Yuan, T. & Wu, J. Knowledge-aware sequence modelling with deep learning for online course recommendation. Inf. Process. Manag. 60 (4), 103377 (2023).

    Google Scholar 

  31. Chen, J. et al. SR-HetGNN: session-based recommendation with heterogeneous graph neural network. Knowl. Inf. Syst. 66 (2), 1111–1134 (2024).

    Google Scholar 

  32. Chinnadurai, J., Karthik, A., Ramesh, J. V. N., Banerjee, S., Rajlakshmi, P. V., Rao,K. V., … Rajaram, A. (2024). Enhancing online education recommendations through clustering-driven deep learning. Biomedical Signal Processing and Control, 97, 106669.

  33. Chen, X., Wang, X., Wang, Y., Liu, D. & Zhang, W. Leveraging deep learning and graph analysis for enhanced course recommendations in online education. Sci. Rep. 15 (1), 18623 (2025).

    Google Scholar 

  34. Sattar, A. & Bacciu, D. Graph neural network for context-aware recommendation. Neural Process. Lett. 55 (5), 5357–5376 (2023).

    Google Scholar 

  35. Pulido-Gaytan, L. B., Tchernykh, A., Cortés-Mendoza, J. M., Babenko, M. & Radchenko, G. A survey on privacy-preserving machine learning with fully homomorphic encryption. In Latin American High Performance Computing Conference (pp. 115–129). Cham: Springer International Publishing. (2020).

  36. THU-KEG. MOOCCube dataset [Dataset]. GitHub. (2020). https://github.com/THU-KEG/MOOCCubeX

  37. NTHU MOOCs dataset. [Dataset]. GitHub. (2021). https://github.com/lingm5038-ctrl/NTHU-MOOCs-Data

Download references

Funding

The authors did not obtain any financial assistance for this study.

Author information

Authors and Affiliations

  1. Engineering Experimental Teaching Department, Nanjing University of Posts and Telecommunications, Nanjing, 210023, China

    Xia Chunqin

  2. National Experimental Teaching Demonstration Center for Electronic Science and Technology, Nanjing University of Posts and Telecommunications, Nanjing, 210023, China

    Xia Chunqin

  3. Shanghai Huiyi Information Technology Co., Ltd, Pudong New Area, Shanghai, 200120, China

    Wu Peixi

Authors
  1. Xia Chunqin
    View author publications

    Search author on:PubMed Google Scholar

  2. Wu Peixi
    View author publications

    Search author on:PubMed Google Scholar

Contributions

All authors contributed to the conception and design of the study. Xia Chunqin and Wu Peixi conducted data collection, simulation, and analysis.

Corresponding author

Correspondence to Xia Chunqin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethical approval

Not relevant.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chunqin, X., Peixi, W. A hybrid actor–critic and BERT framework for intelligent course recommendation in IoT-aware e-learning systems. Sci Rep (2026). https://doi.org/10.1038/s41598-026-40952-2

Download citation

  • Received: 14 December 2025

  • Accepted: 17 February 2026

  • Published: 23 February 2026

  • DOI: https://doi.org/10.1038/s41598-026-40952-2

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Recommender systems
  • Deep reinforcement learning
  • BERT-based feature extraction
  • Mahalanobis distance
  • Personalized learning
  • IoT-enabled education
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics