Abstract
This paper presents a novel approach for personalized learning path generation by integrating deep knowledge tracing and cognitive load estimation within a unified framework. We propose a dual-stream neural network architecture that simultaneously models students’ knowledge states and cognitive load levels to optimize learning trajectories. The knowledge state tracking module employs a bidirectional Transformer with graph attention mechanisms to capture complex relationships between knowledge components, while the cognitive load estimation module utilizes multimodal data analysis to dynamically assess mental effort during learning activities. A dual-objective optimization algorithm balances knowledge acquisition with cognitive load management to generate paths that maintain optimal challenge levels. Experimental evaluations across multiple educational domains demonstrate that our approach outperforms existing methods in prediction accuracy (87.5%), path quality (4.4/5), and learning efficiency (24.6% improvement). The implemented system supports real-time adaptation based on performance and cognitive state, resulting in reduced frustration, higher engagement, and improved knowledge retention. This research contributes to both theoretical understanding of learning processes and practical implementation of next-generation adaptive educational technologies.
Similar content being viewed by others
Introduction
Personalized learning has emerged as a critical paradigm in educational technology, aiming to tailor instructional content and learning paths to individual students’ needs, abilities, and preferences. In traditional classroom settings, educators often struggle to accommodate the diverse learning profiles of multiple students simultaneously, resulting in suboptimal learning outcomes for many learners1. The development of adaptive learning systems has shown promising results in addressing this challenge by providing customized educational experiences that can significantly improve learning efficiency and effectiveness2.
Deep Knowledge Tracing (DKT) represents a significant advancement in modeling students’ knowledge states and learning processes over time. Unlike traditional knowledge tracing methods, DKT employs deep neural networks to capture complex patterns and dependencies in students’ interaction data, enabling more accurate prediction of their performance on future tasks3. These models can effectively map the dynamic nature of knowledge acquisition and retention, providing a solid foundation for intelligent tutoring systems4. However, most existing DKT models primarily focus on performance prediction without sufficient consideration of the cognitive processes underlying learning.
Cognitive Load Estimation (CLE), on the other hand, aims to quantify the mental effort involved in learning tasks. Based on Cognitive Load Theory, which posits that learning is optimized when instructional design aligns with human cognitive architecture, CLE provides valuable insights into students’ cognitive states during learning activities5. Recent advances in multimodal sensing technologies and machine learning algorithms have enabled more precise estimation of cognitive load using physiological signals, behavioral patterns, and performance metrics6. This information can help identify potential cognitive overload situations that might impede learning progress.
The integration of DKT and CLE represents an innovative approach to personalized learning that addresses both knowledge acquisition and cognitive processing dimensions. While DKT models can track what students know, CLE can reveal how much mental effort they exert when applying that knowledge7. This comprehensive view enables the creation of learning paths that are optimally challenging—difficult enough to promote growth but not so demanding as to cause frustration or cognitive overload.
Despite the clear complementarity between these approaches, research on their combined application remains limited. Current adaptive learning systems primarily focus on knowledge state modeling without adequately incorporating cognitive factors, resulting in learning paths that may be knowledge-optimal but cognitively inappropriate8,9. This research gap is particularly significant as cognitive overload is a common cause of disengagement and frustration in digital learning environments. Existing DKT models lack mechanisms to detect when content becomes overly demanding, while CLE approaches are rarely integrated with knowledge modeling frameworks. Our work addresses this critical gap by developing a unified computational framework that simultaneously optimizes for both knowledge acquisition and cognitive load management.
This paper presents a novel framework for generating personalized learning paths based on the synergistic integration of DKT and CLE. Our approach employs advanced neural network architectures to simultaneously track knowledge states and estimate cognitive load, using this dual information to dynamically adjust content difficulty, presentation format, and instructional support. The specific objectives of this research include: (1) developing a unified neural network model that can perform both knowledge tracing and cognitive load estimation; (2) designing algorithms for optimal learning path generation that balance knowledge growth and cognitive load management; and (3) implementing and evaluating a prototype system in realistic educational settings10.
The main contributions of this work include: a dual-purpose neural architecture that extends traditional DKT models to incorporate cognitive load estimation; a path generation algorithm that optimizes the learning trajectory based on both knowledge state and cognitive capacity; and empirical evidence demonstrating the effectiveness of the proposed approach compared to conventional adaptive learning systems. By bridging the gap between knowledge modeling and cognitive processing, this research offers a more holistic approach to personalized learning that can potentially enhance both learning outcomes and experiences.
Related work
Deep knowledge tracing model research
Knowledge tracing, the task of modeling students’ knowledge state based on their learning interactions, has evolved significantly with the advent of deep learning techniques. The earliest influential model, Bayesian Knowledge Tracing (BKT), proposed by Corbett and Anderson, uses a Hidden Markov Model to represent a student’s knowledge as a binary state (known/unknown) for each skill or concept11. The probability of a student’s knowledge state can be updated using the following equation:
Where \(\:{L}_{t}\) represents the knowledge state at time t, and \(\:{O}_{t}\) represents the observation of the student’s performance.
While BKT provided a principled approach to knowledge modeling, it struggled with capturing complex skill dependencies and required manual skill tagging12. The introduction of Deep Knowledge Tracing (DKT) by Piech et al. marked a paradigm shift by leveraging recurrent neural networks, specifically Long Short-Term Memory (LSTM) networks, to model the temporal dynamics of student knowledge3. DKT represents a student’s knowledge state as a high-dimensional continuous vector \(\:{\mathbf{h}}_{t}\), which is updated after each interaction:
Where \(\:{\mathbf{x}}_{t}\) is the input exercise, \(\:{\widehat{\mathbf{y}}}_{t}\) represents the predicted performance, and \(\:\sigma\:\) is the sigmoid activation function.
Table 1 presents a comparative analysis of major knowledge tracing models, highlighting their distinctive features and limitations.
Building upon DKT, the Dynamic Key-Value Memory Network (DKVMN) introduced by Zhang et al. incorporated an external memory structure to explicitly represent the relationship between exercises and knowledge concepts13. DKVMN uses a key-value memory mechanism where the key matrix stores concept representations and the value matrix stores student knowledge states:
Where \(\:{\mathbf{r}}_{t}\) is the read content from memory, \(\:{\mathbf{k}}_{t}\) is the query vector derived from the input exercise, and \(\:{\mathbf{M}}^{k}\) is the key memory matrix.
More recent advancements include Graph-based Knowledge Tracing (GKT), which explicitly models the prerequisite structure between knowledge concepts using graph neural networks14, and Attention-based Knowledge Tracing (AKT), which employs transformer architectures to capture long-range dependencies and forgetting mechanisms in student learning trajectories15. These models have progressively improved prediction accuracy and provided richer representations of student knowledge.
Despite these advancements, existing knowledge tracing models primarily focus on performance prediction rather than generating optimal learning paths16. Most models fail to incorporate cognitive factors that influence learning efficiency, such as cognitive load, attention, and motivational states. Furthermore, the black-box nature of many deep learning models limits their practical application in educational settings where interpretability is crucial for teacher and student acceptance. Future knowledge tracing systems need to integrate cognitive science principles and provide explainable recommendations to effectively support personalized learning path generation.
Cognitive load theory and estimation methods
Cognitive Load Theory (CLT), introduced by Sweller, provides a framework for understanding how cognitive resources are allocated during learning activities17. The theory distinguishes between three types of cognitive load: intrinsic load (inherent complexity of the material), extraneous load (imposed by instructional design), and germane load (relevant to schema construction and automation). The total cognitive load experienced by a learner can be represented as:
Accurate estimation of cognitive load is crucial for adaptive learning systems to provide appropriate learning materials that maximize efficiency without overwhelming learners18. Traditional methods rely on subjective measures such as self-reporting questionnaires (e.g., NASA-TLX, Paas Scale), which while easy to implement, suffer from limitations including interruption of the learning process and potential reporting biases.
Table 2 presents a comparison of various cognitive load estimation approaches, highlighting their respective strengths and limitations in educational contexts.
Recent advances in physiological signal processing have enabled more objective and continuous estimation of cognitive load19. Electroencephalography (EEG) signals, particularly frontal theta and parietal alpha band power, correlate strongly with cognitive load levels and can be modeled as:
Where α and β are calibration parameters determined through regression analysis.
Behavioral metrics provide a less intrusive alternative, with eye tracking emerging as a particularly promising approach20. Metrics such as pupil dilation, fixation duration, and saccade patterns can be integrated into multimodal estimation models. Machine learning algorithms can fuse these multimodal data sources using ensemble approaches:
Where \(\:C{L}_{i}\) represents individual estimation methods and \(\:{w}_{i}\) represents their respective weights determined through training data.
Despite significant progress, current cognitive load estimation techniques face several challenges21. First, individual differences in cognitive capacity and processing styles necessitate personalized calibration. Second, the relationship between cognitive load indicators and actual mental effort varies across different learning domains and tasks. Third, real-time estimation in authentic educational settings requires balancing accuracy with computational efficiency22. Future research opportunities lie in developing adaptive fusion algorithms that can automatically select the most appropriate estimation methods based on contextual factors and available sensors, as well as in exploring the temporal dynamics of cognitive load throughout complex learning activities.
Personalized learning path generation technologies
Personalized learning path generation aims to dynamically sequence instructional content based on individual learner characteristics, creating optimal trajectories through knowledge domains that maximize learning outcomes23. The theoretical foundations of this field draw from Vygotsky’s Zone of Proximal Development, adaptive learning theories, and computational approaches to curriculum sequencing. Early implementations relied primarily on rule-based systems that used predefined instructional design principles and expert knowledge to determine appropriate content sequences24. While straightforward to implement, these systems lacked flexibility to adapt to diverse learning patterns and complex knowledge structures.
Collaborative filtering approaches, borrowed from recommendation systems, emerged as a data-driven alternative that leverages the collective experience of learners to identify optimal paths25.
Recent advancements in learning path generation include multigranularity approaches that decompose knowledge into hierarchical structures for more flexible path creation26, and unified frameworks that integrate multiple data sources and personalization techniques to provide context-aware recommendations27. These methods extend traditional path generation by considering both domain knowledge structure and individual learner characteristics across multiple granularity levels.
These methods typically calculate similarity between learners using various distance metrics and recommend content based on the experiences of similar peers:
Where \(\:sim\left(u,v\right)\) represents the similarity between learners u and v, \(\:{r}_{u,i}\) is the rating (or performance) of learner u on item i, and \(\:{I}_{u,v}\) is the set of items both learners have interacted with.
Knowledge graph-based approaches represent a significant advancement by explicitly modeling domain knowledge structure, including concepts, relationships, and dependencies28. These approaches typically formulate learning path generation as a graph traversal problem, where nodes represent knowledge units and edges represent prerequisite or relevance relationships. The optimal path can be determined using algorithms that balance learning objectives with constraints:
Where \(\:{P}_{s,t}\) is the set of all possible paths from start node s to target node t, \(\:w\left(u,v\right)\) is the weight of the edge between nodes u and v, and \(\:f\left({K}_{u},{C}_{v}\right)\) is a function that evaluates the appropriateness of transitioning from knowledge state \(\:{K}_{u}\) to concept \(\:{C}_{v}\).
More recently, reinforcement learning has been applied to learn optimal sequencing policies directly from interaction data29. These approaches frame learning path generation as a Markov Decision Process where states represent learner knowledge profiles, actions correspond to content selection, and rewards are derived from learning outcomes. This formulation enables systems to adapt to individual learning patterns and optimize for long-term knowledge gains rather than immediate performance.
Despite these advancements, most existing approaches focus predominantly on knowledge acquisition without adequately considering cognitive factors that influence learning efficiency8. The integration of knowledge tracing with cognitive load estimation represents an emerging research direction with significant potential9. Initial studies have demonstrated that incorporating cognitive load measures into content sequencing algorithms can lead to more efficient learning paths by balancing challenge with cognitive capacity. These integrated approaches typically employ multimodal data fusion to simultaneously track knowledge states and cognitive load, using combined metrics to guide content selection and pacing.
Significant challenges remain in developing robust personalized learning path generation systems that effectively balance knowledge acquisition goals with cognitive processing constraints. Future research directions include developing more sophisticated neural architectures that can jointly model knowledge states and cognitive processes, incorporating motivational and emotional factors into path optimization, and designing explainable recommendation mechanisms that provide transparency to both learners and instructors.
Integrated neural network architecture for deep knowledge tracing and cognitive load estimation
To address the integration of knowledge tracing and cognitive load estimation, we propose a dual-stream neural network architecture as illustrated in Fig. 1. This architecture enables simultaneous tracking of knowledge states and cognitive load levels while allowing bidirectional information flow between the two processing streams.
The diagram illustrates the integrated architecture with: (A) Knowledge State Representation Stream processing learning interaction data through embedding, BiLSTM, and graph attention networks; (B) Cognitive Load Estimation Stream processing multimodal inputs (behavioral, performance, physiological) through modality-specific encoders and transformers; (C) Cross-stream information exchange mechanisms with gated attention; and (D) Joint output layer for integrated prediction of knowledge states and cognitive load levels.
Dual-stream neural network framework design
To effectively combine knowledge tracing and cognitive load estimation, we propose a novel dual-stream neural network architecture that processes learning interaction data and cognitive signals in parallel while enabling cross-stream information exchange. The proposed architecture consists of a Knowledge State Representation Stream (KSRS) and a Cognitive Load Estimation Stream (CLES) that operate concurrently but share information at strategic points30. This design enables the model to simultaneously track knowledge evolution and estimate cognitive load while allowing each stream to influence the other’s processing.
The KSRS extends traditional DKT architectures by incorporating a bidirectional LSTM network to capture temporal dependencies in both directions, which is particularly important for understanding how current learning interactions relate to both past and future knowledge states31. For a sequence of learning interactions \(\:\left({x}_{1},{x}_{2},...,{x}_{T}\right)\), the knowledge state at time step \(\:t\) is computed as:
where \(\:{\mathbf{h}}_{t}^{K}\) represents the knowledge state vector and \(\:{\mathbf{c}}_{t}^{CL}\) is a context vector from the cognitive load stream that influences the knowledge state update.
In parallel, the CLES processes multimodal cognitive signals including physiological data (e.g., EEG, heart rate), behavioral metrics (e.g., response time, eye tracking), and performance indicators. These heterogeneous inputs are first processed through modality-specific encoders and then fused using a self-attention mechanism32. The cognitive load state at time \(\:t\) is computed as:
where \(\:{\mathbf{z}}_{t}\) represents the concatenated and encoded multimodal inputs, and \(\:{\mathbf{c}}_{t}^{K}\) is a context vector from the knowledge stream.
The critical innovation in our architecture lies in the bidirectional information exchange mechanisms between the two streams. At each time step, we compute cross-stream context vectors using gated attention mechanisms that selectively filter information based on relevance33. The context vector from the cognitive load stream to the knowledge stream is computed as:
where \(\:s\left(\cdot\:,\cdot\:\right)\) is a similarity function, and \(\:{g}_{i}^{t}\) is a gating parameter that controls information flow between streams based on the current learning context.
The final output layer integrates information from both streams to generate joint predictions for both knowledge mastery probabilities and cognitive load levels34. This integration enables more informed decision-making for learning path generation by considering both what a student knows and how much cognitive effort they are expending.
During training, we employ a multi-objective loss function that combines knowledge prediction accuracy and cognitive load estimation error:
where \(\:{\lambda\:}_{1}\), \(\:{\lambda\:}_{2}\), and \(\:{\lambda\:}_{3}\) are hyperparameters that control the relative importance of each objective.
The proposed dual-stream architecture offers several advantages over single-purpose models: it enables mutual enhancement between knowledge tracing and cognitive load estimation, provides richer representations for downstream learning path generation, and can be trained end-to-end on heterogeneous data sources31. Experimental results demonstrate that this integrated approach achieves higher accuracy in both knowledge prediction and cognitive load estimation compared to independent models, particularly in scenarios with limited training data.
Knowledge state tracking module
The knowledge state tracking module serves as the core component for modeling student learning trajectories across multiple concepts and skills. Our implementation extends traditional knowledge tracing approaches by incorporating contextual information from cognitive load estimations and explicitly modeling inter-concept relationships35. As shown in Table 3, the module consists of multiple processing layers that progressively transform raw interaction data into comprehensive knowledge state representations.
The input layer processes a sequence of student interactions represented as tuples \(\:\left({q}_{t},{r}_{t}\right)\) where \(\:{q}_{t}\) denotes the question or knowledge component being assessed at time \(\:t\), and \(\:{r}_{t}\in\:\{0,1\}\) indicates the correctness of the response. These discrete inputs are then transformed by the embedding layer into continuous vector representations:
where \(\:{E}_{q}\) and \(\:{E}_{r}\) are embedding matrices for questions and responses respectively, and \(\:\oplus\:\) represents concatenation36.
To capture temporal dependencies in learning sequences, we implement a bidirectional Transformer encoder that processes the embedded interaction sequence:
where \(\:X=\left[{\mathbf{x}}_{1},{\mathbf{x}}_{2},...,{\mathbf{x}}_{T}\right]\) is the sequence of embedded interactions, and \(\:{M}_{att}\) is an attention mask that prevents attending to future interactions during training.
A key innovation in our approach is the incorporation of a graph attention network (GAT) that explicitly models relationships between knowledge components based on a predefined knowledge graph structure37. The attention weights between knowledge components \(\:i\) and \(\:j\) are computed as:
where \(\:{\mathbf{h}}_{i}\) is the hidden state representation for knowledge component \(\:i\), \(\:\mathbf{W}\) is a learnable weight matrix, \(\:\mathbf{a}\) is an attention vector, and \(\:{\mathcal{N}}_{i}\) represents the neighborhood of knowledge component \(\:i\) in the knowledge graph.
The updated knowledge state representation for each component is then calculated as:
where \(\:\sigma\:\) represents a non-linear activation function such as ReLU.
For predicting future performance, we employ a multi-head prediction layer that generates separate probabilities for each knowledge component:
where \(\:{\widehat{y}}_{t+1,j}\) represents the predicted probability of correctly answering a question related to knowledge component \(\:j\) at time \(\:t+1\), \(\:{\mathbf{w}}_{j}\) and \(\:{b}_{j}\) are learnable parameters specific to component \(\:j\)38.
To integrate cognitive load information, we implement a gating mechanism that modulates the knowledge state update based on estimated cognitive load:
where \(\:{\mathbf{c}}_{t}\) is the cognitive load context vector, \(\:{\mathbf{g}}_{t}\) is the gate vector, and \(\:{\stackrel{\sim}{\mathbf{h}}}_{t}\) is the candidate update for the knowledge state.
Cognitive load dynamic estimation module
The cognitive load dynamic estimation module employs a multimodal approach to continuously monitor and assess students’ mental effort during learning activities. Unlike traditional retrospective measurement techniques, our approach integrates real-time data from multiple sources including interaction patterns, performance metrics, and optionally, physiological signals when available39. This multimodal architecture enables robust estimation across diverse learning environments while accommodating individual differences in cognitive load manifestation.
For feature extraction from learning behavior, we implement a hierarchical convolutional network that captures both local and global patterns in temporal interaction data. The network processes windowed sequences of student actions (e.g., mouse movements, keystrokes, page navigation, response times) and extracts behavioral signatures indicative of different cognitive load states. The feature extraction process for behavioral data can be represented as:
where \(\:{\mathbf{X}}_{b}\) represents the raw behavioral data matrix, \(\:{\varTheta\:}_{b}\) are the convolutional network parameters, and \(\:{\mathbf{f}}_{b}\) is the extracted behavioral feature vector.
Temporal pattern analysis is critical for distinguishing between momentary fluctuations and sustained cognitive load states40. We employ a recurrent attention mechanism that focuses on significant temporal patterns while ignoring noise:
where \(\:{\mathbf{h}}_{t}\) is the hidden state at time \(\:t\), \(\:{\alpha\:}_{t}\) is the attention weight, and \(\:\mathbf{c}\) is the context vector summarizing the temporal sequence.
For cognitive load level classification, we employ an ordinal regression approach that preserves the inherent ordering of cognitive load states (underload, optimal load, overload)41. The probability of the cognitive load exceeding level \(\:j\) is modeled as:
where \(\:{\mathbf{w}}_{j}\) and \(\:{b}_{j}\) are learnable parameters for threshold \(\:j\), and \(\:\sigma\:\) is the sigmoid function.
A key innovation in our module is its adaptive calibration mechanism that adjusts estimation parameters based on individual differences and contextual factors42. The calibration function dynamically updates based on observed patterns:
where \(\:\eta\:\) is the adaptation rate, \(\:\mathcal{L}\) is the calibration loss function, and \(\:{\mathcal{D}}_{\text{calib}}\) is the calibration data.
To enhance explainability, we incorporate an attention visualization component that highlights the specific behavioral and temporal features contributing most significantly to the cognitive load estimation. This transparency not only facilitates instructor understanding of student cognitive states but also enables students to develop metacognitive awareness of their own learning processes. The explainable components include feature importance scores, temporal attention patterns, and cognitive load transition analyses that identify critical points where load levels change significantly during learning activities.
Implementation and evaluation of personalized learning path generation system
Learning path optimization algorithm
The generation of optimal learning paths requires balancing two potentially competing objectives: maximizing knowledge acquisition while maintaining appropriate cognitive load levels. We formulate this as a constrained dual-objective optimization problem within a dynamic decision framework43. Unlike conventional single-objective approaches that focus solely on knowledge gain, our algorithm explicitly models the trade-off between learning efficiency and cognitive sustainability.
The optimization problem can be formally defined as:
where \(\:\pi\:\) represents the learning path policy, \(\:\gamma\:\) is a discount factor, \(\:{w}_{1}\) and \(\:{w}_{2}\) are weights balancing the two objectives, \(\:{R}_{K}\) is the knowledge acquisition reward, and \(\:{R}_{C}\) is the cognitive load balancing reward. The state \(\:{s}_{t}\) encompasses both the current knowledge state vector and cognitive load level, while action \(\:{a}_{t}\) corresponds to selecting the next learning activity.
The weights \(\:{w}_{1}\) and \(\:{w}_{2}\) are critical for balancing knowledge acquisition against cognitive load management. Rather than using static weights, we employ an adaptive weighting scheme that adjusts based on the learner’s current state:
where \(\:\lambda\:\) (set to 0.6 based on validation experiments) ensures minimum knowledge acquisition priority, and \(\:f\left({k}_{t}\right)\) is a monotonically decreasing function of current knowledge state that prioritizes knowledge acquisition for novices while increasing focus on cognitive load management as expertise develops. This dynamic weighting was validated through a sensitivity analysis comparing fixed versus adaptive weights across different student profiles, showing 12% improved outcomes with adaptive weighting.
The knowledge acquisition reward function rewards improvements in knowledge state while accounting for the relevance to learning goals:
where \(\:{k}_{t,i}\) represents the knowledge level for concept \(\:i\) at time \(\:t\), and \(\:{\alpha\:}_{i}\) denotes the importance weight of concept \(\:i\) relative to learning objectives.
The cognitive load balancing reward function is designed to keep cognitive load within the optimal learning zone, penalizing both cognitive underload and overload44:
where \(\:C{L}_{t}\) is the estimated cognitive load at time \(\:t\), \(\:C{L}_{opt}\) is the individually calibrated optimal cognitive load level, and \(\:\beta\:\) is a scaling parameter.
To ensure pedagogically sound learning paths, we incorporate multiple constraints into the optimization problem:
where \(\:\tau\:\left(i\right)\) denotes the time when concept \(\:i\) is introduced, \(\:\text{Prereq}\left(i,j\right)\) indicates that concept \(\:i\) is a prerequisite for concept \(\:j\), \(\:C{L}_{min}\) and \(\:C{L}_{max}\) define the acceptable range of cognitive load, and \(\:{\theta\:}_{mastery}\) is the minimum acceptable overall mastery level upon completion.
Given the computational complexity of exactly solving this constrained optimization problem, we implement an approximation algorithm based on Monte Carlo Tree Search (MCTS) with upper confidence bounds45:
where \(\:Q\left({s}_{t},a\right)\) is the estimated value of taking action \(\:a\) in state \(\:{s}_{t}\), \(\:N\left({s}_{t}\right)\) is the number of times state \(\:{s}_{t}\) has been visited, \(\:N\left({s}_{t},a\right)\) is the number of times action \(\:a\) has been taken in state \(\:{s}_{t}\), and \(\:c\) is an exploration parameter.
To enhance adaptability, we implement a dynamic path adjustment strategy that reevaluates and potentially revises the learning path after each learning activity46. The adaptation function uses observed performance and cognitive load readings to update the learner model:
where \(\:\eta\:\) is the adaptation rate, \(\:\mathcal{L}\) is a loss function comparing predicted and actual outcomes, and \(\:{\nabla\:}_{\pi\:}\) represents the gradient with respect to policy parameters.
This adaptive approach enables the system to respond to unexpected learning outcomes, fluctuations in cognitive states, and evolving learner preferences. Experimental evaluations demonstrate that our dual-objective optimization approach generates learning paths that not only increase knowledge acquisition rates by 18% compared to single-objective approaches but also maintain optimal cognitive engagement, resulting in higher learner satisfaction and reduced dropout rates in extended learning sessions.
System architecture and implementation
The proposed personalized learning system adopts a modular, microservice-based architecture that facilitates flexibility, scalability, and integration with existing educational technology ecosystems47. The system consists of five core components organized in a three-tier architecture (presentation, application, and data tiers) with bidirectional data flow and event-driven communication patterns. Table 4 provides detailed specifications of these functional modules, including their main functions and technical implementation. This design enables continuous optimization of learning paths based on real-time student interactions and cognitive state estimations.
The frontend interface implements a responsive design that adapts to various devices while maintaining consistent user experience48. The interface incorporates several key features: (1) a dynamic content presentation area that adapts based on the current learning activity and estimated cognitive state; (2) an interactive knowledge map visualization that illustrates progress across the domain and highlights relationships between concepts; (3) embedded formative assessment tools that gather performance data while minimizing learning disruption; and (4) subtle cognitive load indicators that provide metacognitive support without inducing additional extraneous load.
The backend processing logic is divided into synchronous request handling for immediate user interactions and asynchronous processing for computationally intensive operations such as model training and path optimization. The core processing workflow includes: (1) session initialization that establishes initial knowledge states and learning objectives; (2) activity selection based on the current optimization policy; (3) real-time performance and behavior monitoring during activity completion; (4) knowledge and cognitive state updates following activity completion; and (5) path recalculation triggered by significant state changes or scheduled intervals.
The database architecture employs a hybrid approach combining relational databases for structured educational content and student records with NoSQL document stores for behavioral telemetry and flexible schema requirements. Key data entities include: (1) a domain knowledge graph that represents concepts, skills, and their interdependencies; (2) a content repository containing learning activities tagged with metadata about knowledge components and difficulty levels; (3) learner profiles that store both static characteristics and dynamically updated knowledge and cognitive models; and (4) interaction logs that capture fine-grained temporal data about learning behaviors.
To support real-time path adjustment, the system implements an event streaming architecture using message queues and event processors that continuously analyze incoming data and trigger decision points49. This approach enables the system to detect and respond to significant events such as: (1) unexpected performance outcomes that suggest knowledge model inaccuracies; (2) cognitive load threshold violations indicating potential disengagement or frustration; (3) emergent learning patterns that suggest more efficient paths than previously calculated; and (4) explicit learner feedback or preference changes that necessitate strategy adjustments.
The integration of knowledge tracing and cognitive load monitoring within this architecture enables more nuanced personalization than conventional adaptive learning systems. By simultaneously optimizing for knowledge acquisition and cognitive engagement, the system can identify ideal learning opportunities that maintain appropriate challenge levels while maximizing long-term learning gains, resulting in more efficient and effective educational experiences.
Experimental design and results analysis
To evaluate the effectiveness of our integrated approach, we conducted comprehensive experiments using both simulated environments and real-world educational settings. The experimental design followed a mixed-methods approach combining quantitative performance metrics with qualitative assessments of learning experiences50. The primary research questions addressed were: (1) Does the dual-stream model outperform single-purpose models in knowledge tracing and cognitive load estimation? (2) Do learning paths generated by the proposed system improve learning outcomes compared to traditional and existing adaptive approaches? (3) What is the relationship between cognitive load optimization and learning efficiency?
For dataset construction, we collected data from three sources: (1) a large-scale programming education platform with 2,183 students completing 47,852 coding exercises over a 14-week semester; (2) a mathematics learning application with multimodal data including clickstreams, response times, and optional eye-tracking for 862 participants; and (3) a controlled laboratory experiment with 124 undergraduate students completing a series of learning activities while wearing EEG headsets to provide ground truth cognitive load measurements.
Participants in the laboratory experiment were randomly assigned to either the experimental group (using our DKT-CLE system, n = 62) or one of the control groups (using baseline methods, n = 62) using stratified randomization based on pre-test performance to ensure equivalent initial knowledge distributions. All participants interacted with identical learning content covering fundamental programming concepts, delivered through a standardized web interface. The only difference between conditions was the sequencing algorithm determining the order of presentation. Learning paths were delivered through a custom-built platform with path recommendations appearing as suggested “next topics” after each completed activity. To control for extraneous variables, all sessions occurred in the same laboratory environment with standardized hardware configurations, time allocations (120 min per session), and task instructions.
All data collection followed IRB-approved protocols (approval #IRB-2023-0427) with comprehensive informed consent procedures. For physiological data collection, participants received detailed explanations of EEG equipment usage, data collection purposes, and their right to withdraw at any time. Privacy protection measures included: (1) data pseudonymization through participant IDs unlinked to personal identifiers; (2) secure data storage on encrypted servers with access restricted to authorized researchers; (3) aggregated analysis that prevented individual identification; and (4) compliance with educational data privacy regulations (FERPA compliance for US participants). All identifiable information was removed prior to analysis, and participants were informed of all potential data uses including future research applications.
Performance evaluation employed multiple metrics targeting different aspects of the system. For knowledge tracing accuracy, we used the area under the receiver operating characteristic curve (AUC) and root mean squared error (RMSE) between predicted and actual performance:
For cognitive load estimation, we calculated the correlation coefficient between estimated load levels and ground truth measurements from physiological sensors and validated questionnaires. Path quality evaluation employed a composite metric combining prerequisite satisfaction, content diversity, and progression smoothness:
Learning efficiency was quantified using a gain-per-time metric normalized by cognitive effort:
We compared our approach against five baseline methods: (1) a rule-based system following predefined curriculum sequences; (2) a collaborative filtering recommendation approach; (3) a knowledge graph navigation algorithm; (4) a standard DKT model with no cognitive load component; and (5) a state-of-the-art reinforcement learning system trained to maximize knowledge gain. Implementation details and hyperparameter settings for all methods were standardized to ensure fair comparison51.
As shown in Table 5; Fig. 2, our integrated approach consistently outperformed all baseline methods across multiple metrics. The performance improvements were particularly pronounced in the programming education dataset, where the complex skill relationships and varying cognitive demands of different programming concepts highlighted the advantages of our dual-objective optimization approach52. Statistical significance testing using paired t-tests confirmed that the improvements were significant at p < 0.01 for all metrics.
(A) Prediction accuracy comparison across methods. (B) Learning efficiency by approach. (C) Representative learning trajectories showing knowledge state (solid line) and cognitive load (dashed line) over time for a participant using our DKT-CLE system versus baseline approach. (D) Cognitive load heatmap during different learning activities, demonstrating how our system maintains optimal challenge levels.
Qualitative analysis of generated learning paths revealed several distinctive characteristics of our approach: (1) more adaptive sequencing that responded to fluctuations in performance and engagement; (2) strategic insertion of review activities when cognitive overload was detected; (3) appropriate scaffolding through complementary content types; and (4) smoother difficulty progression that maintained optimal challenge levels. Expert evaluation by instructional designers rated our system-generated paths significantly higher in pedagogical soundness compared to baseline methods (4.4/5 vs. 3.5/5 average rating, p < 0.01).
User experience evaluation through post-study surveys and interviews revealed high satisfaction levels (4.3/5) with our system, with participants specifically highlighting reduced frustration during challenging concepts and improved engagement during longer study sessions. Longitudinal analysis of learning outcomes showed that students using our system achieved mastery-level performance 24.6% faster than those following traditional curriculum sequences while reporting lower average stress levels53.
Analysis of log data revealed interesting patterns in the relationship between cognitive load management and learning outcomes. Students who experienced primarily optimal cognitive load states (neither underloaded nor overloaded) showed significantly higher retention in post-tests administered two weeks after the learning period. Furthermore, the system’s ability to detect and respond to early signs of confusion or frustration resulted in 43% fewer abandoned learning sessions compared to non-adaptive approaches.
One particularly valuable finding was the identification of personalized “learning sweet spots” for individual students – optimal ranges of cognitive challenge that maximized both immediate performance and long-term retention. These individual profiles enabled increasingly refined path recommendations as students progressed through the curriculum, demonstrating the system’s capacity for continuous adaptation and personalization.
Conclusion
This study presents a novel approach to personalized learning path generation that integrates deep knowledge tracing with cognitive load estimation through a dual-stream neural network architecture. Our research makes several significant contributions to the field of adaptive educational technology. First, the proposed dual-stream architecture effectively combines knowledge state tracking and cognitive load estimation within a unified computational framework, enabling more comprehensive student modeling than previous approaches. Second, the dual-objective optimization algorithm balances knowledge acquisition with cognitive load management, resulting in learning paths that are both pedagogically sound and cognitively appropriate. Third, the implemented system demonstrates the feasibility of real-time adaptation based on both performance data and cognitive state estimates in authentic educational environments54.
Experimental results across multiple educational domains and diverse student populations confirm the effectiveness of our approach. The integrated system consistently outperformed baseline methods in knowledge prediction accuracy (87.5% vs. 83.7% for the best baseline), path quality assessment (4.4/5 vs. 3.9/5), and learning efficiency improvement (24.6% vs. 18.2%). These quantitative improvements were reinforced by qualitative findings indicating higher student engagement, lower frustration levels, and improved retention of learned material. Particularly noteworthy was the system’s ability to identify and adapt to individual “learning sweet spots” – optimal ranges of cognitive challenge that maximized both immediate performance and long-term retention55.
Despite these promising results, several limitations warrant acknowledgment. First, the accuracy of cognitive load estimation remains challenging in naturalistic learning environments without specialized sensors. While our approach makes significant strides in inferring cognitive states from behavioral patterns, integration with unobtrusive physiological monitoring could further enhance estimation precision.
Second, the current implementation primarily focuses on cognitive aspects of learning without explicitly modeling other critical individual characteristics such as learning styles, preferences, emotional states, and self-regulation capabilities. These factors significantly influence learning effectiveness and engagement but are not captured in our current framework. The personalization approach we present, while advanced in cognitive modeling, represents only a partial solution to fully adaptive learning environments that would ideally account for these multidimensional learner characteristics.
Third, the computational complexity of the dual-objective optimization presents scalability challenges for very large knowledge domains or real-time applications with stringent latency requirements56.
Future research directions should address these limitations while extending the capabilities of the integrated approach. One promising avenue involves developing more sophisticated models of multidimensional cognitive load that distinguish between intrinsic, extraneous, and germane load components, enabling more targeted instructional interventions. Additionally, incorporating emotional state detection and motivational dynamics could provide a more holistic understanding of learning processes. Exploring transfer learning techniques may enhance model generalizability across domains and reduce the data requirements for new applications. Finally, investigating explainable AI approaches could improve transparency and trust in system recommendations, facilitating wider adoption by educators and learners57.
The theoretical contributions of this work include advancing our understanding of the relationship between knowledge acquisition processes and cognitive resource allocation during learning. From a practical perspective, the demonstrated system provides a blueprint for next-generation adaptive learning technologies that consider both what students know and how they process information. As educational systems increasingly embrace personalization, approaches that balance cognitive science principles with advanced computational techniques will be essential for creating learning environments that are both effective and sustainable58,59,60,61,62,63,64.
Data availability
All data included in this study are available upon request by contact with the corresponding author. The implementation code for the dual-stream neural network architecture and learning path optimization algorithms will be made publicly available in a GitHub repository upon publication of this paper to facilitate reproducibility and further research in this area.
References
Baker, R. S., Corbett, A. T. & Aleven, V. More accurate student modeling through contextual estimation of slip and guess probabilities in Bayesian knowledge tracing. International Conference on Intelligent Tutoring Systems 406–415 (Springer, 2008).
Cheng, Y. & Ying, Z. Personalized learning: A new approach to adaptive intelligent tutoring systems. J. Educational Technol. Soc. 22(1), 93–105 (2019).
Piech, C. et al. Deep knowledge tracing. Advances in neural information processing systems (2015).
Wang, L. et al. Incorporating knowledge concept graph for personalized learning path generation. IEEE Trans. Learn. Technol. 13(1), 184–197 (2020).
Sweller, J., Van Merrienboer, J. J. G., Paas, F. & G W C. Cognitive architecture and instructional design. Educational Psychol. Rev. 10(3), 251–296 (1998).
Alqahtani, A. Y. & Rajkhan, A. A. E-learning critical success factors during the COVID-19 pandemic: A comprehensive analysis of e-learning managerial perspectives. Educ. Sci. 10(9), 216 (2020).
Chen, X. et al. A multi-perspective study on artificial intelligence in education: Grants, conferences, journals, software tools, institutions, and researchers. Comput. Educ. Artif. Intell. 3, 100047 (2022).
Lan, A. S. et al. Time-varying learning and content analytics via sparse factor analysis. Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 452–461 (2014).
Wang, Y., Heffernan, N. T. & Heffernan, C. Towards better affect detectors: Effect of missing skills, class features and common wrong answers. Proceedings of the Fifth International Conference on Learning Analytics and Knowledge 31–35 (2015).
Karumbaiah, S. et al. Predicting and explaining students’ stress in intelligent tutoring systems. J. Educational Data Min. 13(2), 55–87 (2021).
Corbett, A. T. & Anderson, J. R. Knowledge tracing: modeling the acquisition of procedural knowledge. User Model. User-Adapt. Interact. 4(4), 253–278 (1994).
Yudelson, M. V., Koedinger, K. R. & Gordon, G. J. Individualized Bayesian knowledge tracing models. International Conference on Artificial Intelligence in Education 171–180 (Springer, 2013).
Zhang, J. et al. Dynamic key-value memory networks for knowledge tracing. Proceedings of the 26th International Conference on World Wide Web 765–774 (2017).
Nakagawa, H., Iwasawa, Y. & Matsuo, Y. Graph-based knowledge tracing: Modeling student proficiency using graph neural networks. IEEE/WIC/ACM International Conference on Web Intelligence 156–163 (2019).
Ghosh, A., Heffernan, N. & Lan, A. S. Context-aware attentive knowledge tracing. Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining 2330–2339 (2020).
Yeung, C. K. & Yeung, D. Y. Addressing two problems in deep knowledge tracing via prediction-consistent regularization. Proceedings of the Fifth Annual ACM Conference on Learning at Scale 1–10 (2018).
Sweller, J. Cognitive load during problem solving: effects on learning. Cogn. Sci. 12(2), 257–285 (1988).
Paas, F. et al. Cognitive load measurement as a means to advance cognitive load theory. Educational Psychol. 38(1), 63–71 (2003).
Antonenko, P. D. et al. Using electroencephalography to measure cognitive load. Educational Psychol. Rev. 22(4), 425–438 (2010).
Chen, S. & Epps, J. Using task-induced pupil diameter and Blink rate to infer cognitive load. Human–Computer Interact. 29(4), 390–413 (2014).
Zheng, R. & Greenberg, K. The boundary of different approaches in cognitive load measurement: strengths and limitations. Learn. Instruction. 57, 48–58 (2018).
Moustafa, K., Luz, S. & Longo, L. Assessment of mental workload: A comparison of machine learning methods and subjective assessment techniques. International Symposium on Human Mental Workload: Models and Applications 30–50 (Springer, 2017).
Shute, V. J. & Zapata-Rivera, D. Adaptive educational systems. Adapt. Technol. Train. Educ. 7(1), 7–27 (2012).
Brusilovsky, P. & Millán, E. User models for adaptive hypermedia and adaptive educational sys tems. The Adaptive Web 3–53 (Springer, 2007).
Dwivedi, P., Kant, V. & Bharadwaj, K. K. Learning path recommendation based on modified variable length genetic algorithm. Educ. Inform. Technol. 23(2), 819–836 (2018).
Liu, J., Tang, M., Zheng, Z., Liu, X. & Lyu, S. A multigranularity learning path recommendation framework based on knowledge graph and improved ant colony optimization algorithm for E-Learning. IEEE Trans. Comput. Social Syst. 12(2), 586–607 (2024).
Zheng, Y. et al. A unified framework for personalized learning pathway recommendation in e-learning contexts. Educ. Inform. Technol. https://doi.org/10.1007/s10639-024-13045-8 (2024).
Dalipi, F., Imran, A. S. & Kastrati, Z. MOOC dropout prediction using machine learning techniques: Review and research challenge. IEEE Global Engineering Education Conference 1007–1014 (2018).
Chen, Q. et al. A reinforcement learning approach to personalize learning path generation. International Conference on Artificial Intelligence in Education 42–47 (Springer, 2020).
Zhang, K. et al. Interactive learning path navigation: A multi-modal interactive learning system based on knowledge graph. ACM Trans. Interact. Intell. Syst. 12(3), 1–25 (2022).
Huang, Z. et al. Bidirectional attentive knowledge tracing. Proceedings of the 29th ACM International Conference on Information & Knowledge Management 1705–1714 (2020).
Li, H. et al. The Effects of the Cognitive Load and self-regulated Learning on Learning Performance in Initial AR Learning Environment: Using eye-tracking To Unveil the Internal mechanism179104408 (Computers & Education, 2022).
Santhanam, S. et al. Integrated attention-based knowledge tracing and cognitive load Estimation for personalized learning path generation. IEEE Trans. Neural Networks Learn. Syst. 33(10), 5757–5771 (2022).
Moreno-Marcos, P. M. et al. Prediction in moocs: A review and future research directions. IEEE Trans. Learn. Technol. 12(3), 384–401 (2019).
Tang, S., Peterson, J. C. & Pardos, Z. A. Deep neural networks and how they apply to sequential education data. Proceedings of the Third ACM Conference on Learning @ Scale 321–324 (2016).
Abdelrahman, G. & Wang, Q. Knowledge tracing with sequential key-value memory networks. Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval 175–184 (2019).
Xu, K. et al. Representation learning on graphs with jumping knowledge networks. International Conference on Machine Learning 5453–5462 (2018).
Pandey, S. & Karypis, G. A self-attentive model for knowledge tracing. Proceedings of the 12th International Conference on Educational Data Mining 384–389 (2019).
Zhou, F. et al. A novel ensemble deep knowledge tracing method for adaptive learning. IEEE Trans. Cybernetics. 52(10), 10280–10290 (2021).
Chen, L., Chen, P. & Lin, Z. Artificial intelligence in education: A review. IEEE Access. 8, 75264–75278 (2020).
Mao, Y., Lin, C. & Yu, M. A multi-label classification approach to cognitive load Estimation with physiological and performance features. IEEE Trans. Cogn. Dev. Syst. 14(2), 472–483 (2022).
Moreno-Marcos, P. M. et al. Analyzing the predictive power for anticipating student’s knowledge in personalized learning via multimodal data. IEEE Trans. Learn. Technol. 13(1), 113–127 (2020).
Huang, Z. et al. Learning and adaptation in multimodal skill assessment with dual attention and recurrent fusion. IEEE Trans. Neural Networks Learn. Syst. 33(8), 3732–3746 (2021).
Wang, S., Zhang, J. & Chan, S. C. A hierarchical framework for adaptive personalized learning path optimization and visualization. IEEE Trans. Neural Netw. Learn. Syst. 33(10), 5772–5784 (2022).
Silver, D. et al. Mastering the game of go without human knowledge. Nature 550(7676), 354–359 (2017).
Manickam, I., Lan, A. S. & Baraniuk, R. G. Contextual multi-armed bandit algorithms for personalized learning action selection. IEEE International Conference on Acoustics, Speech and Signal Processing 6344–6348 (2017).
Essa, A. A possible future for next generation adaptive learning systems. Smart Learn. Environ. 3(1), 1–24 (2016).
Wan, H. et al. PeRSIVA: an interactive visualization system for personalized learning experience analytics. IEEE Trans. Vis. Comput. Graph. 27(12), 4560–4571 (2021).
Baker, T., Smith, L. & Anissa, N. Educ-AI-tion rebooted? Exploring the future of artificial intelligence in schools and colleges. Nesta Foundation Rep. 1–52 (2022).
Cui, W., Xue, Z. & Thai, K. P. Performance prediction and cognitive skill Estimation in adaptive learning systems: A survey of the state-of-the-art. IEEE Trans. Learn. Technol. 14(6), 740–755 (2021).
Liu, R. & Jansen, R. Performance prediction in moocs: A review, analysis, and future directions. IEEE Trans. Learn. Technol. 14(6), 685–703 (2021).
Mu, T. et al. Embedding knowledge concepts in neural knowledge tracing with question-answering interactions. IEEE Trans. Learn. Technol. 14(6), 756–769 (2021).
Tabuenca, B. et al. Analyzing Emotions in Educational Interventions: A Case Study Based on Physiological multi-sensor data 176104358 (Computers & Education, 2022).
Knox, J., Williamson, B. & Bayne, S. Machine behaviourism: future visions of ‘learnification’ and ‘datafication’ across humans and digital technologies. Learn. Media Technol. 45(1), 31–45 (2020).
Whitelock-Wainwright, A. et al. Learning Analytics Deployment Tactics: A meta-analysis of Success factors 183104498 (Computers & Education, 2022).
Chen, L. et al. A review of deep learning in educational data mining and learning analytics: current status and perspectives. Front. Comput. Sci. 16(3), 1–22 (2022).
Holstein, K., McLaren, B. M. & Aleven, V. Student learning benefits of a mixed-reality teacher awareness tool in AI-enhanced classrooms. International Conference on Artificial Intelligence in Education 154–168 (Springer, 2018).
Liang, Y. et al. HELP-DKT: an interpretable cognitive model of how students learn programming based on deep knowledge tracing. Sci. Rep. 12, 4012. https://doi.org/10.1038/s41598-022-07956-0 (2022).
Pu, Y. et al. ELAKT: enhancing locality for attentive knowledge tracing. ACM Trans. Inform. Syst. 42(4), 1–27. https://doi.org/10.1145/3652601 (2024).
Pu, Y. et al. Embedding cognitive framework with self-attention for interpretable knowledge tracing. Sci. Rep. 12, 17536. https://doi.org/10.1038/s41598-022-22539-9 (2022).
Wu, Z. et al. Reinforcement learning enhanced graph neural network with relative entropy. IEEE International Conference on Data Engineering (ICDE) (forthcoming) (2025).
Zhu, Y., Xu, Y., Yu, F. & Liu, Q. SOLA-GCL: Subgraph-oriented learnable augmentation method for graph contrastive learning. Proceedings of the AAAI Conference on Artificial Intelligence (2025).
Peng, T. et al. CLGT: A graph transformer for student performance prediction in collaborative learning. Proc. AAAI Conf. Artif. Intell. 37(13), 15947–15954. https://doi.org/10.1609/aaai.v37i13.26893 (2023).
Peng, T. et al. GOAT: a novel global-local optimized graph transformer framework for predicting student performance in collaborative learning. Sci. Rep. 15, 15421. https://doi.org/10.1038/s41598-023-42703-z (2023).
Author information
Authors and Affiliations
Contributions
Chunyan Tong contributed to the conceptualization of the research, theoretical framework development, educational domain analysis, experimental design, data collection protocols, interpretation of results, and manuscript writing (introduction, related work, and conclusion). Changhong Ren contributed to the neural network architecture design, algorithm development, system implementation, data processing, experimental validation, performance analysis, and manuscript writing (methodology and results sections). Both authors participated in the review and editing of the final manuscript, approved the submitted version, and agreed to be accountable for all aspects of the work.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Tong, C., Ren, C. Deep knowledge tracing and cognitive load estimation for personalized learning path generation using neural network architecture. Sci Rep 15, 24925 (2025). https://doi.org/10.1038/s41598-025-10497-x
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-10497-x




