An Instructional Optimization Method Based on Bidirectional Transformer and Reinforcement Learning

With the rapid development of information technology, personalized education has become a key direction for improving the quality of online learning and optimizing individualized learning paths. However, accurately recommending appropriate courses and exercises for diverse learners remains a signifi...

Full description

Saved in:
Bibliographic Details
Main Authors: Ran Zhang, Xiaoping Wu, Xude Zhang, Li Xu
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11025824/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850130436913627136
author Ran Zhang
Xiaoping Wu
Xude Zhang
Li Xu
author_facet Ran Zhang
Xiaoping Wu
Xude Zhang
Li Xu
author_sort Ran Zhang
collection DOAJ
description With the rapid development of information technology, personalized education has become a key direction for improving the quality of online learning and optimizing individualized learning paths. However, accurately recommending appropriate courses and exercises for diverse learners remains a significant challenge. Existing recommendation methods often struggle with effectively modeling learner interests, addressing the cold-start problem, and dynamically adapting recommendation strategies to meet personalized needs. To address these limitations, this paper proposes RL-TBTNet, a novel teaching optimization recommendation framework that integrates a bidirectional Transformer, BERT, and reinforcement learning (DQN). The model first vectorizes user behavior data, learning content, and knowledge base information. Transformer layers are employed for feature encoding, while BERT extracts deep semantic representations to form individualized feature vectors. These features are then fused via Transformer-based processing to predict optimal learning content. In addition, a DQN-based reinforcement learning module models dynamic shifts in user interests, enabling adaptive refinement of learning trajectories over time. Experimental evaluations on public datasets show that RL-TBTNet outperforms existing Transformer-based methods such as BST in terms of key metrics like HR and NDCG, particularly excelling in cold-start scenarios. Ablation studies further confirm the effectiveness of semantic enhancement through BERT and reinforcement-driven optimization. These results demonstrate the framework’s potential as a robust and adaptive solution for personalized educational content recommendation, offering both practical value and theoretical insights for the development of intelligent education systems.
format Article
id doaj-art-0034fdbf3cb24b3e8f0bbf29ff5a22ec
institution OA Journals
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-0034fdbf3cb24b3e8f0bbf29ff5a22ec2025-08-20T02:32:42ZengIEEEIEEE Access2169-35362025-01-011310006410007310.1109/ACCESS.2025.357687811025824An Instructional Optimization Method Based on Bidirectional Transformer and Reinforcement LearningRan Zhang0https://orcid.org/0009-0006-2086-0712Xiaoping Wu1Xude Zhang2Li Xu3College of Microelectronics and Artificial Intelligence, Kaili University, Kaili, ChinaCollege of Microelectronics and Artificial Intelligence, Kaili University, Kaili, ChinaCollege of Microelectronics and Artificial Intelligence, Kaili University, Kaili, ChinaCollege of Microelectronics and Artificial Intelligence, Kaili University, Kaili, ChinaWith the rapid development of information technology, personalized education has become a key direction for improving the quality of online learning and optimizing individualized learning paths. However, accurately recommending appropriate courses and exercises for diverse learners remains a significant challenge. Existing recommendation methods often struggle with effectively modeling learner interests, addressing the cold-start problem, and dynamically adapting recommendation strategies to meet personalized needs. To address these limitations, this paper proposes RL-TBTNet, a novel teaching optimization recommendation framework that integrates a bidirectional Transformer, BERT, and reinforcement learning (DQN). The model first vectorizes user behavior data, learning content, and knowledge base information. Transformer layers are employed for feature encoding, while BERT extracts deep semantic representations to form individualized feature vectors. These features are then fused via Transformer-based processing to predict optimal learning content. In addition, a DQN-based reinforcement learning module models dynamic shifts in user interests, enabling adaptive refinement of learning trajectories over time. Experimental evaluations on public datasets show that RL-TBTNet outperforms existing Transformer-based methods such as BST in terms of key metrics like HR and NDCG, particularly excelling in cold-start scenarios. Ablation studies further confirm the effectiveness of semantic enhancement through BERT and reinforcement-driven optimization. These results demonstrate the framework’s potential as a robust and adaptive solution for personalized educational content recommendation, offering both practical value and theoretical insights for the development of intelligent education systems.https://ieeexplore.ieee.org/document/11025824/Deep learningtransformerteaching optimizationlearning pushBERT
spellingShingle Ran Zhang
Xiaoping Wu
Xude Zhang
Li Xu
An Instructional Optimization Method Based on Bidirectional Transformer and Reinforcement Learning
IEEE Access
Deep learning
transformer
teaching optimization
learning push
BERT
title An Instructional Optimization Method Based on Bidirectional Transformer and Reinforcement Learning
title_full An Instructional Optimization Method Based on Bidirectional Transformer and Reinforcement Learning
title_fullStr An Instructional Optimization Method Based on Bidirectional Transformer and Reinforcement Learning
title_full_unstemmed An Instructional Optimization Method Based on Bidirectional Transformer and Reinforcement Learning
title_short An Instructional Optimization Method Based on Bidirectional Transformer and Reinforcement Learning
title_sort instructional optimization method based on bidirectional transformer and reinforcement learning
topic Deep learning
transformer
teaching optimization
learning push
BERT
url https://ieeexplore.ieee.org/document/11025824/
work_keys_str_mv AT ranzhang aninstructionaloptimizationmethodbasedonbidirectionaltransformerandreinforcementlearning
AT xiaopingwu aninstructionaloptimizationmethodbasedonbidirectionaltransformerandreinforcementlearning
AT xudezhang aninstructionaloptimizationmethodbasedonbidirectionaltransformerandreinforcementlearning
AT lixu aninstructionaloptimizationmethodbasedonbidirectionaltransformerandreinforcementlearning
AT ranzhang instructionaloptimizationmethodbasedonbidirectionaltransformerandreinforcementlearning
AT xiaopingwu instructionaloptimizationmethodbasedonbidirectionaltransformerandreinforcementlearning
AT xudezhang instructionaloptimizationmethodbasedonbidirectionaltransformerandreinforcementlearning
AT lixu instructionaloptimizationmethodbasedonbidirectionaltransformerandreinforcementlearning