ACM-YOLOv10: Research on Classroom Learning Behavior Recognition Algorithm Based on Improved YOLOv10

To effectively integrate the research on learning engagement with teaching practices and accurately assess and analyze students’ learning behavior participation in the classroom to improve teaching quality, this paper proposes an improved YOLOv10 algorithm model, ACM-YOLOv10, targeting th...

Full description

Saved in:
Bibliographic Details
Main Authors: Beichen Qin, Haoyan Hu, Shaowen Du
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11127087/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849223083599069184
author Beichen Qin
Haoyan Hu
Shaowen Du
author_facet Beichen Qin
Haoyan Hu
Shaowen Du
author_sort Beichen Qin
collection DOAJ
description To effectively integrate the research on learning engagement with teaching practices and accurately assess and analyze students’ learning behavior participation in the classroom to improve teaching quality, this paper proposes an improved YOLOv10 algorithm model, ACM-YOLOv10, targeting the issues of insufficient detection precision, missed detection, false detection, and slow speed of traditional recognition algorithms in classroom behavior detection under multi-scale scenarios and occluded targets. The model is designed with an Asymmetric Depthwise Separable Convolution (ADSConv) module to replace the traditional convolutional layers. This module, with its lightweight design, optimizes convolution operations to reduce computational complexity and parameter quantity, thereby accelerating the model’s inference speed. Additionally, the Spatial and Channel Reconstruction Convolution (C2f_SCConv) module is embedded in the backbone and neck networks, combining the characteristics of SCConv to reduce redundant features of CNN through spatial and channel reconstruction units, which effectively enhances the model’s ability to learn local features, particularly suitable for fine-grained feature extraction in complex backgrounds. Finally, the redesigned spatial Pyramid Pool Fast Learning Separable Kernel Attention (SPPF_LSKA) module is introduced into the backbone network, improving the multi-scale feature fusion method to more effectively handle features of different scales and enhancing the model’s sensitivity and detection capability for minority class behaviors. Experimental results demonstrate that the improved ACM-YOLOv10 model achieves mAP of 82.4% for six behavior categories on the SCB-Dataset3, which is a 5.3% improvement compared to the baseline model, outperforming other mainstream detection models and meeting the practical requirements for Student Behaviors (SB) detection. Additionally, generalization experiments conducted on another SB dataset confirm that the improved algorithm model possesses good generalization performance.
format Article
id doaj-art-0278309d097d4cdeba586dfd0040cc4a
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-0278309d097d4cdeba586dfd0040cc4a2025-08-25T23:12:29ZengIEEEIEEE Access2169-35362025-01-011314486314487710.1109/ACCESS.2025.359968611127087ACM-YOLOv10: Research on Classroom Learning Behavior Recognition Algorithm Based on Improved YOLOv10Beichen Qin0https://orcid.org/0009-0009-0794-4841Haoyan Hu1Shaowen Du2School of Energy and Power Engineering, Nanjing Institute of Technology, Nanjing, Jiangsu, ChinaEngineering Training Center, College of Applied Technology, Nanjing, Jiangsu, ChinaSchool of Traffic Engineering, Nanjing Institute of Technology, Nanjing, Jiangsu, ChinaTo effectively integrate the research on learning engagement with teaching practices and accurately assess and analyze students’ learning behavior participation in the classroom to improve teaching quality, this paper proposes an improved YOLOv10 algorithm model, ACM-YOLOv10, targeting the issues of insufficient detection precision, missed detection, false detection, and slow speed of traditional recognition algorithms in classroom behavior detection under multi-scale scenarios and occluded targets. The model is designed with an Asymmetric Depthwise Separable Convolution (ADSConv) module to replace the traditional convolutional layers. This module, with its lightweight design, optimizes convolution operations to reduce computational complexity and parameter quantity, thereby accelerating the model’s inference speed. Additionally, the Spatial and Channel Reconstruction Convolution (C2f_SCConv) module is embedded in the backbone and neck networks, combining the characteristics of SCConv to reduce redundant features of CNN through spatial and channel reconstruction units, which effectively enhances the model’s ability to learn local features, particularly suitable for fine-grained feature extraction in complex backgrounds. Finally, the redesigned spatial Pyramid Pool Fast Learning Separable Kernel Attention (SPPF_LSKA) module is introduced into the backbone network, improving the multi-scale feature fusion method to more effectively handle features of different scales and enhancing the model’s sensitivity and detection capability for minority class behaviors. Experimental results demonstrate that the improved ACM-YOLOv10 model achieves mAP of 82.4% for six behavior categories on the SCB-Dataset3, which is a 5.3% improvement compared to the baseline model, outperforming other mainstream detection models and meeting the practical requirements for Student Behaviors (SB) detection. Additionally, generalization experiments conducted on another SB dataset confirm that the improved algorithm model possesses good generalization performance.https://ieeexplore.ieee.org/document/11127087/Student behaviorYOLOv10asymmetric depthwise separable convolutionspatial and channel reconstruction convolution
spellingShingle Beichen Qin
Haoyan Hu
Shaowen Du
ACM-YOLOv10: Research on Classroom Learning Behavior Recognition Algorithm Based on Improved YOLOv10
IEEE Access
Student behavior
YOLOv10
asymmetric depthwise separable convolution
spatial and channel reconstruction convolution
title ACM-YOLOv10: Research on Classroom Learning Behavior Recognition Algorithm Based on Improved YOLOv10
title_full ACM-YOLOv10: Research on Classroom Learning Behavior Recognition Algorithm Based on Improved YOLOv10
title_fullStr ACM-YOLOv10: Research on Classroom Learning Behavior Recognition Algorithm Based on Improved YOLOv10
title_full_unstemmed ACM-YOLOv10: Research on Classroom Learning Behavior Recognition Algorithm Based on Improved YOLOv10
title_short ACM-YOLOv10: Research on Classroom Learning Behavior Recognition Algorithm Based on Improved YOLOv10
title_sort acm yolov10 research on classroom learning behavior recognition algorithm based on improved yolov10
topic Student behavior
YOLOv10
asymmetric depthwise separable convolution
spatial and channel reconstruction convolution
url https://ieeexplore.ieee.org/document/11127087/
work_keys_str_mv AT beichenqin acmyolov10researchonclassroomlearningbehaviorrecognitionalgorithmbasedonimprovedyolov10
AT haoyanhu acmyolov10researchonclassroomlearningbehaviorrecognitionalgorithmbasedonimprovedyolov10
AT shaowendu acmyolov10researchonclassroomlearningbehaviorrecognitionalgorithmbasedonimprovedyolov10