Creating a Novel Attention-Enhanced Framework for Video-Based Action Quality Assessment

Action Quality Assessment (AQA)—the task of evaluating how well human actions are performed—is essential in domains such as sports and medicine. Existing AQA methods typically rely on score regression following feature extraction but often neglect the ambiguity inherent in extracted features. In thi...

Full description

Saved in:
Bibliographic Details
Main Authors: Wenhui Gong, Wei Li, Huosheng Hu, Zhijun Song, Zhiqiang Zeng, Jinhua Sun, Yuping Song
Format: Article
Language:English
Published: MDPI AG 2025-05-01
Series:Sci
Subjects:
Online Access:https://www.mdpi.com/2413-4155/7/2/54
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Action Quality Assessment (AQA)—the task of evaluating how well human actions are performed—is essential in domains such as sports and medicine. Existing AQA methods typically rely on score regression following feature extraction but often neglect the ambiguity inherent in extracted features. In this work, we introduce a novel AQA framework that incorporates a modified attention module to better capture relevant information. Our approach segments video data into clips, extracts features using the I3D network, and applies attention mechanisms to highlight salient features while suppressing irrelevant ones. To assess feature quality, we employ score distribution regression and propose an uncertainty-aware score distribution learning strategy that models features as Gaussian distributions. We further leverage Variational Autoencoders (VAEs) to capture complex latent representations and quantify uncertainty. Extensive experiments on the MTL-AQA and JIGSAWS datasets demonstrate the effectiveness and robustness of our proposed method.
ISSN:2413-4155