OM-VST: A video action recognition model based on optimized downsampling module combined with multi-scale feature fusion.

Video classification, as an essential task in computer vision, aims to identify and label video content using computer technology automatically. However, the current mainstream video classification models face two significant challenges in practical applications: first, the classification accuracy i...

Full description

Saved in:

Bibliographic Details
Main Authors:	Xiaozhong Geng, Cheng Chen, Ping Yu, Baijin Liu, Weixin Hu, Qipeng Liang, Xintong Zhang
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2025-01-01
Series:	PLoS ONE
Online Access:	https://doi.org/10.1371/journal.pone.0318884
Tags:	Add Tag No Tags, Be the first to tag this record!

Be the first to leave a comment!

OM-VST: A video action recognition model based on optimized downsampling module combined with multi-scale feature fusion.

Similar Items