Hidden Markov Model-Based Video Recognition for Sports

In this paper, we conduct an in-depth study and analysis of sports video recognition by improved hidden Markov model. The feature module is a complex gesture recognition module based on hidden Markov model gesture features, which applies the hidden Markov model features to gesture recognition and pe...

Full description

Saved in:

Bibliographic Details
Main Authors:	Zhiyuan Wang, Chongyuan Bi, Songhui You, Junjie Yao
Format:	Article
Language:	English
Published:	Wiley 2021-01-01
Series:	Advances in Mathematical Physics
Online Access:	http://dx.doi.org/10.1155/2021/5183088
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832551642076020736
author	Zhiyuan Wang Chongyuan Bi Songhui You Junjie Yao
author_facet	Zhiyuan Wang Chongyuan Bi Songhui You Junjie Yao
author_sort	Zhiyuan Wang
collection	DOAJ
description	In this paper, we conduct an in-depth study and analysis of sports video recognition by improved hidden Markov model. The feature module is a complex gesture recognition module based on hidden Markov model gesture features, which applies the hidden Markov model features to gesture recognition and performs the recognition of complex gestures made by combining simple gestures based on simple gesture recognition. The combination of the two modules forms the overall technology of this paper, which can be applied to many scenarios, including some special scenarios with high-security levels that require real-time feedback and some public indoor scenarios, which can achieve different prevention and services for different age groups. With the increase of the depth of the feature extraction network, the experimental effect is enhanced; however, the two-dimensional convolutional neural network loses temporal information when extracting features, so the three-dimensional convolutional network is used in this paper to extract features from the video in time and space. Multiple binary classifications of the extracted features are performed to achieve the goal of multilabel classification. A multistream residual neural network is used to extract features from video data of three modalities, and the extracted feature vectors are fed into the attention mechanism network, then, the more critical information for video recognition is selected from a large amount of spatiotemporal information, further learning the temporal dependencies existing between consecutive video frames, and finally fusing the multistream network outputs to obtain the final prediction category. By training and optimizing the model in an end-to-end manner, recognition accuracies of 92.7% and 64.4% are achieved on the dataset, respectively.
format	Article
id	doaj-art-e6dcef294c284d6f813b0ff095e6b8d1
institution	Kabale University
issn	1687-9139
language	English
publishDate	2021-01-01
publisher	Wiley
record_format	Article
series	Advances in Mathematical Physics
spelling	doaj-art-e6dcef294c284d6f813b0ff095e6b8d12025-02-03T06:01:00ZengWileyAdvances in Mathematical Physics1687-91392021-01-01202110.1155/2021/5183088Hidden Markov Model-Based Video Recognition for SportsZhiyuan Wang0Chongyuan Bi1Songhui You2Junjie Yao3International College of FootballInternational College of FootballInternational College of FootballSchool of Computer Science and TechnologyIn this paper, we conduct an in-depth study and analysis of sports video recognition by improved hidden Markov model. The feature module is a complex gesture recognition module based on hidden Markov model gesture features, which applies the hidden Markov model features to gesture recognition and performs the recognition of complex gestures made by combining simple gestures based on simple gesture recognition. The combination of the two modules forms the overall technology of this paper, which can be applied to many scenarios, including some special scenarios with high-security levels that require real-time feedback and some public indoor scenarios, which can achieve different prevention and services for different age groups. With the increase of the depth of the feature extraction network, the experimental effect is enhanced; however, the two-dimensional convolutional neural network loses temporal information when extracting features, so the three-dimensional convolutional network is used in this paper to extract features from the video in time and space. Multiple binary classifications of the extracted features are performed to achieve the goal of multilabel classification. A multistream residual neural network is used to extract features from video data of three modalities, and the extracted feature vectors are fed into the attention mechanism network, then, the more critical information for video recognition is selected from a large amount of spatiotemporal information, further learning the temporal dependencies existing between consecutive video frames, and finally fusing the multistream network outputs to obtain the final prediction category. By training and optimizing the model in an end-to-end manner, recognition accuracies of 92.7% and 64.4% are achieved on the dataset, respectively.http://dx.doi.org/10.1155/2021/5183088
spellingShingle	Zhiyuan Wang Chongyuan Bi Songhui You Junjie Yao Hidden Markov Model-Based Video Recognition for Sports Advances in Mathematical Physics
title	Hidden Markov Model-Based Video Recognition for Sports
title_full	Hidden Markov Model-Based Video Recognition for Sports
title_fullStr	Hidden Markov Model-Based Video Recognition for Sports
title_full_unstemmed	Hidden Markov Model-Based Video Recognition for Sports
title_short	Hidden Markov Model-Based Video Recognition for Sports
title_sort	hidden markov model based video recognition for sports
url	http://dx.doi.org/10.1155/2021/5183088
work_keys_str_mv	AT zhiyuanwang hiddenmarkovmodelbasedvideorecognitionforsports AT chongyuanbi hiddenmarkovmodelbasedvideorecognitionforsports AT songhuiyou hiddenmarkovmodelbasedvideorecognitionforsports AT junjieyao hiddenmarkovmodelbasedvideorecognitionforsports

Hidden Markov Model-Based Video Recognition for Sports

Similar Items