Human Posture Recognition and Estimation Method Based on 3D Multiview Basketball Sports Dataset

In traditional 3D reconstruction methods, using a single view to predict the 3D structure of an object is a very difficult task. This research mainly discusses human pose recognition and estimation based on 3D multiview basketball sports dataset. The convolutional neural network framework used in th...

Full description

Saved in:
Bibliographic Details
Main Authors: Xuhui Song, Linyuan Fan
Format: Article
Language:English
Published: Wiley 2021-01-01
Series:Complexity
Online Access:http://dx.doi.org/10.1155/2021/6697697
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832568508167225344
author Xuhui Song
Linyuan Fan
author_facet Xuhui Song
Linyuan Fan
author_sort Xuhui Song
collection DOAJ
description In traditional 3D reconstruction methods, using a single view to predict the 3D structure of an object is a very difficult task. This research mainly discusses human pose recognition and estimation based on 3D multiview basketball sports dataset. The convolutional neural network framework used in this research is VGG11, and the basketball dataset Image Net is used for pretraining. This research uses some modules of the VGG11 network. For different feature fusion methods, different modules of the VGG11 network are used as the feature extraction network. In order to be efficient in computing and processing, the multilayer perceptron in the network model is implemented by a one-dimensional convolutional network. The input is a randomly sampled point set, and after a layer of perceptron, it outputs a feature set of n × 16. Then, the feature set is sent to two network branches, one is to continue to use the perceptron method to generate the feature set of n × 1024, and the other network is used to extract the local features of points. After the RGB basketball sports picture passes through the semantic segmentation network, a picture containing the target object is obtained, and the picture is input to the constructed feature fusion network model. After feature extraction is performed on the RGB image and the depth image, respectively, the RGB feature, the local feature of the point cloud, and the global feature are spliced and fused to form a feature vector of N × 1152. There are three branches for this vector network, which, respectively, predict the object position, rotation, and confidence. Among them, the feature dimensionality reduction is realized by one-dimensional convolution, and the activation function is the ReLU function. After removing the feature mapping module, the accuracy of VC-CNN_v1 dropped by 0.33% and the accuracy of VC-CNN_v2 dropped by 0.55%. It can be seen from the research results that the addition of the feature mapping module improves the recognition effect of the network to a certain extent
format Article
id doaj-art-7f81f1e41a0b47bd8d5a13efa0e731f9
institution Kabale University
issn 1076-2787
1099-0526
language English
publishDate 2021-01-01
publisher Wiley
record_format Article
series Complexity
spelling doaj-art-7f81f1e41a0b47bd8d5a13efa0e731f92025-02-03T00:58:58ZengWileyComplexity1076-27871099-05262021-01-01202110.1155/2021/66976976697697Human Posture Recognition and Estimation Method Based on 3D Multiview Basketball Sports DatasetXuhui Song0Linyuan Fan1Department of Sports, Capital University of Economics and Business, Beijing 100070, ChinaSchool of Statistics, Capital University of Economics and Business, Beijing 100070, ChinaIn traditional 3D reconstruction methods, using a single view to predict the 3D structure of an object is a very difficult task. This research mainly discusses human pose recognition and estimation based on 3D multiview basketball sports dataset. The convolutional neural network framework used in this research is VGG11, and the basketball dataset Image Net is used for pretraining. This research uses some modules of the VGG11 network. For different feature fusion methods, different modules of the VGG11 network are used as the feature extraction network. In order to be efficient in computing and processing, the multilayer perceptron in the network model is implemented by a one-dimensional convolutional network. The input is a randomly sampled point set, and after a layer of perceptron, it outputs a feature set of n × 16. Then, the feature set is sent to two network branches, one is to continue to use the perceptron method to generate the feature set of n × 1024, and the other network is used to extract the local features of points. After the RGB basketball sports picture passes through the semantic segmentation network, a picture containing the target object is obtained, and the picture is input to the constructed feature fusion network model. After feature extraction is performed on the RGB image and the depth image, respectively, the RGB feature, the local feature of the point cloud, and the global feature are spliced and fused to form a feature vector of N × 1152. There are three branches for this vector network, which, respectively, predict the object position, rotation, and confidence. Among them, the feature dimensionality reduction is realized by one-dimensional convolution, and the activation function is the ReLU function. After removing the feature mapping module, the accuracy of VC-CNN_v1 dropped by 0.33% and the accuracy of VC-CNN_v2 dropped by 0.55%. It can be seen from the research results that the addition of the feature mapping module improves the recognition effect of the network to a certain extenthttp://dx.doi.org/10.1155/2021/6697697
spellingShingle Xuhui Song
Linyuan Fan
Human Posture Recognition and Estimation Method Based on 3D Multiview Basketball Sports Dataset
Complexity
title Human Posture Recognition and Estimation Method Based on 3D Multiview Basketball Sports Dataset
title_full Human Posture Recognition and Estimation Method Based on 3D Multiview Basketball Sports Dataset
title_fullStr Human Posture Recognition and Estimation Method Based on 3D Multiview Basketball Sports Dataset
title_full_unstemmed Human Posture Recognition and Estimation Method Based on 3D Multiview Basketball Sports Dataset
title_short Human Posture Recognition and Estimation Method Based on 3D Multiview Basketball Sports Dataset
title_sort human posture recognition and estimation method based on 3d multiview basketball sports dataset
url http://dx.doi.org/10.1155/2021/6697697
work_keys_str_mv AT xuhuisong humanposturerecognitionandestimationmethodbasedon3dmultiviewbasketballsportsdataset
AT linyuanfan humanposturerecognitionandestimationmethodbasedon3dmultiviewbasketballsportsdataset