Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism

Abstract With ongoing social progress, three-dimensional (3D) video is becoming increasingly prevalent in everyday life. As a key component of 3D video technology, depth video plays a crucial role by providing information about the distance and spatial distribution of objects within a scene. This st...

Full description

Saved in:

Bibliographic Details
Main Author:	Xiaolan Wang
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-01-01
Series:	Scientific Reports
Subjects:	Variable resolution Convolutional Neural Network Depth video Video coding efficiency Hyperautomation mechanism
Online Access:	https://doi.org/10.1038/s41598-025-85602-1
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841544818918948864
author	Xiaolan Wang
author_facet	Xiaolan Wang
author_sort	Xiaolan Wang
collection	DOAJ
description	Abstract With ongoing social progress, three-dimensional (3D) video is becoming increasingly prevalent in everyday life. As a key component of 3D video technology, depth video plays a crucial role by providing information about the distance and spatial distribution of objects within a scene. This study focuses on deep video encoding and proposes an efficient encoding method that integrates the Convolutional Neural Network (CNN) with a hyperautomation mechanism. First, an overview of the principles underlying CNNs and the concept of hyperautomation is presented, and the application of CNNs in the intra-frame prediction module of video encoding is explored. By incorporating the hyperautomation mechanism, this study emphasizes the potential of Artificial Intelligence to enhance encoding efficiency. Next, a CNN-based method for variable-resolution intra-frame prediction of depth video is proposed. This method utilizes a multi-level feature fusion network to reconstruct coding units. The effectiveness of the proposed variable-resolution coding technique is then evaluated by comparing its performance against the original method on the high-efficiency video coding (HEVC) test platform. The results demonstrate that, compared to the original test platform method (HTM-16.2), the proposed method achieves an average Bjøntegaard delta bit rate (BDBR) savings of 8.12% across all tested video sequences. This indicates a significant improvement in coding efficiency. Furthermore, the viewpoint BDBR loss of the variable-resolution coding method is only 0.15%, which falls within an acceptable margin of error. This suggests that the method is both stable and reliable in viewpoint coding, and it performs well across a broad range of quantization parameter settings. Additionally, compared to other encoding methods, the proposed approach exhibits superior peak signal-to-noise ratio, structural similarity index, and perceptual quality metrics. This study introduces a novel and efficient approach to 3D video compression, and the integration of CNNs with hyperautomation provides valuable insights for future innovations in video encoding.
format	Article
id	doaj-art-6680f2f4ecff4da494737b3f884f3b8b
institution	Kabale University
issn	2045-2322
language	English
publishDate	2025-01-01
publisher	Nature Portfolio
record_format	Article
series	Scientific Reports
spelling	doaj-art-6680f2f4ecff4da494737b3f884f3b8b2025-01-12T12:17:01ZengNature PortfolioScientific Reports2045-23222025-01-0115112010.1038/s41598-025-85602-1Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanismXiaolan Wang0The Higher Educational Key Laboratory for Flexible Manufacturing Equipment Integration of Fujian Province (Xiamen Institute of Technology)Abstract With ongoing social progress, three-dimensional (3D) video is becoming increasingly prevalent in everyday life. As a key component of 3D video technology, depth video plays a crucial role by providing information about the distance and spatial distribution of objects within a scene. This study focuses on deep video encoding and proposes an efficient encoding method that integrates the Convolutional Neural Network (CNN) with a hyperautomation mechanism. First, an overview of the principles underlying CNNs and the concept of hyperautomation is presented, and the application of CNNs in the intra-frame prediction module of video encoding is explored. By incorporating the hyperautomation mechanism, this study emphasizes the potential of Artificial Intelligence to enhance encoding efficiency. Next, a CNN-based method for variable-resolution intra-frame prediction of depth video is proposed. This method utilizes a multi-level feature fusion network to reconstruct coding units. The effectiveness of the proposed variable-resolution coding technique is then evaluated by comparing its performance against the original method on the high-efficiency video coding (HEVC) test platform. The results demonstrate that, compared to the original test platform method (HTM-16.2), the proposed method achieves an average Bjøntegaard delta bit rate (BDBR) savings of 8.12% across all tested video sequences. This indicates a significant improvement in coding efficiency. Furthermore, the viewpoint BDBR loss of the variable-resolution coding method is only 0.15%, which falls within an acceptable margin of error. This suggests that the method is both stable and reliable in viewpoint coding, and it performs well across a broad range of quantization parameter settings. Additionally, compared to other encoding methods, the proposed approach exhibits superior peak signal-to-noise ratio, structural similarity index, and perceptual quality metrics. This study introduces a novel and efficient approach to 3D video compression, and the integration of CNNs with hyperautomation provides valuable insights for future innovations in video encoding.https://doi.org/10.1038/s41598-025-85602-1Variable resolutionConvolutional Neural NetworkDepth videoVideo coding efficiencyHyperautomation mechanism
spellingShingle	Xiaolan Wang Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism Scientific Reports Variable resolution Convolutional Neural Network Depth video Video coding efficiency Hyperautomation mechanism
title	Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism
title_full	Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism
title_fullStr	Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism
title_full_unstemmed	Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism
title_short	Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism
title_sort	strategies for enhancing deep video encoding efficiency using the convolutional neural network in a hyperautomation mechanism
topic	Variable resolution Convolutional Neural Network Depth video Video coding efficiency Hyperautomation mechanism
url	https://doi.org/10.1038/s41598-025-85602-1
work_keys_str_mv	AT xiaolanwang strategiesforenhancingdeepvideoencodingefficiencyusingtheconvolutionalneuralnetworkinahyperautomationmechanism

Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism

Similar Items