Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism
Abstract With ongoing social progress, three-dimensional (3D) video is becoming increasingly prevalent in everyday life. As a key component of 3D video technology, depth video plays a crucial role by providing information about the distance and spatial distribution of objects within a scene. This st...
Saved in:
| Main Author: | |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-01-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-85602-1 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850118666901782528 |
|---|---|
| author | Xiaolan Wang |
| author_facet | Xiaolan Wang |
| author_sort | Xiaolan Wang |
| collection | DOAJ |
| description | Abstract With ongoing social progress, three-dimensional (3D) video is becoming increasingly prevalent in everyday life. As a key component of 3D video technology, depth video plays a crucial role by providing information about the distance and spatial distribution of objects within a scene. This study focuses on deep video encoding and proposes an efficient encoding method that integrates the Convolutional Neural Network (CNN) with a hyperautomation mechanism. First, an overview of the principles underlying CNNs and the concept of hyperautomation is presented, and the application of CNNs in the intra-frame prediction module of video encoding is explored. By incorporating the hyperautomation mechanism, this study emphasizes the potential of Artificial Intelligence to enhance encoding efficiency. Next, a CNN-based method for variable-resolution intra-frame prediction of depth video is proposed. This method utilizes a multi-level feature fusion network to reconstruct coding units. The effectiveness of the proposed variable-resolution coding technique is then evaluated by comparing its performance against the original method on the high-efficiency video coding (HEVC) test platform. The results demonstrate that, compared to the original test platform method (HTM-16.2), the proposed method achieves an average Bjøntegaard delta bit rate (BDBR) savings of 8.12% across all tested video sequences. This indicates a significant improvement in coding efficiency. Furthermore, the viewpoint BDBR loss of the variable-resolution coding method is only 0.15%, which falls within an acceptable margin of error. This suggests that the method is both stable and reliable in viewpoint coding, and it performs well across a broad range of quantization parameter settings. Additionally, compared to other encoding methods, the proposed approach exhibits superior peak signal-to-noise ratio, structural similarity index, and perceptual quality metrics. This study introduces a novel and efficient approach to 3D video compression, and the integration of CNNs with hyperautomation provides valuable insights for future innovations in video encoding. |
| format | Article |
| id | doaj-art-6680f2f4ecff4da494737b3f884f3b8b |
| institution | OA Journals |
| issn | 2045-2322 |
| language | English |
| publishDate | 2025-01-01 |
| publisher | Nature Portfolio |
| record_format | Article |
| series | Scientific Reports |
| spelling | doaj-art-6680f2f4ecff4da494737b3f884f3b8b2025-08-20T02:35:49ZengNature PortfolioScientific Reports2045-23222025-01-0115112010.1038/s41598-025-85602-1Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanismXiaolan Wang0The Higher Educational Key Laboratory for Flexible Manufacturing Equipment Integration of Fujian Province (Xiamen Institute of Technology)Abstract With ongoing social progress, three-dimensional (3D) video is becoming increasingly prevalent in everyday life. As a key component of 3D video technology, depth video plays a crucial role by providing information about the distance and spatial distribution of objects within a scene. This study focuses on deep video encoding and proposes an efficient encoding method that integrates the Convolutional Neural Network (CNN) with a hyperautomation mechanism. First, an overview of the principles underlying CNNs and the concept of hyperautomation is presented, and the application of CNNs in the intra-frame prediction module of video encoding is explored. By incorporating the hyperautomation mechanism, this study emphasizes the potential of Artificial Intelligence to enhance encoding efficiency. Next, a CNN-based method for variable-resolution intra-frame prediction of depth video is proposed. This method utilizes a multi-level feature fusion network to reconstruct coding units. The effectiveness of the proposed variable-resolution coding technique is then evaluated by comparing its performance against the original method on the high-efficiency video coding (HEVC) test platform. The results demonstrate that, compared to the original test platform method (HTM-16.2), the proposed method achieves an average Bjøntegaard delta bit rate (BDBR) savings of 8.12% across all tested video sequences. This indicates a significant improvement in coding efficiency. Furthermore, the viewpoint BDBR loss of the variable-resolution coding method is only 0.15%, which falls within an acceptable margin of error. This suggests that the method is both stable and reliable in viewpoint coding, and it performs well across a broad range of quantization parameter settings. Additionally, compared to other encoding methods, the proposed approach exhibits superior peak signal-to-noise ratio, structural similarity index, and perceptual quality metrics. This study introduces a novel and efficient approach to 3D video compression, and the integration of CNNs with hyperautomation provides valuable insights for future innovations in video encoding.https://doi.org/10.1038/s41598-025-85602-1Variable resolutionConvolutional Neural NetworkDepth videoVideo coding efficiencyHyperautomation mechanism |
| spellingShingle | Xiaolan Wang Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism Scientific Reports Variable resolution Convolutional Neural Network Depth video Video coding efficiency Hyperautomation mechanism |
| title | Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism |
| title_full | Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism |
| title_fullStr | Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism |
| title_full_unstemmed | Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism |
| title_short | Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism |
| title_sort | strategies for enhancing deep video encoding efficiency using the convolutional neural network in a hyperautomation mechanism |
| topic | Variable resolution Convolutional Neural Network Depth video Video coding efficiency Hyperautomation mechanism |
| url | https://doi.org/10.1038/s41598-025-85602-1 |
| work_keys_str_mv | AT xiaolanwang strategiesforenhancingdeepvideoencodingefficiencyusingtheconvolutionalneuralnetworkinahyperautomationmechanism |