A new deep learning-based fast transcoding for internet of things applications
Abstract To achieve low-power video communication in Internet of Things, this study presents a new deep learning-based fast transcoding algorithm from distributed video coding (DVC) to high efficiency video coding (HEVC). The proposed method accelerates transcoding by minimizing HEVC encoding comple...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-05-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-99533-4 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Abstract To achieve low-power video communication in Internet of Things, this study presents a new deep learning-based fast transcoding algorithm from distributed video coding (DVC) to high efficiency video coding (HEVC). The proposed method accelerates transcoding by minimizing HEVC encoding complexity. Specifically, it models the selections of coding unit (CU) partitions and prediction unit (PU) partition modes as classification tasks. To address these tasks, a novel lightweight deep learning network has been developed acting as the classifier in a top-down transcoding strategy for improved efficiency. The proposed transcoding algorithm operates efficiently at both CU and PU levels. At the CU level, it reduces HEVC encoding complexity by accurately predicting CU partitions. At the PU level, predicting PU partition modes for non-split CUs further streamlines the encoding process. Experimental results demonstrate that the proposed CU-level transcoding reduces complexity overhead by 45.69%, with a 1.33% average Bjøntegaard delta bit-rate (BD-BR) increase. At the PU level, the transcoding achieves an even greater complexity reduction, averaging 60.97%, with a 2.16% average BD-BR increase. These results highlight the algorithm’s efficiency in balancing computational cost and compression performance. The proposed method provides a promising low-power video coding scheme for resource-constrained terminals in both upstream and downstream video communication scenarios. |
|---|---|
| ISSN: | 2045-2322 |