Consistent Quality in Video Coding: A Perceptual RDO and Neural Network-Based QP Prediction Approach

Video coding has been greatly growing in the recent four decades with a number of standards such as H.264/AVC, HEVC or VVC which have opened a new era of multimedia applications such as video conference, video surveillance, broadcasting, and E-learning. However, to achieve high compression performan...

Full description

Saved in:
Bibliographic Details
Main Authors: Tien Vu Huu, Huy Bui Quoc, Minh Do Ngoc, Xiem HoangVan
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11016678/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850131629083721728
author Tien Vu Huu
Huy Bui Quoc
Minh Do Ngoc
Xiem HoangVan
author_facet Tien Vu Huu
Huy Bui Quoc
Minh Do Ngoc
Xiem HoangVan
author_sort Tien Vu Huu
collection DOAJ
description Video coding has been greatly growing in the recent four decades with a number of standards such as H.264/AVC, HEVC or VVC which have opened a new era of multimedia applications such as video conference, video surveillance, broadcasting, and E-learning. However, to achieve high compression performance, video coding standard like HEVC typically suffers from the quality fluctuation which may directly affect the quality of experiences (QoE). To cope with this problem, we propose a novel video coding framework targeting two purposes, the consistent quality and the compression efficiency. First, to achieve the perceptual quality of video frame, we propose to use the VMAF (video multi-methods assessment fusion) metric as the distortion measurement instead of PSNR or SSIM in literatures due to its high correlation with the human perceptual system. Second, we propose a novel VMAF-QP (quantization parameter) neural network that learns how QP can be adjusted to achieve a certain VMAF grade. Following the estimated QP, we propose a RDO process to guarantee the compression efficiency of the proposed codec while maintaining the decoded video with quality consistency. In this process, the Rate-QP and Distortion-QP models are introduced and optimized through a number of statistical analyses. Finally, we integrate the proposed consistent quality video coding (CQVC) framework into the HEVC standard. Experimental results reveal that the proposed video coding framework achieves a small quality fluctuation while still guarantying the high compression efficiency, notably 4.5% Bitrate reduction when compared to the standard HEVC.
format Article
id doaj-art-95e95d99a95b4e14b28ba3ca16f4d9bf
institution OA Journals
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-95e95d99a95b4e14b28ba3ca16f4d9bf2025-08-20T02:32:22ZengIEEEIEEE Access2169-35362025-01-0113943369435310.1109/ACCESS.2025.357430311016678Consistent Quality in Video Coding: A Perceptual RDO and Neural Network-Based QP Prediction ApproachTien Vu Huu0https://orcid.org/0000-0003-2736-2194Huy Bui Quoc1https://orcid.org/0009-0002-2453-8609Minh Do Ngoc2Xiem HoangVan3https://orcid.org/0000-0002-7524-6529Faculty of Multimedia, Posts and Telecommunications Institute of Technology, Hanoi, VietnamFaculty of Multimedia, Posts and Telecommunications Institute of Technology, Hanoi, VietnamFaculty of Electronics and Telecommunications, University of Engineering and Technology—Vietnam National University, Hanoi, VietnamFaculty of Electronics and Telecommunications, University of Engineering and Technology—Vietnam National University, Hanoi, VietnamVideo coding has been greatly growing in the recent four decades with a number of standards such as H.264/AVC, HEVC or VVC which have opened a new era of multimedia applications such as video conference, video surveillance, broadcasting, and E-learning. However, to achieve high compression performance, video coding standard like HEVC typically suffers from the quality fluctuation which may directly affect the quality of experiences (QoE). To cope with this problem, we propose a novel video coding framework targeting two purposes, the consistent quality and the compression efficiency. First, to achieve the perceptual quality of video frame, we propose to use the VMAF (video multi-methods assessment fusion) metric as the distortion measurement instead of PSNR or SSIM in literatures due to its high correlation with the human perceptual system. Second, we propose a novel VMAF-QP (quantization parameter) neural network that learns how QP can be adjusted to achieve a certain VMAF grade. Following the estimated QP, we propose a RDO process to guarantee the compression efficiency of the proposed codec while maintaining the decoded video with quality consistency. In this process, the Rate-QP and Distortion-QP models are introduced and optimized through a number of statistical analyses. Finally, we integrate the proposed consistent quality video coding (CQVC) framework into the HEVC standard. Experimental results reveal that the proposed video coding framework achieves a small quality fluctuation while still guarantying the high compression efficiency, notably 4.5% Bitrate reduction when compared to the standard HEVC.https://ieeexplore.ieee.org/document/11016678/VMAFneural networkvideo coding standards
spellingShingle Tien Vu Huu
Huy Bui Quoc
Minh Do Ngoc
Xiem HoangVan
Consistent Quality in Video Coding: A Perceptual RDO and Neural Network-Based QP Prediction Approach
IEEE Access
VMAF
neural network
video coding standards
title Consistent Quality in Video Coding: A Perceptual RDO and Neural Network-Based QP Prediction Approach
title_full Consistent Quality in Video Coding: A Perceptual RDO and Neural Network-Based QP Prediction Approach
title_fullStr Consistent Quality in Video Coding: A Perceptual RDO and Neural Network-Based QP Prediction Approach
title_full_unstemmed Consistent Quality in Video Coding: A Perceptual RDO and Neural Network-Based QP Prediction Approach
title_short Consistent Quality in Video Coding: A Perceptual RDO and Neural Network-Based QP Prediction Approach
title_sort consistent quality in video coding a perceptual rdo and neural network based qp prediction approach
topic VMAF
neural network
video coding standards
url https://ieeexplore.ieee.org/document/11016678/
work_keys_str_mv AT tienvuhuu consistentqualityinvideocodingaperceptualrdoandneuralnetworkbasedqppredictionapproach
AT huybuiquoc consistentqualityinvideocodingaperceptualrdoandneuralnetworkbasedqppredictionapproach
AT minhdongoc consistentqualityinvideocodingaperceptualrdoandneuralnetworkbasedqppredictionapproach
AT xiemhoangvan consistentqualityinvideocodingaperceptualrdoandneuralnetworkbasedqppredictionapproach