Automatic Quality Assessment of Speech-Driven Synthesized Gestures

The automatic synthesis of realistic gestures has the ability to change the fields of animation, avatars, and communication agents. Although speech-driven synthetic gesture generation methods have been proposed and optimized, the evaluation system of synthetic gestures is still lacking. The current...

Full description

Saved in:
Bibliographic Details
Main Author: Zhiyuan He
Format: Article
Language:English
Published: Wiley 2022-01-01
Series:International Journal of Computer Games Technology
Online Access:http://dx.doi.org/10.1155/2022/1828293
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849696001371144192
author Zhiyuan He
author_facet Zhiyuan He
author_sort Zhiyuan He
collection DOAJ
description The automatic synthesis of realistic gestures has the ability to change the fields of animation, avatars, and communication agents. Although speech-driven synthetic gesture generation methods have been proposed and optimized, the evaluation system of synthetic gestures is still lacking. The current evaluation method still needs manual participation, but it is inefficient in the industry of synthetic gestures and has the interference of human factors. So we need a model that can construct an automatic and objective quantitative quality assessment of the synthesized gesture video. We noticed that recurrent neural networks (RNN) have advantages in modeling advanced spatiotemporal feature sequences, which are very suitable for use in the processing of synthetic gesture video data. Therefore, to build an automatic quality assessment system, we propose in our work a model based on Bi-LSTM and make a little adjustment on the attention mechanism in it. Also, the evaluation method is proposed and experiments are designed to prove that the improved model of the algorithm can complete the quantitative evaluation of synthetic gestures. At the same time, in terms of performance, the model has an improvement of about 20% compared to before the algorithm adjustment.
format Article
id doaj-art-d0bca2c261834296a19f00ae5c57bdf1
institution DOAJ
issn 1687-7055
language English
publishDate 2022-01-01
publisher Wiley
record_format Article
series International Journal of Computer Games Technology
spelling doaj-art-d0bca2c261834296a19f00ae5c57bdf12025-08-20T03:19:35ZengWileyInternational Journal of Computer Games Technology1687-70552022-01-01202210.1155/2022/1828293Automatic Quality Assessment of Speech-Driven Synthesized GesturesZhiyuan He0The University of EdinburghThe automatic synthesis of realistic gestures has the ability to change the fields of animation, avatars, and communication agents. Although speech-driven synthetic gesture generation methods have been proposed and optimized, the evaluation system of synthetic gestures is still lacking. The current evaluation method still needs manual participation, but it is inefficient in the industry of synthetic gestures and has the interference of human factors. So we need a model that can construct an automatic and objective quantitative quality assessment of the synthesized gesture video. We noticed that recurrent neural networks (RNN) have advantages in modeling advanced spatiotemporal feature sequences, which are very suitable for use in the processing of synthetic gesture video data. Therefore, to build an automatic quality assessment system, we propose in our work a model based on Bi-LSTM and make a little adjustment on the attention mechanism in it. Also, the evaluation method is proposed and experiments are designed to prove that the improved model of the algorithm can complete the quantitative evaluation of synthetic gestures. At the same time, in terms of performance, the model has an improvement of about 20% compared to before the algorithm adjustment.http://dx.doi.org/10.1155/2022/1828293
spellingShingle Zhiyuan He
Automatic Quality Assessment of Speech-Driven Synthesized Gestures
International Journal of Computer Games Technology
title Automatic Quality Assessment of Speech-Driven Synthesized Gestures
title_full Automatic Quality Assessment of Speech-Driven Synthesized Gestures
title_fullStr Automatic Quality Assessment of Speech-Driven Synthesized Gestures
title_full_unstemmed Automatic Quality Assessment of Speech-Driven Synthesized Gestures
title_short Automatic Quality Assessment of Speech-Driven Synthesized Gestures
title_sort automatic quality assessment of speech driven synthesized gestures
url http://dx.doi.org/10.1155/2022/1828293
work_keys_str_mv AT zhiyuanhe automaticqualityassessmentofspeechdrivensynthesizedgestures