Voice Quality Modelling for Expressive Speech Synthesis

This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (F0, duration, and energy), from a neutral style into a number of expre...

Full description

Saved in:

Bibliographic Details
Main Authors:	Carlos Monzo, Ignasi Iriondo, Joan Claudi Socoró
Format:	Article
Language:	English
Published:	Wiley 2014-01-01
Series:	The Scientific World Journal
Online Access:	http://dx.doi.org/10.1155/2014/627189
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832551452274327552
author	Carlos Monzo Ignasi Iriondo Joan Claudi Socoró
author_facet	Carlos Monzo Ignasi Iriondo Joan Claudi Socoró
author_sort	Carlos Monzo
collection	DOAJ
description	This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (F0, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics.
format	Article
id	doaj-art-34a5816fdf984c498c93c4f73edb37b9
institution	Kabale University
issn	2356-6140 1537-744X
language	English
publishDate	2014-01-01
publisher	Wiley
record_format	Article
series	The Scientific World Journal
spelling	doaj-art-34a5816fdf984c498c93c4f73edb37b92025-02-03T06:01:19ZengWileyThe Scientific World Journal2356-61401537-744X2014-01-01201410.1155/2014/627189627189Voice Quality Modelling for Expressive Speech SynthesisCarlos Monzo0Ignasi Iriondo1Joan Claudi Socoró2Computer Science, Multimedia and Telecommunication Studies, Universitat Oberta de Catalunya (UOC), Rambla del Poblenou 156, 08018 Barcelona, SpainGrup de Recerca en Tecnologies Mèdia (GTM), Universitat Ramon Llull, La Salle, Quatre Camins 2, 08022 Barcelona, SpainGrup de Recerca en Tecnologies Mèdia (GTM), Universitat Ramon Llull, La Salle, Quatre Camins 2, 08022 Barcelona, SpainThis paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (F0, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics.http://dx.doi.org/10.1155/2014/627189
spellingShingle	Carlos Monzo Ignasi Iriondo Joan Claudi Socoró Voice Quality Modelling for Expressive Speech Synthesis The Scientific World Journal
title	Voice Quality Modelling for Expressive Speech Synthesis
title_full	Voice Quality Modelling for Expressive Speech Synthesis
title_fullStr	Voice Quality Modelling for Expressive Speech Synthesis
title_full_unstemmed	Voice Quality Modelling for Expressive Speech Synthesis
title_short	Voice Quality Modelling for Expressive Speech Synthesis
title_sort	voice quality modelling for expressive speech synthesis
url	http://dx.doi.org/10.1155/2014/627189
work_keys_str_mv	AT carlosmonzo voicequalitymodellingforexpressivespeechsynthesis AT ignasiiriondo voicequalitymodellingforexpressivespeechsynthesis AT joanclaudisocoro voicequalitymodellingforexpressivespeechsynthesis

Voice Quality Modelling for Expressive Speech Synthesis

Similar Items