Voice Quality Modelling for Expressive Speech Synthesis
This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (F0, duration, and energy), from a neutral style into a number of expre...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2014-01-01
|
Series: | The Scientific World Journal |
Online Access: | http://dx.doi.org/10.1155/2014/627189 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832551452274327552 |
---|---|
author | Carlos Monzo Ignasi Iriondo Joan Claudi Socoró |
author_facet | Carlos Monzo Ignasi Iriondo Joan Claudi Socoró |
author_sort | Carlos Monzo |
collection | DOAJ |
description | This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (F0, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics. |
format | Article |
id | doaj-art-34a5816fdf984c498c93c4f73edb37b9 |
institution | Kabale University |
issn | 2356-6140 1537-744X |
language | English |
publishDate | 2014-01-01 |
publisher | Wiley |
record_format | Article |
series | The Scientific World Journal |
spelling | doaj-art-34a5816fdf984c498c93c4f73edb37b92025-02-03T06:01:19ZengWileyThe Scientific World Journal2356-61401537-744X2014-01-01201410.1155/2014/627189627189Voice Quality Modelling for Expressive Speech SynthesisCarlos Monzo0Ignasi Iriondo1Joan Claudi Socoró2Computer Science, Multimedia and Telecommunication Studies, Universitat Oberta de Catalunya (UOC), Rambla del Poblenou 156, 08018 Barcelona, SpainGrup de Recerca en Tecnologies Mèdia (GTM), Universitat Ramon Llull, La Salle, Quatre Camins 2, 08022 Barcelona, SpainGrup de Recerca en Tecnologies Mèdia (GTM), Universitat Ramon Llull, La Salle, Quatre Camins 2, 08022 Barcelona, SpainThis paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (F0, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics.http://dx.doi.org/10.1155/2014/627189 |
spellingShingle | Carlos Monzo Ignasi Iriondo Joan Claudi Socoró Voice Quality Modelling for Expressive Speech Synthesis The Scientific World Journal |
title | Voice Quality Modelling for Expressive Speech Synthesis |
title_full | Voice Quality Modelling for Expressive Speech Synthesis |
title_fullStr | Voice Quality Modelling for Expressive Speech Synthesis |
title_full_unstemmed | Voice Quality Modelling for Expressive Speech Synthesis |
title_short | Voice Quality Modelling for Expressive Speech Synthesis |
title_sort | voice quality modelling for expressive speech synthesis |
url | http://dx.doi.org/10.1155/2014/627189 |
work_keys_str_mv | AT carlosmonzo voicequalitymodellingforexpressivespeechsynthesis AT ignasiiriondo voicequalitymodellingforexpressivespeechsynthesis AT joanclaudisocoro voicequalitymodellingforexpressivespeechsynthesis |