Phonetic minimization of the text corpus in Belarusian for the speech synthesis system training

The most modern speech synthesis systems are based on the corpus-based method. The corpus-based method, unlike previously popular compilation method, uses natural speech database that does not consist of separate specially selected elements of compilation, but represents the corpus of phonograms of...

Full description

Saved in:

Bibliographic Details
Main Author:	S. I. Lysy
Format:	Article
Language:	Russian
Published:	National Academy of Sciences of Belarus, the United Institute of Informatics Problems 2019-03-01
Series:	Informatika
Subjects:	phonetic minimization the belarusian language speech synthesis corpus-based method text corpus
Online Access:	https://inf.grid.by/jour/article/view/748
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849771478729359360
author	S. I. Lysy
author_facet	S. I. Lysy
author_sort	S. I. Lysy
collection	DOAJ
description	The most modern speech synthesis systems are based on the corpus-based method. The corpus-based method, unlike previously popular compilation method, uses natural speech database that does not consist of separate specially selected elements of compilation, but represents the corpus of phonograms of natural speech. Large amounts of text and corresponding audio information, which represents a significant challenge for so-called under-resourced languages, which include Belarusian, are required to achieve high-quality synthesized speech in this approach. In this case, a common approach is to use phonetic minimization, special selection of texts, when the amount of text corpus is maximally reduced, but at the same time phonetic fullness is preserved. The article discusses the information about the nature and the functioning the corpus-based method of sound signal generation in speech synthesis systems, provides a detailed overview of the approaches to the formation of text and speech corpuses, required for speech generation by the corpus-based method. The second half of the work is devoted to the description of the elaborated algorithm of the text corpus phonetic minimization in Belarusian language, as well as technical and linguistic resources used to implement it. A description of the developed software prototype as well as a description of the series of experiments on phonetic minimization are given to demonstrate the efficiency of the algorithm.
format	Article
id	doaj-art-030ed5b6871a425fb010468a4c52efc4
institution	DOAJ
issn	1816-0301
language	Russian
publishDate	2019-03-01
publisher	National Academy of Sciences of Belarus, the United Institute of Informatics Problems
record_format	Article
series	Informatika
spelling	doaj-art-030ed5b6871a425fb010468a4c52efc42025-08-20T03:02:37ZrusNational Academy of Sciences of Belarus, the United Institute of Informatics ProblemsInformatika1816-03012019-03-011617585731Phonetic minimization of the text corpus in Belarusian for the speech synthesis system trainingS. I. Lysy0The United Institute of Informatics Problems of the National Academyof Sciences of Belarus, MinskThe most modern speech synthesis systems are based on the corpus-based method. The corpus-based method, unlike previously popular compilation method, uses natural speech database that does not consist of separate specially selected elements of compilation, but represents the corpus of phonograms of natural speech. Large amounts of text and corresponding audio information, which represents a significant challenge for so-called under-resourced languages, which include Belarusian, are required to achieve high-quality synthesized speech in this approach. In this case, a common approach is to use phonetic minimization, special selection of texts, when the amount of text corpus is maximally reduced, but at the same time phonetic fullness is preserved. The article discusses the information about the nature and the functioning the corpus-based method of sound signal generation in speech synthesis systems, provides a detailed overview of the approaches to the formation of text and speech corpuses, required for speech generation by the corpus-based method. The second half of the work is devoted to the description of the elaborated algorithm of the text corpus phonetic minimization in Belarusian language, as well as technical and linguistic resources used to implement it. A description of the developed software prototype as well as a description of the series of experiments on phonetic minimization are given to demonstrate the efficiency of the algorithm.https://inf.grid.by/jour/article/view/748phonetic minimizationthe belarusian languagespeech synthesiscorpus-based methodtext corpus
spellingShingle	S. I. Lysy Phonetic minimization of the text corpus in Belarusian for the speech synthesis system training Informatika phonetic minimization the belarusian language speech synthesis corpus-based method text corpus
title	Phonetic minimization of the text corpus in Belarusian for the speech synthesis system training
title_full	Phonetic minimization of the text corpus in Belarusian for the speech synthesis system training
title_fullStr	Phonetic minimization of the text corpus in Belarusian for the speech synthesis system training
title_full_unstemmed	Phonetic minimization of the text corpus in Belarusian for the speech synthesis system training
title_short	Phonetic minimization of the text corpus in Belarusian for the speech synthesis system training
title_sort	phonetic minimization of the text corpus in belarusian for the speech synthesis system training
topic	phonetic minimization the belarusian language speech synthesis corpus-based method text corpus
url	https://inf.grid.by/jour/article/view/748
work_keys_str_mv	AT silysy phoneticminimizationofthetextcorpusinbelarusianforthespeechsynthesissystemtraining

Phonetic minimization of the text corpus in Belarusian for the speech synthesis system training

Similar Items