Indonesian Voice Cloning Text-to-Speech System With Vall-E-Based Model and Speech Enhancement

In recent years, Text-to-Speech (TTS) technology has advanced, with research focusing on multi-speaker TTS capable of voice cloning. In 2023, Wang et al. introduced Vall-E, a Transformer-based neural codec language model, achieving state-of-the-art results in voice cloning. However, limited research...

Full description

Saved in:
Bibliographic Details
Main Authors: Hizkia Raditya Pratama Roosadi, Rizki Rivai Ginanjar, Dessi Puji Lestari
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10806715/
Tags: Add Tag
No Tags, Be the first to tag this record!