V2Coder: A Non-Autoregressive Vocoder Based on Hierarchical Variational Autoencoders
This paper introduces V2Coder, a non-autoregressive vocoder based on hierarchical variational autoencoders (VAEs). The hierarchical VAE with hierarchically extended prior and approximate posterior distributions is highly expressive for modeling stochastic components of speech waveforms. V2Coder lear...
Saved in:
| Main Authors: | Takato Fujimoto, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/11014058/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Perceptual consequences of change in vocoded speech parameters for various reverberation conditions
by: Szymon Drgas, et al.
Published: (2014-01-01) -
Perception of vocoded speech in domestic dogs
by: Amritha Mallikarjun, et al.
Published: (2024-04-01) -
Neural adaptations to temporal cues degradation in early blind: insights from envelope and fine structure vocoding
by: Hyo Jung Choi, et al.
Published: (2025-05-01) -
A Bitrate-Scalable Variational Recurrent Mel-Spectrogram Coder for Real-Time Resynthesis-Based Speech Coding
by: Benjamin Stahl, et al.
Published: (2024-01-01) -
Preventing Posterior Collapse with DVAE for Text Modeling
by: Tianbao Song, et al.
Published: (2025-04-01)