Advanced Identification of Prosodic Boundaries, Speakers, and Accents Through Multi-Task Audio Pre-Processing and Speech Language Models
In recent years, the advances in deep neural networks (DNNs) and large language models (LLMs) have led to major breakthroughs and new levels of performance in Natural Language Processing (NLP), including tasks related to speech processing. Based on these new trends, new models such as Whisper and Wa...
Saved in:
| Main Authors: | Francisco Javier Lima Florido, Gloria Corpas Pastor |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-03-01
|
| Series: | Computers |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2073-431X/14/3/102 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Voice disguise and foreign accent: Prosodic aspects of English produced by Bra-zilian Portuguese speakers
by: Leônidas Silva Jr., et al.
Published: (2023-11-01) -
The Neglected Group: Cognitive Discourse Markers as Signposts of Prosodic Unit Boundaries
by: Simona Majhenič, et al.
Published: (2025-06-01) -
w2v-SELD: A Sound Event Localization and Detection Framework for Self-Supervised Spatial Audio Pre-Training
by: Orlem Lima Dos Santos, et al.
Published: (2024-01-01) -
Transformer-based language-independent gender recognition in noisy audio environments
by: Or Haim Anidjar, et al.
Published: (2025-04-01) -
Advancing Spanish Speech Emotion Recognition: A Comprehensive Benchmark of Pre-Trained Models
by: Alex Mares, et al.
Published: (2025-04-01)