Advanced Identification of Prosodic Boundaries, Speakers, and Accents Through Multi-Task Audio Pre-Processing and Speech Language Models

In recent years, the advances in deep neural networks (DNNs) and large language models (LLMs) have led to major breakthroughs and new levels of performance in Natural Language Processing (NLP), including tasks related to speech processing. Based on these new trends, new models such as Whisper and Wa...

Full description

Saved in:
Bibliographic Details
Main Authors: Francisco Javier Lima Florido, Gloria Corpas Pastor
Format: Article
Language:English
Published: MDPI AG 2025-03-01
Series:Computers
Subjects:
Online Access:https://www.mdpi.com/2073-431X/14/3/102
Tags: Add Tag
No Tags, Be the first to tag this record!

Similar Items