A speech recognition method with enhanced transformer decoder
Abstract Addressing the issue that the Transformer decoder struggles to capture local features for monotonic alignment in speech recognition, and simultaneously incorporating language model cold fusion training into the decoder, an enhanced decoder-based speech recognition model is investigated. The...
Saved in:
| Main Authors: | Hengbo Hu, Tong Niu, Zhenhua He |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
SpringerOpen
2025-02-01
|
| Series: | EURASIP Journal on Audio, Speech, and Music Processing |
| Subjects: | |
| Online Access: | https://doi.org/10.1186/s13636-025-00394-6 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Comparison of Linear and Nonlinear Methods for Decoding Selective Attention to Speech From Ear-EEG Recordings
by: Mike D. Thornton, et al.
Published: (2025-01-01) -
Enhanced Conformer-Based Speech Recognition via Model Fusion and Adaptive Decoding with Dynamic Rescoring
by: Junhao Geng, et al.
Published: (2024-12-01) -
Integrating Pose Features and Cross-Relationship Learning for Human–Object Interaction Detection
by: Lang Wu, et al.
Published: (2025-03-01) -
SDNet: Sandwich Decoder Network for Waterbody Segmentation in Remote Sensing Imagery
by: Hao Ni, et al.
Published: (2025-01-01) -
Enhancing Emotion Recognition in Speech Based on Self-Supervised Learning: Cross-Attention Fusion of Acoustic and Semantic Features
by: Bashar M. Deeb, et al.
Published: (2025-01-01)