Parameter-efficient adaptation with multi-channel adversarial training for far-field speech recognition
Abstract Despite notable advancements in automatic speech recognition (ASR) technologies, issues such as background noise, reverberation, and speaker distance still degrade the performance of far-field speech recognition (FSR). Although large-scale pre-trained models have shown promise, their adapta...
Saved in:
| Main Authors: | Tong Niu, Yaqi Chen, Dan Qu, Hengbo Hu, ChengRan Liu |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
SpringerOpen
2025-04-01
|
| Series: | EURASIP Journal on Audio, Speech, and Music Processing |
| Subjects: | |
| Online Access: | https://doi.org/10.1186/s13636-025-00406-5 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Enhancing Far-Field Speech Recognition with Mixer: A Novel Data Augmentation Approach
by: Tong Niu, et al.
Published: (2025-04-01) -
Neural network models for whisper to normal speech conversion
by: Cézar Yamamura, et al.
Published: (2025-03-01) -
A Protection Scheme With Speech Processing Against Audio Adversarial Examples
by: Yuya Tarutani, et al.
Published: (2024-01-01) -
Collaborative AI Dysarthric Speech Recognition System With Data Augmentation Using Generative Adversarial Neural Network
by: Yibo He, et al.
Published: (2025-01-01) -
Formant frequency estimations of whispered speech in Chinese
by: Gang LV, et al.
Published: (2009-01-01)