Parameter-efficient adaptation with multi-channel adversarial training for far-field speech recognition

Parameter-efficient adaptation with multi-channel adversarial training for far-field speech recognition

Abstract Despite notable advancements in automatic speech recognition (ASR) technologies, issues such as background noise, reverberation, and speaker distance still degrade the performance of far-field speech recognition (FSR). Although large-scale pre-trained models have shown promise, their adapta...

Full description

Saved in:

Bibliographic Details
Main Authors:	Tong Niu, Yaqi Chen, Dan Qu, Hengbo Hu, ChengRan Liu
Format:	Article
Language:	English
Published:	SpringerOpen 2025-04-01
Series:	EURASIP Journal on Audio, Speech, and Music Processing
Subjects:	Far-field speech recognition Prefix tuning Adversarial training Whisper
Online Access:	https://doi.org/10.1186/s13636-025-00406-5
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Enhancing Far-Field Speech Recognition with Mixer: A Novel Data Augmentation Approach
by: Tong Niu, et al.
Published: (2025-04-01)

Neural network models for whisper to normal speech conversion
by: Cézar Yamamura, et al.
Published: (2025-03-01)

A Protection Scheme With Speech Processing Against Audio Adversarial Examples
by: Yuya Tarutani, et al.
Published: (2024-01-01)

Collaborative AI Dysarthric Speech Recognition System With Data Augmentation Using Generative Adversarial Neural Network
by: Yibo He, et al.
Published: (2025-01-01)

Formant frequency estimations of whispered speech in Chinese
by: Gang LV, et al.
Published: (2009-01-01)

Application of Teager Energy Operator on Linear and Mel Scales for Whispered Speech Recognition
by: Branko R MARKOVIĆ, et al.
Published: (2018-01-01)

Mandarin speech reconstruction from surface electromyography based on generative adversarial networks
by: Fengji Li, et al.
Published: (2025-06-01)

Whisper Automatic Speech Recognition and GPT Large Language Models as Best Practice for Assessing Communication Progress in Autism Spectrum Disorder
by: Naela Fauzul Muna, et al.
Published: (2025-04-01)

Thermo–Optic Tuning of Integrated Polymethyl Methacrylate Sphere Whispering Gallery Mode Resonator
by: Leilei Shi, et al.
Published: (2016-01-01)

Multi-Stage Audio-Visual Fusion for Dysarthric Speech Recognition With Pre-Trained Models
by: Chongchong Yu, et al.
Published: (2023-01-01)

TFDense-GAN: a generative adversarial network for single-channel speech enhancement
by: Haoxiang Chen, et al.
Published: (2025-03-01)

Resilience of Named Entity Recognition models against adversarial attacks
by: Paweł Walkowiak
Published: (2025-07-01)

All-Optical Tuning Based on Magnetic Fluid-Filled Microcapillary Resonators Inserted with Half-Cone Fiber
by: Minggang Chai, et al.
Published: (2025-03-01)

CAs-Net: A Channel-Aware Speech Network for Uyghur Speech Recognition
by: Jiang Zhang, et al.
Published: (2025-06-01)

Assessing costa rican children speech recognition by humans and machines
by: Maribel Morales-Rodríguez, et al.
Published: (2022-11-01)

Improving Speech Recognition Rate through Analysis Parameters
by: Eringis Deividas, et al.
Published: (2014-05-01)

Development of speech material for an Armenian speech recognition threshold test
by: Sona Sargsyan, et al.
Published: (2021-09-01)

Ghost in the Radio: An Audio Adversarial Attack Using Environmental Noise Through Radio
by: Hyeongjun Choi, et al.
Published: (2024-01-01)

Using casual speech phonology in synthetic speech
by: Linda SHOCKEY
Published: (2014-04-01)

Prefix Tuning Using Residual Reparameterization
by: Youngjun Jung, et al.
Published: (2025-01-01)

Fault Recognition Method and Application Based on Generative Adversarial Network
by: Shuiliang Luo, et al.
Published: (2025-06-01)

SVIT‐SSR: A sEMG‐based vision transformer approach for silent speech recognition
by: Zhao Li, et al.
Published: (2024-11-01)

Speech Emotion Recognition: Humans vs Machines
by: S. Werner, et al.
Published: (2019-12-01)

Recent advancements in automatic disordered speech recognition: A survey paper
by: Nada Gohider, et al.
Published: (2024-12-01)

P-GELU: A Novel Activation Function to Optimize Whisper for Darija Speech Translation
by: Maria Labied, et al.
Published: (2025-01-01)

Enhancing Speech Recognition in Adverse Listening Environments: The Impact of Brief Musical Training on Older Adults
by: Akhila R. NANDAKUMAR, et al.
Published: (2023-12-01)

Speech Recognition in an Enclosure with a Long Reverberation Time
by: Jedrzej KOCINSKI, et al.
Published: (2016-02-01)

Bridging language gaps: The role of NLP and speech recognition in oral english instruction
by: Parul Dubey, et al.
Published: (2025-06-01)

Speech Emotion Recognition under White Noise
by: Chengwei HUANG, et al.
Published: (2013-12-01)

Multilingual Prediction of Cognitive Impairment with Large Language Models and Speech Analysis
by: Felix Agbavor, et al.
Published: (2024-12-01)

Assessment of L2 Spanish pronunciation accuracy via Automatic Speech Recognition
by: Albina Sarymsakova, et al.
Published: (2025-07-01)

Enhancing Robustness Against Adversarial Attacks in Multimodal Emotion Recognition With Spiking Transformers
by: Guoming Chen, et al.
Published: (2025-01-01)

Optimizing Speech Emotion Recognition with Hilbert Curve and convolutional neural network
by: Zijun Yang, et al.
Published: (2024-01-01)

Multichannel speech enhancement for automatic speech recognition: a literature review
by: Zubair Zaland, et al.
Published: (2025-03-01)

Speech Emotion Recognition Based on Voice Fundamental Frequency
by: Teodora DIMITROVA-GREKOW, et al.
Published: (2019-04-01)

Project Euphonia: advancing inclusive speech recognition through expanded data collection and evaluation
by: Alicia Martin, et al.
Published: (2025-06-01)

Skeleton-Based Data Augmentation for Sign Language Recognition Using Adversarial Learning
by: Yuriya Nakamura, et al.
Published: (2025-01-01)

Neural signals, machine learning, and the future of inner speech recognition
by: Adiba Tabassum Chowdhury, et al.
Published: (2025-07-01)

Adversarial detection based on feature invariant in license plate recognition systems
by: ZHU Xiaoyu, et al.
Published: (2024-12-01)

Refining maritime Automatic Speech Recognition by leveraging synthetic speech
by: Christoph Martius, et al.
Published: (2024-12-01)