The analysis of transformer end-to-end model in Real-time interactive scene based on speech recognition technology
Abstract In order to face the uncertainty and semantic complexity of speech signals in real-time interactive scenes and achieve more efficient and accurate speech recognition results, this study proposes a Dynamic Adaptive Transformer for Real-Time Speech Recognition (DATR-SR) model. The study is ba...
Saved in:
| Main Authors: | Ping Li, Can Yang, Lei Mao |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-05-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-02904-0 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
End-to-end feature fusion for jointly optimized speech enhancement and automatic speech recognition
by: Mohamed Medani, et al.
Published: (2025-07-01) -
End-to-end scene text detection and recognition algorithm based on Transformer decoders
by: Jinzhi ZHENG, et al.
Published: (2023-05-01) -
End-to-end scene text detection and recognition algorithm based on Transformer decoders
by: Jinzhi ZHENG, et al.
Published: (2023-05-01) -
End-to-End Mandarin Speech Reconstruction Based on Ultrasound Tongue Images Using Deep Learning
by: Fengji Li, et al.
Published: (2025-01-01) -
End-to-End Learning Framework Incorporating Image Reconstruction and Recognition Models
by: Ren Shibata, et al.
Published: (2025-01-01)