End-to-end audiovisual speech recognition based on attention fusion of SDBN and BLSTM
An end-to-end audiovisual speech recognition algorithm was proposed.In algorithm,a sparse DBN was constructed by introducing mixed l<sub>1/2</sub>norm and l<sub>1</sub>norm into Deep Belief Network with bottleneck structure to extract the spars...
Saved in:
Main Authors: | Yiming WANG, Ken CHEN, Aihaiti ABUDUSALAMU |
---|---|
Format: | Article |
Language: | zho |
Published: |
Beijing Xintong Media Co., Ltd
2019-12-01
|
Series: | Dianxin kexue |
Subjects: | |
Online Access: | http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2019290/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
A review on speech recognition approaches and challenges for Portuguese: exploring the feasibility of fine-tuning large-scale end-to-end models
by: Yan Li, et al.
Published: (2025-01-01) -
An End-To-End Speech Recognition Model for the North Shaanxi Dialect: Design and Evaluation
by: Yi Qin, et al.
Published: (2025-01-01) -
End-to-End Mandarin Speech Reconstruction Based on Ultrasound Tongue Images Using Deep Learning
by: Fengji Li, et al.
Published: (2025-01-01) -
End-to-end scene text detection and recognition algorithm based on Transformer decoders
by: Jinzhi ZHENG, et al.
Published: (2023-05-01) -
Spoof speech detection based on context information and attention feature
by: Jia CHEN, et al.
Published: (2023-02-01)