SlowFast-TCN: A Deep Learning Approach for Visual Speech Recognition
Visual Speech Recognition (VSR), commonly referred to as automated lip-reading, is an emerging technology that interprets speech by visually analyzing lip movements. A challenge in VSR where visually distinct words produce similar lip movements is known as homopheme problem. Visemes are the basic vi...
Saved in:
| Main Authors: | Nicole Yah Yie Ha, Lee-Yeng Ong, Meng-Chew Leow |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Ital Publication
2024-12-01
|
| Series: | Emerging Science Journal |
| Subjects: | |
| Online Access: | https://ijournalse.org/index.php/ESJ/article/view/2670 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Sleeping and Eating Behavior Recognition of Horses Based on an Improved SlowFast Network
by: Yanhong Liu, et al.
Published: (2024-12-01) -
EML-SlowFast: A behavior recognition model for lion-head goose
by: Jinwei Wang, et al.
Published: (2025-08-01) -
Comparative Analysis of Fine-Tuning I3D and SlowFast Networks for Action Recognition in Surveillance Videos
by: T. Gopalakrishnan, et al.
Published: (2024-01-01) -
JEP-KD: Joint-Embedding Predictive Architecture Based Knowledge Distillation for Visual Speech Recognition
by: Chang Sun, et al.
Published: (2024-01-01) -
Analysis for speech and esthetics in sixty consecutive patients with cleft lip and palate
by: Mahantesh S Shiraganvi, et al.
Published: (2011-10-01)