SlowFast-TCN: A Deep Learning Approach for Visual Speech Recognition

SlowFast-TCN: A Deep Learning Approach for Visual Speech Recognition

Visual Speech Recognition (VSR), commonly referred to as automated lip-reading, is an emerging technology that interprets speech by visually analyzing lip movements. A challenge in VSR where visually distinct words produce similar lip movements is known as homopheme problem. Visemes are the basic vi...

Full description

Saved in:

Bibliographic Details
Main Authors:	Nicole Yah Yie Ha, Lee-Yeng Ong, Meng-Chew Leow
Format:	Article
Language:	English
Published:	Ital Publication 2024-12-01
Series:	Emerging Science Journal
Subjects:	visual speech recognition temporal convolutional network lip reading in wild slowfast network homophemes.
Online Access:	https://ijournalse.org/index.php/ESJ/article/view/2670
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Sleeping and Eating Behavior Recognition of Horses Based on an Improved SlowFast Network
by: Yanhong Liu, et al.
Published: (2024-12-01)

EML-SlowFast: A behavior recognition model for lion-head goose
by: Jinwei Wang, et al.
Published: (2025-08-01)

Comparative Analysis of Fine-Tuning I3D and SlowFast Networks for Action Recognition in Surveillance Videos
by: T. Gopalakrishnan, et al.
Published: (2024-01-01)

JEP-KD: Joint-Embedding Predictive Architecture Based Knowledge Distillation for Visual Speech Recognition
by: Chang Sun, et al.
Published: (2024-01-01)

Analysis for speech and esthetics in sixty consecutive patients with cleft lip and palate
by: Mahantesh S Shiraganvi, et al.
Published: (2011-10-01)

LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face
by: Md. Tanvir Rahman Sahed, et al.
Published: (2025-02-01)

Deep Transfer Learning for Lip Reading Based on NASNetMobile Pretrained Model in Wild Dataset
by: Ashwaq Waleed Abdul Ameer, et al.
Published: (2025-01-01)

SPEECH DISORDERS AND DIFFICULTIES WITH READING AND WRITING
by: Beata Wołosiuk
Published: (2019-07-01)

Improving speaker-independent visual language identification using deep neural networks with training batch augmentation
by: Jacob L. Newman
Published: (2025-06-01)

Planning community-based intervention for speech for children with cleft lip and palate from rural South India: A needs assessment
by: Subramaniyan Balasubramaniyan, et al.
Published: (2017-09-01)

Silent speech recognition using visual cascading fusion of tongue-lip movements based on pre-trained and fine-tuned model
by: Chongchong Yu, et al.
Published: (2025-04-01)

Comparative Evaluation of Functional and Aesthetic Outcomes in Reconstruction of Commissure of Mouth with Radial Forearm Free Flap with and without Palmaris Longus Tendon in Patients being Operated for Squamous Cell Carcinoma of Buccal Mucosa and Lip: A Research Protocol
by: Imran Saleem Solanki, et al.
Published: (2025-04-01)

Application of new acoustic parameters in ANN-aided pathological speech diagnosis
by: Joanna SZALENIEC, et al.
Published: (2014-04-01)

Inner Speech and Speed Reading: An Analysis of Written Texts Internalization
by: Francy Lorena García Cobo, et al.
Published: (2024-09-01)

Software Package for Special Teaching and Testing of Children with Hearing and Speech Impairments
by: Stolyarova E.I., et al.
Published: (2022-04-01)

Neural network models for whisper to normal speech conversion
by: Cézar Yamamura, et al.
Published: (2025-03-01)

The application of Kohonen and Multilayer Perceptron Networks in the speech nonfluency analysis
by: Izabela Szczurowska, et al.
Published: (2014-01-01)

Synchronous Analysis of Speech Production and Lips Movement to Detect Parkinson’s Disease Using Deep Learning Methods
by: Cristian David Ríos-Urrego, et al.
Published: (2024-12-01)

Lip-Reading Classification of Turkish Digits Using Ensemble Learning Architecture Based on 3DCNN
by: Ali Erbey, et al.
Published: (2025-01-01)

A Novel Approach for Visual Speech Recognition Using the Partition-Time Masking and Swin Transformer 3D Convolutional Model
by: Xiangliang Zhang, et al.
Published: (2025-04-01)

Mandarin speech reconstruction from surface electromyography based on generative adversarial networks
by: Fengji Li, et al.
Published: (2025-06-01)

Automating Speech Audiometry in Quiet and in Noise Using a Deep Neural Network
by: Hadrien Jean, et al.
Published: (2025-02-01)

GA_FastICA Algorithmfor Speech Separation
by: LAN Chao-feng, et al.
Published: (2022-12-01)

Urdu Lip Reading Systems for Digits in Controlled and Uncontrolled Environment
by: Amanullah Baloch, et al.
Published: (2025-01-01)

Methodological Foundations of Film Speech Analysis Using Corpora: Technical, Social, and Cultural-National Aspects
by: Ya. M. Alyunina
Published: (2024-03-01)

Quantitative results of SonoSpeech Cleft Pilot: a mixed-methods pilot randomised control trial of ultrasound visual biofeedback versus standard intervention for children with cleft palate ± cleft lip
by: Maria Cairney, et al.
Published: (2025-05-01)

Speech etiquette situation of apology in classes of Russian as a foreign language (on the example of Asya Petrova's story «Sorry, Fool»)
by: Zhanna K. Gaponova, et al.
Published: (2023-08-01)

Deep Learning Based Automatic Speech Recognition for Turkish
by: Hamit Erdem, et al.
Published: (2020-08-01)

CNN Based Automatic Speech Recognition: A Comparative Study
by: Hilal Ilgaz, et al.
Published: (2024-08-01)

Large language models and speech genre systematicity
by: Devyatkin, Dmitry Alekseevich, et al.
Published: (2025-02-01)

Cochleogram-Based Speech Emotion Recognition with the Cascade of Asymmetric Resonators with Fast-Acting Compression Using Time-Distributed Convolutional Long Short-Term Memory and Support Vector Machines
by: Cevahir Parlak
Published: (2025-03-01)

Deep Neural Network for Supervised Single-Channel Speech Enhancement
by: Nasir SALEEM, et al.
Published: (2019-01-01)

Deep learning techniques for speech emotion recognition: A review
by: Silviana Widya Lestari, et al.
Published: (2023-06-01)

End-to-end neuromorphic speech enhancement with PDM microphones
by: Sidi Yaya Arnaud Yarga, et al.
Published: (2025-01-01)

Can behavioral features reveal lying in an online personality questionnaire? The impact of mouse dynamics and speech
by: Eduard Kuric, et al.
Published: (2025-05-01)

Hate Speech Detection Using Machine Learning: A Survey
by: Seble, H.,, et al.
Published: (2023-09-01)

Spatiotemporal Feature Enhancement for Lip-Reading: A Survey
by: Yinuo Ma, et al.
Published: (2025-04-01)

Drivers of hate speech in political conversations on Twitter: the case of the 2022 Italian general election
by: Francesco Pierri
Published: (2024-10-01)

End-to-End Multi-Speaker FastSpeech2 With Hierarchical Decoder
by: Majid Adibian, et al.
Published: (2025-01-01)

Using casual speech phonology in synthetic speech
by: Linda SHOCKEY
Published: (2014-04-01)