Exploring MediaPipe optimization strategies for real-time sign language recognition

The present study meticulously investigates optimization strategies for real-time sign language recognition (SLR) employing the MediaPipe framework. We introduce an innovative multi-modal methodology, amalgamating four distinct Long Short-Term Memory (LSTM) models dedicated to processing skeletal c...

Full description

Saved in:

Bibliographic Details
Main Authors:	Phuoc Thanh Nguyen, Thanh Hoang Nguyen, Ngoc Xuan Nguyen Hoang, Huynh Thanh Binh Phan, Hoang Son Hai Vu, Hieu Nhan Huynh
Format:	Article
Language:	English
Published:	Can Tho University Publisher 2023-10-01
Series:	CTU Journal of Innovation and Sustainable Development
Subjects:	LSTM, MediaPipe, How2Sign, Indian Sign Language, ISL
Online Access:	https://ctujs.ctu.edu.vn/index.php/ctujs/article/view/716
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	The present study meticulously investigates optimization strategies for real-time sign language recognition (SLR) employing the MediaPipe framework. We introduce an innovative multi-modal methodology, amalgamating four distinct Long Short-Term Memory (LSTM) models dedicated to processing skeletal coordinates ascertained from the MediaPipe framework. Rigorous evaluations were executed on esteemed sign language datasets. Empirical findings underscore that the multi-modal approach significantly elevates the accuracy of the SLR model while preserving its real-time capabilities. In comparative analyses with prevalent MediaPipe-based models, our multi-modal strategy consistently manifested superior performance metrics. A distinguishing characteristic of this approach is its inherent adaptability, facilitating modifications within the LSTM layers, rendering it apt for a myriad of challenges and data typologies. Integrating the MediaPipe framework with real-time SLR markedly amplifies recognition precision, signifying a pivotal advancement in the discipline.
ISSN:	2588-1418 2815-6412

Exploring MediaPipe optimization strategies for real-time sign language recognition

Similar Items