Exploring MediaPipe optimization strategies for real-time sign language recognition

The present study meticulously investigates optimization strategies for real-time sign language recognition (SLR) employing the MediaPipe framework. We introduce an innovative multi-modal methodology, amalgamating four distinct Long Short-Term Memory (LSTM) models dedicated to processing skeletal c...

Full description

Saved in:
Bibliographic Details
Main Authors: Phuoc Thanh Nguyen, Thanh Hoang Nguyen, Ngoc Xuan Nguyen Hoang, Huynh Thanh Binh Phan, Hoang Son Hai Vu, Hieu Nhan Huynh
Format: Article
Language:English
Published: Can Tho University Publisher 2023-10-01
Series:CTU Journal of Innovation and Sustainable Development
Subjects:
Online Access:https://ctujs.ctu.edu.vn/index.php/ctujs/article/view/716
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The present study meticulously investigates optimization strategies for real-time sign language recognition (SLR) employing the MediaPipe framework. We introduce an innovative multi-modal methodology, amalgamating four distinct Long Short-Term Memory (LSTM) models dedicated to processing skeletal coordinates ascertained from the MediaPipe framework. Rigorous evaluations were executed on esteemed sign language datasets. Empirical findings underscore that the multi-modal approach significantly elevates the accuracy of the SLR model while preserving its real-time capabilities. In comparative analyses with prevalent MediaPipe-based models, our multi-modal strategy consistently manifested superior performance metrics. A distinguishing characteristic of this approach is its inherent adaptability, facilitating modifications within the LSTM layers, rendering it apt for a myriad of challenges and data typologies. Integrating the MediaPipe framework with real-time SLR markedly amplifies recognition precision, signifying a pivotal advancement in the discipline.
ISSN:2588-1418
2815-6412