Smartphone video-based early diagnosis of blepharospasm using dual cross-attention modeling enhanced by facial pose estimation

Abstract Blepharospasm is a focal dystonia characterized by involuntary eyelid contractions that impair vision and social function. The subtle clinical signs of blepharospasm make early and accurate diagnosis difficult, delaying timely intervention. In this study, we propose a dual cross-attention d...

Full description

Saved in:
Bibliographic Details
Main Authors: Shenyu Huang, Boyuan Yang, Xiaoling Huang, Huina Zhang, Dong Luo, Guanchao Tong, Yijie Wang, Yongqing Shao, Menglu Chen, Qi Gao, Juan Ye
Format: Article
Language:English
Published: Nature Portfolio 2025-08-01
Series:npj Digital Medicine
Online Access:https://doi.org/10.1038/s41746-025-01904-8
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Blepharospasm is a focal dystonia characterized by involuntary eyelid contractions that impair vision and social function. The subtle clinical signs of blepharospasm make early and accurate diagnosis difficult, delaying timely intervention. In this study, we propose a dual cross-attention deep learning framework that integrates temporal video features and facial landmark dynamics to assess blepharospasm severity, frequency, and diagnosis from smartphone-recorded facial videos. A retrospective dataset of 847 patient videos collected from two hospitals (2016–2023) was used for model development. The model achieved high accuracy for severity (0.828) and frequency (0.82), and moderate performance for diagnosis (0.674).SHAP analysis identified case-specific video fragments contributing to predictions, enhancing interpretability. In a prospective evaluation on an independent dataset (N = 179), AI assistance improved junior ophthalmologist’s diagnostic accuracy by up to 18.5%. These findings demonstrate the potential of an explainable, smartphone-compatible video model to support early detection and assessment of blepharospasm.
ISSN:2398-6352