Improving Facial Expression Recognition with a Focal Transformer and Partial Feature Masking Augmentation

With the advancement of deep learning (DL) and computer vision (CV) technologies, significant progress has been made in facial expression identification FER for real-world applications. However, FER still faces challenges such as occlusion and head pose variations, which make it difficult for FER mo...

Full description

Saved in:
Bibliographic Details
Main Authors: Liang-Ying Ke, Chia-Yu Liao, Chih-Hsien Hsia
Format: Article
Language:English
Published: MDPI AG 2025-05-01
Series:Engineering Proceedings
Subjects:
Online Access:https://www.mdpi.com/2673-4591/92/1/70
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:With the advancement of deep learning (DL) and computer vision (CV) technologies, significant progress has been made in facial expression identification FER for real-world applications. However, FER still faces challenges such as occlusion and head pose variations, which make it difficult for FER models to maintain stability and accuracy. In this study, we introduced a focal vision transformer (FViT) with partial feature masking (PFM) into FER. This method was found to efficiently simulate the challenges posed by occlusion and head pose variations by introducing PFM data augmentation. Parts of the image were randomly masked while preserving key facial expressions. The proposed FViT showed an accuracy of 89.08% on the real-world affective faces database, which includes scenarios with occlusion and head pose variations. PFM enhanced the model’s performance, too. The developed method effectively addresses the challenges of occlusion and head pose variations in FER.
ISSN:2673-4591