Video-Based Facial Emotion Recognition using YOLO and Vision Transformer

This paper presents a video-based FER approach using a combination of the YOLOv8 model for face detection and a pre-trained Vision Transformer (ViT) for emotion classification. Our methodology involves extracting the middle frame from the RAVDESS dataset, which is then used for face detection using...

Full description

Saved in:
Bibliographic Details
Main Authors: Sareen Vidhi, Seeja K.R.
Format: Article
Language:English
Published: EDP Sciences 2025-01-01
Series:EPJ Web of Conferences
Subjects:
Online Access:https://www.epj-conferences.org/articles/epjconf/pdf/2025/13/epjconf_icetsf2025_01040.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!