Camscribe: Enhanced Dashcam Video Descriptions Through Multimodal Spatiotemporal and Object Detection for Autonomous Vehicles

The generation of accurate and coherent video descriptions necessitates comprehensive understanding of multiple visual cues. While conventional video description models have predominantly relied on RGB and optical flow information, yet these approaches face fundamental accuracy constraints, undersco...

Full description

Saved in:
Bibliographic Details
Main Authors: Muhammad Rafiq, Mankyu Sung, Ghazala Rafiq, Gyu Sang Choi
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11006041/
Tags: Add Tag
No Tags, Be the first to tag this record!