Camscribe: Enhanced Dashcam Video Descriptions Through Multimodal Spatiotemporal and Object Detection for Autonomous Vehicles
The generation of accurate and coherent video descriptions necessitates comprehensive understanding of multiple visual cues. While conventional video description models have predominantly relied on RGB and optical flow information, yet these approaches face fundamental accuracy constraints, undersco...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/11006041/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|