Leveraging Multimodal Large Language Models (MLLMs) for Enhanced Object Detection and Scene Understanding in Thermal Images for Autonomous Driving Systems

The integration of thermal imaging data with multimodal large language models (MLLMs) offers promising advancements for enhancing the safety and functionality of autonomous driving systems (ADS) and intelligent transportation systems (ITS). This study investigates the potential of MLLMs, specificall...

Full description

Saved in:
Bibliographic Details
Main Authors: Huthaifa I. Ashqar, Taqwa I. Alhadidi, Mohammed Elhenawy, Nour O. Khanfar
Format: Article
Language:English
Published: MDPI AG 2024-10-01
Series:Automation
Subjects:
Online Access:https://www.mdpi.com/2673-4052/5/4/29
Tags: Add Tag
No Tags, Be the first to tag this record!

Similar Items