Leveraging Multimodal Large Language Models (MLLMs) for Enhanced Object Detection and Scene Understanding in Thermal Images for Autonomous Driving Systems
The integration of thermal imaging data with multimodal large language models (MLLMs) offers promising advancements for enhancing the safety and functionality of autonomous driving systems (ADS) and intelligent transportation systems (ITS). This study investigates the potential of MLLMs, specificall...
Saved in:
Main Authors: | Huthaifa I. Ashqar, Taqwa I. Alhadidi, Mohammed Elhenawy, Nour O. Khanfar |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2024-10-01
|
Series: | Automation |
Subjects: | |
Online Access: | https://www.mdpi.com/2673-4052/5/4/29 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
A Survey on MLLMs in Education: Application and Future Directions
by: Weicheng Xing, et al.
Published: (2024-12-01) -
EdgeNet: An End-to-End Deep Neural Network Pretrained with Synthetic Data for a Real-World Autonomous Driving Application
by: Leanne Miller, et al.
Published: (2024-12-01) -
GPT-4 enhanced multimodal grounding for autonomous driving: Leveraging cross-modal attention with large language models
by: Haicheng Liao, et al.
Published: (2024-12-01) -
Driving-Related Cognitive Abilities Prediction Based on Transformer’s Multimodal Fusion Framework
by: Yifan Li, et al.
Published: (2024-12-01) -
Leveraging Deep Learning and Multimodal Large Language Models for Near-Miss Detection Using Crowdsourced Videos
by: Shadi Jaradat, et al.
Published: (2025-01-01)