Multimodal Learning for Traffic Risk Prediction: Combining Aerial Imagery With Contextual Data

Accurately predicting traffic risks at urban intersections is essential for improving road safety. While traditional models use data sources like road traffic conditions, geometry, and signals, they often miss the spatial interactions between road networks and buildings. This study introduces a mult...

Full description

Saved in:
Bibliographic Details
Main Authors: Hanlin Tian, Yuxiang Feng, Mohammed Quddus, Yiannis Demiris, Panagiotis Angeloudis
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Open Journal of Intelligent Transportation Systems
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11017746/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Accurately predicting traffic risks at urban intersections is essential for improving road safety. While traditional models use data sources like road traffic conditions, geometry, and signals, they often miss the spatial interactions between road networks and buildings. This study introduces a multimodal deep learning framework that integrates aerial imagery, building footprint data, and traffic flow information to improve traffic risk prediction and better capture these complex relationships. By leveraging datasets from OpenStreetMap, the U.K. Traffic Count, and high-resolution aerial imagery, our approach creates a comprehensive representation of the urban environment, capturing intricate spatial relationships between road networks, surrounding structures, and traffic conditions. Using DeepLabV3+, UNet++, and SegFormer as baseline models, we demonstrate that combining building and traffic data enhances prediction accuracy compared to models relying solely on visual data. Our results show that the DeepLabV3+ model, when incorporating both building and traffic data, achieves the highest Intersection over Union (IoU) score of 0.4052 and the lowest Root Mean Square Error (RMSE) of 0.0907. These findings underscore the effectiveness of a multimodal approach in traffic risk assessment, offering a more precise tool for urban planning and traffic management interventions. The code and data used in this study are available at <uri>https://github.com/zachtian/Multimodal-Learning-for-Traffic-Risk-Prediction</uri>.
ISSN:2687-7813