A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting

Forecasting pedestrian trajectories is crucial for autonomous driving systems but remains challenging due to complex spatial and temporal interactions. Most existing methods model these interactions separately; for example, they capture temporal features and then pass this information to a spatial i...

Full description

Saved in:
Bibliographic Details
Main Authors: Feng Bian, Wensheng Zhang
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11026008/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850161255476625408
author Feng Bian
Wensheng Zhang
author_facet Feng Bian
Wensheng Zhang
author_sort Feng Bian
collection DOAJ
description Forecasting pedestrian trajectories is crucial for autonomous driving systems but remains challenging due to complex spatial and temporal interactions. Most existing methods model these interactions separately; for example, they capture temporal features and then pass this information to a spatial interaction model. These sequential methods hinder communication between the two dimensions, reducing forecasting accuracy. To address this limitation, we propose a novel method called CAM (Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting). CAM independently captures temporal and spatial features and facilitates effective communication between them. Specifically, we utilize graph attention networks to capture spatial features and transformers to capture temporal features. The cross-dimensional attention mechanism enables features encoded in one dimension to query and retrieve relevant information from the other dimension. This mechanism allows features in both dimensions to interact effectively, making pedestrian trajectory forecasting easier. We evaluated our method on two well-known public datasets, ETH and UCY. The results show that our method improves forecasting accuracy, with the average displacement error (ADE) of 0.21 and final displacement error (FDE) of 0.41, improving the ADE and FDE by 16.0% and 6.8%, respectively.
format Article
id doaj-art-ac8914dd742f416b8a354b2b427c76bd
institution OA Journals
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-ac8914dd742f416b8a354b2b427c76bd2025-08-20T02:22:55ZengIEEEIEEE Access2169-35362025-01-011310029810030610.1109/ACCESS.2025.357689011026008A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory ForecastingFeng Bian0https://orcid.org/0000-0002-5573-5421Wensheng Zhang1https://orcid.org/0000-0002-5080-0714School of Traffic and Transportation, Shijiazhuang Tiedao University, Shijiazhuang, Hebei, ChinaSchool of Traffic and Transportation, Shijiazhuang Tiedao University, Shijiazhuang, Hebei, ChinaForecasting pedestrian trajectories is crucial for autonomous driving systems but remains challenging due to complex spatial and temporal interactions. Most existing methods model these interactions separately; for example, they capture temporal features and then pass this information to a spatial interaction model. These sequential methods hinder communication between the two dimensions, reducing forecasting accuracy. To address this limitation, we propose a novel method called CAM (Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting). CAM independently captures temporal and spatial features and facilitates effective communication between them. Specifically, we utilize graph attention networks to capture spatial features and transformers to capture temporal features. The cross-dimensional attention mechanism enables features encoded in one dimension to query and retrieve relevant information from the other dimension. This mechanism allows features in both dimensions to interact effectively, making pedestrian trajectory forecasting easier. We evaluated our method on two well-known public datasets, ETH and UCY. The results show that our method improves forecasting accuracy, with the average displacement error (ADE) of 0.21 and final displacement error (FDE) of 0.41, improving the ADE and FDE by 16.0% and 6.8%, respectively.https://ieeexplore.ieee.org/document/11026008/Attention mechanismautonomous drivingpedestrian trajectory forecastingtransformer
spellingShingle Feng Bian
Wensheng Zhang
A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting
IEEE Access
Attention mechanism
autonomous driving
pedestrian trajectory forecasting
transformer
title A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting
title_full A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting
title_fullStr A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting
title_full_unstemmed A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting
title_short A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting
title_sort cross dimensional attention mechanism for pedestrian trajectory forecasting
topic Attention mechanism
autonomous driving
pedestrian trajectory forecasting
transformer
url https://ieeexplore.ieee.org/document/11026008/
work_keys_str_mv AT fengbian acrossdimensionalattentionmechanismforpedestriantrajectoryforecasting
AT wenshengzhang acrossdimensionalattentionmechanismforpedestriantrajectoryforecasting
AT fengbian crossdimensionalattentionmechanismforpedestriantrajectoryforecasting
AT wenshengzhang crossdimensionalattentionmechanismforpedestriantrajectoryforecasting