A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting

Forecasting pedestrian trajectories is crucial for autonomous driving systems but remains challenging due to complex spatial and temporal interactions. Most existing methods model these interactions separately; for example, they capture temporal features and then pass this information to a spatial i...

Full description

Saved in:

Bibliographic Details
Main Authors:	Feng Bian, Wensheng Zhang
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Attention mechanism autonomous driving pedestrian trajectory forecasting transformer
Online Access:	https://ieeexplore.ieee.org/document/11026008/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850161255476625408
author	Feng Bian Wensheng Zhang
author_facet	Feng Bian Wensheng Zhang
author_sort	Feng Bian
collection	DOAJ
description	Forecasting pedestrian trajectories is crucial for autonomous driving systems but remains challenging due to complex spatial and temporal interactions. Most existing methods model these interactions separately; for example, they capture temporal features and then pass this information to a spatial interaction model. These sequential methods hinder communication between the two dimensions, reducing forecasting accuracy. To address this limitation, we propose a novel method called CAM (Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting). CAM independently captures temporal and spatial features and facilitates effective communication between them. Specifically, we utilize graph attention networks to capture spatial features and transformers to capture temporal features. The cross-dimensional attention mechanism enables features encoded in one dimension to query and retrieve relevant information from the other dimension. This mechanism allows features in both dimensions to interact effectively, making pedestrian trajectory forecasting easier. We evaluated our method on two well-known public datasets, ETH and UCY. The results show that our method improves forecasting accuracy, with the average displacement error (ADE) of 0.21 and final displacement error (FDE) of 0.41, improving the ADE and FDE by 16.0% and 6.8%, respectively.
format	Article
id	doaj-art-ac8914dd742f416b8a354b2b427c76bd
institution	OA Journals
issn	2169-3536
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-ac8914dd742f416b8a354b2b427c76bd2025-08-20T02:22:55ZengIEEEIEEE Access2169-35362025-01-011310029810030610.1109/ACCESS.2025.357689011026008A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory ForecastingFeng Bian0https://orcid.org/0000-0002-5573-5421Wensheng Zhang1https://orcid.org/0000-0002-5080-0714School of Traffic and Transportation, Shijiazhuang Tiedao University, Shijiazhuang, Hebei, ChinaSchool of Traffic and Transportation, Shijiazhuang Tiedao University, Shijiazhuang, Hebei, ChinaForecasting pedestrian trajectories is crucial for autonomous driving systems but remains challenging due to complex spatial and temporal interactions. Most existing methods model these interactions separately; for example, they capture temporal features and then pass this information to a spatial interaction model. These sequential methods hinder communication between the two dimensions, reducing forecasting accuracy. To address this limitation, we propose a novel method called CAM (Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting). CAM independently captures temporal and spatial features and facilitates effective communication between them. Specifically, we utilize graph attention networks to capture spatial features and transformers to capture temporal features. The cross-dimensional attention mechanism enables features encoded in one dimension to query and retrieve relevant information from the other dimension. This mechanism allows features in both dimensions to interact effectively, making pedestrian trajectory forecasting easier. We evaluated our method on two well-known public datasets, ETH and UCY. The results show that our method improves forecasting accuracy, with the average displacement error (ADE) of 0.21 and final displacement error (FDE) of 0.41, improving the ADE and FDE by 16.0% and 6.8%, respectively.https://ieeexplore.ieee.org/document/11026008/Attention mechanismautonomous drivingpedestrian trajectory forecastingtransformer
spellingShingle	Feng Bian Wensheng Zhang A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting IEEE Access Attention mechanism autonomous driving pedestrian trajectory forecasting transformer
title	A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting
title_full	A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting
title_fullStr	A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting
title_full_unstemmed	A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting
title_short	A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting
title_sort	cross dimensional attention mechanism for pedestrian trajectory forecasting
topic	Attention mechanism autonomous driving pedestrian trajectory forecasting transformer
url	https://ieeexplore.ieee.org/document/11026008/
work_keys_str_mv	AT fengbian acrossdimensionalattentionmechanismforpedestriantrajectoryforecasting AT wenshengzhang acrossdimensionalattentionmechanismforpedestriantrajectoryforecasting AT fengbian crossdimensionalattentionmechanismforpedestriantrajectoryforecasting AT wenshengzhang crossdimensionalattentionmechanismforpedestriantrajectoryforecasting

A Cross-Dimensional Attention Mechanism for Pedestrian Trajectory Forecasting

Similar Items