DMformer: a transformer with denoising and multi-modal data fusion for enhancing BEV perception

Abstract Accurate and robust perception in the Bird’s Eye View (BEV) is essential for effective environmental understanding in autonomous driving systems. This study introduces DMFormer, an innovative multi-modal BEV perception framework that employs Transformer architecture and a diffusion denoisin...

Full description

Saved in:
Bibliographic Details
Main Authors: Xuefeng Bao, Feng Liu, Yunli Chen, Yong Li, Rui Tian
Format: Article
Language:English
Published: Springer 2025-07-01
Series:Complex & Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1007/s40747-025-01984-9
Tags: Add Tag
No Tags, Be the first to tag this record!