Applying auxiliary supervised depth-assisted transformer and cross modal attention fusion in monocular 3D object detection

Monocular 3D object detection is the most widely applied and challenging solution for autonomous driving, due to 2D images lacking 3D information. Existing methods are limited by inaccurate depth estimations by inequivalent supervised targets. The use of both depth and visual features also faces pro...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhijian Wang, Jie Liu, Yixiao Sun, Xiang Zhou, Boyan Sun, Dehong Kong, Jay Xu, Xiaoping Yue, Wenyu Zhang
Format: Article
Language:English
Published: PeerJ Inc. 2025-01-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-2656.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!