Dual-Priority Delayed Deep Double Q-Network (DPD3QN): A Dueling Double Deep Q-Network with Dual-Priority Experience Replay for Autonomous Driving Behavior Decision-Making

The behavior decision control of autonomous vehicles is a critical aspect of advancing autonomous driving technology. However, current behavior decision algorithms based on deep reinforcement learning still face several challenges, such as insufficient safety and sparse reward mechanisms. To solve t...

Full description

Saved in:
Bibliographic Details
Main Authors: Shuai Li, Peicheng Shi, Aixi Yang, Heng Qi, Xinlong Dong
Format: Article
Language:English
Published: MDPI AG 2025-05-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/18/5/291
Tags: Add Tag
No Tags, Be the first to tag this record!