Reinforcement Learning Ramp Metering without Complete Information

This paper develops a model of reinforcement learning ramp metering (RLRM) without complete information, which is applied to alleviate traffic congestions on ramps. RLRM consists of prediction tools depending on traffic flow simulation and optimal choice model based on reinforcement learning theorie...

Full description

Saved in:
Bibliographic Details
Main Authors: Xing-Ju Wang, Xiao-Ming Xi, Gui-Feng Gao
Format: Article
Language:English
Published: Wiley 2012-01-01
Series:Journal of Control Science and Engineering
Online Access:http://dx.doi.org/10.1155/2012/208456
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper develops a model of reinforcement learning ramp metering (RLRM) without complete information, which is applied to alleviate traffic congestions on ramps. RLRM consists of prediction tools depending on traffic flow simulation and optimal choice model based on reinforcement learning theories. Moreover, it is also a dynamic process with abilities of automaticity, memory and performance feedback. Numerical cases are given in this study to demonstrate RLRM such as calculating outflow rate, density, average speed, and travel time compared to no control and fixed-time control. Results indicate that the greater is the inflow, the more is the effect. In addition, the stability of RLRM is better than fixed-time control.
ISSN:1687-5249
1687-5257