DDRL: A Decentralized Deep Reinforcement Learning Method for Vehicle Repositioning
文献类型:会议论文
作者 | Jinhao Xi1,3![]() ![]() ![]() ![]() |
出版日期 | 2021-10-25 |
会议日期 | 19-22 September 2021 |
会议地点 | Indianapolis, IN, USA |
英文摘要 | Online Ride-hailing System improves the efficiency of vehicle utilization and the urban transportation. However, the imbalance between supply and demand is still a problem. To solve this problem and improve resource utilization efficiency, a Decentralized Deep Reinforcement Learning Method (DDRL) for vehicle repositioning is proposed. In DDRL, each vehicle is modeled as an independent agent and dispatched according to its own state to rebalance its local supply and demand. Thus, the global rebalance problem is divided into many small local rebalance problems. First, a new reward evaluation method is proposed and the long-term global reward in traditional reinforcement learning is transformed into many short-term local rewards. Second, a unified algorithm is designed by learning all the decentralized agents' sample data. Finally, the weight matrix of the state is introduced to magnify the differences between the states of adjacent vehicles. Experiments are carried out and the effectiveness of DDRL is verified. |
会议录出版者 | IEEE |
语种 | 英语 |
源URL | [http://ir.ia.ac.cn/handle/173211/52124] ![]() |
专题 | 自动化研究所_复杂系统管理与控制国家重点实验室_先进控制与自动化团队 |
通讯作者 | Fenghua Zhu |
作者单位 | 1.The State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, 100190 China 2.iFLYTEK CO. LTD, Hefei 230088, China 3.The School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, 100049 China |
推荐引用方式 GB/T 7714 | Jinhao Xi,Fenghua Zhu,Yuanyuan Chen,et al. DDRL: A Decentralized Deep Reinforcement Learning Method for Vehicle Repositioning[C]. 见:. Indianapolis, IN, USA. 19-22 September 2021. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。