中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
DDRL: A Decentralized Deep Reinforcement Learning Method for Vehicle Repositioning

文献类型:会议论文

作者Jinhao Xi1,3; Fenghua Zhu1; Yuanyuan Chen1; Yisheng Lv1; Chang Tan2; Feiyue Wang1
出版日期2021-10-25
会议日期19-22 September 2021
会议地点Indianapolis, IN, USA
英文摘要

Online Ride-hailing System improves the efficiency of vehicle utilization and the urban transportation. However, the imbalance between supply and demand is still a problem. To solve this problem and improve resource utilization efficiency, a Decentralized Deep Reinforcement Learning Method (DDRL) for vehicle repositioning is proposed. In DDRL, each vehicle is modeled as an independent agent and dispatched according to its own state to rebalance its local supply and demand. Thus, the global rebalance problem is divided into many small local rebalance problems. First, a new reward evaluation method is proposed and the long-term global reward in traditional reinforcement learning is transformed into many short-term local rewards. Second, a unified algorithm is designed by learning all the decentralized agents' sample data. Finally, the weight matrix of the state is introduced to magnify the differences between the states of adjacent vehicles. Experiments are carried out and the effectiveness of DDRL is verified.

会议录出版者IEEE
语种英语
源URL[http://ir.ia.ac.cn/handle/173211/52124]  
专题自动化研究所_复杂系统管理与控制国家重点实验室_先进控制与自动化团队
通讯作者Fenghua Zhu
作者单位1.The State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, 100190 China
2.iFLYTEK CO. LTD, Hefei 230088, China
3.The School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, 100049 China
推荐引用方式
GB/T 7714
Jinhao Xi,Fenghua Zhu,Yuanyuan Chen,et al. DDRL: A Decentralized Deep Reinforcement Learning Method for Vehicle Repositioning[C]. 见:. Indianapolis, IN, USA. 19-22 September 2021.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。