HDRLM3D: A Deep Reinforcement Learning-Based Model with Human-like Perceptron and Policy for Crowd Evacuation in 3D Environments
文献类型:期刊论文
作者 | Zhang, Dong1,2; Li, Wenhang1; Gong, Jianhua1,2,3; Huang, Lin1; Zhang, Guoyong1; Shen, Shen4; Liu, Jiantao5; Ma, Haonan1,2 |
刊名 | ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION
![]() |
出版日期 | 2022-04-01 |
卷号 | 11期号:4页码:20 |
关键词 | crowd simulation agent-based model deep reinforcement learning perceptron policy |
DOI | 10.3390/ijgi11040255 |
通讯作者 | Li, Wenhang(liwh@aircas.ac.cn) |
英文摘要 | At present, a common drawback of crowd simulation models is that they are mainly simulated in (abstract) 2D environments, which limits the simulation of crowd behaviors observed in real 3D environments. Therefore, we propose a deep reinforcement learning-based model with human-like perceptron and policy for crowd evacuation in 3D environments (HDRLM3D). In HDRLM3D, we propose a vision-like ray perceptron (VLRP) and combine it with a redesigned global (or local) perceptron (GOLP) to form a human-like perception model. We propose a double-branch feature extraction and decision network (DBFED-Net) as the policy, which can extract features and make behavioral decisions. Moreover, we validate our method's ability to reproduce typical phenomena and behaviors through experiments in two different scenarios. In scenario I, we reproduce the bottleneck effect of crowds and verify the effectiveness and advantages of HDRLM3D by comparing it with real crowd experiments and classical methods in terms of density maps, fundamental diagrams, and evacuation times. In scenario II, we reproduce agents' navigation and obstacle avoidance behaviors and demonstrate the advantages of HDRLM3D for crowd simulation in unknown 3D environments by comparing it with other deep reinforcement learning-based models in terms of trajectories and numbers of collisions. |
WOS关键词 | BEHAVIOR ; DRIVEN |
资助项目 | National Natural Science Foundation of China[42171113] ; National Natural Science Foundation of China[41971361] ; National Key Technology R&D Program of China[2020YFC0833103] ; Pilot Fund of Frontier Science and Disruptive Technology of Aerospace Information Research Institute, Chinese Academy of Sciences[E0Z21101] |
WOS研究方向 | Computer Science ; Physical Geography ; Remote Sensing |
语种 | 英语 |
WOS记录号 | WOS:000787951100001 |
出版者 | MDPI |
资助机构 | National Natural Science Foundation of China ; National Key Technology R&D Program of China ; Pilot Fund of Frontier Science and Disruptive Technology of Aerospace Information Research Institute, Chinese Academy of Sciences |
源URL | [http://ir.igsnrr.ac.cn/handle/311030/175162] ![]() |
专题 | 中国科学院地理科学与资源研究所 |
通讯作者 | Li, Wenhang |
作者单位 | 1.Chinese Acad Sci, Natl Engn Res Ctr Geoinformat, Aerosp Informat Res Inst, Beijing 100094, Peoples R China 2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China 3.Zhejiang CAS Applicat Ctr Geoinformat, Jiaxing 314199, Peoples R China 4.Inst Geog Sci & Nat Resources Res, State Key Lab Resources & Environm Informat Syst, CAS, Beijing 100101, Peoples R China 5.Shandong Jianzhu Univ, Sch Surveying & Geoinformat, Jinan 250101, Peoples R China |
推荐引用方式 GB/T 7714 | Zhang, Dong,Li, Wenhang,Gong, Jianhua,et al. HDRLM3D: A Deep Reinforcement Learning-Based Model with Human-like Perceptron and Policy for Crowd Evacuation in 3D Environments[J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION,2022,11(4):20. |
APA | Zhang, Dong.,Li, Wenhang.,Gong, Jianhua.,Huang, Lin.,Zhang, Guoyong.,...&Ma, Haonan.(2022).HDRLM3D: A Deep Reinforcement Learning-Based Model with Human-like Perceptron and Policy for Crowd Evacuation in 3D Environments.ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION,11(4),20. |
MLA | Zhang, Dong,et al."HDRLM3D: A Deep Reinforcement Learning-Based Model with Human-like Perceptron and Policy for Crowd Evacuation in 3D Environments".ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 11.4(2022):20. |
入库方式: OAI收割
来源:地理科学与资源研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。