Residual Reinforcement Learning for Motion Control of a Bionic Exploration Robot - RoboDact
文献类型:期刊论文
作者 | Zhang Tiandong2,4![]() ![]() ![]() ![]() ![]() |
刊名 | IEEE Transactions on Instrumentation and Measurement
![]() |
出版日期 | 2023 |
页码 | 1-13 |
DOI | 10.1109/TIM.2023.3282297 |
英文摘要 | This paper aims to investigate the motion control method of a bionic underwater exploration robot (RoboDact). The robot is equipped with a double-joint tail fin and two undulating pectoral fins to obtain good mobility and stability. The hybrid propulsion mode helps perform stable and effective underwater exploration and measurement. To coordinate these two kinds of bionic propulsion fins and address the challenge of measurement noises and external disturbances during underwater exploration, a novel residual reinforcement learning method with parameter randomization (PR-RRL) is proposed. The control strategy is a weighted superposition of a feedback controller and a residual controller. The observation feedback controller based on active disturbance rejection control (ADRC) is adapted to improve stability and convergence. And the residual controller based on the soft actor-critic (SAC) algorithm is adapted to improve adaptability to uncertainties and disturbances. Moreover, the parameter randomization training strategy is proposed for adapting natural complicated scenarios by randomizing the partial dynamics of the underwater exploration robot during the training phase. Finally, the feasibility and efficacy of the presented motion control method are validated by comprehensive simulation tests and RoboDact prototype physical experiments. |
URL标识 | 查看原文 |
语种 | 英语 |
源URL | [http://ir.ia.ac.cn/handle/173211/51988] ![]() |
专题 | 智能机器人系统研究 |
通讯作者 | Wang Rui |
作者单位 | 1.Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences 2.State Key Laboratory of MultimodalArtificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences 3.Centrale Lille, CRIStAL - Centre de Recherche en Informatique Signal et Automatique de Lille, University of Lille 4.School of Artificial Intelligence, University of Chinese Academy of Sciences |
推荐引用方式 GB/T 7714 | Zhang Tiandong,Wang Rui,Wang Shuo,et al. Residual Reinforcement Learning for Motion Control of a Bionic Exploration Robot - RoboDact[J]. IEEE Transactions on Instrumentation and Measurement,2023:1-13. |
APA | Zhang Tiandong,Wang Rui,Wang Shuo,Wang Yu,Zheng Gang,&Tan Min.(2023).Residual Reinforcement Learning for Motion Control of a Bionic Exploration Robot - RoboDact.IEEE Transactions on Instrumentation and Measurement,1-13. |
MLA | Zhang Tiandong,et al."Residual Reinforcement Learning for Motion Control of a Bionic Exploration Robot - RoboDact".IEEE Transactions on Instrumentation and Measurement (2023):1-13. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。