Learning Causal Dynamics Models in Object-Oriented Environments
文献类型:会议论文
作者 | Yu ZY(余忠蔚)![]() ![]() |
出版日期 | 2024-05 |
会议日期 | 2024-07-21 |
会议地点 | 奥地利, 维也纳 |
关键词 | 强化学习 因果模型 |
英文摘要 | Causal dynamics models (CDMs) have demonstrated significant potential in addressing various challenges in reinforcement learning. To learn CDMs, recent studies have performed causal discovery to capture the causal dependencies among environmental variables. However, the learning of CDMs is still confined to small-scale environments due to computational complexity and sample efficiency constraints. This paper aims to extend CDMs to large-scale object-oriented environments, which consist of a multitude of objects classified into different categories. We introduce the Object-Oriented CDM (OOCDM) that shares causalities and parameters among objects belonging to the same class. Furthermore, we propose a learning method for OOCDM that enables it to adapt to a varying number of objects. Experiments on large-scale tasks indicate that OOCDM outperforms existing CDMs in terms of causal discovery, prediction accuracy, generalization, and computational efficiency. |
会议录 | Proceedings of the 41st International Conference on Machine Learning
![]() |
源URL | [http://ir.ia.ac.cn/handle/173211/56584] ![]() |
专题 | 多模态人工智能系统全国重点实验室 |
通讯作者 | Xing DP(邢登鹏) |
作者单位 | 中国科学院自动化研究所 |
推荐引用方式 GB/T 7714 | Yu ZY,Ruan JQ,Xing DP. Learning Causal Dynamics Models in Object-Oriented Environments[C]. 见:. 奥地利, 维也纳. 2024-07-21. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。