中国科学院机构知识库网格系统: Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search

文献类型：期刊论文


作者	Zizhang Qiu; Shouguang Wang; Dan You; MengChu Zhou
刊名	IEEE/CAA Journal of Automatica Sinica
出版日期	2024
卷号	11 期号:10 页码:2111-2122
关键词	Contract Bridge reinforcement learning search
ISSN号	2329-9266
DOI	10.1109/JAS.2024.124488
英文摘要	Contract Bridge, a four-player imperfect information game, comprises two phases: bidding and playing. While computer programs excel at playing, bidding presents a challenging aspect due to the need for information exchange with partners and interference with communication of opponents. In this work, we introduce a Bridge bidding agent that combines supervised learning, deep reinforcement learning via self-play, and a test-time search approach. Our experiments demonstrate that our agent outperforms WBridge5, a highly regarded computer Bridge software that has won multiple world championships, by a performance of 0.98 IMPs (international match points) per deal over 10 000 deals, with a much cost-effective approach. The performance significantly surpasses previous state-of-the-art (0.85 IMPs per deal). Note 0.1 IMPs per deal is a significant improvement in Bridge bidding.
源URL	[http://ir.ia.ac.cn/handle/173211/58841]
专题	自动化研究所_学术期刊_IEEE/CAA Journal of Automatica Sinica
推荐引用方式 GB/T 7714	Zizhang Qiu,Shouguang Wang,Dan You,et al. Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search[J]. IEEE/CAA Journal of Automatica Sinica,2024,11(10):2111-2122.
APA	Zizhang Qiu,Shouguang Wang,Dan You,&MengChu Zhou.(2024).Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search.IEEE/CAA Journal of Automatica Sinica,11(10),2111-2122.
MLA	Zizhang Qiu,et al."Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search".IEEE/CAA Journal of Automatica Sinica 11.10(2024):2111-2122.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。