中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search

文献类型:期刊论文

作者Zizhang Qiu; Shouguang Wang; Dan You; MengChu Zhou
刊名IEEE/CAA Journal of Automatica Sinica
出版日期2024
卷号11期号:10页码:2111-2122
关键词Contract Bridge reinforcement learning search
ISSN号2329-9266
DOI10.1109/JAS.2024.124488
英文摘要Contract Bridge, a four-player imperfect information game, comprises two phases: bidding and playing. While computer programs excel at playing, bidding presents a challenging aspect due to the need for information exchange with partners and interference with communication of opponents. In this work, we introduce a Bridge bidding agent that combines supervised learning, deep reinforcement learning via self-play, and a test-time search approach. Our experiments demonstrate that our agent outperforms WBridge5, a highly regarded computer Bridge software that has won multiple world championships, by a performance of 0.98 IMPs (international match points) per deal over 10 000 deals, with a much cost-effective approach. The performance significantly surpasses previous state-of-the-art (0.85 IMPs per deal). Note 0.1 IMPs per deal is a significant improvement in Bridge bidding.
源URL[http://ir.ia.ac.cn/handle/173211/58841]  
专题自动化研究所_学术期刊_IEEE/CAA Journal of Automatica Sinica
推荐引用方式
GB/T 7714
Zizhang Qiu,Shouguang Wang,Dan You,et al. Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search[J]. IEEE/CAA Journal of Automatica Sinica,2024,11(10):2111-2122.
APA Zizhang Qiu,Shouguang Wang,Dan You,&MengChu Zhou.(2024).Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search.IEEE/CAA Journal of Automatica Sinica,11(10),2111-2122.
MLA Zizhang Qiu,et al."Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search".IEEE/CAA Journal of Automatica Sinica 11.10(2024):2111-2122.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。