A Problem Case for UCT

Browne Cameron<sup>*</sup>

doi:10.1109/TCIAIG.2012.2220138

登录

免费注册

赞收藏引用

科研之友

微信

新浪微博

Facebook

分享链接

A Problem Case for UCT

作者：Browne Cameron^*

来源：IEEE Transactions on Computational Intelligence and AI in Games, 2013, 5(1): 69-74.

DOI：10.1109/TCIAIG.2012.2220138

摘要

This paper examines a simple 5 5 Hex position that not only completely defeats flat Monte Carlo search, but also initially defeats plain upper confidence bounds for trees (UCT) search until an excessive number of iterations are performed. The inclusion of domain knowledge during playouts significantly improves UCT performance, but a slight negative effect is shown for the rapid action value estimate (RAVE) heuristic under some circumstances. This example was drawn from an actual game during standard play, and highlights the dangers of relying on flat Monte Carlo and unenhanced UCT search even for rough estimates. A brief comparison is made with RAVE failure in Go.

出版日期2013-3

全文

访问全文

收藏分享被引(2) 浏览

更新时间：2019-03-28 13:11

相似论文
引用论文
参考文献

产品服务

科研之友科研之友机构版科创云

站内浏览

科研成果科研人员科研机构

服务支持

帮助中心隐私政策服务条款

联系方式

在线客服：【立即咨询】客户热线：400-1616-289 电子邮箱：support@scholarmate.com

微信公众号