A Self-Acquiring Knowledge Process for MCTS

Fabbri Andre<sup>*</sup>; Armetta Frederic<sup>*</sup>; Duchene Eric<sup>*</sup>; Hassas Salima<sup>*</sup>

doi:10.1142/S0218213016600071

摘要

MCTS (Monte Carlo Tree Search) is a well-known and efficient process to cover and evaluate a large range of states for combinatorial problems. We choose to study MCTS for the Computer Go problem, which is one of the most challenging problem in the field of Artificial Intelligence. For this game, a single combinatorial approach does not always lead to a reliable evaluation of the game states. In order to enhance MCTS ability to tackle such problems, one can benefit from game specific knowledge in order to increase the accuracy of the game state evaluation. Such a knowledge is not easy to acquire. It is the result of a constructivist learning mechanism based on the experience of the player. That is why we explore the idea to endow the MCTS with a process inspired by constructivist learning, to self-acquire knowledge from playing experience. In this paper, we propose a complementary process for MCTS called BHRF (Background History Reply Forest), which allows to memorize efficient patterns in order to promote their use through the MCTS process. Our experimental results lead to promising results and underline how self-acquired data can be useful for MCTS based algorithms.

出版日期2016-2

全文

访问全文

收藏分享被引(1) 浏览

更新时间：2021-03-24 17:21

A Self-Acquiring Knowledge Process for MCTS

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友