作者:
基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
Teaching computer programs to play games through machine learning has been an important way to achieve better artificial intelligence (AI) in a variety of real-world applications. Monte Carlo Tree Search (MCTS) is one of the key AI techniques developed recently that enabled AlphaGo to defeat a legendary professional Go player. What makes MCTS particularly attractive is that it only understands the basic rules of the game and does not rely on expert-level knowledge. Researchers thus expect that MCTS can be applied to other complex AI problems where domain-specific expert-level knowledge is not yet available. So far there are very few analytic studies in the literature. In this paper, our goal is to develop analytic studies of MCTS to build a more fundamental understanding of the algorithms and their applicability in complex AI problems. We start with a simple version of MCTS, called random playout search (RPS), to play Tic-Tac-Toe, and find that RPS may fail to discover the correct moves even in a very simple game position of Tic-Tac-Toe. Both the probability analysis and simulation have confirmed our discovery. We continue our studies with the full version of MCTS to play Gomoku and find that while MCTS has shown great success in playing more sophisticated games like Go, it is not effective to address the problem of sudden death/win. The main reason that MCTS often fails to detect sudden death/win lies in the random playout search nature of MCTS, which leads to prediction distortion. Therefore, although MCTS in theory converges to the optimal minimax search, with real world computational resource constraints, MCTS has to rely on RPS as an important step in its search process, therefore suffering from the same fundamental prediction distortion problem as RPS does. By examining the detailed statistics of the scores in MCTS, we investigate a variety of scenarios where MCTS fails to detect sudden death/win. Finally, we propose an improved MCTS algorithm by incorporating minimax search to overcome prediction distortio
推荐文章
Monte-Carlo统计迭代图像重建算法
层析γ扫描
线性衰减系数
Monte-Carlo方法
图像重建
迭代法
Monte Carlo方法在气体动理论中的应用
Monte Carlo方法
气体动理论
麦克斯韦速率分布
改进Monte Carlo算法用于RFID标签的室内定位
通信技术
无线定位
Monte Carlo定位算法
RFID室内定位系统
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 Prediction Distortion in Monte Carlo Tree Search and an Improved Algorithm
来源期刊 智能学习系统与应用(英文) 学科 医学
关键词 MONTE Carlo TREE SEARCH MINIMAX SEARCH Board GAMES Artificial
年,卷(期) 2018,(2) 所属期刊栏目
研究方向 页码范围 46-79
页数 34页 分类号 R73
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (0)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2018(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
MONTE
Carlo
TREE
SEARCH
MINIMAX
SEARCH
Board
GAMES
Artificial
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
智能学习系统与应用(英文)
季刊
2150-8402
武汉市江夏区汤逊湖北路38号光谷总部空间
出版文献量(篇)
166
总下载数(次)
0
总被引数(次)
0
论文1v1指导