Construction of Go Algorithm Based on Modern Heuristics and Playout

Research Project

Project/Area Number	16K00510
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Research Field	Entertainment and game informatics 1
Research Institution	Aichi Institute of Technology
Principal Investigator	ITOH Masaru 愛知工業大学, 情報科学部, 教授 (80221026)
Research Collaborator	ITOH arito KAWAI makoto YAMAKAWA yushi
Project Period (FY)	2016-04-01 – 2019-03-31
Project Status	Completed (Fiscal Year 2018)
Budget Amount *help	¥3,380,000 (Direct Cost: ¥2,600,000、Indirect Cost: ¥780,000) Fiscal Year 2018: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000) Fiscal Year 2017: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000) Fiscal Year 2016: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Keywords	囲碁アルゴリズム / プレイアウト / モダンヒューリスティックス / ニューラルネットワーク / 深層学習 / モンテカルロ木探索 / アクション値 / ヒューリスティック / 畳み込みニューラルネットワーク / 碁テキストプロトコル
Outline of Final Research Achievements	The recently major algorithm in computer go is based on the Monte-Carlo tree search. The search method generates a large amount of playouts to obtain the best next move. However, those playouts are rarely reused until the end. So, I proposed a method to get the best next move by making game tree from past playout history. It was confirmed that the proposed method can be applied to small problems such as tsumego. I proposed another algorithm based on deep learning and playouts. Deep learning was realized by multi-layer convolutional neural networks. Neural network is one of the representative methods of modern heuristics. In the Value-MCTS, the search method did not use a win/loss of rollout proposed by AlphaGo, and instead substituted this part with playout. For node evaluation, not the UCB1 value but the action value proposed by AlphaGo was adopted. It was shown statistically that the proposed method is superior to the existing Go software if the parameter values are set correctly.
Academic Significance and Societal Importance of the Research Achievements	過去のプレイアウト履歴からゲーム木を作成して最善手を求める方法は詰碁のような狭い探索空間で有効に機能した。しかし、9路盤囲碁では完全に無効であった。アイデアは興味深いが、モンテカルロ木探索と同等以上の手法にはなり得ないことが判明した。もうひとつの深層学習とプレイアウトに基づく囲碁アルゴリズムは少資源環境下で動作させることに成功した。既存のオープンソース囲碁に統計的に有意に勝利できることを確認した。Value-MCTSで必要となるノード展開閾値やMixingパラメータといった各種パラメータ値を適切に設定する必要があり、それらの調整は容易でない。さらに、実行時間の短縮という大きな課題も残った。

Report

(4 results)

2018 Annual Research Report Final Research Report ( PDF )
2017 Research-status Report
2016 Research-status Report

Research Products
(10 results)

All 2019 2018 2017 Other

All Journal Article (1 results) (of which Open Access: 1 results) Presentation (5 results) (of which Invited: 1 results) Remarks (4 results)

[Journal Article] 深層学習とプレイアウトに基づく囲碁アルゴリズム2019
- Author(s)
  伊藤雅, 伊藤有人
- Journal Title
  
  愛知工業大学研究報告
  
  Volume: 54 Pages: 110-117
- NAID
  120006607321
- Related Report
  2018 Annual Research Report
- Open Access
[Presentation] プレイアウトと深層学習を組み合わせた囲碁アルゴリズム2019
- Author(s)
  伊藤雅, 伊藤有人
- Organizer
  平成31年電気学会全国大会
- Related Report
  2018 Annual Research Report
[Presentation] 少資源環境下におけるRocAlphaGoの改良とその棋力検証2018
- Author(s)
  伊藤有人, 伊藤雅
- Organizer
  情報処理学会第80回全国大会
- Related Report
  2017 Research-status Report
[Presentation] プレイアウト履歴から成るゲーム木を用いた囲碁アルゴリズム2017
- Author(s)
  川合諒, 伊藤雅
- Organizer
  情報処理学会第79回全国大会
- Place of Presentation
  名古屋大学東山キャンパス（愛知県名古屋市千種区）
- Year and Date
  2017-03-16
- Related Report
  2016 Research-status Report
[Presentation] コンピュータ囲碁の最前線2017
- Author(s)
  伊藤雅
- Organizer
  愛知工業大学総合技術研究所社会連携室主催　第9回AITテクノサロン～情報科学とAI～
- Related Report
  2017 Research-status Report
- Invited
[Presentation] RocAlphaGoに基づく囲碁アルゴリズム2017
- Author(s)
  山川雄史, 伊藤雅
- Organizer
  日本オペレーションズ・リサーチ学会中部支部第44回中部支部研究発表会
- Place of Presentation
  ウインクあいち 15階愛知県立大学サテライトキャンパス（愛知県名古屋市中村区）
- Related Report
  2016 Research-status Report
[Remarks] 平成30年度卒業研究紹介
- URL
  https://aitech.ac.jp/~milabo/member/k24/
- Related Report
  2018 Annual Research Report
[Remarks] 愛知工業大学学術情報リポジトリ
- URL
  http://repository.aitech.ac.jp/dspace/handle/11133/3491
- Related Report
  2018 Annual Research Report
[Remarks] 平成29年度　卒業研究紹介｜伊藤雅研究室
- URL
  http://aitech.ac.jp/~milabo/member/k23/
- Related Report
  2017 Research-status Report
[Remarks] 平成28年度　修士論文・卒業研究紹介｜伊藤雅研究室
- URL
  http://aitech.ac.jp/~milabo/member/k22/
- Related Report
  2016 Research-status Report

Construction of Go Algorithm Based on Modern Heuristics and Playout

Principal Investigator

ITOH Masaru 愛知工業大学, 情報科学部, 教授 (80221026)

¥3,380,000 (Direct Cost: ¥2,600,000、Indirect Cost: ¥780,000)

Report

Research Products

[Journal Article] 深層学習とプレイアウトに基づく囲碁アルゴリズム2019

Author(s)

Journal Title

NAID

Related Report

[Presentation] プレイアウトと深層学習を組み合わせた囲碁アルゴリズム2019

Author(s)

Organizer

Related Report

[Presentation] 少資源環境下におけるRocAlphaGoの改良とその棋力検証2018

Author(s)

Organizer

Related Report

[Presentation] プレイアウト履歴から成るゲーム木を用いた囲碁アルゴリズム2017

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] コンピュータ囲碁の最前線2017

Author(s)

Organizer

Related Report

[Presentation] RocAlphaGoに基づく囲碁アルゴリズム2017

Author(s)

Organizer

Place of Presentation

Related Report

[Remarks] 平成30年度卒業研究紹介

URL

Related Report

[Remarks] 愛知工業大学学術情報リポジトリ

URL

Related Report

[Remarks] 平成29年度 卒業研究紹介｜伊藤雅研究室

URL

Related Report

[Remarks] 平成28年度 修士論文・卒業研究紹介｜伊藤雅研究室

URL

Related Report

[Remarks] 平成29年度　卒業研究紹介｜伊藤雅研究室

[Remarks] 平成28年度　修士論文・卒業研究紹介｜伊藤雅研究室