• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

AlphaZero toward Theoretical Values and Optimal Plays of Perfect Information Games

Research Project

Project/Area Number 20K19946
Research Category

Grant-in-Aid for Early-Career Scientists

Allocation TypeMulti-year Fund
Review Section Basic Section 62040:Entertainment and game informatics-related
Research InstitutionJapan Advanced Institute of Science and Technology

Principal Investigator

HSUEH Chu Hsuan  北陸先端科学技術大学院大学, 先端科学技術研究科, 助教 (30847497)

Project Period (FY) 2020-04-01 – 2022-03-31
Project Status Completed (Fiscal Year 2021)
Budget Amount *help
¥2,600,000 (Direct Cost: ¥2,000,000、Indirect Cost: ¥600,000)
Fiscal Year 2021: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000)
Fiscal Year 2020: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000)
KeywordsAlphaZero / ゲームの解析 / 最適戦略 / 理論値 / Tabular / ニューラルネットワーク / 完全情報ゲーム / 確率的なゲーム / Chinese dark chess / EinStein wurfelt nicht! / NoGo / 強化学習
Outline of Research at the Start

AlphaZero はゲームルールのみを知識として用い,ゼロからプロ棋士を上回る強さを持つことができる.しかし,最適戦略や理論値を学習できるのかどうかについては十分調べられていない.さらに,不確定要素を含むゲームへの適用も殆どない.本研究の目的は,AlpahZero 枠組みを用いて,完全情報ゲーム(決定的・確率的)の理論値と最適戦略を学習できるアルゴリズムにすることである.まずは規模が小さくて完全解析が可能なゲームを対象として研究を行い,それから完全解析が困難なゲームへの適用と分析を予定する.さまざまなゲームごとの特性と特徴抽出モデルが学習に与える影響を明らかにし,改善することを目的とする.

Outline of Final Research Achievements

AlphaZero outperformed professionals by learning from scratch based on self-play games, which only needed to know game rules. However, it is unclear whether AlphaZero can learn the optimal policies or theoretical values. In addition, there are only a few applications to games involving uncertainty.
This research targeted games on small scales at first, where each position’s optimal policy and theoretical value can be obtained. The results showed that the learning of AlphaZero under many settings could converge to the optimal policies or theoretical values. In addition, for a game on a larger scale and involving uncertainty, it was also confirmed that the program based on AlphaZero was strong enough to obtain the silver medal in a tournament.

Academic Significance and Societal Importance of the Research Achievements

AlphaZero のパラメータを丁寧に調べ,学習結果への影響を明らかにしたことは学術的意義があった.AlphaZero を適用する研究者には,パラメータに関する試行錯誤のコストが減ることを期待する.また,サイコロを振るような不確定要素を含むゲームにおいても,AlphaZero の適用に成功したことの示しに貢献した.
さらに,AlphaZero で学習した戦略と局面評価の質がいいことを示したことで,それらの戦略や局面評価の参考価値をより深めた.人間プレイヤ(特に強いプレイヤ)の上達に利用できることを考える.利用価値を深めたことは学術的意義にも社会的意義にも貢献したと考える.

Report

(4 results)
  • 2021 Annual Research Report   Final Research Report ( PDF )
  • 2020 Research-status Report
  • Products Report
  • Research Products

    (6 results)

All 2025 2023 2022 2020 Other

All Int'l Joint Research (2 results) Journal Article (2 results) (of which Int'l Joint Research: 1 results,  Peer Reviewed: 2 results,  Open Access: 1 results) Presentation (2 results) (of which Int'l Joint Research: 2 results)

  • [Int'l Joint Research] 国立陽明交通大学/中央研究院/国立台北大学(その他の国・地域 台湾)

    • Related Report
      2021 Annual Research Report
  • [Int'l Joint Research] 国立交通大学/中央研究院/国立台北大学(その他の国・地域 台湾)

    • Related Report
      2020 Research-status Report
  • [Journal Article] Proposal and Generation of Endgame Puzzles for an Imperfect Information Game Geister2025

    • Author(s)
      Chu-Hsuan Hsueh, Takefumi Ishii, Tsuyoshi Hashimoto, Kokolo Ikeda
    • Journal Title

      Entertainment Computing

      Volume: 52 Pages: 100736-100736

    • DOI

      10.1016/j.entcom.2024.100736

    • Related Report
      Products Report
    • Peer Reviewed
  • [Journal Article] Analyses of Tabular AlphaZero on Strongly-Solved Stochastic Games2023

    • Author(s)
      Hsueh Chu-Hsuan、Ikeda Kokolo、Wu I-Chen、Chen Jr-Chang、Hsu Tsan-Sheng
    • Journal Title

      IEEE Access

      Volume: 11 Pages: 18157-18182

    • DOI

      10.1109/access.2023.3246638

    • Related Report
      Products Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Presentation] Graph Convolutional Networks for Turn-Based Strategy Games2022

    • Author(s)
      Wanxiang Li, Houkuan He, Chu-Hsuan Hsueh, and Kokolo Ikeda
    • Organizer
      The 14th International Conference on Agents and Artificial Intelligence
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Analyses of Tabular AlphaZero on NoGo2020

    • Author(s)
      Chu-Hsuan Hsueh, Kokolo Ikeda, Sang-Gyu Nam, and I-Chen Wu
    • Organizer
      2020 International Conference on Technologies and Applications of Artificial Intelligence
    • Related Report
      2020 Research-status Report
    • Int'l Joint Research

URL: 

Published: 2020-04-28   Modified: 2025-03-27  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi