Modeling Partial Observability in Real-World Problems and Developing Reinforcement Learning Algorithms for it, along with Theoretical Analysis

Research Project

Project/Area Number	24KJ0818
Research Category	Grant-in-Aid for JSPS Fellows
Allocation Type	Multi-year Fund
Section	国内
Review Section	Basic Section 61030:Intelligent informatics-related
Research Institution	The University of Tokyo
Principal Investigator	西森創一朗東京大学, 大学院新領域創成科学研究科, 特別研究員(DC1)
Project Period (FY)	2024-04-23 – 2027-03-31
Project Status	Granted (Fiscal Year 2024)
Budget Amount *help	¥2,200,000 (Direct Cost: ¥2,200,000) Fiscal Year 2026: ¥600,000 (Direct Cost: ¥600,000) Fiscal Year 2025: ¥600,000 (Direct Cost: ¥600,000) Fiscal Year 2024: ¥1,000,000 (Direct Cost: ¥1,000,000)
Outline of Research at the Start	現実での意思決定には，意思決定に必要な情報の一部のみ観測可能であるという部分観測性が伴います．本研究課題では，現実に我々が直面する部分観測性を適切にモデル化し，モデルの仮定を活用し高い性能を達成する強化学習アルゴリズムの開発，およびその理論的な性能保証を目指します．