2009 Fiscal Year Self-evaluation Report

Automatic and rapid realization of higher brain functions by partially observable Markov decision processes

Research Project

Project/Area Number	19700215
Research Category	Grant-in-Aid for Young Scientists (B)
Allocation Type	Single-year Grants
Research Field	Sensitivity informatics/Soft computing
Research Institution	Tokyo Institute of Technology
Principal Investigator	ITOH Hideaki Tokyo Institute of Technology, 理工学部, 講師 (20345375)
Project Period (FY)	2007 – 2010
Keywords	POMDP / 確率的最適制御 / 高次脳機能 / 推論 / 報酬最大化 / 適応制御 / 階層モデル / 階層制御
Research Abstract	本研究の目的は、ゴール指向性推論・選択的注意・作業記憶の利用などの高次脳機能を包括的に実現するエージェントを設計することである。このような諸機能を設計者が作りこむのは容易ではなく、例えばいつ何に注意を向けるのがよいか、また、いつどのような推論をしたらよいかなどをあらかじめ設計者が決めておくのは困難である。そこで本研究では、報酬最大化原理に基づきエージェントが環境にあわせて自動的に必要な機能を発現するように設計する。このようなアプローチについては代表者によるものを含めてこれまでにも既にいくつかの研究がなされているが、本研究ではそれらを発展させ、複雑な機能であっても現実的な時間内に発現できるよう、部分観測マルコフ決定過程(POMDP)理論に基づく高性能なモデル有り学習法を用いて高速な実装を目指す。 POMDPに関しては近年よい解法が次々と開発されてきているが、問題によっては満足な解が得られないことも考えられる。これは予想される困難のうちで最も大きなものであるが、必要に応じて新たな手法やPOMDP以外の解法も検討する。また、環境モデルの学習手法としては代表者が研究した経験のあるダイナミックベイジアンネットワークのonline学習法を想定しているが、online学習は性能あるいは計算量の面で問題がある可能性も考えられる。その場合はbatch学習を採用するなど適切に対処する。

Research Products
(3 results)

All 2009 2008

All Presentation (3 results)

[Presentation] Comparison of Near-Threshold Characteristics of Flash Suppression and Forward Masking2009
- Author(s)
  Kenji Aoki, Hiroki Takahashi, Hideaki Itoh, Kiyohiko Nakamura
- Organizer
  International Conference on Neural Information Processing
- Place of Presentation
  Bangkok, Thailand
- Year and Date
  2009-12-03
[Presentation] Towards a Comparative Theory of the Primates' Tool-use Behavior, Towards a Comparative Theory of the Primates' Tool-use Behavior2008
- Author(s)
  Toshisada Mariyama, Hideaki Itoh
- Organizer
  International Conference on Neural Information Processing
- Place of Presentation
  Auckland, New Zealand
- Year and Date
  2008-11-26
[Presentation] 情動状態の思考による鎮静化現象の分析の試み2008
- Author(s)
  清川舞, 伊藤秀昭, 中村清彦
- Organizer
  「脳と心のメカニズム」冬のワークショップ
- Place of Presentation
  北海道
- Year and Date
  2008-01-10

2009 Fiscal Year Self-evaluation Report

Automatic and rapid realization of higher brain functions by partially observable Markov decision processes

Principal Investigator

ITOH Hideaki Tokyo Institute of Technology, 理工学部, 講師 (20345375)

Research Products

[Presentation] Comparison of Near-Threshold Characteristics of Flash Suppression and Forward Masking2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Towards a Comparative Theory of the Primates' Tool-use Behavior, Towards a Comparative Theory of the Primates' Tool-use Behavior2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 情動状態の思考による鎮静化現象の分析の試み2008

Author(s)

Organizer

Place of Presentation

Year and Date