A study of modular models of decision making in uncertain and non-stationary environments

Research Project

Project/Area Number	21300113
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Research Field	Bioinformatics/Life informatics
Research Institution	Kyoto University
Principal Investigator	ISHII Shin 京都大学, 大学院・情報学研究科, 教授 (90294280)
Co-Investigator(Kenkyū-buntansha)	NAKAMURA Yutaka 大阪大学, 基礎工学研究科, 助教 (70403334) MAEDA Shinichi 京都大学, 大学院・情報学研究科, 助教 (20379530)
Co-Investigator(Renkei-kenkyūsha)	MORI Takeshi 大阪大学, 基礎工学研究科, 研究員 OSHIO Ritz 京都大学, 大学院・情報学研究科, 研究員 SHIKAUCHI Yumi 京都大学, 大学院・情報学研究科, 技術補佐員 MORIMOTO Satoshi 京都大学, 大学院・情報学研究科, 技術補佐員
Project Period (FY)	2009 – 2011
Project Status	Completed (Fiscal Year 2011)
Budget Amount *help	¥18,070,000 (Direct Cost: ¥13,900,000、Indirect Cost: ¥4,170,000) Fiscal Year 2011: ¥2,730,000 (Direct Cost: ¥2,100,000、Indirect Cost: ¥630,000) Fiscal Year 2010: ¥7,280,000 (Direct Cost: ¥5,600,000、Indirect Cost: ¥1,680,000) Fiscal Year 2009: ¥8,060,000 (Direct Cost: ¥6,200,000、Indirect Cost: ¥1,860,000)
Keywords	強化学習 / モジュールアーキテクチャ / 計算論的神経科学 / ロボット / 非侵襲脳計測
Research Abstract	We have developed statistical learning models, with a particular interest in reinforcement learning(RL), which can perform decision making in uncertain and even non-stationary environments. We have derived an RL method in which value function represented by a module structure can be online and efficiently approximated by adding new modules in an incremental fashion, and an optimal learning procedure of the value function based on the framework of semi-parametric statistics. As an application, we have succeeded in automatic control of non-holonomic systems by means of a policy-based RL method. In the human brain, we have found module-like structures which are activated when inferring a hierarchical inference task. Moreover, we have succeeded in decoding inference process based on the subject's behaviors and MRI scanned images.

Report

(4 results)

2011 Annual Research Report Final Research Report ( PDF )
2010 Annual Research Report
2009 Annual Research Report

Research Products
(52 results)

All 2012 2011 2010 2009 Other

All Journal Article (20 results) (of which Peer Reviewed: 10 results) Presentation (26 results) Book (4 results) Remarks (2 results)

[Journal Article] Robust reinforcement learning in sequential value function approximation2011
- Author(s)
  T. Mori, S. Ishii
- Journal Title
  
  IEEE Transactions on Systems, Man and Cybernetics
  
  Volume: 41(5) Pages: 1407-1416
- Related Report
  2011 Annual Research Report 2011 Final Research Report
[Journal Article] Bayesian normalized gaussian network and hierarchical model selection method2011
- Author(s)
  J. Yoshimoto, M. Sato, S. Ishii
- Journal Title
  
  ntelligent Automation and Soft Computing
  
  Volume: 17(1) Pages: 71-94
- Related Report
  2011 Final Research Report
[Journal Article] Generating circular motion of a human-like robotic arm using attractor selection model2011
- Author(s)
  A. Sugahara, Y. Nakamura, I. Fukuyori, Y. Matsumoto, H. Ishiguro
- Journal Title
  
  Journal of Robotics and Mechatronics
  
  Volume: 22(3) Pages: 71-94
- Related Report
  2011 Final Research Report
[Journal Article] Generalized TD Learning2011
- Author(s)
  T.Ueno, S.Maeda, M.Kawanabe, S.Ishii
- Journal Title
  
  Journal of Machine Learning Research
  
  Volume: 12(6) Pages: 1977-2020
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Bayesian normalized gaussian network and hierarchical model selection metbod2011
- Author(s)
  J.Yoshimoto, M.Sato, S.Ishii
- Journal Title
  
  Intelligent Automation and Soft Computing
  
  Volume: 17(1) Issue: 12 Pages: 71-94
- DOI
  10.1371/journal.pone.0027950
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Visual attention model involving feature-based inhibition of return2010
- Author(s)
  S. Hotta, S. Oba, S. Ishii
- Journal Title
  
  Journal of Artificial Life and Robotics
  
  Volume: 15(2) Pages: 129-132
- NAID
  120002647329
- Related Report
  2011 Final Research Report
[Journal Article] Hierarchical rule switching in prefrontal cortex2010
- Author(s)
  W. Yoshida, H. Funakoshi, S. Ishii
- Journal Title
  
  Neuro Image
  
  Volume: 50(1) Pages: 314-322
- Related Report
  2011 Final Research Report
[Journal Article] Generating circular motion of a human-like robotic arm using attractor selection model.2010
- Author(s)
  A.Sugahara, Y.Nakamura, I.Fukuyori, Y.Matsumoto, H.Ishiguro
- Journal Title
  
  Journal of Robotics and Mechatronics
  
  Volume: 22(3) Pages: 315-321
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Visual attention model involving feature-based inhibition of return.2010
- Author(s)
  S.Hotta, S.Oba, S.Ishii
- Journal Title
  
  Artificial Life and Robotics
  
  Volume: 15(2) Pages: 129-132
- NAID
  120002647329
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Hierarchical rule switching in prefrontal cortex2010
- Author(s)
  W.Yoshida, H.Funakoshi, S.Ishii
- Journal Title
  
  NeuroImage 50(1)
  
  Pages: 314-322
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] Boosting perceptual learning by fake feedback2009
- Author(s)
  K. Shibata, N. Yamagishi, S. Ishii, M. Kawato
- Journal Title
  
  Vision Research
  
  Volume: 49(21) Pages: 2574-2585
- Related Report
  2011 Final Research Report
[Journal Article] Adaptive particle allocation for multifocal visual attention based on particle filtering2009
- Author(s)
  N. Yano, T. Shibata, S. Ishii
- Journal Title
  
  Journal of Artificial Life and Robotics
  
  Volume: 13 Pages: 522-535
- Related Report
  2011 Final Research Report
[Journal Article] Boosting perceptual learning by fake feedback2009
- Author(s)
  K.Shibata, N.Yamagishi, S.Ishii, M.Kawato
- Journal Title
  
  Vision Research 49(21)
  
  Pages: 2574-2585
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] Adaptive particle allocation for multifocal visual attention based on particle filtering2009
- Author(s)
  N.Yano, T.Shibata, S.Ishii
- Journal Title
  
  Journal of Artificial Life and Robotics 13
  
  Pages: 522-535
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] Asymptotic analysis of value prediction in well-specified and misspecified models
- Author(s)
  T. Ueno, S. Maeda, S. Ishii
- Journal Title
  
  Neural Networks
- Related Report
  2011 Final Research Report
[Journal Article] Low-dimensional feature representation for instrumental identification
- Author(s)
  M. Ishihara, S. Maeda, K. Ikeda, S. Ishii
- Journal Title
  
  SICE Journal of Control, Measurement, and System Integration
- NAID
  10031140141
- Related Report
  2011 Final Research Report
[Journal Article] Generalized TD Learning
- Author(s)
  T. Ueno, S. Maeda, M. Kawanabe, S. Ishii
- Journal Title
  
  Journal of Machine Learning Research
- Related Report
  2011 Final Research Report
[Journal Article] Asymptotic analys of value prediction in well-specified and misspecified models
- Author(s)
  T.Ueno, S.Maeda, S.Ishii
- Journal Title
  
  Neural Networks
  
  Volume: (掲載確定)(to appear)
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Low-dimensional feature representation for instrumental identification
- Author(s)
  M.Ishihara, S.Maeda, K.Ikeda, S.Ishii
- Journal Title
  
  SICE Journal of Control, Measurement and System Integration
  
  Volume: (掲載確定)(to appear)
- NAID
  10031140141
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Generalized TD learning.
- Author(s)
  T.Ueno, S.Maeda, M.Kawanabe, S.Ishii
- Journal Title
  
  Journal of Machine Learning Research
  
  Volume: (to appear)
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Presentation] Machine learning methods for brain machine interface2012
- Author(s)
  S.Ishii
- Organizer
  statistics for Biomedical and Social Mathematical Sciences
- Place of Presentation
  Tokyo, Japan
- Year and Date
  2012-03-02
- Related Report
  2011 Annual Research Report
[Presentation] ネットワーク型知識に関する機械学習的アプローチ2012
- Author(s)
  石井信
- Organizer
  ブレインリーディングからブレインマシンインターフェースへ.平成23年度数学・数理科学と諸科学・産業との連携研究ワークショップ
- Place of Presentation
  Tokyo, Japan
- Year and Date
  2012-02-23
- Related Report
  2011 Annual Research Report
[Presentation] A control method for a redundant robot using stored instances2012
- Author(s)
  Y.Okadome, Y.Nakamura, H.Ishiguro
- Organizer
  International symposium on artificial life and robotics
- Place of Presentation
  Beppu, Japan
- Year and Date
  2012-01-19
- Related Report
  2011 Annual Research Report
[Presentation] Machine learning methods for brain machine interface2012
- Author(s)
  S. Ishii
- Organizer
  Statistics for Biomedical and Social Mathematical Sciences
- Place of Presentation
  Tokyo
- Related Report
  2011 Final Research Report
[Presentation] ブレインリーディングからブレインマシンインターフェースへ「ネットワーク型知識に関する機械学習的アプローチ」2012
- Author(s)
  石井信
- Organizer
  2011年度数学・数理科学と諸科学・産業との連携研究ワークショップ
- Place of Presentation
  Tokyo
- Related Report
  2011 Final Research Report
[Presentation] A control method for a redundant robot using stored instances2012
- Author(s)
  Y. Okadome, Y. Nakamura, H. Ishiguro
- Organizer
  International Symposium on Artificial Life and Robotics
- Place of Presentation
  Beppu
- Related Report
  2011 Final Research Report
[Presentation] ネットワーク型ブレインマシンインターフェースに向けて2011
- Author(s)
  石井信
- Organizer
  Neuroscience 2011
- Place of Presentation
  Yokohama, Japan
- Year and Date
  2011-09-16
- Related Report
  2011 Annual Research Report
[Presentation] ネットワーク社会のブレインマシンインタフェース2011
- Author(s)
  石井信
- Organizer
  (社)電子情報通信学会通信ソサイエティ総会
- Place of Presentation
  Sapporo, Japan
- Year and Date
  2011-09-14
- Related Report
  2011 Annual Research Report
[Presentation] ネットワーク型ブレインマシンインターフェースに向けて2011
- Author(s)
  石井信
- Organizer
  Neuroscience 2011
- Place of Presentation
  Yokohama
- Related Report
  2011 Final Research Report
[Presentation] ネットワーク社会のブレインマシンインターフェース.(社)電子情報通信学会2011
- Author(s)
  石井信
- Organizer
  通信ソサイエティ総会
- Place of Presentation
  Sapporo
- Related Report
  2011 Final Research Report
[Presentation] Hidden Markov model for human decision process in a partially observable environment2011
- Author(s)
  M. Adomi, Y. Shikauchi, S. Ishii
- Organizer
  International Conference on Neural Information Processing
- Place of Presentation
  Sydney
- Related Report
  2011 Final Research Report
[Presentation] Sparse and low-rank estimation of time-varying Markov networks with alternating direction method of multipliers2010
- Author(s)
  J.Hirayama, A.Hyvarinen, S.Ishii
- Organizer
  International Conference on Neural Information Processing
- Place of Presentation
  Sydney, Australia
- Year and Date
  2010-11-22
- Related Report
  2010 Annual Research Report
[Presentation] Hidden Markov model for human decision process in a partially observable environment2010
- Author(s)
  M.Adomi, Y.Shikauchi, S.Ishii
- Organizer
  International Conference on Artificial Neural Networks
- Place of Presentation
  Thessaloniki, Greece
- Year and Date
  2010-09-17
- Related Report
  2010 Annual Research Report
[Presentation] Separation of exploration and exploitation in maze navigation task2010
- Author(s)
  Y.Shikauchi, M.Adomi, S.Ishii
- Organizer
  Neuro2010
- Place of Presentation
  Kobe, Japan
- Year and Date
  2010-09-01
- Related Report
  2010 Annual Research Report
[Presentation] Sparse and low-rank estimation of time-varying Markov networks with alternating direction method of multipliers2010
- Author(s)
  J. Hirayama, A. Hyvarinen, S. Ishii
- Organizer
  International Conference on Neural Information Processing
- Place of Presentation
  Sydney
- Related Report
  2011 Final Research Report
[Presentation] Separation of exploration and exploitation in maze navigation task2010
- Author(s)
  Y. Shikauchi, S. Ishii
- Organizer
  Neuro2010
- Place of Presentation
  Kobe
- Related Report
  2011 Final Research Report
[Presentation] Robust approximation in decomposed reinforcement learning2009
- Author(s)
  T.Mori, S.Ishii
- Organizer
  International Conference on Neural Information Processing
- Place of Presentation
  Bangkok, Thailand
- Year and Date
  2009-12-02
- Related Report
  2009 Annual Research Report
[Presentation] An additive reinforcement learning2009
- Author(s)
  T.Mori, S.Ishii
- Organizer
  International Conference on Artificial Neural Networks
- Place of Presentation
  Limassol, Cyprus
- Year and Date
  2009-09-14
- Related Report
  2009 Annual Research Report
[Presentation] Optimal online learning procedures for model-free policy evaluation2009
- Author(s)
  T.Ueno, M.Kawanabe, S.Maeda, S.Ishii
- Organizer
  European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases
- Place of Presentation
  Ljubljana, Slovenija
- Year and Date
  2009-09-08
- Related Report
  2009 Annual Research Report
[Presentation] Visual attention model involving feature-based inhibition of return2009
- Author(s)
  S.Hotta, S.Oba, S.Ishii
- Organizer
  International Symposium on Artificial Life and Robotics
- Place of Presentation
  Beppu, Japan
- Year and Date
  2009-02-05
- Related Report
  2009 Annual Research Report
[Presentation] Machine learning approarch to 9-DOF arm robot control2009
- Author(s)
  S.Nishioka, S.Maeda, S.Ishii
- Organizer
  International Symposium on Artificial Life and Robitics
- Place of Presentation
  Beppu, Japan
- Year and Date
  2009-02-04
- Related Report
  2009 Annual Research Report
[Presentation] Visual attention model involving feature-based inhibition of return2009
- Author(s)
  S. Hotta, S. Oba, S. Ishii
- Organizer
  International Symposium on Artificial Life and Robotics
- Place of Presentation
  Beppu
- Related Report
  2011 Final Research Report
[Presentation] Machine learning approach to 9-DOF arm robot control2009
- Author(s)
  S. Nishioka, S. Maeda, S. Ishii
- Organizer
  International Symposium on Artificial Life and Robotics
- Place of Presentation
  Beppu
- Related Report
  2011 Final Research Report
[Presentation] Robust approximation in decomposed reinforcement learning2009
- Author(s)
  T. Mori, S. Ishii
- Organizer
  International Conference on Neural Information Processing
- Place of Presentation
  Bangkok
- Related Report
  2011 Final Research Report
[Presentation] Optimal online learning procedures for model-free policy evaluation2009
- Author(s)
  T. Ueno, M. Kawanabe, S. Maeda, S. Ishii
- Organizer
  European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases
- Place of Presentation
  Ljubljana
- Related Report
  2011 Final Research Report
[Presentation] An additive reinforcement learning2009
- Author(s)
  T. Mori, S. Ishii
- Organizer
  International Conference on Artificial Neural Networks
- Place of Presentation
  Ljubljana
- Related Report
  2011 Final Research Report
[Book] 科学2010
- Author(s)
  石井信
- Total Pages
  80
- Publisher
  岩波書店
- Related Report
  2011 Final Research Report
[Book] 価値と学習・よくわかる認知科学2010
- Author(s)
  石井信
- Publisher
  ミネルヴァ書房
- Related Report
  2011 Final Research Report
[Book] 科学, 80(12),分担執筆2010
- Author(s)
  石井信
- Total Pages
  1188
- Publisher
  岩波書店
- Related Report
  2010 Annual Research Report
[Book] 価値と学習・よくわかる認知科学分担執筆 (IV-7節)2010
- Author(s)
  石井信
- Total Pages
  3
- Publisher
  ミネルヴァ書房
- Related Report
  2009 Annual Research Report
[Remarks] ホームページ等論理生命学(石井)研究室HP
- URL
  http://hawaii.sys.i.kyoto-u.ac.jp/home
- Related Report
  2011 Final Research Report
[Remarks] 石井信HP
- URL
  http://hawaii.sys.i.kyoto-u.ac.jp/~ishii/
- Related Report
  2011 Final Research Report

A study of modular models of decision making in uncertain and non-stationary environments

Principal Investigator

ISHII Shin 京都大学, 大学院・情報学研究科, 教授 (90294280)

¥18,070,000 (Direct Cost: ¥13,900,000、Indirect Cost: ¥4,170,000)

Report

Research Products

[Journal Article] Robust reinforcement learning in sequential value function approximation2011

Author(s)

Journal Title

Related Report

[Journal Article] Bayesian normalized gaussian network and hierarchical model selection method2011

Author(s)

Journal Title

Related Report

[Journal Article] Generating circular motion of a human-like robotic arm using attractor selection model2011

Author(s)

Journal Title

Related Report

[Journal Article] Generalized TD Learning2011

Author(s)

Journal Title

Related Report

[Journal Article] Bayesian normalized gaussian network and hierarchical model selection metbod2011

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Visual attention model involving feature-based inhibition of return2010

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Hierarchical rule switching in prefrontal cortex2010

Author(s)

Journal Title

Related Report

[Journal Article] Generating circular motion of a human-like robotic arm using attractor selection model.2010

Author(s)

Journal Title

Related Report

[Journal Article] Visual attention model involving feature-based inhibition of return.2010

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Hierarchical rule switching in prefrontal cortex2010

Author(s)

Journal Title

Related Report

[Journal Article] Boosting perceptual learning by fake feedback2009

Author(s)

Journal Title

Related Report

[Journal Article] Adaptive particle allocation for multifocal visual attention based on particle filtering2009

Author(s)

Journal Title

Related Report

[Journal Article] Boosting perceptual learning by fake feedback2009

Author(s)

Journal Title

Related Report

[Journal Article] Adaptive particle allocation for multifocal visual attention based on particle filtering2009

Author(s)

Journal Title

Related Report

[Journal Article] Asymptotic analysis of value prediction in well-specified and misspecified models

Author(s)

Journal Title

Related Report

[Journal Article] Low-dimensional feature representation for instrumental identification

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Generalized TD Learning

Author(s)

Journal Title

Related Report

[Journal Article] Asymptotic analys of value prediction in well-specified and misspecified models

Author(s)