• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

A study of modular models of decision making in uncertain and non-stationary environments

Research Project

Project/Area Number 21300113
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Bioinformatics/Life informatics
Research InstitutionKyoto University

Principal Investigator

ISHII Shin  京都大学, 大学院・情報学研究科, 教授 (90294280)

Co-Investigator(Kenkyū-buntansha) NAKAMURA Yutaka  大阪大学, 基礎工学研究科, 助教 (70403334)
MAEDA Shinichi  京都大学, 大学院・情報学研究科, 助教 (20379530)
Co-Investigator(Renkei-kenkyūsha) MORI Takeshi  大阪大学, 基礎工学研究科, 研究員
OSHIO Ritz  京都大学, 大学院・情報学研究科, 研究員
SHIKAUCHI Yumi  京都大学, 大学院・情報学研究科, 技術補佐員
MORIMOTO Satoshi  京都大学, 大学院・情報学研究科, 技術補佐員
Project Period (FY) 2009 – 2011
Project Status Completed (Fiscal Year 2011)
Budget Amount *help
¥18,070,000 (Direct Cost: ¥13,900,000、Indirect Cost: ¥4,170,000)
Fiscal Year 2011: ¥2,730,000 (Direct Cost: ¥2,100,000、Indirect Cost: ¥630,000)
Fiscal Year 2010: ¥7,280,000 (Direct Cost: ¥5,600,000、Indirect Cost: ¥1,680,000)
Fiscal Year 2009: ¥8,060,000 (Direct Cost: ¥6,200,000、Indirect Cost: ¥1,860,000)
Keywords強化学習 / モジュールアーキテクチャ / 計算論的神経科学 / ロボット / 非侵襲脳計測
Research Abstract

We have developed statistical learning models, with a particular interest in reinforcement learning(RL), which can perform decision making in uncertain and even non-stationary environments. We have derived an RL method in which value function represented by a module structure can be online and efficiently approximated by adding new modules in an incremental fashion, and an optimal learning procedure of the value function based on the framework of semi-parametric statistics. As an application, we have succeeded in automatic control of non-holonomic systems by means of a policy-based RL method. In the human brain, we have found module-like structures which are activated when inferring a hierarchical inference task. Moreover, we have succeeded in decoding inference process based on the subject's behaviors and MRI scanned images.

Report

(4 results)
  • 2011 Annual Research Report   Final Research Report ( PDF )
  • 2010 Annual Research Report
  • 2009 Annual Research Report
  • Research Products

    (52 results)

All 2012 2011 2010 2009 Other

All Journal Article (20 results) (of which Peer Reviewed: 10 results) Presentation (26 results) Book (4 results) Remarks (2 results)

  • [Journal Article] Robust reinforcement learning in sequential value function approximation2011

    • Author(s)
      T. Mori, S. Ishii
    • Journal Title

      IEEE Transactions on Systems, Man and Cybernetics

      Volume: 41(5) Pages: 1407-1416

    • Related Report
      2011 Annual Research Report 2011 Final Research Report
  • [Journal Article] Bayesian normalized gaussian network and hierarchical model selection method2011

    • Author(s)
      J. Yoshimoto, M. Sato, S. Ishii
    • Journal Title

      ntelligent Automation and Soft Computing

      Volume: 17(1) Pages: 71-94

    • Related Report
      2011 Final Research Report
  • [Journal Article] Generating circular motion of a human-like robotic arm using attractor selection model2011

    • Author(s)
      A. Sugahara, Y. Nakamura, I. Fukuyori, Y. Matsumoto, H. Ishiguro
    • Journal Title

      Journal of Robotics and Mechatronics

      Volume: 22(3) Pages: 71-94

    • Related Report
      2011 Final Research Report
  • [Journal Article] Generalized TD Learning2011

    • Author(s)
      T.Ueno, S.Maeda, M.Kawanabe, S.Ishii
    • Journal Title

      Journal of Machine Learning Research

      Volume: 12(6) Pages: 1977-2020

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Bayesian normalized gaussian network and hierarchical model selection metbod2011

    • Author(s)
      J.Yoshimoto, M.Sato, S.Ishii
    • Journal Title

      Intelligent Automation and Soft Computing

      Volume: 17(1) Issue: 12 Pages: 71-94

    • DOI

      10.1371/journal.pone.0027950

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Visual attention model involving feature-based inhibition of return2010

    • Author(s)
      S. Hotta, S. Oba, S. Ishii
    • Journal Title

      Journal of Artificial Life and Robotics

      Volume: 15(2) Pages: 129-132

    • NAID

      120002647329

    • Related Report
      2011 Final Research Report
  • [Journal Article] Hierarchical rule switching in prefrontal cortex2010

    • Author(s)
      W. Yoshida, H. Funakoshi, S. Ishii
    • Journal Title

      Neuro Image

      Volume: 50(1) Pages: 314-322

    • Related Report
      2011 Final Research Report
  • [Journal Article] Generating circular motion of a human-like robotic arm using attractor selection model.2010

    • Author(s)
      A.Sugahara, Y.Nakamura, I.Fukuyori, Y.Matsumoto, H.Ishiguro
    • Journal Title

      Journal of Robotics and Mechatronics

      Volume: 22(3) Pages: 315-321

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Visual attention model involving feature-based inhibition of return.2010

    • Author(s)
      S.Hotta, S.Oba, S.Ishii
    • Journal Title

      Artificial Life and Robotics

      Volume: 15(2) Pages: 129-132

    • NAID

      120002647329

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Hierarchical rule switching in prefrontal cortex2010

    • Author(s)
      W.Yoshida, H.Funakoshi, S.Ishii
    • Journal Title

      NeuroImage 50(1)

      Pages: 314-322

    • Related Report
      2009 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Boosting perceptual learning by fake feedback2009

    • Author(s)
      K. Shibata, N. Yamagishi, S. Ishii, M. Kawato
    • Journal Title

      Vision Research

      Volume: 49(21) Pages: 2574-2585

    • Related Report
      2011 Final Research Report
  • [Journal Article] Adaptive particle allocation for multifocal visual attention based on particle filtering2009

    • Author(s)
      N. Yano, T. Shibata, S. Ishii
    • Journal Title

      Journal of Artificial Life and Robotics

      Volume: 13 Pages: 522-535

    • Related Report
      2011 Final Research Report
  • [Journal Article] Boosting perceptual learning by fake feedback2009

    • Author(s)
      K.Shibata, N.Yamagishi, S.Ishii, M.Kawato
    • Journal Title

      Vision Research 49(21)

      Pages: 2574-2585

    • Related Report
      2009 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Adaptive particle allocation for multifocal visual attention based on particle filtering2009

    • Author(s)
      N.Yano, T.Shibata, S.Ishii
    • Journal Title

      Journal of Artificial Life and Robotics 13

      Pages: 522-535

    • Related Report
      2009 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Asymptotic analysis of value prediction in well-specified and misspecified models

    • Author(s)
      T. Ueno, S. Maeda, S. Ishii
    • Journal Title

      Neural Networks

    • Related Report
      2011 Final Research Report
  • [Journal Article] Low-dimensional feature representation for instrumental identification

    • Author(s)
      M. Ishihara, S. Maeda, K. Ikeda, S. Ishii
    • Journal Title

      SICE Journal of Control, Measurement, and System Integration

    • NAID

      10031140141

    • Related Report
      2011 Final Research Report
  • [Journal Article] Generalized TD Learning

    • Author(s)
      T. Ueno, S. Maeda, M. Kawanabe, S. Ishii
    • Journal Title

      Journal of Machine Learning Research

    • Related Report
      2011 Final Research Report
  • [Journal Article] Asymptotic analys of value prediction in well-specified and misspecified models

    • Author(s)
      T.Ueno, S.Maeda, S.Ishii
    • Journal Title

      Neural Networks

      Volume: (掲載確定)(to appear)

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Low-dimensional feature representation for instrumental identification

    • Author(s)
      M.Ishihara, S.Maeda, K.Ikeda, S.Ishii
    • Journal Title

      SICE Journal of Control, Measurement and System Integration

      Volume: (掲載確定)(to appear)

    • NAID

      10031140141

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Generalized TD learning.

    • Author(s)
      T.Ueno, S.Maeda, M.Kawanabe, S.Ishii
    • Journal Title

      Journal of Machine Learning Research

      Volume: (to appear)

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Presentation] Machine learning methods for brain machine interface2012

    • Author(s)
      S.Ishii
    • Organizer
      statistics for Biomedical and Social Mathematical Sciences
    • Place of Presentation
      Tokyo, Japan
    • Year and Date
      2012-03-02
    • Related Report
      2011 Annual Research Report
  • [Presentation] ネットワーク型知識に関する機械学習的アプローチ2012

    • Author(s)
      石井信
    • Organizer
      ブレインリーディングからブレインマシンインターフェースへ.平成23年度数学・数理科学と諸科学・産業との連携研究ワークショップ
    • Place of Presentation
      Tokyo, Japan
    • Year and Date
      2012-02-23
    • Related Report
      2011 Annual Research Report
  • [Presentation] A control method for a redundant robot using stored instances2012

    • Author(s)
      Y.Okadome, Y.Nakamura, H.Ishiguro
    • Organizer
      International symposium on artificial life and robotics
    • Place of Presentation
      Beppu, Japan
    • Year and Date
      2012-01-19
    • Related Report
      2011 Annual Research Report
  • [Presentation] Machine learning methods for brain machine interface2012

    • Author(s)
      S. Ishii
    • Organizer
      Statistics for Biomedical and Social Mathematical Sciences
    • Place of Presentation
      Tokyo
    • Related Report
      2011 Final Research Report
  • [Presentation] ブレインリーディングからブレインマシンインターフェースへ「ネットワーク型知識に関する機械学習的アプローチ」2012

    • Author(s)
      石井信
    • Organizer
      2011年度数学・数理科学と諸科学・産業との連携研究ワークショップ
    • Place of Presentation
      Tokyo
    • Related Report
      2011 Final Research Report
  • [Presentation] A control method for a redundant robot using stored instances2012

    • Author(s)
      Y. Okadome, Y. Nakamura, H. Ishiguro
    • Organizer
      International Symposium on Artificial Life and Robotics
    • Place of Presentation
      Beppu
    • Related Report
      2011 Final Research Report
  • [Presentation] ネットワーク型ブレインマシンインターフェースに向けて2011

    • Author(s)
      石井信
    • Organizer
      Neuroscience 2011
    • Place of Presentation
      Yokohama, Japan
    • Year and Date
      2011-09-16
    • Related Report
      2011 Annual Research Report
  • [Presentation] ネットワーク社会のブレインマシンインタフェース2011

    • Author(s)
      石井信
    • Organizer
      (社)電子情報通信学会通信ソサイエティ総会
    • Place of Presentation
      Sapporo, Japan
    • Year and Date
      2011-09-14
    • Related Report
      2011 Annual Research Report
  • [Presentation] ネットワーク型ブレインマシンインターフェースに向けて2011

    • Author(s)
      石井信
    • Organizer
      Neuroscience 2011
    • Place of Presentation
      Yokohama
    • Related Report
      2011 Final Research Report
  • [Presentation] ネットワーク社会のブレインマシンインターフェース.(社)電子情報通信学会2011

    • Author(s)
      石井信
    • Organizer
      通信ソサイエティ総会
    • Place of Presentation
      Sapporo
    • Related Report
      2011 Final Research Report
  • [Presentation] Hidden Markov model for human decision process in a partially observable environment2011

    • Author(s)
      M. Adomi, Y. Shikauchi, S. Ishii
    • Organizer
      International Conference on Neural Information Processing
    • Place of Presentation
      Sydney
    • Related Report
      2011 Final Research Report
  • [Presentation] Sparse and low-rank estimation of time-varying Markov networks with alternating direction method of multipliers2010

    • Author(s)
      J.Hirayama, A.Hyvarinen, S.Ishii
    • Organizer
      International Conference on Neural Information Processing
    • Place of Presentation
      Sydney, Australia
    • Year and Date
      2010-11-22
    • Related Report
      2010 Annual Research Report
  • [Presentation] Hidden Markov model for human decision process in a partially observable environment2010

    • Author(s)
      M.Adomi, Y.Shikauchi, S.Ishii
    • Organizer
      International Conference on Artificial Neural Networks
    • Place of Presentation
      Thessaloniki, Greece
    • Year and Date
      2010-09-17
    • Related Report
      2010 Annual Research Report
  • [Presentation] Separation of exploration and exploitation in maze navigation task2010

    • Author(s)
      Y.Shikauchi, M.Adomi, S.Ishii
    • Organizer
      Neuro2010
    • Place of Presentation
      Kobe, Japan
    • Year and Date
      2010-09-01
    • Related Report
      2010 Annual Research Report
  • [Presentation] Sparse and low-rank estimation of time-varying Markov networks with alternating direction method of multipliers2010

    • Author(s)
      J. Hirayama, A. Hyvarinen, S. Ishii
    • Organizer
      International Conference on Neural Information Processing
    • Place of Presentation
      Sydney
    • Related Report
      2011 Final Research Report
  • [Presentation] Separation of exploration and exploitation in maze navigation task2010

    • Author(s)
      Y. Shikauchi, S. Ishii
    • Organizer
      Neuro2010
    • Place of Presentation
      Kobe
    • Related Report
      2011 Final Research Report
  • [Presentation] Robust approximation in decomposed reinforcement learning2009

    • Author(s)
      T.Mori, S.Ishii
    • Organizer
      International Conference on Neural Information Processing
    • Place of Presentation
      Bangkok, Thailand
    • Year and Date
      2009-12-02
    • Related Report
      2009 Annual Research Report
  • [Presentation] An additive reinforcement learning2009

    • Author(s)
      T.Mori, S.Ishii
    • Organizer
      International Conference on Artificial Neural Networks
    • Place of Presentation
      Limassol, Cyprus
    • Year and Date
      2009-09-14
    • Related Report
      2009 Annual Research Report
  • [Presentation] Optimal online learning procedures for model-free policy evaluation2009

    • Author(s)
      T.Ueno, M.Kawanabe, S.Maeda, S.Ishii
    • Organizer
      European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases
    • Place of Presentation
      Ljubljana, Slovenija
    • Year and Date
      2009-09-08
    • Related Report
      2009 Annual Research Report
  • [Presentation] Visual attention model involving feature-based inhibition of return2009

    • Author(s)
      S.Hotta, S.Oba, S.Ishii
    • Organizer
      International Symposium on Artificial Life and Robotics
    • Place of Presentation
      Beppu, Japan
    • Year and Date
      2009-02-05
    • Related Report
      2009 Annual Research Report
  • [Presentation] Machine learning approarch to 9-DOF arm robot control2009

    • Author(s)
      S.Nishioka, S.Maeda, S.Ishii
    • Organizer
      International Symposium on Artificial Life and Robitics
    • Place of Presentation
      Beppu, Japan
    • Year and Date
      2009-02-04
    • Related Report
      2009 Annual Research Report
  • [Presentation] Visual attention model involving feature-based inhibition of return2009

    • Author(s)
      S. Hotta, S. Oba, S. Ishii
    • Organizer
      International Symposium on Artificial Life and Robotics
    • Place of Presentation
      Beppu
    • Related Report
      2011 Final Research Report
  • [Presentation] Machine learning approach to 9-DOF arm robot control2009

    • Author(s)
      S. Nishioka, S. Maeda, S. Ishii
    • Organizer
      International Symposium on Artificial Life and Robotics
    • Place of Presentation
      Beppu
    • Related Report
      2011 Final Research Report
  • [Presentation] Robust approximation in decomposed reinforcement learning2009

    • Author(s)
      T. Mori, S. Ishii
    • Organizer
      International Conference on Neural Information Processing
    • Place of Presentation
      Bangkok
    • Related Report
      2011 Final Research Report
  • [Presentation] Optimal online learning procedures for model-free policy evaluation2009

    • Author(s)
      T. Ueno, M. Kawanabe, S. Maeda, S. Ishii
    • Organizer
      European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases
    • Place of Presentation
      Ljubljana
    • Related Report
      2011 Final Research Report
  • [Presentation] An additive reinforcement learning2009

    • Author(s)
      T. Mori, S. Ishii
    • Organizer
      International Conference on Artificial Neural Networks
    • Place of Presentation
      Ljubljana
    • Related Report
      2011 Final Research Report
  • [Book] 科学2010

    • Author(s)
      石井信
    • Total Pages
      80
    • Publisher
      岩波書店
    • Related Report
      2011 Final Research Report
  • [Book] 価値と学習・よくわかる認知科学2010

    • Author(s)
      石井信
    • Publisher
      ミネルヴァ書房
    • Related Report
      2011 Final Research Report
  • [Book] 科学, 80(12),分担執筆2010

    • Author(s)
      石井信
    • Total Pages
      1188
    • Publisher
      岩波書店
    • Related Report
      2010 Annual Research Report
  • [Book] 価値と学習・よくわかる認知科学 分担執筆 (IV-7節)2010

    • Author(s)
      石井信
    • Total Pages
      3
    • Publisher
      ミネルヴァ書房
    • Related Report
      2009 Annual Research Report
  • [Remarks] ホームページ等論理生命学(石井)研究室HP

    • URL

      http://hawaii.sys.i.kyoto-u.ac.jp/home

    • Related Report
      2011 Final Research Report
  • [Remarks] 石井信HP

    • URL

      http://hawaii.sys.i.kyoto-u.ac.jp/~ishii/

    • Related Report
      2011 Final Research Report

URL: 

Published: 2009-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi