• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2013 Fiscal Year Final Research Report

Exploration of a Breakthrough Technology for Emergence of Symbol Processing by Neuro-based Reinforcement Learning Considering Time Axis

Research Project

  • PDF
Project/Area Number 23500245
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeMulti-year Fund
Section一般
Research Field Perception information processing/Intelligent robotics
Research InstitutionOita University

Principal Investigator

SHIBATA Katsunari  大分大学, 工学部, 准教授 (10260522)

Project Period (FY) 2011 – 2013
Keywords知能ロボット / 強化学習 / ニューラルネット / 高次機能 / シンボル処理創発 / 因果トレース / 概念形成 / コミュニケーション学習
Research Abstract

Efficient learning in a huge amount of spatio-temporal information holds the key to the emergence of higher functions in the real world. In this research, handling of time axis was especially focused on. A novel idea named "Causality traces" is propounded which judge the importance of events "subjectively" and are used for retrospective learning. Its learning performance exceeds that with the conventional method in value learning. Next, based on the idea that "concept" is formed from the difference of necessary motions, it is confirmed that discrete and abstract internal state representations are autonomously formed in a recurrent neural network through reinforcement learning. Furthermore, in autonomous communication learning, it was shown that information about target movement could be transmitted after learning. A novel perspective could be introduced to the handling of the time axis, but for the emergence of symbol processing, learning of dynamics should be further improved.

  • Research Products

    (12 results)

All 2014 2013 2012 2011

All Journal Article (6 results) Presentation (6 results) (of which Invited: 1 results)

  • [Journal Article] Emergence of Flexible Prediction-Based Discrete Decision Making and Continuous Motion Generation through Actor-Q-Learning2013

    • Author(s)
      Katsunari Shibata and Kenta Goto
    • Journal Title

      Proc. of Int'l Conf. on Development and Learning and on Epigenetic Robotics (ICDL-Epirob)

      Volume: ID 15 (CDROM)

  • [Journal Article] Mohamad Faizal bin Samsudin and Katsunari Shibata, Emergence of Discrete and Abstract State Representation in Continuous Input Task through Reinforcement Learning, Advances in Intelligent Systems and Computing, Robot Intelligence Technology and Applications 20122012

    • Author(s)
      Yoshito Sawatsubashi, Mohamad Faizal bin Samsudin and Katsunari Shibata
    • Journal Title

      Proc. of RiTA

      Volume: 13-22

  • [Journal Article] Differential Trace in Learning of Value Function with a Neural Network, Advances in Intelligent Systems and Computing, Robot Intelligence Technology and Applications 20122012

    • Author(s)
      Katsunari Shibata and Shuji Enoki
    • Journal Title

      Proc. of RiTA

      Volume: 55-64

  • [Journal Article] Emergence of Multi-Step Discrete State Transition through Reinforcement Learning with a Recurrent Neural Network2012

    • Author(s)
      Mohamad Faizal bin Samsudin, Yoshito Sawatsubashi and Katsunari Shibata
    • Journal Title

      LNCS(Lecture Notes in Computer Science), Neural Information Processing, Proc. of ICONIP (Int'l Conf. on Neural Information Processing)

      Volume: 7664 Pages: 583-590

  • [Journal Article] Emergence of Color Constancy Illusion through Reinforcement Learning with a Neural Network2012

    • Author(s)
      Katsunari Shibata and Shunsuke Kurizaki
    • Journal Title

      Proc. of ICDL-EpiRob (Int'l Conf. on Development and Learning- Epigenetic Robotics)

    • DOI

      10.1109/DevLrn.2012.6400580

  • [Journal Article] Emergence of Purposive and Grounded Communication through Reinforcement Learning, LNCS(Lecture Notes in Computer Science)2011

    • Author(s)
      Katsunari Shibata and Kazuki Sasahara
    • Journal Title

      LNCS(Lecture Notes in Computer Science), Vol. 7064, Proc. of ICONIP (Int'l Conf. on Neural Information Processing)

      Volume: 7064 Pages: 66-75

  • [Presentation] 因果トレース - 並列かつ主観的時間スケールの導入による過去の処理の効率的学習2014

    • Author(s)
      柴田克成
    • Organizer
      電子情報通信学会ニューロコンピューティング研究会
    • Place of Presentation
      東京
    • Year and Date
      2014-03-18
  • [Presentation] 強化学習によるリカレントニューラルネットワーク内部での振動子創発の可能性2013

    • Author(s)
      品矢 裕介
    • Organizer
      第32回計測自動制御学会九州支部学術講演会
    • Place of Presentation
      長崎
    • Year and Date
      2013-12-01
  • [Presentation] RNN を用いた強化学習によるセンサ信号の時間変化を表すコミュニケーションの創発2013

    • Author(s)
      朱 祺
    • Organizer
      第32回計測自動制御学会九州支部学術講演会
    • Place of Presentation
      長崎
    • Year and Date
      2013-12-01
  • [Presentation] ニューラルネットを用いた価値関数の学習における 微分型トレースの提案2012

    • Author(s)
      榎修志
    • Organizer
      計測自動制御学会システム, 情報部門学術講演会
    • Place of Presentation
      名古屋
    • Year and Date
      2012-11-23
  • [Presentation] リカレントネットを用いた強化学習における 離散的かつ抽象的な状態表現の創発2012

    • Author(s)
      沢津橋由人
    • Organizer
      計測自動制御学会 システム, 情報部門学術講演会
    • Place of Presentation
      名古屋
    • Year and Date
      2012-11-23
  • [Presentation] あめとむちで知能を作る? -知能ロボットって本当に賢いの?2011

    • Author(s)
      柴田克成
    • Organizer
      SOFT九州支部夏季ワークショップ
    • Place of Presentation
      玉名(熊本県)
    • Year and Date
      2011-09-01
    • Invited

URL: 

Published: 2015-07-16  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi