2013 Fiscal Year Final Research Report

Exploration of a Breakthrough Technology for Emergence of Symbol Processing by Neuro-based Reinforcement Learning Considering Time Axis

Research Project

Project/Area Number	23500245
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Research Field	Perception information processing/Intelligent robotics
Research Institution	Oita University
Principal Investigator	SHIBATA Katsunari 大分大学, 工学部, 准教授 (10260522)
Project Period (FY)	2011 – 2013
Keywords	知能ロボット / 強化学習 / ニューラルネット / 高次機能 / シンボル処理創発 / 因果トレース / 概念形成 / コミュニケーション学習
Research Abstract	Efficient learning in a huge amount of spatio-temporal information holds the key to the emergence of higher functions in the real world. In this research, handling of time axis was especially focused on. A novel idea named "Causality traces" is propounded which judge the importance of events "subjectively" and are used for retrospective learning. Its learning performance exceeds that with the conventional method in value learning. Next, based on the idea that "concept" is formed from the difference of necessary motions, it is confirmed that discrete and abstract internal state representations are autonomously formed in a recurrent neural network through reinforcement learning. Furthermore, in autonomous communication learning, it was shown that information about target movement could be transmitted after learning. A novel perspective could be introduced to the handling of the time axis, but for the emergence of symbol processing, learning of dynamics should be further improved.

Research Products
(12 results)

All 2014 2013 2012 2011

All Journal Article (6 results) Presentation (6 results) (of which Invited: 1 results)

[Journal Article] Emergence of Flexible Prediction-Based Discrete Decision Making and Continuous Motion Generation through Actor-Q-Learning2013
- Author(s)
  Katsunari Shibata and Kenta Goto
- Journal Title
  
  Proc. of Int'l Conf. on Development and Learning and on Epigenetic Robotics (ICDL-Epirob)
  
  Volume: ID 15 (CDROM)
[Journal Article] Mohamad Faizal bin Samsudin and Katsunari Shibata, Emergence of Discrete and Abstract State Representation in Continuous Input Task through Reinforcement Learning, Advances in Intelligent Systems and Computing, Robot Intelligence Technology and Applications 20122012
- Author(s)
  Yoshito Sawatsubashi, Mohamad Faizal bin Samsudin and Katsunari Shibata
- Journal Title
  
  Proc. of RiTA
  
  Volume: 13-22
[Journal Article] Differential Trace in Learning of Value Function with a Neural Network, Advances in Intelligent Systems and Computing, Robot Intelligence Technology and Applications 20122012
- Author(s)
  Katsunari Shibata and Shuji Enoki
- Journal Title
  
  Proc. of RiTA
  
  Volume: 55-64
[Journal Article] Emergence of Multi-Step Discrete State Transition through Reinforcement Learning with a Recurrent Neural Network2012
- Author(s)
  Mohamad Faizal bin Samsudin, Yoshito Sawatsubashi and Katsunari Shibata
- Journal Title
  
  LNCS(Lecture Notes in Computer Science), Neural Information Processing, Proc. of ICONIP (Int'l Conf. on Neural Information Processing)
  
  Volume: 7664 Pages: 583-590
[Journal Article] Emergence of Color Constancy Illusion through Reinforcement Learning with a Neural Network2012
- Author(s)
  Katsunari Shibata and Shunsuke Kurizaki
- Journal Title
  
  Proc. of ICDL-EpiRob (Int'l Conf. on Development and Learning- Epigenetic Robotics)
- DOI
  10.1109/DevLrn.2012.6400580
[Journal Article] Emergence of Purposive and Grounded Communication through Reinforcement Learning, LNCS(Lecture Notes in Computer Science)2011
- Author(s)
  Katsunari Shibata and Kazuki Sasahara
- Journal Title
  
  LNCS(Lecture Notes in Computer Science), Vol. 7064, Proc. of ICONIP (Int'l Conf. on Neural Information Processing)
  
  Volume: 7064 Pages: 66-75
[Presentation] 因果トレース - 並列かつ主観的時間スケールの導入による過去の処理の効率的学習2014
- Author(s)
  柴田克成
- Organizer
  電子情報通信学会ニューロコンピューティング研究会
- Place of Presentation
  東京
- Year and Date
  2014-03-18
[Presentation] 強化学習によるリカレントニューラルネットワーク内部での振動子創発の可能性2013
- Author(s)
  品矢裕介
- Organizer
  第32回計測自動制御学会九州支部学術講演会
- Place of Presentation
  長崎
- Year and Date
  2013-12-01
[Presentation] RNN を用いた強化学習によるセンサ信号の時間変化を表すコミュニケーションの創発2013
- Author(s)
  朱祺
- Organizer
  第32回計測自動制御学会九州支部学術講演会
- Place of Presentation
  長崎
- Year and Date
  2013-12-01
[Presentation] ニューラルネットを用いた価値関数の学習における微分型トレースの提案2012
- Author(s)
  榎修志
- Organizer
  計測自動制御学会システム, 情報部門学術講演会
- Place of Presentation
  名古屋
- Year and Date
  2012-11-23
[Presentation] リカレントネットを用いた強化学習における離散的かつ抽象的な状態表現の創発2012
- Author(s)
  沢津橋由人
- Organizer
  計測自動制御学会システム, 情報部門学術講演会
- Place of Presentation
  名古屋
- Year and Date
  2012-11-23
[Presentation] あめとむちで知能を作る? -知能ロボットって本当に賢いの?2011
- Author(s)
  柴田克成
- Organizer
  SOFT九州支部夏季ワークショップ
- Place of Presentation
  玉名(熊本県)
- Year and Date
  2011-09-01
- Invited

2013 Fiscal Year Final Research Report

Exploration of a Breakthrough Technology for Emergence of Symbol Processing by Neuro-based Reinforcement Learning Considering Time Axis

Principal Investigator

SHIBATA Katsunari 大分大学, 工学部, 准教授 (10260522)

Research Products

[Journal Article] Emergence of Flexible Prediction-Based Discrete Decision Making and Continuous Motion Generation through Actor-Q-Learning2013

Author(s)

Journal Title

[Journal Article] Mohamad Faizal bin Samsudin and Katsunari Shibata, Emergence of Discrete and Abstract State Representation in Continuous Input Task through Reinforcement Learning, Advances in Intelligent Systems and Computing, Robot Intelligence Technology and Applications 20122012

Author(s)

Journal Title

[Journal Article] Differential Trace in Learning of Value Function with a Neural Network, Advances in Intelligent Systems and Computing, Robot Intelligence Technology and Applications 20122012

Author(s)

Journal Title

[Journal Article] Emergence of Multi-Step Discrete State Transition through Reinforcement Learning with a Recurrent Neural Network2012

Author(s)

Journal Title

[Journal Article] Emergence of Color Constancy Illusion through Reinforcement Learning with a Neural Network2012

Author(s)

Journal Title

DOI

[Journal Article] Emergence of Purposive and Grounded Communication through Reinforcement Learning, LNCS(Lecture Notes in Computer Science)2011

Author(s)

Journal Title

[Presentation] 因果トレース - 並列かつ主観的時間スケールの導入による過去の処理の効率的学習2014

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 強化学習によるリカレントニューラルネットワーク内部での振動子創発の可能性2013

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] RNN を用いた強化学習によるセンサ信号の時間変化を表すコミュニケーションの創発2013

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] ニューラルネットを用いた価値関数の学習における 微分型トレースの提案2012

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] リカレントネットを用いた強化学習における 離散的かつ抽象的な状態表現の創発2012

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] あめとむちで知能を作る? -知能ロボットって本当に賢いの?2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] ニューラルネットを用いた価値関数の学習における微分型トレースの提案2012

[Presentation] リカレントネットを用いた強化学習における離散的かつ抽象的な状態表現の創発2012