2008 Fiscal Year Annual Research Report

大脳情報処理模倣型強化学習システム

Research Project

Project/Area Number	20500207
Research Institution	Yamaguchi University
Principal Investigator	小林邦和 Yamaguchi University, 大学院・理工学研究科, 助教 (40263793)
Co-Investigator(Kenkyū-buntansha)	大林正直山口大学, 大学院・理工学研究科, 教授 (60213849) 呉本尭山口大学, 大学院・理工学研究科, 助教 (40294657)
Keywords	強化学習 / マルチエージェント / 大脳情報処理 / 状態予測 / 自律構成 / モジュール構造 / 協調行動 / ロボット
Research Abstract	平成20年度は主にマルチエージェントシステムの創発現象の定式化と計算機上の仮想環境における性能評価を行った.詳細は以下の通りである. 1.マルチエージェントシステムにおけるミクローマクロ・ループの現象について,統計物理学的手法を用いて定式化を行った.具体的には,マルチエージェントシステムにおける個々のエージェント(自律的に行動する主体)を統計物理におけるミクロな粒子,またエージェント同士の相互作用により生じる秩序を統計物理におけるマクロな現象と捉える. 2.上記の枠組みの下で,マルチエージェントシステムにおける協調行動の創発を目的とした新しい大脳情報処理模倣型強化学習システムを提案した.提案システムは,他エージェントの状態予測機能と階層とモジュールの自律構成機能を併せ持ち,外部環境の変化に柔軟に適応する能力を持っている.各モジュールは,次状態の予測を行う状態予測器と行動決定を行う行動制御器を内部に持つ.提案システムでは,ヒトの大脳における情報処理機構に倣い感覚器官から高次機能まで抽象レベルを階層状に表現し,また同一レベルにおいてモジュールを複数配置することで,状態空間の整理を行い,学習の効率化を図っている 3.計算機上の仮想環境におけるシミュレーションにより,提案法がマルチエージェント環境において,協調行動を創発していることを示した. 4.次年度以降に行う自律移動ロボットを用いた現実環境における創発現象(協調行動の創発)の実現のための,ハードウェア環境の構築を行った.

Research Products
(23 results)

All 2009 2008

All Journal Article (10 results) (of which Peer Reviewed: 10 results) Presentation (13 results)

[Journal Article] 状態予測型強化学習システム2008
- Author(s)
  小林邦和, 他
- Journal Title
  
  電気学会論文誌 128-C
  
  Pages: 1303-1311
- Peer Reviewed
[Journal Article] 一時的滞留機能を持つ過渡的カオス連想記憶モデル2008
- Author(s)
  大林正直, 他
- Journal Title
  
  電気学会論文誌 128-C
  
  Pages: 1852-1858
- Peer Reviewed
[Journal Article] A Dynamic Associative Memory System Adopting Amygdala Model2008
- Author(s)
  T. Kuremoto, et al.
- Journal Title
  
  Artificial Life and Robotics 13
  
  Pages: 478-482
- Peer Reviewed
[Journal Article] A Robust Reinforcement Learning Using Concept of Sliding Mode Control2008
- Author(s)
  M. Obayashi, et al.
- Journal Title
  
  Artificial Life and Robotics 13
  
  Pages: 526-530
- Peer Reviewed
[Journal Article] A Reinforcement Learning Method Based on Immune Network Adapted to Semi Markov Decision Process2008
- Author(s)
  N. Kogawa, et al.
- Journal Title
  
  Artificial Life and Robotics 13
  
  Pages: 538-542
- Peer Reviewed
[Journal Article] A Reinforcement Learning System for Swarm Behaviors2008
- Author(s)
  T. Kuremoto, et al.
- Journal Title
  
  Proceedings of the 2008 IEEE World Congress on Computational Intelligence
  
  Pages: 3710-3715
- Peer Reviewed
[Journal Article] A Neuro-fuzzy Learning System for Adaptive Swarm Behaviors Dealing with Continuous State Space2008
- Author(s)
  T. Kuremoto, et al.
- Journal Title
  
  Lecture Notes in Computer Science 5227
  
  Pages: 675-683
- Peer Reviewed
[Journal Article] A Reinforcement Learning System with Chaotic Neural Networks-Based Adaptive Hierarchical Memory Structure for Autonomous Robots2008
- Author(s)
  M. Obayashi, et al.
- Journal Title
  
  Proceedings of International Conference on Control, Automation and Systems
  
  Pages: 69-74
- Peer Reviewed
[Journal Article] A Bayesian Local Linear Wavelet Neural Network2008
- Author(s)
  K. Kobayashi, et al.
- Journal Title
  
  Proceedings of International Conference on Neural Information Processing
  
  Pages: 113-114
- Peer Reviewed
[Journal Article] A Self-Organized Fuzzy-Neuro Reinforcement Learning System for Continuous State Space for Autonomous Robots2008
- Author(s)
  M. Obayashi, et al.
- Journal Title
  
  Proceedings of International Conference on Computational Intelligence for Modelling, Control and Automation
  
  Pages: 69-74
- Peer Reviewed
[Presentation] 大脳辺縁系モデルの構築2009
- Author(s)
  呉本尭, 他
- Organizer
  第21回自律分散システム・シンポジウム
- Place of Presentation
  とりぎん文化会舘(鳥取市)
- Year and Date
  2009-01-22
[Presentation] Transient-SOMの改良及び音声命令学習への応用2009
- Author(s)
  古本隆人, 他
- Organizer
  第21回自律分散システム・シンポジウム
- Place of Presentation
  とりぎん文化会舘(鳥取市)
- Year and Date
  2009-01-22
[Presentation] 階差時系列を用いたニューラルネットワークによる時系列予測2009
- Author(s)
  波多聡, 他
- Organizer
  第21回自律分散システム・シンポジウム
- Place of Presentation
  とりぎん文化会舘(鳥取市)
- Year and Date
  2009-01-22
[Presentation] 学習分類子システムを用いた強化学習(XCS QT)の性能評価2008
- Author(s)
  尾崎智香, 他
- Organizer
  第17回計測自動制御学会中国支部学術講演会
- Place of Presentation
  広島大学(東広島市)
- Year and Date
  2008-11-15
[Presentation] 自律移動ロボットによるマップ作成とゴール探索2008
- Author(s)
  河村佳代子, 他
- Organizer
  第17回計測自動制御学会中国支部学術講演会
- Place of Presentation
  広島大学(東広島市)
- Year and Date
  2008-11-15
[Presentation] アントコロニー最適化法におけるランダム選択率設定方法2008
- Author(s)
  永田昌彦, 他
- Organizer
  第17回計測自動制御学会中国支部学術講演会
- Place of Presentation
  広島大学(東広島市)
- Year and Date
  2008-11-15
[Presentation] ランダムタイリングを用いたモジュール型強化学習2008
- Author(s)
  松井裕之, 他
- Organizer
  第17回計測自動制御学会中国支部学術講演会
- Place of Presentation
  広島大学(東広島市)
- Year and Date
  2008-11-15
[Presentation] TD誤差を用いた強化学習のメタパラメータ学習法2008
- Author(s)
  溝上裕之, 他
- Organizer
  第17回計測自動制御学会中国支部学術講演会
- Place of Presentation
  広島大学(東広島市)
- Year and Date
  2008-11-15
[Presentation] 不完全観測環境下におけるスライディングモード制御の概念を利用した強化学習システム2008
- Author(s)
  山田勝巳, 他
- Organizer
  第17回計測自動制御学会中国支部学術講演会
- Place of Presentation
  広島大学(東広島市)
- Year and Date
  2008-11-15
[Presentation] 局所線形ウェーブレットニューラルネットワークのベイズ的設計法2008
- Author(s)
  小林邦和, 他
- Organizer
  第18回日本神経回路学会合同大会
- Place of Presentation
  産業技術総合研究所(つくば市)
- Year and Date
  2008-09-25
[Presentation] アントコロニー最適化法における上位ランク限定ランダム選択方式2008
- Author(s)
  永田昌彦, 他
- Organizer
  電子・情報・システム部門大会
- Place of Presentation
  はこだて未来大学(函館市)
- Year and Date
  2008-08-21
[Presentation] 局所線形モデルを導入したウェーブレッドニューラルネットワークのベイズ的設計法2008
- Author(s)
  小林邦和, 他
- Organizer
  電子・情報・システム部門大会
- Place of Presentation
  はこだて未来大学(函館市)
- Year and Date
  2008-08-20
[Presentation] TD誤差に基づく強化学習のメタパラメータ学習法2008
- Author(s)
  溝上裕之, 他
- Organizer
  電子・情報・システム部門大会
- Place of Presentation
  はこだて未来大学(函館市)
- Year and Date
  2008-08-20

2008 Fiscal Year Annual Research Report

大脳情報処理模倣型強化学習システム

Principal Investigator

小林 邦和 Yamaguchi University, 大学院・理工学研究科, 助教 (40263793)

Research Products

[Journal Article] 状態予測型強化学習システム2008

Author(s)

Journal Title

[Journal Article] 一時的滞留機能を持つ過渡的カオス連想記憶モデル2008

Author(s)

Journal Title

[Journal Article] A Dynamic Associative Memory System Adopting Amygdala Model2008

Author(s)

Journal Title

[Journal Article] A Robust Reinforcement Learning Using Concept of Sliding Mode Control2008

Author(s)

Journal Title

[Journal Article] A Reinforcement Learning Method Based on Immune Network Adapted to Semi Markov Decision Process2008

Author(s)

Journal Title

[Journal Article] A Reinforcement Learning System for Swarm Behaviors2008

Author(s)

Journal Title

[Journal Article] A Neuro-fuzzy Learning System for Adaptive Swarm Behaviors Dealing with Continuous State Space2008

Author(s)

Journal Title

[Journal Article] A Reinforcement Learning System with Chaotic Neural Networks-Based Adaptive Hierarchical Memory Structure for Autonomous Robots2008

Author(s)

Journal Title

[Journal Article] A Bayesian Local Linear Wavelet Neural Network2008

Author(s)

Journal Title

[Journal Article] A Self-Organized Fuzzy-Neuro Reinforcement Learning System for Continuous State Space for Autonomous Robots2008

Author(s)

Journal Title

[Presentation] 大脳辺縁系モデルの構築2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Transient-SOMの改良及び音声命令学習への応用2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 階差時系列を用いたニューラルネットワークによる時系列予測2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 学習分類子システムを用いた強化学習(XCS QT)の性能評価2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 自律移動ロボットによるマップ作成とゴール探索2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] アントコロニー最適化法におけるランダム選択率設定方法2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] ランダムタイリングを用いたモジュール型強化学習2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] TD誤差を用いた強化学習のメタパラメータ学習法2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 不完全観測環境下におけるスライディングモード制御の概念を利用した強化学習システム2008

Author(s)

Organizer

Place of Presentation

Year and Date

小林邦和 Yamaguchi University, 大学院・理工学研究科, 助教 (40263793)