2016 Fiscal Year Research-status Report

強化学習を用いたサイバーフィジカルシステムのフレキシブルな開発技術

Research Project

Project/Area Number	16K06424
Research Institution	Osaka Prefecture University
Principal Investigator	松本啓之亮大阪府立大学, 工学(系)研究科(研究院), 教授 (90285304)
Project Period (FY)	2016-04-01 – 2019-03-31
Keywords	サイバーフィジカルシステム / 機械学習 / エージェント
Outline of Annual Research Achievements	サイバーフィジカルシステムをその構成単位ごとに自律的に行動するマルチエージェントシステムとしてモデル化した．各エージェントの構造は知的判断部，基本機能部，ネットワーク通信関連部からなる．ネットワーク通信によりプロトコルを介して結ばれるマルチエージェントシステムについて基本機能を定義し，その機能を実装した．定義された作業が協調をとりながら実行されることを確認するためプロトコルに準拠したテストを実施した．実フィールド環境では，不確実性や計測不能な未知のパラメータが存在するため，タスクの達成方法やゴールへの到達方法を事前にあらゆる場合を想定し，あらかじめ設定することは非常に困難となる．このため本研究では試行錯誤を通して環境に適応する学習制御の枠組みである強化学習を採用した．適用例として追跡問題に強化学習を適用した．すべてのハンタは獲物を捕まえるという共通の目的を持ち, ハンタがとれる行動はどのハンタも同じである．そのような環境において，ハンタが獲物を捕まえる際，各ハンタの適した行動は一致するものがあり, 他のハンタの行動を学習することにより，少ない試行回数で適した行動を学習できると考えられる．そこで, 本研究では他のハンタの行動履歴をもとに自身のQ値を更新する手法を考案した．実験の結果，提案手法は学習は早くなっているが, 最終的な学習結果は行動履歴を共有しない手法と比べて悪くなる傾向がある．提案手法は学習終盤に他ハンタの行動を学習したことで学習精度が劣化していると考えられる．このため，他のハンタの行動履歴を利用して学習する際の学習率をエピソード数に応じて減少させ, 学習が進むにつれて他のハンタの行動履歴による学習への影響を少なくする．これにより, 学習初期は他のハンタの行動履歴を活用し, 学習が進むと自分の履歴のみを利用した学習に近づくこととなる．
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason 本研究のメインテーマの一つである強化学習アルゴリズムの中核部分について先行的に検討・実施し，実現の見通しが得られた．
Strategy for Future Research Activity	システムのモデル化と強化学習アルゴリズムの中核部分については見通しが得られたので，これらを基礎にして通信するための分散型システムアーキテクチャの設計や各エージェントを効率よく協調させサイバーフィジカルシステムをフレキシブルに開発するためのエージェントの知的判断部のモデル駆動開発による自動生成を目指す．
Causes of Carryover	高機能なコンピュータハードウェアが新年度以降に販売されることになったので，購入時期を遅らせたため．
Expenditure Plan for Carryover Budget	ネットワーク上で実用可能性を検証できる程度の規模をもつプロトタイプシステムを構築するため，サーバマシン，クライアントマシンおよびネットワーク部品等を購入する．

Research Products
(8 results)

All 2016

All Journal Article (3 results) (of which Peer Reviewed: 3 results, Acknowledgement Compliant: 1 results) Presentation (5 results) (of which Int'l Joint Research: 1 results)

[Journal Article] 劣個体分布に基づく DII analysis の提案と応用2016
- Author(s)
  長谷川拓，井上和之．荒木悠太，森直樹，松本啓之亮
- Journal Title
  
  進化計算学会論文誌
  
  Volume: Vol. 7 Pages: 13-23
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] Analysis of Parameter-less Population Pyramid on the Local Distribution of Inferior Individuals2016
- Author(s)
  T. Hasegawa, Y. Araki, N. Mori and K. Matsumoto
- Journal Title
  
  Intelligent and Evolutionary Systems - Adaptation, Learning and Optimization
  
  Volume: 8 Pages: 149-164
- Peer Reviewed
[Journal Article] CMA-ES with Surrogate Model Adapting to Fitness Landscape2016
- Author(s)
  K. Tsukada, T. Hasegawa, N. Mori and K. Matsumoto
- Journal Title
  
  Intelligent and Evolutionary Systems - Adaptation, Learning and Optimization
  
  Volume: 8 Pages: 417-429
- Peer Reviewed
[Presentation] Learning Method by Sharing Activity Logs in Multiagent Environment2016
- Author(s)
  K. Matsumoto, T. Gohara, and N. Mori
- Organizer
  10th International Conference on Advanced Engineering Computing and Applications in Sciences
- Place of Presentation
  Venice, Italy
- Year and Date
  2016-10-09 – 2016-10-13
- Int'l Joint Research
[Presentation] 株式市場における人工市場と現実市場の類似度指標についての考察2016
- Author(s)
  住田和也，松本啓之亮，森直樹
- Organizer
  電気学会電子・情報・システム部門大会
- Place of Presentation
  神戸大学 (兵庫県神戸市)
- Year and Date
  2016-08-31 – 2016-09-03
[Presentation] アクティビティ図の再利用のための検索法2016
- Author(s)
  丸本晃大，松本啓之亮，森直樹
- Organizer
  第60回システム制御情報学会研究発表講演会
- Place of Presentation
  京都テルサ (京都府京都市)
- Year and Date
  2016-05-25 – 2016-05-27
[Presentation] 単語と画像の印象的相関関係に基づく顔画像生成システム2016
- Author(s)
  山本元気，松本啓之亮，森直樹
- Organizer
  第60回システム制御情報学会研究発表講演会
- Place of Presentation
  京都テルサ (京都府京都市)
- Year and Date
  2016-05-25 – 2016-05-27
[Presentation] 株価データの解析におけるDeep Learningの導入2016
- Author(s)
  住田和也，松本啓之亮，森直樹
- Organizer
  第60回システム制御情報学会研究発表講演会
- Place of Presentation
  京都テルサ (京都府京都市)
- Year and Date
  2016-05-25 – 2016-05-27

2016 Fiscal Year Research-status Report

強化学習を用いたサイバーフィジカルシステムのフレキシブルな開発技術

Principal Investigator

松本 啓之亮 大阪府立大学, 工学(系)研究科(研究院), 教授 (90285304)

Current Status of Research Progress

Reason

Research Products

[Journal Article] 劣個体分布に基づく DII analysis の提案と応用2016

Author(s)

Journal Title

[Journal Article] Analysis of Parameter-less Population Pyramid on the Local Distribution of Inferior Individuals2016

Author(s)

Journal Title

[Journal Article] CMA-ES with Surrogate Model Adapting to Fitness Landscape2016

Author(s)

Journal Title

[Presentation] Learning Method by Sharing Activity Logs in Multiagent Environment2016

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 株式市場における人工市場と現実市場の類似度指標についての考察2016

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] アクティビティ図の再利用のための検索法2016

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 単語と画像の印象的相関関係に基づく顔画像生成システム2016

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 株価データの解析におけるDeep Learningの導入2016

Author(s)

Organizer

Place of Presentation

Year and Date

松本啓之亮大阪府立大学, 工学(系)研究科(研究院), 教授 (90285304)