2007 Fiscal Year Annual Research Report

分散知能実現のための頑健・高速・汎用な強化学習アルゴリズムの研究

Research Project

Project/Area Number	07J07695
Research Institution	Yokohama National University
Principal Investigator	渋谷長史 Yokohama National University, 大学院・工学府, 特別研究員(DC1)
Keywords	強化学習 / 次元の呪い / 状態空間 / 不完全知覚問題
Research Abstract	本年度では,強化学習に関する二つの研究を行った。一つは,測距センサをもつロボットのための状態空間の自己組織化手法に関する研究である。もう一つは,複素強化学習法における学習性能向上のための手法に関する研究である。測距センサをもつロボットのための状態空間の自己組織化手法は,強化学習における「次元の呪い」とよばれる問題を回避するために考案された。「次元の呪い」とは,センサの次元や分解能を増やすことによって学習が進まなくなるという問題である。この問題に対して,測距によって得られたデータを2次元画像に変換し,変換によって得られた画像を自己組織化マップにより分類するという手法を提案した。シミュレーション実験の結果により,提案手法を用いると広大な状態空間における状態を実用上十分少ない数の状態に分類できることが明らかとなった。複素強化学習法における学習性能向上のための手法は,強化学習における「不完全知覚問題」とよばれる問題が発生する環境で学習を行うために考案された。「不完全知覚問題」とは,センサの次元や分解能が不足することによって学習が進まなくなる問題である。不完全知覚問題が発生する環境での学習のために,これまで複素強化学習法を提案してきた。本年度は,この複素強化学習法のなかで,適格度トレースの調整と行動価値の多重化についての検討を行った。適格度トレースの調整や行動価値の多重化のどちらも,「より複雑なタスク」において学習が可能であるという結果が得られた。

Research Products

(6 results)

All 2008 2007

All Journal Article (2 results) (of which Peer Reviewed: 2 results) Presentation (4 results)

[Journal Article] 複素数で表現された行動価値を用いるQ-learning(掲載決定)2008
- Author(s)
  澁谷長史
- Journal Title
  
  電子情報通信学会論文誌D Vol.91-D No.5
- Peer Reviewed
[Journal Article] Amethod of generalization of state space construction for multi robots with different sensor configurations(掲載決定)2008
- Author(s)
  Takeshi SHIBLTYA
- Journal Title
  
  IEEJ Transactions of Electrical&Electronic Engineering 7
- Peer Reviewed
[Presentation] 複素強化学習のための行動価値の多重化に関する実験的検討2008
- Author(s)
  澁谷長史
- Organizer
  知能システムシンポジウム
- Place of Presentation
  東京都
- Year and Date
  2008-03-17
[Presentation] Experimental Study of the Eligibility Traces in Complex Valued Reinforcement Learning2007
- Author(s)
  Takeshi Shibuya
- Organizer
  IEEE International Conference on Systems, Man and Cybernetics
- Place of Presentation
  カナダ・モントリオール
- Year and Date
  2007-10-09
[Presentation] 複素強化学習において行動価値を多重化する効果について2007
- Author(s)
  澁谷長史
- Organizer
  情報科学技術フォーラム
- Place of Presentation
  愛知県豊田市
- Year and Date
  2007-09-05
[Presentation] 複素強化学習を用いたAcrobotの振り上げ制御2007
- Author(s)
  澁谷長史
- Organizer
  人工知能学会全国大会
- Place of Presentation
  宮崎県宮崎市
- Year and Date
  2007-06-22

2007 Fiscal Year Annual Research Report

分散知能実現のための頑健・高速・汎用な強化学習アルゴリズムの研究

Principal Investigator

渋谷 長史 Yokohama National University, 大学院・工学府, 特別研究員(DC1)

Research Products

[Journal Article] 複素数で表現された行動価値を用いるQ-learning(掲載決定)2008

Author(s)

Journal Title

[Journal Article] Amethod of generalization of state space construction for multi robots with different sensor configurations(掲載決定)2008

Author(s)

Journal Title

[Presentation] 複素強化学習のための行動価値の多重化に関する実験的検討2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Experimental Study of the Eligibility Traces in Complex Valued Reinforcement Learning2007

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 複素強化学習において行動価値を多重化する効果について2007

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 複素強化学習を用いたAcrobotの振り上げ制御2007

Author(s)

Organizer

Place of Presentation

Year and Date

渋谷長史 Yokohama National University, 大学院・工学府, 特別研究員(DC1)