2002 Fiscal Year Annual Research Report

運動・行動学習における身体ダイナミクスの形成とパターン生成に関する研究

Research Project

Project/Area Number	14350227
Research Category	Grant-in-Aid for Scientific Research (B)
Research Institution	Tokyo Institute of Technology
Principal Investigator	伊藤宏司東京工業大学, 大学院・総合理工学研究科, 教授 (30023310)
Co-Investigator(Kenkyū-buntansha)	近藤敏之東京工業大学, 大学院・総合理工学研究科, 助手 (60323820) 柴田克成大分大学, 工学部, 助教授 (10260522) 松野文俊東京工業大学, 大学院・総合理工学研究科, 助教授 (00190489)
Keywords	運動学習 / 身体ダイナミクス / 生体システム / ロボティクス / 自己組織化
Research Abstract	【1】動的関係に基づく内部ダイナミクスの形成と身体運動パターン生成対象物との動的な関係から、身体各部の時空間運動パターンをフィードフォワード的に内部生成する必要がある.そこでは、環境変化の予測と身体ダイナミクス調節が重要な役割を果たす.ポイントは、学習初期の状態空間拘束と内部モデルによる状態予測(学習初期は誤差大)のバランスを学習の進行とともにどのように制御するかである,本年度は、人間の運動学習初期に見られる筋の同時活動(手足をかたくする等)をヒントに、身体ダイナミクス調節による関節自由度と環境ダイナミクスの拘束、およびそれに基づく内部モデルを同時に獲得する学習アルゴリズムを開発した. 【2】環境変化に適合した知能行動の獲得高次元・連続な状態入出力を有する制御対象として移動ロボットに着目し,その感覚・行動間写像の同定に強化学習を適用する.このとき問題となる,計算資源の割当て問題を解決するための一手法として,NGnetで実装される学習器の構造パラメータ(RBF関数の中心ならびに標準偏差)を同時に局所探索する進化的recruitmement戦略を提案する.本年度は提案戦略のアルゴリズムを計算機上に実装し,一定時間ごとに関数形状が動的に変化する目的関数のオンライン関数近似課題に適用し,その有効性を確認した.また,同手法を移動ロボットの行動獲得問題に適用した.計算機シミュレータ上に,ロボットと円柱状の搬送物体(以降Pegと呼ぶ)のモデルを実装し,さまざまな初期姿勢から押し動作学習を行わせた.ロボットの感覚・行動間写像は先のNGnetで実装され,その構造パラメータは提案手法でオンライン調節された.シミュレーション結果から,提案手法は学習の伸展とともに学習器を構成する基底関数の構造パラメータが漸進的にチューニングされるため,追加する基底関数のサイズをあらかじめ設計者が指定しなければならない従来手法と比べて,より少ない数の基底関数で制御器を構成できることが確認された.さらに,上記のPeg押し課題について実ロボットを用いた検証実験を行い,一時間弱の試行の後にロボットが所望のタスクを学習できることを確認した.

Research Products
(6 results)

All Other

All Publications (6 results)

[Publications] 柴田克成, 上田雅英, 伊藤宏司: "強化学習による個性・社会性の発現・分化モデル"計測自動制御学会論文集. 39(掲載予定). (2003)
[Publications] Jun Izawa, Toshiyuki Kondo, Koji Ito: "Biological Robot Arm Motion through Reinforcement Learning"Proceedings of International Conference on Robotics and Automation (ICRA). (CD-ROM). WAII-11-3 (2002)
[Publications] Yoshiyuki Wakamatsu, Toshiyuki kondo, Koji Ito: "A Computational Emotion Model based on the Prosodic Component of Speech Sounds"Proceedings of International Conference on Robotics and Automation (ICRA). (CD-ROM). WP-5II-7 (2002)
[Publications] Mikiko Hori, Toshiyuki Kondo, Koji Ito: "EMG controlled manipulation with variable viscoelastic characteristics"Proceedings of 2002 IEEE International Conference on Systems, Man and Cybernetics (SMC'02). (CD-ROM). MP2J2 (2002)
[Publications] Toshiyuki Kondo, Koji Ito: "A Reinforcement Learning with Adaptive State Space Recruitment Strategy for Real Autonomous Mobile Robots"Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'02). (CD-ROM). 393 (2002)
[Publications] Edwardo A.Y.Murakami, T.Yamada, T.Kondo, K.Ito: "Performance Evaluation of Bilateral Micro-Teleoperation Systems based on Man-Machine Dynamic Characteristics"Proceedings of SICE Annual Conference 2002. (CD-ROM). (2002)

2002 Fiscal Year Annual Research Report

運動・行動学習における身体ダイナミクスの形成とパターン生成に関する研究

Principal Investigator

伊藤 宏司 東京工業大学, 大学院・総合理工学研究科, 教授 (30023310)

Research Products

[Publications] 柴田克成, 上田雅英, 伊藤宏司: "強化学習による個性・社会性の発現・分化モデル"計測自動制御学会論文集. 39(掲載予定). (2003)

[Publications] Jun Izawa, Toshiyuki Kondo, Koji Ito: "Biological Robot Arm Motion through Reinforcement Learning"Proceedings of International Conference on Robotics and Automation (ICRA). (CD-ROM). WAII-11-3 (2002)

[Publications] Yoshiyuki Wakamatsu, Toshiyuki kondo, Koji Ito: "A Computational Emotion Model based on the Prosodic Component of Speech Sounds"Proceedings of International Conference on Robotics and Automation (ICRA). (CD-ROM). WP-5II-7 (2002)

[Publications] Mikiko Hori, Toshiyuki Kondo, Koji Ito: "EMG controlled manipulation with variable viscoelastic characteristics"Proceedings of 2002 IEEE International Conference on Systems, Man and Cybernetics (SMC'02). (CD-ROM). MP2J2 (2002)

[Publications] Toshiyuki Kondo, Koji Ito: "A Reinforcement Learning with Adaptive State Space Recruitment Strategy for Real Autonomous Mobile Robots"Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'02). (CD-ROM). 393 (2002)

[Publications] Edwardo A.Y.Murakami, T.Yamada, T.Kondo, K.Ito: "Performance Evaluation of Bilateral Micro-Teleoperation Systems based on Man-Machine Dynamic Characteristics"Proceedings of SICE Annual Conference 2002. (CD-ROM). (2002)

伊藤宏司東京工業大学, 大学院・総合理工学研究科, 教授 (30023310)