ヒトの行動学習・発達規範の計算エネルギーコスト制約に基づく三層ロボット継続学習

研究課題

研究課題/領域番号	23K24926
補助金の研究課題番号	22H03670 (2022-2023)
研究種目	基盤研究(B)
配分区分	基金 (2024) 補助金 (2022-2023)
応募区分	一般
審査区分	小区分61050:知能ロボティクス関連
研究機関	大阪大学
研究代表者	OZTOP Erhan 大阪大学, 先導的学際研究機構, 特任教授(常勤) (90542217)
研究期間 (年度)	2022-04-01 – 2025-03-31
研究課題ステータス	交付 (2024年度)
配分額 *注記	14,950千円 (直接経費: 11,500千円、間接経費: 3,450千円) 2024年度: 4,160千円 (直接経費: 3,200千円、間接経費: 960千円) 2023年度: 4,550千円 (直接経費: 3,500千円、間接経費: 1,050千円) 2022年度: 6,240千円 (直接経費: 4,800千円、間接経費: 1,440千円)
キーワード	計算エネルギーコスト / 継続的な行動学習 / 概念形成 / 内発的動機 / スキル移転 / Lifelong Robot Learning / Learning Progress / Knowledge Transfer / Multitask Learning / Symbol Formation / Interleaved Learning
研究開始時の研究の概要	生物にはエネルギーコストの制限があり,生涯を通じて効率的で有用な行動の学習と発達を可能にする．この制約は，ロボットにおいては，計算エネルギーコスト（CEC,computational energy cost）制約に対応すると考えられるが，ヒトと同様にこの制約がロボットの継続的な行動学習・発達にどのようにうまく機能するかが，本研究のテーマである．これを検証するために，ロボットの行動学習・発達機構として三層構造（エネルギーコスト最小化するニューラルネットワーク,　CEC基づいた内発的動機, 概念形成）を想定する．各層でこの制約に基づく計算手法をロボットに実装され,ヒトに類似した学習行動が生成する．
研究実績の概要	1) Different variations of multi-task learning model with bidirectional skill transfer is explored. One-way skill transfer from literature is generalized to bidirectional transfer, and how human-like learning can be realized via ‘interleaved learning’, for effective lifelong robot learning (LRL) is studied. In the current approaches, task order is specified, and tasks are learned to completion. Humans can switch tasks during learning and obtain skill transfer leverage. In LRL model, we realized such a mechanism and showed that it leads to effective learning. 2) A novel intrinsic motivation (IM) signal is proposed that combines computational energy cost (CEC) and learning progress (LP). Existing cognitive models disregards the cost of computation, yet the human brain must consider this. The work on CEC-aware task selection and network loss definition is tested on a new set of robotic tasks. The simulations has shown that a nice trade-off between learning accuracy and energy consumption is possible. 3) The LRL model is generalized to reinforcement learning (RL) domain. For doing so, a new LP signal is proposed, namely ‘expected total reward progress’, which is shown to facilitate effective learning when used as a signal for task selection in an interleaved manner. 4) Additional work on symbolic representation with attention layers is conducted. Also, interplay between CEC and robotic trust is explored with collaborators. Another direction explored is to consider prediction uncertainty as another IM signal that can be used by robots for lifelong learning.
現在までの達成度 (区分)	現在までの達成度 (区分) 3: やや遅れている理由 The planned work items for second year and their current status assessment is as follows: Integration of Neural Computational Cost (NCC): NCC is integrated into Lifelong Robot Learning (LRL) for ‘task selection’ and ‘neural network loss computation’. It took time to find a good balance between learning progress and NCC. Overall the NCC work can be considered on time. Integration of Symbol/Concept based Knowledge transfer: Work in this direction is conducted with collaborators and theoretical results are obtained; however, the integration of this knowledge in LRL architecture is delayed. More Complex Multi-task Learning Scenarios: We have switched to more complex task scenarios but are still in the action-effect prediction domain. The arbitrariness in error definition of learning tasks makes it difficult to combine very different tasks in a single task arbitration mechanism. This still is an open problem, and work is being conducted on this. Overall work towards task complexification can be considered on time. Incorporation of Reinforcement Learning (RL) Tasks: This direction has been established and a paper is submitted so it is on time.
今後の研究の推進方策	The third year of the project will focus on these research items: Integration of Symbol/Concept based Knowledge transfer :The existing know-how on symbol and concept formation will be integrated into the LRL model. Research will be conducted on how to represent knowledge in a resource economical way and how to efficiently access to that knowledge. Real Robot deployment: The perception and action primitives of the Torobo robot is tuned for the tasks used in the simulations. However, due to the complexities faced during modeling and task simulations, the real hardware experiments are left to the last year of the project. So, the goal is realize some of the simulated tasks on the real robot. Heterogeneous Multi-task Learning Scenarios: The current task scenarios are homogenous in that they are based on action-effect prediction learning. A new approach to address the arbitrariness in error definition of the tasks is needed. Effort will be spent on developing a heterogeneous multi-task learning framework with effective solutions.

報告書

(2件)

2023 実績報告書
2022 実績報告書

研究成果
(16件)

すべて 2024 2023 2022 その他

すべて国際共同研究 (3件) 雑誌論文 (6件) (うち国際共著 2件、査読あり 2件) 学会発表 (6件) (うち国際学会 6件) 学会・シンポジウム開催 (1件)

[国際共同研究] Bogazici University/Ozyegin University(トルコ)
- 関連する報告書
  2023 実績報告書
[国際共同研究] Tilburg University(オランダ)
- 関連する報告書
  2023 実績報告書
[国際共同研究] Bogazici University/Ozyegin University(トルコ)
- 関連する報告書
  2022 実績報告書
[雑誌論文] Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning2024
- 著者名/発表者名
  Ada Suzan Ece、Oztop Erhan、Ugur Emre
- 雑誌名
  
  IEEE Robotics and Automation Letters
  
  巻: 9 号: 4 ページ: 3116-3123
- DOI
  10.1109/lra.2024.3363530
- 関連する報告書
  2023 実績報告書
- 査読あり / 国際共著
[雑誌論文] Discovering Predictive Relational Object Symbols With Symbolic Attentive Layers2024
- 著者名/発表者名
  Ahmetoglu Alper、Celik Batuhan、Oztop Erhan、Ugur Emre
- 雑誌名
  
  IEEE Robotics and Automation Letters
  
  巻: 9 号: 2 ページ: 1977-1984
- DOI
  10.1109/lra.2024.3350994
- 関連する報告書
  2023 実績報告書
- 査読あり / 国際共著
[雑誌論文] Correspondence Learning Between Morphologically Different Robots via Task Demonstrations2024
- 著者名/発表者名
  Aktas Hakan、Nagai Yukie、Asada Minoru、Oztop Erhan、Ugur Emre
- 雑誌名
  
  IEEE Robotics and Automation Letters
  
  巻: 9 号: 5 ページ: 4463-4470
- DOI
  10.1109/lra.2024.3382534
- 関連する報告書
  2023 実績報告書
[雑誌論文] Trust in robot-robot scaffolding2023
- 著者名/発表者名
  Kirtay Murat、Hafner Verena V.、Asada Minoru、Oztop Erhan
- 雑誌名
  
  IEEE Transactions on Cognitive and Developmental Systems
  
  巻: 1 号: 4 ページ: 1-1
- DOI
  10.1109/tcds.2023.3235974
- 関連する報告書
  2022 実績報告書
[雑誌論文] DeepSym: Deep Symbol Generation and Rule Learning for Planning from Unsupervised Robot Interaction2022
- 著者名/発表者名
  Ahmetoglu Alper、Seker M. Yunus、Piater Justus、Oztop Erhan、Ugur Emre
- 雑誌名
  
  Journal of Artificial Intelligence Research
  
  巻: 75 ページ: 709-745
- DOI
  10.1613/jair.1.13754
- 関連する報告書
  2022 実績報告書
[雑誌論文] Multimodal reinforcement learning for partner specific adaptation in robot-multi-robot interaction2022
- 著者名/発表者名
  Kirtay Murat、Hafner Verena V.、Asada Minoru、Kuhlen Anna K.、Oztop Erhan
- 雑誌名
  
  IEEE Proceedings on Humanoids 2022, Ginowan, Japan
  
  巻: 1 ページ: 1-1
- DOI
  10.1109/humanoids53995.2022.10000205
- 関連する報告書
  2022 実績報告書
[学会発表] Human-in-the-Loop Training Leads to Faster Skill Acquisition and Adaptation in Reinforcement Learning-based Robot Control2024
- 著者名/発表者名
  Yilmaz D, Ugurlu B, Oztop E
- 学会等名
  18th IEEE International Conference on Advanced Motion Control (AMC2024), Kyoto, Japan
- 関連する報告書
  2023 実績報告書
- 国際学会
[学会発表] A Model for Cognitively Valid Lifelong Learning2023
- 著者名/発表者名
  Say H, Oztop E
- 学会等名
  IEEE International Conference on Robotics and Biomimetics (ROBIO 2023), Koh Samui, Thailand
- 関連する報告書
  2023 実績報告書
- 国際学会
[学会発表] Developmental Scaffolding with Large Language Models2023
- 著者名/発表者名
  Celik MB, Ahmetoglu A, Ugur E, Oztop E
- 学会等名
  23rd IEEE International Conference on Development and Learning (ICDL 2023), Macau, China
- 関連する報告書
  2023 実績報告書
- 国際学会
[学会発表] Interplay between neural computational energy and multimodal processing in robot-robot interaction2023
- 著者名/発表者名
  Kirtay M, Hafner VV, Asada M, Oztop E
- 学会等名
  23rd IEEE International Conference on Development and Learning (ICDL 2023), Macau, China
- 関連する報告書
  2023 実績報告書
- 国際学会
[学会発表] Context Based Echo State Networks for Robot Movement Primitives2023
- 著者名/発表者名
  Amirshirzad N, Asada M, Oztop E
- 学会等名
  32nd IEEE International Conference on Robot & Human Interactive Communication (RO-MAN) Busan, South Korea
- 関連する報告書
  2023 実績報告書
- 国際学会
[学会発表] Bimanual rope manipulation skill synthesis through context dependent correction policy learning from human demonstration2023
- 著者名/発表者名
  Akbulut Baturhan, Tuba Girgin, Mehrabi Arash, Ugur Emre, Oztop Erhan
- 学会等名
  IEEE International Conference on Robotics and Automation (ICRA2023)
- 関連する報告書
  2022 実績報告書
- 国際学会
[学会・シンポジウム開催] IEEE IROS 2022 Workshop on “Lifelong Learning of High-level Cognitive and Reasoning Skills” (https://lifelongrobotics.github.io/)2022
- 関連する報告書
  2022 実績報告書

ヒトの行動学習・発達規範の計算エネルギーコスト制約に基づく三層ロボット継続学習

研究代表者

OZTOP Erhan 大阪大学, 先導的学際研究機構, 特任教授(常勤) (90542217)

14,950千円 (直接経費: 11,500千円、間接経費: 3,450千円)

現在までの達成度 (区分)

理由

報告書

研究成果

[国際共同研究] Bogazici University/Ozyegin University(トルコ)

関連する報告書

[国際共同研究] Tilburg University(オランダ)

関連する報告書

[国際共同研究] Bogazici University/Ozyegin University(トルコ)

関連する報告書

[雑誌論文] Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning2024

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] Discovering Predictive Relational Object Symbols With Symbolic Attentive Layers2024

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] Correspondence Learning Between Morphologically Different Robots via Task Demonstrations2024

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] Trust in robot-robot scaffolding2023

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] DeepSym: Deep Symbol Generation and Rule Learning for Planning from Unsupervised Robot Interaction2022

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] Multimodal reinforcement learning for partner specific adaptation in robot-multi-robot interaction2022

著者名/発表者名

雑誌名

DOI

関連する報告書

[学会発表] Human-in-the-Loop Training Leads to Faster Skill Acquisition and Adaptation in Reinforcement Learning-based Robot Control2024

著者名/発表者名

学会等名

関連する報告書

[学会発表] A Model for Cognitively Valid Lifelong Learning2023

著者名/発表者名

学会等名

関連する報告書

[学会発表] Developmental Scaffolding with Large Language Models2023

著者名/発表者名

学会等名

関連する報告書

[学会発表] Interplay between neural computational energy and multimodal processing in robot-robot interaction2023

著者名/発表者名

学会等名

関連する報告書

[学会発表] Context Based Echo State Networks for Robot Movement Primitives2023

著者名/発表者名

学会等名

関連する報告書

[学会発表] Bimanual rope manipulation skill synthesis through context dependent correction policy learning from human demonstration2023

著者名/発表者名

学会等名

関連する報告書

[学会・シンポジウム開催] IEEE IROS 2022 Workshop on “Lifelong Learning of High-level Cognitive and Reasoning Skills” (https://lifelongrobotics.github.io/)2022

関連する報告書