2023 年度実績報告書

ヒトの行動学習・発達規範の計算エネルギーコスト制約に基づく三層ロボット継続学習

研究課題

研究課題/領域番号	22H03670
配分区分	補助金
研究機関	大阪大学
研究代表者	OZTOP Erhan 大阪大学, 先導的学際研究機構, 特任教授(常勤) (90542217)
研究期間 (年度)	2022-04-01 – 2025-03-31
キーワード	Lifelong Robot Learning / Learning Progress / Knowledge Transfer / Multitask Learning / Symbol Formation / Interleaved Learning
研究実績の概要	1) Different variations of multi-task learning model with bidirectional skill transfer is explored. One-way skill transfer from literature is generalized to bidirectional transfer, and how human-like learning can be realized via ‘interleaved learning’, for effective lifelong robot learning (LRL) is studied. In the current approaches, task order is specified, and tasks are learned to completion. Humans can switch tasks during learning and obtain skill transfer leverage. In LRL model, we realized such a mechanism and showed that it leads to effective learning. 2) A novel intrinsic motivation (IM) signal is proposed that combines computational energy cost (CEC) and learning progress (LP). Existing cognitive models disregards the cost of computation, yet the human brain must consider this. The work on CEC-aware task selection and network loss definition is tested on a new set of robotic tasks. The simulations has shown that a nice trade-off between learning accuracy and energy consumption is possible. 3) The LRL model is generalized to reinforcement learning (RL) domain. For doing so, a new LP signal is proposed, namely ‘expected total reward progress’, which is shown to facilitate effective learning when used as a signal for task selection in an interleaved manner. 4) Additional work on symbolic representation with attention layers is conducted. Also, interplay between CEC and robotic trust is explored with collaborators. Another direction explored is to consider prediction uncertainty as another IM signal that can be used by robots for lifelong learning.
現在までの達成度 (区分)	現在までの達成度 (区分) 3: やや遅れている理由 The planned work items for second year and their current status assessment is as follows: Integration of Neural Computational Cost (NCC): NCC is integrated into Lifelong Robot Learning (LRL) for ‘task selection’ and ‘neural network loss computation’. It took time to find a good balance between learning progress and NCC. Overall the NCC work can be considered on time. Integration of Symbol/Concept based Knowledge transfer: Work in this direction is conducted with collaborators and theoretical results are obtained; however, the integration of this knowledge in LRL architecture is delayed. More Complex Multi-task Learning Scenarios: We have switched to more complex task scenarios but are still in the action-effect prediction domain. The arbitrariness in error definition of learning tasks makes it difficult to combine very different tasks in a single task arbitration mechanism. This still is an open problem, and work is being conducted on this. Overall work towards task complexification can be considered on time. Incorporation of Reinforcement Learning (RL) Tasks: This direction has been established and a paper is submitted so it is on time.
今後の研究の推進方策	The third year of the project will focus on these research items: Integration of Symbol/Concept based Knowledge transfer :The existing know-how on symbol and concept formation will be integrated into the LRL model. Research will be conducted on how to represent knowledge in a resource economical way and how to efficiently access to that knowledge. Real Robot deployment: The perception and action primitives of the Torobo robot is tuned for the tasks used in the simulations. However, due to the complexities faced during modeling and task simulations, the real hardware experiments are left to the last year of the project. So, the goal is realize some of the simulated tasks on the real robot. Heterogeneous Multi-task Learning Scenarios: The current task scenarios are homogenous in that they are based on action-effect prediction learning. A new approach to address the arbitrariness in error definition of the tasks is needed. Effort will be spent on developing a heterogeneous multi-task learning framework with effective solutions.

研究成果
(10件)

すべて 2024 2023 その他

すべて国際共同研究 (2件) 雑誌論文 (3件) (うち国際共著 2件、査読あり 2件) 学会発表 (5件) (うち国際学会 5件)

[国際共同研究] Bogazici University/Ozyegin University(トルコ)
- 国名
  トルコ
- 外国機関名
  Bogazici University/Ozyegin University
[国際共同研究] Tilburg University(オランダ)
- 国名
  オランダ
- 外国機関名
  Tilburg University
[雑誌論文] Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning2024
- 著者名/発表者名
  Ada Suzan Ece、Oztop Erhan、Ugur Emre
- 雑誌名
  
  IEEE Robotics and Automation Letters
  
  巻: 9 ページ: 3116～3123
- DOI
  10.1109/LRA.2024.3363530
- 査読あり / 国際共著
[雑誌論文] Discovering Predictive Relational Object Symbols With Symbolic Attentive Layers2024
- 著者名/発表者名
  Ahmetoglu Alper、Celik Batuhan、Oztop Erhan、Ugur Emre
- 雑誌名
  
  IEEE Robotics and Automation Letters
  
  巻: 9 ページ: 1977～1984
- DOI
  10.1109/LRA.2024.3350994
- 査読あり / 国際共著
[雑誌論文] Correspondence Learning Between Morphologically Different Robots via Task Demonstrations2024
- 著者名/発表者名
  Aktas Hakan、Nagai Yukie、Asada Minoru、Oztop Erhan、Ugur Emre
- 雑誌名
  
  IEEE Robotics and Automation Letters
  
  巻: 9 ページ: 4463～4470
- DOI
  10.1109/LRA.2024.3382534
[学会発表] Human-in-the-Loop Training Leads to Faster Skill Acquisition and Adaptation in Reinforcement Learning-based Robot Control2024
- 著者名/発表者名
  Yilmaz D, Ugurlu B, Oztop E
- 学会等名
  18th IEEE International Conference on Advanced Motion Control (AMC2024), Kyoto, Japan
- 国際学会
[学会発表] A Model for Cognitively Valid Lifelong Learning2023
- 著者名/発表者名
  Say H, Oztop E
- 学会等名
  IEEE International Conference on Robotics and Biomimetics (ROBIO 2023), Koh Samui, Thailand
- 国際学会
[学会発表] Developmental Scaffolding with Large Language Models2023
- 著者名/発表者名
  Celik MB, Ahmetoglu A, Ugur E, Oztop E
- 学会等名
  23rd IEEE International Conference on Development and Learning (ICDL 2023), Macau, China
- 国際学会
[学会発表] Interplay between neural computational energy and multimodal processing in robot-robot interaction2023
- 著者名/発表者名
  Kirtay M, Hafner VV, Asada M, Oztop E
- 学会等名
  23rd IEEE International Conference on Development and Learning (ICDL 2023), Macau, China
- 国際学会
[学会発表] Context Based Echo State Networks for Robot Movement Primitives2023
- 著者名/発表者名
  Amirshirzad N, Asada M, Oztop E
- 学会等名
  32nd IEEE International Conference on Robot & Human Interactive Communication (RO-MAN) Busan, South Korea
- 国際学会

2023 年度 実績報告書

ヒトの行動学習・発達規範の計算エネルギーコスト制約に基づく三層ロボット継続学習

研究代表者

OZTOP Erhan 大阪大学, 先導的学際研究機構, 特任教授(常勤) (90542217)

現在までの達成度 (区分)

理由

研究成果

[国際共同研究] Bogazici University/Ozyegin University(トルコ)

国名

外国機関名

[国際共同研究] Tilburg University(オランダ)

国名

外国機関名

[雑誌論文] Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning2024

著者名/発表者名

雑誌名

DOI

[雑誌論文] Discovering Predictive Relational Object Symbols With Symbolic Attentive Layers2024

著者名/発表者名

雑誌名

DOI

[雑誌論文] Correspondence Learning Between Morphologically Different Robots via Task Demonstrations2024

著者名/発表者名

雑誌名

DOI

[学会発表] Human-in-the-Loop Training Leads to Faster Skill Acquisition and Adaptation in Reinforcement Learning-based Robot Control2024

著者名/発表者名

学会等名

[学会発表] A Model for Cognitively Valid Lifelong Learning2023

著者名/発表者名

学会等名

[学会発表] Developmental Scaffolding with Large Language Models2023

著者名/発表者名

学会等名

[学会発表] Interplay between neural computational energy and multimodal processing in robot-robot interaction2023

著者名/発表者名

学会等名

[学会発表] Context Based Echo State Networks for Robot Movement Primitives2023

著者名/発表者名

学会等名

2023 年度実績報告書