2020 Fiscal Year Annual Research Report

Natural reinforcement learning integrating intrinsic motivation and sociality

Research Project

Project/Area Number	20H04259
Research Institution	Tokyo Denki University
Principal Investigator	高橋達二東京電機大学, 理工学部, 教授 (00514514)
Co-Investigator(Kenkyū-buntansha)	太田宏之防衛医科大学校(医学教育部医学科進学課程及び専門課程、動物実験施設、共同利用研究施設、病院並びに防衛, 薬理学, 講師 (20535190) 浦上大輔日本大学, 生産工学部, 准教授 (40458196) 甲野佑東京電機大学, 理工学部, 講師 (10870313) 大用庫智関西学院大学, 総合政策学部, 講師 (60755685) 玉造晃弘東京電機大学, 理工学部, 研究員 (10876361)
Project Period (FY)	2020-04-01 – 2023-03-31
Keywords	強化学習 / 社会学習 / バンディット問題 / 模倣学習 / 限定合理性 / 満足化 / 行動経済学 / 採餌行動
Outline of Annual Research Achievements	本研究の実施項目は、大別して (T) 自然強化学習理論の理論的確立 (S) 社会学習の新しいモデリング (A) 応用における有効性の実証 (X) 理論の実験的検証 (人間・マウスなど) である。それぞれ、ジャーナル論文が 5, 1, 1, 2 編ずつ出版された。(T)においては、主観リグレットという概念により、限定合理性理論のモデルである満足化の実装、プロスペクト理論的な現象（リスク態度の反射効果）、合理的な採餌行動などの現象が再現できることが分かり、大きな進展があった。 (S) では、エミュレーション的競争（コンペティションとは異なる）により、分業が自己組織化されることが分かった。(A)の応用に関しては新しくベクトル量子化においても結果が出た。(X)では、マウスに関して、本研究の理論を一般化しうる興味深い結果が得られた。
Current Status of Research Progress	Current Status of Research Progress 1: Research has progressed more than it was originally planned. Reason 「自然強化学習」に関して、計画以上に理論的に大きな進展と社会学習の新しいモデリングが成功した他、共同研究を通じて応用における有効性の実証も行えた。理論の実験的検証についてもマウスについて新しい結果が出、論文は注目を集めている。
Strategy for Future Research Activity	コロナを理由に延期（繰り越し）を行った実験的研究は実施する。他に関しては順調に進展しており、論文出版を中心に行う。

Research Products
(9 results)

All 2022 2021 2020

All Journal Article (8 results) (of which Peer Reviewed: 8 results, Open Access: 7 results) Book (1 results)

[Journal Article] Softsatisficing: Risk-sensitive softmax action selection2022
- Author(s)
  Kamiya Takumi、Takahashi Tatsuji
- Journal Title
  
  Biosystems
  
  Volume: 213 Pages: 104633～104633
- DOI
  10.1016/j.biosystems.2022.104633
- Peer Reviewed / Open Access
[Journal Article] Conveying Intention by Motions With Awareness of Information Asymmetry2022
- Author(s)
  Fukuchi Yosuke、Osawa Masahiko、Yamakawa Hiroshi、Takahashi Tatsuji、Imai Michita
- Journal Title
  
  Frontiers in Robotics and AI
  
  Volume: 9 Pages: 783863
- DOI
  10.3389/frobt.2022.783863
- Peer Reviewed / Open Access
[Journal Article] The asymmetric learning rates of murine exploratory behavior in sparse reward environments2021
- Author(s)
  Ohta Hiroyuki、Satori Kuniaki、Takarada Yu、Arake Masashi、Ishizuka Toshiaki、Morimoto Yuji、Takahashi Tatsuji
- Journal Title
  
  Neural Networks
  
  Volume: 143 Pages: 218～229
- DOI
  10.1016/j.neunet.2021.05.030
- Peer Reviewed / Open Access
[Journal Article] Self-incremental learning vector quantization with human cognitive biases2021
- Author(s)
  Manome Nobuhito、Shinohara Shuji、Takahashi Tatsuji、Chen Yu、Chung Ung-il
- Journal Title
  
  Scientific Reports
  
  Volume: 11 Pages: 3910
- DOI
  10.1038/s41598-021-83182-4
- Peer Reviewed / Open Access
[Journal Article] 不定自然変換理論に基づく比喩理解モデルの計算論的実装の試み2021
- Author(s)
  池田駿介、布山美慕、西郷甲矢人、高橋達二
- Journal Title
  
  認知科学
  
  Volume: 28 Pages: 39～56
- DOI
  10.11225/cs.2020.065
- Peer Reviewed / Open Access
[Journal Article] A category theoretic approach to metaphor comprehension: Theory of indeterminate natural transformation2020
- Author(s)
  Fuyama Miho、Saigo Hayato、Takahashi Tatsuji
- Journal Title
  
  Biosystems
  
  Volume: 197 Pages: 104213～104213
- DOI
  10.1016/j.biosystems.2020.104213
- Peer Reviewed / Open Access
[Journal Article] A new method of Bayesian causal inference in non-stationary environments2020
- Author(s)
  Shinohara Shuji、Manome Nobuhito、Suzuki Kouta、Chung Ung-il、Takahashi Tatsuji、Okamoto Hiroshi、Gunji Yukio Pegio、Nakajima Yoshihiro、Mitsuyoshi Shunji
- Journal Title
  
  PLOS ONE
  
  Volume: 15 Pages: e0233559
- DOI
  10.1371/journal.pone.0233559
- Peer Reviewed / Open Access
[Journal Article] Extended Bayesian inference incorporating symmetry bias2020
- Author(s)
  Shinohara Shuji、Manome Nobuhito、Suzuki Kouta、Chung Ung-il、Takahashi Tatsuji、Gunji Pegio-Yukio、Nakajima Yoshihiro、Mitsuyoshi Shunji
- Journal Title
  
  Biosystems
  
  Volume: 190 Pages: 104104～104104
- DOI
  10.1016/j.biosystems.2020.104104
- Peer Reviewed
[Book] Logic and Uncertainty in the Human Mind2020
- Author(s)
  Takahashi Tatsuji、Oyo Kuratomo、Tamatsukuri Akihiro、Higuchi Kohki
- Total Pages
  20
- Publisher
  Routledge

2020 Fiscal Year Annual Research Report

Natural reinforcement learning integrating intrinsic motivation and sociality

Principal Investigator

高橋 達二 東京電機大学, 理工学部, 教授 (00514514)

Current Status of Research Progress

Reason

Research Products

[Journal Article] Softsatisficing: Risk-sensitive softmax action selection2022

Author(s)

Journal Title

DOI

[Journal Article] Conveying Intention by Motions With Awareness of Information Asymmetry2022

Author(s)

Journal Title

DOI

[Journal Article] The asymmetric learning rates of murine exploratory behavior in sparse reward environments2021

Author(s)

Journal Title

DOI

[Journal Article] Self-incremental learning vector quantization with human cognitive biases2021

Author(s)

Journal Title

DOI

[Journal Article] 不定自然変換理論に基づく比喩理解モデルの計算論的実装の試み2021

Author(s)

Journal Title

DOI

[Journal Article] A category theoretic approach to metaphor comprehension: Theory of indeterminate natural transformation2020

Author(s)

Journal Title

DOI

[Journal Article] A new method of Bayesian causal inference in non-stationary environments2020

Author(s)

Journal Title

DOI

[Journal Article] Extended Bayesian inference incorporating symmetry bias2020

Author(s)

Journal Title

DOI

[Book] Logic and Uncertainty in the Human Mind2020

Author(s)

Total Pages

Publisher

高橋達二東京電機大学, 理工学部, 教授 (00514514)