Emergence of Grammar in Imitation Learning using Deep Learning

Publicly Offered Research

Project Area	Studies of Language Evolution for Co-creative Human Communication
Project/Area Number	18H05057
Research Category	Grant-in-Aid for Scientific Research on Innovative Areas (Research in a proposed research area)
Allocation Type	Single-year Grants
Research Institution	Hokkaido University
Principal Investigator	飯塚博幸北海道大学, 情報科学研究院, 准教授 (30396832)
Project Period (FY)	2018-04-01 – 2020-03-31
Project Status	Completed (Fiscal Year 2019)
Budget Amount *help	¥3,900,000 (Direct Cost: ¥3,000,000、Indirect Cost: ¥900,000) Fiscal Year 2019: ¥1,950,000 (Direct Cost: ¥1,500,000、Indirect Cost: ¥450,000) Fiscal Year 2018: ¥1,950,000 (Direct Cost: ¥1,500,000、Indirect Cost: ¥450,000)
Keywords	敵対的模倣学習 / LSTM / カオス / 文法 / 深層学習 / ニューラルネットワーク / 複雑化 / 模倣学習
Outline of Annual Research Achievements	初年度において関数の表現が豊かなフィードフォワードニューラルネットワークを用いても，相手を真似したいが，相手からは真似されたくない状況である敵対的模倣学習によって時系列が複雑化することが明らかになった．最終年度は，まず，敵対的模倣が複雑化に寄与しているのかを確かめるために，異なる真似関係構造がもたらす時系列の複雑さを明らかにした．相互作用の種類として，２体のそれぞれのエージェントに，模倣のみ（真似したい），被模倣のみ（真似されたくない），敵対的模倣の３種類を用いてシミュレーションを行った．結果として，少なくとも一方が敵対的模倣のときのみ時系列は複雑化し，そのときに相手が被模倣もしくは敵対的である必要があることが明らかとなった．両者の真似されたくないというのが複雑化の必要条件であり，かつ，少なくともどちらか一方が真似したいという敵対的模倣が必要であるとわかった．さらに，敵対的模倣学習がもたらす時系列の時間方向の複雑化を示すために，記憶を保持できるリカレントニューラルネットワークの一つであるLSTMを用いてモデルの構築とシミュレーションを行った．結果，初年度のモデルと同様に，リアプノフ指数が正となり，時系列が複雑化することがわかった．そこで，記憶のないモデル時に見られなかったような時間方向への構造化を定量的に示すために，ZIP圧縮を用いて生成時系列を圧縮させた．時系列がカオスになっているエポックのうち，その中でも相対的に圧縮できるパターンが生まれていることが明らかとなった．このときの時系列の遷移パターンを可視化するために，N-gramモデルを用いて時系列を有限状態文法で表し，ノード数・パス数ともに小さくなっていることを示し，より単純なグラフになっていることがわかった．記憶を持つ場合には，空間方向だけでなく時間方向も扱えるため，時間方向の複雑化・構造化が生じた．
Research Progress Status	令和元年度が最終年度であるため、記入しない。
Strategy for Future Research Activity	令和元年度が最終年度であるため、記入しない。

Report

(2 results)

2019 Annual Research Report
2018 Annual Research Report

Research Products
(7 results)

All 2020 2019 2018

All Journal Article (1 results) (of which Peer Reviewed: 1 results) Presentation (6 results) (of which Int'l Joint Research: 6 results)

[Journal Article] Complexity of bird song caused by adversarial imitation learning2020
- Author(s)
  Seiya Yamazaki, Hiroyuki Iizuka, Masahito Yamamoto
- Journal Title
  
  Artificial Life and Robotics
  
  Volume: 25(1) Issue: 1 Pages: 124-132
- DOI
  10.1007/s10015-019-00559-5
- NAID
  120006955201
- Related Report
  2019 Annual Research Report
- Peer Reviewed
[Presentation] Analysis of Time Series Trained by Adversarial Imitation Learning in Discrete State Neural Network Model2020
- Author(s)
  Seiya Yamazaki, Hiroyuki Iizuka, Masahito Yamamoto
- Organizer
  Proceedings of the 2020 IEEE/SICE International Symposium on System Integration
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Analysis of Time Series Generated By Long Short-Term Memory Trained with Adversarial Imitation Learning2019
- Author(s)
  Seiya Yamazaki, Hiroyuki Iizuka, Masahito Yamamoto
- Organizer
  Proceedings of 2019 IEEE Symposium Series on Computational Intelligence (IEEE SSCI 2019), 2019 IEEE Symposium on Artificial Life
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Adversarial Imitation Learning of Bird Song Modeled with Recurrent Neural Network2019
- Author(s)
  Seiya Yamazaki, Hiroyuki Iizuka, Masahito Yamamoto
- Organizer
  The 24th International Symposium on Artificial Life and Robotics 2019 (AROB 2019), 162-167
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Emergence of Chaotic Time Series by Adversarial Imitation Learning2018
- Author(s)
  Seiya Yamazaki, Hiroyuki Iizuka, Masahito Yamamoto
- Organizer
  Proceedings of the 2018 Conference on Artificial Life, 659-664
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Differentiation of communication signals to establish cooperation using Deep Q-Network2018
- Author(s)
  Hironobu Horiuchi, Hiroyuki Iizuka, Masahito Yamamoto
- Organizer
  Proceedings of IES 2018 (the 22nd Asia Pacific Symposium on Intelligent and Evolutionary Systems), 156--162
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] 敵対的模倣学習におけるカオス時系列の創発要因2018
- Author(s)
  山嵜聖也, 飯塚博幸, 山本雅人
- Organizer
  情報処理北海道シンポジウム2018, 161-166
- Related Report
  2018 Annual Research Report
- Int'l Joint Research

Emergence of Grammar in Imitation Learning using Deep Learning

Principal Investigator

飯塚 博幸 北海道大学, 情報科学研究院, 准教授 (30396832)

¥3,900,000 (Direct Cost: ¥3,000,000、Indirect Cost: ¥900,000)

Report

Research Products

[Journal Article] Complexity of bird song caused by adversarial imitation learning2020

Author(s)

Journal Title

DOI

NAID

Related Report

[Presentation] Analysis of Time Series Trained by Adversarial Imitation Learning in Discrete State Neural Network Model2020

Author(s)

Organizer

Related Report

[Presentation] Analysis of Time Series Generated By Long Short-Term Memory Trained with Adversarial Imitation Learning2019

Author(s)

Organizer

Related Report

[Presentation] Adversarial Imitation Learning of Bird Song Modeled with Recurrent Neural Network2019

Author(s)

Organizer

Related Report

[Presentation] Emergence of Chaotic Time Series by Adversarial Imitation Learning2018

Author(s)

Organizer

Related Report

[Presentation] Differentiation of communication signals to establish cooperation using Deep Q-Network2018

Author(s)

Organizer

Related Report

[Presentation] 敵対的模倣学習におけるカオス時系列の創発要因2018

Author(s)

Organizer

Related Report

飯塚博幸北海道大学, 情報科学研究院, 准教授 (30396832)