2017 Fiscal Year Annual Research Report

Elucidation of the Mathematical Basis and Neural Mechanisms of Multi-layer Representation Learning

Planned Research

Project Area	Correspondence and Fusion of Artificial Intelligence and Brain Science
Project/Area Number	16H06563
Research Institution	Okinawa Institute of Science and Technology Graduate University
Principal Investigator	銅谷賢治沖縄科学技術大学院大学, 神経計算ユニット, 教授 (80188846)
Project Period (FY)	2016-06-30 – 2021-03-31
Keywords	ディープラーニング / 強化学習 / モジュール自己組織化
Outline of Annual Research Achievements	1) 多階層表現学習の数理基盤：ディープラーニングを強化学習に用いる場合に、学習の安定性を保証しながらデータ効率を改善するため、Conservative Value Iterationと呼ぶあらたな学習アルゴリズムを提案し、その収束速度の数理解析を行った。それをもとにサンプルノイズや近似誤差のもとで安定に効率良い探索と学習が可能なパラメタ設定をシミュレーション実験により明らかにした。 2) 多階層表現学習の神経機構：大脳基底核での情報表現の獲得機構を明らかにするため、線条体の異なるコンパートメントの細胞を区別した光学神経活動計測実験を行った結果、行動学習の進行にともないストライオソームの細胞が報酬予測的な応答を獲得することを明らかにし論文発表を行った。大脳皮質と大脳基底核の各領域の情報表現の違いを明らかにするため、強化学習課題中のラットの神経活動計測データに対してタスクレベル、空間レベル、身体運動レベルの変数による回帰分析を行い、多層的な情報表現の局在を解析した。 3) 全脳レベルのモジュール自己組織化：大脳皮質の神経回路において、ボトムアップの感覚情報とトップダウンの予測情報が統合されるメカニズムを明らかにするため、マウスにレバーの微小な動きを識別させ操作を行わせる新たな行動課題を開発し、それを実行中のマウス大脳皮質の異なる層の神経活動をプリズム内視鏡を用いて同時計測する実験を開始した。
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason 1) 多階層表現学習の数理基盤：学習の安定性を保ちつつデータ効率を上げる新たな強化学習アルゴリズムについての国際学会論文の投稿を行った。 2) 多階層表現学習の神経機構：大脳基底核の線条体のストライオソームのニューロンが、報酬予測に関わる活動を示すという知見についてeNeuro誌に論文発表を行った。 3) 全脳レベルのモジュール自己組織化：マウスの新たな行動課題と光学神経活動記録システムが立ち上がり、計測実験を開始した。
Strategy for Future Research Activity	1) 多階層表現学習の数理基盤：提案した新たな学習アルゴリズムが、ディープネットワークとの組み合わせのもとで安定に効率よく動作することをAtariゲームなどの課題で実証する。 2) 多階層表現学習の神経機構：階層強化学習モデルにラットやマウスが行うのと同じ行動課題を学習させ、そこで獲得される情報表現を大脳皮質と大脳基底核の神経活動データと比較し解析を行う。３）全脳レベルのモジュール自己組織化：大脳皮質でのボトムアップの感覚情報とトップダウンの予測情報が統合されるメカニズムについての光学神経活動計測実験を進め、一次感覚野の各層の動的な情報表現の違いを明らかにする。

Research Products
(39 results)

All 2018 2017 Other

All Journal Article (4 results) (of which Peer Reviewed: 4 results, Open Access: 4 results) Presentation (32 results) (of which Int'l Joint Research: 16 results, Invited: 14 results) Remarks (2 results) Funded Workshop (1 results)

[Journal Article] Connectivity inference from neural recording data: Challenges, mathematical bases and research directions2018
- Author(s)
  Magrans de Abril I, Yoshimoto J, Doya K
- Journal Title
  
  Neural Networks
  
  Volume: 102 Pages: 120-137
- DOI
  10.1016/j.neunet.2018.02.016
- Peer Reviewed / Open Access
[Journal Article] Reward-predictive neural activities in striatal striosome compartments2018
- Author(s)
  Yoshizawa T, Ito M, Doya K
- Journal Title
  
  eNeuro
  
  Volume: 5 Pages: 0367
- DOI
  10.1523/ENEURO.0367-17.2018.
- Peer Reviewed / Open Access
[Journal Article] Sigmoid-weighted linear units for neural network function approximation in reinforcement learning2018
- Author(s)
  Elfwing S, Uchibe E, Doya K
- Journal Title
  
  Neural Networks
  
  Volume: 2017 Specail issue Pages: 30297-6
- DOI
  10.1016/j.neunet.2017.12.012.
- Peer Reviewed / Open Access
[Journal Article] Multiple co-clustering based on nonparametric mixture models with heterogeneous marginal distributions2017
- Author(s)
  Tokuda T, Yoshimoto J, Shimizu Y Okada G, Takamura M, Okamoto Y, Yamawaki S, Doya K
- Journal Title
  
  PLoS One
  
  Volume: 12 Pages: e0186566
- DOI
  10.1371/journal.pone.0186566
- Peer Reviewed / Open Access
[Presentation] Cell-type specific calcium imaging of striatal neurons in the striosome compartments during an odor-conditioning task2018
- Author(s)
  Yoshizawa T, Ito M, Doya K
- Organizer
  Gordon Research Conference
- Int'l Joint Research
[Presentation] Experimental investigation of hierarchical Bayesian inference in sensory and motor cortices2018
- Author(s)
  Zobnin S, Li Yuzhe, Doya K
- Organizer
  脳と心のメカニズム第18回冬のワークショップ
[Presentation] 報酬待機行動を制御するセロトニンの役割ーoptogeneticsによる検証ー2018
- Author(s)
  宮崎勝彦
- Organizer
  セロトニン研究会
- Invited
[Presentation] Imaging the neural circuit for mental simulation2018
- Author(s)
  Doya K
- Organizer
  COSYNE 2018 Workshop Session: Concepts, attention, and consciousness in (reinforcement) learning 3.5
- Int'l Joint Research
[Presentation] Neural circuits for reinforcement learning and mental simulation2018
- Author(s)
  Doya K
- Organizer
  Canonical Computation in Brains and Machines, New York University
- Int'l Joint Research
[Presentation] What should we further learn from the brain?2018
- Author(s)
  Doya K
- Organizer
  Joint Workshop of Korean AI flagship Project and Japanese AI and Brain Scinece Project
[Presentation] Neural circuits for reinforcement learning and mental simulation2018
- Author(s)
  Doya K
- Organizer
  Brain and AI Symposium Korea
- Int'l Joint Research / Invited
[Presentation] Neural circuits for reinforcement learning and mental simulation2018
- Author(s)
  Doya K
- Organizer
  Seminar at Cold Spring Harbor Laboratory
[Presentation] How does the brain wire up itself on the fly?2018
- Author(s)
  Doya K
- Organizer
  Seminar at Institute for Advnced Study, Princeton
- Invited
[Presentation] Neural Circuit for Mental Simulation2018
- Author(s)
  Doya K
- Organizer
  The Neuroscience and Social Decision Making Talk Series
[Presentation] Robust and Efficient Off-Policy Policy Evaluation via Enhanced Action-Gap2017
- Author(s)
  Kozuno T, Uchibe E, Doya K
- Organizer
  The 20th Information-Based Induction Sciences Workshop
[Presentation] The optimal-baseline estimator is not the optimal baseline-estimator2017
- Author(s)
  Parmas P, Peters J, Doya, K
- Organizer
  Information-based induction sciences workshop (IBIS)
[Presentation] Adaptation of Optimization Algorithms to Problem Domains by Transfer Learning2017
- Author(s)
  Reinke C, Doya K
- Organizer
  2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)
- Int'l Joint Research
[Presentation] Average Reward Optimization with Multiple Discounting Reinforcement Learners2017
- Author(s)
  Reinke C, Uchibe E, Doya K
- Organizer
  ICONIP (Lecture Notes in Computer Science)
- Int'l Joint Research
[Presentation] Fast Adaptation of Behavior to Changing Goals with a Gamma Ensemble2017
- Author(s)
  Reinke C, Uchibe E, Doya K
- Organizer
  3rd Multidisciplinary conference on reinforcement learning and decision making
- Int'l Joint Research
[Presentation] Coding of value information in the striatal striosome compartment2017
- Author(s)
  Yoshizawa T, Ito M, Doya K
- Organizer
  44th Naito Conference
[Presentation] Neural representation of sensory-state value in the striatal striosome compartment2017
- Author(s)
  Yoshizawa T, Ito M, Doya K
- Organizer
  Society for Neuroscience 47th Annual Meeting
- Int'l Joint Research
[Presentation] Experimental investigation of hierarchical Bayesian inference in sensory and motor cortices2017
- Author(s)
  Zobnin S, Li Yuzhe, Doya K
- Organizer
  新学術領域研究「人工知能と脳科学」第3回領域会議
[Presentation] Neural coding, brain imaging and information extraction by circuit modeling2017
- Author(s)
  Doya K
- Organizer
  Neuroscience 2017 Sattelite Symposium / CREST Symposium
- Int'l Joint Research / Invited
[Presentation] Reinforcement learning: basic concepts and recent advances2017
- Author(s)
  Doya K
- Organizer
  Workshop on "Human & Machine Learning"
- Int'l Joint Research / Invited
[Presentation] Neural mechanisms of reinforcement learning and mental simulation2017
- Author(s)
  Doya K
- Organizer
  Workshop on "Human & Machine Learning"
- Int'l Joint Research / Invited
[Presentation] What should we further learn from the brain?2017
- Author(s)
  Doya K
- Organizer
  Korean AI Flagship Project Workshop
- Invited
[Presentation] Exploring the deep brain network for reinforcement learning2017
- Author(s)
  Doya K
- Organizer
  Neuroscience 2017, 40th Annual Meeting of the Japan Neuroscience Societ,Luncheon Seminar
- Int'l Joint Research / Invited
[Presentation] Decoding the contents of mental simulation2017
- Author(s)
  Doya K
- Organizer
  Neuroscience 2017 Sattelite Symposium: Computational Principles of the Nervous System
- Int'l Joint Research / Invited
[Presentation] What should We further Learn from the Brain?2017
- Author(s)
  Doya K
- Organizer
  Brain-AI Workshop, NYU Shanhai
- Int'l Joint Research / Invited
[Presentation] BRAINｘBayes2017
- Author(s)
  銅谷賢治
- Organizer
  新学術領域研究「人工知能と脳科学」若手サマースクール
[Presentation] 大脳基底核の機能モデル・病態モデルと実験的検証2017
- Author(s)
  Doya K
- Organizer
  第32回日本大脳基底核研究会 (JBAGS2017)
- Invited
[Presentation] 「脳内シミュレーション」の神経回路を可視化する2017
- Author(s)
  銅谷賢治
- Organizer
  新適塾「脳はおもしろい」第１8回会合
[Presentation] Artificial Intelligence and Brain Science2017
- Author(s)
  Doya K
- Organizer
  Seminar at Kyungpook National University
[Presentation] What can we further learn from the brain?2017
- Author(s)
  Doya K
- Organizer
  24th International Conference on Neural Information Processing ICONIP2017
- Int'l Joint Research / Invited
[Presentation] Neural circuits for reinforcement learning and mental simulation2017
- Author(s)
  Doya K
- Organizer
  SCiNDU: Systems & Computational Neuroscience Down Under
- Int'l Joint Research / Invited
[Presentation] 脳内シミュレーションの神経機構2017
- Author(s)
  銅谷賢治
- Organizer
  第32回日本整形外科学会基礎学術集会
- Invited
[Remarks] 人工知能と脳科学の対照と融合
- URL
  http://www.brain-ai.jp/jp/
[Remarks] 沖縄科学技術大学院大学　神経計算ユニット
- URL
  https://groups.oist.jp/ncu
[Funded Workshop] 第40回日本神経科学大会2017

2017 Fiscal Year Annual Research Report

Elucidation of the Mathematical Basis and Neural Mechanisms of Multi-layer Representation Learning

Principal Investigator

銅谷 賢治 沖縄科学技術大学院大学, 神経計算ユニット, 教授 (80188846)

Current Status of Research Progress

Reason

Research Products

[Journal Article] Connectivity inference from neural recording data: Challenges, mathematical bases and research directions2018

Author(s)

Journal Title

DOI

[Journal Article] Reward-predictive neural activities in striatal striosome compartments2018

Author(s)

Journal Title

DOI

[Journal Article] Sigmoid-weighted linear units for neural network function approximation in reinforcement learning2018

Author(s)

Journal Title

DOI

[Journal Article] Multiple co-clustering based on nonparametric mixture models with heterogeneous marginal distributions2017

Author(s)

Journal Title

DOI

[Presentation] Cell-type specific calcium imaging of striatal neurons in the striosome compartments during an odor-conditioning task2018

Author(s)

Organizer

[Presentation] Experimental investigation of hierarchical Bayesian inference in sensory and motor cortices2018

Author(s)

Organizer

[Presentation] 報酬待機行動を制御するセロトニンの役割ーoptogeneticsによる検証ー2018

Author(s)

Organizer

[Presentation] Imaging the neural circuit for mental simulation2018

Author(s)

Organizer

[Presentation] Neural circuits for reinforcement learning and mental simulation2018

Author(s)

Organizer

[Presentation] What should we further learn from the brain?2018

Author(s)

Organizer

[Presentation] Neural circuits for reinforcement learning and mental simulation2018

Author(s)

Organizer

[Presentation] Neural circuits for reinforcement learning and mental simulation2018

Author(s)

Organizer

[Presentation] How does the brain wire up itself on the fly?2018

Author(s)

Organizer

[Presentation] Neural Circuit for Mental Simulation2018

Author(s)

Organizer

[Presentation] Robust and Efficient Off-Policy Policy Evaluation via Enhanced Action-Gap2017

Author(s)

Organizer

[Presentation] The optimal-baseline estimator is not the optimal baseline-estimator2017

Author(s)

Organizer

[Presentation] Adaptation of Optimization Algorithms to Problem Domains by Transfer Learning2017

Author(s)

Organizer

[Presentation] Average Reward Optimization with Multiple Discounting Reinforcement Learners2017

Author(s)

Organizer

[Presentation] Fast Adaptation of Behavior to Changing Goals with a Gamma Ensemble2017

Author(s)

Organizer

[Presentation] Coding of value information in the striatal striosome compartment2017

Author(s)

Organizer

[Presentation] Neural representation of sensory-state value in the striatal striosome compartment2017

Author(s)

Organizer

[Presentation] Experimental investigation of hierarchical Bayesian inference in sensory and motor cortices2017

Author(s)

Organizer

[Presentation] Neural coding, brain imaging and information extraction by circuit modeling2017

Author(s)

Organizer

銅谷賢治沖縄科学技術大学院大学, 神経計算ユニット, 教授 (80188846)

[Remarks] 沖縄科学技術大学院大学　神経計算ユニット