2014 Fiscal Year Annual Research Report

予測と意思決定のための機械学習理論の構築とその神経回路での実現

Planned Research

Project Area	Elucidation of neural computation for prediction and decision making: toward better human understanding and applications
Project/Area Number	23120004
Research Institution	The University of Tokyo
Principal Investigator	杉山将東京大学, 新領域創成科学研究科, 教授 (90334515)
Co-Investigator(Kenkyū-buntansha)	森本淳株式会社国際電気通信基礎技術研究所, 脳情報通信総合研究所, 研究室長 (10505986)
Project Period (FY)	2011-04-01 – 2016-03-31
Keywords	予測 / 意志決定 / 機械学習 / 特徴選択 / 強化学習 / ロボット制御
Outline of Annual Research Achievements	本年度は，特徴選択および特徴抽出に関して，エントロピー正則化に基づく距離計量学習アルゴリズムの開発，特徴選択に用いる統計的従属性尺度の推定アルゴリズムの開発を行い，論文を出版した．また，モデルベース強化学習において重要な働きをする状態遷移確率（条件付き確率密度）推定における特徴選択および特徴抽出に関して，特徴選択および特徴抽出手法を適用した後に条件付き確率密度を推定するという従来の二段階のアプローチではなく，特徴選択および特徴抽出と条件付き確率密度の推定を同時に行うための基礎研究を行った．強化学習に関して，条件付き確率の直接推定に基づくモデルベース強化学習アルゴリズムを開発し，その有効性を計算機シミュレーションにより実証した．また，報酬が時間とともに任意に変化するという非常に厳しい状況下でのオンライン強化学習アルゴリズムを開発し，その性質を理論的に明らかにした．更に，昨年度開発した標本再利用型モデルフリー強化学習アルゴリズムを実ヒューマノイドロボット制御に応用し，その有用性を実証した．高次元・実環境での強化学習アルゴリズムの改良のためにモデル予測制御の援用を予定しており，その基礎的検討を行った．具体的には，２足歩行ロボットモデルの特異摂動系への変換にもとづいて，評価時間を力学系の時定数にあわせて２段階用意することにより効率的な最適制御軌道の生成が可能となった．
Current Status of Research Progress	Current Status of Research Progress 1: Research has progressed more than it was originally planned. Reason 本年度は，計画した研究課題を達成しただけでなく，当初計画していなかった新しい成果が多数得られ，トップレベルの国際会議に複数の論文を発表することができた．また，高いレベルの国際会議で最優秀論文賞を獲得した．
Strategy for Future Research Activity	最終年度の来年度は，計画に従い研究を遂行していくことに加え，これまでの研究成果を総括するとともに，将来の更なる発展に向けて議論する．

Research Products
(26 results)

All 2015 2014 Other

All Journal Article (9 results) (of which Peer Reviewed: 9 results, Acknowledgement Compliant: 9 results) Presentation (14 results) Book (1 results) Remarks (2 results)

[Journal Article] Creating the brain and interacting with the brain and Integrated approach to understanding the brain2015
- Author(s)
  J. Morimoto and M. Kawato
- Journal Title
  
  Journal of the Royal Society Interface
  
  Volume: 12 Pages: -
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] A fault tolerant approach for biosignal-based robot control2015
- Author(s)
  J. Furukawa, T. Noda, T. Teramae and J. Morimoto
- Journal Title
  
  Advanced Robotics
  
  Volume: - Pages: -
- DOI
  10.1080/01691864.2014.996603
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] An EMG-driven assist system for vertical component weight bearing force2015
- Author(s)
  J. Furukawa, T. Noda, T. Teramae, and J. Morimoto
- Journal Title
  
  IEEE Systems Journal
  
  Volume: - Pages: -
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] Constrained least-squares density-difference estimation.2014
- Author(s)
  Nguyen, T. D., du Plessis, M. C., Kanamori, T., & Sugiyama, M.
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E97-D Pages: 1822-1829
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] Computationally efficient estimation of squared-loss mutual information with multiplicative kernel models.2014
- Author(s)
  Sakai, T. & Sugiyama, M.
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E97-D Pages: 968-971
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation.2014
- Author(s)
  Tangkaratt, V., Mori, S., Zhao, T., Morimoto, J., & Sugiyama, M.
- Journal Title
  
  Neural Networks
  
  Volume: 57 Pages: 128-140
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] Information-theoretic semi-supervised metric learning via entropy regularization.2014
- Author(s)
  Niu, G., Dai, B., Yamada, M., & Sugiyama, M.
- Journal Title
  
  Neural Computation
  
  Volume: 26 Pages: 1717-1762
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] Decoding the ERD/ERS: influence of afferent input induced by a leg assistive robot2014
- Author(s)
  G. Lisi, T. Noda and J. Morimoto
- Journal Title
  
  Frontiers in Systems Neuroscience
  
  Volume: 8 Pages: 1-12
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] rain-machine interfacing control of whole-body humanoid motion2014
- Author(s)
  K. Bouyarmane, J. Vaillant, N. Sugimoto, F. Keith, J. Furukawa and J. Morimoto
- Journal Title
  
  Frontiers in Systems Neurosciences
  
  Volume: 8 Pages: 1-10
- Peer Reviewed / Acknowledgement Compliant
[Presentation] Analysis of variational Bayesian latent Dirichlet allocation: Weaker sparsity than MAP.2014
- Author(s)
  Nakajima, S., Sato, I., Sugiyama, M., Watanabe, K., & Kobayashi, H.
- Organizer
  Neural Information Processing Systems (NIPS2014)
- Place of Presentation
  Montreal, Quebec, Canada
- Year and Date
  2014-12-08 – 2014-12-11
[Presentation] Analysis of learning from positive and unlabeled data.2014
- Author(s)
  du Plessis, M. C., Niu, G., & Sugiyama, M.
- Organizer
  Neural Information Processing Systems (NIPS2014)
- Place of Presentation
  Montreal, Quebec, Canada
- Year and Date
  2014-12-08 – 2014-12-11
[Presentation] Efficient reuse of previous experiences in humanoid motor learning.2014
- Author(s)
  Sugimoto, N., Tangkaratt, V., Wensveen, T., Zhao, T., Sugiyama, M., & Morimoto, J.
- Organizer
  IEEE-RAS International Conference on Humanoid Robots (HUMANOIDS2014)
- Place of Presentation
  Madrid, Spain
- Year and Date
  2014-11-18 – 2014-11-20
[Presentation] Observing human movements to construct a humanoid interface2014
- Author(s)
  Y. Ariki, T. Inamura and J. Morimoto
- Organizer
  IEEE-RAS International Conference on Humanoid Robots (HUMANOIDS2014)
- Place of Presentation
  Madrid, Spain
- Year and Date
  2014-11-18 – 2014-11-20
[Presentation] Style-Phase Adaptation of Human and Humanoid Biped Walking Patterns in Real Systems2014
- Author(s)
  T. Matsubara, D. Uto, T. Noda, T. Teramae and J. Morimoto
- Organizer
  IEEE-RAS International Conference on Humanoid Robots (HUMANOIDS2014)
- Place of Presentation
  Madrid, Spain
- Year and Date
  2014-11-18 – 2014-11-20
[Presentation] Clustering via mode seeking by direct estimation of the gradient of a log-density.2014
- Author(s)
  Sasaki, H., Hyvarinen, A., & Sugiyama, M.
- Organizer
  European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD2014)
- Place of Presentation
  Nancy, France
- Year and Date
  2014-09-15 – 2014-09-19
[Presentation] An online policy gradient algorithm for continuous state and action Markov decision processes.2014
- Author(s)
  Ma, Y., Zhao, T., Hatano, K., & Sugiyama, M.
- Organizer
  European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD2014)
- Place of Presentation
  Nancy, France
- Year and Date
  2014-09-15 – 2014-09-19
[Presentation] Development of an upper limb exoskeleton powered via pneumatic electric hybrid actuators with bowden cable2014
- Author(s)
  T. Noda, T. Teramae, B. Ugurlu, and J. Morimoto
- Organizer
  IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS2014)
- Place of Presentation
  Chicago, Illinois, USA
- Year and Date
  2014-09-14 – 2014-09-18
[Presentation] Transductive learning with multi-class volume approximation.2014
- Author(s)
  Niu, G., Dai, B., du Plessis, M. C., & Sugiyama, M.
- Organizer
  International Conference on Machine Learning (ICML2014)
- Place of Presentation
  Beijing, China
- Year and Date
  2014-06-21 – 2014-06-26
[Presentation] Online approach for altering robot behaviors based on human in the loop coaching gestures2014
- Author(s)
  T. Petric, A. Gams, L. Zlajpah, A. Ude, and J. Morimoto
- Organizer
  IEEE International Conference on Robotics and Automation (ICRA2014)
- Place of Presentation
  Hong Kong, China
- Year and Date
  2014-05-31 – 2014-06-07
[Presentation] Orientation in Cartesian Space Dynamic Movement Primitives2014
- Author(s)
  A. Ude, B. Nemec, T. Petric, and J. Morimoto
- Organizer
  IEEE International Conference on Robotics and Automation (ICRA2014)
- Place of Presentation
  Hong Kong, China
- Year and Date
  2014-05-31 – 2014-06-07
[Presentation] Optimal Control Approach for Pneumatic Artificial Muscle with using Pressure-Force Conversion Model2014
- Author(s)
  T. Teramae, T. Noda and J. Morimoto
- Organizer
  IEEE International Conference on Robotics and Automation (ICRA2014)
- Place of Presentation
  Hong Kong, China
- Year and Date
  2014-05-31 – 2014-06-07
[Presentation] Analysis of empirical MAP and empirical partially Bayes: Can they be alternatives to variational Bayes?2014
- Author(s)
  Nakajima, S. & Sugiyama, M.
- Organizer
  International Conference on Artificial Intelligence and Statistics (AISTATS2014)
- Place of Presentation
  Reykjavik, Iceland
- Year and Date
  2014-04-22 – 2014-04-24
[Presentation] Bias reduction and metric learning for nearest-neighbor estimation of Kullback-Leibler divergence.2014
- Author(s)
  Noh, Y.-K., Sugiyama, M., Liu, S., du Plessis, M. C., Park, F. C., & Lee, D. D.
- Organizer
  International Conference on Artificial Intelligence and Statistics (AISTATS2014)
- Place of Presentation
  Reykjavik, Iceland
- Year and Date
  2014-04-22 – 2014-04-24
[Book] Statistical Reinforcement Learning: Modern Machine Learning Approaches2015
- Author(s)
  Sugiyama, M.
- Total Pages
  206
- Publisher
  Chapman and Hall/CRC
[Remarks]
- URL
  http://www.ms.k.u-tokyo.ac.jp
[Remarks]
- URL
  http://www.cns.atr.jp/~xmorimo/

2014 Fiscal Year Annual Research Report

予測と意思決定のための機械学習理論の構築とその神経回路での実現

Principal Investigator

杉山 将 東京大学, 新領域創成科学研究科, 教授 (90334515)

Current Status of Research Progress

Reason

Research Products

[Journal Article] Creating the brain and interacting with the brain and Integrated approach to understanding the brain2015

Author(s)

Journal Title

[Journal Article] A fault tolerant approach for biosignal-based robot control2015

Author(s)

Journal Title

DOI

[Journal Article] An EMG-driven assist system for vertical component weight bearing force2015

Author(s)

Journal Title

[Journal Article] Constrained least-squares density-difference estimation.2014

Author(s)

Journal Title

[Journal Article] Computationally efficient estimation of squared-loss mutual information with multiplicative kernel models.2014

Author(s)

Journal Title

[Journal Article] Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation.2014

Author(s)

Journal Title

[Journal Article] Information-theoretic semi-supervised metric learning via entropy regularization.2014

Author(s)

Journal Title

[Journal Article] Decoding the ERD/ERS: influence of afferent input induced by a leg assistive robot2014

Author(s)

Journal Title

[Journal Article] rain-machine interfacing control of whole-body humanoid motion2014

Author(s)

Journal Title

[Presentation] Analysis of variational Bayesian latent Dirichlet allocation: Weaker sparsity than MAP.2014

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Analysis of learning from positive and unlabeled data.2014

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Efficient reuse of previous experiences in humanoid motor learning.2014

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Observing human movements to construct a humanoid interface2014

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Style-Phase Adaptation of Human and Humanoid Biped Walking Patterns in Real Systems2014

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Clustering via mode seeking by direct estimation of the gradient of a log-density.2014

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] An online policy gradient algorithm for continuous state and action Markov decision processes.2014

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Development of an upper limb exoskeleton powered via pneumatic electric hybrid actuators with bowden cable2014

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Transductive learning with multi-class volume approximation.2014

Author(s)

Organizer

Place of Presentation

Year and Date

杉山将東京大学, 新領域創成科学研究科, 教授 (90334515)