Data assimilation based reinforcement learning

Research Project

Project/Area Number	25730135
Research Category	Grant-in-Aid for Young Scientists (B)
Allocation Type	Multi-year Fund
Research Field	Intelligent informatics
Research Institution	The University of Tokyo (2015) Osaka University (2013-2014)
Principal Investigator	Ueno Tsuyoshi 東京大学, 新領域創成科学研究科, 特任研究員 (90615824)
Project Period (FY)	2013-04-01 – 2016-03-31
Project Status	Completed (Fiscal Year 2015)
Budget Amount *help	¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000) Fiscal Year 2015: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000) Fiscal Year 2014: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000) Fiscal Year 2013: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Keywords	強化学習 / データ同化 / 機械学習 / 人工知能 / ベイズ最適化 / 創薬 / 確率最適制御 / 最適制御 / 統計学習
Outline of Final Research Achievements	Learning action strategies from computer simulations has a potential to achieving the drastic productivity increases because it has no necessity to perform an expensive process, i.e., real experiments for collecting the data. However, the behavior of simulations often differ from that of actual environments; thus, it is not rare that the action strategy obtained from the simulation makes no sense in practical applications. In this project, we developed a new framework, so-called data assimilation reinforcement learning (DARL) which incorporates data assimilation and reinforcement learning. DARL can provide the good action strategy in the small number of experiments by learning not only the action strategy but also the computer simulation simultaneously. We have also applied DARL to material design and drug discovery problems and confirmed its effectiveness compared with current methods.

Report

(4 results)

2015 Annual Research Report Final Research Report ( PDF )
2014 Research-status Report
2013 Research-status Report

Research Products
(8 results)

All 2016 2014 2013 Other

All Int'l Joint Research (1 results) Journal Article (3 results) (of which Int'l Joint Research: 1 results, Peer Reviewed: 2 results, Open Access: 1 results, Acknowledgement Compliant: 1 results) Presentation (4 results) (of which Invited: 2 results)

[Int'l Joint Research] Trevor David Rhoneb/Harvard University(米国)
- Related Report
  2015 Annual Research Report
[Journal Article] COMBO: An Efficient Bayesian Optimization Library for Materials Science Materials Discovery2016
- Author(s)
  Tsuyoshi Ueno, Trevor David Rhone, T. Mizoguchi, Zhufeng Hou Koji Tsuda
- Journal Title
  
  Materials Discovery
  
  Volume: 印刷中
- Related Report
  2015 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research / Acknowledgement Compliant
[Journal Article] 極大クリーク列挙技術のビジネス応用とソフトウェアツール2014
- Author(s)
  植野剛
- Journal Title
  
  電子情報通信学会誌,
  
  Volume: 92 Pages: 1103-1106
- Related Report
  2014 Research-status Report
[Journal Article] Active learning for noisy oracle via density power divergence2013
- Author(s)
  Y. Sogawa, T. Ueno, Y. Kawahara, T. Washio,
- Journal Title
  
  Neural Networks
  
  Volume: 46 Pages: 133-143
- Related Report
  2013 Research-status Report
- Peer Reviewed
[Presentation] 確率推論による方策探索法2014
- Author(s)
  植野　剛
- Organizer
  日本ロボット学会学術講演会
- Place of Presentation
  福岡県北九州市
- Year and Date
  2014-09-04 – 2014-09-06
- Related Report
  2014 Research-status Report
[Presentation] e射影に基づく方策探索法2014
- Author(s)
  植野　剛
- Organizer
  人工知能学会全国大会
- Place of Presentation
  愛媛県松山市
- Year and Date
  2014-05-12 – 2014-05-15
- Related Report
  2014 Research-status Report 2013 Research-status Report
[Presentation] 学習による制御: 強化学習2013
- Author(s)
  植野　剛
- Organizer
  計測・制御・システム工学部会シンポジウム
- Place of Presentation
  千葉県千葉市
- Related Report
  2013 Research-status Report
- Invited
[Presentation] Semiparametric Statistical Inference to Reinforcement Leanrning2013
- Author(s)
  Tsuyoshi Ueno
- Organizer
  Bernoulli Society Satellite Meeting to the ISI World Statistics Congress 2013
- Place of Presentation
  東京都文京区
- Related Report
  2013 Research-status Report
- Invited

Data assimilation based reinforcement learning

Principal Investigator

Ueno Tsuyoshi 東京大学, 新領域創成科学研究科, 特任研究員 (90615824)

¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000)

Report

Research Products

[Int'l Joint Research] Trevor David Rhoneb/Harvard University(米国)

Related Report

[Journal Article] COMBO: An Efficient Bayesian Optimization Library for Materials Science Materials Discovery2016

Author(s)

Journal Title

Related Report

[Journal Article] 極大クリーク列挙技術のビジネス応用と ソフトウェアツール2014

Author(s)

Journal Title

Related Report

[Journal Article] Active learning for noisy oracle via density power divergence2013

Author(s)

Journal Title

Related Report

[Presentation] 確率推論による方策探索法2014

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] e射影に基づく方策探索法2014

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 学習による制御: 強化学習2013

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] Semiparametric Statistical Inference to Reinforcement Leanrning2013

Author(s)

Organizer

Place of Presentation

Related Report

[Journal Article] 極大クリーク列挙技術のビジネス応用とソフトウェアツール2014