• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Data assimilation based reinforcement learning

Research Project

Project/Area Number 25730135
Research Category

Grant-in-Aid for Young Scientists (B)

Allocation TypeMulti-year Fund
Research Field Intelligent informatics
Research InstitutionThe University of Tokyo (2015)
Osaka University (2013-2014)

Principal Investigator

Ueno Tsuyoshi  東京大学, 新領域創成科学研究科, 特任研究員 (90615824)

Project Period (FY) 2013-04-01 – 2016-03-31
Project Status Completed (Fiscal Year 2015)
Budget Amount *help
¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000)
Fiscal Year 2015: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000)
Fiscal Year 2014: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Fiscal Year 2013: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Keywords強化学習 / データ同化 / 機械学習 / 人工知能 / ベイズ最適化 / 創薬 / 確率最適制御 / 最適制御 / 統計学習
Outline of Final Research Achievements

Learning action strategies from computer simulations has a potential to achieving the drastic productivity increases because it has no necessity to perform an expensive process, i.e., real experiments for collecting the data. However, the behavior of simulations often differ from that of actual environments; thus, it is not rare that the action strategy obtained from the simulation makes no sense in practical applications. In this project, we developed a new framework, so-called data assimilation reinforcement learning (DARL) which incorporates data assimilation and reinforcement learning. DARL can provide the good action strategy in the small number of experiments by learning not only the action strategy but also the computer simulation simultaneously. We have also applied DARL to material design and drug discovery problems and confirmed its effectiveness compared with current methods.

Report

(4 results)
  • 2015 Annual Research Report   Final Research Report ( PDF )
  • 2014 Research-status Report
  • 2013 Research-status Report
  • Research Products

    (8 results)

All 2016 2014 2013 Other

All Int'l Joint Research (1 results) Journal Article (3 results) (of which Int'l Joint Research: 1 results,  Peer Reviewed: 2 results,  Open Access: 1 results,  Acknowledgement Compliant: 1 results) Presentation (4 results) (of which Invited: 2 results)

  • [Int'l Joint Research] Trevor David Rhoneb/Harvard University(米国)

    • Related Report
      2015 Annual Research Report
  • [Journal Article] COMBO: An Efficient Bayesian Optimization Library for Materials Science Materials Discovery2016

    • Author(s)
      Tsuyoshi Ueno, Trevor David Rhone, T. Mizoguchi, Zhufeng Hou Koji Tsuda
    • Journal Title

      Materials Discovery

      Volume: 印刷中

    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research / Acknowledgement Compliant
  • [Journal Article] 極大クリーク列挙技術のビジネス応用と ソフトウェアツール2014

    • Author(s)
      植野剛
    • Journal Title

      電子情報通信学会誌,

      Volume: 92 Pages: 1103-1106

    • Related Report
      2014 Research-status Report
  • [Journal Article] Active learning for noisy oracle via density power divergence2013

    • Author(s)
      Y. Sogawa, T. Ueno, Y. Kawahara, T. Washio,
    • Journal Title

      Neural Networks

      Volume: 46 Pages: 133-143

    • Related Report
      2013 Research-status Report
    • Peer Reviewed
  • [Presentation] 確率推論による方策探索法2014

    • Author(s)
      植野 剛
    • Organizer
      日本ロボット学会 学術講演会
    • Place of Presentation
      福岡県北九州市
    • Year and Date
      2014-09-04 – 2014-09-06
    • Related Report
      2014 Research-status Report
  • [Presentation] e射影に基づく方策探索法2014

    • Author(s)
      植野 剛
    • Organizer
      人工知能学会全国大会
    • Place of Presentation
      愛媛県松山市
    • Year and Date
      2014-05-12 – 2014-05-15
    • Related Report
      2014 Research-status Report 2013 Research-status Report
  • [Presentation] 学習による制御: 強化学習2013

    • Author(s)
      植野 剛
    • Organizer
      計測・制御・システム工学部会シンポジウム
    • Place of Presentation
      千葉県千葉市
    • Related Report
      2013 Research-status Report
    • Invited
  • [Presentation] Semiparametric Statistical Inference to Reinforcement Leanrning2013

    • Author(s)
      Tsuyoshi Ueno
    • Organizer
      Bernoulli Society Satellite Meeting to the ISI World Statistics Congress 2013
    • Place of Presentation
      東京都文京区
    • Related Report
      2013 Research-status Report
    • Invited

URL: 

Published: 2014-07-25   Modified: 2022-01-27  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi