Data-Driven Learning Optimal Control for Stochastic Systems

Research Project

Project/Area Number	18H05899
Research Category	Grant-in-Aid for Research Activity Start-up
Allocation Type	Single-year Grants
Review Section	0301:Mechanics of materials, production engineering, design engineering, fluid engineering, thermal engineering, mechanical dynamics, robotics, aerospace engineering, marine and maritime engineering, and related fields
Research Institution	The Institute of Statistical Mathematics
Principal Investigator	豊田充統計数理研究所, 統計思考院, 特任助教 (40826939)
Project Period (FY)	2018-08-24 – 2020-03-31
Project Status	Completed (Fiscal Year 2018)
Budget Amount *help	¥2,340,000 (Direct Cost: ¥1,800,000、Indirect Cost: ¥540,000) Fiscal Year 2018: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Keywords	制御理論 / 最適制御 / 確率システム / 論理システム / 離散システム / ブーリアンネットワーク / ベイズ最適化 / 有限時間最適制御
Outline of Annual Research Achievements	本研究課題では動的システムを対象として，観測されたデータをもとに，制御対象のダイナミクスや評価関数を学習しながら最適化するアルゴリズムの研究を実施した．具体的な制御対象としては，微分方程式で表される連続時間のダイナミクスと離散的な論理状態を確率的に遷移する確率論理システムに焦点をあて検討を行った． (1) 連続時間微分方程式であらわされるダイナミクスを対象とした研究に関しては，従来静的関数の最適化に用いられていたガウシアンプロセスによるベイズ的最適化手法を最適制御問題に拡張し，有限時間最適制御問題をデータ駆動的に最適化する方法を得た．さらに動的システム論の知見を活用した効率的な評価関数の勾配計算手法によって計算の実現可能性を示し，また，アルゴリズムの適用によって達成される最適性に関しても評価を行った． (2) 2値の離散的な論理状態をとり，それらが確率的な遷移に従う確率論理システムである確率ブーリアンネットワークについて，基礎的な結果としてモデル内の確率を得られた計測データからベイズ的に推定する手法が得られた．また，従来ではモデル内の選択確率とよばれるパラメータは確定的に与えられ制御問題が定式化されていたが，確率的に不確かな選択確率をもつ最適制御問題として新たな問題設定を行い，推定を行いながら最適制御するアルゴリズムを検討した．付随的な結果として，従来の可制御性の検討を最適制御問題として取り扱う手法を考案し，従来研究よりも広い枠組みで可制御性が議論できることを示した．
Research Progress Status	平成30年度が最終年度であるため、記入しない。
Strategy for Future Research Activity	平成30年度が最終年度であるため、記入しない。

Report

(1 results)

2018 Annual Research Report

Research Products
(7 results)

All 2019 2018 Other

All Int'l Joint Research (1 results) Journal Article (1 results) (of which Peer Reviewed: 1 results) Presentation (5 results) (of which Int'l Joint Research: 2 results)

[Int'l Joint Research] 大連理工大学(中国)
- Related Report
  2018 Annual Research Report
[Journal Article] Bayesian Optimization for Continuous-time Optimal Control Problem with Unknown Cost Function2019
- Author(s)
  豊田充
- Journal Title
  
  Transactions of the Society of Instrument and Control Engineers
  
  Volume: 55 Issue: 2 Pages: 100-109
- DOI
  10.9746/sicetr.55.100
- NAID
  130007601712
- ISSN
  0453-4654, 1883-8189
- Related Report
  2018 Annual Research Report
- Peer Reviewed
[Presentation] MCMC Based Selection Probability Estimation2019
- Author(s)
  Mitsuru Toyoda and Yuhu Wu
- Organizer
  2019 12th Asian Control (ASCC)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Terminal Cost Optimization of Probabilistic Boolean Control Network with Beta Distributed Selection Probabilities2019
- Author(s)
  Mitsuru Toyoda and Yuhu Wu
- Organizer
  38th Chinese Control Conference (CCC2019)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] HEVの最適エネルギマネジメント問題の近似解法2019
- Author(s)
  豊田充，徐福国，申鉄龍
- Organizer
  自動車技術会2019年春季大会
- Related Report
  2018 Annual Research Report
[Presentation] 確率ブーリアンネットワークの有限時間到達確率最大化制御2018
- Author(s)
  豊田充，呉玉虎
- Organizer
  第61回自動制御連合講演会講演論文集
- Related Report
  2018 Annual Research Report
[Presentation] パラメータがベータ事前分布に従う確率ブーリアンネットワークの最適制御2018
- Author(s)
  豊田充，呉玉虎
- Organizer
  第6回制御部門マルチシンポジウム講演論文集(第10回プラントモデリングシンポジウム)
- Related Report
  2018 Annual Research Report

Data-Driven Learning Optimal Control for Stochastic Systems

Principal Investigator

豊田 充 統計数理研究所, 統計思考院, 特任助教 (40826939)

¥2,340,000 (Direct Cost: ¥1,800,000、Indirect Cost: ¥540,000)

Report

Research Products

[Int'l Joint Research] 大連理工大学(中国)

Related Report

[Journal Article] Bayesian Optimization for Continuous-time Optimal Control Problem with Unknown Cost Function2019

Author(s)

Journal Title

DOI

NAID

ISSN

Related Report

[Presentation] MCMC Based Selection Probability Estimation2019

Author(s)

Organizer

Related Report

[Presentation] Terminal Cost Optimization of Probabilistic Boolean Control Network with Beta Distributed Selection Probabilities2019

Author(s)

Organizer

Related Report

[Presentation] HEVの最適エネルギマネジメント問題の近似解法2019

Author(s)

Organizer

Related Report

[Presentation] 確率ブーリアンネットワークの有限時間到達確率最大化制御2018

Author(s)

Organizer

Related Report

[Presentation] パラメータがベータ事前分布に従う確率ブーリアンネットワークの最適制御2018

Author(s)

Organizer

Related Report

豊田充統計数理研究所, 統計思考院, 特任助教 (40826939)