Construction and development of the stochastic control theory for multivalued stochastic differential equations

Research Project

Project/Area Number	20K03754
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Review Section	Basic Section 12040:Applied mathematics and statistics-related
Research Institution	Hiroshima City University
Principal Investigator	Tanaka Teruo 広島市立大学, 情報科学研究科, 教授 (80227149)
Project Period (FY)	2020-04-01 – 2023-03-31
Project Status	Completed (Fiscal Year 2022)
Budget Amount *help	¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000) Fiscal Year 2022: ¥520,000 (Direct Cost: ¥400,000、Indirect Cost: ¥120,000) Fiscal Year 2021: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000) Fiscal Year 2020: ¥520,000 (Direct Cost: ¥400,000、Indirect Cost: ¥120,000)
Keywords	集合値確率過程 / 確率制御問題 / マルコフ決定過程 / 最適停止問題 / 分数型評価基準 / 預言者の不等式 / 最適政策 / 最適停止規則 / パラメトリック法 / Dinkebachアルゴリズム / 多価確率微分方程式 / マルコフ過程 / 確率制御理論 / 集合値解析学
Outline of Research at the Start	確率制御理論では、制御過程、状態過程、評価関数の3要素が重要である。本研究では、制御過程を集合値確率過程、状態過程を多価確率微分方程式、評価関数を制御過程と状態過程に依存する集合値関数（集合値確率変数）の期待値とすることにより新たな制御問題の定式化を与える。最適制御の存在を証明し、最適値関数の特徴付けを行うことにより、集合値確率制御理論を構築し、さらに、状態制約をもつ確率制御問題、不規則集合移動体の最適探索問題等へ応用する。
Outline of Final Research Achievements	(1)For a compact convex set valued stochastic process, we have studied the theory of set valued Markov processes by using the method of embedding the a family of compact convex sets to some Banach space, and identifying a set valued Markov process and a vector valued Markov process. (2)We have studied stochastic control problems (Markov decision processes, optimal stopping problems) under fractional criterion, proved the existence of an optimal control (optimal policy, optimal stopping rule), and given the characterization of optimal value. We also investigated the efficiency of Dinkelbach algorithm in order to seek an optimal control. (3)We have studied the difference comparison and ratio comparison of prophet inequalities for multiparameter optimal stopping problem, and drive a universal constant and an optimization problem in order to seek the universal constant.
Academic Significance and Societal Importance of the Research Achievements	確率制御理論では、制御過程、状態過程、評価関数の3要素が重要である。従来研究は、制御過程はスカラー値又はベクトル値確率過程、状態過程は制御過程を変数として含む確率微分方程式で記述されるスカラー値又はベクトル値確率過程評価関数、制御過程と状態過程に依存する汎関数の期待値によって定式化を与え、最適制御の存在を証明し、最適値関数の特徴付けを行うことであった。本研究は、集合値確率過程の性質を考察すること、評価関数を分数型にすることよって定式化を与え、最適制御の存在を証明し最適値関数の特徴付けを行うこと、および最適停止問題に対する預言者の不等式を考察することである。

Report

(4 results)

2022 Annual Research Report Final Research Report ( PDF )
2021 Research-status Report
2020 Research-status Report

Research Products
(7 results)

All 2023 2022

All Journal Article (3 results) (of which Peer Reviewed: 3 results) Presentation (4 results)

[Journal Article] A discrete time Markov decision process with a fractional discounted reward2023
- Author(s)
  Teruo Tanaka
- Journal Title
  
  Journal of Information and Optimization Sciences（掲載予定）
  
  Volume: -
- Related Report
  2022 Annual Research Report
- Peer Reviewed
[Journal Article] Continuous time optimal stopping problems with fractional rewards2023
- Author(s)
  Teruo Tanaka
- Journal Title
  
  Journal of Information and Optimization Sciences（掲載予定）
  
  Volume: -
- Related Report
  2022 Annual Research Report
- Peer Reviewed
[Journal Article] Discrete time multiparameter optimal stopping problems with fractional rewards2023
- Author(s)
  Teruo Tanaka
- Journal Title
  
  Journal of Information and Optimization Sciences（掲載予定）
  
  Volume: -
- Related Report
  2022 Annual Research Report
- Peer Reviewed
[Presentation] 最適停止問題の預言者の不等式―差の評価ー2023
- Author(s)
  田中輝雄
- Organizer
  日本数学会2023年度年会
- Related Report
  2022 Annual Research Report
[Presentation] 最適停止問題の預言者の不等式―比の評価ー2022
- Author(s)
  田中輝雄
- Organizer
  日本数学会2022年度秋季総合分科会
- Related Report
  2022 Annual Research Report
[Presentation] 分数型ペイオフを持つ微分ゲームについて2022
- Author(s)
  朝田智也，田中輝雄
- Organizer
  日本オペレーションズ・リサーチ学会2022年春季研究発表会
- Related Report
  2021 Research-status Report
[Presentation] 分数型評価基準のマルコフ決定過程2022
- Author(s)
  田中輝雄
- Organizer
  日本数学会2022年度年会
- Related Report
  2021 Research-status Report

Construction and development of the stochastic control theory for multivalued stochastic differential equations

Principal Investigator

Tanaka Teruo 広島市立大学, 情報科学研究科, 教授 (80227149)

¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)

Report

Research Products

[Journal Article] A discrete time Markov decision process with a fractional discounted reward2023

Author(s)

Journal Title

Related Report

[Journal Article] Continuous time optimal stopping problems with fractional rewards2023

Author(s)

Journal Title

Related Report

[Journal Article] Discrete time multiparameter optimal stopping problems with fractional rewards2023

Author(s)

Journal Title

Related Report

[Presentation] 最適停止問題の預言者の不等式―差の評価ー2023

Author(s)

Organizer

Related Report

[Presentation] 最適停止問題の預言者の不等式―比の評価ー2022

Author(s)

Organizer

Related Report

[Presentation] 分数型ペイオフを持つ微分ゲームについて2022

Author(s)

Organizer

Related Report

[Presentation] 分数型評価基準のマルコフ決定過程2022

Author(s)

Organizer

Related Report