1998 Fiscal Year Final Research Report Summary

Studies on Mathematical Structure of Dynamic Programming

Research Project

Project/Area Number	09640243
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	General mathematics (including Probability theory/Statistical mathematics)
Research Institution	Chiba University
Principal Investigator	KURANO Masami Chiba Univ., Faculty of Education, Prof., 教育学部, 教授 (70029487)
Co-Investigator(Kenkyū-buntansha)	YOSHIDA Yuji Kitakyushu Univ., Faculty of Economics, Prof., 経済学部, 教授 (90192426) KADOTA Yoshinobu Wakayama Univ., Faculty of Education, Prof., 教育学部, 教授 (90116294) MARUYAMA Ken-ichi Chiba Univ., Faculty of Education, Asist.Prof., 教育学部, 助教授 (70173961) KENMOCHI Nobuyuki Chiba Univ., Faculty of Education, Prof., 教育学部, 教授 (00033887) UZAWA Masakazu Chiba Univ., Faculty of Education, Prof., 教育学部, 教授 (80009026)
Project Period (FY)	1997 – 1998
Keywords	Dynamic Programming / Markov Decision Processes / Fuzzy Dynamic System / Markov Set-Chain / Fuzzy Stopping / Optimality Equation / general Utility / Optimal Policy
Research Abstract	In this project, our objective is to develope the structural study of Dynamic Programming(DP in short) and establish DP method which is more robust or more flexible in the sense that it is reasonably efficient in rough approximation and allows for fluctuating factors in sequential decision processes. For this purpose. we tried to develope analytical studies on various mathematical decion model. (1) Markov set-chain model As a model which is robust for rough approximation of the transition matrix in Markov decision processes, we introduced a decision model, called a controlled Markov set-chain, and derived a DP equation by which Pareto optimal policies was constructed. Some computational results are included. (2) Fuzzy dynamic systems and stopping problem The ergodic theorem for the dynamic system with fuzzy state and fuzzy transition is developed and the existence and uniqueness of solutions of the corresponding DP equations is proved. Also, a stopping problem for dynamic fuzzy system is formulated and solved by an extended UP method. (3) General utility model A stopped Markov decision process is analysed under general utility. The corresponding DP equation is described by a family of distributions and more usefull in application of DP.

Research Products
(15 results)

All Other

All Publications (15 results)

[Publications] 蔵野正美: "A fuzzy relational equation in dynamic fuzzy systems" Fuzzy Sets and Systems. 101. 439-443 (1999)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 蔵野正美: "Controlled Markov set-chains with discounting" Journal of Applied Probability. 35. 293-302 (1998)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 吉田祐治: "A monotone fuzzy stopping time in dynamic fuzzy systems" To appear in Bulletin of Informatics and Cyber. 31. (1999)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 蔵野正美: "Constrained MDPs with compact State and action spaces" To appear in Optimization.
- Description
  「研究成果報告書概要(和文)」より
[Publications] 蔵野正美: "The time average reward for some dynamic fuzzy systems" To appear in Computers and Mathematics with Applications.
- Description
  「研究成果報告書概要(和文)」より
[Publications] 剱持信幸: "Parablic PDEs with hysteresis and quasivariational inequalities" Nonlinear Analysis. 34. 665-686 (1998)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Kurano, M., Song, J.and Hosaka, M.: "Controlled Markov set-chains with discounting" Journal of Applied Probabiliy. Vol.35. 293-302 (1998)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Yoshida, Y.: "The optimal stopped fuzzy rewards in some continuous-time dynamic fuzzy rewards" Mathmatical and Computer Modelling. Vol.26. 53-66 (1997)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Yoshida, Y., Yasuda, M., Nakagami, J.and Kurano, M.: "A limit theorem in dynamic fuzzy systems with a monotone propery" Fuzzy Sets and Systems. Vol.94. 109-119 (1998)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Yoshida, Y: "The recurrence of dynamic fuzzy systems" Fuzzy Sets and Systems. Vol.95. 319-332 (1998)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Kadota, Y., Kurano, M.and Yasuda, M.: "On the general utility of discounted Markov decision processes" International Transactions in Operational Research. Vol.5. 27-34 (1998)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Kenmochi, N., Koyama, T.and Meyer, G.H.: "Parabolic PDEs with hysteresis and quasivariational inequalities" Nonlinear Analysis. Vol.34. 665-686 (1998)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Kurano, M., Yasuda, M., Nakagami, J.and Yoshida, Y.: "A fuzzy relational equation in dynamic fuzzy systems" Fuzzy Sets and Systems. Vol.101. 439-443 (1999)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Kurano, M., Nakagami, J.and Horiguchi, M.: "Controlled Markov set-chains with set-valued reward." Proceeding of International Conf erence on Nonlinear Analysis and Convex Analysis(NACA98). (To appear).
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Yoshida, Y., Yasuda, M., Nakagami, J.and Kurano, M.: "A monotone fuzzy stopping time in dynamic fuzzy systems" Bulletin of Informatics and Cybernetics. Vol.31(To appear). (1999)
- Description
  「研究成果報告書概要(欧文)」より

1998 Fiscal Year Final Research Report Summary

Studies on Mathematical Structure of Dynamic Programming

Principal Investigator

KURANO Masami Chiba Univ., Faculty of Education, Prof., 教育学部, 教授 (70029487)

Research Products

[Publications] 蔵野 正美: "A fuzzy relational equation in dynamic fuzzy systems" Fuzzy Sets and Systems. 101. 439-443 (1999)

Description

[Publications] 蔵野 正美: "Controlled Markov set-chains with discounting" Journal of Applied Probability. 35. 293-302 (1998)

Description

[Publications] 吉田 祐治: "A monotone fuzzy stopping time in dynamic fuzzy systems" To appear in Bulletin of Informatics and Cyber. 31. (1999)

Description

[Publications] 蔵野 正美: "Constrained MDPs with compact State and action spaces" To appear in Optimization.

Description

[Publications] 蔵野 正美: "The time average reward for some dynamic fuzzy systems" To appear in Computers and Mathematics with Applications.

Description

[Publications] 剱持 信幸: "Parablic PDEs with hysteresis and quasivariational inequalities" Nonlinear Analysis. 34. 665-686 (1998)

Description

[Publications] Kurano, M., Song, J.and Hosaka, M.: "Controlled Markov set-chains with discounting" Journal of Applied Probabiliy. Vol.35. 293-302 (1998)

Description

[Publications] Yoshida, Y.: "The optimal stopped fuzzy rewards in some continuous-time dynamic fuzzy rewards" Mathmatical and Computer Modelling. Vol.26. 53-66 (1997)

Description

[Publications] Yoshida, Y., Yasuda, M., Nakagami, J.and Kurano, M.: "A limit theorem in dynamic fuzzy systems with a monotone propery" Fuzzy Sets and Systems. Vol.94. 109-119 (1998)

Description

[Publications] Yoshida, Y: "The recurrence of dynamic fuzzy systems" Fuzzy Sets and Systems. Vol.95. 319-332 (1998)

Description

[Publications] Kadota, Y., Kurano, M.and Yasuda, M.: "On the general utility of discounted Markov decision processes" International Transactions in Operational Research. Vol.5. 27-34 (1998)

Description

[Publications] Kenmochi, N., Koyama, T.and Meyer, G.H.: "Parabolic PDEs with hysteresis and quasivariational inequalities" Nonlinear Analysis. Vol.34. 665-686 (1998)

Description

[Publications] Kurano, M., Yasuda, M., Nakagami, J.and Yoshida, Y.: "A fuzzy relational equation in dynamic fuzzy systems" Fuzzy Sets and Systems. Vol.101. 439-443 (1999)

Description

[Publications] Kurano, M., Nakagami, J.and Horiguchi, M.: "Controlled Markov set-chains with set-valued reward." Proceeding of International Conf erence on Nonlinear Analysis and Convex Analysis(NACA98). (To appear).

Description

[Publications] Yoshida, Y., Yasuda, M., Nakagami, J.and Kurano, M.: "A monotone fuzzy stopping time in dynamic fuzzy systems" Bulletin of Informatics and Cybernetics. Vol.31(To appear). (1999)

Description

[Publications] 蔵野正美: "A fuzzy relational equation in dynamic fuzzy systems" Fuzzy Sets and Systems. 101. 439-443 (1999)

[Publications] 蔵野正美: "Controlled Markov set-chains with discounting" Journal of Applied Probability. 35. 293-302 (1998)

[Publications] 吉田祐治: "A monotone fuzzy stopping time in dynamic fuzzy systems" To appear in Bulletin of Informatics and Cyber. 31. (1999)

[Publications] 蔵野正美: "Constrained MDPs with compact State and action spaces" To appear in Optimization.

[Publications] 蔵野正美: "The time average reward for some dynamic fuzzy systems" To appear in Computers and Mathematics with Applications.

[Publications] 剱持信幸: "Parablic PDEs with hysteresis and quasivariational inequalities" Nonlinear Analysis. 34. 665-686 (1998)