• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Studies on Learning Algorithms for Flexibly Structured Decision Process Models

Research Project

Project/Area Number 18540111
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field General mathematics (including Probability theory/Statistical mathematics)
Research InstitutionChiba University

Principal Investigator

KURANO Masami  Chiba University, Faculty of Education, Professor (70029487)

Co-Investigator(Kenkyū-buntansha) YASUDA Masami  Chiba University, Faculty of Science, Professor (00041244)
NAKAGAMI Jun-ichi  Chiba University, Faculty of Science, Professor (30092076)
KADOTA Yoshinobu  Wakayama University, Faculty of Education, Professor (90116294)
YOSHIDA Yuji  University of Kitakyushu, Faculty of Economics and Business Administration, Professor (90192426)
IWAMURA Kakuzo  Josai University, Faculty of Science, Lecturer (00077918)
Project Period (FY) 2006 – 2007
Project Status Completed (Fiscal Year 2007)
Budget Amount *help
¥2,930,000 (Direct Cost: ¥2,600,000、Indirect Cost: ¥330,000)
Fiscal Year 2007: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Fiscal Year 2006: ¥1,500,000 (Direct Cost: ¥1,500,000)
KeywordsFlexibly structured model / Markov decision process / learning algorithm / Fuzzy model / Reinforcement learning / Adaptive policy / Credibilistic process / Genetic algorithm / マルコフ決定モデル / ニューロ動的計画法 / 最適方程式
Research Abstract

In this project, our objective is to establish the adaptive and reinforcement learning algorithms for uncertain decision processes with the more flexible and soft structure. The main research results are as follows. 1. Further studies on construction and analysis of flexibly structured models (a) Investigating possibility and credibility of fuzziness and applying its extension theorem, we have succeeded in constructing credibilistic process from given conditional credibility measures, by which axiomatic development of decision processes under fuzzy environment will be made to be possible. (b) We have succeeded in deriving the flexible optimality equations for an absorbing semi-Markov game with general utility functions which determine the optimal strategies. c Concerning Bayesian analysis for a quality control problem, we have proposed the new control chart which has more flexible structure, grasping the unknown parameter by a priori interval of measures. The efficiency of the new one is shown by comparing with the usual one 2. Learning algorithms for adaptive Markov decision models (MDPs) We have developed a pattern-matrix learning algorithm for adaptive MDPs which learns the structure (pattern) of transition matrices from the observed data and using its information constructs the adaptive policy based on temporal difference (TD) method. This method can be essentially applicable to the multichain case. 3. Application of reinforcement learning methods (a) We have investigated the convergence of the TD or Actor Critic algorithms applicable to various models of neuron dynamic programming, finding its efficiency by numerical experiments. (b) In order to solve several Operational Research models under fuzzy environments, we have developed the Hybrid Intelligent algorithm integrating fuzzy simulation and genetic algorithm, whose efficiency is verified by a numerical examples.

Report

(3 results)
  • 2007 Annual Research Report   Final Research Report Summary
  • 2006 Annual Research Report
  • Research Products

    (16 results)

All 2008 2007 2006 Other

All Journal Article (13 results) (of which Peer Reviewed: 6 results) Presentation (3 results)

  • [Journal Article] Fuzzy facility Iocation-allocation problem under the Hurwiczcriterion2008

    • Author(s)
      岩村 覚三 (共著)
    • Journal Title

      European J.of Operational Research(To appear) 184

      Pages: 627-635

    • Related Report
      2007 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Fuzzy optimality relation for perception MDPs the average case2007

    • Author(s)
      蔵野正美(共著)
    • Journal Title

      Fuzzy Sets and Systems 158

      Pages: 1905-1912

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
    • Peer Reviewed
  • [Journal Article] A structured pattern matrix algorithm for multichain Markov decision processes2007

    • Author(s)
      伊喜哲一郎(共著)
    • Journal Title

      Mathematical Methods of Operations Research 66

      Pages: 545-555

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
    • Peer Reviewed
  • [Journal Article] Fuzzy optimality relation for perceptive MDPs - the average case2007

    • Author(s)
      Kurano, M., Yasuda, M., Nakagami, J., and Yoshida, Y.
    • Journal Title

      Fuzzy Sets and Systems Vol. 158

      Pages: 1905-1912

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] A structured pattern matrix Algorithm for multichain Markov decision processes2007

    • Author(s)
      Iki, T., Horiguchi, M., and Kurano, M.
    • Journal Title

      Mathematical Methods of Operations Research Vol. 66

      Pages: 545-555

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] Fuzzy optimality relation for perceptive MDPs-the average case2007

    • Author(s)
      蔵野 正美 (共著)
    • Journal Title

      Fuzzy Sets and Systems 153

      Pages: 1905-1912

    • Related Report
      2007 Annual Research Report
    • Peer Reviewed
  • [Journal Article] New models for shortest path problem with fuzzy arc lengths2007

    • Author(s)
      岩村 覚三 (共著)
    • Journal Title

      Applied Mathematical Modelling 31

      Pages: 259-269

    • Related Report
      2007 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A fuzzy perceptive value for multi-variate stopping problem with a monotone rule2007

    • Author(s)
      蔵野正美(共著)
    • Journal Title

      Bulletin of Informatics and Cybernetics (印刷中)

    • NAID

      120001944228

    • Related Report
      2006 Annual Research Report
  • [Journal Article] A structured pattern matrix algorithm for multichain Markov decision processes2007

    • Author(s)
      伊喜哲一郎(共著)
    • Journal Title

      Mathematical Methods of Operations Research (印刷中)

    • Related Report
      2006 Annual Research Report
  • [Journal Article] A fuzzy approach to Markov decision processes with uncertain transition probabilities2006

    • Author(s)
      蔵野正美(共著)
    • Journal Title

      Fuzzy Sets and Systems 157

      Pages: 2674-2682

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary 2006 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A fuzzy approach to Markov decision processes with uncertain transition probabilities2006

    • Author(s)
      Kurano, M., Yasuda, M., Nakagami, J., and Yoshida, Y.
    • Journal Title

      Fuzzy Sets and Systems Vol. 157

      Pages: 2674-2682

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] A new evaluation of mean value for fuzzy numbers and its application to American put option under uncertainty2006

    • Author(s)
      吉田祐治(共著)
    • Journal Title

      Fuzzy Sets and Systems 157

      Pages: 2614-2626

    • Related Report
      2006 Annual Research Report
  • [Journal Article] A Learning algorithm for communicating Markov decision processes with unknown transition matrices

    • Author(s)
      伊喜哲一郎(共著)
    • Journal Title

      Bulletin of Informatics and Cybernetics (印刷中)

    • Related Report
      2006 Annual Research Report
  • [Presentation] Adaptive Markov decision processes based on temporal difference method2007

    • Author(s)
      伊喜哲一郎(共著)
    • Organizer
      日本数学学会
    • Place of Presentation
      東北大学
    • Year and Date
      2007-09-24
    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Presentation] Adaptive Markov decision Processes based on temporal difference method2007

    • Author(s)
      Iki, T., Horiguchi, M., Yasuda, M., and Kurano, M.
    • Organizer
      Mathematical Society of Japan, Autumn Meeting
    • Place of Presentation
      Tohoku Univ
    • Year and Date
      2007-09-24
    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Presentation] Adaptive Markov decision processes based on difference method2007

    • Author(s)
      伊喜哲 一郎(共同)
    • Organizer
      日本数学会
    • Place of Presentation
      東北大学
    • Year and Date
      2007-09-24
    • Related Report
      2007 Annual Research Report

URL: 

Published: 2006-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi