• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2013 Fiscal Year Final Research Report

Plateau Phenomena of the Learning Dynamics and Stabilities of the Local Minima of the Error Function in Machine Learning

Research Project

  • PDF
Project/Area Number 21500222
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Sensitivity informatics/Soft computing
Research InstitutionTokai University

Principal Investigator

OZEKI Tomoko  東海大学, 情報理工学部, 教授 (10407992)

Project Period (FY) 2009-04-01 – 2014-03-31
Keywords知能情報処理 / 機械学習 / 多層パーセプトロン / 隠れマルコフモデル / 強化学習
Research Abstract

Machine learning is one of the theories to construct the systems that can learn the data given from outside world like human brains. The algorithms of machine learning are divided into three categories such as supervised learning, unsupervised learning and reinforcement learning. In this research, we have investigated the dynamics of supervised learning and reinforcement learning and proposed some improvements. (1) We have investigated the relation between the singular structure of the parameter space of hidden Markov models and the trajectories of the learning dynamics. (2) We have proposed the reinforcement learning algorithm that can adapt to the changing environments and investigated the learning dynamics.

  • Research Products

    (7 results)

All 2014 2013 2012

All Journal Article (3 results) (of which Peer Reviewed: 1 results) Presentation (4 results)

  • [Journal Article] Concurrent Q LearningにおけるRelaxationの改良2013

    • Author(s)
      村上和謙, 尾関智子
    • Journal Title

      東海大学紀要情報理工学部

      Volume: 13 Pages: 9-14

    • Peer Reviewed
  • [Journal Article] Improvement of the Relaxation Procedure in Concurrent Q-Learning2013

    • Author(s)
      Kazunori Murakami, Tomoko Ozeki
    • Journal Title

      Neural Information Processing Lecture Notes in Computer Science

      Volume: 8227 Pages: 84-91

  • [Journal Article] Concurrent Q-LearningにおけるRelaxationの改良2012

    • Author(s)
      村上和謙, 尾関智子
    • Journal Title

      信学技報

      Volume: 112(480) Pages: 209-213

  • [Presentation] Concurrent Q-Learningにおける適格度トレースの影響2014

    • Author(s)
      村上和謙, 尾関智子
    • Organizer
      電子情報通信学会
    • Place of Presentation
      新潟大学
    • Year and Date
      20140300
  • [Presentation] Improvement of the Relaxation Procedure in Concurrent Q-Learning2013

    • Author(s)
      Kazunori Murakami, Tomoko Ozeki
    • Organizer
      ICONIP2013
    • Place of Presentation
      Daegu, Korea
    • Year and Date
      20131100
  • [Presentation] Concurrent Q-LearningとSarsa, Q学習の動的環境への適応2012

    • Author(s)
      村上和謙, 尾関智子
    • Organizer
      IBIS2012
    • Place of Presentation
      筑波大学
    • Year and Date
      20121100
  • [Presentation] 動的環境におけるTD誤差を用いた強化学習メタパラメータ学習法2012

    • Author(s)
      村上和謙, 尾関智子
    • Organizer
      電子情報通信学会
    • Place of Presentation
      岡山大学
    • Year and Date
      20120300

URL: 

Published: 2015-07-16  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi