2013 Fiscal Year Final Research Report

Plateau Phenomena of the Learning Dynamics and Stabilities of the Local Minima of the Error Function in Machine Learning

Research Project

Project/Area Number	21500222
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Sensitivity informatics/Soft computing
Research Institution	Tokai University
Principal Investigator	OZEKI Tomoko 東海大学, 情報理工学部, 教授 (10407992)
Project Period (FY)	2009-04-01 – 2014-03-31
Keywords	知能情報処理 / 機械学習 / 多層パーセプトロン / 隠れマルコフモデル / 強化学習
Research Abstract	Machine learning is one of the theories to construct the systems that can learn the data given from outside world like human brains. The algorithms of machine learning are divided into three categories such as supervised learning, unsupervised learning and reinforcement learning. In this research, we have investigated the dynamics of supervised learning and reinforcement learning and proposed some improvements. (1) We have investigated the relation between the singular structure of the parameter space of hidden Markov models and the trajectories of the learning dynamics. (2) We have proposed the reinforcement learning algorithm that can adapt to the changing environments and investigated the learning dynamics.

Research Products
(7 results)

All 2014 2013 2012

All Journal Article (3 results) (of which Peer Reviewed: 1 results) Presentation (4 results)

[Journal Article] Concurrent Q LearningにおけるRelaxationの改良2013
- Author(s)
  村上和謙, 尾関智子
- Journal Title
  
  東海大学紀要情報理工学部
  
  Volume: 13 Pages: 9-14
- Peer Reviewed
[Journal Article] Improvement of the Relaxation Procedure in Concurrent Q-Learning2013
- Author(s)
  Kazunori Murakami, Tomoko Ozeki
- Journal Title
  
  Neural Information Processing Lecture Notes in Computer Science
  
  Volume: 8227 Pages: 84-91
[Journal Article] Concurrent Q-LearningにおけるRelaxationの改良2012
- Author(s)
  村上和謙, 尾関智子
- Journal Title
  
  信学技報
  
  Volume: 112(480) Pages: 209-213
[Presentation] Concurrent Q-Learningにおける適格度トレースの影響2014
- Author(s)
  村上和謙, 尾関智子
- Organizer
  電子情報通信学会
- Place of Presentation
  新潟大学
- Year and Date
  20140300
[Presentation] Improvement of the Relaxation Procedure in Concurrent Q-Learning2013
- Author(s)
  Kazunori Murakami, Tomoko Ozeki
- Organizer
  ICONIP2013
- Place of Presentation
  Daegu, Korea
- Year and Date
  20131100
[Presentation] Concurrent Q-LearningとSarsa, Q学習の動的環境への適応2012
- Author(s)
  村上和謙, 尾関智子
- Organizer
  IBIS2012
- Place of Presentation
  筑波大学
- Year and Date
  20121100
[Presentation] 動的環境におけるTD誤差を用いた強化学習メタパラメータ学習法2012
- Author(s)
  村上和謙, 尾関智子
- Organizer
  電子情報通信学会
- Place of Presentation
  岡山大学
- Year and Date
  20120300

2013 Fiscal Year Final Research Report

Plateau Phenomena of the Learning Dynamics and Stabilities of the Local Minima of the Error Function in Machine Learning

Principal Investigator

OZEKI Tomoko 東海大学, 情報理工学部, 教授 (10407992)

Research Products

[Journal Article] Concurrent Q LearningにおけるRelaxationの改良2013

Author(s)

Journal Title

[Journal Article] Improvement of the Relaxation Procedure in Concurrent Q-Learning2013

Author(s)

Journal Title

[Journal Article] Concurrent Q-LearningにおけるRelaxationの改良2012

Author(s)

Journal Title

[Presentation] Concurrent Q-Learningにおける適格度トレースの影響2014

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Improvement of the Relaxation Procedure in Concurrent Q-Learning2013

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Concurrent Q-LearningとSarsa, Q学習の動的環境への適応2012

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 動的環境におけるTD誤差を用いた強化学習メタパラメータ学習法2012

Author(s)

Organizer

Place of Presentation

Year and Date