Application of Asymptotic Optimal Strategy to Dynamic Adaptive Learning Algorithm

Research Project

Project/Area Number	15K00344
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Research Field	Soft computing
Research Institution	Osaka Prefecture University
Principal Investigator	Notsu Akira 大阪府立大学, 人間社会システム科学研究科, 准教授 (40405345)
Co-Investigator(Kenkyū-buntansha)	本多克宏大阪府立大学, 工学(系)研究科(研究院), 教授 (80332964)
Project Period (FY)	2015-04-01 – 2018-03-31
Project Status	Completed (Fiscal Year 2017)
Budget Amount *help	¥4,550,000 (Direct Cost: ¥3,500,000、Indirect Cost: ¥1,050,000) Fiscal Year 2017: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000) Fiscal Year 2016: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000) Fiscal Year 2015: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Keywords	強化学習 / 最適化問題 / 漸近最適戦略 / 自己組織化マップ / 意思決定 / クラスタリング / オンライン型 / 認知モデル
Outline of Final Research Achievements	In this subject, we have studied a method for stochastically optimal selection in reinforcement learning and optimization problems. When there are multiple choices, it is necessary to judge based on how much past experience and how much good results can be expected. In this research, we were able to devise several frameworks for introducing the optimal strategy while confirming that it is the same in reinforcement learning and optimization problems. In particular, from the viewpoint of Bayesian estimation, the reinforcement learning algorithm was fundamentally reviewed and the reconstruction showed that the conventional general idea of separating learning from decision making was wrong. In addition, we also gave research results on the method of estimating the state of the learner without applying computational load.

Report

(4 results)

2017 Annual Research Report Final Research Report ( PDF )
2016 Research-status Report
2015 Research-status Report

Research Products
(27 results)

All 2018 2017 2016 2015 Other

All Journal Article (9 results) (of which Peer Reviewed: 9 results, Open Access: 1 results, Acknowledgement Compliant: 2 results) Presentation (17 results) (of which Int'l Joint Research: 4 results) Remarks (1 results)

[Journal Article] Deterministic annealing process for pLSA-induced fuzzy co-clustering and cluster splitting characteristics2018
- Author(s)
  T. Goshima, K. Honda, S. Ubukata, A. Notsu
- Journal Title
  
  International Journal of Approximate Reasoning
  
  Volume: 95 Pages: 185-193
- DOI
  10.1016/j.ijar.2018.02.005
- Related Report
  2017 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] FCM-type Fuzzy Coclustering for Three-mode Cooccurrence Data: 3FCCM and 3Fuzzy CoDoK2017
- Author(s)
  K. Honda, Y. Suzuki, S. Ubukata, A. Notsu
- Journal Title
  
  Advances in Fuzzy Systems
  
  Volume: 2017 Pages: 1-8
- DOI
  10.1155/2017/9842127
- Related Report
  2017 Annual Research Report
- Peer Reviewed
[Journal Article] Visual Assessment of Co-cluster Structure through Cooccurrence-Sensitive Ordering2017
- Author(s)
  K. Honda, T. Sako, S. Ubukata, A. Notsu
- Journal Title
  
  Proc. of Joint 17th World Congress of International Fuzzy Systems Association and 9th International Conference on Soft Computing and Intelligent Systems
  
  Volume: 50 Pages: 1-6
- Related Report
  2017 Annual Research Report
- Peer Reviewed
[Journal Article] A Fuzzy Co-clustering Model for Three-modes Relational Cooccurrence Data2017
- Author(s)
  K. Honda, Y. Suzuki, M. Nishioka, S. Ubukata, A. Notsu
- Journal Title
  
  Proc. of 2017 IEEE International Conference on Fuzzy Systems
  
  Volume: F-0272 Pages: 1-6
- Related Report
  2017 Annual Research Report
- Peer Reviewed
[Journal Article] Phase Transition in pLSA-induced Fuzzy Co-clustering Based on Tuning of Intrinsic Fuzziness2017
- Author(s)
  T. Goshima, K. Honda, S. Ubukata, A. Notsu
- Journal Title
  
  Proc. of the 18th International Symposium on Advanced Intelligent Systems
  
  Volume: T2c-1 Pages: 243-249
- Related Report
  2017 Annual Research Report
- Peer Reviewed
[Journal Article] Designation of Candidate Solutions in Differential Evolution Based on Bandit Algorithm2017
- Author(s)
  M. Sakakibara, A. Notsu, S. Ubukata, K. Honda
- Journal Title
  
  Proc. of the 18th International Symposium on Advanced Intelligent Systems
  
  Volume: F1c-2 Pages: 471-478
- NAID
  40021220989
- Related Report
  2017 Annual Research Report
- Peer Reviewed
[Journal Article] Beta Distribution Propagating Reinforcement Learning Based on Prospect Theory for the Efficient Exploration and Exploitation2017
- Author(s)
  野津亮, 生方誠希, 本多克宏
- Journal Title
  
  Journal of Japan Society for Fuzzy Theory and Intelligent Informatics
  
  Volume: 29 Issue: 1 Pages: 507-516
- DOI
  10.3156/jsoft.29.1_507
- NAID
  130005243096
- ISSN
  1347-7986, 1881-7203
- Related Report
  2016 Research-status Report
- Peer Reviewed
[Journal Article] Visualization of Learning Process in “State and Action” Space Using Self-Organizing Maps2016
- Author(s)
  A. Notsu, Y. Hattori, S. Ubukata, K. Honda
- Journal Title
  
  Journal of Advanced Computational Intelligence and Intelligent Informatics
  
  Volume: 20 Issue: 6 Pages: 983-991
- DOI
  10.20965/jaciii.2016.p0983
- NAID
  130007673111
- ISSN
  1343-0130, 1883-8014
- Year and Date
  2016-11-20
- Related Report
  2016 Research-status Report
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] バンディットアルゴリズムに基づいた汎用最適化手法の開発2016
- Author(s)
  野津亮, 河上寛和, 本多克宏, 生方誠希
- Journal Title
  
  知能と情報（日本知能情報ファジィ学会誌）
  
  Volume: 28 Pages: 522-534
- NAID
  130005128539
- Related Report
  2015 Research-status Report
- Peer Reviewed / Acknowledgement Compliant
[Presentation] 3モード共起関係データの組織間協調型ファジィ共クラスタリング2018
- Author(s)
  松崎正太郎，本多克宏，生方誠希，野津亮
- Organizer
  平成29年度計測自動制御学会関西支部・システム制御情報学会若手研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] Spectral Ordering に基づく共クラスター構造の視覚化2017
- Author(s)
  佐古拓也，本多克宏，生方誠希，野津亮
- Organizer
  第61回システム制御情報学会研究発表講演会
- Related Report
  2017 Annual Research Report
[Presentation] バンディットアルゴリズムに基づいた差分進化における解集団の生成2017
- Author(s)
  榊原雅也，野津亮，生方誠希，本多克宏
- Organizer
  第61回システム制御情報学会研究発表講演会
- Related Report
  2017 Annual Research Report
[Presentation] pLSAの発展によるファジィ共クラスタリングにおけるファジィ度と相転移に関する考察2017
- Author(s)
  五島隆文，本多克宏，生方誠希，野津亮
- Organizer
  第33回ファジィシステムシンポジウム
- Related Report
  2017 Annual Research Report
[Presentation] 共クラスター構造の視覚的な把握と都道府県間人口移動データ分析への応用2017
- Author(s)
  佐古拓也，本多克宏，生方誠希，野津亮
- Organizer
  第33回ファジィシステムシンポジウム
- Related Report
  2017 Annual Research Report
[Presentation] トンプソンサンプリングにおけるサンプリングの省略2017
- Author(s)
  野津亮，柳川綾香，生方誠希，本多克宏
- Organizer
  第33回ファジィシステムシンポジウム
- Related Report
  2017 Annual Research Report
[Presentation] 成長型自己組織化マップによる強化学習2017
- Author(s)
  野津亮
- Organizer
  第27回インテリジェント・システム・シンポジウム
- Related Report
  2017 Annual Research Report
[Presentation] 自己組織化マップを用いた強化学習結果の抽象化とその利用2016
- Author(s)
  野津亮，近藤佑紀，生方誠希，本多克宏
- Organizer
  第26回インテリジェント・システム・シンポジウム
- Place of Presentation
  大阪
- Year and Date
  2016-10-27
- Related Report
  2016 Research-status Report
[Presentation] 認知特性に基づいたバンディットアルゴリズムの頑強性2016
- Author(s)
  菊田美月，野津亮，生方誠希，本多克宏
- Organizer
  第32回ファジィシステムシンポジウム
- Place of Presentation
  佐賀
- Year and Date
  2016-08-31
- Related Report
  2016 Research-status Report
[Presentation] Application of the UCT Algorithm for Noisy Optimization Problems2016
- Author(s)
  A. Notsu, S. Kane, S. Ubukata, K. Honda
- Organizer
  Joint 8th International Conference on Soft Computing and Intelligent Systems and 17th International Symposium on Advanced Intelligent Systems
- Place of Presentation
  sapporo, hokkaido, japan
- Year and Date
  2016-08-25
- Related Report
  2016 Research-status Report
- Int'l Joint Research
[Presentation] Performance Investigation of UCB Policy in Q-Learning2015
- Author(s)
  K. Saito, A. Notsu, S. Ubukata and K. Honda
- Organizer
  International Conference on Machine Learning and Applications
- Place of Presentation
  Pullman Hotel，マイアミ，アメリカ
- Year and Date
  2015-12-09
- Related Report
  2015 Research-status Report
- Int'l Joint Research
[Presentation] Proposal of Grid Area Search with UCB for Discrete Optimization Problem2015
- Author(s)
  A. Notsu, K. Saito, Y. Nohara, S. Ubukata and K. Honda
- Organizer
  Integrated Uncertainty in Knowledge Modelling and Decision Making
- Place of Presentation
  SUNRISE HOTEL，ニャチャン，ベトナム
- Year and Date
  2015-10-15
- Related Report
  2015 Research-status Report
- Int'l Joint Research
[Presentation] FCM-type Co-clustering Transfer Reinforcement Learning for Non-Markov Processes2015
- Author(s)
  A. Notsu, T. Ueno, Y. Hattori, S. Ubukata and K. Honda
- Organizer
  Integrated Uncertainty in Knowledge Modelling and Decision Making
- Place of Presentation
  SUNRISE HOTEL，ニャチャン，ベトナム
- Year and Date
  2015-10-15
- Related Report
  2015 Research-status Report
- Int'l Joint Research
[Presentation] Q学習におけるUCB行動選択手法の性能に関する調査2015
- Author(s)
  斉藤晃貴，野津亮，生方誠希，本多克宏
- Organizer
  第25回インテリジェント・システム・シンポジウム
- Place of Presentation
  東北大学片平さくらホール（宮城県仙台市）
- Year and Date
  2015-09-24
- Related Report
  2015 Research-status Report
[Presentation] 強化学習における自己組織化マップを用いた状態と行動の学習プロセスの可視化2015
- Author(s)
  服部雄市，野津亮，生方誠希，本多克宏
- Organizer
  第25回インテリジェント・システム・シンポジウム
- Place of Presentation
  東北大学片平さくらホール（宮城県仙台市）
- Year and Date
  2015-09-24
- Related Report
  2015 Research-status Report
[Presentation] Q学習におけるファジィ共クラスタリングによる知識の圧縮と再利用2015
- Author(s)
  服部雄市，野津亮，生方誠希，本多克宏，上野貴紀
- Organizer
  第31回ファジィシステムシンポジウム
- Place of Presentation
  電気通信大学（東京都調布市）
- Year and Date
  2015-09-02
- Related Report
  2015 Research-status Report
[Presentation] UCBによる離散最適化問題の探索と活用の調整2015
- Author(s)
  斉藤晃貴，野津亮，野原由布美，生方誠希，本多克宏
- Organizer
  第31回ファジィシステムシンポジウム
- Place of Presentation
  電気通信大学（東京都調布市）
- Year and Date
  2015-09-02
- Related Report
  2015 Research-status Report
[Remarks] 人間情報システム研究グループ
- URL
  http://www.cs.osakafu-u.ac.jp/hi/
- Related Report
  2017 Annual Research Report

Application of Asymptotic Optimal Strategy to Dynamic Adaptive Learning Algorithm

Principal Investigator

Notsu Akira 大阪府立大学, 人間社会システム科学研究科, 准教授 (40405345)

¥4,550,000 (Direct Cost: ¥3,500,000、Indirect Cost: ¥1,050,000)

Report

Research Products

[Journal Article] Deterministic annealing process for pLSA-induced fuzzy co-clustering and cluster splitting characteristics2018

Author(s)

Journal Title

DOI

Related Report

[Journal Article] FCM-type Fuzzy Coclustering for Three-mode Cooccurrence Data: 3FCCM and 3Fuzzy CoDoK2017

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Visual Assessment of Co-cluster Structure through Cooccurrence-Sensitive Ordering2017

Author(s)

Journal Title

Related Report

[Journal Article] A Fuzzy Co-clustering Model for Three-modes Relational Cooccurrence Data2017

Author(s)

Journal Title

Related Report

[Journal Article] Phase Transition in pLSA-induced Fuzzy Co-clustering Based on Tuning of Intrinsic Fuzziness2017

Author(s)

Journal Title

Related Report

[Journal Article] Designation of Candidate Solutions in Differential Evolution Based on Bandit Algorithm2017

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Beta Distribution Propagating Reinforcement Learning Based on Prospect Theory for the Efficient Exploration and Exploitation2017

Author(s)

Journal Title

DOI

NAID

ISSN

Related Report

[Journal Article] Visualization of Learning Process in “State and Action” Space Using Self-Organizing Maps2016

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] バンディットアルゴリズムに基づいた汎用最適化手法の開発2016

Author(s)

Journal Title

NAID

Related Report

[Presentation] 3モード共起関係データの組織間協調型ファジィ共クラスタリング2018

Author(s)

Organizer

Related Report

[Presentation] Spectral Ordering に基づく共クラスター構造の視覚化2017

Author(s)

Organizer

Related Report

[Presentation] バンディットアルゴリズムに基づいた差分進化における解集団の生成2017

Author(s)

Organizer

Related Report

[Presentation] pLSAの発展によるファジィ共クラスタリングにおけるファジィ度と相転移に関する考察2017

Author(s)

Organizer

Related Report

[Presentation] 共クラスター構造の視覚的な把握と都道府県間人口移動データ分析への応用2017

Author(s)

Organizer

Related Report

[Presentation] トンプソンサンプリングにおけるサンプリングの省略2017

Author(s)

Organizer

Related Report

[Presentation] 成長型自己組織化マップによる強化学習2017

Author(s)

Organizer