• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Application of Asymptotic Optimal Strategy to Dynamic Adaptive Learning Algorithm

Research Project

Project/Area Number 15K00344
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeMulti-year Fund
Section一般
Research Field Soft computing
Research InstitutionOsaka Prefecture University

Principal Investigator

Notsu Akira  大阪府立大学, 人間社会システム科学研究科, 准教授 (40405345)

Co-Investigator(Kenkyū-buntansha) 本多 克宏  大阪府立大学, 工学(系)研究科(研究院), 教授 (80332964)
Project Period (FY) 2015-04-01 – 2018-03-31
Project Status Completed (Fiscal Year 2017)
Budget Amount *help
¥4,550,000 (Direct Cost: ¥3,500,000、Indirect Cost: ¥1,050,000)
Fiscal Year 2017: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000)
Fiscal Year 2016: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Fiscal Year 2015: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Keywords強化学習 / 最適化問題 / 漸近最適戦略 / 自己組織化マップ / 意思決定 / クラスタリング / オンライン型 / 認知モデル
Outline of Final Research Achievements

In this subject, we have studied a method for stochastically optimal selection in reinforcement learning and optimization problems. When there are multiple choices, it is necessary to judge based on how much past experience and how much good results can be expected. In this research, we were able to devise several frameworks for introducing the optimal strategy while confirming that it is the same in reinforcement learning and optimization problems. In particular, from the viewpoint of Bayesian estimation, the reinforcement learning algorithm was fundamentally reviewed and the reconstruction showed that the conventional general idea of separating learning from decision making was wrong. In addition, we also gave research results on the method of estimating the state of the learner without applying computational load.

Report

(4 results)
  • 2017 Annual Research Report   Final Research Report ( PDF )
  • 2016 Research-status Report
  • 2015 Research-status Report
  • Research Products

    (27 results)

All 2018 2017 2016 2015 Other

All Journal Article (9 results) (of which Peer Reviewed: 9 results,  Open Access: 1 results,  Acknowledgement Compliant: 2 results) Presentation (17 results) (of which Int'l Joint Research: 4 results) Remarks (1 results)

  • [Journal Article] Deterministic annealing process for pLSA-induced fuzzy co-clustering and cluster splitting characteristics2018

    • Author(s)
      T. Goshima, K. Honda, S. Ubukata, A. Notsu
    • Journal Title

      International Journal of Approximate Reasoning

      Volume: 95 Pages: 185-193

    • DOI

      10.1016/j.ijar.2018.02.005

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] FCM-type Fuzzy Coclustering for Three-mode Cooccurrence Data: 3FCCM and 3Fuzzy CoDoK2017

    • Author(s)
      K. Honda, Y. Suzuki, S. Ubukata, A. Notsu
    • Journal Title

      Advances in Fuzzy Systems

      Volume: 2017 Pages: 1-8

    • DOI

      10.1155/2017/9842127

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Visual Assessment of Co-cluster Structure through Cooccurrence-Sensitive Ordering2017

    • Author(s)
      K. Honda, T. Sako, S. Ubukata, A. Notsu
    • Journal Title

      Proc. of Joint 17th World Congress of International Fuzzy Systems Association and 9th International Conference on Soft Computing and Intelligent Systems

      Volume: 50 Pages: 1-6

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A Fuzzy Co-clustering Model for Three-modes Relational Cooccurrence Data2017

    • Author(s)
      K. Honda, Y. Suzuki, M. Nishioka, S. Ubukata, A. Notsu
    • Journal Title

      Proc. of 2017 IEEE International Conference on Fuzzy Systems

      Volume: F-0272 Pages: 1-6

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Phase Transition in pLSA-induced Fuzzy Co-clustering Based on Tuning of Intrinsic Fuzziness2017

    • Author(s)
      T. Goshima, K. Honda, S. Ubukata, A. Notsu
    • Journal Title

      Proc. of the 18th International Symposium on Advanced Intelligent Systems

      Volume: T2c-1 Pages: 243-249

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Designation of Candidate Solutions in Differential Evolution Based on Bandit Algorithm2017

    • Author(s)
      M. Sakakibara, A. Notsu, S. Ubukata, K. Honda
    • Journal Title

      Proc. of the 18th International Symposium on Advanced Intelligent Systems

      Volume: F1c-2 Pages: 471-478

    • NAID

      40021220989

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Beta Distribution Propagating Reinforcement Learning Based on Prospect Theory for the Efficient Exploration and Exploitation2017

    • Author(s)
      野津 亮, 生方誠希, 本多克宏
    • Journal Title

      Journal of Japan Society for Fuzzy Theory and Intelligent Informatics

      Volume: 29 Issue: 1 Pages: 507-516

    • DOI

      10.3156/jsoft.29.1_507

    • NAID

      130005243096

    • ISSN
      1347-7986, 1881-7203
    • Related Report
      2016 Research-status Report
    • Peer Reviewed
  • [Journal Article] Visualization of Learning Process in “State and Action” Space Using Self-Organizing Maps2016

    • Author(s)
      A. Notsu, Y. Hattori, S. Ubukata, K. Honda
    • Journal Title

      Journal of Advanced Computational Intelligence and Intelligent Informatics

      Volume: 20 Issue: 6 Pages: 983-991

    • DOI

      10.20965/jaciii.2016.p0983

    • NAID

      130007673111

    • ISSN
      1343-0130, 1883-8014
    • Year and Date
      2016-11-20
    • Related Report
      2016 Research-status Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Journal Article] バンディットアルゴリズムに基づいた汎用最適化手法の開発2016

    • Author(s)
      野津 亮, 河上 寛和, 本多克宏, 生方誠希
    • Journal Title

      知能と情報(日本知能情報ファジィ学会誌)

      Volume: 28 Pages: 522-534

    • NAID

      130005128539

    • Related Report
      2015 Research-status Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Presentation] 3モード共起関係データの組織間協調型ファジィ共クラスタリング2018

    • Author(s)
      松崎 正太郎,本多 克宏,生方 誠希,野津 亮
    • Organizer
      平成29年度計測自動制御学会関西支部・システム制御情報学会若手研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] Spectral Ordering に基づく共クラスター構造の視覚化2017

    • Author(s)
      佐古拓也,本多克宏,生方誠希,野津亮
    • Organizer
      第61回システム制御情報学会研究発表講演会
    • Related Report
      2017 Annual Research Report
  • [Presentation] バンディットアルゴリズムに基づいた差分進化における解集団の生成2017

    • Author(s)
      榊原 雅也,野津 亮,生方 誠希,本多 克宏
    • Organizer
      第61回システム制御情報学会研究発表講演会
    • Related Report
      2017 Annual Research Report
  • [Presentation] pLSAの発展によるファジィ共クラスタリングにおけるファジィ度と相転移に関する考察2017

    • Author(s)
      五島 隆文,本多 克宏,生方 誠希,野津 亮
    • Organizer
      第33回ファジィシステムシンポジウム
    • Related Report
      2017 Annual Research Report
  • [Presentation] 共クラスター構造の視覚的な把握と都道府県間人口移動データ分析への応用2017

    • Author(s)
      佐古 拓也,本多 克宏,生方 誠希,野津 亮
    • Organizer
      第33回ファジィシステムシンポジウム
    • Related Report
      2017 Annual Research Report
  • [Presentation] トンプソンサンプリングにおけるサンプリングの省略2017

    • Author(s)
      野津 亮,柳川 綾香,生方 誠希,本多 克宏
    • Organizer
      第33回ファジィシステムシンポジウム
    • Related Report
      2017 Annual Research Report
  • [Presentation] 成長型自己組織化マップによる強化学習2017

    • Author(s)
      野津 亮
    • Organizer
      第27回インテリジェント・システム・シンポジウム
    • Related Report
      2017 Annual Research Report
  • [Presentation] 自己組織化マップを用いた強化学習結果の抽象化とその利用2016

    • Author(s)
      野津 亮,近藤 佑紀,生方 誠希,本多 克宏
    • Organizer
      第26回インテリジェント・システム・シンポジウム
    • Place of Presentation
      大阪
    • Year and Date
      2016-10-27
    • Related Report
      2016 Research-status Report
  • [Presentation] 認知特性に基づいたバンディットアルゴリズムの頑強性2016

    • Author(s)
      菊田 美月,野津 亮,生方 誠希,本多 克宏
    • Organizer
      第32回ファジィシステムシンポジウム
    • Place of Presentation
      佐賀
    • Year and Date
      2016-08-31
    • Related Report
      2016 Research-status Report
  • [Presentation] Application of the UCT Algorithm for Noisy Optimization Problems2016

    • Author(s)
      A. Notsu, S. Kane, S. Ubukata, K. Honda
    • Organizer
      Joint 8th International Conference on Soft Computing and Intelligent Systems and 17th International Symposium on Advanced Intelligent Systems
    • Place of Presentation
      sapporo, hokkaido, japan
    • Year and Date
      2016-08-25
    • Related Report
      2016 Research-status Report
    • Int'l Joint Research
  • [Presentation] Performance Investigation of UCB Policy in Q-Learning2015

    • Author(s)
      K. Saito, A. Notsu, S. Ubukata and K. Honda
    • Organizer
      International Conference on Machine Learning and Applications
    • Place of Presentation
      Pullman Hotel,マイアミ,アメリカ
    • Year and Date
      2015-12-09
    • Related Report
      2015 Research-status Report
    • Int'l Joint Research
  • [Presentation] Proposal of Grid Area Search with UCB for Discrete Optimization Problem2015

    • Author(s)
      A. Notsu, K. Saito, Y. Nohara, S. Ubukata and K. Honda
    • Organizer
      Integrated Uncertainty in Knowledge Modelling and Decision Making
    • Place of Presentation
      SUNRISE HOTEL,ニャチャン,ベトナム
    • Year and Date
      2015-10-15
    • Related Report
      2015 Research-status Report
    • Int'l Joint Research
  • [Presentation] FCM-type Co-clustering Transfer Reinforcement Learning for Non-Markov Processes2015

    • Author(s)
      A. Notsu, T. Ueno, Y. Hattori, S. Ubukata and K. Honda
    • Organizer
      Integrated Uncertainty in Knowledge Modelling and Decision Making
    • Place of Presentation
      SUNRISE HOTEL,ニャチャン,ベトナム
    • Year and Date
      2015-10-15
    • Related Report
      2015 Research-status Report
    • Int'l Joint Research
  • [Presentation] Q学習におけるUCB行動選択手法の性能に関する調査2015

    • Author(s)
      斉藤 晃貴,野津 亮,生方 誠希,本多 克宏
    • Organizer
      第25回インテリジェント・システム・シンポジウム
    • Place of Presentation
      東北大学片平さくらホール(宮城県仙台市)
    • Year and Date
      2015-09-24
    • Related Report
      2015 Research-status Report
  • [Presentation] 強化学習における自己組織化マップを用いた状態と行動の学習プロセスの可視化2015

    • Author(s)
      服部 雄市,野津 亮,生方 誠希,本多 克宏
    • Organizer
      第25回インテリジェント・システム・シンポジウム
    • Place of Presentation
      東北大学片平さくらホール(宮城県仙台市)
    • Year and Date
      2015-09-24
    • Related Report
      2015 Research-status Report
  • [Presentation] Q学習におけるファジィ共クラスタリングによる知識の圧縮と再利用2015

    • Author(s)
      服部 雄市,野津 亮,生方 誠希,本多 克宏,上野 貴紀
    • Organizer
      第31回ファジィシステムシンポジウム
    • Place of Presentation
      電気通信大学(東京都調布市)
    • Year and Date
      2015-09-02
    • Related Report
      2015 Research-status Report
  • [Presentation] UCBによる離散最適化問題の探索と活用の調整2015

    • Author(s)
      斉藤 晃貴,野津 亮,野原 由布美,生方 誠希,本多 克宏
    • Organizer
      第31回ファジィシステムシンポジウム
    • Place of Presentation
      電気通信大学(東京都調布市)
    • Year and Date
      2015-09-02
    • Related Report
      2015 Research-status Report
  • [Remarks] 人間情報システム研究グループ

    • URL

      http://www.cs.osakafu-u.ac.jp/hi/

    • Related Report
      2017 Annual Research Report

URL: 

Published: 2015-04-16   Modified: 2019-03-29  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi