2023 Fiscal Year Annual Research Report

Research on the innovative evolution of deep reinforcement learning based on the profit sharing principle and its application to real problems

Research Project

Project/Area Number	21K12024
Research Institution	National Institution for Academic Degrees and Quality Enhancement of Higher Education
Principal Investigator	宮崎和光独立行政法人大学改革支援・学位授与機構, 研究開発部, 教授 (20282866)
Co-Investigator(Kenkyū-buntansha)	山口周独立行政法人大学改革支援・学位授与機構, 研究開発部, 特任教授 (10182437) 原田拓東京理科大学, 創域理工学部経営システム工学科, 准教授 (70256668) 小玉直樹明治大学, 理工学部, 助教 (60908747)
Project Period (FY)	2021-04-01 – 2024-03-31
Keywords	深層強化学習 / 利益分配原理 / 深層経験強化型学習 / スマートエネルギーシステム / 道路交通信号機制御 / ツイートデータ / 意識的意思決定システム
Outline of Annual Research Achievements	本研究課題では、初年度に、主目標である「利益分配原理（PS原理）に基づく、学習結果のばらつきを抑えた深層経験強化学習（DeePS)の基本設計」を完成させた。その後、最終年度にSSI優秀論文賞を受賞した「意識的意思決定システムのマルチエージェント環境下への拡張」等を通じ、副目標である「PS原理と適格度トレースとの関係を整理し、DeePSのMDPsを超えるクラスでの有効性を明らかにする」こと、及び「DeePSとマルチエージェント環境下での間接報酬に関する定理との関係を整理し、DeePSのマルチエージェント環境下での有効性を明らかにする」ことに寄与する成果を得た。さらに、当初想定していた応用例のうち「スマートエネルギーシステム」については初年度に達成し、「カリキュラム分析支援システム」については、最終年度に公表した学術論文「Proposal of a Course-Classification-Support System using a Deep Learning and its Evaluation when combined with Reinforcement Learning」において、カリキュラム分析の中心となる「科目分類支援システム」への強化学習の組み込みに成功した。加えて、当初は予定していなかった「道路交通信号機制御」に適用するとともに、「ツイートデータ」や「Angry Bird AI Competition」への適用も開始した。以上より、本研究課題の目的は十分に達成されたと考える。特に、深層強化学習の中心的手法として知られるDeep Q-Networkやその派生手法に対して、必要とされる試行錯誤回数の観点で圧倒的に有利なPS原理に基づく手法の有効性を、複数の応用例を通じ提示できた意義は大きく、今後の実問題への適用の拡充に貢献する重要な成果が得られたと言える。

Research Products
(11 results)

All 2024 2023

All Journal Article (4 results) (of which Peer Reviewed: 4 results, Open Access: 2 results) Presentation (7 results) (of which Int'l Joint Research: 4 results)

[Journal Article] Proposal of a Course-Classification Support System Using Deep Learning and its Evaluation When Combined with Reinforcement Learning2024
- Author(s)
  Miyazaki Kazuteru、Yamaguchi Shu、Mori Rie、Yoshikawa Yumiko、Saito Takanori、Suzuki Toshiya
- Journal Title
  
  Journal of Advanced Computational Intelligence and Intelligent Informatics
  
  Volume: 28 Pages: 454～467
- DOI
  10.20965/jaciii.2024.p0454
- Peer Reviewed / Open Access
[Journal Article] Suppression of negative tweets using reinforcement learning systems2024
- Author(s)
  Miyazaki Kazuteru、Miyazaki Hitomi
- Journal Title
  
  Cognitive Systems Research
  
  Volume: 84 Pages: -
- DOI
  10.1016/j.cogsys.2023.101207
- Peer Reviewed
[Journal Article] Performance evaluation of character-level CNNs using tweet data and analysis for weight perturbations2024
- Author(s)
  Miyazaki Kazuteru、Ida Masaaki
- Journal Title
  
  Artificial Life and Robotics
  
  Volume: 29 Pages: 266～273
- DOI
  10.1007/s10015-024-00944-9
- Peer Reviewed
[Journal Article] Enhanced Naive Agent in Angry Birds AI Competition via Exploitation-oriented Learning2024
- Author(s)
  Miyazaki Kazuteru
- Journal Title
  
  Journal of Robotics and Mechatronics
  
  Volume: 掲載予定 Pages: -
- Peer Reviewed / Open Access
[Presentation] Application of Deep Reinforcement Learning to Decentralized Control of Traffic Signals Considering Fairness in a Road Traffic Network Including Intersections Without Traffic Signals2024
- Author(s)
  Shirasaka Shogo、Kodama Naoki、Harada Taku
- Organizer
  The 10th IEEJ International Workshop on Sensing, Actuation, Motion Control, and Optimization
- Int'l Joint Research
[Presentation] Suppression of Negative Tweets using Reinforcement Learning Systems in a Multi-Agent Environment2023
- Author(s)
  Miyazaki Kazuteru、Miyazaki Hitomi
- Organizer
  2023 Annual International Conference on Brain-Inspired Cognitive Architectures for Artificial Intelligence, the 14th Annual Meeting of the BICA Society (BICA*AI 2023)
- Int'l Joint Research
[Presentation] Competencies to Be Cultivated in Higher Education and Their Evaluation in the Era of Generative AI: Through the Experiences With Self-Study Degree-Awarding Program in NIAD-QE2023
- Author(s)
  Yamada Nodoka、Sakaguchi Kikue、Nakamura Yu、Miyazaki Kazuteru、Yamaguchi Shu
- Organizer
  The 15th Higher Education International Conference, ARTIFICIAL INTELLIGENCE AND PEDAGOGICAL TRANSFORMATION: IMPLICATIONS FOR HIGHER EDUCATION QUALITY ASSURANCE
- Int'l Joint Research
[Presentation] Rule-based generation of synthetic genetic circuits2023
- Author(s)
  Yamamura Masayuki、Sekine Ryoji、Miyazaki Kazuteru、Okuda Sota、Kodama Naoki、Kiga Daisuke
- Organizer
  15th International Workshop on Bio-Design Automation (IWBDA 2023)
- Int'l Joint Research
[Presentation] 意識的意思決定システムのマルチエージェント環境下への拡張2023
- Author(s)
  宮崎和光
- Organizer
  計測自動制御学会システム・情報部門学術講演会2023 (SSI2023)
[Presentation] 燃料消費および走行時間を考慮したハイブリッド自動車走行制御に対する深層強化学習の適用2023
- Author(s)
  LI ZHAOXI、原田拓
- Organizer
  計測自動制御学会　システム・情報部門学術講演会2023 (SSI2023)
[Presentation] 機械学習手法を利用したBioDOS にとって有用な論文の発見2023
- Author(s)
  宮崎和光、木賀大介、安田翔也、濱田立輝、小玉直樹、山村雅幸
- Organizer
  電気学会システム/制御合同研究会

2023 Fiscal Year Annual Research Report

Research on the innovative evolution of deep reinforcement learning based on the profit sharing principle and its application to real problems

Principal Investigator

宮崎 和光 独立行政法人大学改革支援・学位授与機構, 研究開発部, 教授 (20282866)

Research Products

[Journal Article] Proposal of a Course-Classification Support System Using Deep Learning and its Evaluation When Combined with Reinforcement Learning2024

Author(s)

Journal Title

DOI

[Journal Article] Suppression of negative tweets using reinforcement learning systems2024

Author(s)

Journal Title

DOI

[Journal Article] Performance evaluation of character-level CNNs using tweet data and analysis for weight perturbations2024

Author(s)

Journal Title

DOI

[Journal Article] Enhanced Naive Agent in Angry Birds AI Competition via Exploitation-oriented Learning2024

Author(s)

Journal Title

[Presentation] Application of Deep Reinforcement Learning to Decentralized Control of Traffic Signals Considering Fairness in a Road Traffic Network Including Intersections Without Traffic Signals2024

Author(s)

Organizer

[Presentation] Suppression of Negative Tweets using Reinforcement Learning Systems in a Multi-Agent Environment2023

Author(s)

Organizer

[Presentation] Competencies to Be Cultivated in Higher Education and Their Evaluation in the Era of Generative AI: Through the Experiences With Self-Study Degree-Awarding Program in NIAD-QE2023

Author(s)

Organizer

[Presentation] Rule-based generation of synthetic genetic circuits2023

Author(s)

Organizer

[Presentation] 意識的意思決定システムのマルチエージェント環境下への拡張2023

Author(s)

Organizer

[Presentation] 燃料消費および走行時間を考慮したハイブリッド自動車走行制御に対する深層強化学習の適用2023

Author(s)

Organizer

[Presentation] 機械学習手法を利用したBioDOS にとって有用な論文の発見2023

Author(s)

Organizer

宮崎和光独立行政法人大学改革支援・学位授与機構, 研究開発部, 教授 (20282866)